[HTML][HTML] Functional impact of missense variants in BRCA1 predicted by supervised learning

R Karchin, ANA Monteiro, SV Tavtigian… - PLoS computational …, 2007 - journals.plos.org
PLoS computational biology, 2007journals.plos.org
Many individuals tested for inherited cancer susceptibility at the BRCA1 gene locus are
discovered to have variants of unknown clinical significance (UCVs). Most UCVs cause a
single amino acid residue (missense) change in the BRCA1 protein. They can be
biochemically assayed, but such evaluations are time-consuming and labor-intensive.
Computational methods that classify and suggest explanations for UCV impact on protein
function can complement functional tests. Here we describe a supervised learning approach …
Many individuals tested for inherited cancer susceptibility at the BRCA1 gene locus are discovered to have variants of unknown clinical significance (UCVs). Most UCVs cause a single amino acid residue (missense) change in the BRCA1 protein. They can be biochemically assayed, but such evaluations are time-consuming and labor-intensive. Computational methods that classify and suggest explanations for UCV impact on protein function can complement functional tests. Here we describe a supervised learning approach to classification of BRCA1 UCVs. Using a novel combination of 16 predictive features, the algorithms were applied to retrospectively classify the impact of 36 BRCA1 C-terminal (BRCT) domain UCVs biochemically assayed to measure transactivation function and to blindly classify 54 documented UCVs. Majority vote of three supervised learning algorithms is in agreement with the assay for more than 94% of the UCVs. Two UCVs found deleterious by both the assay and the classifiers reveal a previously uncharacterized putative binding site. Clinicians may soon be able to use computational classifiers such as those described here to better inform patients. These classifiers can be adapted to other cancer susceptibility genes and systematically applied to prioritize the growing number of potential causative loci and variants found by large-scale disease association studies.
PLOS