Identification and analysis of deleterious human SNPs

P Yue, J Moult - Journal of molecular biology, 2006 - Elsevier
Journal of molecular biology, 2006Elsevier
We have developed two methods of identifying which non-synonomous single base
changes have a deleterious effect on protein function in vivo. One method, described
elsewhere, analyzes the effect of the resulting amino acid change on protein stability,
utilizing structural information. The other method, introduced here, makes use of the
conservation and type of residues observed at a base change position within a protein
family. A machine learning technique, the support vector machine, is trained on single amino …
We have developed two methods of identifying which non-synonomous single base changes have a deleterious effect on protein function in vivo. One method, described elsewhere, analyzes the effect of the resulting amino acid change on protein stability, utilizing structural information. The other method, introduced here, makes use of the conservation and type of residues observed at a base change position within a protein family. A machine learning technique, the support vector machine, is trained on single amino acid changes that cause monogenic disease, with a control set of amino acid changes fixed between species. Both methods are used to identify deleterious single nucleotide polymorphisms (SNPs) in the human population. After carefully controlling for errors, we find that approximately one quarter of known non-synonymous SNPs are deleterious by these criteria, providing a set of possible contributors to human complex disease traits.
Elsevier