[PDF][PDF] A multipoint method for detecting genotyping errors and mutations in sibling-pair linkage data

JA Douglas, M Boehnke, K Lange - The American Journal of Human …, 2000 - cell.com
JA Douglas, M Boehnke, K Lange
The American Journal of Human Genetics, 2000cell.com
The identification of genes contributing to complex diseases and quantitative traits requires
genetic data of high fidelity, because undetected errors and mutations can profoundly affect
linkage information. The recent emphasis on the use of the sibling-pair design eliminates or
decreases the likelihood of detection of genotyping errors and marker mutations through
apparent Mendelian incompatibilities or close double recombinants. In this article, we
describe a hidden Markov method for detecting genotyping errors and mutations in …
The identification of genes contributing to complex diseases and quantitative traits requires genetic data of high fidelity, because undetected errors and mutations can profoundly affect linkage information. The recent emphasis on the use of the sibling-pair design eliminates or decreases the likelihood of detection of genotyping errors and marker mutations through apparent Mendelian incompatibilities or close double recombinants. In this article, we describe a hidden Markov method for detecting genotyping errors and mutations in multilocus linkage data. Specifically, we calculate the posterior probability of genotyping error or mutation for each sibling-pair-marker combination, conditional on all marker data and an assumed genotype-error rate. The method is designed for use with sibling-pair data when parental genotypes are unavailable. Through Monte Carlo simulation, we explore the effects of map density, marker-allele frequencies, marker position, and genotype-error rate on the accuracy of our error-detection method. In addition, we examine the impact of genotyping errors and error detection and correction on multipoint linkage information. We illustrate that even moderate error rates can result in substantial loss of linkage information, given efforts to fine-map a putative disease locus. Although simulations suggest that our method detects ≤50% of genotyping errors, it generally flags those errors that have the largest impact on linkage results. For high-resolution genetic maps, removal of the errors identified by our method restores most or nearly all the lost linkage information and can be accomplished without generating false evidence for linkage by removing incorrectly identified errors.
cell.com