Selecting a maximally informative set of single-nucleotide polymorphisms for association analyses using linkage disequilibrium.

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Common genetic polymorphisms may explain a portion of the heritable risk for common diseases. Within candidate genes, the number of common polymorphisms is finite, but direct assay of all existing common polymorphism is inefficient, because genotypes at many of these sites are strongly correlated. Thus, it is not necessary to assay all common variants if the patterns of allelic association between common variants can be described. We have developed an algorithm to select the maximally informative set of common single-nucleotide polymorphisms (tagSNPs) to assay in candidate-gene association studies, such that all known common polymorphisms either are directly assayed or exceed a threshold level of association with a tagSNP. The algorithm is based on the r(2) linkage disequilibrium (LD) statistic, because r(2) is directly related to statistical power to detect disease associations with unassayed sites. We show that, at a relatively stringent r(2) threshold (r2>0.8), the LD-selected tagSNPs resolve >80% of all haplotypes across a set of 100 candidate genes, regardless of recombination, and tag specific haplotypes and clades of related haplotypes in nonrecombinant regions. Thus, if the patterns of common variation are described for a candidate gene, analysis of the tagSNP set can comprehensively interrogate for main effects from common functional variation. We demonstrate that, although common variation tends to be shared between populations, tagSNPs should be selected separately for populations with different ancestries.

Related collections

Author and article information

Journal

Journal ID (iso-abbrev): Am J Hum Genet

Title: American journal of human genetics

Publisher: University of Chicago Press

ISSN: 0002-9297

ISSN (Print): 0002-9297

Publication date (Electronic): Jan 2004

Volume: 74

Issue: 1

Affiliations

[1 ] Department of Genome Sciences, University of Washington, Seattle, WA, 98195, USA. csc47@u.washington.edu

Article

Publisher Item ID: S0002-9297(07)61949-1

DOI: 10.1086/381000

PMC ID: 1181897

PubMed ID: 14681826

SO-VID: 6a2e1e36-006b-4492-95fe-bc18fd57c852

History

Data availability:

Comments

Comment on this article

scite_

Cited by 290

See all cited by

- Version 1