• Record: found
  • Abstract: found
  • Article: not found

Principal components analysis corrects for stratification in genome-wide association studies.

Nature genetics

Principal Component Analysis, Algorithms, Polymorphism, Single Nucleotide, Phenotype, Humans, Genotype, statistics & numerical data, Genomics, Genome, Human, Genetic Markers, Databases, Nucleic Acid, Case-Control Studies, Alleles

Read this article at

      There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.


      Population stratification--allele frequency differences between cases and controls due to systematic ancestry differences-can cause spurious associations in disease studies. We describe a method that enables explicit detection and correction of population stratification on a genome-wide scale. Our method uses principal components analysis to explicitly model ancestry differences between cases and controls. The resulting correction is specific to a candidate marker's variation in frequency across ancestral populations, minimizing spurious associations while maximizing power to detect true associations. Our simple, efficient approach can easily be applied to disease studies with hundreds of thousands of markers.

      Related collections

      Author and article information



      Comment on this article