Inference of Human Population History From Whole Genome Sequence of A Single Individual

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

The history of human population size is important to understanding human evolution. Various studies ^{1-
5} have found evidence for a founder event (bottleneck) in East Asian and European populations associated with the human dispersal out-of-Africa event around 60 thousand years ago (kya) before present. However, these studies have to assume simplified demographic models with few parameters and do not precisely date the start and stop times of the bottleneck. Here, with fewer assumptions on population size changes, we present a more detailed history of human population sizes between approximately ten thousand to a million years ago, using the pairwise sequentially Markovian coalescent (PSMC) model applied to the complete diploid genome sequences of a Chinese male (YH) ⁶, a Korean male (SJK) ⁷, three European individuals (Venter ⁸, NA12891 and NA12878 ⁹) and two Yoruba males (NA18507 ¹⁰ and NA19239). We infer that European and Chinese populations had very similar population size histories before 10–20kya. Both populations experienced a severe bottleneck between 10–60kya while African populations experienced a milder bottleneck from which they recovered earlier. All three populations have an elevated effective population size between 60–250kya, possibly due to a population structure ¹¹. We also infer that the differentiation of genetically modern humans may have started as early as 100–120kya ¹², but considerable genetic exchanges may still have occurred until 20–40kya.

Related collections

Most cited references 15

Record: found
Abstract: found
Article: not found

Linkage disequilibrium in the human genome.

D. E. Reich, M Cargill, S Bolk … (2001)

With the availability of a dense genome-wide map of single nucleotide polymorphisms (SNPs), a central issue in human genetics is whether it is now possible to use linkage disequilibrium (LD) to map genes that cause disease. LD refers to correlations among neighbouring alleles, reflecting 'haplotypes' descended from single, ancestral chromosomes. The size of LD blocks has been the subject of considerable debate. Computer simulations and empirical data have suggested that LD extends only a few kilobases (kb) around common SNPs, whereas other data have suggested that it can extend much further, in some cases greater than 100 kb. It has been difficult to obtain a systematic picture of LD because past studies have been based on only a few (1-3) loci and different populations. Here, we report a large-scale experiment using a uniform protocol to examine 19 randomly selected genomic regions. LD in a United States population of north-European descent typically extends 60 kb from common alleles, implying that LD mapping is likely to be practical in this population. By contrast, LD in a Nigerian population extends markedly less far. The results illuminate human history, suggesting that LD in northern Europeans is shaped by a marked demographic event about 27,000-53,000 years ago.

0 comments Cited 317 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Estimate of the mutation rate per nucleotide in humans.

M W Nachman, S L Crowell, Daniel Ho … (2000)

Many previous estimates of the mutation rate in humans have relied on screens of visible mutants. We investigated the rate and pattern of mutations at the nucleotide level by comparing pseudogenes in humans and chimpanzees to (i) provide an estimate of the average mutation rate per nucleotide, (ii) assess heterogeneity of mutation rate at different sites and for different types of mutations, (iii) test the hypothesis that the X chromosome has a lower mutation rate than autosomes, and (iv) estimate the deleterious mutation rate. Eighteen processed pseudogenes were sequenced, including 12 on autosomes and 6 on the X chromosome. The average mutation rate was estimated to be approximately 2.5 x 10(-8) mutations per nucleotide site or 175 mutations per diploid genome per generation. Rates of mutation for both transitions and transversions at CpG dinucleotides are one order of magnitude higher than mutation rates at other sites. Single nucleotide substitutions are 10 times more frequent than length mutations. Comparison of rates of evolution for X-linked and autosomal pseudogenes suggests that the male mutation rate is 4 times the female mutation rate, but provides no evidence for a reduction in mutation rate that is specific to the X chromosome. Using conservative calculations of the proportion of the genome subject to purifying selection, we estimate that the genomic deleterious mutation rate (U) is at least 3. This high rate is difficult to reconcile with multiplicative fitness effects of individual mutations and suggests that synergistic epistasis among harmful mutations may be common.

0 comments Cited 296 times – based on 0 reviews      Review now

Bookmark

Record: found
Abstract: found
Article: not found

Calibrating a coalescent simulation of human genome sequence variation.

Stephen F. Schaffner, Catherine Foo, Stacey Gabriel … (2005)

Population genetic models play an important role in human genetic research, connecting empirical observations about sequence variation with hypotheses about underlying historical and biological causes. More specifically, models are used to compare empirical measures of sequence variation, linkage disequilibrium (LD), and selection to expectations under a "null" distribution. In the absence of detailed information about human demographic history, and about variation in mutation and recombination rates, simulations have of necessity used arbitrary models, usually simple ones. With the advent of large empirical data sets, it is now possible to calibrate population genetic models with genome-wide data, permitting for the first time the generation of data that are consistent with empirical data across a wide range of characteristics. We present here the first such calibrated model and show that, while still arbitrary, it successfully generates simulated data (for three populations) that closely resemble empirical data in allele frequency, linkage disequilibrium, and population differentiation. No assertion is made about the accuracy of the proposed historical and recombination model, but its ability to generate realistic data meets a long-standing need among geneticists. We anticipate that this model, for which software is publicly available, and others like it will have numerous applications in empirical studies of human genetics.

0 comments Cited 226 times – based on 0 reviews      Review now

Bookmark

All references

Author and article information

Journal

Journal ID (nlm-journal-id): 0410462

Journal ID (pubmed-jr-id): 6011

Journal ID (nlm-ta): Nature

Journal ID (iso-abbrev): Nature

Title: Nature

ISSN (Print): 0028-0836

ISSN (Electronic): 1476-4687

Publication date Nihms-submitted: 21 July 2011

Publication date (Electronic): 13 July 2011

Publication date PMC-release: 28 January 2012

Volume: 475

Issue: 7357

Pages: 493-496

Affiliations

[1 ]The Wellcome Trust Sanger Institute, Hinxton, Cambridge, CB10 1SA, United Kingdom

[2 ]Broad Institute of Harvard and MIT, Cambridge, Massachusetts, 02142, USA

Author notes

Correspondence should be addressed to: Richard Durbin ( rd@ 123456sanger.ac.uk ) or Heng Li ( lh3@ 123456sanger.ac.uk )

Author contribution R.D. proposed the basic strategy and designed the overall study. H.L. developed the theory, implemented the algorithm and analyzed results. R.D. and H.L. wrote the manuscript.

Article

Manuscript ID: UKMS36009

DOI: 10.1038/nature10231

PMC ID: 3154645

PubMed ID: 21753753

SO-VID: f8cc6acc-7706-40ab-af78-cb30e2ea89ba

License:

Users may view, print, copy, download and text and data- mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms

History

Funding

Funded by: Wellcome Trust :

Award ID: 077192 || WT

Comments

Comment on this article

scite_

Cited by 1,111

See all cited by

Most referenced authors 2,011

See all reference authors

- Version 1

Inference of Human Population History From Whole Genome Sequence of A Single Individual

Read this article at

Abstract

Related collections

The Journal of Population and Sustainability

Most cited references 15

Linkage disequilibrium in the human genome.

Estimate of the mutation rate per nucleotide in humans.

Calibrating a coalescent simulation of human genome sequence variation.

Author and article information

Journal

Affiliations

Author notes

Article

History

Funding

Categories

Comments

Comment on this article

Similar content 164

Cited by 1,111

Most referenced authors 2,011