19
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genetic Diversity, Population Structure and Ancestral Origin of Australian Wheat

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Since the introduction of wheat into Australia by the First Fleet settlers, germplasm from different geographical origins has been used to adapt wheat to the Australian climate through selection and breeding. In this paper, we used 482 cultivars, representing the breeding history of bread wheat in Australia since 1840, to characterize their diversity and population structure and to define the geographical ancestral background of Australian wheat germplasm. This was achieved by comparing them to a global wheat collection using in-silico chromosome painting based on SNP genotyping. The global collection involved 2,335 wheat accessions which was divided into 23 different geographical subpopulations. However, the whole set was reduced to 1,544 accessions to increase the differentiation and decrease the admixture among different global subpopulations to increase the power of the painting analysis. Our analysis revealed that the structure of Australian wheat germplasm and its geographic ancestors have changed significantly through time, especially after the Green Revolution. Before 1920, breeders used cultivars from around the world, but mainly Europe and Africa, to select potential cultivars that could tolerate Australian growing conditions. Between 1921 and 1970, a dependence on African wheat germplasm became more prevalent. Since 1970, a heavy reliance on International Maize and Wheat Improvement Center (CIMMYT) germplasm has persisted. Combining the results from linkage disequilibrium, population structure and in-silico painting revealed that the dependence on CIMMYT materials has varied among different Australian States, has shrunken the germplasm effective population size and produced larger linkage disequilibrium blocks. This study documents the evolutionary history of wheat breeding in Australia and provides an understanding for how the wheat genome has been adapted to local growing conditions. This information provides a guide for industry to assist with maintaining genetic diversity for long-term selection gains and to plan future breeding programs.

          Related collections

          Most cited references43

          • Record: found
          • Abstract: found
          • Article: not found

          Genome plasticity a key factor in the success of polyploid wheat under domestication.

          Wheat was domesticated about 10,000 years ago and has since spread worldwide to become one of the major crops. Its adaptability to diverse environments and end uses is surprising given the diversity bottlenecks expected from recent domestication and polyploid speciation events. Wheat compensates for these bottlenecks by capturing part of the genetic diversity of its progenitors and by generating new diversity at a relatively fast pace. Frequent gene deletions and disruptions generated by a fast replacement rate of repetitive sequences are buffered by the polyploid nature of wheat, resulting in subtle dosage effects on which selection can operate.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            A genetic atlas of human admixture history.

            Modern genetic data combined with appropriate statistical methods have the potential to contribute substantially to our understanding of human history. We have developed an approach that exploits the genomic structure of admixed populations to date and characterize historical mixture events at fine scales. We used this to produce an atlas of worldwide human admixture history, constructed by using genetic data alone and encompassing over 100 events occurring over the past 4000 years. We identified events whose dates and participants suggest they describe genetic impacts of the Mongol empire, Arab slave trade, Bantu expansion, first millennium CE migrations in Eastern Europe, and European colonialism, as well as unrecorded events, revealing admixture to be an almost universal force shaping human populations.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found

              Genomic analyses inform on migration events during the peopling of Eurasia

              High-coverage whole-genome sequence studies have so far focused on a limited number1 of geographically restricted populations 2–5, or targeted at specific diseases, e.g. cancer6. Nevertheless, the availability of high-resolution genomic data has led to the development of new methodologies for inferring population history7–9 and refuelled the debate on the mutation rate in humans10. Here we present the Estonian Biocentre human Genome Diversity Panel (EGDP), a dataset of 483 high-coverage human genomes from 148 populations worldwide, including 379 new genomes from 125 populations, which we group into Diversity and Selection Sets (ED1-2; SI1:1.1-7). We analyse this dataset to refine estimates of continent-wide patterns of heterozygosity, long- and short-distance gene flow, archaic admixture, and changes in effective population size through time. We find a genetic signature in present-day Papuans suggesting that at least 2% of their genome originates from an early and largely extinct expansion of anatomically modern humans (AMH) Out-of-Africa (xOoA). Together with evidence from the Western Asian fossil record11, and admixture between AMHs and Neanderthals predating the main Eurasian expansion12, our results contribute to the mounting evidence for the presence of AMH out of Africa earlier than 75kya. We also screen for signals of positive or balancing selection. The paths taken by AMHs out of Africa (OoA) have been the subject of considerable debate over the past two decades. Fossil and archaeological evidence13,14, and craniometric studies15 of African and Asian populations, demonstrate that Homo sapiens was present outside of Africa ca. 120-70 kya11. However, this colonization has been viewed as a failed expansion OoA16 since genetic analyses of living populations have been consistent with a single OoA followed by serial founder events17. Ancient DNA (aDNA) sequencing studies have found support for admixture between early Eurasians and at least two archaic human lineages18,19, and suggests modern human reached Eurasia at around 100kya12. In addition, aDNA from modern humans suggests population structuring and turnover, but little additional archaic admixture, in Eurasia over the last 35-45 thousand years20–22. Overall, these findings indicate that the majority of human genetic diversity outside Africa derives from a single dispersal event that was followed by admixture with archaic humans18,23. We used ADMIXTURE to analyse the genetic structure in our Diversity Set (ED1). We further compared the individual-level haplotype similarity of our samples using fineSTRUCTURE (ED3). Despite small sample sizes, we inferred 106 genetically distinct populations forming 12 major regional clusters, corresponding well to the 148 self-identified population labels. This clustering forms the basis for the groupings used in the scans of natural selection. Similar genetic affinities are highlighted by plotting the outgroup f 3 statistic9 in the form f 3(X, Y; Yoruba), which here measures shared drift between a non-African population X and any modern or ancient population Y from Yoruba as an African outgroup (SI1:2.2.6, ED4). Our sampling allowed us to consider geographic features correlated with gene flow by spatially interpolating genetic similarity measures between pairs of populations (SI1:2.2.2). We considered several measures and report gradients of allele frequencies in Figure 1, which was compared to gene flow patterns from EEMS24 as a validation (ED5). Controlling for pairwise geographic distance, we find a correlation between these genetic gradients and geographic and climatic features such as precipitation and elevation (inset of Figure 1, SI1:2.2.2). We screened for evidence of selection by first focusing on loci that showed the highest allelic differentiation among groups (SI1:3). We then performed positive and purifying selection scans (Methods), and find some candidate loci that replicate previously known and functionally-supported findings (SI2:3.3.4-I, SI1:3.1, ED6; SI2:3.1-IV,VI). Additionally, we infer more purifying selection in Africans in genes involved in pigmentation (bootstrapping p value - bpv for RX/Y-scores 500Kb) run of homozygosity. We ran ChromoPainter for each individual on only these regions, meaning each individual was only painted where it had been perfectly phased. This did not change the qualitative features (SI1:2.2.1). Removal of similar samples Papuans are genetically distinct from other populations due to tens of thousands of years of isolation. We wanted to check whether the length of haplotypes assigned as African were biased by the inclusion of a large number of relatively homogeneous Eurasians with few Papuans. To do this we repeated the N=447 painting allowing only donors from dissimilar populations, including only individuals who donated 40 kya, their mtDNA and Y lineages could have been lost by genetic drift even assuming an initial xOoA mixing component of up to 35%. Similar findings have been reported recently13. Extended Data ED1 Sample Diversity and Archaic signals. A: Map of location of samples highlighting the Diversity/Selection Sets; B: ADMIXTURE plot (K=8 and 14) which relates general visual inspection of genetic structure to studied populations and their region of origin; C: Sample level heterozygosity is plotted against distance from Addis Ababa. The trend line represents only non-African samples. The inset shows the waypoints used to arrive at the distance in kilometres for each sample. D: Boxplots were used to visualize the Denisova (red), Altai (green) and Croatian Neanderthal (blue) D distribution for each regional group of samples. Oceanian Altai D values show a remarkable similarity with the Denisova D values for the same region, in contrast with the other groups of samples where the Altai boxplots tend to be more similar to the Croatian Neanderthal ones. ED2 Data quality checks and heterozygosity patterns. Concordance of DNA sequencing (Complete Genomics Inc.) and DNA genotyping (Illumina genotyping arrays) data (ref-ref; het-ref-alt and hom-alt-alt, see SI 1.6) from chip (A) and sequence data (B). Coverage (depth) distribution of variable positions, divided by DNA source (Blood or Saliva) and Complete Genomic calling pipeline (release version) (C). Genome-wide distribution of Transition/Transversion ratio subdivided by DNA source (Saliva or Blood) and by Complete Genomic calling pipeline (D). Genome-wide distribution of Transition/Transversion ratio subdivided by chromosomes (E). Inter-chromosome differences in observed heterozygosity in 447 samples from the Diversity Set (F). Inter-chromosome differences in observed heterozygosity in a set of 50 unpublished genomes from the Estonian Genome Center, sequenced on an Illumina platform at an average coverage exceeding 30x (G). Inter-chromosome differences in observed heterozygosity in the phase 3 of the 1000 Genomes Project (H). The total number of observed heterozygous sites was divided by the number of accessible basepairs reported by the 1000 Genomes Project. ED3 FineSTRUCTURE shared ancestry analysis. ChromoPainter and FineSTRUCTURE results, showing both inferred populations with the underlying (averaged) number of haplotypes that an individual in a population receives (rows) from donor individuals in other populations (columns). 108 populations are inferred by FineSTRUCTURE. The dendrogram shows the inferred relationship between populations. The numbers on the dendrogram give the proportion of MCMC iterations for which each population split is observed (where this is less than 1). Each “geographical region” has a unique colour from which individuals are labeled. The number of individuals in each population is given in the label; e.g. “4Italians; 3Albanians” is a population of size 7 containing 4 individuals from Italy and 3 from Albania. ED4 MSMC genetic split times and outgroup f3 results. The MSMC split times estimated between each sample and a reference panel of 9 genomes were linearly interpolated to infer the broader square matrix (A). Summary of outgroup f3 statistics for each pair of non-African populations (B) or to an ancient sample (C) using Yoruba as an outgroup. Populations are grouped by geographic region and are ordered with increasing distance from Africa (left to right for columns and bottom to top for rows). Colour bars at the left and top of the heat map indicate the colour coding used for the geographical region. Individual population labels are indicated at the right and bottom of the heat map. The f3 statistics are scaled to lie between 0 and 1, with a black colour indicating those close to 0 and a red colour indicating those close to 1. Let m and M be the minimum and maximum f3 values within a given row (i.e., focal population). That is, for focal population X (on rows), m = minY,Y≠X f3(X, Y ; Yoruba) and M = maxY,Y≠X f3(X, Y ; Yoruba). The scaled f3 statistic for a given cell in that row is given by f3scaled=(f3-m)/(M-m), so that the smallest f3 in the row has value f3scaled=0 (black) and the largest has value f3scaled=1 (red). By default, the diagonal has value f3scaled=1 (red). The heat map is therefore asymmetric, with the population closest to the focal population at a given row having value f3scaled=1 (red colour) and the population farthest from the focal population at a given row having value f3scaled=0 (black colour). Therefore, at a given row, scanning the columns of the heat map reveals the populations with the most shared ancestry with the focal population of that row in the heat map. ED5 Geographical patterns of genetic diversity. Isolation by distance pattern across areas of high genetic gradient, using Europe as a baseline. The samples used in each analysis are indicated by coloured lines on the maps to the right of each plot. The panels show F ST as a function of distance across the Himalayas (A), the Ural mountains (B), and the Caucasus (C) as reported on the color-coded map (D). Effect of creating gaps in the samples in Europe (E): we tested the effect of removing samples from stripes, either north to south (F) or west to east (G), to create gaps comparable in size to the gaps in samples in the dataset. Effective migration surfaces inferred by EEMS (H). ED6 Summary of positive selection results Barplot comparing frequency distributions of functional variants in Africans and non-Africans (A). The distribution of exonic SNPs according to their functional impact (synonymous, missense and nonsense) as a function of allele frequency. Note that the data from both groups was normalised for a sample size of n=21 and that the Africans show significantly (Chisq p-value 500kb) Run of Homozygosity using the PLINK command “--homozyg-window-kb 500000 --homozyg-window-het 0 --homozyg-density 10”. Because there are so few such regions, we report only the population average for populations with two or more individuals, as well as the standard error in that estimate. Populations for whom the 95% CI passed 0 were also excluded. Note the logarithmic axis. D: Ancient DNA panel results. We used a different panel of 109 individuals which included 3 ancient genomes. We painted Chromosomes 11, 21 & 22 and report as crosses the population averages for populations with 2 or more individuals. The solid thin lines represent the position of each population when modern samples only are analysed. The dashed lines lead off the figure to the position of the ancient hominins and the African samples. ED8 MSMC Linear behavior of MSMC split estimates in presence of admixture. The examined Central Asian (A), East African (B), and African-American (C) genomes yielded a signature of MSMC split time (Truth, left-most column) that could be recapitulated (Reconstruction, second left most column) as a linear mixture of other MSMC split times. The admixture proportions inferred by our method (top of each admixture component column) were remarkably similar to the ones previously reported from the literature. MSMC split times (D) calculated after re-phasing an Estonian and a Papuan (Koinanbe) genome together with all the available West African and Pygmy genomes from our dataset to minimize putative phasing artefacts. The cross coalescence rate curves reported here are quantitatively comparable with the ones of Figure 2 A, hence showing that phasing artefacts are unlikely to explain the observed past-ward shift of the Papuan-African split time. Boxplot (E) showing the distribution of differences between African-Papuan and African-Eurasian split times obtained from coalescent simulations assembled through random replacement to make 2000 sets of 6 individuals (to match the 6 Papuans available from our empirical dataset), each made of 1.5 Gb of sequence. The simulation command line used to generate each chromosome made of 5Mb was as follows, being *DIV*=0.064; 0.4 or 0.8 for the xOoA, Denisova (Den) and Divergent Denisova (DeepDen) cases, respectively: ms0ancient2 10 1 .065 .05 -t 5000. -r 3000. 5000000 -I 7 1 1 1 1 2 2 2 -en 0. 1 .2 -en 0. 2 .2 -en 0. 3 .2 -en 0. 4 .2 -es .025 7 .96 -en .025 8 .2 -ej .03 7 6 -ej .04 6 5 -ej .060 8 3 -ej .061 4 3 -ej .062 2 1 -ej .063 3 1 -ej *DIV* 1 5 ED9 Modelling the xOoA components with FineSTRUCTURE. A: Joint distribution of haplotype lengths and Derived allele count, showing the median position of each cluster and all haplotypes assigned to it in the Maximum A Posteriori (MAP) estimate. Note that although a different proportion of points is assigned to each in the MAP, the total posterior is very close to 1/K for all. The dashed lines show a constant mutation rate. Haplotypes are ordered by mutation rate from low to high. B: Residual distribution comparison between the two component mixture using EUR.AFR and EUR.PNG (left), and the three component mixture including xOoA (using the same colour scale) (right). The residuals without xOoA are larger (RMSE 0.0055 compared to RMSE 0.0018) but more importantly, they are also structured. C: Assuming a mutational clock and a correct assignment of haplotypes, we can estimate the relative age of the splits from the number of derived alleles observed on the haplotypes. This leads to an estimate of 1.5 times older for xOoA compared to the Eurasian-Africa split. ED10 Proposed xOoA model. A subway map figure illustrating, as suggested by the novel results presented here, a model of an early, extinct Out-of-Africa (xOoA) signature in the genomes of Sahul populations at their arrival in the region. Given the overall small genomic contribution of this event to the genomes of modern Sahul individuals, we could not determine whether the documented Denisova admixture (question marks) and putative multiple Neanderthal admixtures took place along this extinct OoA. We also speculate (question mark) people who migrated along the xOoA route may have left a trace in the genomes of the Altai Neanderthal as reported by Kuhlwilm and colleagues12. Supplementary Information Additional results are reported in two Supplementary Information files online: SI1 including description of additional analyses, and SI2 including results in table format. Supplementary Information SI1 Table SI2
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Plant Sci
                Front Plant Sci
                Front. Plant Sci.
                Frontiers in Plant Science
                Frontiers Media S.A.
                1664-462X
                12 December 2017
                2017
                : 8
                : 2115
                Affiliations
                [1] 1Department of Animal, Plant and Soil Sciences, La Trobe University , Bundoora, VIC, Australia
                [2] 2Agriculture Victoria Research, AgriBio, Centre for Agribioscience , Bundoora, VIC, Australia
                [3] 3School of Applied Systems Biology, La Trobe University , Bundoora, VIC, Australia
                [4] 4School of Life and Environmental Sciences, The University of Sydney Plant Breeding Institute , Cobbitty, NSW, Australia
                Author notes

                Edited by: Rinaldo W. Wellerson Pereira, Universidade Católica de Brasília, Brazil

                Reviewed by: Yessica Rico, Institute of Ecology (INECOL), Mexico; Marco Pessoa-Filho, Brazilian Agricultural Research Corporation, Brazil; Mulatu Geleta, Swedish University of Agricultural Sciences, Sweden

                *Correspondence: Reem Joukhadar 17823013@ 123456students.latrobe.edu.au

                This article was submitted to Evolutionary and Population Genetics, a section of the journal Frontiers in Plant Science

                Article
                10.3389/fpls.2017.02115
                5733070
                29312381
                056a5b5d-8b1c-45bf-96e0-ca880e57ec76
                Copyright © 2017 Joukhadar, Daetwyler, Bansal, Gendall and Hayden.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) or licensor are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 23 July 2017
                : 28 November 2017
                Page count
                Figures: 6, Tables: 2, Equations: 0, References: 58, Pages: 15, Words: 10197
                Categories
                Plant Science
                Original Research

                Plant science & Botany
                australian wheat,geographical ancestor,genetic diversity,in-silico chromosome painting,population structure,single nucleotide polymorphism

                Comments

                Comment on this article