7
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Genomic preselection with genotyping-by-sequencing increases performance of commercial oil palm hybrid crosses

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          There is great potential for the genetic improvement of oil palm yield. Traditional progeny tests allow accurate selection but limit the number of individuals evaluated. Genomic selection (GS) could overcome this constraint. We estimated the accuracy of GS prediction of seven oil yield components using A × B hybrid progeny tests with almost 500 crosses for training and 200 crosses for independent validation. Genotyping-by-sequencing (GBS) yielded +5000 single nucleotide polymorphisms (SNPs) on the parents of the crosses. The genomic best linear unbiased prediction method gave genomic predictions using the SNPs of the training and validation sets and the phenotypes of the training crosses. The practical impact was illustrated by quantifying the additional bunch production of the crosses selected in the validation experiment if genomic preselection had been applied in the parental populations before progeny tests.

          Results

          We found that prediction accuracies for cross values plateaued at 500 to 2000 SNPs, with high (0.73) or low (0.28) values depending on traits. Similar results were obtained when parental breeding values were predicted. GS was able to capture genetic differences within parental families, requiring at least 2000 SNPs with less than 5% missing data, imputed using pedigrees. Genomic preselection could have increased the selected hybrids bunch production by more than 10%.

          Conclusions

          Finally, preselection for yield components using GBS is the first possible application of GS in oil palm. This will increase selection intensity, thus improving the performance of commercial hybrids. Further research is required to increase the benefits from GS, which should revolutionize oil palm breeding.

          Electronic supplementary material

          The online version of this article (10.1186/s12864-017-4179-3) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references64

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The variant call format and VCFtools

          Summary: The variant call format (VCF) is a generic format for storing DNA polymorphism data such as SNPs, insertions, deletions and structural variants, together with rich annotations. VCF is usually stored in a compressed manner and can be indexed for fast data retrieval of variants from a range of positions on the reference genome. The format was developed for the 1000 Genomes Project, and has also been adopted by other projects such as UK10K, dbSNP and the NHLBI Exome Project. VCFtools is a software suite that implements various utilities for processing VCF files, including validation, merging, comparing and also provides a general Perl API. Availability: http://vcftools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            A Robust, Simple Genotyping-by-Sequencing (GBS) Approach for High Diversity Species

            Advances in next generation technologies have driven the costs of DNA sequencing down to the point that genotyping-by-sequencing (GBS) is now feasible for high diversity, large genome species. Here, we report a procedure for constructing GBS libraries based on reducing genome complexity with restriction enzymes (REs). This approach is simple, quick, extremely specific, highly reproducible, and may reach important regions of the genome that are inaccessible to sequence capture approaches. By using methylation-sensitive REs, repetitive regions of genomes can be avoided and lower copy regions targeted with two to three fold higher efficiency. This tremendously simplifies computationally challenging alignment problems in species with high levels of genetic diversity. The GBS procedure is demonstrated with maize (IBM) and barley (Oregon Wolfe Barley) recombinant inbred populations where roughly 200,000 and 25,000 sequence tags were mapped, respectively. An advantage in species like barley that lack a complete genome sequence is that a reference map need only be developed around the restriction sites, and this can be done in the process of sample genotyping. In such cases, the consensus of the read clusters across the sequence tagged sites becomes the reference. Alternatively, for kinship analyses in the absence of a reference genome, the sequence tags can simply be treated as dominant markers. Future application of GBS to breeding, conservation, and global species and population surveys may allow plant breeders to conduct genomic selection on a novel germplasm or species without first having to develop any prior molecular tools, or conservation biologists to determine population structure without prior knowledge of the genome or diversity in the species.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Rapid and accurate haplotype phasing and missing-data inference for whole-genome association studies by use of localized haplotype clustering.

              Whole-genome association studies present many new statistical and computational challenges due to the large quantity of data obtained. One of these challenges is haplotype inference; methods for haplotype inference designed for small data sets from candidate-gene studies do not scale well to the large number of individuals genotyped in whole-genome association studies. We present a new method and software for inference of haplotype phase and missing data that can accurately phase data from whole-genome association studies, and we present the first comparison of haplotype-inference methods for real and simulated data sets with thousands of genotyped individuals. We find that our method outperforms existing methods in terms of both speed and accuracy for large data sets with thousands of individuals and densely spaced genetic markers, and we use our method to phase a real data set of 3,002 individuals genotyped for 490,032 markers in 3.1 days of computing time, with 99% of masked alleles imputed correctly. Our method is implemented in the Beagle software package, which is freely available.
                Bookmark

                Author and article information

                Contributors
                +33-467615800 , david.cros@cirad.fr
                +33-467615800 , stephanie.bocs@cirad.fr
                +33-467615800 , virginie.riou@cirad.fr
                +33-467615800 , Enrique.Ortega@cefe.cnrs.fr
                +33-467615800 , sebastien.tisne@cirad.fr
                +33-467615800 , xavier.argout@cirad.fr
                +33-467615800 , virginie.pomies@cirad.fr
                anleifi@yahoo.com
                zulkiflilubis@socfindo.co.id
                benoit.cochard@palmelit.com
                tristan.durand-gasselin@palmelit.com
                Journal
                BMC Genomics
                BMC Genomics
                BMC Genomics
                BioMed Central (London )
                1471-2164
                2 November 2017
                2 November 2017
                2017
                : 18
                : 839
                Affiliations
                [1 ]ISNI 0000 0001 2153 9871, GRID grid.8183.2, CIRAD, UMR AGAP (Genetic Improvement and Adaptation of Mediterranean and Tropical Plants Research Unit), ; F-34398 Montpellier, France
                [2 ]South Green Bioinformatics Platform, Montpellier, France
                [3 ]P.T. SOCFINDO Medan, Medan, 20001 Indonesia
                [4 ]INRAB, CRAPP, Pobè, Benin
                [5 ]PalmElit SAS, 34980 Montferrier sur Lez, France
                Author information
                http://orcid.org/0000-0002-8601-7991
                Article
                4179
                10.1186/s12864-017-4179-3
                5667528
                29096603
                aa527a43-28a1-4d85-bf18-6117768c7955
                © The Author(s). 2017

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 20 April 2017
                : 5 October 2017
                Categories
                Research Article
                Custom metadata
                © The Author(s) 2017

                Genetics
                genetic gain,genomic selection,genotyping-by-sequencing,hybrid,oil palm,reciprocal recurrent selection

                Comments

                Comment on this article