1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Enviromic-based kernels may optimize resource allocation with multi-trait multi-environment genomic prediction for tropical Maize

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Success in any genomic prediction platform is directly dependent on establishing a representative training set. This is a complex task, even in single-trait single-environment conditions and tends to be even more intricated wherein additional information from envirotyping and correlated traits are considered. Here, we aimed to design optimized training sets focused on genomic prediction, considering multi-trait multi-environment trials, and how those methods may increase accuracy reducing phenotyping costs. For that, we considered single-trait multi-environment trials and multi-trait multi-environment trials for three traits: grain yield, plant height, and ear height, two datasets, and two cross-validation schemes. Next, two strategies for designing optimized training sets were conceived, first considering only the genomic by environment by trait interaction (GET), while a second including large-scale environmental data (W, enviromics) as genomic by enviromic by trait interaction (GWT). The effective number of individuals (genotypes × environments × traits) was assumed as those that represent at least 98% of each kernel (GET or GWT) variation, in which those individuals were then selected by a genetic algorithm based on prediction error variance criteria to compose an optimized training set for genomic prediction purposes.

          Results

          The combined use of genomic and enviromic data efficiently designs optimized training sets for genomic prediction, improving the response to selection per dollar invested by up to 145% when compared to the model without enviromic data, and even more when compared to cross validation scheme with 70% of training set or pure phenotypic selection. Prediction models that include G × E or enviromic data + G × E yielded better prediction ability.

          Conclusions

          Our findings indicate that a genomic by enviromic by trait interaction kernel associated with genetic algorithms is efficient and can be proposed as a promising approach to designing optimized training sets for genomic prediction when the variance-covariance matrix of traits is available. Additionally, great improvements in the genetic gains per dollar invested were observed, suggesting that a good allocation of resources can be deployed by using the proposed approach.

          Related collections

          Most cited references56

          • Record: found
          • Abstract: found
          • Article: not found

          Efficient methods to compute genomic predictions.

          P VanRaden (2008)
          Efficient methods for processing genomic data were developed to increase reliability of estimated breeding values and to estimate thousands of marker effects simultaneously. Algorithms were derived and computer programs tested with simulated data for 2,967 bulls and 50,000 markers distributed randomly across 30 chromosomes. Estimation of genomic inbreeding coefficients required accurate estimates of allele frequencies in the base population. Linear model predictions of breeding values were computed by 3 equivalent methods: 1) iteration for individual allele effects followed by summation across loci to obtain estimated breeding values, 2) selection index including a genomic relationship matrix, and 3) mixed model equations including the inverse of genomic relationships. A blend of first- and second-order Jacobi iteration using 2 separate relaxation factors converged well for allele frequencies and effects. Reliability of predicted net merit for young bulls was 63% compared with 32% using the traditional relationship matrix. Nonlinear predictions were also computed using iteration on data and nonlinear regression on marker deviations; an additional (about 3%) gain in reliability for young bulls increased average reliability to 66%. Computing times increased linearly with number of genotypes. Estimation of allele frequencies required 2 processor days, and genomic predictions required <1 d per trait, and traits were processed in parallel. Information from genotyping was equivalent to about 20 daughters with phenotypic records. Actual gains may differ because the simulation did not account for linkage disequilibrium in the base population or selection in subsequent generations.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps

            Recent advances in molecular genetic techniques will make dense marker maps available and genotyping many individuals for these markers feasible. Here we attempted to estimate the effects of ∼50,000 marker haplotypes simultaneously from a limited number of phenotypic records. A genome of 1000 cM was simulated with a marker spacing of 1 cM. The markers surrounding every 1-cM region were combined into marker haplotypes. Due to finite population size (Ne = 100), the marker haplotypes were in linkage disequilibrium with the QTL located between the markers. Using least squares, all haplotype effects could not be estimated simultaneously. When only the biggest effects were included, they were overestimated and the accuracy of predicting genetic values of the offspring of the recorded animals was only 0.32. Best linear unbiased prediction of haplotype effects assumed equal variances associated to each 1-cM chromosomal segment, which yielded an accuracy of 0.73, although this assumption was far from true. Bayesian methods that assumed a prior distribution of the variance associated with each chromosome segment increased this accuracy to 0.85, even when the prior was not correct. It was concluded that selection on genetic values predicted from markers could substantially increase the rate of genetic gain in animals and plants, especially if combined with reproductive techniques to shorten the generation interval.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A high-performance computing toolset for relatedness and principal component analysis of SNP data.

              Genome-wide association studies are widely used to investigate the genetic basis of diseases and traits, but they pose many computational challenges. We developed gdsfmt and SNPRelate (R packages for multi-core symmetric multiprocessing computer architectures) to accelerate two key computations on SNP data: principal component analysis (PCA) and relatedness analysis using identity-by-descent measures. The kernels of our algorithms are written in C/C++ and highly optimized. Benchmarks show the uniprocessor implementations of PCA and identity-by-descent are ∼8-50 times faster than the implementations provided in the popular EIGENSTRAT (v3.0) and PLINK (v1.07) programs, respectively, and can be sped up to 30-300-fold by using eight cores. SNPRelate can analyse tens of thousands of samples with millions of SNPs. For example, our package was used to perform PCA on 55 324 subjects from the 'Gene-Environment Association Studies' consortium studies.
                Bookmark

                Author and article information

                Contributors
                raysagevartosky@alumni.usp.br
                Journal
                BMC Plant Biol
                BMC Plant Biol
                BMC Plant Biology
                BioMed Central (London )
                1471-2229
                5 January 2023
                5 January 2023
                2023
                : 23
                : 10
                Affiliations
                [1 ]GRID grid.11899.38, ISNI 0000 0004 1937 0722, Department of Genetics, Luiz de Queiroz College of Agriculture, , University of São Paulo, ; Piracicaba, São Paulo, Brazil
                [2 ]GRID grid.5690.a, ISNI 0000 0001 2151 2978, Centro de Biotecnología y Genómica de Plantas (CBGP, UPM-INIA), , Universidad Politécnica de Madrid (UPM), ; Madrid, Spain
                [3 ]GRID grid.5386.8, ISNI 000000041936877X, Institute for Genomics Diversity, , Cornell University, ; Ithaca, NY USA
                [4 ]GRID grid.412887.0, ISNI 0000 0001 2375 8971, Facultad de Telemática, , Universidad de Colima, ; Colima, Mexico
                [5 ]GRID grid.433436.5, ISNI 0000 0001 2289 885X, International Maize and Wheat Improvement Center (CIMMYT), ; Km 45, Carretera Mexico-Veracruz, CP 52640 Texcoco, Edo. de México Mexico
                [6 ]GRID grid.418752.d, ISNI 0000 0004 1795 9752, Colegio de Postgraduados, ; CP 56230 Montecillos, Edo. de México Mexico
                Author information
                http://orcid.org/0000-0003-1495-0261
                https://orcid.org/0000-0003-0745-7583
                https://orcid.org/0000-0003-1137-6786
                https://orcid.org/0000-0002-3973-6547
                https://orcid.org/0000-0001-9429-5855
                https://orcid.org/0000-0003-4310-0047
                Article
                3975
                10.1186/s12870-022-03975-1
                9814176
                36604618
                5150b846-f023-4f68-982f-ab11c4cf1dc6
                © The Author(s) 2023

                Open AccessThis article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated in a credit line to the data.

                History
                : 19 August 2022
                : 24 November 2022
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/501100002322, Coordenação de Aperfeiçoamento de Pessoal de Nível Superior;
                Award ID: 001
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100000865, Bill and Melinda Gates Foundation;
                Award ID: INV-003439, BMGF/FCDO, Accelerating Genetic Gains in Maize and Wheat for Improved Livelihoods (AG2MW)
                Categories
                Research Article
                Custom metadata
                © The Author(s) 2023

                Plant science & Botany
                genomic prediction,training population,enviromics,response to selection
                Plant science & Botany
                genomic prediction, training population, enviromics, response to selection

                Comments

                Comment on this article