14
views
0
recommends
+1 Recommend
2 collections
    0
    shares

      Why publish your research Open Access with G3: Genes|Genomes|Genetics?

      Learn more and submit today!

      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Phantom Epistasis in Genomic Selection: On the Predictive Ability of Epistatic Models

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Genomic selection uses whole-genome marker models to predict phenotypes or genetic values for complex traits. Some of these models fit interaction terms between markers, and are therefore called epistatic. The biological interpretation of the corresponding fitted effects is not straightforward and there is the threat of overinterpreting their functional meaning. Here we show that the predictive ability of epistatic models relative to additive models can change with the density of the marker panel. In more detail, we show that for publicly available Arabidopsis and rice datasets, an initial superiority of epistatic models over additive models, which can be observed at a lower marker density, vanishes when the number of markers increases. We relate these observations to earlier results reported in the context of association studies which showed that detecting statistical epistatic effects may not only be related to interactions in the underlying genetic architecture, but also to incomplete linkage disequilibrium at low marker density (“Phantom Epistasis”). Finally, we illustrate in a simulation study that due to phantom epistasis, epistatic models may also predict the genetic value of an underlying purely additive genetic architecture better than additive models, when the marker density is low. Our observations can encourage the use of genomic epistatic models with low density panels, and discourage their biological over-interpretation.

          Related collections

          Most cited references39

          • Record: found
          • Abstract: found
          • Article: not found

          Efficient methods to compute genomic predictions.

          P VanRaden (2008)
          Efficient methods for processing genomic data were developed to increase reliability of estimated breeding values and to estimate thousands of marker effects simultaneously. Algorithms were derived and computer programs tested with simulated data for 2,967 bulls and 50,000 markers distributed randomly across 30 chromosomes. Estimation of genomic inbreeding coefficients required accurate estimates of allele frequencies in the base population. Linear model predictions of breeding values were computed by 3 equivalent methods: 1) iteration for individual allele effects followed by summation across loci to obtain estimated breeding values, 2) selection index including a genomic relationship matrix, and 3) mixed model equations including the inverse of genomic relationships. A blend of first- and second-order Jacobi iteration using 2 separate relaxation factors converged well for allele frequencies and effects. Reliability of predicted net merit for young bulls was 63% compared with 32% using the traditional relationship matrix. Nonlinear predictions were also computed using iteration on data and nonlinear regression on marker deviations; an additional (about 3%) gain in reliability for young bulls increased average reliability to 66%. Computing times increased linearly with number of genotypes. Estimation of allele frequencies required 2 processor days, and genomic predictions required <1 d per trait, and traits were processed in parallel. Information from genotyping was equivalent to about 20 daughters with phenotypic records. Actual gains may differ because the simulation did not account for linkage disequilibrium in the base population or selection in subsequent generations.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            The Pfam protein families database: towards a more sustainable future

            In the last two years the Pfam database (http://pfam.xfam.org) has undergone a substantial reorganisation to reduce the effort involved in making a release, thereby permitting more frequent releases. Arguably the most significant of these changes is that Pfam is now primarily based on the UniProtKB reference proteomes, with the counts of matched sequences and species reported on the website restricted to this smaller set. Building families on reference proteomes sequences brings greater stability, which decreases the amount of manual curation required to maintain them. It also reduces the number of sequences displayed on the website, whilst still providing access to many important model organisms. Matches to the full UniProtKB database are, however, still available and Pfam annotations for individual UniProtKB sequences can still be retrieved. Some Pfam entries (1.6%) which have no matches to reference proteomes remain; we are working with UniProt to see if sequences from them can be incorporated into reference proteomes. Pfam-B, the automatically-generated supplement to Pfam, has been removed. The current release (Pfam 29.0) includes 16 295 entries and 559 clans. The facility to view the relationship between families within a clan has been improved by the introduction of a new tool.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Julia: A Fresh Approach to Numerical Computing

                Bookmark

                Author and article information

                Journal
                G3 (Bethesda)
                Genetics
                G3: Genes, Genomes, Genetics
                G3: Genes, Genomes, Genetics
                G3: Genes, Genomes, Genetics
                G3: Genes|Genomes|Genetics
                Genetics Society of America
                2160-1836
                23 July 2020
                September 2020
                : 10
                : 9
                : 3137-3145
                Affiliations
                [* ]Universidad de Buenos Aires, Facultad de Agronomía, Buenos Aires, Argentina
                []International Maize and Wheat Improvement Center (CIMMYT), Texcoco, Mexico
                []Center for Integrated Breeding Research, Department of Animal Sciences, University of Göttingen, Germany
                [§ ]Department of Epidemiology and Biostatistics, Michigan State University, East Lansing
                [** ]Center for Computational and Theoretical Biology (CCTB), University of Würzburg, Germany
                [†† ]National Scientific and Technical Research Council (CONICET), Argentina
                Author notes
                [1 ]Corresponding author: Universidad de Buenos Aires, Facultad de Agronomía, Av. San Martín 4453, Buenos Aires, Argentina CP 1417. E-mail: matiasfschrauf@ 123456agro.uba.ar
                Author information
                http://orcid.org/0000-0001-8841-6155
                http://orcid.org/0000-0003-0628-6794
                http://orcid.org/0000-0002-7551-3797
                http://orcid.org/0000-0001-5692-7129
                http://orcid.org/0000-0001-6282-146X
                http://orcid.org/0000-0003-0831-1463
                http://orcid.org/0000-0003-2666-2566
                Article
                GGG_401300
                10.1534/g3.120.401300
                7466977
                32709618
                0701f3aa-8fb1-4239-b9dd-2c7965c70511
                Copyright © 2020 Schrauf et al.

                This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 15 April 2020
                : 05 July 2020
                Page count
                Figures: 2, Tables: 4, Equations: 7, References: 43, Pages: 9
                Categories
                Genomic Prediction

                Genetics
                epistasis,additive effects,genomics,breeding,genpred,genomic prediction,shared data resources
                Genetics
                epistasis, additive effects, genomics, breeding, genpred, genomic prediction, shared data resources

                Comments

                Comment on this article