20
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Model based heritability scores for high-throughput sequencing data

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Heritability of a phenotypic or molecular trait measures the proportion of variance that is attributable to genotypic variance. It is an important concept in breeding and genetics. Few methods are available for calculating heritability for traits derived from high-throughput sequencing.

          Results

          We propose several statistical models and different methods to compute and test a heritability measure for such data based on linear and generalized linear mixed effects models. We also provide methodology for hypothesis testing and interval estimation. Our analyses show that, among the methods, the negative binomial mixed model (NB-fit), compound Poisson mixed model (CP-fit), and the variance stabilizing transformed linear mixed model (VST) outperform the voom-transformed linear mixed model (voom). NB-fit and VST appear to be more robust than CP-fit for estimating and testing the heritability scores, while NB-fit is the most computationally expensive. CP-fit performed best in terms of the coverage of the confidence intervals. In addition, we applied the methods to both microRNA (miRNA) and messenger RNA (mRNA) sequencing datasets from a recombinant inbred mouse panel. We show that miRNA and mRNA expression can be a highly heritable molecular trait in mouse, and that some top heritable features coincide with expression quantitative trait loci.

          Conclusions

          The models and methods we investigated in this manuscript is applicable and extendable to sequencing experiments where some biological replicates are available and the environmental variation is properly controlled. The CP-fit approach for assessing heritability was implemented for the first time to our knowledge. All the methods presented, as well as the generation of simulated sequencing data under either negative binomial or compound Poisson mixed models, are provided in the R package HeritSeq.

          Electronic supplementary material

          The online version of this article (doi:10.1186/s12859-017-1539-6) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references27

          • Record: found
          • Abstract: found
          • Article: not found

          A genome-wide association study of global gene expression.

          We have created a global map of the effects of polymorphism on gene expression in 400 children from families recruited through a proband with asthma. We genotyped 408,273 SNPs and identified expression quantitative trait loci from measurements of 54,675 transcripts representing 20,599 genes in Epstein-Barr virus-transformed lymphoblastoid cell lines. We found that 15,084 transcripts (28%) representing 6,660 genes had narrow-sense heritabilities (H2) > 0.3. We executed genome-wide association scans for these traits and found peak lod scores between 3.68 and 59.1. The most highly heritable traits were markedly enriched in Gene Ontology descriptors for response to unfolded protein (chaperonins and heat shock proteins), regulation of progression through the cell cycle, RNA processing, DNA repair, immune responses and apoptosis. SNPs that regulate expression of these genes are candidates in the study of degenerative diseases, malignancy, infection and inflammation. We have created a downloadable database to facilitate use of our findings in the mapping of complex disease loci.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            The Method of Path Coefficients

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Molecular networks as sensors and drivers of common human diseases.

              The molecular biology revolution led to an intense focus on the study of interactions between DNA, RNA and protein biosynthesis in order to develop a more comprehensive understanding of the cell. One consequence of this focus was a reduced attention to whole-system physiology, making it difficult to link molecular biology to clinical medicine. Equipped with the tools emerging from the genomics revolution, we are now in a position to link molecular states to physiological ones through the reverse engineering of molecular networks that sense DNA and environmental perturbations and, as a result, drive variations in physiological states associated with disease.
                Bookmark

                Author and article information

                Contributors
                pratyaydipta.rudra@ucdenver.edu
                wen.2.shi@ucdenver.edu
                brian.vestal@ucdenver.edu
                pamela.russell.ucdenver@gmail.com
                aarono@uoregon.edu
                robin.dowell@colorado.edu
                richard.radcliffe@ucdenver.edu
                laura.saba@ucdenver.edu
                katerina.kechris@ucdenver.edu
                Journal
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                2 March 2017
                2 March 2017
                2017
                : 18
                : 143
                Affiliations
                [1 ]ISNI 0000 0001 0703 675X, GRID grid.430503.1, Department of Biostatistics and Informatics, , University of Colorado School of Public Health, ; Aurora, CO 80045 USA
                [2 ]ISNI 0000 0001 0703 675X, GRID grid.430503.1, Computational Bioscience Program, , University of Colorado School of Medicine, ; Aurora, CO 80045 USA
                [3 ]ISNI 0000 0004 1936 8008, GRID grid.170202.6, Department of Biology, , University of Oregon, ; Eugene, OR USA
                [4 ]ISNI 0000000096214564, GRID grid.266190.a, Department of Molecular, Cellular and Developmental Biology, , University of Colorado at Boulder, ; Boulder, CO 80303 USA
                [5 ]BioFrontiers Institute, Boulder, CO 80303 USA
                [6 ]ISNI 0000 0001 0703 675X, GRID grid.430503.1, Department of Pharmaceutical Sciences, , University of Colorado Skaggs School of Pharmaceutical Sciences, ; Aurora, CO 80045 USA
                Article
                1539
                10.1186/s12859-017-1539-6
                5333443
                28253840
                9d76d122-de94-42d4-a508-70cbf45c9f5d
                © The Author(s) 2017

                Open Access This article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                History
                : 21 October 2016
                : 7 February 2017
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/100000027, National Institute on Alcohol Abuse and Alcoholism;
                Award ID: R01AA021131
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100000027, National Institute on Alcohol Abuse and Alcoholism;
                Award ID: R01AA016957
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100000092, U.S. National Library of Medicine;
                Award ID: T15LM009451
                Award Recipient :
                Categories
                Methodology Article
                Custom metadata
                © The Author(s) 2017

                Bioinformatics & Computational biology
                heritability,rnaseq,recombinant inbred panel,negative binomial mixed model,compound poisson mixed model,variance partition coefficient

                Comments

                Comment on this article