10
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Improving the Identification of Phenotypic Abnormalities and Sexual Dimorphism in Mice When Studying Rare Event Categorical Characteristics

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Biological research frequently involves the study of phenotyping data. Many of these studies focus on rare event categorical data, and functional genomics studies typically study the presence or absence of an abnormal phenotype. With the growing interest in the role of sex, there is a need to assess the phenotype for sexual dimorphism. The identification of abnormal phenotypes for downstream research is challenged by the small sample size, the rare event nature, and the multiple testing problem, as many variables are monitored simultaneously. Here, we develop a statistical pipeline to assess statistical and biological significance while managing the multiple testing problem. We propose a two-step pipeline to initially assess for a treatment effect, in our case example genotype, and then test for an interaction with sex. We compare multiple statistical methods and use simulations to investigate the control of the type-one error rate and power. To maximize the power while addressing the multiple testing issue, we implement filters to remove data sets where the hypotheses to be tested cannot achieve significance. A motivating case study utilizing a large scale high-throughput mouse phenotyping data set from the Wellcome Trust Sanger Institute Mouse Genetics Project, where the treatment is a gene ablation, demonstrates the benefits of the new pipeline on the downstream biological calls.

          Related collections

          Most cited references6

          • Record: found
          • Abstract: found
          • Article: not found

          Interval estimation for the difference between independent proportions: comparison of eleven methods

          Several existing unconditional methods for setting confidence intervals for the difference between binomial proportions are evaluated. Computationally simpler methods are prone to a variety of aberrations and poor coverage properties. The closely interrelated methods of Mee and Miettinen and Nurminen perform well but require a computer program. Two new approaches which also avoid aberrations are developed and evaluated. A tail area profile likelihood based method produces the best coverage properties, but is difficult to calculate for large denominators. A method combining Wilson score intervals for the two proportions to be compared also performs well, and is readily implemented irrespective of sample size.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Independent filtering increases detection power for high-throughput experiments.

            With high-dimensional data, variable-by-variable statistical testing is often used to select variables whose behavior differs across conditions. Such an approach requires adjustment for multiple testing, which can result in low statistical power. A two-stage approach that first filters variables by a criterion independent of the test statistic, and then only tests variables which pass the filter, can provide higher power. We show that use of some filter/test statistics pairs presented in the literature may, however, lead to loss of type I error control. We describe other pairs which avoid this problem. In an application to microarray data, we found that gene-by-gene filtering by overall variance followed by a t-test increased the number of discoveries by 50%. We also show that this particular statistic pair induces a lower bound on fold-change among the set of discoveries. Independent filtering-using filter/test pairs that are independent under the null hypothesis but correlated under the alternative-is a general approach that can substantially increase the efficiency of experiments.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A modified Bonferroni method for discrete data.

              R Tarone (1990)
              The Bonferroni adjustment for multiple comparisons is a simple and useful method of controlling the overall false positive error rate when several significance tests are performed in the evaluation of an experiment. In situations with categorical data, the test statistics have discrete distributions. The discreteness of the null distributions can be exploited to reduce the number of significance tests taken into account in the Bonferroni procedure. This reduction is accomplished by using only the information contained in the marginal totals.
                Bookmark

                Author and article information

                Journal
                Genetics
                Genetics
                genetics
                genetics
                genetics
                Genetics
                Genetics Society of America
                0016-6731
                1943-2631
                February 2017
                5 December 2016
                5 December 2016
                : 205
                : 2
                : 491-501
                Affiliations
                [* ]Mouse Informatics Group, Wellcome Trust Sanger Institute, Cambridge, CB10 1SA, United Kingdom
                []Department of Statistics and Operations Research, School of Mathematical Sciences, Tel Aviv University, Israel
                []Mouse Genetics Project, Wellcome Trust Sanger Institute, Cambridge, CB10 1SA, United Kingdom
                [§ ]The Sagol School of Neuroscience, Tel Aviv University, 69978 Israel
                Author notes
                [1 ]Corresponding author: Darwin Building (Unit 310), Cambridge Science Park, Milton Road, Cambridge, CB4 0WG, United Kingdom. E-mail: natasha.karp@ 123456astrazeneca.com
                Author information
                http://orcid.org/0000-0002-8404-2907
                Article
                195388
                10.1534/genetics.116.195388
                5289831
                27932544
                c301efdb-442f-43e9-91c6-7a2a693ed5ca
                Copyright © 2017 Karp et al.

                Available freely online through the author-supported open access option.

                This is an open-access article distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 30 August 2016
                : 11 November 2016
                Page count
                Figures: 5, Tables: 3, Equations: 0, References: 33, Pages: 11
                Categories
                Investigations
                Methods, Technology, and Resources

                Genetics
                gene–phenotype map,mouse models,multiple testing,rare events,sexual dimorphism
                Genetics
                gene–phenotype map, mouse models, multiple testing, rare events, sexual dimorphism

                Comments

                Comment on this article