10
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Using deep learning to associate human genes with age-related diseases

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation

          One way to identify genes possibly associated with ageing is to build a classification model (from the machine learning field) capable of classifying genes as associated with multiple age-related diseases. To build this model, we use a pre-compiled list of human genes associated with age-related diseases and apply a novel Deep Neural Network (DNN) method to find associations between gene descriptors (e.g. Gene Ontology terms, protein–protein interaction data and biological pathway information) and age-related diseases.

          Results

          The novelty of our new DNN method is its modular architecture, which has the capability of combining several sources of biological data to predict which ageing-related diseases a gene is associated with (if any). Our DNN method achieves better predictive performance than standard DNN approaches, a Gradient Boosted Tree classifier (a strong baseline method) and a Logistic Regression classifier. Given the DNN model produced by our method, we use two approaches to identify human genes that are not known to be associated with age-related diseases according to our dataset. First, we investigate genes that are close to other disease-associated genes in a complex multi-dimensional feature space learned by the DNN algorithm. Second, using the class label probabilities output by our DNN approach, we identify genes with a high probability of being associated with age-related diseases according to the model. We provide evidence of these putative associations retrieved from the DNN model with literature support.

          Availability and implementation

          The source code and datasets can be found at: https://github.com/fabiofabris/Bioinfo2019.

          Supplementary information

          Supplementary data are available at Bioinformatics online.

          Related collections

          Most cited references21

          • Record: found
          • Abstract: not found
          • Article: not found

          The genetic association database.

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Diversity of ageing across the tree of life.

            Evolution drives, and is driven by, demography. A genotype moulds its phenotype's age patterns of mortality and fertility in an environment; these two patterns in turn determine the genotype's fitness in that environment. Hence, to understand the evolution of ageing, age patterns of mortality and reproduction need to be compared for species across the tree of life. However, few studies have done so and only for a limited range of taxa. Here we contrast standardized patterns over age for 11 mammals, 12 other vertebrates, 10 invertebrates, 12 vascular plants and a green alga. Although it has been predicted that evolution should inevitably lead to increasing mortality and declining fertility with age after maturity, there is great variation among these species, including increasing, constant, decreasing, humped and bowed trajectories for both long- and short-lived species. This diversity challenges theoreticians to develop broader perspectives on the evolution of ageing and empiricists to study the demography of more species.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A dirty dozen: twelve p-value misconceptions.

              The P value is a measure of statistical evidence that appears in virtually all medical research papers. Its interpretation is made extraordinarily difficult because it is not part of any formal system of statistical inference. As a result, the P value's inferential meaning is widely and often wildly misconstrued, a fact that has been pointed out in innumerable papers and books appearing since at least the 1940s. This commentary reviews a dozen of these common misinterpretations and explains why each is wrong. It also reviews the possible consequences of these improper understandings or representations of its meaning. Finally, it contrasts the P value with its Bayesian counterpart, the Bayes' factor, which has virtually all of the desirable properties of an evidential measure that the P value lacks, most notably interpretability. The most serious consequence of this array of P-value misconceptions is the false belief that the probability of a conclusion being in error can be calculated from the data in a single experiment without reference to external evidence or the plausibility of the underlying mechanism.
                Bookmark

                Author and article information

                Contributors
                Role: Associate Editor
                Journal
                Bioinformatics
                Bioinformatics
                bioinformatics
                Bioinformatics
                Oxford University Press
                1367-4803
                1367-4811
                01 April 2020
                17 December 2019
                17 December 2019
                : 36
                : 7
                : 2202-2208
                Affiliations
                [1 ] School of Computing, University of Kent , Canterbury, Kent CT2 7NF, UK
                [2 ] Integrative Genomics of Ageing Group, Institute of Ageing and Chronic Disease, University of Liverpool , Liverpool L7 8TX, UK
                Author notes
                To whom correspondence should be addressed. E-mail: A.A.Freitas@ 123456kent.ac.uk
                Article
                btz887
                10.1093/bioinformatics/btz887
                7141856
                31845988
                832cdd5d-609a-479c-995b-bfd3693406b2
                © The Author(s) 2019. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 08 July 2019
                : 06 September 2019
                : 13 December 2019
                Page count
                Pages: 7
                Funding
                Funded by: Leverhulme Trust, DOI 10.13039/501100000275;
                Award ID: RPG-2016-015
                Funded by: Wellcome Trust, DOI 10.13039/100004440;
                Award ID: 208375/Z/17/Z
                Funded by: Biotechnology and Biological Sciences Research Council, DOI 10.13039/501100000268;
                Award ID: BB/R014949/1
                Categories
                Original Papers
                Data and Text Mining

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article