22
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Predicting the clinical impact of human mutation with deep neural networks

      research-article

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Millions of human genomes and exomes have been sequenced, but their clinical applications remain limited due to the difficulty of distinguishing disease-causing mutations from benign genetic variation. Here we demonstrate that common missense variants in other primate species are largely clinically benign in human, enabling pathogenic mutations to be systematically identified by process of elimination. Using hundreds of thousands of common variants from population sequencing of six non-human primate species, we train a deep neural network that identifies pathogenic mutations in rare disease patients with 88% accuracy, and enables the discovery of 14 new candidate genes in intellectual disability at genome-wide significance. Cataloging common variation from additional primate species would improve interpretation for millions of variants of uncertain significance, further advancing the clinical utility of human genome sequencing.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: found
          • Article: not found

          Amino acid substitution matrices from protein blocks.

          Methods for alignment of protein sequences typically measure similarity by using a substitution matrix with scores for all possible exchanges of one amino acid with another. The most widely used matrices are based on the Dayhoff model of evolutionary rates. Using a different approach, we have derived substitution matrices from about 2000 blocks of aligned sequence segments characterizing more than 500 groups of related proteins. This led to marked improvements in alignments and in searches using queries from each of the groups.
            Bookmark
            • Record: found
            • Abstract: not found
            • Book Chapter: not found

            Identity Mappings in Deep Residual Networks

              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              De novo gene disruptions in children on the autistic spectrum.

              Exome sequencing of 343 families, each with a single child on the autism spectrum and at least one unaffected sibling, reveal de novo small indels and point substitutions, which come mostly from the paternal line in an age-dependent manner. We do not see significantly greater numbers of de novo missense mutations in affected versus unaffected children, but gene-disrupting mutations (nonsense, splice site, and frame shifts) are twice as frequent, 59 to 28. Based on this differential and the number of recurrent and total targets of gene disruption found in our and similar studies, we estimate between 350 and 400 autism susceptibility genes. Many of the disrupted genes in these studies are associated with the fragile X protein, FMRP, reinforcing links between autism and synaptic plasticity. We find FMRP-associated genes are under greater purifying selection than the remainder of genes and suggest they are especially dosage-sensitive targets of cognitive disorders. Copyright © 2012 Elsevier Inc. All rights reserved.
                Bookmark

                Author and article information

                Journal
                9216904
                2419
                Nat Genet
                Nat. Genet.
                Nature genetics
                1061-4036
                1546-1718
                31 May 2018
                23 July 2018
                August 2018
                23 January 2019
                : 50
                : 8
                : 1161-1170
                Affiliations
                [1 ]Illumina Artificial Intelligence Laboratory, Illumina Inc, San Diego, California, USA
                [2 ]Department of Computer Science, Stanford University, Stanford, California, USA
                [3 ]National Science Foundation Center for Big Learning, University of Florida, Gainesville, Florida, USA
                [4 ]Analytic and Translational Genetics Unit (ATGU), Department of Medicine, Massachusetts General Hospital and Harvard Medical School, Boston Massachusetts, USA
                [5 ]Toyota Technological Institute at Chicago, Chicago, Illinois, USA
                Author notes
                []Correspondence to: kfarh@ 123456illumina.com
                [*]

                These authors contributed equally to this work.

                Correspondence and requests for materials should be addressed to K.K.F. ( kfarh@ 123456illumina.com ).

                Article
                NIHMS971466
                10.1038/s41588-018-0167-z
                6237276
                30038395
                5e55310d-51e5-4c56-9b6b-cc79deb57da2

                Users may view, print, copy, and download text and data-mine the content in such documents, for the purposes of academic research, subject always to the full Conditions of use: http://www.nature.com/authors/editorial_policies/license.html#terms

                History
                Categories
                Article

                Genetics
                Genetics

                Comments

                Comment on this article