2
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      CADA: phenotype-driven gene prioritization based on a case-enriched knowledge graph

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Many rare syndromes can be well described and delineated from other disorders by a combination of characteristic symptoms. These phenotypic features are best documented with terms of the Human Phenotype Ontology (HPO), which are increasingly used in electronic health records (EHRs), too. Many algorithms that perform HPO-based gene prioritization have also been developed; however, the performance of many such tools suffers from an over-representation of atypical cases in the medical literature. This is certainly the case if the algorithm cannot handle features that occur with reduced frequency in a disorder. With Cada, we built a knowledge graph based on both case annotations and disorder annotations. Using network representation learning, we achieve gene prioritization by link prediction. Our results suggest that Cada exhibits superior performance particularly for patients that present with the pathognomonic findings of a disease. Additionally, information about the frequency of occurrence of a feature can readily be incorporated, when available. Crucial in the design of our approach is the use of the growing amount of phenotype–genotype information that diagnostic labs deposit in databases such as ClinVar. By this means, Cada is an ideal reference tool for differential diagnostics in rare disorders that can also be updated regularly.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: not found
          • Conference Proceedings: not found

          Optuna : A Next-generation Hyperparameter Optimization Framework

            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Phenolyzer: phenotype-based prioritization of candidate genes for human diseases.

            Prior biological knowledge and phenotype information may help to identify disease genes from human whole-genome and whole-exome sequencing studies. We developed Phenolyzer (http://phenolyzer.usc.edu), a tool that uses prior information to implicate genes involved in diseases. Phenolyzer exhibits superior performance over competing methods for prioritizing Mendelian and complex disease genes, based on disease or phenotype terms entered as free text.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Clinical diagnostics in human genetics with semantic similarity searches in ontologies.

              The differential diagnostic process attempts to identify candidate diseases that best explain a set of clinical features. This process can be complicated by the fact that the features can have varying degrees of specificity, as well as by the presence of features unrelated to the disease itself. Depending on the experience of the physician and the availability of laboratory tests, clinical abnormalities may be described in greater or lesser detail. We have adapted semantic similarity metrics to measure phenotypic similarity between queries and hereditary diseases annotated with the use of the Human Phenotype Ontology (HPO) and have developed a statistical model to assign p values to the resulting similarity scores, which can be used to rank the candidate diseases. We show that our approach outperforms simpler term-matching approaches that do not take the semantic interrelationships between terms into account. The advantage of our approach was greater for queries containing phenotypic noise or imprecise clinical descriptions. The semantic network defined by the HPO can be used to refine the differential diagnosis by suggesting clinical features that, if present, best differentiate among the candidate diagnoses. Thus, semantic similarity searches in ontologies represent a useful way of harnessing the semantic structure of human phenotypic abnormalities to help with the differential diagnosis. We have implemented our methods in a freely available web application for the field of human Mendelian disorders.
                Bookmark

                Author and article information

                Contributors
                Journal
                NAR Genom Bioinform
                NAR Genom Bioinform
                nargab
                NAR Genomics and Bioinformatics
                Oxford University Press
                2631-9268
                September 2021
                03 September 2021
                03 September 2021
                : 3
                : 3
                : lqab078
                Affiliations
                Institute for Genomic Statistics, University Bonn , 53129 Bonn, Germany
                Institute for Genomic Statistics, University Bonn , 53129 Bonn, Germany
                Institute for Genomic Statistics, University Bonn , 53129 Bonn, Germany
                Fraunhofer SCAI, Department of Bioinformatics , 53757 Sankt Augustin, Germany
                Institute for Genomic Statistics, University Bonn , 53129 Bonn, Germany
                Genetikum Counseling Center , 70173 Stuttgart, Germany
                Genetikum Counseling Center , 70173 Stuttgart, Germany
                Genetikum Counseling Center , 70173 Stuttgart, Germany
                Institute of Medical Genetics and Applied Genomics, University Tübingen , 72076 Tübingen, Germany
                Institute of Medical Genetics and Applied Genomics, University Tübingen , 72076 Tübingen, Germany
                Institute for Human Genetics, Technical University Munich , 81675 Munich, Germany
                Institute for Human Genetics, Technical University Munich , 81675 Munich, Germany
                Institute for Medical Genetics, Charité University Medicine , 13353 Berlin, Germany
                Institute for Medical Genetics, Charité University Medicine , 13353 Berlin, Germany
                GeneTalk GmbH , 53129 Bonn, Germany
                GeneTalk GmbH , 53129 Bonn, Germany
                FDNA Inc , FL 33325 Sunrise, USA
                FDNA Inc , FL 33325 Sunrise, USA
                Fraunhofer SCAI, Department of Bioinformatics , 53757 Sankt Augustin, Germany
                Bonn-Aachen International Center for IT, University Bonn , 53115 Bonn, Germany
                Author notes
                To whom correspondence should be addressed. Tel: +49 228 287 14733; Email: pkrawitz@ 123456uni-bonn.de
                Author information
                https://orcid.org/0000-0002-9765-4400
                https://orcid.org/0000-0003-0047-0670
                https://orcid.org/0000-0002-4454-8823
                https://orcid.org/0000-0002-3194-8625
                Article
                lqab078
                10.1093/nargab/lqab078
                8415429
                34514393
                8c6b54d2-ecf6-43bc-b06d-9ffbf60a85a8
                © The Author(s) 2021. Published by Oxford University Press on behalf of NAR Genomics and Bioinformatics.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution-NonCommercial License ( https://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@ 123456oup.com

                History
                : 05 March 2021
                : 16 August 2021
                : 31 August 2021
                Page count
                Pages: 7
                Categories
                AcademicSubjects/SCI00030
                AcademicSubjects/SCI00980
                AcademicSubjects/SCI01060
                AcademicSubjects/SCI01140
                AcademicSubjects/SCI01180
                Methods Article

                Comments

                Comment on this article