1
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      GenOtoScope: Towards automating ACMG classification of variants associated with congenital hearing loss

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Since next-generation sequencing (NGS) has become widely available, large gene panels containing up to several hundred genes can be sequenced cost-efficiently. However, the interpretation of the often large numbers of sequence variants detected when using NGS is laborious, prone to errors and is often difficult to compare across laboratories. To overcome this challenge, the American College of Medical Genetics and Genomics and the Association for Molecular Pathology (ACMG/AMP) have introduced standards and guidelines for the interpretation of sequencing variants. Additionally, disease-specific refinements have been developed that include accurate thresholds for many criteria, enabling highly automated processing. This is of particular interest for common but heterogeneous disorders such as hearing impairment. With more than 200 genes associated with hearing disorders, the manual inspection of possible causative variants is particularly difficult and time-consuming.

          To this end, we developed the open-source bioinformatics tool GenOtoScope, which automates the analysis of all ACMG/AMP criteria that can be assessed without further individual patient information or human curator investigation, including the refined loss of function criterion (“PVS1”). Two types of interfaces are provided: (i) a command line application to classify sequence variants in batches for a set of patients and (ii) a user-friendly website to classify single variants.

          We compared the performance of our tool with two other variant classification tools using two hearing loss data sets, which were manually annotated either by the ClinGen Hearing Loss Gene Curation Expert Panel or the diagnostics unit of our human genetics department. GenOtoScope achieved the best average accuracy and precision for both data sets. Compared to the second-best tool, GenOtoScope improved the accuracy metric by 25.75% and 4.57% and precision metric by 52.11% and 12.13% on the two data sets, respectively. The web interface is accessible via: http://genotoscope.mh-hannover.de:5000 and the command line interface via: https://github.com/damianosmel/GenOtoScope.

          Author summary

          New high-throughput sequencing technologies can produce massive amounts of information and are used by laboratories to explain the often complex genetic etiology of hereditary diseases. To use these sequencing technologies effectively, software tools have been developed that can aid researchers interpreting genetic data by semi-automatically classifying the biologic (and, thus, potentially medical) impact of the detected variants (i.e. alterations of the patient’s genome compared to the human reference genome). Variant classification itself is a very complex process: various information has to be taken into account and carefully judged, allowing room for interpretation. Over the last decade, the quality of genetic analysis evaluation and the consistency between different laboratories have improved substantially. This is in particular a result of increasing standardization, which extends to gene- or disease-specific considerations. Hereby, a largely uniform variant interpretation is possible even for common but heterogeneous disorders.

          To design a reliable tool that can accommodate these precise specifications, we chose hearing loss as a model disease because it is the most common sensory disorder, often hereditary and has a high impact on patients every-day lifes. The currently available genetic variant classification tools are either not designed specifically for the interpretation of variants detected in subjects with hearing loss or they do not allow researchers to use them for batch classification of all variants detected, e.g. in a study group. To address this drawback, we developed GenOtoScope, an open-source tool that automates the pathogenicity classification of variants potentially associated with congenital hearing loss. By adjusting specific parameters, the algorithm could also be easily adapted to other medical conditions. GenOtoScope can be applied for the automatic classification of all variants detected in a set of individuals.

          Related collections

          Most cited references34

          • Record: found
          • Abstract: found
          • Article: not found

          Standards and Guidelines for the Interpretation of Sequence Variants: A Joint Consensus Recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology

          The American College of Medical Genetics and Genomics (ACMG) previously developed guidance for the interpretation of sequence variants. 1 In the past decade, sequencing technology has evolved rapidly with the advent of high-throughput next generation sequencing. By adopting and leveraging next generation sequencing, clinical laboratories are now performing an ever increasing catalogue of genetic testing spanning genotyping, single genes, gene panels, exomes, genomes, transcriptomes and epigenetic assays for genetic disorders. By virtue of increased complexity, this paradigm shift in genetic testing has been accompanied by new challenges in sequence interpretation. In this context, the ACMG convened a workgroup in 2013 comprised of representatives from the ACMG, the Association for Molecular Pathology (AMP) and the College of American Pathologists (CAP) to revisit and revise the standards and guidelines for the interpretation of sequence variants. The group consisted of clinical laboratory directors and clinicians. This report represents expert opinion of the workgroup with input from ACMG, AMP and CAP stakeholders. These recommendations primarily apply to the breadth of genetic tests used in clinical laboratories including genotyping, single genes, panels, exomes and genomes. This report recommends the use of specific standard terminology: ‘pathogenic’, ‘likely pathogenic’, ‘uncertain significance’, ‘likely benign’, and ‘benign’ to describe variants identified in Mendelian disorders. Moreover, this recommendation describes a process for classification of variants into these five categories based on criteria using typical types of variant evidence (e.g. population data, computational data, functional data, segregation data, etc.). Because of the increased complexity of analysis and interpretation of clinical genetic testing described in this report, the ACMG strongly recommends that clinical molecular genetic testing should be performed in a CLIA-approved laboratory with results interpreted by a board-certified clinical molecular geneticist or molecular genetic pathologist or equivalent.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            The mutational constraint spectrum quantified from variation in 141,456 humans

            Genetic variants that inactivate protein-coding genes are a powerful source of information about the phenotypic consequences of gene disruption: genes that are crucial for the function of an organism will be depleted of such variants in natural populations, whereas non-essential genes will tolerate their accumulation. However, predicted loss-of-function variants are enriched for annotation errors, and tend to be found at extremely low frequencies, so their analysis requires careful variant annotation and very large sample sizes 1 . Here we describe the aggregation of 125,748 exomes and 15,708 genomes from human sequencing studies into the Genome Aggregation Database (gnomAD). We identify 443,769 high-confidence predicted loss-of-function variants in this cohort after filtering for artefacts caused by sequencing and annotation errors. Using an improved model of human mutation rates, we classify human protein-coding genes along a spectrum that represents tolerance to inactivation, validate this classification using data from model organisms and engineered human cells, and show that it can be used to improve the power of gene discovery for both common and rare diseases.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data

              High-throughput sequencing platforms are generating massive amounts of genetic variation data for diverse genomes, but it remains a challenge to pinpoint a small subset of functionally important variants. To fill these unmet needs, we developed the ANNOVAR tool to annotate single nucleotide variants (SNVs) and insertions/deletions, such as examining their functional consequence on genes, inferring cytogenetic bands, reporting functional importance scores, finding variants in conserved regions, or identifying variants reported in the 1000 Genomes Project and dbSNP. ANNOVAR can utilize annotation databases from the UCSC Genome Browser or any annotation data set conforming to Generic Feature Format version 3 (GFF3). We also illustrate a ‘variants reduction’ protocol on 4.7 million SNVs and indels from a human genome, including two causal mutations for Miller syndrome, a rare recessive disease. Through a stepwise procedure, we excluded variants that are unlikely to be causal, and identified 20 candidate genes including the causal gene. Using a desktop computer, ANNOVAR requires ∼4 min to perform gene-based annotation and ∼15 min to perform variants reduction on 4.7 million variants, making it practical to handle hundreds of human genomes in a day. ANNOVAR is freely available at http://www.openbioinformatics.org/annovar/ .
                Bookmark

                Author and article information

                Contributors
                Role: Formal analysisRole: InvestigationRole: MethodologyRole: SoftwareRole: ValidationRole: VisualizationRole: Writing – original draftRole: Writing – review & editing
                Role: Data curationRole: InvestigationRole: Project administrationRole: SupervisionRole: ValidationRole: Writing – review & editing
                Role: Data curationRole: Formal analysisRole: InvestigationRole: SupervisionRole: Writing – review & editing
                Role: Data curationRole: InvestigationRole: SupervisionRole: ValidationRole: Writing – review & editing
                Role: Data curationRole: Writing – review & editing
                Role: ConceptualizationRole: Funding acquisitionRole: Writing – review & editing
                Role: Funding acquisitionRole: Project administration
                Role: ConceptualizationRole: Funding acquisitionRole: InvestigationRole: Project administrationRole: SupervisionRole: Writing – review & editing
                Role: Editor
                Journal
                PLoS Comput Biol
                PLoS Comput Biol
                plos
                PLoS Computational Biology
                Public Library of Science (San Francisco, CA USA )
                1553-734X
                1553-7358
                September 2022
                21 September 2022
                : 18
                : 9
                : e1009785
                Affiliations
                [1 ] L3S Research Center, Leibniz University Hannover, Hannover, Germany
                [2 ] Department of Human Genetics, Hannover Medical School, Hannover, Germany
                [3 ] Department of Otorhinolaryngology, Hannover Medical School, Hannover, Germany
                [4 ] Hearing4all Cluster of Excellence, Hannover Medical School, Hannover, Germany
                [5 ] Knowledge-based Systems Laboratory, Leibniz University Hannover, Hannover, Germany
                Hebrew University of Jerusalem, ISRAEL
                Author notes

                The authors have declared that no competing interests exist.

                ‡ WN and BA also contributed equally to this work.

                Author information
                https://orcid.org/0000-0001-8503-1962
                https://orcid.org/0000-0001-7165-7294
                https://orcid.org/0000-0002-3680-7450
                https://orcid.org/0000-0003-3374-2193
                https://orcid.org/0000-0003-1880-291X
                Article
                PCOMPBIOL-D-21-02304
                10.1371/journal.pcbi.1009785
                9529123
                36129964
                199023fc-c0a3-48e5-bcef-c3a8701ef5c3
                © 2022 Melidis et al

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 26 December 2021
                : 22 August 2022
                Page count
                Figures: 12, Tables: 3, Pages: 24
                Funding
                Funded by: funder-id http://dx.doi.org/10.13039/100011937, Niedersächsische Ministerium für Wissenschaft und Kultur;
                Award ID: ZN3429
                Award Recipient :
                The authors would like to acknowledge the financial support through the project Understanding Cochlear Implant Outcome Variability using Big Data and Machine Learning Approaches, project id: ZN3429, funded by Volkswagen Foundation, through the Ministry for Science and Culture of Lower Saxony Germany (MWK: Ministerium fuer Wissenschaft und Kultur). SvH, ALS, WN and BA received funding. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Medicine and Health Sciences
                Pathology and Laboratory Medicine
                Pathogenesis
                Biology and Life Sciences
                Computational Biology
                Genome Analysis
                Genome Annotation
                Biology and Life Sciences
                Genetics
                Genomics
                Genome Analysis
                Genome Annotation
                Medicine and Health Sciences
                Otorhinolaryngology
                Otology
                Hearing Disorders
                Deafness
                Computer and Information Sciences
                Computer Applications
                Web-Based Applications
                Biology and Life Sciences
                Genetics
                Genomics
                Research and Analysis Methods
                Database and Informatics Methods
                Bioinformatics
                Research and Analysis Methods
                Database and Informatics Methods
                Biological Databases
                Genomic Databases
                Biology and Life Sciences
                Computational Biology
                Genome Analysis
                Genomic Databases
                Biology and Life Sciences
                Genetics
                Genomics
                Genome Analysis
                Genomic Databases
                Biology and Life Sciences
                Biochemistry
                Proteins
                Protein Domains
                Custom metadata
                vor-update-to-uncorrected-proof
                2022-10-03
                The code is accessible for the following personal repository: https://github.com/damianosmel/GenOtoScope Please find how to install and use the genotoscope command line interface from respective sections of the wiki page: https://github.com/damianosmel/genotoscope_wiki/wiki Please run the genotoscope with example inputs that can be found on the folder (of the first repository): https://github.com/damianosmel/GenOtoScope/tree/main/toy_dataset The web interface is freely accessible via: http://genotoscope.mh-hannover.de:5000/home.

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article