661
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Increasing the Yield in Targeted Next-Generation Sequencing by Implicating CNV Analysis, Non-Coding Exons and the Overall Variant Load: The Example of Retinal Dystrophies

      research-article
      1 , 1 , 2 , 1 , 3 , 3 , 1 , 4 , 4 , 4 , 5 , 6 , 2 , 6 , 7 , 8 , 9 , 10 , 11 , 12 , 13 , 14 , 15 , 16 , 17 , 7 , 8 , 18 , 19 , 20 , 20 , 21 , 22 , 8 , 23 , 8 , 23 , 24 , 25 , 1 , 26 , 3 , 1 , 7 , *
      PLoS ONE
      Public Library of Science

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Retinitis pigmentosa (RP) and Leber congenital amaurosis (LCA) are major causes of blindness. They result from mutations in many genes which has long hampered comprehensive genetic analysis. Recently, targeted next-generation sequencing (NGS) has proven useful to overcome this limitation. To uncover “hidden mutations” such as copy number variations (CNVs) and mutations in non-coding regions, we extended the use of NGS data by quantitative readout for the exons of 55 RP and LCA genes in 126 patients, and by including non-coding 5′ exons. We detected several causative CNVs which were key to the diagnosis in hitherto unsolved constellations, e.g. hemizygous point mutations in consanguineous families, and CNVs complemented apparently monoallelic recessive alleles. Mutations of non-coding exon 1 of EYS revealed its contribution to disease. In view of the high carrier frequency for retinal disease gene mutations in the general population, we considered the overall variant load in each patient to assess if a mutation was causative or reflected accidental carriership in patients with mutations in several genes or with single recessive alleles. For example, truncating mutations in RP1, a gene implicated in both recessive and dominant RP, were causative in biallelic constellations, unrelated to disease when heterozygous on a biallelic mutation background of another gene, or even non-pathogenic if close to the C-terminus. Patients with mutations in several loci were common, but without evidence for di- or oligogenic inheritance. Although the number of targeted genes was low compared to previous studies, the mutation detection rate was highest (70%) which likely results from completeness and depth of coverage, and quantitative data analysis. CNV analysis should routinely be applied in targeted NGS, and mutations in non-coding exons give reason to systematically include 5′-UTRs in disease gene or exome panels. Consideration of all variants is indispensable because even truncating mutations may be misleading.

          Related collections

          Most cited references40

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          NCBI Reference Sequences (RefSeq): current status, new features and genome annotation policy

          The National Center for Biotechnology Information (NCBI) Reference Sequence (RefSeq) database is a collection of genomic, transcript and protein sequence records. These records are selected and curated from public sequence archives and represent a significant reduction in redundancy compared to the volume of data archived by the International Nucleotide Sequence Database Collaboration. The database includes over 16 000 organisms, 2.4 × 106 genomic records, 13 × 106 proteins and 2 × 106 RNA records spanning prokaryotes, eukaryotes and viruses (RefSeq release 49, September 2011). The RefSeq database is maintained by a combined approach of automated analyses, collaboration and manual curation to generate an up-to-date representation of the sequence, its features, names and cross-links to related sources of information. We report here on recent growth, the status of curating the human RefSeq data set, more extensive feature annotation and current policy for eukaryotic genome annotation via the NCBI annotation pipeline. More information about the resource is available online (see http://www.ncbi.nlm.nih.gov/RefSeq/).
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            The UCSC Genome Browser database: extensions and updates 2013

            The University of California Santa Cruz (UCSC) Genome Browser (http://genome.ucsc.edu) offers online public access to a growing database of genomic sequence and annotations for a wide variety of organisms. The Browser is an integrated tool set for visualizing, comparing, analysing and sharing both publicly available and user-generated genomic datasets. As of September 2012, genomic sequence and a basic set of annotation ‘tracks’ are provided for 63 organisms, including 26 mammals, 13 non-mammal vertebrates, 3 invertebrate deuterostomes, 13 insects, 6 worms, yeast and sea hare. In the past year 19 new genome assemblies have been added, and we anticipate releasing another 28 in early 2013. Further, a large number of annotation tracks have been either added, updated by contributors or remapped to the latest human reference genome. Among these are an updated UCSC Genes track for human and mouse assemblies. We have also introduced several features to improve usability, including new navigation menus. This article provides an update to the UCSC Genome Browser database, which has been previously featured in the Database issue of this journal.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Improved splice site detection in Genie.

              We present an improved splice site predictor for the genefinding program Genie. Genie is based on a generalized Hidden Markov Model (GHMM) that describes the grammar of a legal parse of a multi-exon gene in a DNA sequence. In Genie, probabilities are estimated for gene features by using dynamic programming to combine information from multiple content and signal sensors, including sensors that integrate matches to homologous sequences from a database. One of the hardest problems in genefinding is to determine the complete gene structure correctly. The splice site sensors are the key signal sensors that address this problem. We replaced the existing splice site sensors in Genie with two novel neural networks based on dinucleotide frequencies. Using these novel sensors, Genie shows significant improvements in the sensitivity and specificity of gene structure identification. Experimental results in tests using a standard set of annotated genes showed that Genie identified 86% of coding nucleotides correctly with a specificity of 85%, versus 80% and 84% in the older system. In further splice site experiments, we also looked at correlations between splice site scores and intron and exon lengths, as well as at the effect of distance to the nearest splice site on false positive rates.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, USA )
                1932-6203
                2013
                12 November 2013
                : 8
                : 11
                : e78496
                Affiliations
                [1 ]Bioscientia Center for Human Genetics, Ingelheim, Germany
                [2 ]Division of Pediatric Ophthalmology, King Khaled Eye Specialist Hospital, Riyadh, Saudi Arabia
                [3 ]Department of Ophthalmology, Justus-Liebig-University Giessen, University Hospital Giessen and Marburg GmbH, Giessen Campus, Giessen, Germany
                [4 ]Department of Ophthalmology, University of Bonn, Bonn, Germany
                [5 ]Human Molecular Genetics Laboratory, Health Biotechnology Division, National Institute for Biotechnology and Genetic Engineering, Faisalabad, Pakistan
                [6 ]Institute of Human Genetics, University of Lübeck, Lübeck, Germany
                [7 ]Institute of Human Genetics, University Hospital of Cologne, Cologne, Germany
                [8 ]Center for Molecular Medicine Cologne (CMMC), University of Cologne, Cologne, Germany
                [9 ]Cologne Excellence Cluster on Cellular Stress Responses in Aging-Associated Diseases (CECAD), University of Cologne, Cologne, Germany
                [10 ]MRC Holland, Amsterdam, The Netherlands
                [11 ]Comprehensive Genetic Center, Tehran University of Medical Sciences, Tehran, Iran
                [12 ]Avicenna Biotechnology Research Institute, Tehran, Iran
                [13 ]Pränatalzentrum Hamburg und Humangenetik, Hamburg, Germany
                [14 ]Institute of Clinical Genetics, Technical University Dresden, Dresden, Germany
                [15 ]Division of Human Genetics, Medical University Innsbruck, Innsbruck, Austria
                [16 ]Humangenetik, Bremen, Germany
                [17 ]Institute of Human Genetics, Westfälische Wilhelms-University, Münster, Germany
                [18 ]Praenatal-Medizin und Genetik Düsseldorf, Düsseldorf, Germany
                [19 ]Praxis für Humangenetik am DRK-Klinikum Westend, Berlin, Germany
                [20 ]Princess Al Jawhara Albrahim Center of Excellence in Research of Hereditary Disorders, King Abdulaziz University, Jeddah, Saudi Arabia
                [21 ]Medizinisch Genetisches Zentrum, Munich, Germany
                [22 ]Department of Pediatrics, University Hospital of Cologne, Cologne, Germany
                [23 ]Cologne Center for Genomics and Center for Molecular Medicine, University of Cologne, Cologne, Germany
                [24 ]Department of Ophthalmology, Zayed Military Hospital, Abu Dhabi, United Arab Emirates
                [25 ]Institute of Human Genetics, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
                [26 ]Center for Clinical Research, University Hospital of Freiburg, Freiburg, Germany
                National Eye Institute, United States of America
                Author notes

                Competing Interests: TE, CN, CD, AB, CB and HJB are employees of Bioscientia, which is part of a publicly traded diagnostic company. There are no patents, products in development or marketed products to declare. This does not alter the authors' adherence to all the PLOS ONE policies on sharing data and materials, as detailed online in the guide for authors.

                Conceived and designed the experiments: HJB CB TE. Performed the experiments: GN PN TE CN CD A. Bieg. Analyzed the data: HJB TE CD A. Bieg CN GN PN CB. Contributed reagents/materials/analysis tools: AOK MNP CF MG PCI FGH SMB YH A. Galvez KP BW SRG MR EB ST DB A. BBohring JS SKJ CSA KB JYA TN PH JSD A. Gal BL CB HJB. Wrote the paper: HJB TE.

                Article
                PONE-D-13-29248
                10.1371/journal.pone.0078496
                3827063
                24265693
                e48f048b-50a9-4f82-a99d-edf000299ae8
                Copyright @ 2013

                This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 15 July 2013
                : 12 September 2013
                Page count
                Pages: 18
                Funding
                The authors received no specific funding for this study.
                Categories
                Research Article

                Uncategorized
                Uncategorized

                Comments

                Comment on this article