71
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Discovery of Candidate Disease Genes in ENU–Induced Mouse Mutants by Large-Scale Sequencing, Including a Splice-Site Mutation in Nucleoredoxin

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          An accurate and precisely annotated genome assembly is a fundamental requirement for functional genomic analysis. Here, the complete DNA sequence and gene annotation of mouse Chromosome 11 was used to test the efficacy of large-scale sequencing for mutation identification. We re-sequenced the 14,000 annotated exons and boundaries from over 900 genes in 41 recessive mutant mouse lines that were isolated in an N-ethyl- N-nitrosourea (ENU) mutation screen targeted to mouse Chromosome 11. Fifty-nine sequence variants were identified in 55 genes from 31 mutant lines. 39% of the lesions lie in coding sequences and create primarily missense mutations. The other 61% lie in noncoding regions, many of them in highly conserved sequences. A lesion in the perinatal lethal line l11Jus13 alters a consensus splice site of nucleoredoxin ( Nxn), inserting 10 amino acids into the resulting protein. We conclude that point mutations can be accurately and sensitively recovered by large-scale sequencing, and that conserved noncoding regions should be included for disease mutation identification. Only seven of the candidate genes we report have been previously targeted by mutation in mice or rats, showing that despite ongoing efforts to functionally annotate genes in the mammalian genome, an enormous gap remains between phenotype and function. Our data show that the classical positional mapping approach of disease mutation identification can be extended to large target regions using high-throughput sequencing.

          Author Summary

          Here we show that tiny DNA lesions can be found in huge amounts of DNA sequence data, similar to finding a needle in a haystack. These lesions identify many new candidates for disease genes associated with birth defects, infertility, and growth. Further, our data suggest that we know very little about what mammalian genes do. Sequencing methods are becoming cheaper and faster. Therefore, our strategy, shown here for the first time, will become commonplace.

          Related collections

          Most cited references43

          • Record: found
          • Abstract: found
          • Article: not found

          The transcriptional landscape of the mammalian genome.

          This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Genome-wide in situ exon capture for selective resequencing.

            Increasingly powerful sequencing technologies are ushering in an era of personal genome sequences and raising the possibility of using such information to guide medical decisions. Genome resequencing also promises to accelerate the identification of disease-associated mutations. Roughly 98% of the human genome is composed of repeats and intergenic or non-protein-coding sequences. Thus, it is crucial to focus resequencing on high-value genomic regions. Protein-coding exons represent one such type of high-value target. We have developed a method of using flexible, high-density microarrays to capture any desired fraction of the human genome, in this case corresponding to more than 200,000 protein-coding exons. Depending on the precise protocol, up to 55-85% of the captured fragments are associated with targeted regions and up to 98% of intended exons can be recovered. This methodology provides an adaptable route toward rapid and efficient resequencing of any sizeable, non-repeat portion of the human genome.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Highly conserved non-coding elements on either side of SOX9 associated with Pierre Robin sequence.

              Pierre Robin sequence (PRS) is an important subgroup of cleft palate. We report several lines of evidence for the existence of a 17q24 locus underlying PRS, including linkage analysis results, a clustering of translocation breakpoints 1.06-1.23 Mb upstream of SOX9, and microdeletions both approximately 1.5 Mb centromeric and approximately 1.5 Mb telomeric of SOX9. We have also identified a heterozygous point mutation in an evolutionarily conserved region of DNA with in vitro and in vivo features of a developmental enhancer. This enhancer is centromeric to the breakpoint cluster and maps within one of the microdeletion regions. The mutation abrogates the in vitro enhancer function and alters binding of the transcription factor MSX1 as compared to the wild-type sequence. In the developing mouse mandible, the 3-Mb region bounded by the microdeletions shows a regionally specific chromatin decompaction in cells expressing Sox9. Some cases of PRS may thus result from developmental misexpression of SOX9 due to disruption of very-long-range cis-regulatory elements.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Genet
                plos
                plosgen
                PLoS Genetics
                Public Library of Science (San Francisco, USA )
                1553-7390
                1553-7404
                December 2009
                December 2009
                11 December 2009
                : 5
                : 12
                : e1000759
                Affiliations
                [1 ]Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas, United States of America
                [2 ]The Wellcome Trust Sanger Institute, Hinxton, Cambridgeshire, United Kingdom
                [3 ]Faculty of Life Sciences, University of Manchester, Manchester, United Kingdom
                [4 ]The University of Texas MD Anderson Cancer Center, Houston, Texas, United States of America
                [5 ]Department of Pediatrics, Baylor College of Medicine, Houston, Texas, United States of America
                [6 ]Texas Children's Hospital, Houston, Texas, United States of America
                [7 ]Laboratory of Intracellular Signaling, Institute for Protein Research, Osaka University, Osaka, Japan
                Harvard Medical School, United States of America
                Author notes

                Conceived and designed the experiments: JR AB DJA MJJ. Performed the experiments: MKB BMW LGW JH DG LPW AM KM MDR PMG LM AJC AP TJH JR. Analyzed the data: MKB BMW LGW BL LPW AM DJA MJJ. Contributed reagents/materials/analysis tools: BL KEH RJ KH YF HM AB DJA MJJ. Wrote the paper: MKB BMW FJP KEH JRL AB DJA MJJ.

                Article
                09-PLGE-RA-0851R3
                10.1371/journal.pgen.1000759
                2782131
                20011118
                14b14dbd-36e4-4aa6-8dd7-4ff71006549c
                Boles et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
                History
                : 22 May 2009
                : 9 November 2009
                Page count
                Pages: 13
                Categories
                Research Article
                Biochemistry/Molecular Evolution
                Cell Biology/Developmental Molecular Mechanisms
                Computational Biology/Comparative Sequence Analysis
                Computational Biology/Genomics
                Computational Biology/Molecular Genetics
                Developmental Biology/Developmental Molecular Mechanisms
                Developmental Biology/Molecular Development
                Evolutionary Biology/Developmental Molecular Mechanisms
                Evolutionary Biology/Evolutionary and Comparative Genetics
                Evolutionary Biology/Genomics
                Genetics and Genomics/Animal Genetics
                Genetics and Genomics/Comparative Genomics
                Genetics and Genomics/Functional Genomics
                Genetics and Genomics/Gene Discovery
                Molecular Biology/Molecular Evolution

                Genetics
                Genetics

                Comments

                Comment on this article