316
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Complex Loci in Human and Mouse Genomes

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Mammalian genomes harbor a larger than expected number of complex loci, in which multiple genes are coupled by shared transcribed regions in antisense orientation and/or by bidirectional core promoters. To determine the incidence, functional significance, and evolutionary context of mammalian complex loci, we identified and characterized 5,248 cis–antisense pairs, 1,638 bidirectional promoters, and 1,153 chains of multiple cis–antisense and/or bidirectionally promoted pairs from 36,606 mouse transcriptional units (TUs), along with 6,141 cis–antisense pairs, 2,113 bidirectional promoters, and 1,480 chains from 42,887 human TUs. In both human and mouse, 25% of TUs resided in cis–antisense pairs, only 17% of which were conserved between the two organisms, indicating frequent species specificity of antisense gene arrangements. A sampling approach indicated that over 40% of all TUs might actually be in cis–antisense pairs, and that only a minority of these arrangements are likely to be conserved between human and mouse. Bidirectional promoters were characterized by variable transcriptional start sites and an identifiable midpoint at which overall sequence composition changed strand and the direction of transcriptional initiation switched. In microarray data covering a wide range of mouse tissues, genes in cis–antisense and bidirectionally promoted arrangement showed a higher probability of being coordinately expressed than random pairs of genes. In a case study on homeotic loci, we observed extensive transcription of nonconserved sequences on the noncoding strand, implying that the presence rather than the sequence of these transcripts is of functional importance. Complex loci are ubiquitous, host numerous nonconserved gene structures and lineage-specific exonification events, and may have a cis-regulatory impact on the member genes.

          Synopsis

          In the traditional view, most genes occupy their own distinct territory in mammalian genomes. However, it has become apparent that many genes are in fact located in complex regions (complex loci) where they share territory with other genes by utilizing opposite strands of DNA. Such genes either share regions expressed as mRNA (i.e., form cis–antisense pairs) or start from a genome region (called a bidirectional promoter) at which transcription can initiate in both directions along the DNA. In this paper, researchers present the one of the most comprehensive censuses of complex loci to date and investigate their general properties and human–mouse differences to discover the rules of this type of gene organization and its effect on gene regulation. They found about 25% of known human and mouse genes to be in cis–antisense pairs, and estimate the total fraction to be over 40%. At bidirectional promoters, they demonstrated the existence of mirror DNA sequence composition related to the promoters' ability to initiate transcription in two directions. The researchers found over 2,000 “chains” complex arrangements where three or more genes are coupled by cis–antisense pairing and/or bidirectional promoters; among them are many genes whose products control the expression of other genes.

          Related collections

          Most cited references39

          • Record: found
          • Abstract: found
          • Article: not found

          Single-step method of RNA isolation by acid guanidinium thiocyanate-phenol-chloroform extraction.

          A new method of total RNA isolation by a single extraction with an acid guanidinium thiocyanate-phenol-chloroform mixture is described. The method provides a pure preparation of undegraded RNA in high yield and can be completed within 4 h. It is particularly useful for processing large numbers of samples and for isolation of RNA from minute quantities of cells or tissue samples.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            The transcriptional landscape of the mammalian genome.

            This study describes comprehensive polling of transcription start and termination sites and analysis of previously unidentified full-length complementary DNAs derived from the mouse genome. We identify the 5' and 3' boundaries of 181,047 transcripts with extensive variation in transcripts arising from alternative promoter usage, splicing, and polyadenylation. There are 16,247 new mouse protein-coding transcripts, including 5154 encoding previously unidentified proteins. Genomic mapping of the transcriptome reveals transcriptional forests, with overlapping transcription on both strands, separated by deserts in which few transcripts are observed. The data provide a comprehensive platform for the comparative analysis of mammalian transcriptional regulation in differentiation and development.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

              The analysis of regulatory regions in genome sequences is strongly based on the detection of potential transcription factor binding sites. The preferred models for representation of transcription factor binding specificity have been termed position-specific scoring matrices. JASPAR is an open-access database of annotated, high-quality, matrix-based transcription factor binding site profiles for multicellular eukaryotes. The profiles were derived exclusively from sets of nucleotide sequences experimentally demonstrated to bind transcription factors. The database is complemented by a web interface for browsing, searching and subset selection, an online sequence analysis utility and a suite of programming tools for genome-wide and comparative genomic analysis of regulatory regions. JASPAR is available at http://jaspar. cgb.ki.se.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Role: Editor
                Role: Editor
                Role: Editor
                Role: Editor
                Journal
                PLoS Genet
                pgen
                PLoS Genetics
                Public Library of Science (San Francisco, USA )
                1553-7390
                1553-7404
                April 2006
                28 April 2006
                : 2
                : 4
                : e47
                Affiliations
                [1 ] Computational Biology Unit, Bergen Center for Computational Science, University of Bergen, Bergen, Norway
                [2 ] Programme for Genomics and Bioinformatics, Department of Cell and Molecular Biology, Karolinska Institutet, Stockholm, Sweden
                [3 ] Genome Exploration Research Group (Genome Network Project Core Group), RIKEN Genomic Sciences Center, RIKEN Yokohama Institute, Yokohama, Japan
                [4 ] Dulbecco Telethon Institute, Institute of Genetics and Biophysics CNR, Naples, Italy
                [5 ] Department of Biological and Technological Research, San Raffaele Scientific Institute, Milan, Italy
                [6 ] Fondazione Istituto FIRC di Oncologia Molecolare, Milan, Italy
                [7 ] Department of Experimental Oncology, Istituto Europeo di Oncologia, Milan, Italy
                [8 ] Knowledge Extraction Laboratory, Institute for Infocomm Research, Singapore
                [9 ] South African National Bioinformatics Institute, University of the Western Cape, Bellville, South Africa
                [10 ] Department of Biological Sciences, National University of Singapore, Singapore
                [11 ] Chemical and Life Sciences, Nanyang Polytechnic, Singapore
                [12 ] Genomics Institute of the Novartis Research Foundation, San Diego, California, United States of America
                [13 ] Scripps Florida, Jupiter, Florida, United States of America
                [14 ] Genome Science Laboratory, Discovery Research Institute, RIKEN Wako Institute, Wako, Japan
                [15 ] School of Biomolecular and Biomedical Science, Eskitis Institute for Cell and Molecular Therapies, Griffith University, Brisbane, Queensland, Australia
                [16 ] Department of Experimental Oncology, Istituto Nazionale per lo Studio e la Cura dei Tumori, Milan, Italy
                [17 ] Genome Institute of Singapore, Singapore
                The Jackson Laboratory, US
                MRC-Harwell, UK
                NHGRI-NIH, US
                Lawrence Livermore National Laboratory, US
                The Jackson Laboratory, US
                Author notes
                * To whom correspondence should be addressed. E-mail: Boris.Lenhard@ 123456bccs.uib.no (BL); lipovich@ 123456gis.a-star.edu.sg (LL)
                Article
                05-PLGE-RA-0215R3 plge-02-04-13
                10.1371/journal.pgen.0020047
                1449890
                16683030
                ea683227-1e52-4962-83e5-fee48e6796ab
                Copyright: © 2006 Engström et al. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
                History
                : 15 August 2005
                : 13 February 2006
                Page count
                Pages: 14
                Categories
                Research Article
                Bioinformatics - Computational Biology
                Biotechnology
                Cancer Biology
                Development
                Evolution
                Genetics/Genomics
                Genetics/Gene Expression
                Mammals
                Homo (Human)
                Mus (Mouse)
                Custom metadata
                Engström PG, Suzuki H, Ninomiya N, Akalin A, Sessa L, et al. (2006) Complex loci in human and mouse genomes. PLoS Genet 2(4): e47. DOI: 10.1371/journal.pgen.0020047
                Fantom_Logo.jpg

                Genetics
                Genetics

                Comments

                Comment on this article