386
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Multi-species, multi-transcription factor binding highlights conserved control of tissue-specific biological pathways

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          As exome sequencing gives way to genome sequencing, the need to interpret the function of regulatory DNA becomes increasingly important. To test whether evolutionary conservation of cis-regulatory modules (CRMs) gives insight into human gene regulation, we determined transcription factor (TF) binding locations of four liver-essential TFs in liver tissue from human, macaque, mouse, rat, and dog. Approximately, two thirds of the TF-bound regions fell into CRMs. Less than half of the human CRMs were found as a CRM in the orthologous region of a second species. Shared CRMs were associated with liver pathways and disease loci identified by genome-wide association studies. Recurrent rare human disease causing mutations at the promoters of several blood coagulation and lipid metabolism genes were also identified within CRMs shared in multiple species. This suggests that multi-species analyses of experimentally determined combinatorial TF binding will help identify genomic regions critical for tissue-specific gene control.

          DOI: http://dx.doi.org/10.7554/eLife.02626.001

          eLife digest

          Stretches of DNA called cis-regulatory modules (or CRMs for short) could help researchers to identify the regions of DNA that are most important for controlling genes. CRMs are regions where multiple transcription factors—proteins that control when and how genes are expressed—bind to DNA. As important biological pathways are often regulated by more than one transcription factor, CRMs are therefore a good target when looking for DNA regions that, if mutated, are likely to cause disease.

          If a stretch of DNA performs an important role, it is often conserved throughout evolution. This is often observed for genes that make proteins. Indeed, DNA regions that specify critical amino acids that make up proteins are often conserved across distantly related species. However, unlike the changes made to the amino acid encoding parts of genes, it is currently a challenge to predict which changes in the rest of the genome will affect gene expression.

          One reason for this challenge is that transcription factor binding sites are rapidly evolving. This rapid evolution means that strictly comparing DNA sequences between species may fail to identify where transcription factors like to bind in the genome. Numerous experimental efforts have therefore been made to map these sites. These have revealed that there are a huge number of regions in the human genome that can bind transcription factors: hundreds of thousands of sites, far more than there are genes. For this reason, there is a great interest in revealing which of these regulatory regions are critical for maintaining normal levels and timings of gene expression.

          Ballester et al. compared the binding sites of four transcription factors responsible for regulating liver function in humans, macaques, mice, rats, and dogs. About two-thirds of these binding sites were found in CRMs. Less than half of the CRMs in humans were also CRMs in another species—but Ballester et al. found that these shared CRMs are predominantly in charge of regulating the essential biological pathways that allow the liver to function correctly. In addition, Ballester et al. identified several examples of disease-causing DNA mutations in shared CRMs that affected the expression of genes that make up pathways such as the blood clotting cascade. Genome-wide association studies also uncovered common variants for liver-related traits that were enriched for the CRMs found in more than one species, further supporting their importance.

          As transcription factors work in different ways in different tissues, further studies are now required to expand these observations to organs other than the liver. Future work is also needed to investigate the function of thousands of conserved CRMs whose role in liver gene regulation remains unknown.

          DOI: http://dx.doi.org/10.7554/eLife.02626.002

          Related collections

          Most cited references59

          • Record: found
          • Abstract: found
          • Article: not found

          JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

          The analysis of regulatory regions in genome sequences is strongly based on the detection of potential transcription factor binding sites. The preferred models for representation of transcription factor binding specificity have been termed position-specific scoring matrices. JASPAR is an open-access database of annotated, high-quality, matrix-based transcription factor binding site profiles for multicellular eukaryotes. The profiles were derived exclusively from sets of nucleotide sequences experimentally demonstrated to bind transcription factors. The database is complemented by a web interface for browsing, searching and subset selection, an online sequence analysis utility and a suite of programming tools for genome-wide and comparative genomic analysis of regulatory regions. JASPAR is available at http://jaspar. cgb.ki.se.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            TimeTree: a public knowledge-base of divergence times among organisms.

            Biologists and other scientists routinely need to know times of divergence between species and to construct phylogenies calibrated to time (timetrees). Published studies reporting time estimates from molecular data have been increasing rapidly, but the data have been largely inaccessible to the greater community of scientists because of their complexity. TimeTree brings these data together in a consistent format and uses a hierarchical structure, corresponding to the tree of life, to maximize their utility. Results are presented and summarized, allowing users to quickly determine the range and robustness of time estimates and the degree of consensus from the published literature. TimeTree is available at http://www.timetree.net
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Evolutionary and biomedical insights from the rhesus macaque genome.

              The rhesus macaque (Macaca mulatta) is an abundant primate species that diverged from the ancestors of Homo sapiens about 25 million years ago. Because they are genetically and physiologically similar to humans, rhesus monkeys are the most widely used nonhuman primate in basic and applied biomedical research. We determined the genome sequence of an Indian-origin Macaca mulatta female and compared the data with chimpanzees and humans to reveal the structure of ancestral primate genomes and to identify evidence for positive selection and lineage-specific expansions and contractions of gene families. A comparison of sequences from individual animals was used to investigate their underlying genetic diversity. The complete description of the macaque genome blueprint enhances the utility of this animal model for biomedical research and improves our understanding of the basic biology of the species.
                Bookmark

                Author and article information

                Contributors
                Role: Reviewing editor
                Journal
                eLife
                eLife
                eLife
                eLife
                eLife Sciences Publications, Ltd
                2050-084X
                2050-084X
                03 October 2014
                2014
                : 3
                : e02626
                Affiliations
                [1]deptEuropean Molecular Biology Laboratory , European Bioinformatics Institute, Wellcome Trust Genome Campus , Cambridge, United Kingdom
                [2]deptAix-Marseille Université, UMR1090 TAGC , Marseille, France
                [3]deptINSERM, UMR1090 TAGC , Marseille, France
                [4]deptGenetics and Genome Biology Program , SickKids Research Institute , Toronto, Canada
                [5]deptCancer Research UK–Cambridge Institute University of Cambridge , Cambridge, United Kingdom
                [6]deptSchool of Electronic and Computing Systems , University of Cincinnati , Cincinnati, United States
                [7]deptSchool of Biotechnology and Biomolecular Sciences , University of New South Wales , Kensington, Australia
                [8]deptCenter for Autoimmune Genomics and Etiology , Cincinnati Children's Hospital Medical Center , Cincinnati, United States
                [9]deptDivisions of Biomedical Informatics and Developmental Biology , Cincinnati Children's Hospital Medical Center , Cincinnati, United States
                [10]deptWellcome Trust Sanger Institute , Wellcome Trust Genome Campus , Cambridge, United Kingdom
                [11]deptDepartment of Molecular Genetics , University of Toronto , Toronto, Canada
                University of Washington , United States
                University of Washington , United States
                Author notes
                [* ]For correspondence: benoit.ballester@ 123456inserm.fr (BB);
                [* ]For correspondence: michael.wilson@ 123456sickkids.ca (MDW)
                [†]

                These authors contributed equally to this work.

                [‡]

                Syncona Partners LLP, London, United Kingdom.

                [¶]

                Wellcome Trust Sanger Institute, Wellcome Trust Genome Campus, Cambridge, United Kingdom.

                [§]

                Division of Biology and Biological Engineering, California Institute of Technology, Pasadena, United States.

                Article
                02626
                10.7554/eLife.02626
                4359374
                25279814
                7e9eb9a6-2b73-418b-a591-454afa2cef46
                © 2014, Ballester et al

                This article is distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use and redistribution provided that the original author and source are credited.

                History
                : 23 February 2014
                : 02 September 2014
                Funding
                Funded by: FundRef http://dx.doi.org/10.13039/501100000781, European Research Council;
                Award ID: 202218
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100003043, EMBO;
                Award ID: Young Investigator Award
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100000165, SickKids Foundation;
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100000038, Natural Sciences and Engineering Research Council of Canada;
                Award ID: 436194-2013
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100004440, Wellcome Trust;
                Award ID: WT098051
                Award Recipient :
                Funded by: INSERM;
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100001804, Canada Research Chairs;
                Award Recipient :
                Funded by: Marie Curie Reintegration Grant;
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100001711, Swiss National Science Foundation;
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100000289, Cancer Research UK;
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/501100003141, Consejo Nacional de Ciencia y Tecnología;
                Award Recipient :
                Funded by: European Molecular Biology Laboratory (EMBL);
                Award Recipient :
                Funded by: FundRef http://dx.doi.org/10.13039/100004440, Wellcome Trust;
                Award ID: WT095908
                Award Recipient :
                Funded by: European Molecular Biology Laboratory;
                Award ID: International PhD Progam
                Award Recipient :
                Funded by: Heart and Stroke Foundation of Ontario;
                Award ID: Bridge grant:7486
                Award Recipient :
                The funders had no role in study design, data collection and interpretation, or the decision to submit the work for publication.
                Categories
                Research Article
                Genomics and Evolutionary Biology
                Human Biology and Medicine
                Custom metadata
                2
                Combinatorial transcription factor binding shared by multiple species enriches for essential biological pathways and coincides with disease-causing regulatory DNA mutations.

                Life sciences
                cis regulatory module,transcription factors,molecular evolution,macaque,dog,liver,human,mouse,rat,other
                Life sciences
                cis regulatory module, transcription factors, molecular evolution, macaque, dog, liver, human, mouse, rat, other

                Comments

                Comment on this article