4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Global Analysis of Transcription Start Sites in the New Ovine Reference Genome ( Oar rambouillet v1.0)

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The overall aim of the Ovine FAANG project is to provide a comprehensive annotation of the new highly contiguous sheep reference genome sequence ( Oar rambouillet v1.0). Mapping of transcription start sites (TSS) is a key first step in understanding transcript regulation and diversity. Using 56 tissue samples collected from the reference ewe Benz2616, we have performed a global analysis of TSS and TSS-Enhancer clusters using Cap Analysis Gene Expression (CAGE) sequencing. CAGE measures RNA expression by 5′ cap-trapping and has been specifically designed to allow the characterization of TSS within promoters to single-nucleotide resolution. We have adapted an analysis pipeline that uses TagDust2 for clean-up and trimming, Bowtie2 for mapping, CAGEfightR for clustering, and the Integrative Genomics Viewer (IGV) for visualization. Mapping of CAGE tags indicated that the expression levels of CAGE tag clusters varied across tissues. Expression profiles across tissues were validated using corresponding polyA+ mRNA-Seq data from the same samples. After removal of CAGE tags with <10 read counts, 39.3% of TSS overlapped with 5′ ends of 31,113 transcripts that had been previously annotated by NCBI (out of a total of 56,308 from the NCBI annotation). For 25,195 of the transcripts, previously annotated by NCBI, no TSS meeting stringent criteria were identified. A further 14.7% of TSS mapped to within 50 bp of annotated promoter regions. Intersecting these predicted TSS regions with annotated promoter regions (±50 bp) revealed 46% of the predicted TSS were “novel” and previously un-annotated. Using whole-genome bisulfite sequencing data from the same tissues, we were able to determine that a proportion of these “novel” TSS were hypo-methylated (32.2%) indicating that they are likely to be reproducible rather than “noise”. This global analysis of TSS in sheep will significantly enhance the annotation of gene models in the new ovine reference assembly. Our analyses provide one of the highest resolution annotations of transcript regulation and diversity in a livestock species to date.

          Related collections

          Most cited references56

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The Sequence Alignment/Map format and SAMtools

          Summary: The Sequence Alignment/Map (SAM) format is a generic alignment format for storing read alignments against reference sequences, supporting short and long reads (up to 128 Mbp) produced by different sequencing platforms. It is flexible in style, compact in size, efficient in random access and is the format in which alignments from the 1000 Genomes Project are released. SAMtools implements various utilities for post-processing alignments in the SAM format, such as indexing, variant caller and alignment viewer, and thus provides universal tools for processing read alignments. Availability: http://samtools.sourceforge.net Contact: rd@sanger.ac.uk
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Fast gapped-read alignment with Bowtie 2.

            As the rate of sequencing increases, greater throughput is demanded from read aligners. The full-text minute index is often used to make alignment very fast and memory-efficient, but the approach is ill-suited to finding longer, gapped alignments. Bowtie 2 combines the strengths of the full-text minute index with the flexibility and speed of hardware-accelerated dynamic programming algorithms to achieve a combination of high speed, sensitivity and accuracy.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              BEDTools: a flexible suite of utilities for comparing genomic features

              Motivation: Testing for correlations between different sets of genomic features is a fundamental task in genomics research. However, searching for overlaps between features with existing web-based methods is complicated by the massive datasets that are routinely produced with current sequencing technologies. Fast and flexible tools are therefore required to ask complex questions of these data in an efficient manner. Results: This article introduces a new software suite for the comparison, manipulation and annotation of genomic features in Browser Extensible Data (BED) and General Feature Format (GFF) format. BEDTools also supports the comparison of sequence alignments in BAM format to both BED and GFF features. The tools are extremely efficient and allow the user to compare large datasets (e.g. next-generation sequencing data) with both public and custom genome annotation tracks. BEDTools can be combined with one another as well as with standard UNIX commands, thus facilitating routine genomics tasks as well as pipelines that can quickly answer intricate questions of large genomic datasets. Availability and implementation: BEDTools was written in C++. Source code and a comprehensive user manual are freely available at http://code.google.com/p/bedtools Contact: aaronquinlan@gmail.com; imh4y@virginia.edu Supplementary information: Supplementary data are available at Bioinformatics online.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Genet
                Front Genet
                Front. Genet.
                Frontiers in Genetics
                Frontiers Media S.A.
                1664-8021
                23 October 2020
                2020
                : 11
                : 580580
                Affiliations
                [1] 1The Roslin Institute, Royal (Dick) School of Veterinary Studies, The University of Edinburgh , Edinburgh, United Kingdom
                [2] 2Centre for Tropical Livestock Genetics and Health (CTLGH), Roslin Institute, University of Edinburgh , Midlothian, United Kingdom
                [3] 3AgResearch, Invermay Agricultural Centre , Mosgiel, New Zealand
                [4] 4Genetics Otago, Department of Biochemistry, University of Otago , Dunedin, New Zealand
                [5] 5Genetics Core, Edinburgh Clinical Research Facility, The University of Edinburgh , Edinburgh, United Kingdom
                [6] 6MRC Human Genetics Unit, The University of Edinburgh , Edinburgh, United Kingdom
                [7] 7USDA, Agricultural Research Service, U.S. Meat Animal Research Center , Clay Center, NE, United States
                [8] 8Baylor College of Medicine , Houston, TX, United States
                [9] 9Department of Animal, Dairy and Veterinary Sciences, Utah State University , Logan, UT, United States
                [10] 10Department of Animal, Veterinary and Food Sciences, University of Idaho , Moscow, ID, United States
                Author notes

                Edited by: Huaijun Zhou, University of California, Davis, United States

                Reviewed by: Oleg Gusev, RIKEN, Japan; Zhihua Jiang, Washington State University, United States

                *Correspondence: Emily L. Clark, emily.clark@ 123456roslin.ed.ac.uk

                These authors have contributed equally to this work

                This article was submitted to Livestock Genomics, a section of the journal Frontiers in Genetics

                Article
                10.3389/fgene.2020.580580
                7645153
                74d7d6f8-0298-4360-bb4c-8fff5bca83fc
                Copyright © 2020 Salavati, Caulton, Clark, Gazova, Smith, Worley, Cockett, Archibald, Clarke, Murdoch and Clark.

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 06 July 2020
                : 09 September 2020
                Page count
                Figures: 10, Tables: 2, Equations: 2, References: 61, Pages: 21, Words: 0
                Funding
                Funded by: U.S. Department of Agriculture 10.13039/100000199
                Award ID: USDA-NIFA-2017-67016-26301
                Award ID: USDA-NIFA-2013-67015-21228
                Funded by: Utah State University 10.13039/100006630
                Award ID: UTA-1172
                Funded by: Biotechnology and Biological Sciences Research Council 10.13039/501100000268
                Award ID: BBS/E/D/2021550
                Award ID: BBS/E/D/10002070
                Funded by: University of Otago 10.13039/100008247
                Award ID: NZ MBIE C10X1906
                Funded by: Agricultural Research Service 10.13039/100007917
                Award ID: 3040-31000-100-00D
                Funded by: National Institute of Food and Agriculture 10.13039/100005825
                Award ID: USDA-NIFA-2017-67016-26301
                Award ID: USDA-NIFA-2013-67015-21228
                Categories
                Genetics
                Original Research

                Genetics
                ovine,tss,cage,wgbs,promoter,enhancer,transcriptome,faang
                Genetics
                ovine, tss, cage, wgbs, promoter, enhancer, transcriptome, faang

                Comments

                Comment on this article