55
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      HTSstation: A Web Application and Open-Access Libraries for High-Throughput Sequencing Data Analysis

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The HTSstation analysis portal is a suite of simple web forms coupled to modular analysis pipelines for various applications of High-Throughput Sequencing including ChIP-seq, RNA-seq, 4C-seq and re-sequencing. HTSstation offers biologists the possibility to rapidly investigate their HTS data using an intuitive web application with heuristically pre-defined parameters. A number of open-source software components have been implemented and can be used to build, configure and run HTS analysis pipelines reactively. Besides, our programming framework empowers developers with the possibility to design their own workflows and integrate additional third-party software. The HTSstation web application is accessible at http://htsstation.epfl.ch.

          Related collections

          Most cited references14

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The UCSC genome browser and associated tools

          The UCSC Genome Browser (http://genome.ucsc.edu) is a graphical viewer for genomic data now in its 13th year. Since the early days of the Human Genome Project, it has presented an integrated view of genomic data of many kinds. Now home to assemblies for 58 organisms, the Browser presents visualization of annotations mapped to genomic coordinates. The ability to juxtapose annotations of many types facilitates inquiry-driven data mining. Gene predictions, mRNA alignments, epigenomic data from the ENCODE project, conservation scores from vertebrate whole-genome alignments and variation data may be viewed at any scale from a single base to an entire chromosome. The Browser also includes many other widely used tools, including BLAT, which is useful for alignments from high-throughput sequencing experiments. Private data uploaded as Custom Tracks and Data Hubs in many formats may be displayed alongside the rich compendium of precomputed data in the UCSC database. The Table Browser is a full-featured graphical interface, which allows querying, filtering and intersection of data tables. The Saved Session feature allows users to store and share customized views, enhancing the utility of the system for organizing multiple trains of thought. Binary Alignment/Map (BAM), Variant Call Format and the Personal Genome Single Nucleotide Polymorphisms (SNPs) data formats are useful for visualizing a large sequencing experiment (whole-genome or whole-exome), where the differences between the data set and the reference assembly may be displayed graphically. Support for high-throughput sequencing extends to compact, indexed data formats, such as BAM, bigBed and bigWig, allowing rapid visualization of large datasets from RNA-seq and ChIP-seq experiments via local hosting.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            Genome-Wide and Phase-Specific DNA-Binding Rhythms of BMAL1 Control Circadian Output Functions in Mouse Liver

            Introduction Circadian clocks provide higher organisms with cell-autonomous and organ-based metronomes that control temporally gated and tissue-specific gene expression or metabolic programs [1]–[4]. In the liver, such programs have been implicated in detoxification [5], glucose homeostasis [6],[7], cholesterol biosynthesis [8],[9], and gating of the cell cycle [10],[11]. The mammalian clock depends on a cell-autonomous [11] core oscillator that is built around interlocked transcriptional feedback loops. These use a variety of transcriptional regulators: the basic helix-loop-helix (bHLH) PAS domain proteins CLOCK, NPAS2, and BMAL1 [12],[13], orphan nuclear receptors of the REV-ERB [14] and ROR families [15], and the DEC bHLH repressors [16]. In addition, important co-regulators such as PER and CRY proteins mediate negative feedback by repressing their own transcriptional activators, BMAL1/CLOCK [17]–[20]. Among all these regulators, the Bmal1 gene is the only single gene in the circadian network whose knockout results in arrhythmicity [21],[22]. BMAL1 functions as a heterodimeric complex, BMAL1/CLOCK, that activates transcription of its targets via E-boxes [12],[23],[24]. The DNA-binding activity of BMAL1/CLOCK is thought to cycle because of circadian changes in post-translational modifications [25],[26]. The core oscillator exerts its function by controlling temporally gated outputs, notably metabolic functions [5],[7],[27]. Transcriptional regulation of circadian output is known to occur both directly via the core clock transcription factors and indirectly, as, for example, via the PAR-bZip regulators DBP/TEF/HLF, which are themselves controlled by BMAL1/CLOCK [28]. Thus, circadian output function is controlled via a hierarchical network of transcription regulators that drives vast programs of tissue-specific gene expression both in the suprachiasmatic nucleus [29] and in peripheral tissues [29]–[34] in the mouse. Notably, these transcript rhythms cover the full range of expression phases, which thus begs the question about the mechanism behind phase-specific circadian gene expression. It has been proposed that virtually any peak expression phase can be achieved by suitably tuned regulatory sequences that integrate a small number of phase-specific core regulators [35]. Here we investigate the degree to which BMAL1 recruitment to the genomic DNA is itself rhythmic and to what extent peak binding carries phase information for downstream circadian mRNA expression. To address these questions and further dissect the hierarchical structure of circadian clock networks, we perform a time series chromatin immunoprecipitation (ChIP) analysis for the master clock regulator BMAL1 in mouse liver. This allows us to identify a comprehensive set of direct BMAL1 targets in a circadianly controlled tissue, to model the DNA-binding specificity of BMAL1 in vivo, and to determine how tightly the phase of mRNA output follows rhythmic protein-DNA interactions. Our results reveal the pervasiveness of circadian protein-DNA interactions in a mammalian tissue by showing widespread rhythmic and phase-specific binding of BMAL1 to coding and non-coding genes. This enables us to characterize the cooperative interactions of BMAL1/CLOCK complexes at tandem E-box elements (E1-E2), and to emphasize the complexity of circadian phase control that involves transcriptional and post-transcriptional mechanisms. Results BMAL1 Binds Rhythmically to Thousands of Genomic Regions in Mouse Liver To obtain a time-resolved and genome-wide map of BMAL1 target sites, we performed ChIP in mouse liver at 4-h time intervals during one light-dark cycle. Following initial testing of ChIP efficiency by quantitative PCR (qPCR) (Figure S1), two independent BMAL1 ChIP time courses were subjected to ultra-high-throughput sequencing to yield about 20 million tags per time point (Table S1) and were analyzed via a bioinformatics pipeline that combines existing and novel methods. Briefly, we used the MACS software [36] to detect regions with enriched BMAL1 binding compared to an input chromatin sample (see Materials and Methods). To efficiently reject spurious signals and accurately estimate the location of binding sites, we developed a model-based deconvolution method for ChIP combined with deep sequencing (ChIP-Seq) data (see Text S1). We identified 2,049 bona fide BMAL1 binding sites in mouse liver. Among the top 200 sites, more than 90% are significantly rhythmic (Fisher test, p 0.2) are colored. (B) BMAL1 targets in the insulin signaling pathway. (C) BMAL1 targets in the Pparα signaling pathway. These graphs were generated using KEGG Mapper (http://www.genome.jp/kegg/tool/color_pathway.html). (1.31 MB PDF) Click here for additional data file. Figure S6 Weight matrix and structure of the HMM used for sequence analysis. (A) Logo of the E-box PSWM used for autocorrelation analysis. At each position of the PSWM, the most probable letter has p = 0.96875, while the others have p = 0.03125. (B) Structure of the HMM. E1 and E2 model, respectively, the collection of hidden states of the first and second E-box. M states allow for filtering of spurious signal, namely GTGT repeats. B1 and SP represent, respectively, background and spacer states. For simplicity, the reverse complement of the HMM is not shown here. (0.35 MB PDF) Click here for additional data file. Figure S7 Pre-mRNA and mRNA measurements of longer lived transcripts. (A) mRNA transcript stability may explain lag and relative amplitude between pre-mRNA and mRNA accumulation in the Gys2, March8, and Qdpr transcripts. Experiments were performed as described in Figure 7A–7C. Approximate half-lives for March8 and Qdpr are 5.4 h and >10 h, while that for Gys2 is not available (see [B]). (B) mRNA half-lives from mouse embryonic stem cells [61] and mouse fibroblasts [62] for the genes in Figure 7A–7C and (A). When several measurements from the same cell line were available, we took the mean. (0.51 MB PDF) Click here for additional data file. Figure S8 The anti-BMAL1 antibody recognizes specifically BMAL1. Ponceau staining (A) and Western blot (B and C) of nuclear extracts (15 ug) from wild-type and Bmal1 −/− mouse liver at ZT6. The nuclear extracts were electrophoresed on a 12% SDS-PAGE gel, transferred onto a nitrocellulose membrane, and detected using anti-POLII Cter (ab817-100, Abcam) (B) and anti-BMAL1 antibodies (C). The sequence of the peptide used for the immunization is located at the C-terminal of the mouse BMAL1 protein: LEADAGLGGPVDFSDLPWPL. (3.48 MB PDF) Click here for additional data file. Table S1 Sequencing data: number of sequenced and non-redundant tags at each time point. (0.05 MB PDF) Click here for additional data file. Table S2 Functional annotation clustering of putative BMAL1 targets using DAVID tools. These annotations link the sites to the closest gene irrespective of the distance. In total, 1,551 out of 2,049 sites have a functional annotation. For details regarding the positions and binding strength of these sites, see Text S2. For the small clusters, we list the gene symbols in the most significant subcategory. (0.12 MB PDF) Click here for additional data file. Table S3 Putative BMAL1 targets with transcription factor activity (DAVID, GO:003700). Additional columns include the rank of BMAL1 binding strength, the p-value for cyclic mRNA expression (data as in Figure 6; significant values, p<0.05, are in bold), and phase of mRNA expression. For the nuclear receptors, we also indicate results for mRNA expression patterns by real-time PCR in mouse liver [27]. According to those analyses, all 18 bound receptors are expressed and 9/18 show circadian accumulation. (0.12 MB PDF) Click here for additional data file. Table S4 Enriched KEGG pathways identified with DAVID (p<0.05). The BMAL1 putative targets in the three most significant pathways are shown in Figure S5. (0.23 MB PDF) Click here for additional data file. Table S5 PSWM for the E1-E2 motif. E1 goes from position 1 to 13, position 14 corresponds to the spacer, and E2 goes from position 15 to 27. (0.05 MB PDF) Click here for additional data file. Table S6 TaqMan probes for ChIP-PCR measurements. (0.04 MB PDF) Click here for additional data file. Table S7 TaqMan probes for mRNA measurements. (0.05 MB PDF) Click here for additional data file. Table S8 Annealing primers for EMSA. (0.04 MB PDF) Click here for additional data file. Table S9 Annealing primers for transactivation assays. (0.05 MB PDF) Click here for additional data file. Text S1 Supplementary methods. (0.09 MB PDF) Click here for additional data file. Text S2 List of all BMAL1 sites with annotations and binding strength at each time point. (0.20 MB TXT) Click here for additional data file. Text S3 List of BMAL1 sites near RCGs. (0.01 MB TXT) Click here for additional data file.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              A switch between topological domains underlies HoxD genes collinearity in mouse limbs.

              Hox genes are major determinants of the animal body plan, where they organize structures along both the trunk and appendicular axes. During mouse limb development, Hoxd genes are transcribed in two waves: early on, when the arm and forearm are specified, and later, when digits form. The transition between early and late regulations involves a functional switch between two opposite topological domains. This switch is reflected by a subset of Hoxd genes mapping centrally into the cluster, which initially interact with the telomeric domain and subsequently swing toward the centromeric domain, where they establish new contacts. This transition between independent regulatory landscapes illustrates both the modularity of the limbs and the distinct evolutionary histories of its various pieces. It also allows the formation of an intermediate area of low HOX proteins content, which develops into the wrist, the transition between our arms and our hands. This regulatory strategy accounts for collinear Hox gene regulation in land vertebrate appendages.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, USA )
                1932-6203
                2014
                27 January 2014
                : 9
                : 1
                : e85879
                Affiliations
                [1 ]School of Life Sciences, Ecole Polytechnique Fédérale de Lausanne (EPFL), Lausanne, Switzerland
                [2 ]Swiss Institute of Bioinformatics (SIB), Lausanne, Switzerland
                [3 ]Swiss Institute for Experimental Cancer Research (ISREC), Lausanne, Switzerland
                Université Paris-Sud, France
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                Conceived and designed the experiments: ML JR FD. Analyzed the data: FD JD SC JR ML. Contributed reagents/materials/analysis tools: FD JD SC FR GL JR ML DN LS YJ. Wrote the paper: FD JD SC JR ML.

                [¤a]

                Current address: Laboratoire de Génétique Moléculaire, Institut de Biologie, CHU de Nantes Hôtel-Dieu, Nantes, France

                [¤b]

                Current address: Nestlé Institute of Health Sciences SA, Functional Genomics, Lausanne, Switzerland

                [¤c]

                Current address: Limnology Department, Evolutionary Biology Centre, Uppsala University, Sweden

                Article
                PONE-D-13-36266
                10.1371/journal.pone.0085879
                3903476
                24475057
                c4c3b5ac-5f69-406d-bb6b-60483f1d2d76
                Copyright @ 2014

                This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                : 3 September 2013
                : 3 December 2013
                Page count
                Pages: 9
                Funding
                This work has been supported by SystemsX.ch, the Swiss Initiative in Systems Biology, through the SyBIT (JD, FJR, LS, YJ), CycliX (JR), LipidX (FPAD) and DynamiX (JR) projects, by the ERC grant SystemsHox.ch to Denis Duboule (DN, ML), by a Swiss National Science Foundation Sinergia Grant CRSI33_130662 to Daniel Constam (SC), and by the EPFL. The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Biology
                Computational Biology
                Genomics
                Genome Analysis Tools
                Genetic Networks
                Transcriptomes
                Functional Genomics
                Genome Expression Analysis
                Genome Sequencing
                Sequence Analysis
                Genomics
                Functional Genomics
                Genome Analysis Tools
                Genome Databases
                Genome Expression Analysis
                Genome Sequencing
                Computer Science
                Computer Applications
                Web-Based Applications

                Uncategorized
                Uncategorized

                Comments

                Comment on this article