82
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      JASPAR 2016: a major expansion and update of the open-access database of transcription factor binding profiles

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          JASPAR ( http://jaspar.genereg.net) is an open-access database storing curated, non-redundant transcription factor (TF) binding profiles representing transcription factor binding preferences as position frequency matrices for multiple species in six taxonomic groups. For this 2016 release, we expanded the JASPAR CORE collection with 494 new TF binding profiles (315 in vertebrates, 11 in nematodes, 3 in insects, 1 in fungi and 164 in plants) and updated 59 profiles (58 in vertebrates and 1 in fungi). The introduced profiles represent an 83% expansion and 10% update when compared to the previous release. We updated the structural annotation of the TF DNA binding domains (DBDs) following a published hierarchical structural classification. In addition, we introduced 130 transcription factor flexible models trained on ChIP-seq data for vertebrates, which capture dinucleotide dependencies within TF binding sites. This new JASPAR release is accompanied by a new web tool to infer JASPAR TF binding profiles recognized by a given TF protein sequence. Moreover, we provide the users with a Ruby module complementing the JASPAR API to ease programmatic access and use of the JASPAR collection of profiles. Finally, we provide the JASPAR2016 R/Bioconductor data package with the data of this release.

          Related collections

          Most cited references18

          • Record: found
          • Abstract: found
          • Article: not found

          JASPAR: an open-access database for eukaryotic transcription factor binding profiles.

          The analysis of regulatory regions in genome sequences is strongly based on the detection of potential transcription factor binding sites. The preferred models for representation of transcription factor binding specificity have been termed position-specific scoring matrices. JASPAR is an open-access database of annotated, high-quality, matrix-based transcription factor binding site profiles for multicellular eukaryotes. The profiles were derived exclusively from sets of nucleotide sequences experimentally demonstrated to bind transcription factors. The database is complemented by a web interface for browsing, searching and subset selection, an online sequence analysis utility and a suite of programming tools for genome-wide and comparative genomic analysis of regulatory regions. JASPAR is available at http://jaspar. cgb.ki.se.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            JASPAR, the open access database of transcription factor-binding profiles: new content and tools in the 2008 update

            JASPAR is a popular open-access database for matrix models describing DNA-binding preferences for transcription factors and other DNA patterns. With its third major release, JASPAR has been expanded and equipped with additional functions aimed at both casual and power users. The heart of the JASPAR database—the JASPAR CORE sub-database—has increased by 12% in size, and three new specialized sub-databases have been added. New functions include clustering of matrix models by similarity, generation of random matrices by sampling from selected sets of existing models and a language-independent Web Service applications programming interface for matrix retrieval. JASPAR is available at http://jaspar.genereg.net.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Amino acid-base interactions: a three-dimensional analysis of protein-DNA interactions at an atomic level.

              To assess whether there are universal rules that govern amino acid-base recognition, we investigate hydrogen bonds, van der Waals contacts and water-mediated bonds in 129 protein-DNA complex structures. DNA-backbone interactions are the most numerous, providing stability rather than specificity. For base interactions, there are significant base-amino acid type correlations, which can be rationalised by considering the stereochemistry of protein side chains and the base edges exposed in the DNA structure. Nearly two-thirds of the direct read-out of DNA sequences involves complex networks of hydrogen bonds, which enhance specificity. Two-thirds of all protein-DNA interactions comprise van der Waals contacts, compared to about one-sixth each of hydrogen and water-mediated bonds. This highlights the central importance of these contacts for complex formation, which have previously been relegated to a secondary role. Although common, water-mediated bonds are usually non-specific, acting as space-fillers at the protein-DNA interface. In conclusion, the majority of amino acid-base interactions observed follow general principles that apply across all protein-DNA complexes, although there are individual exceptions. Therefore, we distinguish between interactions whose specificities are 'universal' and 'context-dependent'. An interactive Web-based atlas of side chain-base contacts provides access to the collected data, including analyses and visualisation of the three-dimensional geometry of the interactions.
                Bookmark

                Author and article information

                Journal
                Nucleic Acids Res
                Nucleic Acids Res
                nar
                nar
                Nucleic Acids Research
                Oxford University Press
                0305-1048
                1362-4962
                04 January 2016
                03 November 2015
                03 November 2015
                : 44
                : Database issue , Database issue
                : D110-D115
                Affiliations
                [1 ]Centre for Molecular Medicine and Therapeutics at the Child and Family Research Institute, Department of Medical Genetics, University of British Columbia, Vancouver, V5Z 4H4, BC, Canada
                [2 ]Laboratoire Physiologie Cellulaire & Végétale, Université Grenoble Alpes, CNRS, CEA, iRTSV, INRA, 38054 Grenoble, France
                [3 ]Computational Regulatory Genomics, MRC Clinical Sciences Centre, Imperial College London, Du Cane Road, London W12 0NN, UK
                [4 ]The Bioinformatics Centre, Department of Biology and Biotech Research and Innovation Centre, Copenhagen University, Ole Maaloes Vej 5, DK-2200, Denmark
                Author notes
                [* ]To whom correspondence should be addressed. Tel: +1 604 875 3812; Fax: +1 604 875 3819; Email:  wyeth@ 123456cmmt.ubc.ca
                Correspondence may also be addressed to Boris Lenhard. Tel: +44 208 383 8353; Fax: +44 208 383 8577; Email: b.lenhard@ 123456csc.mrc.ac.uk
                Correspondence may also be addressed to Albin Sandelin. Tel: +45 353 21285; Fax:+45 3532 5669; Email: albin@ 123456binf.ku.dk
                Article
                10.1093/nar/gkv1176
                4702842
                26531826
                0be3b023-c78f-4aef-9073-8601eca53611
                © The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@ 123456oup.com

                History
                : 22 October 2015
                : 19 October 2015
                : 12 September 2015
                Page count
                Pages: 6
                Categories
                Database Issue
                Custom metadata
                04 January 2016

                Genetics
                Genetics

                Comments

                Comment on this article