33
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      An Evolutionary View on Disulfide Bond Connectivities Prediction Using Phylogenetic Trees and a Simple Cysteine Mutation Model

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Disulfide bonds are crucial for many structural and functional aspects of proteins. They have a stabilizing role during folding, can regulate enzymatic activity and can trigger allosteric changes in the protein structure. Moreover, knowledge of the topology of the disulfide connectivity can be relevant in genomic annotation tasks and can provide long range constraints for ab-initio protein structure predictors. In this paper we describe PhyloCys, a novel unsupervised predictor of disulfide bond connectivity from known cysteine oxidation states. For each query protein, PhyloCys retrieves and aligns homologs with HHblits and builds a phylogenetic tree using ClustalW. A simplified model of cysteine co-evolution is then applied to the tree in order to hypothesize the presence of oxidized cysteines in the inner nodes of the tree, which represent ancestral protein sequences. The tree is then traversed from the leaves to the root and the putative disulfide connectivity is inferred by observing repeated patterns of tandem mutations between a sequence and its ancestors. A final correction is applied using the Edmonds-Gabow maximum weight perfect matching algorithm. The evolutionary approach applied in PhyloCys results in disulfide bond predictions equivalent to Sephiroth, another approach that takes whole sequence information into account, and is 26–29% better than state of the art methods based on cysteine covariance patterns in multiple sequence alignments, while requiring one order of magnitude fewer homologous sequences (10 3 instead of 10 4), thus extending its range of applicability. The software described in this article and the datasets used are available at http://ibsquare.be/phylocys.

          Related collections

          Most cited references14

          • Record: found
          • Abstract: found
          • Article: not found

          PSICOV: precise structural contact prediction using sparse inverse covariance estimation on large multiple sequence alignments.

          The accurate prediction of residue-residue contacts, critical for maintaining the native fold of a protein, remains an open problem in the field of structural bioinformatics. Interest in this long-standing problem has increased recently with algorithmic improvements and the rapid growth in the sizes of sequence families. Progress could have major impacts in both structure and function prediction to name but two benefits. Sequence-based contact predictions are usually made by identifying correlated mutations within multiple sequence alignments (MSAs), most commonly through the information-theoretic approach of calculating mutual information between pairs of sites in proteins. These predictions are often inaccurate because the true covariation signal in the MSA is often masked by biases from many ancillary indirect-coupling or phylogenetic effects. Here we present a novel method, PSICOV, which introduces the use of sparse inverse covariance estimation to the problem of protein contact prediction. Our method builds on work which had previously demonstrated corrections for phylogenetic and entropic correlation noise and allows accurate discrimination of direct from indirectly coupled mutation correlations in the MSA. PSICOV displays a mean precision substantially better than the best performing normalized mutual information approach and Bayesian networks. For 118 out of 150 targets, the L/5 (i.e. top-L/5 predictions for a protein of length L) precision for long-range contacts (sequence separation >23) was ≥ 0.5, which represents an improvement sufficient to be of significant benefit in protein structure prediction or model quality assessment. The PSICOV source code can be downloaded from http://bioinf.cs.ucl.ac.uk/downloads/PSICOV.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            DISULFIND: a disulfide bonding state and cysteine connectivity prediction server

            DISULFIND is a server for predicting the disulfide bonding state of cysteines and their disulfide connectivity starting from sequence alone. Optionally, disulfide connectivity can be predicted from sequence and a bonding state assignment given as input. The output is a simple visualization of the assigned bonding state (with confidence degrees) and the most likely connectivity patterns. The server is available at .
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Disulfide bonds and protein folding

                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS One
                PLoS ONE
                plos
                plosone
                PLoS ONE
                Public Library of Science (San Francisco, CA USA )
                1932-6203
                2015
                10 July 2015
                : 10
                : 7
                : e0131792
                Affiliations
                [1 ]Interuniversity Institute of Bioinformatics in Brussels, ULB-VUB, Brussels, Belgium
                [2 ]Structural Biology Brussels, Vrije Universiteit Brussel, Brussels, Belgium
                [3 ]Department of Structural Biology, VIB, Brussels, Belgium
                [4 ]Machine Learning Group, ULB, Brussels, Belgium
                Instituto de Biociencias—Universidade de São Paulo, BRAZIL
                Author notes

                Competing Interests: The authors have declared that no competing interests exist.

                Conceived and designed the experiments: GO DR WV. Performed the experiments: DR GO. Analyzed the data: GO DR WV. Contributed reagents/materials/analysis tools: DR GO WV. Wrote the paper: DR GO WV.

                ‡ These authors contributed equally to this work.

                Article
                PONE-D-15-11094
                10.1371/journal.pone.0131792
                4498770
                26161671
                e8219903-e6ca-4fdc-9d07-9e40e3d70cad
                Copyright @ 2015

                This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited

                History
                : 13 March 2015
                : 7 June 2015
                Page count
                Figures: 5, Tables: 5, Pages: 18
                Funding
                DR is funded by the Agency for Innovation by Science and Technology in Flanders (IWT) ( http://www.iwt.be/). WFV is funded by the Brussels Institute for Research and Innovation (Innoviris) grant BB2B 2010-1-12 ( http://www.innoviris.be/nl). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.
                Categories
                Research Article
                Custom metadata
                The data are available at http://dx.doi.org/10.6084/m9.figshare.1422074 for the oxcysnr dataset and http://dx.doi.org/10.6084/m9.figshare.1422073 for the oxcys15 dataset.

                Uncategorized
                Uncategorized

                Comments

                Comment on this article