20
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Assessment of contact predictions in CASP12: Co‐evolution and deep learning coming of age

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Following up on the encouraging results of residue‐residue contact prediction in the CASP11 experiment, we present the analysis of predictions submitted for CASP12. The submissions include predictions of 34 groups for 38 domains classified as free modeling targets which are not accessible to homology‐based modeling due to a lack of structural templates. CASP11 saw a rise of coevolution‐based methods outperforming other approaches. The improvement of these methods coupled to machine learning and sequence database growth are most likely the main driver for a significant improvement in average precision from 27% in CASP11 to 47% in CASP12. In more than half of the targets, especially those with many homologous sequences accessible, precisions above 90% were achieved with the best predictors reaching a precision of 100% in some cases. We furthermore tested the impact of using these contacts as restraints in ab initio modeling of 14 single‐domain free modeling targets using Rosetta. Adding contacts to the Rosetta calculations resulted in improvements of up to 26% in GDT_TS within the top five structures.

          Related collections

          Most cited references59

          • Record: found
          • Abstract: found
          • Article: not found

          Protein homology detection by HMM-HMM comparison.

          Protein homology detection and sequence alignment are at the basis of protein structure prediction, function prediction and evolution. We have generalized the alignment of protein sequences with a profile hidden Markov model (HMM) to the case of pairwise alignment of profile HMMs. We present a method for detecting distant homologous relationships between proteins based on this approach. The method (HHsearch) is benchmarked together with BLAST, PSI-BLAST, HMMER and the profile-profile comparison tools PROF_SIM and COMPASS, in an all-against-all comparison of a database of 3691 protein domains from SCOP 1.63 with pairwise sequence identities below 20%.Sensitivity: When the predicted secondary structure is included in the HMMs, HHsearch is able to detect between 2.7 and 4.2 times more homologs than PSI-BLAST or HMMER and between 1.44 and 1.9 times more than COMPASS or PROF_SIM for a rate of false positives of 10%. Approximately half of the improvement over the profile-profile comparison methods is attributable to the use of profile HMMs in place of simple profiles. Alignment quality: Higher sensitivity is mirrored by an increased alignment quality. HHsearch produced 1.2, 1.7 and 3.3 times more good alignments ('balanced' score >0.3) than the next best method (COMPASS), and 1.6, 2.9 and 9.4 times more than PSI-BLAST, at the family, superfamily and fold level, respectively.Speed: HHsearch scans a query of 200 residues against 3691 domains in 33 s on an AMD64 2GHz PC. This is 10 times faster than PROF_SIM and 17 times faster than COMPASS.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            ROSETTA3: an object-oriented software suite for the simulation and design of macromolecules.

            We have recently completed a full re-architecturing of the ROSETTA molecular modeling program, generalizing and expanding its existing functionality. The new architecture enables the rapid prototyping of novel protocols by providing easy-to-use interfaces to powerful tools for molecular modeling. The source code of this rearchitecturing has been released as ROSETTA3 and is freely available for academic use. At the time of its release, it contained 470,000 lines of code. Counting currently unpublished protocols at the time of this writing, the source includes 1,285,000 lines. Its rapid growth is a testament to its ease of use. This chapter describes the requirements for our new architecture, justifies the design decisions, sketches out central classes, and highlights a few of the common tasks that the new software can perform. © 2011 Elsevier Inc. All rights reserved.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              Hidden Markov model speed heuristic and iterative HMM search procedure

              Background Profile hidden Markov models (profile-HMMs) are sensitive tools for remote protein homology detection, but the main scoring algorithms, Viterbi or Forward, require considerable time to search large sequence databases. Results We have designed a series of database filtering steps, HMMERHEAD, that are applied prior to the scoring algorithms, as implemented in the HMMER package, in an effort to reduce search time. Using this heuristic, we obtain a 20-fold decrease in Forward and a 6-fold decrease in Viterbi search time with a minimal loss in sensitivity relative to the unfiltered approaches. We then implemented an iterative profile-HMM search method, JackHMMER, which employs the HMMERHEAD heuristic. Due to our search heuristic, we eliminated the subdatabase creation that is common in current iterative profile-HMM approaches. On our benchmark, JackHMMER detects 14% more remote protein homologs than SAM's iterative method T2K. Conclusions Our search heuristic, HMMERHEAD, significantly reduces the time needed to score a profile-HMM against large sequence databases. This search heuristic allowed us to implement an iterative profile-HMM search method, JackHMMER, which detects significantly more remote protein homologs than SAM's T2K and NCBI's PSI-BLAST.
                Bookmark

                Author and article information

                Contributors
                a.m.j.j.bonvin@uu.nl
                Journal
                Proteins
                Proteins
                10.1002/(ISSN)1097-0134
                PROT
                Proteins
                John Wiley and Sons Inc. (Hoboken )
                0887-3585
                1097-0134
                07 November 2017
                March 2018
                : 86
                : Suppl Suppl 1 , Twelfth Meeting on the Critical Assessment of Techniques for Protein Structure Prediction ( doiID: 10.1002/prot.v86.S1 )
                : 51-66
                Affiliations
                [ 1 ] Faculty of Science ‐ Chemistry Computational Structural Biology Group, Bijvoet Center for Biomolecular Research, Utrecht University Utrecht The Netherlands
                [ 2 ] Genome Center, University of California Davis California
                Author notes
                [*] [* ] Correspondence Alexandre M.J.J. Bonvin, Faculty of Science, Utrecht University, Padualaan 8, 3584 CH Utrecht, the Netherlands. Email: a.m.j.j.bonvin@ 123456uu.nl
                Author information
                http://orcid.org/0000-0002-4389-2366
                http://orcid.org/0000-0001-5066-7178
                http://orcid.org/0000-0001-7369-1322
                Article
                PROT25407
                10.1002/prot.25407
                5820169
                29071738
                f06306d0-2a14-438b-9366-dd12201f3340
                © 2017 The Authors Proteins: Structure, Function and Bioinformatics Published by Wiley Periodicals, Inc.

                This is an open access article under the terms of the Creative Commons Attribution‐NonCommercial License, which permits use, distribution and reproduction in any medium, provided the original work is properly cited and is not used for commercial purposes.

                History
                : 14 July 2017
                : 06 October 2017
                : 24 October 2017
                Page count
                Figures: 16, Tables: 2, Pages: 16, Words: 8985
                Funding
                Funded by: National Institute of General Medical Sciences (NIGMS/NIH)
                Award ID: GM100482
                Funded by: The FP7 WeNMR
                Award ID: # 261572
                Funded by: Horizon 2020 West‐Life
                Award ID: # 675858
                Funded by: European e‐Infrastructure projects
                Categories
                Research Article
                Research Articles
                Custom metadata
                2.0
                prot25407
                March 2018
                Converter:WILEY_ML3GV2_TO_NLMPMC version:version=5.3.2.2 mode:remove_FC converted:05.03.2018

                Biochemistry
                casp,contact prediction,correlated mutations,co‐variation,evolutionary coupling,de novo structure prediction

                Comments

                Comment on this article