44
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Combining Experiments and Simulations Using the Maximum Entropy Principle

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          A key component of computational biology is to compare the results of computer modelling with experimental measurements. Despite substantial progress in the models and algorithms used in many areas of computational biology, such comparisons sometimes reveal that the computations are not in quantitative agreement with experimental data. The principle of maximum entropy is a general procedure for constructing probability distributions in the light of new data, making it a natural tool in cases when an initial model provides results that are at odds with experiments. The number of maximum entropy applications in our field has grown steadily in recent years, in areas as diverse as sequence analysis, structural modelling, and neurobiology. In this Perspectives article, we give a broad introduction to the method, in an attempt to encourage its further adoption. The general procedure is explained in the context of a simple example, after which we proceed with a real-world application in the field of molecular simulations, where the maximum entropy procedure has recently provided new insight. Given the limited accuracy of force fields, macromolecular simulations sometimes produce results that are at not in complete and quantitative accordance with experiments. A common solution to this problem is to explicitly ensure agreement between the two by perturbing the potential energy function towards the experimental data. So far, a general consensus for how such perturbations should be implemented has been lacking. Three very recent papers have explored this problem using the maximum entropy approach, providing both new theoretical and practical insights to the problem. We highlight each of these contributions in turn and conclude with a discussion on remaining challenges.

          Related collections

          Most cited references31

          • Record: found
          • Abstract: found
          • Article: not found

          Direct-coupling analysis of residue coevolution captures native contacts across many protein families.

          The similarity in the three-dimensional structures of homologous proteins imposes strong constraints on their sequence variability. It has long been suggested that the resulting correlations among amino acid compositions at different sequence positions can be exploited to infer spatial contacts within the tertiary protein structure. Crucial to this inference is the ability to disentangle direct and indirect correlations, as accomplished by the recently introduced direct-coupling analysis (DCA). Here we develop a computationally efficient implementation of DCA, which allows us to evaluate the accuracy of contact prediction by DCA for a large number of protein domains, based purely on sequence information. DCA is shown to yield a large number of correctly predicted contacts, recapitulating the global structure of the contact map for the majority of the protein domains examined. Furthermore, our analysis captures clear signals beyond intradomain residue contacts, arising, e.g., from alternative protein conformations, ligand-mediated residue couplings, and interdomain interactions in protein oligomers. Our findings suggest that contacts predicted by DCA can be used as a reliable guide to facilitate computational predictions of alternative protein conformations, protein complex formation, and even the de novo prediction of protein domain structures, contingent on the existence of a large number of homologous sequences which are being rapidly made available due to advances in genome sequencing.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Systematic Validation of Protein Force Fields against Experimental Data

            Molecular dynamics simulations provide a vehicle for capturing the structures, motions, and interactions of biological macromolecules in full atomic detail. The accuracy of such simulations, however, is critically dependent on the force field—the mathematical model used to approximate the atomic-level forces acting on the simulated molecular system. Here we present a systematic and extensive evaluation of eight different protein force fields based on comparisons of experimental data with molecular dynamics simulations that reach a previously inaccessible timescale. First, through extensive comparisons with experimental NMR data, we examined the force fields' abilities to describe the structure and fluctuations of folded proteins. Second, we quantified potential biases towards different secondary structure types by comparing experimental and simulation data for small peptides that preferentially populate either helical or sheet-like structures. Third, we tested the force fields' abilities to fold two small proteins—one α-helical, the other with β-sheet structure. The results suggest that force fields have improved over time, and that the most recent versions, while not perfect, provide an accurate description of many structural and dynamical properties of proteins.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Long-timescale molecular dynamics simulations of protein structure and function.

              Molecular dynamics simulations allow for atomic-level characterization of biomolecular processes such as the conformational transitions associated with protein function. The computational demands of such simulations, however, have historically prevented them from reaching the microsecond and greater timescales on which these events often occur. Recent advances in algorithms, software, and computer hardware have made microsecond-timescale simulations with tens of thousands of atoms practical, with millisecond-timescale simulations on the horizon. This review outlines these advances in high-performance molecular dynamics simulation and discusses recent applications to studies of protein dynamics and function as well as experimental validation of the underlying computational models.
                Bookmark

                Author and article information

                Contributors
                Role: Editor
                Journal
                PLoS Comput Biol
                PLoS Comput. Biol
                plos
                ploscomp
                PLoS Computational Biology
                Public Library of Science (San Francisco, USA )
                1553-734X
                1553-7358
                February 2014
                20 February 2014
                : 10
                : 2
                : e1003406
                Affiliations
                [1 ]Structural Biology and NMR Laboratory, Department of Biology, University of Copenhagen, Copenhagen, Denmark
                [2 ]Cellular Signal Integration Group, Center for Biological Sequence Analysis, Technical University of Denmark, Lyngby, Denmark
                Stanford University, United States of America
                Author notes

                The authors have declared that no competing interests exist.

                Article
                PCOMPBIOL-D-13-01414
                10.1371/journal.pcbi.1003406
                3930489
                24586124
                87db7776-22da-40f7-8e09-cc10901b7c7c
                Copyright @ 2014

                This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

                History
                Page count
                Pages: 9
                Funding
                WB and KLL were supported by a Hallas Møller stipend from the Novo Nordisk Foundation ( www.novonordiskfonden.dk). The funders had no role in the preparation of the manuscript.
                Categories
                Perspective
                Biology
                Biophysics
                Biophysics Simulations
                Computational Biology
                Biophysic Al Simulations
                Chemistry
                Computational Chemistry
                Molecular Mechanics
                Physics
                Biophysics
                Biophysics Simulations
                Physical Laws and Principles
                Thermodynamics
                Entropy

                Quantitative & Systems biology
                Quantitative & Systems biology

                Comments

                Comment on this article