6
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: not found
      • Article: not found

      Enspara: Modeling molecular ensembles with scalable data structures and parallel computing

      1 , 1 , 1
      The Journal of Chemical Physics
      AIP Publishing

      Read this article at

      ScienceOpenPublisherPMC
      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Markov state models (MSMs) are quantitative models of protein dynamics that are useful for uncovering the structural fluctuations that proteins undergo, as well as the mechanisms of these conformational changes. Given the enormity of conformational space, there has been ongoing interest in identifying a small number of states that capture the essential features of a protein. Generally, this is achieved by making assumptions about the properties of relevant features—for example, that the most important features are those that change slowly. An alternative strategy is to keep as many degrees of freedom as possible and subsequently learn from the model which of the features are most important. In these larger models, however, traditional approaches quickly become computationally intractable. In this paper, we present enspara, a library for working with MSMs that provides several novel algorithms and specialized data structures that dramatically improve the scalability of traditional MSM methods. This includes ragged arrays for minimizing memory requirements, message passing interface-parallelized implementations of compute-intensive operations, and a flexible framework for model construction and analysis.

          Related collections

          Most cited references43

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          The NumPy array: a structure for efficient numerical computation

          In the Python world, NumPy arrays are the standard representation for numerical data. Here, we show how these arrays enable efficient implementation of numerical computations in a high-level language. Overall, three techniques are applied to improve performance: vectorizing calculations, avoiding copying data in memory, and minimizing operation counts. We first present the NumPy array structure, then show how to use it for efficient computation, and finally how to share array data with other libraries.
            Bookmark
            • Record: found
            • Abstract: not found
            • Article: not found

            OpenMP: an industry standard API for shared-memory programming

              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Cython: The Best of Both Worlds

                Bookmark

                Author and article information

                Journal
                The Journal of Chemical Physics
                J. Chem. Phys.
                AIP Publishing
                0021-9606
                1089-7690
                January 28 2019
                January 28 2019
                : 150
                : 4
                : 044108
                Affiliations
                [1 ]Department of Biochemistry and Molecular Biophysics, Washington University School of Medicine, 660 South Euclid Avenue, St. Louis, Missouri 63110, USA
                Article
                10.1063/1.5063794
                6910589
                30709308
                eeb7ef40-f70f-4f54-b00b-153045e16508
                © 2019
                History

                Comments

                Comment on this article