11
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Geometric potentials from deep learning improve prediction of CDR H3 loop structures

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Motivation

          Antibody structure is largely conserved, except for a complementarity-determining region featuring six variable loops. Five of these loops adopt canonical folds which can typically be predicted with existing methods, while the remaining loop (CDR H3) remains a challenge due to its highly diverse set of observed conformations. In recent years, deep neural networks have proven to be effective at capturing the complex patterns of protein structure. This work proposes DeepH3, a deep residual neural network that learns to predict inter-residue distances and orientations from antibody heavy and light chain sequence. The output of DeepH3 is a set of probability distributions over distances and orientation angles between pairs of residues. These distributions are converted to geometric potentials and used to discriminate between decoy structures produced by RosettaAntibody and predict new CDR H3 loop structures de novo.

          Results

          When evaluated on the Rosetta antibody benchmark dataset of 49 targets, DeepH3-predicted potentials identified better, same and worse structures [measured by root-mean-squared distance (RMSD) from the experimental CDR H3 loop structure] than the standard Rosetta energy function for 33, 6 and 10 targets, respectively, and improved the average RMSD of predictions by 32.1% (1.4 Å). Analysis of individual geometric potentials revealed that inter-residue orientations were more effective than inter-residue distances for discriminating near-native CDR H3 loops. When applied to de novo prediction of CDR H3 loop structures, DeepH3 achieves an average RMSD of 2.2 ± 1.1 Å on the Rosetta antibody benchmark.

          Availability and Implementation

          DeepH3 source code and pre-trained model parameters are freely available at https://github.com/Graylab/deepH3-distances-orientations.

          Supplementary information

          Supplementary data are available at Bioinformatics online.

          Related collections

          Most cited references22

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          SAbDab: the structural antibody database

          Structural antibody database (SAbDab; http://opig.stats.ox.ac.uk/webapps/sabdab) is an online resource containing all the publicly available antibody structures annotated and presented in a consistent fashion. The data are annotated with several properties including experimental information, gene details, correct heavy and light chain pairings, antigen details and, where available, antibody–antigen binding affinity. The user can select structures, according to these attributes as well as structural properties such as complementarity determining region loop conformation and variable domain orientation. Individual structures, datasets and the complete database can be downloaded.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Conformations of immunoglobulin hypervariable regions.

            On the basis of comparative studies of known antibody structures and sequences it has been argued that there is a small repertoire of main-chain conformations for at least five of the six hypervariable regions of antibodies, and that the particular conformation adopted is determined by a few key conserved residues. These hypotheses are now supported by reasonably successful predictions of the structures of most hypervariable regions of various antibodies, as revealed by comparison with their subsequently determined structures.
              Bookmark
              • Record: found
              • Abstract: not found
              • Article: not found

              Sub-angstrom accuracy in protein loop reconstruction by robotics-inspired conformational sampling.

                Bookmark

                Author and article information

                Journal
                Bioinformatics
                Bioinformatics
                bioinformatics
                Bioinformatics
                Oxford University Press
                1367-4803
                1367-4811
                July 2020
                13 July 2020
                13 July 2020
                : 36
                : Suppl 1 , ISMB 2020 Proceedings
                : i268-i275
                Affiliations
                [b1 ] Program in Molecular Biophysics, The Johns Hopkins University , Baltimore, MD 21218, USA
                [b2 ] Department of Computer Science, George Mason University , Fairfax, VA 22030, USA
                [b3 ] Department of Chemical and Biomolecular Engineering, The Johns Hopkins University , Baltimore, MD 21218, USA
                [b4 ] Department of Biomedical Engineering, The Johns Hopkins University , Baltimore, MD 21218, USA
                [b5 ] Mathematical Institute for Data Science, The Johns Hopkins University , Baltimore, MD 21218, USA
                Author notes
                To whom correspondence should be addressed. E-mail: jgray@ 123456jhu.edu
                Article
                btaa457
                10.1093/bioinformatics/btaa457
                7355305
                32657412
                0a9f0a5e-5d20-40c4-8e92-5b5cf03266ed
                © The Author(s) 2020. Published by Oxford University Press.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution Non-Commercial License ( http://creativecommons.org/licenses/by-nc/4.0/), which permits non-commercial re-use, distribution, and reproduction in any medium, provided the original work is properly cited. For commercial re-use, please contact journals.permissions@oup.com

                History
                Page count
                Pages: 8
                Funding
                Funded by: National Institutes of Health, DOI 10.13039/100000002;
                Award ID: R01-GM078221
                Award ID: T32-GM008403
                Funded by: National Science Foundation Research Experience for Undergraduates;
                Award ID: DBI-1659649
                Funded by: Maryland Advanced Research Computing Cluster;
                Funded by: MARCC;
                Categories
                Macromolecular Sequence, Structure, and Function

                Bioinformatics & Computational biology
                Bioinformatics & Computational biology

                Comments

                Comment on this article