1
views
0
recommends
+1 Recommend
1 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      A Structure-Based B-cell Epitope Prediction Model Through Combing Local and Global Features

      methods-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          B-cell epitopes (BCEs) are a set of specific sites on the surface of an antigen that binds to an antibody produced by B-cell. The recognition of BCEs is a major challenge for drug design and vaccines development. Compared with experimental methods, computational approaches have strong potential for BCEs prediction at much lower cost. Moreover, most of the currently methods focus on using local information around target residue without taking the global information of the whole antigen sequence into consideration. We propose a novel deep leaning method through combing local features and global features for BCEs prediction. In our model, two parallel modules are built to extract local and global features from the antigen separately. For local features, we use Graph Convolutional Networks (GCNs) to capture information of spatial neighbors of a target residue. For global features, Attention-Based Bidirectional Long Short-Term Memory (Att-BLSTM) networks are applied to extract information from the whole antigen sequence. Then the local and global features are combined to predict BCEs. The experiments show that the proposed method achieves superior performance over the state-of-the-art BCEs prediction methods on benchmark datasets. Also, we compare the performance differences between data with or without global features. The experimental results show that global features play an important role in BCEs prediction. Our detailed case study on the BCEs prediction for SARS-Cov-2 receptor binding domain confirms that our method is effective for predicting and clustering true BCEs.

          Related collections

          Most cited references43

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          A pneumonia outbreak associated with a new coronavirus of probable bat origin

          Since the outbreak of severe acute respiratory syndrome (SARS) 18 years ago, a large number of SARS-related coronaviruses (SARSr-CoVs) have been discovered in their natural reservoir host, bats 1–4 . Previous studies have shown that some bat SARSr-CoVs have the potential to infect humans 5–7 . Here we report the identification and characterization of a new coronavirus (2019-nCoV), which caused an epidemic of acute respiratory syndrome in humans in Wuhan, China. The epidemic, which started on 12 December 2019, had caused 2,794 laboratory-confirmed infections including 80 deaths by 26 January 2020. Full-length genome sequences were obtained from five patients at an early stage of the outbreak. The sequences are almost identical and share 79.6% sequence identity to SARS-CoV. Furthermore, we show that 2019-nCoV is 96% identical at the whole-genome level to a bat coronavirus. Pairwise protein sequence analysis of seven conserved non-structural proteins domains show that this virus belongs to the species of SARSr-CoV. In addition, 2019-nCoV virus isolated from the bronchoalveolar lavage fluid of a critically ill patient could be neutralized by sera from several patients. Notably, we confirmed that 2019-nCoV uses the same cell entry receptor—angiotensin converting enzyme II (ACE2)—as SARS-CoV.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Long Short-Term Memory

            Learning to store information over extended time intervals by recurrent backpropagation takes a very long time, mostly because of insufficient, decaying error backflow. We briefly review Hochreiter's (1991) analysis of this problem, then address it by introducing a novel, efficient, gradient-based method called long short-term memory (LSTM). Truncating the gradient where this does not do harm, LSTM can learn to bridge minimal time lags in excess of 1000 discrete-time steps by enforcing constant error flow through constant error carousels within special units. Multiplicative gate units learn to open and close access to the constant error flow. LSTM is local in space and time; its computational complexity per time step and weight is O(1). Our experiments with artificial data involve local, distributed, real-valued, and noisy pattern representations. In comparisons with real-time recurrent learning, back propagation through time, recurrent cascade correlation, Elman nets, and neural sequence chunking, LSTM leads to many more successful runs, and learns much faster. LSTM also solves complex, artificial long-time-lag tasks that have never been solved by previous recurrent network algorithms.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Gapped BLAST and PSI-BLAST: a new generation of protein database search programs.

              S Altschul (1997)
              The BLAST programs are widely used tools for searching protein and DNA databases for sequence similarities. For protein comparisons, a variety of definitional, algorithmic and statistical refinements described here permits the execution time of the BLAST programs to be decreased substantially while enhancing their sensitivity to weak similarities. A new criterion for triggering the extension of word hits, combined with a new heuristic for generating gapped alignments, yields a gapped BLAST program that runs at approximately three times the speed of the original. In addition, a method is introduced for automatically combining statistically significant alignments produced by BLAST into a position-specific score matrix, and searching the database using this matrix. The resulting Position-Specific Iterated BLAST (PSI-BLAST) program runs at approximately the same speed per iteration as gapped BLAST, but in many cases is much more sensitive to weak but biologically relevant sequence similarities. PSI-BLAST is used to uncover several new and interesting members of the BRCT superfamily.
                Bookmark

                Author and article information

                Contributors
                Journal
                Front Immunol
                Front Immunol
                Front. Immunol.
                Frontiers in Immunology
                Frontiers Media S.A.
                1664-3224
                01 July 2022
                2022
                01 July 2022
                : 13
                : 890943
                Affiliations
                [1] 1 School of Computer and Artificial Intelligence, Zhengzhou University , Zhengzhou, China
                [2] 2 School of Life Sciences, Zhengzhou University , Zhengzhou, China
                [3] 3 Longhu Laboratory of Advanced Immunology , Zhengzhou, China
                Author notes

                Edited by: Roland Dunbrack, Fox Chase Cancer Center, United States

                Reviewed by: Philippe Auguste Robert, University of Oslo, Norway; Kannan Sankar, Novartis Institutes for BioMedical Research, United States

                *Correspondence: Xiaofei Nan, iexfnan@ 123456zzu.edu.cn ; Shoutao Zhang, zhangst@ 123456zzu.edu.cn

                This article was submitted to B Cell Biology, a section of the journal Frontiers in Immunology

                Article
                10.3389/fimmu.2022.890943
                9283778
                35844532
                e0e378b9-564a-493f-8608-d0fc90186339
                Copyright © 2022 Lu, Li, Ma, Nan and Zhang

                This is an open-access article distributed under the terms of the Creative Commons Attribution License (CC BY). The use, distribution or reproduction in other forums is permitted, provided the original author(s) and the copyright owner(s) are credited and that the original publication in this journal is cited, in accordance with accepted academic practice. No use, distribution or reproduction is permitted which does not comply with these terms.

                History
                : 07 March 2022
                : 23 May 2022
                Page count
                Figures: 6, Tables: 4, Equations: 17, References: 43, Pages: 11, Words: 5202
                Funding
                Funded by: Science and Technology Department of Henan Province , doi 10.13039/501100011447;
                Award ID: 194200510002
                Funded by: National Health Commission of the People's Republic of China , doi 10.13039/501100018684;
                Funded by: Xinjiang Production and Construction Corps , doi 10.13039/501100009967;
                Categories
                Immunology
                Methods

                Immunology
                bi-lstm,gcn,sars-cov-2,structure-based,attention,b-cell epitopes prediction
                Immunology
                bi-lstm, gcn, sars-cov-2, structure-based, attention, b-cell epitopes prediction

                Comments

                Comment on this article