5
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      GlyStruct: glycation prediction using structural properties of amino acid residues

      research-article
      1 , , 1 , 2 , 3 , 4 , , 5 , 2 , 4 , 6 , 7 , 1 , 2 , 4 , 7
      BMC Bioinformatics
      BioMed Central
      17th International Conference on Bioinformatics (InCoB 2018)
      26-28 September 2018
      Post-translational modification, Lysine glycation, Protein sequences, Amino acids, Prediction, Support vector machine

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Background

          Glycation is a one of the post-translational modifications (PTM) where sugar molecules and residues in protein sequences are covalently bonded. It has become one of the clinically important PTM in recent times attributed to many chronic and age related complications. Being a non-enzymatic reaction, it is a great challenge when it comes to its prediction due to the lack of significant bias in the sequence motifs.

          Results

          We developed a classifier, GlyStruct based on support vector machine, to predict glycated and non-glycated lysine residues using structural properties of amino acid residues. The features used were secondary structure, accessible surface area and the local backbone torsion angles. For this work, a benchmark dataset was extracted containing 235 glycated and 303 non-glycated lysine residues. GlyStruct demonstrated improved performance of approximately 10% in comparison to benchmark method of Gly-PseAAC. The performance for GlyStruct on the metrics, sensitivity, specificity, accuracy and Mathew’s correlation coefficient were 0.7013, 0.7989, 0.7562, and 0.5065, respectively for 10-fold cross-validation.

          Conclusion

          Glycation has emerged to be one of the clinically important PTM of proteins in recent times. Therefore, the development of computational tools become necessary to predict glycation, which could help medical professionals administer drugs and manage patients more effectively. The proposed predictor manages to classify glycated and non-glycated lysine residues with promising results consistently on various cross-validation schemes and outperforms other state of the art methods.

          Electronic supplementary material

          The online version of this article (10.1186/s12859-018-2547-x) contains supplementary material, which is available to authorized users.

          Related collections

          Most cited references65

          • Record: found
          • Abstract: found
          • Article: found
          Is Open Access

          PhosphoSitePlus: a comprehensive resource for investigating the structure and function of experimentally determined post-translational modifications in man and mouse

          PhosphoSitePlus (http://www.phosphosite.org) is an open, comprehensive, manually curated and interactive resource for studying experimentally observed post-translational modifications, primarily of human and mouse proteins. It encompasses 1 30 000 non-redundant modification sites, primarily phosphorylation, ubiquitinylation and acetylation. The interface is designed for clarity and ease of navigation. From the home page, users can launch simple or complex searches and browse high-throughput data sets by disease, tissue or cell line. Searches can be restricted by specific treatments, protein types, domains, cellular components, disease, cell types, cell lines, tissue and sequences or motifs. A few clicks of the mouse will take users to substrate pages or protein pages with sites, sequences, domain diagrams and molecular visualization of side-chains known to be modified; to site pages with information about how the modified site relates to the functions of specific proteins and cellular processes and to curated information pages summarizing the details from one record. PyMOL and Chimera scripts that colorize reactive groups on residues that are modified can be downloaded. Features designed to facilitate proteomic analyses include downloads of modification sites, kinase–substrate data sets, sequence logo generators, a Cytoscape plugin and BioPAX download to enable pathway visualization of the kinase–substrate interactions in PhosphoSitePlus®.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            Proteomic analysis of post-translational modifications.

            Post-translational modifications modulate the activity of most eukaryote proteins. Analysis of these modifications presents formidable challenges but their determination generates indispensable insight into biological function. Strategies developed to characterize individual proteins are now systematically applied to protein populations. The combination of function- or structure-based purification of modified 'subproteomes', such as phosphorylated proteins or modified membrane proteins, with mass spectrometry is proving particularly successful. To map modification sites in molecular detail, novel mass spectrometric peptide sequencing and analysis technologies hold tremendous potential. Finally, stable isotope labeling strategies in combination with mass spectrometry have been applied successfully to study the dynamics of modifications.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Cell-PLoc: a package of Web servers for predicting subcellular localization of proteins in various organisms.

              Information on subcellular localization of proteins is important to molecular cell biology, proteomics, system biology and drug discovery. To provide the vast majority of experimental scientists with a user-friendly tool in these areas, we present a package of Web servers developed recently by hybridizing the 'higher level' approach with the ab initio approach. The package is called Cell-PLoc and contains the following six predictors: Euk-mPLoc, Hum-mPLoc, Plant-PLoc, Gpos-PLoc, Gneg-PLoc and Virus-PLoc, specialized for eukaryotic, human, plant, Gram-positive bacterial, Gram-negative bacterial and viral proteins, respectively. Using these Web servers, one can easily get the desired prediction results with a high expected accuracy, as demonstrated by a series of cross-validation tests on the benchmark data sets that covered up to 22 subcellular location sites and in which none of the proteins included had > or =25% sequence identity to any other protein in the same subcellular-location subset. Some of these Web servers can be particularly used to deal with multiplex proteins as well, which may simultaneously exist at, or move between, two or more different subcellular locations. Proteins with multiple locations or dynamic features of this kind are particularly interesting, because they may have some special biological functions intriguing to investigators in both basic research and drug discovery. This protocol is a step-by-step guide on how to use the Web-server predictors in the Cell-PLoc package. The computational time for each prediction is less than 5 s in most cases. The Cell-PLoc package is freely accessible at http://chou.med.harvard.edu/bioinf/Cell-PLoc.
                Bookmark

                Author and article information

                Contributors
                hamendra.reddy@gmail.com
                alok.sharma@griffith.edu.au
                i.dehzangi@gmail.com
                d.shigemizu@gmail.com
                chandra_ab@usp.ac.fj
                tatsuhiko.tsunoda@riken.jp
                Conference
                BMC Bioinformatics
                BMC Bioinformatics
                BMC Bioinformatics
                BioMed Central (London )
                1471-2105
                4 February 2019
                4 February 2019
                2019
                : 19
                : Suppl 13
                : 547
                Affiliations
                [1 ]ISNI 0000 0001 2171 4027, GRID grid.33998.38, School of Engineering & Physics, , University of the South Pacific, ; Suva, Fiji
                [2 ]Laboratory for Medical Science Mathematics, RIKEN Center for Integrative Medical Sciences, Tokyo, Japan
                [3 ]ISNI 0000 0004 0437 5432, GRID grid.1022.1, Institute for Integrated and Intelligent Systems, Griffith University, ; Brisbane, Australia
                [4 ]ISNI 0000 0004 1754 9200, GRID grid.419082.6, CREST, JST, ; Tokyo, Japan
                [5 ]ISNI 0000 0001 2224 4258, GRID grid.260238.d, Department of Computer Science, , Morgan State University, ; Baltimore, MD USA
                [6 ]ISNI 0000 0004 1791 9005, GRID grid.419257.c, Division of Genomic Medicine, , Medical Genome Center, National Center for Geriatrics and Gerontology, ; Obu, Aichi Japan
                [7 ]ISNI 0000 0001 1014 9130, GRID grid.265073.5, Department of Medical Science Mathematics, , Medical Research Institute, Tokyo Medical and Dental University, ; Tokyo, Japan
                Author information
                http://orcid.org/0000-0003-4852-3778
                Article
                2547
                10.1186/s12859-018-2547-x
                7394324
                30717650
                adb217b6-f67f-4807-9ca1-04b0e6df4ac9
                © The Author(s). 2019

                Open AccessThis article is distributed under the terms of the Creative Commons Attribution 4.0 International License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted use, distribution, and reproduction in any medium, provided you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The Creative Commons Public Domain Dedication waiver ( http://creativecommons.org/publicdomain/zero/1.0/) applies to the data made available in this article, unless otherwise stated.

                17th International Conference on Bioinformatics (InCoB 2018)
                New Delhi, India
                26-28 September 2018
                History
                : 23 May 2018
                : 29 November 2018
                Funding
                Funded by: The University of the South Pacific
                Award ID: FST14/F3205
                Award Recipient :
                Categories
                Research
                Custom metadata
                © The Author(s) 2019

                Bioinformatics & Computational biology
                post-translational modification,lysine glycation,protein sequences,amino acids,prediction,support vector machine

                Comments

                Comment on this article