4
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Plant-mSubP: a computational framework for the prediction of single- and multi-target protein subcellular localization using integrated machine-learning approaches

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          The subcellular localization of proteins is very important for characterizing its function in a cell. Accurate prediction of the subcellular locations in computational paradigm has been an active area of interest. Most of the work has been focused on single localization prediction. Only few studies have discussed the multi-target localization, but have not achieved good accuracy so far; in plant sciences, very limited work has been done. Here we report the development of a novel tool Plant-mSubP, which is based on integrated machine learning approaches to efficiently predict the subcellular localizations in plant proteomes. The proposed approach predicts with high accuracy 11 single localizations and three dual locations of plant cell. Several hybrid features based on composition and physicochemical properties of a protein such as amino acid composition, pseudo amino acid composition, auto-correlation descriptors, quasi-sequence-order descriptors and hybrid features are used to represent the protein. The performance of the proposed method has been assessed through a training set as well as an independent test set. Using the hybrid feature of the pseudo amino acid composition, N-Center-C terminal amino acid composition and the dipeptide composition (PseAAC-NCC-DIPEP), an overall accuracy of 81.97 %, 84.75 % and 87.88 % is achieved on the training data set of proteins containing the single-label, single- and dual-label combined, and dual-label proteins, respectively. When tested on the independent data, an accuracy of 64.36 %, 64.84 % and 81.08 % is achieved on the single-label, single- and dual-label, and dual-label proteins, respectively. The prediction models have been implemented on a web server available at http://bioinfo.usu.edu/Plant-mSubP/. The results indicate that the proposed approach is comparable to the existing methods in single localization prediction and outperforms all other existing tools when compared for dual-label proteins. The prediction tool will be a useful resource for better annotation of various plant proteomes.

          Abstract

          In order for various cellular processes to be carried out in a cell, the correct identification of subcellular localization of a protein is crucial. Determining the localization of proteins through experiments can be laborious, expensive and time-consuming. Computational prediction is thus essential to decipher protein function and faster genome annotation. It also aids the identification of drug targets. We present here a software, Plant-mSubP, which is a publicly available web platform at http://bioinfo.usu.edu/Plant-mSubP/. Plant-mSubP uses artificial intelligence to effectively predict the protein localizations (11 single- and three dual-target) inside the plant cell. This resource can assess a high number of proteins in order to find their desired location in a cell, and thus will be useful in better annotation of various plant proteomes.

          Related collections

          Most cited references46

          • Record: found
          • Abstract: not found
          • Article: not found

          Recent progress in protein subcellular location prediction.

            Bookmark
            • Record: found
            • Abstract: found
            • Article: found
            Is Open Access

            LOCALIZER: subcellular localization prediction of both plant and effector proteins in the plant cell

            Pathogens secrete effector proteins and many operate inside plant cells to enable infection. Some effectors have been found to enter subcellular compartments by mimicking host targeting sequences. Although many computational methods exist to predict plant protein subcellular localization, they perform poorly for effectors. We introduce LOCALIZER for predicting plant and effector protein localization to chloroplasts, mitochondria, and nuclei. LOCALIZER shows greater prediction accuracy for chloroplast and mitochondrial targeting compared to other methods for 652 plant proteins. For 107 eukaryotic effectors, LOCALIZER outperforms other methods and predicts a previously unrecognized chloroplast transit peptide for the ToxA effector, which we show translocates into tobacco chloroplasts. Secretome-wide predictions and confocal microscopy reveal that rust fungi might have evolved multiple effectors that target chloroplasts or nuclei. LOCALIZER is the first method for predicting effector localisation in plants and is a valuable tool for prioritizing effector candidates for functional investigations. LOCALIZER is available at http://localizer.csiro.au/.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: found
              Is Open Access

              BUSCA: an integrative web server to predict subcellular localization of proteins

              Abstract Here, we present BUSCA (http://busca.biocomp.unibo.it), a novel web server that integrates different computational tools for predicting protein subcellular localization. BUSCA combines methods for identifying signal and transit peptides (DeepSig and TPpred3), GPI-anchors (PredGPI) and transmembrane domains (ENSEMBLE3.0 and BetAware) with tools for discriminating subcellular localization of both globular and membrane proteins (BaCelLo, MemLoci and SChloro). Outcomes from the different tools are processed and integrated for annotating subcellular localization of both eukaryotic and bacterial protein sequences. We benchmark BUSCA against protein targets derived from recent CAFA experiments and other specific data sets, reporting performance at the state-of-the-art. BUSCA scores better than all other evaluated methods on 2732 targets from CAFA2, with a F1 value equal to 0.49 and among the best methods when predicting targets from CAFA3. We propose BUSCA as an integrated and accurate resource for the annotation of protein subcellular localization.
                Bookmark

                Author and article information

                Contributors
                Role: Associate Editor
                Journal
                AoB Plants
                AoB Plants
                aobpla
                AoB Plants
                Oxford University Press (US )
                2041-2851
                June 2020
                17 October 2019
                17 October 2019
                : 12
                : 3
                : plz068
                Affiliations
                [1 ] Department of Electronics and Communication Engineering, Birla Institute of Technology , Mesra, Ranchi, India
                [2 ] Department of Plants, Soils, and Climate/Center for Integrated BioSystems, College of Agriculture and Applied Sciences, Utah State University , Logan, UT, USA
                [3 ] Bioinformatics Facility, Center for Integrated BioSystems, Utah State University , Logan, UT, USA
                Author notes
                Corresponding author’s e-mail address: rkaundal@ 123456usu.edu

                These authors contributed equally to this work.

                Article
                plz068
                10.1093/aobpla/plz068
                7274489
                32528639
                a1a67f1b-88c5-4f18-bdc5-55bb5bbdbcb2
                © The Author(s) 2019. Published by Oxford University Press on behalf of the Annals of Botany Company.

                This is an Open Access article distributed under the terms of the Creative Commons Attribution License ( http://creativecommons.org/licenses/by/4.0/), which permits unrestricted reuse, distribution, and reproduction in any medium, provided the original work is properly cited.

                History
                : 28 April 2019
                : 23 August 2019
                : 11 October 2019
                : 10 March 2020
                Page count
                Pages: 10
                Categories
                Tools
                AcademicSubjects/SCI01210

                Plant science & Botany
                artificial intelligence,machine learning,multi-location,prediction tool,protein science,subcellular localization,web server

                Comments

                Comment on this article