3
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: found
      Is Open Access

      Multi-species Identification of Polymorphic Peptide Variants via Propagation in Spectral Networks*

      research-article

      Read this article at

      Bookmark
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Peptide and protein identification remains challenging in organisms with poorly annotated or rapidly evolving genomes, as are commonly encountered in environmental or biofuels research. Such limitations render tandem mass spectrometry (MS/MS) database search algorithms ineffective as they lack corresponding sequences required for peptide-spectrum matching. We address this challenge with the spectral networks approach to (1) match spectra of orthologous peptides across multiple related species and then (2) propagate peptide annotations from identified to unidentified spectra. We here present algorithms to assess the statistical significance of spectral alignments (Align-GF), reduce the impurity in spectral networks, and accurately estimate the error rate in propagated identifications. Analyzing three related Cyanothece species, a model organism for biohydrogen production, spectral networks identified peptides from highly divergent sequences from networks with dozens of variant peptides, including thousands of peptides in species lacking a sequenced genome. Our analysis further detected the presence of many novel putative peptides even in genomically characterized species, thus suggesting the possibility of gaps in our understanding of their proteomic and genomic expression. A web-based pipeline for spectral networks analysis is available at http://proteomics.ucsd.edu/software.

          Related collections

          Most cited references33

          • Record: found
          • Abstract: found
          • Article: not found

          Improving the coverage of the cyanobacterial phylum using diversity-driven genome sequencing.

          The cyanobacterial phylum encompasses oxygenic photosynthetic prokaryotes of a great breadth of morphologies and ecologies; they play key roles in global carbon and nitrogen cycles. The chloroplasts of all photosynthetic eukaryotes can trace their ancestry to cyanobacteria. Cyanobacteria also attract considerable interest as platforms for "green" biotechnology and biofuels. To explore the molecular basis of their different phenotypes and biochemical capabilities, we sequenced the genomes of 54 phylogenetically and phenotypically diverse cyanobacterial strains. Comparison of cyanobacterial genomes reveals the molecular basis for many aspects of cyanobacterial ecophysiological diversity, as well as the convergence of complex morphologies without the acquisition of novel proteins. This phylum-wide study highlights the benefits of diversity-driven genome sequencing, identifying more than 21,000 cyanobacterial proteins with no detectable similarity to known proteins, and foregrounds the diversity of light-harvesting proteins and gene clusters for secondary metabolite biosynthesis. Additionally, our results provide insight into the distribution of genes of cyanobacterial origin in eukaryotic nuclear genomes. Moreover, this study doubles both the amount and the phylogenetic diversity of cyanobacterial genome sequence data. Given the exponentially growing number of sequenced genomes, this diversity-driven study demonstrates the perspective gained by comparing disparate yet related genomes in a phylum-wide context and the insights that are gained from it.
            Bookmark
            • Record: found
            • Abstract: found
            • Article: not found

            PepNovo: de novo peptide sequencing via probabilistic network modeling.

            We present a novel scoring method for de novo interpretation of peptides from tandem mass spectrometry data. Our scoring method uses a probabilistic network whose structure reflects the chemical and physical rules that govern the peptide fragmentation. We use a likelihood ratio hypothesis test to determine whether the peaks observed in the mass spectrum are more likely to have been produced under our fragmentation model than under a model that treats peaks as random events. We tested our de novo algorithm PepNovo on ion trap data and achieved results that are superior to popular de novo peptide sequencing algorithms. PepNovo can be accessed via the URL http://www-cse.ucsd.edu/groups/bioinformatics/software.html.
              Bookmark
              • Record: found
              • Abstract: found
              • Article: not found

              Community proteomics of a natural microbial biofilm.

              Using genomic and mass spectrometry-based proteomic methods, we evaluated gene expression, identified key activities, and examined partitioning of metabolic functions in a natural acid mine drainage (AMD) microbial biofilm community. We detected 2033 proteins from the five most abundant species in the biofilm, including 48% of the predicted proteins from the dominant biofilm organism, Leptospirillum group II. Proteins involved in protein refolding and response to oxidative stress appeared to be highly expressed, which suggests that damage to biomolecules is a key challenge for survival. We validated and estimated the relative abundance and cellular localization of 357 unique and 215 conserved novel proteins and determined that one abundant novel protein is a cytochrome central to iron oxidation and AMD formation.
                Bookmark

                Author and article information

                Journal
                Mol Cell Proteomics
                Mol. Cell Proteomics
                mcprot
                mcprot
                MCP
                Molecular & Cellular Proteomics : MCP
                The American Society for Biochemistry and Molecular Biology
                1535-9476
                1535-9484
                November 2016
                8 September 2016
                8 September 2016
                : 15
                : 11
                : 3501-3512
                Affiliations
                [1]From the ‡Dept. of Computer Science and Engineering, University of California, San Diego, La Jolla, California, 92093;
                [2]§Center for Computational Mass Spectrometry, University of California, San Diego, La Jolla, California, 92093;
                [3]¶Pacific Northwest National Laboratory, Richland, Washington 99354,
                [4]‖Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California, San Diego, La Jolla, California, 92093
                Author notes
                ** To whom correspondence should be addressed: Center for Computational Mass Spectrometry, Department of Computer Science and Engineering, University of California, San Diego, 9500 Gilman Drive, Mail Code 0404, La Jolla, CA 92093-0404. Tel.: 1-858-534-8666; E-mail: bandeira@ 123456ucsd.edu .
                Author information
                http://orcid.org/0000-0002-5159-2048
                http://orcid.org/0000-0002-8351-1994
                Article
                O116.060913
                10.1074/mcp.O116.060913
                5098046
                27609420
                9160c063-2e3a-4b00-ac5f-42a7ab3a69c0
                © 2016 by The American Society for Biochemistry and Molecular Biology, Inc.

                Author's Choice—Final version free via Creative Commons CC-BY license.

                History
                : 10 May 2016
                : 29 August 2016
                Funding
                Funded by: National Institutes of Health http://dx.doi.org/10.13039/100000002
                Award ID: GM103493
                Award ID: Grant 2 P41 GM103484–06A1
                Funded by: U.S. Department of Energy http://dx.doi.org/10.13039/100000015
                Award ID: Early Career Program
                Funded by: Alfred P. Sloan Foundation http://dx.doi.org/10.13039/100000879
                Award ID: Sloan Research Fellowship
                Categories
                Technological Innovation and Resources

                Molecular biology
                Molecular biology

                Comments

                Comment on this article