14
views
0
recommends
+1 Recommend
0 collections
    0
    shares
      • Record: found
      • Abstract: found
      • Article: not found

      Fast open modification spectral library searching through approximate nearest neighbor indexing

      research-article

      Read this article at

      ScienceOpenPublisherPMC
          There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

          Abstract

          Open modification searching (OMS) is a powerful search strategy that identifies peptides carrying any type of modification by allowing a modified spectrum to match against its unmodified variant by using a very wide precursor mass window. A drawback of this strategy, however, is that it leads to a large increase in search time. Although performing an open search can be done using existing spectral library search engines by simply setting a wide precursor mass window, none of these tools have been optimized for OMS, leading to excessive runtimes and suboptimal identification results.

          Here we present the ANN-SoLo tool for fast and accurate open spectral library searching. ANN-SoLo uses approximate nearest neighbor indexing to speed up OMS by selecting only a limited number of the most relevant library spectra to compare to an unknown query spectrum. This approach is combined with a cascade search strategy to maximize the number of identified unmodified and modified spectra while strictly controlling the false discovery rate, as well as a shifted dot product score to sensitively match modified spectra to their unmodified counterparts.

          ANN-SoLo achieves state-of-the-art performance in terms of speed and the number of identifications. On a previously published human cell line data set, ANN-SoLo confidently identifies more spectra than SpectraST or MSFragger and achieves a speedup of an order of magnitude compared to SpectraST.

          ANN-SoLo is implemented in Python and C++. It is freely available under the Apache 2.0 license at https://github.com/bittremieux/ANN-SoLo.

          Related collections

          Author and article information

          Journal
          101128775
          30137
          J Proteome Res
          J. Proteome Res.
          Journal of proteome research
          1535-3893
          1535-3907
          9 September 2018
          13 September 2018
          05 October 2018
          05 October 2019
          : 17
          : 10
          : 3463-3474
          Affiliations
          [1 ]Department of Mathematics and Computer Science, University of Antwerp, 2020 Antwerp, Belgium;
          [2 ]Biomedical Informatics Network Antwerpen (biomina), 2020 Antwerp, Belgium;
          [3 ]Department of Genome Sciences, University of Washington, Seattle WA 98195, USA;
          [4 ]Department of Computer Science and Engineering, University of Washington, Seattle WA 98195, USA
          Author notes
          Article
          PMC6173621 PMC6173621 6173621 nihpa988286
          10.1021/acs.jproteome.8b00359
          6173621
          30184435
          03aad4ce-397f-4125-b681-dbf37a3aae5e
          History
          Categories
          Article

          Comments

          Comment on this article