Probability-based protein identification by searching sequence databases using mass spectrometry data

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

Several algorithms have been described in the literature for protein identification by searching a sequence database using mass spectrometry data. In some approaches, the experimental data are peptide molecular weights from the digestion of a protein by an enzyme. Other approaches use tandem mass spectrometry (MS/MS) data from one or more peptides. Still others combine mass data with amino acid sequence data. We present results from a new computer program, Mascot, which integrates all three types of search. The scoring algorithm is probability based, which has a number of advantages: (i) A simple rule can be used to judge whether a result is significant or not. This is particularly useful in guarding against false positives. (ii) Scores can be compared with those from other types of search, such as sequence homology. (iii) Search parameters can be readily optimised by iteration. The strengths and limitations of probability-based scoring are discussed, particularly in the context of high throughput, fully automated protein identification.

Related collections

Author and article information

Journal

Title: Electrophoresis

Abbreviated Title: Electrophoresis

Publisher: Wiley

ISSN (Print): 0173-0835

ISSN (Electronic): 1522-2683

Publication date Created: December 01 1999

Publication date (Print): December 01 1999

Volume: 20

Issue: 18

Pages: 3551-3567

Article

DOI: 10.1002/(SICI)1522-2683(19991201)20:18<3551::AID-ELPS3551>3.0.CO;2-2

SO-VID: 2cde7c90-73cb-49f2-958e-4a64e8fbc4f1

License:

http://doi.wiley.com/10.1002/tdm_license_1.1

History

Data availability:

Comments

Comment on this article

scite_

Cited by 1,590

See all cited by