Record: found
Abstract: found
Article: not found

A statistical model for identifying proteins by tandem mass spectrometry.

Author(s): Alexey I. Nesvizhskii, Andrew Keller, Eugene Kolker, Ruedi Aebersold

Publication date: 2003-09-01

Keywords: Amino Acid Sequence, Humans, Mass Spectrometry, methods, Models, Statistical, Molecular Sequence Data, Peptides, analysis, chemistry, Proteins

Read this article at

ScienceOpenPubMed

There is no author summary for this article yet. Authors can add summaries to their articles on ScienceOpen to make them more accessible to a non-specialist audience.

Abstract

A statistical model is presented for computing probabilities that proteins are present in a sample on the basis of peptides assigned to tandem mass (MS/MS) spectra acquired from a proteolytic digest of the sample. Peptides that correspond to more than a single protein in the sequence database are apportioned among all corresponding proteins, and a minimal protein list sufficient to account for the observed peptide assignments is derived using the expectation-maximization algorithm. Using peptide assignments to spectra generated from a sample of 18 purified proteins, as well as complex H. influenzae and Halobacterium samples, the model is shown to produce probabilities that are accurate and have high power to discriminate correct from incorrect protein identifications. This method allows filtering of large-scale proteomics data sets with predictable sensitivity and false positive identification error rates. Fast, consistent, and transparent, it provides a standard for publishing large-scale protein identification data sets in the literature and for comparing the results obtained from different experiments.

Related collections

Author and article information

Journal

PubMed ID:: 14632076

ScienceOpen disciplines: Chemistry

Keywords: Amino Acid Sequence,Humans,Mass Spectrometry,methods,Models, Statistical,Molecular Sequence Data,Peptides,analysis,chemistry,Proteins

Data availability:

ScienceOpen disciplines: Chemistry

Keywords: Amino Acid Sequence, Humans, Mass Spectrometry, methods, Models, Statistical, Molecular Sequence Data, Peptides, analysis, chemistry, Proteins

A statistical model for identifying proteins by tandem mass spectrometry.

Read this article at

Abstract

Related collections

EPA CompTox Chemicals Dashboard

Author and article information

Journal

Comments

Comment on this article

Similar content 148

Cited by 1,340