MCP Thermo Scientific TMT Isobaric Mass Tagging Kits
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
 QUICK SEARCH:   [advanced]


     


A more recent version of this article appeared on June 1, 2008.
This Article
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
M700239-MCP200v1
7/6/1135    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Glossary
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Martínez-Bartolomé, S.
Right arrow Articles by Vázquez, J.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Martínez-Bartolomé, S.
Right arrow Articles by Vázquez, J.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Submitted on May 23, 2007
Revised on February 19, 2008
Accepted on February 25, 2008

Properties of average score distributions of SEQUEST: the Probability Ratio method

Salvador Martínez-Bartolomé, Pedro Navarro, Fernando Martín-Maroto, Daniel López-Ferrer, Antonio Ramos-Fernández, Margarita Villar, Josefa P. García-Ruiz, and Jesús Vázquez

Centro de Biología Molecular Severo Ochoa, Cantoblanco, Madrid 28049

Corresponding Author: jvazquez{at}cbm.uam.es

High-throughput identification of peptides in databases from tandem mass spectrometry data is a key technique in modern Proteomics. Common approaches to interpret large-scale peptide identification results are based on the statistical analysis of average score distributions, which are constructed from the set of best scores produced by large collections of MS/MS spectra by using searching engines such as SEQUEST. Other approaches calculate individual peptide identification probabilities on the basis of theoretical models or from single-spectrum score distributions constructed by the set of scores produced by each MS/MS spectrum. In this work, we study the mathematical properties of average SEQUEST score distributions by introducing the concept of spectrum quality and expressing these average distributions as compositions of single-spectrum distributions. We predict and demonstrate in the practice that average score distributions are dominated by the quality distribution in the spectra collection, except in the low probability region, where it is possible to predict the dependence of average probability on database size. Our analysis leads to a novel indicator, the probability ratio, which takes optimally into account the statistical information provided by the first and second best scores. We also demonstrate that the probability ratio is a robust indicator that allows a peptide identification performance at least better, on the basis of error rates, than that obtained by more sophisticated empirical algorithms. The probability ratio also compares favorably with statistical probability indicators obtained by the construction of single-spectrum SEQUEST score distributions by extensive computation. These results make the conceptual simplicity and ease of automation of the probability ratio algorithm a very attractive alternative to determine peptide identification confidences and error rates in high-throughput experiments.


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?





HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH
 All ASBMB Journals   Journal of Biological Chemistry 
 Journal of Lipid Research   ASBMB Today 
Copyright © 2008 by the American Society for Biochemistry and Molecular Biology.