|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Submitted on July 27, 2005
Biological Chemistry, University of Michigan, Ann Arbor, MI 48109
Corresponding Author: pulintz{at}umich.edu
Manual analysis of mass spectrometry data is a current bottleneck in high-throughput proteomics. In particular, the need to manually validate the results of mass spectrometry database searching algorithms can be prohibitively time-consuming. Development of software tools that attempt to quantify the confidence in the assignment of a protein or peptide identity to a mass spectrum is an area of active interest. We seek to extend work in this area by investigating the potential of recent machine learning algorithms to improve the accuracy of these approaches, and as a flexible framework for accommodating new data features. Specifically, we demonstrate the ability of boosting and random forest approaches to improve the discrimination of true hits from false positive identifications in the results of mass spectrometry database search engines, compared to thresholding and other machine learning approaches. We accommodate additional attributes obtainable from database search results, including a factor addressing proton mobility. Performance is evaluated using publically available electrospray data and a new collection of MALDI data generated from purified human reference proteins.
Revised on November 10, 2005
Accepted on November 30, 2005
Improved classification of mass spectrometry database search results using newer machine learning approaches
![]()
CiteULike
Complore
Connotea
Del.icio.us
Digg
Reddit
Technorati What's this?
This article has been cited by other articles:
![]() |
M. Brosch, S. Swamy, T. Hubbard, and J. Choudhary Comparison of Mascot and X!Tandem Performance for Low and High Accuracy Mass Spectrometry and the Development of an Adjusted Mascot Threshold Mol. Cell. Proteomics, May 1, 2008; 7(5): 962 - 970. [Abstract] [Full Text] [PDF] |
||||
![]() |
D. A. Stead, N. W. Paton, P. Missier, S. M. Embury, C. Hedeler, B. Jin, A. J. P. Brown, and A. Preece Information quality in proteomics Brief Bioinform, March 1, 2008; 9(2): 174 - 188. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. D. Jaffe, D. R. Mani, K. C. Leptos, G. M. Church, M. A. Gillette, and S. A. Carr PEPPeR, a Platform for Experimental Proteomic Pattern Recognition Mol. Cell. Proteomics, October 1, 2006; 5(10): 1927 - 1941. [Abstract] [Full Text] [PDF] |
||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |