A more recent version of this article appeared on February 1, 2002.
Submitted on August 7, 2001
Revised on November 13, 2001
Accepted on December 12, 2001
Getting more from less: algorithms for rapid protein identification with multiple short peptide sequences
Aaron J. Mackey, Timothy A.J. Haystead, and William R. Pearson
Biochemistry and Molecular Genetics, University of Virginia, Charlottesville, VA 22908
Corresponding Author: wrp{at}virginia.edu
We describe two novel sequence similarity search algorithms, FASTS and FASTF, that use multiple short peptide sequences to identify homologous sequences in protein or DNA databases. FASTS searches with peptide sequences of unknown order, as obtained by mass spectrometry-based sequencing, evaluating all possible arrangements of the peptides. FASTF searches with mixed peptide sequences, as generated by Edman sequencing of unseparated mixtures of peptides. FASTF deconvolutes the mixture, using a greedy heuristic that allows rapid identification of high scoring alignments while reducing the total number of explored alternatives. Both algorithms use the heuristic FASTA comparison strategy to accelerate the search, but use alignment probability, rather than similarity score, as the criterion for alignment optimality. Statistical estimates are calculated using an empirical correction to a theoretical probability. These calculated estimates were accurate within a factor of 10 for FASTS and 1000 for FASTF on our test dataset. FASTS requires only 1520 total residues in three or four peptides to robustly identify homologues sharing 50% or greater protein sequence identity. FASTF requires about 25% more sequence data than FASTS for equivalent sensitivity, but additional sequence data is usually available from mixed-Edman experiments. Thus, both algorithms can identify homologues that diverged 100 to 500 million years ago, allowing proteomic identification from organisms whose genomes have not been sequenced.

CiteULike Complore Connotea Del.icio.us Digg Reddit Technorati What's this?
This article has been cited by other articles:

|
 |

|
 |
 
I. J. Tetlow, K. G. Beisel, S. Cameron, A. Makhmoudova, F. Liu, N. S. Bresolin, R. Wait, M. K. Morell, and M. J. Emes
Analysis of Protein Complexes in Wheat Amyloplasts Reveals Functional Interactions among Starch Biosynthetic Enzymes
Plant Physiology,
April 1, 2008;
146(4):
1878 - 1891.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
L. K. Iwai, M. Yoshida, A. Sadahiro, W. R. da Silva, M. L. Marin, A. C. Goldberg, M. A. Juliano, L. Juliano, M. A. Shikanai-Yasuda, J. Kalil, et al.
T-Cell Recognition of Paracoccidioides brasiliensis gp43-Derived Peptides in Patients with Paracoccidioidomycosis and Healthy Individuals
Clin. Vaccine Immunol.,
April 1, 2007;
14(4):
474 - 476.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. Sandra, P. Dolashka-Angelova, B. Devreese, and J. Van Beeumen
New insights in Rapana venosa hemocyanin N-glycosylation resulting from on-line mass spectrometric analyses
Glycobiology,
February 1, 2007;
17(2):
141 - 156.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
Q. Luo, E. Nieves, J. Kzhyshkowska, and R. H. Angeletti
Endogenous Transforming Growth Factor-{beta} Receptor-mediated Smad Signaling Complexes Analyzed by Mass Spectrometry
Mol. Cell. Proteomics,
July 1, 2006;
5(7):
1245 - 1260.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
K. I. Orsborn, L. F. Shubitz, T. Peng, E. M. Kellner, M. J. Orbach, P. A. Haynes, and J. N. Galgiani
Protein Expression Profiling of Coccidioides posadasii by Two-Dimensional Differential In-Gel Electrophoresis and Evaluation of a Newly Recognized Peroxisomal Matrix Protein as a Recombinant Vaccine Candidate
Infect. Immun.,
March 1, 2006;
74(3):
1865 - 1872.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. E. Kremer, T. Haystead, and I. G. Macara
Mammalian Septins Regulate Microtubule Stability through Interaction with the Microtubule-binding Protein MAP4
Mol. Biol. Cell,
October 1, 2005;
16(10):
4648 - 4659.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. D. Halligan, V. Ruotti, S. N. Twigger, and A. S. Greene
DeNovoID: a web-based tool for identifying peptides from sequence and mass tags deduced from de novo peptide sequencing by mass spectroscopy
Nucleic Acids Res.,
July 1, 2005;
33(suppl_2):
W376 - W381.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
D. Vergote, P.-E. Sautiere, F. Vandenbulcke, D. Vieau, G. Mitta, E. R. Macagno, and M. Salzet
Up-regulation of Neurohemerythrin Expression in the Central Nervous System of the Medicinal Leech, Hirudo medicinalis, following Septic Injury
J. Biol. Chem.,
October 15, 2004;
279(42):
43828 - 43837.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
S. M. Baxter, J. S. Rosenblum, S. Knutson, M. R. Nelson, J. S. Montimurro, J. A. Di Gennaro, J. A. Speir, J. J. Burbaum, and J. S. Fetrow
Synergistic Computational and Experimental Proteomics Approaches for More Accurate Detection of Active Serine Hydrolases in Yeast
Mol. Cell. Proteomics,
March 1, 2004;
3(3):
209 - 225.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
B. Habermann, J. Oegema, S. Sunyaev, and A. Shevchenko
The Power and the Limitations of Cross-Species Protein Identification by Mass Spectrometry-driven Sequence Similarity Searches
Mol. Cell. Proteomics,
March 1, 2004;
3(3):
238 - 249.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
I. J. Tetlow, R. Wait, Z. Lu, R. Akkasaeng, C. G. Bowsher, S. Esposito, B. Kosar-Hashemi, M. K. Morell, and M. J. Emes
Protein Phosphorylation in Amyloplasts Regulates Starch Branching Enzyme Activity and Protein-Protein Interactions
PLANT CELL,
March 1, 2004;
16(3):
694 - 708.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
E. E. Corcoran, J. D. Joseph, J. A. MacDonald, C. D. Kane, T. A. J. Haystead, and A. R. Means
Proteomic Analysis of Calcium/Calmodulin-dependent Protein Kinase I and IV in Vitro Substrates Reveals Distinct Catalytic Preferences
J. Biol. Chem.,
March 14, 2003;
278(12):
10516 - 10522.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. R. Graves and T. A.J. Haystead
A Functional Proteomics Approach to Signal Transduction
Recent Prog. Horm. Res.,
January 1, 2003;
58(1):
1 - 24.
[Abstract]
[Full Text]
[PDF]
|
 |
|

|
 |

|
 |
 
P. R. Graves, J. J. Kwiek, P. Fadden, R. Ray, K. Hardeman, A. M. Coley, M. Foley, and T. A. J. Haystead
Discovery of Novel Targets of Quinoline Drugs in the Human Purine Binding Proteome
Mol. Pharmacol.,
December 1, 2002;
62(6):
1364 - 1372.
[Abstract]
[Full Text]
[PDF]
|
 |
|
Copyright © 2001 by the American Society for Biochemistry and Molecular Biology.
|
Advertisement
Advertisement
|