|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 5:172-181, 2006.
© 2006 by The American Society for Biochemistry and Molecular Biology, Inc.
,
,¶
,||,**




From the
Department of Pathology, Harvard Medical School and Childrens Hospital, Boston, Massachusetts 02115,
Department of Systems Biology, Harvard Medical School, Boston, Massachusetts 02115, || Neurobiology Program, Childrens Hospital, Boston, Massachusetts 02115, ** MDS Sciex, Concord, Ontario L4K 4V8, Canada, 
Cell Signaling Technology, Beverly, Massachusetts 01915, and 
MRC Protein Phosphorylation Unit, University of Dundee, Dundee, United Kingdom DD1 5EH
| ABSTRACT |
|---|
|
|
|---|
30% of all proteins in a cell are phosphorylated at any given time. However, this number is in stark contrast to the actual number of phosphorylation sites found so far. For instance, the Phospho.ELM database (phospho.elm.eu.org) currently lists 1703 experimentally verified phosphorylation sites for 556 different proteins derived from eukaryotes, the human protein reference database (www.hprd.org) lists 3652 reported phosphorylation sites on 1240 human proteins, and PhosphoSite (www.phosphosite.org) lists 6084 non-redundant phosphorylation sites on 2430 human and mouse proteins. However, a recent study by Beausoleil et al. (1) gave some idea about the expected number of phosphorylation in the cell when the authors performed a large scale phosphoproteomics study on HeLa cell nuclei. Although their method was biased against basic phosphopeptides where His, Lys-Pro, and/or Arg-Pro residues are in close proximity to the phosphoamino acid residue, more than 2000 phosphorylation sites on 967 nuclear proteins were found. Considering the fact that protein phosphorylation analysis is of major interest in numerous laboratories around the world it is surprising that more information about protein phosphorylation sites has not been gathered since the discovery of protein phosphorylation. This raises the question as to why it is still such a challenge to perform unbiased protein phosphorylation analysis. One inherent reason is the generally low phosphorylation stoichiometry of most of the proteins such that phosphopeptides are grossly underrepresented in the generated complex peptide mixtures (see below). In addition, numerous reasons can be listed for not being able to identify and localize a phosphorylation site in a given protein, which affects all stages of phosphorylation analysis, i.e. sample preparation, analysis, and data interpretation. Examples for problems during the sample preparation include e.g. omission of phosphatase inhibitor, overestimation of the degree of phosphorylation because of the lack of information about the phosphorylation stoichiometry when using phosphospecific antibodies, and/or underestimation of the heterogeneity of the phosphorylation due to nonspecific 32P-labeling.
However, even if it was ensured that a particular protein sample is phosphorylated by using other methods such as radiolabeling it is often the case that the mass spectrometric analysis does not provide any information about the sites of phosphorylation. Three main reasons are commonly used to explain as to why the phosphopeptides were missed during the mass spectrometric analysis: (i) increased hydrophilicity and hence reduced retention of phosphopeptides on reversed-phase materials, (ii) selective suppression of their ionization/detection efficiencies in the presence of large amounts of unphosphorylated peptides, and (iii) lower detection efficiencies of phosphopeptides as compared with their unphosphorylated cognates. In this study we tried to scrutinize these different arguments often brought forward with the aim to draw the attention to the most prevalent problem(s) of mass spectrometric phosphorylation analysis and to improve the analysis where it is most fruitful/promising.
| MATERIALS AND METHODS |
|---|
|
|
|---|
LC/MS Analysis
All electrospray ionization experiments were performed using a QSTAR XL hybrid mass spectrometer (AB/MDS Sciex) hyphenated with microscale capillary reversed-phase HPLC (Famos autosampler (LC Packings), Agilent 1100 HPLC pump (Agilent)). The columns were packed in-house using Magic C18 (5 µm, 200 Å, Michrom BioResources) beads. The buffer compositions are as follows: buffer A: 2.5% acetonitrile, 0.2% formic acid; buffer B: 2.5% H2O, 0.2% formic acid. For the quantitation experiments a 5-min gradient was used with mass spectra being acquired every 0.15 s. Data analysis and quantitation was done using the Analyst software package provided by Applied Biosystems/MDS Sciex.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
|
The described elution order can be observed in the absence of strong ion pairing reagents such as trifluoroacetic acid (TFA) as these agents mask the positive charges on peptides, whereas the negative charges on peptides caused by carboxylic groups are neutralized at acidic pH. This is contrasted by peptides with highly acidic functional groups such as phospho moieties for which one negative charge on the phosphate group remains even at pH 2 such that phosphopeptides elute before their unphosphorylated peptide counterparts when ion pairing reagent is present to neutralize the positive charges. Because strong ion pairing reagents such as TFA are often not required (and not desired) in LC/MS experiments the elution order of phosphopeptides and their unmodified cognates are easily modulated.
More systematic work is necessary to evaluate the effects of different stationary phases and buffer compositions on the elution order of peptides and their phosphorylated cognates. This notion is underscored by a recently published example of a doubly phosphorylated peptide and its partially phosphorylated/unphosphorylated cognates; in this example, which employed only formic acid without TFA in the LC buffers, the peptides eluted in the following order: monophosphopeptide A < diphosphopeptide < unmodified peptide < monophosphopeptide B (2). Such an elution order is hard to rationalize irrespective of the presence or absence of any ion pairing reagents. Nevertheless, the data presented clearly show that phosphorylation does not necessarily lead to an overall increase in hydrophilicity. This is of particular interest for tryptic peptides, which generate peptides with at least one basic amino acid residue (exception: C-terminal peptide) such that singly phosphorylated peptides should elute after the unphosphorylated cognate; this minimizes the risk of losing phosphopeptides during the loading of the reversed-phase column. Multiple phosphorylations, however, can indeed lead to decreased retention if the number of phosphorylation sites is not matched by the number of basic amino acid residues. However, because phosphorylation affects the proteolysis kinetics of cleavage sites proximal to the site of phosphorylation (3), it can be expected that multiply phosphorylated peptides will show an increased frequency of more than one basic amino acid residues, thus partially compensating this problem. Nevertheless, alternative proteolysis strategies should be considered (e.g. Lys-C) which (a) increase the average number of basic residues within the proteolytic peptides and/or (b) increase the length of the proteolytic peptides, which generally also increases the retention time, thus reducing the potential loss of phosphopeptides during the loading of the reversed-phase column. That said, the loss of phosphopeptides because of increased hydrophilicity can be a serious problem if nonspecific proteases are used generating small peptides without any basic amino acid residues. This notion was confirmed when a highly acidic and hydrophilic peptide/phosphopeptide pair with the sequence EDADS(pY)ENMD was analyzed by LC/MS; whereas the unmodified form of the peptide was retained on the C18 column, the phosphorylated cognate was detected in the flow through under standard LC/MS conditions.
"Phosphorylated species are selectively suppressed in the presence of unmodified peptides" is another often used general statement that is made without detailed examination of this phenomenon. To test the validity of this generalization for LC/MS, constant amounts of several synthetic phosphopeptides and their unphosphorylated cognates were mixed with increasing amounts of a tryptic BSA digest, ranging from equal amount to 100-fold excess. The ratios of the signal intensities of peptide versus phosphopeptide were calculated for each pair and are shown in Fig. 2A. It is evident from the almost horizontal curves that there is no significant change in the relative ionization/detection efficiencies with increasing "background" of unmodified peptides. To ensure that this observation is valid not only for this particular set of peptides/phosphopeptides and tryptic BSA digest, another set of six different phosphopeptide/peptide pairs was spiked into seven individual tryptic digest of numerous commercially available proteins: (ubiquitin (8.5 kDa), avidin (17 kDa),
-casein (25 kDa), ß-casein (25 kDa), carbonic anhydrase (29 kDa), alcohol dehydrogenase (40 kDa), ovalbumin (43 kDa)) and into a mixture thereof. This mixture simulated the digest of a
200 kDa protein. The normalized signal intensity ratios for the different peptide/phosphopeptide pairs present in those samples are shown in Fig. 2B. The horizontal trend of all curves again shows that the signal intensity ratios of peptide versus phosphorylated cognate did not significantly change, irrespective of the complexity of the digest, i.e. irrespective of the number of unmodified peptides co-eluting with the species of interest. This is exemplified in Fig. 3, AD, which display mass spectra at the peak elution of one of the synthetic test peptides (marked with an arrowhead) spiked into various tryptic digests; differences in the sample complexity at the time of elution are obvious. The findings in either of the two sets of experiments do not support the notion that selective suppression of phosphopeptides occurs when analyzed by electrospray ionization mass spectrometry in the presence of unmodified peptides.
|
|
"Phosphopeptides have lower ionization/detection efficiencies than their unphosphorylated counterpart" is one of the most commonly used arguments for not observing phosphopeptides in a particular sample, i.e. protein digests. This notion is based on the idea that the phosphate group of phosphorylated species is negatively charged, thereby affecting the ionization and detection efficiencies under acidic conditions used for mass spectrometric experiments in positive ion mode. To address this question, whether phosphopeptides indeed have lower detection/ionization efficiencies than their unphosphorylated cognates, the synthetic peptides and phosphopeptides used for the experiments described above were extensively purified by HPLC to homogeneity prior to amino acid analysis in replicate for quantitation. Stock solutions of all peptides were prepared using freshly calibrated pipettes to minimize the chances of pipetting errors, which would give rise to systematic deviations. Each peptide/phosphopeptide pair was mixed in at least two different defined ratios before performing repeated electrospray ionization mass spectrometric analyses. Microscale capillary HPLC was used to introduce the samples into the mass spectrometer. Magic C18 was used for the LC experiments because this reversed-phase material is a commonly used stationary phase in the field of proteomics. The buffers used were the same as described above. To determine the relative ionization/detection efficiencies for the peptide/phosphopeptide pairs, selected ion chromatograms were generated for each observed charge state of every peptide/phosphopeptide pair. Subsequently, the areas under the curves were calculated using the quantitation feature in Analyst QS. These integral values were then corrected for the defined concentration ratios (ranging from 4:1 to 1:4). The corrected ratios of those integrals reflect the ionization/detection efficiency ratio for the different peptide/phosphopeptide pairs. Throughout the article, the ionization/detection efficiency ratio for each peptide versus phosphopeptide is calculated such that a number >1 indicates that the unphosphorylated peptide has a better ionization/detection efficiency, whereas a ratio <1 indicates a better ionization/detection efficiencies for the phosphorylated form.
A quadrupole TOF mass spectrometer was used for the experiments. This type of mass spectrometer has a limited dynamic range. However, potential saturation-related problems are compensated for by the isotopic resolution provided by the instrument, which allowed choosing for the calculation of the ionization/detection efficiency ratios the signals of isotopes unaffected by saturation. The results of this set of experiments are listed in Table I, column 4. The peptides are grouped accounting for the number of basic amino acid residues and number of phosphorylation sites. The outcome of the experiments can be summarized as follows:
More phosphopeptides species show better ionization/detection efficiencies than their unphosphorylated cognates as compared with the reversed situation. This trend is more pronounced when the peptide contains more than one basic amino acid residue. Even the phosphopeptides with only one basic amino acid residue which resemble as such His-free tryptic peptides show relative ionization/detection efficiencies (peptide versus phosphopeptide) ranging from 0.3 to 1.7 Table I, Peptide species 15).
About 70% of the peptide pairs show relative ionization/detection efficiencies in the range of 0.52. However, it is not obvious (apart from the exceptions mentioned below) which peptide sequences give rise to ionization/detection efficiency differences >2 upon phosphorylation. This reflects the lack of general understanding of what determines the absolute ionization/detection efficiencies of peptides.
The ionization/detection efficiency ratio unphosphorylated peptide versus phosphopeptide increases with increasing the charge state. The extent of increase, however, varies significantly. The observed increases varied from 11% (SRARIG(pS)DPLAYEPK: 2+
3+ Table I, Peptide species 21 and 22)) to about 800% (KTQA(pS)QGTLQTR: 2+
3+ Table I, Peptide species 8 and 9)). One example is presented in Fig. 4, which shows charge states of 2+ to 4+ of the peptide GLGTRTG(pS)NLDRDKL Table I, Peptide species 1416; please note that the peptide/phosphopeptide were combined in a 2:1 mixture). A clear decrease of the phosphopeptide ion signal relative to the unphosphorylated form is apparent with increasing charge state.
|
It was observed that for the three highly basic peptides, the most highly charged species of the unmodified cognate had ionization/detection efficiencies 2 to 3 orders of magnitude better than the phosphorylated complement (peptide species 16, 20, and 23). One example is given in Fig. 4. The triply charged phosphopeptide GLGTRTGpSNLDRDKL Table I, Peptide species 16) is more efficiently ionized and detected than the unmodified cognate GLGTRTGSNLDRDKL. However, the quadruply charged unmodified species shows a vastly better ionization/detection efficiency as compared with its phosphorylated form, which is almost completely absent. A significant fraction of the molecules of these "exceptional" species show this very high charge state because of the overall proton affinity of the peptides. As soon as the proton affinity is reduced by e.g. phosphorylation, harboring four protons becomes energetically unfavorable, and the ion signal intensity of this charge state becomes negligible. The rather large standard deviations in these exceptional cases result from the limited dynamic range of the instrument used for the study.
The findings in this part of the study clearly show a general statement, stating that phosphopeptides show lower ionization/detection efficiencies than their unphosphorylated cognates, does not apply to electrospray ionization mass spectrometry as the majority of the phosphopeptides tested actually show better ionization/detection efficiencies than their unphosphorylated cognates. Similar ionization/detection efficiencies were also found for tryptic-like peptides, i.e. those with only one basic amino acid residue. This finding is supported by other studies from our own and other groups, which determined similar ionization/detection efficiencies for truly tryptic peptide/phosphopeptide pairs, i.e. peptides that were generated by tryptic digestion of proteins (46); all these studies report similar ionization/detection efficiencies for tryptic peptides and their unphosphorylated cognates.
This raises the questions (or highlights the lack of knowledge about) what determines the ionization/detection efficiencies of peptides. Some preliminary studies have been published investigating the effect of different physicochemical properties of peptides on the ionization/detection efficiencies during electrospray ionization. These studies highlighted a positive correlation between the hydrophobicities and the ionization/detection efficiencies of peptides as well as between pI values and ionization/detection efficiencies (e.g. Refs. 79). However, when following their line of argument that the phosphorylated form of a particular peptide should have lower ionization/detection efficiencies than their unmodified cognate because both the hydrophobicity and pI are thought to decrease with the covalent attachment of a phosphate group. Although the pI of a peptide is certainly reduced upon phosphorylation, the effect of phosphorylation on the hydrophobicity is not as clear. As described above, some kind of charge compensation/internal salt bridges/zwitterion formation might actually decrease the hydrophilicity of basic amino acid-containing peptides in diluted formic acid/acetonitrile/water solutions upon phosphorylation. However, a more detailed understanding of the determinants of the ionization/detection efficiencies of peptides still has to be established. Such a study must include the composition of the peptides as well as instrumental parameters and the composition of the spray solution. The importance of the latter was underscored by a phosphorylation analysis on the yeast transcription factor Pho4 performed in our laboratory. The same sample was analyzed twice. The following parameters were varied in the two LC/MS experiments: (i) a different electrospray emitter was used, and (ii) the LC buffer system was changed from water/acetonitrile/0.4% acetic acid/0.005% heptafluorobutyric acid to water/acetonitrile/0.2% formic acid. When observing the signal intensities of the ions corresponding to the phosphopeptide TSSSAEGVVVASE(pS)PVIAPHGSTHAR and its unmodified cognate it was noted that the triply charged species show almost identical intensity ratios for both sets of experimental conditions, whereas the signal intensity ratio of the quadruply charged peptide/phosphopeptide is significantly different, changing from 1.2 to 0.33 (Fig. 5). It should be noted that measuring relative ionization/detection efficiencies of peptides and their phosphorylated cognates is not only of educational importance in the context of this study but can be used for stable-isotope-free quantitation of protein phosphorylation stoichiometries as we have recently shown (4). Once such an ionization/detection efficiency ratio has been determined the degree of phosphorylation can easily be calculated based on the ion signal intensity ratio corrected by the relative ionization/detection efficiencies. Because there is a lack of information regarding the factors influencing ionization/detection efficiencies, the ratios have to be determined empirically. However, it is foreseeable that this property can be theoretically estimated in the future once a better understanding of ionization/detection efficiencies has been provided. This would greatly simplify the task of quantitation of protein phosphorylation stoichiometries.
|
By testing how hydrophilicity of peptides is altered upon phosphorylation, compelling evidence is provided that calculating nominal hydrophilicities by simply adding up hydrophilicity coefficients for each amino acid side chain is a suboptimal descriptor for the expected elution order of peptides and their phosphorylated cognates. Although phosphorylation clearly increases the nominal hydrophilicity of peptides with respect to their unphosphorylated cognate, this effect is overcompensated under certain LC conditions if the overall net charge of the peptide is decreased by the addition of phosphate groups. In other words, conditions can be used under which phosphopeptides are better retained on the reversed-phase column as long as the number of phosphorylation sites does not exceed the number of basic amino acid residues (Arg, His, and Lys) within the peptides. Under our chosen conditions the risk of losing singly phosphorylated tryptic peptides caused by reduced retention can be minimized. However, retention problems associated with increased hydrophilicity can still be problematic for multiply phosphorylated tryptic peptides without additional basic amino acid residues or phosphorylated peptides without basic residues.
The issue of selective ionization suppression of phosphopeptides in the presence of unphosphorylated peptides was tested a) by mixing increasing amounts of a standard protein digest with constant amounts of various peptide/phosphopeptide pairs and b) by mixing different protein digests with numerous peptide/phosphopeptide pairs. None of the data in our test series supported the notion of selective ionization suppression of phosphopeptides in the presence of unphosphorylated species; instead the signal intensity ratio peptide/phosphopeptide and the total ion signal even remained constant within the experimental error over the concentration and complexity range tested.
Similarly, the common belief that phosphopeptides show lower ionization/detection efficiencies than their unphosphorylated cognates could not be substantiated as a general fact. When testing the peptide/phosphopeptide pairs by using online LC electrospray ionization MS, the majority of the phosphopeptides tested actually showed better ionization/detection efficiencies than their unmodified cognates and even the tryptic-like peptides showed similar ionization/detection efficiencies corroborating earlier reports (46). This indicates that LC combined with electrospray ionization mass spectrometry is a good choice for routine phosphorylation analyses.
If several of the most commonly used arguments for not observing phosphorylation sites are questionable in light of this study, this raises the question as to why so many phosphopeptides are not observed. There are numerous reasons that can be grouped into biological and technical ones:
The human genome encodes not only 518 protein kinases, but also up to 65 protein phosphatases that can dephosphorylate proteins rapidly with high specificity (1012) such that proteins phosphorylated in e.g. a signal transduction cascade in response to an extracellular stimuli are usually dephosphorylated rapidly back to the resting state once the stimuli has been removed (13). Also only a subset of the cellular complement of a particular protein may be involved in a particular signaling pathway, and hence only a small fraction of the total cellular pool of that protein may be phosphorylated at any one time. So in the preparation of phosphoproteins from cell lysates by immunoprecipitation or organelle isolation, the majority of the isolated protein is not likely to be stoichiometrically phosphorylated, and the subsequent proteolytic digest will then be dominated by unmodified peptides. These reasons underscore the notion that most of the phosphorylation sites are substoichiometrically modified.
This substoichiometric nature of protein modification highlights one of the limitations of automated, data-dependent MS/MS routines to select precursors for fragmentation based on ion signal intensity. In the situation where the stoichiometry of phosphorylation at any given site is <10%, the ion intensity of the unphosphorylated peptide will always dominate the survey mass spectrum, and hence it is these precursors that will be automatically selected for MS/MS and not the lower abundance phosphopeptides. This applies especially to highly complex peptide mixtures in which the number of peptides species eluting off the column exceeds the number of peptides that can be sequenced in the same time period.
Thus, to improve the detection of phosphopeptides from proteolytic digests either phosphoprotein enrichment, phosphopeptide enrichment, or phosphopeptide selective mass spectrometric detection methods must be employed. This selective enrichment of phosphoproteins can involve the use of metal chelate affinity of phosphoproteins prior to digestion (14) or phosphospecific antibody enrichment of phosphoproteins, such as those phosphorylated on tyrosine residues (15). Affinity purification of phosphopeptides from proteolytic digests can be performed by immunoaffinity methods (16), metal chelate affinity chromatography (1720), ion exchange fractionation (1), or a combination thereof (21) prior to reverse phase LC-MS. The result of this enrichment of phosphoproteins and/or phosphopeptides is that the proteolytic digest analyzed by LC/MS is highly simplified, and phosphopeptide ions are more likely to be selected for MS/MS fragmentation because they now represent the more abundant peptide ions. Alternatively, mass spectrometric scanning features for the selective detection of phosphorylated species on a triple quadrupole or hybrid triple quadrupole linear ion trap mass spectrometer such as precursor ion scanning (22, 23) or neutral loss scanning can detect the generation of either PO3 ions or the neutral loss of H3PO4 from collision induced dissociation of phosphorylated precursor ions (24). Once these ions are detected they can be either automatically selected for MS/MS, or if necessary, manually assigned for MS/MS on a subsequent LC/MS analysis.
Furthermore, improved sequence coverage upon proteolysis of the proteins of interest is of pivotal importance to identify and locate the protein phosphorylation sites. For protein identification purposes any set of peptide suffices, whereas for more comprehensive protein characterization which includes phosphorylation analysis, significantly improved sequence coverage is necessary. This means that more effort should be directed toward (a) miniaturizing and minimizing of the sample handling to reduce losses (25) and (b) the analysis of smaller and/or larger proteolytic peptides, i.e. those outside the normally observed m/z range. Alternatively, in solution protein separation, handling and data interpretation tools should be improved such that FTICR-based top-down analyses of proteins become more routine. Top-down analyses by definition provide 100% sequence coverage. If following this approach the dynamic range of the mass spectrometer becomes very important to be able to analyze protein isoforms of lower abundance in the presence of high abundance proteins.
Another obstacle for the analysis of phosphopeptides and the exact localization of the phosphorylation site is the facile loss of phosphoric acid that is commonly observed for serine- and threonine-phosphorylated species. This facile loss leads to information-poor MS/MS spectra dominated by the neutral loss of phosphoric acid. Hence, some phosphopeptides might have been picked during data-dependent acquisition routines, but the following database search did not provide any meaningful hit or the unambiguous localization of the modification site was not possible. However, this limitation can be partially compensated for by using automated routines that trigger MS/MS/MS experiments if a significant neutral loss of 98/z (z being the charge state of the selected precursor ion) is observed (1); nevertheless, such an approach still requires that the modified peptides has been selected for fragmentation based on its intensity (see above).
In conclusion, the data presented above suggest that phosphopeptides are difficult to identify in phosphoprotein digests because of the substoichiometric nature of this modification and not because they have poor ionization efficiencies. Thus, most of the technical limitations discussed above would profit from mass spectrometers with improved detection limits and dynamic ranges and the development of more robust and reliable selective detection and enrichment procedures.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, October 4, 2005, DOI 10.1074/mcp.M500135-MCP200
1 The abbreviation used is: XIC, selected ion chromatogram. ![]()
* The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
¶ To whom correspondence should be addressed: Childrens Hospital Boston, Department of Pathology/Enders 1130, 300 Longwood Ave., Boston, MA 02115. Tel.: 617-919-2629; E-mail: hanno.steen{at}childrens.harvard.edu
| REFERENCES |
|---|
|
|
|---|
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |