|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 5:1610-1627, 2006.
© 2006 by The American Society for Biochemistry and Molecular Biology, Inc.
,
,
,**
From the
Barnett Institute, Northeastern University, Boston, Massachusetts 01225, ¶ Laboratory of Pathology, NCI, National Institutes of Health, Bethesda, Maryland 20892, and || Center for Applied Proteomics and Molecular Medicine, George Mason University, Manassas, Virginia 20110
| ABSTRACT |
|---|
|
|
|---|
Previous studies of post-translational modifications of EGFR, which have focused mainly on the phosphorylation in the cytoplasmic tail, have indicated that phosphorylation events are transient, rapidly fluctuating over time (2). Indeed the temporal fluctuation of EGFR is thought to be the major mechanism by which a small family of receptors can influence a wide diversity of cellular functions, such as regulation, differentiation, and even apoptosis (3, 4). Importantly the spatial and temporal fluctuations of phosphorylation of EGFR could be causal determinants in diseases such as cancer (5) as these autophosphorylation sites, as noted, are the initiation points for various signaling pathways that control the above cellular functions (6). Thus, it is important to identify patterns of binding to the epidermal growth factor (EGF) receptor that are the first steps in signaling. Another factor known to affect phosphorylation and the resultant intracellular protein binding is sequence mutation in the cytoplasmic tail. Indeed an important therapeutic agent for lung cancer is only successful for that level of the population that has a specific mutation (79).
Kinase-driven phosphorylation is only one of several classes of functionally important post-translational modifications. The extracellular domains of EGFR and other receptors are highly glycosylated. Although much less studied, because of their complexity, glycosylation is significant in the control of ligand binding and signaling. For example, elimination of glycosylation at Asn420 and Asn579 on EGFR has been shown to alter ligand binding to the receptor as well as to affect membrane receptor dimerization and subsequent phosphorylation (10, 11). Although the study of glycosylation patterns and their role in signaling is a relatively new area of exploration, it is clear that such study will be a necessary component in the characterization of tyrosine kinase receptors.
To elucidate the molecular basis of signal transduction, i.e. interaction partners and the kinetics of post-translational modifications, new technologies are needed for global characterization of signaling network components. Moreover to extend this analysis ultimately to in vivo systems and clinical material, such as tissue, such technologies must be sufficiently sensitive for analysis at the trace level. The ideal technology would provide a suite of receptor information including high sequence coverage, post-translational analysis, and quantitation of various phospho- and glyco-isoforms and identification of its binding partners. Although methodologies have been implemented for portions of the above, a full analysis has heretofore not been developed. It was the purpose of this study to demonstrate a methodology, Extended Range Proteomic Analysis (ERPA), that can be utilized for comprehensive characterization of the above components of signaling. In the present study we applied this technology to analyze comprehensively EGFR/ERBB1 signal transduction with a particular emphasis on kinetic analysis that captures the critical dynamic nature of cellular signaling.
With respect to protein characterization, high sequence coverage has been a focus of top-down proteomics using FTMS (12). Although some impressive results have been obtained, the method is relatively insensitive (pmol or higher), is not readily useable above protein sizes of 50 kDa, has not been applied to date for glycosylation analysis, and requires relatively pure proteins for analysis. With respect to phosphorylation site identification, LC-MS approaches using proteolytic products of proteins, often obtained by immunoprecipitation, have recently been actively pursued (13). Because of the low stoichiometry of phosphorylation, enrichment approaches using IMAC (14), titanium oxide (15), ion exchange (16), or phosphotyrosine antibodies (17) are often used. Although these approaches represent a means for low level phosphorylation identification, a concern is the bias induced by the enrichment approach (18, 19) and the neglect of the glycosylated peptides. With regard to glycosylation, a typical strategy has often involved the enrichment of the glycopeptides from a proteolytic digest (20) or the affinity capture and separation of oligosaccharides released after glycosidase treatment (21) Again the phosphorylated peptides and non-glycosylated peptides are often neglected by these approaches. To overcome the limits of these approaches, we describe a comprehensive strategy to discover, map, and quantitatively characterize protein phosphorylation sites, glycosylation sites, and protein binding partners.
In a previous study, we introduced the on-line LC-MS method ERPA for the analysis of post-translationally modified EGFR (22) in which the determination of phosphorylation and glycosylation in one experiment was demonstrated. This intermediate approach between bottom-up (23) and top-down (24) proteomics uses proteolytic enzymes such as Lys-C (C-terminal Lys) to cut proteins less frequently than trypsin (C-terminal Arg and Lys). As a consequence, the molecular weight distribution of peptide fragments is generally wider than with tryptic digests, leading to larger fragments and simpler mixtures relative to trypsin (average of 23 times larger in size with 23 times fewer peptide fragments). Using a hybrid LTQ-FT mass spectrometer and a protocol of MS2 followed by MS3 of the most intense MS2 fragment, high sequence coverage (
95%) as well as the sites and structures of phosphorylation and glycosylation were obtained at the hundreds of fmol to 1 pmol level. In this approach, enzymatically digested fragments up to 10 kDa could be analyzed in which the mass resolution of the FT (100,000) allowed determination of the high charge state of the large peptide ion. Importantly a 1050-fold increase in signal intensity of post-translationally modified Lys-C fragments (relative to tryptic fragments) was observed as a consequence of additional basic amino acid residues in the peptide backbone as well as an increase in chromatographic retention. In a separate study, we achieved high sequence coverage for EGFR with an order of magnitude decrease in detection limits (low fmol) by using a narrow bore monolithic columns (20-µm inner diameter).2
In addition to comprehensive structural characterization, the ERPA approach can significantly enhance LC-MS as a quantitative method, an area of current active interest (26, 27). Many approaches for quantitation involve the use of stable isotopes either incorporated metabolically (28, 29) or chemically (3032). Recently label-free methods of quantitation using LC-MS have been introduced (3336). Because one of our ultimate goals is to conduct quantitative proteomics in tissue, we have focused on label-free approaches. As a result of the high sequence coverage obtained by the ERPA method, normalization for the amount of EGFR present in each time point of stimulation using non-post-translationally modified peptides of EGFR or intrinsic internal standards (i.e. non-phosphorylated counterparts) can be readily achieved. Thus, in addition to the ability to quantify accurately the level of phosphorylation and glycosylation and the abundance of the binding partners, the stoichiometry of phosphorylation and glycosylation can be estimated.
In this study, we measured the temporal changes of the EGFR upon stimulation by EGF in an A 431 cancer cell line. Using an antibody that binds to the intracellular C-terminal region but not to the autophosphorylation sites, EGFR was immunoprecipitated after EGF stimulation for 0, 0.5, 2, and 10 min as well as 4 h. Sequence coverage of 96% for EGFR was obtained at each time point by use of ERPA methodology. A total of 13 phosphorylation sites were identified without the need of enrichment. Direct analysis of EGFR allowed quantitation and estimation of stoichiometry of each site as a function of time of EGF stimulation. Furthermore 10 extracellular glycosylation sites were identified with the major glycosylation patterns characterized, and the relative contribution of individual major glycoforms at a given site was determined. Finally binding partners and their temporal changes with EGF stimulation were determined from quantitative analysis of proteins pulled down in the immunoprecipitation of EGFR. Correction for nonspecific binding proteins and normalization to the amount of EGFR precipitated was also incorporated into the analysis. Importantly because quantitation was achieved by a label-free strategy, the comprehensive analysis of targets in clinical tissue samples will be possible.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
were purchased from Santa Cruz Biotechnology (Santa Cruz, CA). Rabbit phosphospecific EGFR (Tyr(P)992, Tyr(P)1068, Tyr(P)1148, and Tyr(P)1173) antibodies were from Cell Signaling Technology (Beverly, MA), rabbit anti-Tyr(P)1148 EGFR was from BioSource (Camarillo, CA), and mouse anti-GAPDH was from Ambion (Austin, TX). Recombinant human EGF, Nonidet P-40, Tween 20, and protease inhibitor mixture I were obtained from EMD Biosciences (La Jolla, CA). Dimethyl pimelimidate dihydrochloride (DMP), Micro BCA protein assay, horseradish peroxidase-conjugated goat anti-mouse, goat anti-rabbit, and rabbit anti-goat were from Pierce. BSA, phosphatase inhibitor mixtures I and II, sodium fluoride, sodium orthovanadate, citrate, ß-mercaptoethanol, DTT, iodoacetamide, guanidine hydrochloride, and ammonium bicarbonate were obtained from Sigma. Paramagnetic protein A Dynabeads, NuPAGE protein gel reagents (4x lithium dodecyl sulfate sample buffer, Novex Bis-Tris protein gels, MOPS and MES SDS running buffers, transfer buffer, and antioxidant), Invitrolon 0.45-µm PVDF, and SYPRO Ruby gel stain were purchased from Invitrogen. ECL Plus enhanced chemiluminescence kit was from GE Healthcare. Carnation nonfat dry milk was from Nestle USA (Solon, OH). Achromobacter protease I (Lys-C) was obtained from Wako (Richmond, VA). Formic acid, acetone, and acetonitrile were purchased from Fisher, and the HPLC grade water used in all experiments was from J. T. Baker Inc.
Cell Culture
Cultures of the A 431 human epidermoid carcinoma cell line were seeded at 1.5 x 107 cells/150-mm dish in complete growth medium (0.22-µm filtered DMEM with 10% FBS, 2 mM L-glutamine, 1.5 g/liter sodium bicarbonate, 4.5 g/liter glucose, 100 units/ml penicillin, and 100 µg/ml streptomycin) and grown at 37 °C with 5% CO2. After 2 days, the growth medium was removed, the cells were washed twice with TBS (0.22-µm filtered 50 mM Tris-HCl, pH 7.4, 150 mM NaCl) and then serum-starved by continuing cell culture overnight in serum-free medium (DMEM without FBS or BSA).
A 431 cells were either untreated (controls) or treated by removing the serum-free medium and replacing with warmed serum-free medium containing 25 ng/ml EGF and then incubated for 30 s, 2 min, 10 min, or 4 h at 37 °C. Each time point produced 1.5 x 108 cells (four 150-mm dishes at 80% confluency).
Cell Lysis
Immediately following EGF stimulation, the A 431 cells were washed three times with ice-cold TBS and lysed on ice with 5 ml of modified radioimmunoprecipitation assay (RIPA) buffer (50 mM Tris-HCl, pH 7.4, 150 mM NaCl, 1% Nonidet P-40, 1x protease inhibitor mixture set I, and 1x phosphatase inhibitor mixture sets I and II) per plate for 5 min, scraped, and then transferred to a 50-ml tube. Pooled lysates from each time point were adjusted to 50 ml with additional RIPA buffer followed by five passages through a 21-gauge needle, incubation on ice for 30 min, and centrifugation at 10,000 x g for 10 min. The clarified supernatants were transferred to new 50-ml tubes and normalized by total protein concentration (Micro BCA assay).
EGFR Immunoprecipitation
The clarified A 431 lysates were precleared with 75 µg of normal rabbit IgG (cross-linked to 750 µl of paramagnetic protein A bead slurry with 20 mM DMP) by tumbling end-over-end for 4 h at 4 °C. Each rabbit IgG-protein A precleared complex was isolated by magnetic capture and washed six times with chilled RIPA buffer and then twice with chilled TBS (containing protease and phosphatase inhibitors). Precleared products were eluted in four washes of 100 mM citrate, pH 2.8, pooled, and neutralized with 1 M Tris-HCl, pH 9.5. For EGFR immunoprecipitation, the precleared A 431 lysates from above were incubated with 75 µg of anti-EGFR (cross-linked to 750 µl of paramagnetic protein A bead slurry with 20 mM DMP) overnight at 4 °C with end-over-end tumbling. The anti-EGFR-protein A immunoprecipitated complex was isolated, washed, and eluted as above. Eluates were aliquoted and prepared for Western blotting or enzymatic digestion and LC-MS analysis.
Western Blotting
Protein samples were mixed with 4x lithium dodecyl sulfate buffer and ß-mercaptoethanol (2% final concentration) and then heated at 70 °C for 10 min. Cell lysates and immunoprecipitation products were separated by SDS-PAGE (NuPAGE Novex Bis-Tris gels) and transferred onto 0.45-µm Invitrolon PDVF membranes. After overnight incubation at 4 °C in blocking buffer (7.5% glycine, 5% nonfat dry milk, 0.1% Tween 20, 1 mM activated sodium orthovanadate, 50 mM sodium fluoride), the membranes were incubated with primary antibody diluted in wash buffer (TBS, 0.1% Tween 20, 0.1% BSA, 1 mM activated sodium orthovanadate, 50 mM sodium fluoride) containing 1% BSA for 2 h at room temperature. Membranes were then washed five times (3 min each) and incubated with secondary antibody (horseradish peroxidase-conjugated goat anti-mouse, goat anti-rabbit, or rabbit anti-goat) for 1 h at room temperature followed by five washes (3 min each). Blots were developed with ECL Plus reagents (GE Healthcare), and the resulting fluorescent signals were detected using an Eastman Kodak Co. 4000MM unit equipped with 400 nm excitation and 535 nm emission filters. Blot data were quantified using Kodak molecular imaging software. For publication, image brightness and contrast were adjusted with Adobe Photoshop software, and control bands/lanes within each blot were repositioned for continuity.
Enzymatic Digestion
After immunoprecipitation, proteins (mainly EGFR) eluted from the cross-linked antibody/protein A beads were buffer-exchanged with 6 M guanidine hydrochloride using a Microcon spin column (10-kDa molecular mass cutoff; Millipore, Bedford, MA). The exchanged protein solution (200 µl) was reduced with 20 mM DTT for 30 min at 37 °C and alkylated with 50 mM iodoacetamide in the dark at room temperature for 1.5 h. After buffer exchange (desalting) over a Microcon spin column (10-kDa molecular mass cutoff), endoproteinase Lys-C (1:100, w/w) was added to digest the protein for 4 h at 37 °C. Digestion was stopped by addition of 1% formic acid, and then the solution was concentrated down to
60 µl. Approximately 300 nl of the digest was directly analyzed by LC-MS, and the rest was stored for repeat use. For the analysis of the 10-kDa peptide, acetonitrile was added to the digest solution to make a 5% acetonitrile solution prior to LC injection.
LC-MS
LC-MS experiments were performed on an LTQ-FTMS instrument (Thermo Electron, San Jose, CA) with an Ultimate nano-LC pump (Dionex, Mountain View, CA) using a monolithic column (polystyrene divinylbenzene, 50-µm inner diameter x 10 cm), which was prepared in-house.2 Mobile phase A was 0.1% formic acid in water, and mobile phase B was 0.1% formic acid in acetonitrile. A shallow gradient was used for all analyses: (i) 40 min at 0% B for sample loading, (ii) linear gradient to 35% B over 30 min, then (iii) to 80% B over 10 min, and finally (iv) constant 80% B for 10 min. The flow rate of the column (at the initial mobile phase condition) was measured as
100 nl/min.
The parameters of the LTQ-FT mass spectrometer and the acquisition modes for MS2 and MS3 were similar to those in our previous study (22). Briefly the mass spectrometer was operated in the data-dependent mode to switch automatically between MS, MS2, and MS3 acquisition. Survey full scan MS spectra with two microscans (m/z 4002000) were acquired in the FTICR cell with mass resolution of 100,000 at m/z 400 (after accumulation to a target value of 2 x 106 ions in the linear ion trap) followed by four pairs of sequential MS2 and MS3 scans. Dynamic exclusion was utilized with an exclusion duration of 30 s and no repeat counts. The total cycle time (one FTICR survey scan with two microscans plus four pairs of sequential linear ion trap MS2 and MS3 scans) was
2.7 s.
Peptide Assignment
The assignments of peptides (for charge state
3+), large peptides (for charge state
4+), phosphopeptides, and glycopeptides were similar to our previous report (22). Briefly the Sequest algorithm in the BioWorks software (Version 3.2, Thermo Electron Corp.) was used to search all MS2 and MS3 spectra against spectra of theoretical fragmentations of a human Swiss-Prot annotated database downloaded in January 2006 containing 14,094 protein entries with a mass tolerance of ±1.4 Da. Peptide ions (
3+ ions) were assigned by BioWorks 3.2 software, which identified the peptides with a double filter: 1) peptide probability greater than 95% confidence and 2) peptide Xcorr scores above the following thresholds:
2.5 for 3+ and higher charge state ions,
2.0 for 2+ ions, and
1.5 for 1+ ions, with semi-Lys-C specificity and up to three internal missed cleavages. The individual identifications were further confirmed by the high mass accuracy (<5 ppm). For larger ions (
4+ charge), no rigorous statistics are available correlating the probability of correct identification to the Xcorr scores, therefore Sequest was used to select and rank the most probable peptides, and the top assignment was further confirmed by (i) the mass accuracy (<5 ppm) and (ii) preferred fragmentation patterns in the observed MS2 and MS3 spectra (22). For EGFR phosphopeptides, the data were searched against a single database corresponding to the sequence of EGFR with the parameter of differential STY equal to +80 Da. The locations of the phosphorylated sites in the identified phosphopeptides were further confirmed by manual inspection of related b and y ions. For EGFR glycopeptides, the likely glycan structure in a glycopeptide was initially assigned by applying the mass obtained from the difference of a glycopeptide and its deglycosylated counterpart to match against the masses of the glycans in the Glycosuite database (Proteome Systems, Sydney, Australia). Among these likely glycostructure candidates, the best assignment was then selected from the preferred fragmentation patterns obtained in the related MS2 and MS3 spectra.
Quantitation
The charge state with the highest intensity of a given peptide was used for quantitation. In this charge state, the chromatographic peak area derived from the three highest isotopic ions extracted from the precursor ion in the FTICR scans was calculated by an in-house algorithm to automate the selection of chromatographic peaks for quantitation. In addition to the use of the MS scan of the FTICR for quantitation, the mass spectrometer was also operated in the selected ion monitoring (SIM) mode using the linear ion trap for increased sensitivity when necessary. In this mode, a survey full scan MS spectrum was acquired in the FTICR cell followed by several sequential SIM scans in the linear ion trap. Each SIM scan was the average of at least 10 microscans with the selected mass window of ±2.5 m/z. The SIM scan was operated by segments (or retention time windows). For example, six SIM scans were used in the time window between 20 and 25 min for six specific peptides eluting in that time window. The peak areas of the ions in the SIM scans were used for quantitation, and the accurate masses of the ions obtained in the FTICR full scans were used for confirmation of the specific ions and for alignment of the chromatographic peaks from run to run. In addition, selected reaction monitoring (SRM) or multiple reaction monitoring, which measured the specific fragment ions in the MS/MS mode, was used for identification and quantitation of a specific ion, particularly if the given peptide (e.g. a 10-kDa phosphopeptide) had multiple phosphorylation sites with the same mass (e.g. phosphorylated at either Tyr992 or Ser1002). In this mode, the mass spectrometer was set to acquire the FTICR MS scan followed by a targeted MS/MS scan on the particular m/z ion (average of two microscans) in the linear ion trap within a given retention window. The specific ions in the MS/MS spectra were used to differentiate the phosphorylation sites (e.g. b47 ion for Tyr(P)992 site and y43 ion for Ser(P)1002 site). Similarly a targeted MS3 scan was applied to further confirm the site of phosphorylation. The peak areas of these specific ions from the MS/MS spectra (SRM) were extracted for quantitation (i.e. phosphopeptides) or normalization (i.e. non-phosphopeptides). All chromatographic peak areas derived from the average of all isotopic peaks, obtained either from the SIM or SRM scans in the linear ion trap, were calculated manually using the "add peaks" tool in Xcalibur Qual Browsers Version 1.4 SR1 program.
| RESULTS |
|---|
|
|
|---|
108 cells at each time point were lysed, immunoextracted, and analyzed by the ERPA approach. Separately immunoextracted EGFR was also analyzed by SDS-PAGE with anti-EGFR blotting or SYPRO staining (for total protein determination and normalization) as shown in Fig. 2. The data of Fig. 2 support the absence of IgG interference and demonstrate the success of the immunoextraction procedure using antibodies cross-linked to beads. Furthermore the specificity of the anti-EGFR can be seen in the single band resulting from the blotting of either crude cell lysate or immunoextracted EGFR samples. Enrichment can be observed in the immunoextracted samples (lanes CG, darker bands) as compared with the lysate sample (lane B, lighter band).
|
|
We determined 96% sequence coverage for EGFR in the immunoprecipitated fraction, a level similar to our previous work with a commercial source of the protein (22). For the present study, all results were obtained with three injections or a total of
200 fmol for each time point of EGF stimulation. With 200 fmol, we were also able to achieve extensive post-translational analysis for phosphorylation and glycosylation. The complete list of phosphorylated and glycosylated peptides are shown in Tables I and II, respectively. Representative mass spectra are provided in the supplemental material (Supplemental Figs. A, B, and C), and further details on spectral interpretation can be found elsewhere (22). Based on these results, we next sought to analyze dynamic EGFR-mediated events and characterize relative changes in post-translational modification as a function of time after stimulation with physiologic levels of EGF. Our label-free strategy for quantitation and the subsequent results are detailed below.
|
|
Fig. 3A presents the extracted ion chromatogram (XIC) of Lys-C fragment 10381075 in which four different phosphorylation sites were separated. The high resolution of the monolithic column is important because all four monophosphorylated peptides are isobaric. It can be seen that the corresponding non-phosphorylated peptide eluted close to the phosphorylated forms. Fig. 3B shows similar behavior for peptide fragment 10761136 where again the non-phosphorylated peptide eluted in a region of the gradient similar to that of the two phosphorylated forms.
|
![]() |
where SA = chromatographic peak area of peptide A, pSA1 = the peak area of phosphorylated peptide A at site 1, pSA2 = the peak area of phosphorylated peptide A at site 2, etc. The peak area of the XIC, as described under "Experimental Procedures," was determined from the sum of the peak areas of the three most intense isotopes of the most intense charge state of the peptide (see Fig. 3, A and B, insets). Importantly because the phospho- and non-phosphocounterparts eluted close to one another, quantitative variations in ESI due to solvent changes were minimized. In addition, the sum of the chromatographic peak areas of the various forms of the given peptide minimizes variation in quantitation between experiments. Also it should be noted that we examined the use of other peptide fragments of EGFR in the same LC-MS runs as internal standards (36, 37). The trends in change of phosphorylation with time of stimulation were similar using %ES or other peptide fragments of EGFR as internal standards; however, the variation in quantitation from run to run was found to be greater with the latter approach.
It should be noted that the stoichiometry as measured by Equation 1 is estimated because the response factors of Ser(P) and Ser are not identical (13). However, a recent study notes that the response factor differences between the single phosphorylated and non-phosphorylated peptides are generally not great (38). It can be further noted that the large Lys-C-digested peptides (2.510 kDa), along with the additional arginines present, should minimize the differences in response factors between the phospho- and non-phosphomoieties further.
We searched for multiply phosphorylated peptides using, as described under "Experimental Procedures," SIM in the ion trap to enhance detection sensitivity. Only three examples of doubly phosphorylated peptides were found, all with relatively low signal even at the maximum level (see Table I). The assumption is made that the response factors of the doubly phosphorylated peptides do not differ in a major way from the corresponding monophosphorylated peptides given the length of the peptides and the acid mobile phase buffer conditions. Therefore, as a result of the relatively low levels observed for the doubly phosphorylated forms and the expected relatively small differences in response factors between the monophospho- and non-phosphopeptides, we concluded that the ES value should be a reasonable estimate of stoichiometry.
Using Equation 1, the phosphorylation changes at all 13 sites on EGFR were quantitated as a function of the time of EGF stimulation. As an example, Fig. 4 shows the temporal changes that occur in phosphorylation for Tyr1068 as measured by both Western blotting (Fig. 4A) and mass spectrometry (Fig. 4B). The blotting revealed an increase in phosphorylation over time with the total amount of immunoprecipitated EGFR remaining approximately constant. The MS results showed the same trend while providing more details on the quantitation. Phosphorylation at this tyrosine site was seen to rise rapidly within 30 s of EGF stimulation, remained roughly constant to 10 min, and finally increased during the extended time to 4 h. The MS results also exhibited variation in %ES for three runs at each time point, demonstrating the reproducibility of the approach.
|
|
Quantitation of EGFR Glycosylation
As reported in our previous study (22), EGFR has 12 potential N-linked glycosylation sites; 10 of these sites are glycosylated with either high mannose or complex-type structures. As expected, these glycosylation sites are located on the outer membrane surface where ligand binding can occur. Table II lists all the peptide fragments and the glycosylation patterns observed. Note again that in many cases there are additional basic amino acid residues in the Lys-C fragments, leading to an order of magnitude or greater enhanced ionization intensity than the related tryptic fragments. We mapped six sites that were fully glycosylated (Asn151, Asn328, Asn337, Asn420, Asn504, and Asn599) and four sites that were partially glycosylated (Asn32, Asn389, Asn544, and Asn579). All of these EFGR sites were previously known. The soluble variant of EGFR with one additional glycosylation site was not identified because the anti-EGFR used here was against only the C-terminal end of EGFR (10).
As a result of the high sequence coverage obtained by ERPA, the glycostructures of EGFR and the relative amounts of specific glycoforms at individual sites could be determined. The likely glycan structure was initially assigned by comparing the mass obtained from the difference of the glycopeptide and its deglycosylated counterpart against the masses of the glycans in the Glycosuite database (22). Among these likely glycostructure candidates, the most probable assignment was then selected from the preferred fragmentation patterns obtained in the MS2 and MS3 spectra. As noted, the major forms of the glycans (such as high mannose, sialylated complex type, fucosylation, and branching) attached to specific sites were determined but not the specific linkage (such as
16 or ß14 linkage) or isoform (such as galactose or mannose). The latter, however, are usually well conserved in human cells and can be extrapolated from the mass assignment in a human glycostructure database (40).
The glycoforms attached to the same peptide generally eluted close to one another in the chromatogram, and the relative percentage at a given site could then be readily estimated. As with phosphorylation, the measurement of the relative percentage can minimize the bias against the immunoextraction and LC-MS loading procedures. For simplification, the response factors of different glycopeptide isoforms at a given site are assumed to be similar because the major contribution of the response factors will most likely be from the peptide moieties, particularly as the peptide size is increased. In addition, for partially glycosylated sites, because of potential differences in ionization efficiencies between glycosylated and non-glycosylated peptides, the relative level of individual glycan forms at a given site would represent an estimate of the actual percentage. The deglycosylated peptide could in principle be quantitated, leading to an estimate of the relative amount of the partially glycosylated forms. As with phosphorylation, we thus report the relative amounts of the different glycoforms at a given site as percent estimated stoichiometry where the amount of a specific glycopeptide is divided by all forms of the given peptide.
All quantitation followed the procedure used for phosphorylation. Specifically the XIC peak area for a given form was divided by the XIC peak areas of all forms of the proteolytic peptide fragment. All Lys-C peptide fragments that were fully or partially glycosylated were found with only one modified site (see Table II).
An example for the calculation of a fully glycosylated high mannose site, namely Asn337, is presented in Fig. 6. The base ion chromatogram is shown in Fig. 6A where it can be seen that the glycopeptide eluted late in the chromatogram. Elution under high organic content, as well as an additional basic amino acid (see Table II), led to a significant increase in signal relative to the corresponding tryptic peptide fragment. Fig. 6B displays the highest charge state for each glycosylation form at the Asn337 site, and Fig. 6C presents an expanded scale of the 5+ state for Man8 from which quantitation was made. The percentage of Man7, Man8, and Man9 as a function of stimulation time is shown in Fig. 6D. There was no change in the relative amounts of each form at the different time points. Our previous work involved a commercial source of EGFR that was purified from A 431 cells cultured under unknown (to us) growth conditions. Although the specific sites that were fully or partially glycosylated agreed with what was found in the present study, significant differences in the percentage of the forms were observed. For example, at Asn337, the percentage of Man9 was 5-fold higher, and Man6 was 5-fold lower in the present sample relative to the commercial sample. Thus, measurement of the relative amounts of the glycoforms at specific sites is important for comprehensive characterization.
|
|
A double filter was used to minimize identification of proteins that were nonspecifically pulled down in the precipitation. The first filter was the standard negative control experiment consisting of immunoextraction of the A 431 cell line by a non-EGFR antibody (see "Experimental Procedures"). Proteins appearing in both the negative control and EGFR pulldown were eliminated. Then the additional filter was applied such that only proteins that changed by at least a factor of 3 over the time of stimulation were included in the list of likely binding partners. A 3-fold change was selected to minimize variations due to washing of the precipitate from run to run and also to exclude non-changing binding partners during EGF stimulation. Normalization over the time points of the experiment was achieved by dividing the XIC peak area of a specific peptide of the interaction partner by the average XIC peak area of three representative Lys-C peptides of EGFR that were not post-translationally modified. This normalization approach has the further advantage of correcting for differences due to LC sample loading or to immunoprecipitation procedures.
Fig. 8 illustrates the approach for the known EGFR interaction partner GRB2 (41). Fig. 8A, Western blotting, shows a general increase, relative to EGFR, in the amount of GRB2 pulled down in the immunoprecipitation as a function of stimulation time. Fig. 8B indicates a similar trend from the ERPA LC-MS analysis in which the relative change in GRB2 level from 0 to 4 h was over 6-fold. It is interesting to note that the temporal changes for GBR2 follow those of phosphorylation of Tyr(P)1068 (Fig. 4), the known position of association (42).
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
As a demonstration of the depth of coverage, we report procedures for label-free quantitation of PTMs, both phosphorylation and glycosylation, following stimulation of the EGFR by EGF. The estimated phosphorylation in its cytoplasmic tail or glycosylation stoichiometries in its extracellular domain fluctuating over time represent a unique picture of EGFR upon EGF stimulation. In addition, the quantitative changes that occur over time for PTMs can potentially be matched to the quantitative fluctuation of binding partners, thereby providing clues as to which binding partners relate to specific PTMs. Finally the estimated stoichiometries reveal significant differences in the relative levels of phosphorylation, and these differences likely also play a role in the signaling processes.
Although past studies of tyrosine kinase receptors, including EGFR, have evaluated sites of phosphorylation following stimulation, these studies have typically involved some form of enrichment such as IMAC or phosphotyrosine antibody pulldown. Enrichment was required because phosphorylation at individual sites is low. Although effective, especially in global searches for phosphorylation sites on proteins in cells (17), the enrichment approach may be less useful for quantitation of changes over time because enrichment can lead to bias (15). Moreover enrichment requires normalization to correct for (a) initial sample amount differences, (b) differential losses during sample preparation, and (c) differences in sample injection. Temporal studies of EGFR by proteomic approaches have previously involved either isotopic growth medium, e.g. SILAC (29), or chemical labeling, e.g. iTRAQ (isobaric tags for relative and absolute quantitation) (46), but without estimation of the stoichiometry because of the difficulty to obtain the non-phosphorylated or non-glycosylated counterparts in the same run. Our goal was to develop quantitative methods using mass spectrometry that did not include enrichment steps or the use of SILAC or a labeling procedure to provide a general approach that could ultimately be used in tissue.
The developed methodology will allow in-depth exploration of key protein molecules that may presently or ultimately be drug targets. The approach also opens up possibilities to explore such molecules in tissue or blood under various physiological states, e.g. disease versus normal. Such capabilities could prove highly effective in the discovery phase of a study. Although antibodies have been demonstrated to be a powerful approach to quantitation in later stages (47), specific site modifications must first be known. Moreover not all isoforms will produce highly specific antibodies; this is especially the case for the carbohydrate forms.
The results in Figs. 4 and 5 reveal quite distinct kinetic behavior of individual phosphorylation sites. Six tyrosine phosphorylation sites increased with the extent of EGF stimulation. In contrast, a seventh site, Tyr(P)992, decreased upon stimulation (shown in both MS and Western blotting results). A similar downward trend by Western blotting has also been observed by others (48). The temporal trends in Tyr(P)1068, Tyr(P)1086, and Tyr(P)1148 generally agree with those found using labeling approaches (17, 48, 49), leading to further confidence in our label-free results.
We demonstrated temporal changes of phosphorylation for specific serine and threonine sites of EGFR. Although there is a growing compendium of antibodies recognizing specific Ser/Thr site residues, phospho-Ser and -Thr antibodies have historically performed poorly when used for blotting or pulldown purposes. Although phosphorylation did not vary greatly with stimulation time for Ser(P)1057, there are examples where increases, or decreases, in phosphorylation were observed, e.g. Ser(P)1002, Ser(P)1046, and Ser(P)967. Although the biological significance of these specific phosphorylation is unclear, it has been established previously that serine hreonine phosphorylation of the EGFR can play a role in the attenuation of kinase activity (50).
The relationship of extracellular glycosylation to phosphorylation kinetics using the ERPA approach is an avenue that can now be explored and correlated with other functional information. Although, as expected, no temporal changes in glycosylation were observed in this study, variations in growth conditions or the use of different cell lines would be expected to yield a variety of glycosylation patterns that could affect phosphorylation and thus signaling. In addition, it is well known that glycosylation is altered in disease, e.g. cancer (51, 52), and this ability to characterize in detail the glycoforms will prove useful in future studies.
We additionally were able to determine binding partners and estimate the kinetics of binding of these proteins to EGFR with a tight criteria, i.e. only proteins that changed by at least a factor of 3 over the time of stimulation. We acknowledge that some binding partners may not reach that extent of change upon stimulation. To avoid the potential nonspecific interaction and also the measurement variations due to washing of the precipitate from run to run, only 19 of 49 possible interaction partners (data not shown) were included at this stage, and many of these 19 proteins had a trend similar to that found by Western blotting (Fig. 9). We showed that some proteins are up-regulated, whereas others are down-regulated. In some other cases, there appears to be release of specific proteins upon initial stimulation. It may well be that this release is either important for receptor homo- or heterodimerization to occur or that removal of protein exposes sites on the EGFR tail for phosphorylation to take place.
As noted earlier, one of our ultimate goals is to provide comprehensive characterization of such molecules in tissue. Key to this will be the capability to characterize and quantitate at low levels of sample. Although LC-MS is a sensitive method, we can lower detection levels at least 10-fold relative to the several hundred fmol in this study by using LC columns narrower than 50 µm (as discussed in a separate study)2 and by targeting individual precursors for SIM. With these and other advances, the ability for comprehensive characterization of EGFR in disease tissue will be available. Using reverse protein tissue arrays to determine pathways (25), the relationship of the structure of EGFR, e.g. glycosylation, to signaling pathways in disease will be possible to determine. Furthermore the high sequence coverage will allow analysis of specific mutant forms known to influence phosphorylation patterns (6).
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, June 23, 2006, DOI 10.1074/mcp.M600105-MCP200
1 The abbreviations used are: EGFR, epidermal growth factor receptor; ERPA, Extended Range Proteomic Analysis; EGF, epidermal growth factor; PTM, post-translation modification; DMEM, Dulbeccos modified Eagles medium; FBS, fetal bovine serum; DMP, dimethyl pimelimidate dihydrochloride; Bis-Tris, 2-[bis(2-hydroxyethyl)amino]-2-(hydroxymethyl)propane-1,3-diol; RIPA, radioimmunoprecipitation assay; SIM, selected ion monitoring; SRM, selected reaction monitoring; XIC, extracted ion chromatogram; ES, estimated stoichiometry; Man, mannose; SILAC, stable isotope labeling by amino acids in cell culture; GAPDH, glyceraldehyde-3-phosphate dehydrogenase; LTQ-FT, linear ion trap integrated with Fourier transform. ![]()
2 J. Zhang, J. K. Kim, S. L. Wu, and B. L. Karger, manuscript in preparation. ![]()
* This work was supported by National Institutes of Health Grant GM 15847. This research was also supported by the Intramural Research Program of the National Institutes of Health, NCI Center for Cancer Research. This paper is Contribution Number 880 from the Barnett Institute. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
Both authors contributed equally to this work. ![]()
** To whom correspondence should be addressed. E-mail: b.karger{at}neu.edu
| REFERENCES |
|---|
|
|
|---|