|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 2:215-233, 2003.
© 2003 by The American Society for Biochemistry and Molecular Biology, Inc.

,
Department of Genetics and Developmental Biology, University of Connecticut Health Center, 263 Farmington Avenue, Farmington, Connecticut 06030-3301
¶ Universitätsklinikum Hamburg-Eppendorf, Martinistrasse 52, 20246 Hamburg, Germany
| ABSTRACT |
|---|
|
|
|---|
0.05%). The presence of a constitutively activated protein tyrosine kinase (PTK) oncogene product might increase this percentage 10-fold (to
0.5% of the total), and this relatively minor change is sufficient to induce malignant transformation (1). Because unregulated PTK signaling causes a breakdown in the normal regulation of processes such as cell proliferation and motility, leading directly to human diseases including cancer (2), tyrosine kinase signaling pathways are now a major focus of biomedical research. Given the importance of tyrosine phosphorylation, a major challenge is to develop the means to rationally control and manipulate the cellular tyrosine phosphorylation state. The potential benefits are clearly illustrated by the remarkable success of the small molecule drug Imatinib (Gleevec, STI571; Novartis, Basel, Switzerland) in treating chronic myelogenous leukemia and other malignancies (3). Chronic myelogenous leukemia is caused by a chromosome rearrangement leading to the expression of a constitutively active PTK, the BCR-Abl fusion protein, which is strongly inhibited by Imatinib. A number of other agents that target PTKs are also in various stages of development; for example Trastuzumab (Herceptin; Genentech, South San Francisco, CA), an inhibitor of HER2/Neu/Erb2 receptor-type tyrosine kinase, has shown some success in combination with other anticancer agents in treating advanced HER2-overexpressing breast cancers (4). These success stories have clearly validated the usefulness of specific PTK inhibitors for treating human disease and also provide a persuasive justification for the identification of downstream effectors of PTK signaling, which may be expected to include novel therapeutic targets.
How should we best explore the downstream target molecules in PTK or protein tyrosine phosphatase (PTP) signaling pathways? Recent technical advances, including the availability of the complete human genome sequence, have set the stage for comprehensive or global analyses of PTK/PTP signaling. There are two broad approaches to this goal. One is the comprehensive identification of all PTKs/PTPs existing in the human genome, followed by elucidation of their function and regulation in the cell. Manning et al. have termed the full complement of protein kinases the "kinome," and they currently estimate there are 90 individual PTKs in the human genome (5). Also taking into consideration the number of PTPs in the genome, the spatiotemporal regulation PTK/PTP activity is sure to be extremely complex. Therefore, the comprehensive profiling of these enzymatic activities in vivo would be an enormous challenge at the moment, although activity-based probe technology has begun to emerge as a promising tool for such studies (6, 7).
A second direction is the comprehensive identification of all tyrosine-phosphorylated proteins in the cell, the tyrosine phosphoproteome. This may be a more realistic goal, in part due to the availability of tools such as pTyr-specific antibodies (anti-pTyr) that can be used to detect or enrich for tyrosine-phosphorylated proteins. Historically, major tyrosine-phosphorylated proteins such as FAK, paxillin, p130CAS, etc. have been identified as prominent substrates in cells transformed by PTK oncogenes (810), implying biological relevance to transformation parameters such as cell growth, morphological alteration, or adhesion/motility, and indeed their biological importance has been validated in many cases (1113). In contrast, the comprehensive detection of phosphoproteins must be unbiased. Furthermore, low-abundance proteins (or those phosphorylated at relatively low stoichiometry) are likely to play critical roles in vivo. Therefore, in order to explore new molecular targets in PTK/PTP signaling, reliable technologies that are both highly sensitive and selective are clearly needed.
To address this challenging problem, we have taken advantage of Src homology 2 (SH2) domains to develop a strategy for profiling the global tyrosine phosphorylation state (14). The SH2 domain is a small modular protein domain that binds specifically to tyrosine-phosphorylated peptide ligands; it is the most prevalent type of tyrosine phosphorylation binding motif in the cell, found in a large number of different proteins in metazoan organisms (15, 16). Because these domains play a critical role in normal signaling by mediating the formation of protein-protein complexes in response to changes in tyrosine phosphorylation, the SH2 binding pattern is likely to reflect functionally relevant aspects of the PTK signaling state. In the first part of this article, advantages and disadvantages of current mass spectrometry (MS)-based approaches for analysis of the phosphoproteome are discussed. In the next section, the SH2 domain assay (SH2 profiling) will be described, and finally prospects for the development of high-throughput SH2 profiling formats will be discussed.
| PHOSPHOPROTEOMIC APPROACHES |
|---|
|
|
|---|
|
32P Labeling
32P labeling is frequently used for phosphoproteomics as a highly selective and sensitive means of detecting phosphopeptides (2325). Direct visualization and quantitation of phosphoprotein spots on 2D gels are possible with 32P labeling. Using high-resolution narrow-range 2DE, it is feasible to detect and quantitate differentially phosphorylated forms of a protein, which exhibit similar molecular mass but different isoelectric points (23). Moreover, 32P labeling coupled to immunoprecipitation (IP) allows phosphoproteomic analysis of particular protein complexes or organelles such as the 26S proteasome (26). However, the advantages of 32P are diminishing with the emergence of MS technologies that enable the direct detection of phosphoproteins as well as identification of the phosphorylation site. In addition, inconvenience and safety issues regarding the handling of radioactive materials are an unavoidable drawback of 32P labeling and preclude its use for human tissue samples.
Immunoaffinity-based Methods
IP with anti-pTyr is a powerful tool to enrich for low-abundance tyrosine-phosphorylated proteins, thereby improving sensitivity of detection in subsequent MS analysis. Anti-pTyr IP coupled to either one-dimensional (1D) gel electrophoresis (20, 2730) or 2DE (3133) are now the most prevalent formats for tyrosine phosphoproteomic analysis, allowing unambiguous identification of tyrosine-phosphorylated proteins. Gel staining can be a critical factor for the sensitivity of this approach; Maguire et al. used anti-pTyr IP and 2DE stained with SYPRO-Ruby, which is more sensitive than conventional silver staining, ultimately detecting 67 spots by 2DE and identifying 10 proteins including FAK and Syk (32). As an alternative to anti-pTyr, phosphoprotein binding motifs such as SH2 domains or 14-3-3 proteins can also be used to purify phosphoproteins from complex mixtures (3436). Using a glutathione S-transferase (GST)-Grb2 SH2 domain fusion, Blagoev et al. identified 228 proteins from epidermal growth factor-stimulated cells, of which 28 were specifically induced upon epidermal growth factor stimulation and 2 were unknown (34). It should be noted that not all detected proteins are phosphorylated when IP or pull-down methods are used for enrichment, because unmodified proteins can bind to and coprecipitate with phosphoproteins.
Combining anti-pTyr immunoblotting with 2DE provides considerable detection sensitivity for tyrosine phosphoproteins. Phosphoproteins can be identified by MS analysis of gel spots excised from a reference gel, which correspond to spots detected by immunoblotting (22, 3739). Although this approach can be sensitive for protein identification, in many cases the amount of protein in a spot is not sufficient for identification of the specific phosphorylation site.
Because antibodies for phosphoserine (pSer) and phosphothreonine (pThr) are generally not thought to have sufficient specificity or affinity for IP, enrichment of serine- or threonine-phosphorylated proteins by IP with those antibodies has not been widely used. Recently, however, Gronborg et al. demonstrated that an anti-pSer/pThr PKA substrate antibody was capable of enriching pSer/pThr-containing proteins, leading to the identification of a novel signaling molecule (40). In addition, two-dimensional Western blot analysis with an AKT kinase substrate antibody was also used for sensitive detection of phosphosubstrates (41). Therefore, improved pSer/pThr antibodies have the potential to play a larger role in the future.
Immobilized Metal Affinity Chromatography
Gel-based "off-line" approaches have inherently low throughput, because electrophoresis, staining, and spot-picking are relatively slow compared with MS analysis. Immobilized metal affinity chromatography (IMAC) is a chromatographic technique for phosphopeptide enrichment based on the affinity of Fe(III) or Ga(III) for the negatively charged phosphate group and can be used in an on-line high-throughput format; accordingly, it has been widely used for phosphoproteomic studies (4246). Although IMAC purification is not absolute due to the binding of acidic peptides (19, 47), Ficarro et al. have reduced this background binding by methylester modification of carboxyl groups prior to IMAC, thereby improving detectability of phosphopeptides in subsequent MS (45). They could detect more than a thousand phosphopeptides and identified 383 phosphorylation sites in 216 peptides starting with 500 µg of yeast protein. This is outstanding sensitivity and throughput compared with other published reports. Stensballe et al. reported that custom-made nanoscale Fe(III)-IMAC columns, in combination with 2DE, increased the likelihood of identification of phosphorylation sites (48). Of course this off-line format is incompatible with an automated high-throughput system.
Chemical Modification and Isotopic Labeling
Tagging phosphopeptides by specific chemical modification is attractive because it is amenable to large-scale analysis. Methods in which the phosphate moiety is chemically modified, e.g. by biotinylation, allow enrichment of phosphopeptides by affinity chromatography and the subsequent unambiguous identification of the phosphorylated site (Table I, nos. 1618) (4952). On the other hand, methods in which peptides are differentially labeled with stable isotopes such as 12C/13C or 14N/15N allow accurate determination of the abundance of specific phosphopeptides in one sample relative to another by measuring relative MS signal intensity (Table I, nos. 15, 1821) (34, 39, 51, 5355).
The goal of the isotope-coded affinity tag (ICAT) method is to quantitate relative protein amounts in two samples without separation by 2DE. All cysteine residues in one sample are modified with a biotinylated "heavy" isotope tag, and those of a second sample with a similar "light" isotope tag; the two samples are then combined and the relative intensity of corresponding heavy and light peptides is determined by MS (56). Phosphoprotein isotope-coded affinity tags (PhIATs) is conceptually similar to ICAT, but pSer and pThr residues are tagged instead of cysteine residues. This permits the simultaneous enrichment, quantitation, and identification of phosphopeptides via a biotinylated isotope tag (51). While chemical modification methods based on the ß-elimination reaction, including PhIAT, cannot modify tyrosine residues (Table I, nos. 16, 18, and 19), a different approach based on a carbodiimide condensation reaction can be applied to pTyr (Table I, no. 17) (50).
It has been pointed out that current chemical modification approaches can only detect relatively abundant phosphoproteins (19). Methods aimed at enriching phosphorylated peptides prior to modification have been somewhat disappointing, with increased sensitivity compromised by losses due to multiple additional purification steps. However, a recent report demonstrated the identification of low-abundance phosphoproteins by the PhIAT system (52).
Mass Spectrometry
There is no doubt that current MS technologies have enormous advantages over traditional Edman sequencing, both in sensitivity and throughput. For instance, Yoshimura et al. reported the identification of a total of 28 32P-labeled spots (potential substrates of Ca2+/calmodulin-dependent protein kinase II) by nanoflow liquid chromatography-tandem mass spectrometry (MS/MS) analysis, of which 12 were previously not detectable by Edman sequencing (25). Several approaches are currently used to detect phosphopeptides by MS (20). The matrix-assisted laser desorption ionization time-of-flight (MALDI-TOF) MS "peptide fingerprint" method is a standard tool for peptide identification. In this type of analysis, phosphorylated peptides can be detected by the characteristic 80-Da mass reduction upon alkaline phosphatase treatment. Collecting MS spectra from samples treated with different proteases increases the likelihood of detecting phosphopeptides (24). The lower sensitivity of detection for phosphopeptides in positive ion mode can be overcome by direct scanning of fragment ion PO3- in negative ion mode of MS/MS analysis. An m/z value of -79 for its fragment ion is specific and sensitive for its detection; however, this approach cannot differentiate between pSer, pThr, and pTyr, and subsequent peptide sequence analysis is needed with different polarity. Another approach takes advantage of precursor scanning for the immonium ion of phosphotyrosine (C8H10NO4P), with m/z of 216.043. Using this method for specific detection of phosphotyrosine, coupled to pTyr-IP-1D gel, it is possible to identify a large number of known/unknown signal transduction molecules (2729, 57). Undoubtedly, this MS approach will become a standard tool for tyrosine phosphoproteomics. However, because the pTyr immonium ion (m/z 216.043) must be differentiated from other ions with similar nominal mass (e.g. C9H14NO3S, m/z 216.069), an MS system with extremely high accuracy is required, such as a quadrupole time-of-flight hybrid tandem mass spectrometer (58, 59).
Comparison of Different Approaches
While a variety of different approaches for phosphoproteomics are currently available, it is difficult to assess their relative merits because of differences in the cell systems analyzed or downstream MS analysis in different studies, which may greatly affect sensitivity and data quality. In this regard, direct comparison of different approaches in the same system is invaluable, especially for the analysis of low-abundance tyrosine-phosphorylated proteins. Ficarro et al. compared multiple IMAC-based approaches using capacitated human sperm (39). A total of 200 distinct phosphopeptides were detected from the crude protein mixture, in which 60 phosphorylation sites were mapped. However, in this study only five pTyr sites were mapped, four of which were in peptides that also contained pSer, consistent with the notion that multiply phosphorylated peptides are preferably enriched by IMAC (19).
Immunoaffinity approaches employing pTyr-2D Western blot assays and pTyr-IP-IMAC were also compared in that study (39). Surprisingly, pTyr-2D immunoblotting was more sensitive than pTyr-IP-IMAC for detecting and identifying tyrosine phosphoproteins. This fact reminds us that the pTyr-2D Western blot assay is a powerful tool for tyrosine phosphoproteomics. Phosphoprotein identification by the combination of pTyr-2D Western blot assays and MALDI-TOF MS has become standard, but mapping the phosphorylation sites by this approach remains a challenge. In contrast, IMAC or chemical modification may increase the likelihood of unambiguous determination of the site, and these on-line approaches are well adapted for high-throughput large-scale analysis. Indeed, pTyr-IP-IMAC was recently used to map 64 known/unknown tyrosine-phosphorylated sites on 32 different proteins from human hematopoietic cells (60), and mapping of several hundred phosphorylation sites by IMAC has been reported (45).
In summary, it is obvious that specific approaches need to be chosen depending on the purpose. At the moment, the pTyr-2D Western blot assay is highly sensitive for detection, and low-abundance phosphoproteins can be sequenced by enrichment with anti-pTyr IP. In contrast, IMAC or stable isotope labeling may have the advantage for detecting phosphorylation sites or the relative quantitation of phosphopeptides, respectively. Importantly, for all of these approaches appropriate sample preparation prior to MS analysis is crucial for obtaining high-quality data (61).
Profiling Tyrosine Phosphoproteins
Like genomics, phosphoproteomics is an identification-based strategy, where the primary end point is the comprehensive identification and quantitation of phosphoproteins in a specific cellular context. However, to facilitate the development of new diagnostic and therapeutic targets, the functional profiling of the phosphoproteome is essential (62, 63). Such functional data would include the identity of the phosphorylated protein, the sites phosphorylated, and the kinetics of phosphorylation/dephosphorylation under various physiological conditions. As important, but undoubtedly more challenging, would be characterization of the upstream enzymes responsible for changes in phosphorylation, as well as the downstream consequences of changes in phosphorylation. At the moment, global phosphoproteomic profiling is technically difficult because the sensitivity of available methods is insufficient to identify all PTK substrates in a given sample.
Even without the comprehensive identification of all phosphorylated proteins in a sample, however, a great deal of useful information can be gleaned by profiling the phosphorylation of specific proteins or specific classes of phosphorylation sites. Phosphorylation site-specific antibodies (phospho-Abs), which can detect specific phosphorylated proteins by conventional Western blot assay, have become important tools for analyzing signaling pathways. The coupling of phospho-Abs to protein chip technologies based on high-throughput MS systems such as surface-enhanced laser desorption/ionization time-of-flight (SELDI-TOF) MS may emerge as a valuable approach (6466). Kinase-substrate antibodies that specifically detect individual kinase substrates are also a promising tool for phosphorylation profiling (41). However, generating phospho-Abs involves significant labor and cost, and those currently available do not recognize all known sites (and the number of unknown sites is presumably enormous), so it is not feasible to profile all the tyrosine-phosphorylated proteins in the cell by this approach.
| SH2 PROFILING |
|---|
|
|
|---|
Methodology
SH2 domains are modular, and the recombinant domains can easily be produced in bacteria and bind well to their tyrosine-phosphorylated ligands in vitro; these properties suggested that SH2 domains could be used to probe the tyrosine phosphorylation state of a protein sample. We use the term "SH2 profiling" to describe methods in which a battery of SH2 domain probes are used to provide a qualitative and quantitative fingerprint, or profile, of the overall state of tyrosine phosphorylation (14). In the current method based on the far-Western filter binding assay, a protein extract is separated by conventional 1D gel, transferred to a nitrocellulose or polyvinylidene difluoride membrane, and subsequently probed with labeled GST-SH2 fusion proteins. Replicate filters are probed with different SH2 domains having different binding specificities, and the pattern of binding (intensity and apparent molecular mass of specific bands) for each probe can be compared among different samples.
One general concern with far-Western blotting is the ability to detect specific signals above the background of nonspecific binding. This is a particularly important issue for detection of low-abundance binding partners such as tyrosine-phosphorylated proteins in normal cells. We found that specific signal was enhanced and background reduced when glutathione-horseradish peroxidase conjugate (GSH-HRP) was used to label the GST-SH2 fusion protein probe, as compared with standard detection methods including anti-GST antibody or direct biotinylation of the GST fusion protein (14). While we have not exhaustively examined the reasons for this, one possible explanation is the fact that GSH-HRP can bind only to native, functional GST; therefore, any denatured GST fusion protein, which might bind nonspecifically to proteins on the filter, is not labeled. Another advantage is rapidity; because there is no need to incubate with secondary detection agents (2° antibody, streptavidin conjugate, etc.), washing and incubation times are minimized. Indeed, a 10-min binding reaction followed by brief washing is often sufficient. Finally, the GSH-HRP conjugate is highly oligomerized, containing large numbers of GSH and HRP moieties per complex; thus, incubation of GSH-HRP with GST-SH2 fusions generates a probe with very high avidity of binding (because it has many copies of the SH2 domain) and high specific activity (because it has many HRP molecules), thereby increasing the specific signal.
A central premise of this approach is that different SH2 domains maintain their phosphorylation-dependent and sequence-dependent binding specificities in the far-Western filter binding assay. The phosphorylation dependence of binding has been confirmed by tyrosine phosphatase treatment of the membrane, which led to complete loss of SH2 domain binding (14). Peptide sequence specificity was also addressed. When binding was performed with a single SH2 domain probe at moderate concentrations (e.g. 1 µg/ml), specificity was in some cases compromised; this is most likely because the probe was present in great excess relative to the phosphorylated targets on the membrane, leading to some nonspecific binding to suboptimal sites. We found that specificity could be dramatically improved by performing binding under competitive conditions, where a single labeled SH2 domain competes in the binding reaction with multiple unlabeled heterologous SH2 domains (14). When GSH-HRP is used for detection, however, competitors must be chemically modified to ensure that they cannot bind GSH (otherwise the label would equilibrate between the probe and "competitors"). This can be easily accomplished by treatment of competitors with ethacrynic acid, a GST inhibitor that stably binds to the GSH binding site of GST (14).
In experiments using GSH-HRP-labeled SH2 domain probes and ethacrynic acid-blocked competitors, we showed that SH2 domains could detect specific tyrosine-phosphorylated sites in a protein expressed at endogenous levels in a whole-cell protein lysate. In cells expressing either wild-type platelet-derived growth factor (PDGF) ß receptors or mutants in which specific SH2 binding sites were eliminated, the specificities of the phospholipase C
(PLC
), phosphatidylinositol 3-kinase (PI3-kinase), and p120 Ras-GTPase-activating protein (GAP) SH2 domain probes were tested. As expected, each SH2 domain detected the wild-type PDGF receptor when autophosphorylated in response to PDGF treatment. Each SH2 domain probe no longer bound, however, when the tyrosine residue corresponding to its known binding site on the receptor was mutated (14). This clearly demonstrates the feasibility of using SH2 profiling to characterize PTK signaling: the assay has both the specificity to discriminate among different tyrosine-phosphorylated sites and the sensitivity to detect phosphorylation of a single site on a signaling protein expressed at relatively low levels.
It is important to consider several practical issues regarding the sensitivity and specificity of the SH2 profiling approach. We have shown that in many cases binding of SH2 domains can visualize bands that are not apparent by standard anti-pTyr immunoblotting. Anti-pTyr is accepted to be a very sensitive technique for detecting tyrosine-phosphorylated proteins; thus, we can conclude that the sensitivity of SH2 profiling is as good if not better. In competitive binding experiments, however, the presence of unlabeled competitor SH2 domains affects both sensitivity and specificity. Increased binding specificity under competitive conditions is accompanied by a significant decrease in overall signal intensity compared with the noncompetitive situation. Therefore, for some SH2 domains, especially those with relatively low affinity and/or high intrinsic specificity, noncompetitive binding conditions can give superior results. The mix of competitor SH2 domains must also be carefully considered. Ideally, the competitors will include many SH2 domains with different binding specificities, but not domains closely related to the labeled probe, which would be expected to have similar binding specificity and thus diminish the specific signal through direct competition. The mixture of competitors that maximizes specificity and signal strength may vary for each probe and must be determined empirically. However, we have found that a "standard" competitor mixture of 510 SH2 domains can be used successfully for many probes.
Applications
We have used the SH2 profiling approach to study a variety of biological systems. In one example, we have profiled changes in tyrosine phosphorylation patterns in the course of neuronal differentiation in a cell culture system where PTK signaling has been suggested to play an important role (70, 71). Fig. 1A shows the morphological changes in p19 embryonal carcinoma cells that have been induced to differentiate in response to retinoic acid (RA). RA-treated cells initially withdraw from the cell cycle and become more tightly adherent, then after 1014 days they begin to display neuron-like morphology (72, 73). The tyrosine phosphorylation state was determined by anti-pTyr Western blot and by the far-Western SH2 profiling assay (Fig. 1B) in the course of RA-induced differentiation. The binding of three representative SH2 probes, whose binding patterns were clearly distinct, were superimposed and represented in different colors (right panel of Fig. 1B).
|
We further asked whether this approach could serve as a basis to classify human malignancies based on their tyrosine phosphorylation state. There is currently great interest in developing novel molecular diagnostic methods that can classify tumor cells in biologically and clinically relevant ways. In particular, classification methods that have predictive value in terms of the course of disease, response to therapy, and other clinical parameters will be of great value in choosing the best treatment options for each tumor. cDNA expression profiling has been proposed as a method for both class identification and class assignment and has shown some promise in pilot studies (7476). Because tyrosine phosphorylation can play a critical role in tumorigenesis, as well as regulating factors important for progression including angiogenesis and motility, we reasoned that a global fingerprint or profile of the tyrosine phosphorylation state of a tumor might serve as a functionally relevant means of classifying otherwise similar tumors. Consistent with this idea, SH2 profiling of various human cancer cell lines including prostate, breast, and colon cancer demonstrated that each cell line has a distinct SH2 binding profile (Fig. 2). Furthermore, some cell lines of similar origin (for example the CX-1 and DLD1 colon carcinoma cell lines) show similar patterns, while others are quite distinct.
|
|
|
Fig. 4 shows representative profile data obtained from clinical samples, in this case from two patients with acute myeloid leukemia (AML; FAB classification M2). In both cases, bone marrow as well as peripheral blood mononucleated cells (PBMC) were analyzed for each patient. Several observations are relevant in the context of molecular diagnostics. First, the SH2 profiles of the PBMC samples from the two patients are clearly distinct, despite the similar pathological classification of the malignancies. Second, the profiles for the PBMC and bone marrow of patient 1 are clearly very similar, as expected if the same tumor cells are present in both blood and marrow. This (and other data not shown) demonstrates that SH2 profiles are reproducible from sample to sample, despite differences in preparation. In contrast, very little signal is seen in the bone marrow from patient 2, consistent with the clinical diagnosis of bone marrow remission for this patient at the time the sample was taken. Taken together, these results suggest it is likely that SH2 profiling will provide novel information that can be used to classify tumors and may even serve to identify novel diagnostic markers or targets for therapy.
Practical Considerations and Other Formats
Data Analysis
Data from SH2 profiling experiments can be analyzed in several different ways. 1D gel band patterns or "fingerprints" for a number of SH2 domains can be digitally acquired for each sample and normalized (81). This permits the overall patterns for different samples to be compared and classified based on quantitative similarities in binding pattern, and statistical analyses can be applied to identify potentially significant correlations between binding pattern and biologically or clinically relevant parameters (response to therapy, survival, etc.). While technically feasible, we have not yet established optimal automated protocols for normalizing and comparing profile data because the number of samples to date has been relatively modest. Instead, we manually generate images in which the binding patterns for individual SH2 domains are rendered in false color and superimposed (Figs. 1 4). For this process, and indeed for any method of archiving and comparing SH2 profiling data, it is critical to use standardized and reproducible conditions for gel electrophoresis and membrane transfer. Commercially available precast gels and blotting methods that generate multiple membranes from a single gel will be useful in this regard. It should be noted that stripping and reprobing of membranes is possible, but often results in loss of signal intensity and increased background (data not shown). Manual image analysis is adequate to identify major differences and similarities between samples, but obviously automated computer-based methods for data analysis are an important priority for large-scale quantitative studies.
Protein Identification
While differences in the overall pattern of SH2 binding will be useful for class identification and class assignment, the unambiguous identification of the proteins corresponding to specific bands will be important for understanding the biological significance of profile data. An obvious approach for identification is to take advantage of SH2 domains as affinity reagents to purify binding proteins from a sample, then to identify the bound proteins (and specific phosphorylation sites) using standard MS-based methods. Given the low abundance of most tyrosine-phosphorylated proteins and the unfavorable solution binding kinetics that this implies, relatively large amounts of starting material (108109 cells) are necessary. We are currently purifying SH2 binding proteins of interest by this approach, but have found that low-abundance phosphoproteins are sometimes undetectable. It is likely that the binding of phosphorylated peptides to endogenous SH2 domain-containing proteins present in the cell lysate and other steric factors contribute to making simple affinity purification with SH2 domains relatively inefficient. However, Blagoev et al. reported identifying 228 proteins from epidermal growth factor-stimulated cells by GST-Grb2 SH2 domain binding followed by MS analysis (34), so the combination of SH2 profiling and protein identification may evolve into a powerful tool for phosphoproteomics.
Relevance to Signaling Pathways
It is important to keep in mind that SH2 binding data can provide information about binding sites for endogenous SH2 domain-containing proteins corresponding to the probe. For example, we showed earlier that in GIST cells, the PI3-kinase SH2 domain was unique among the wide variety of SH2 probes examined in that its binding sites were virtually eliminated by Imatinib treatment. Given the effectiveness of Imatinib in inhibiting the growth of these cells, two implications of this result can be explored. First, this suggests that activation of the PI3-kinase signaling pathway by c-Kit may be critical for the growth of these cells, and thus drugs that target this pathway may be useful therapeutically. Second, binding of the PI3-kinase SH2 may be useful as a marker for predicting the sensitivity of GISTs to Imatinib and monitoring response to therapy.
It will also be very useful to begin identifying the components of SH2 domain-phosphoprotein complexes that are present in the cell, and to assemble databases of such complexes in various cells where SH2 profiling has identified potentially important differences in phosphorylation. For example, in the GIST example cited above, it would be of great interest to identify complexes containing phosphoproteins that bind the PI3-kinase SH2 domains. Such an approach has the potential to reveal upstream and downstream components in critical signaling pathways in the cell of interest. Several comprehensive studies have been published in which stable protein complexes have been identified by the combination of expression of a tagged protein of interest, followed by isolation of the tagged protein (along with binding partners) and subsequent MS analysis (82, 83). Similar approaches are feasible for identifying SH2 binding proteins in cell lines, and as discussed above direct affinity purification using SH2 domain probes may also be useful in this regard.
Binding Considerations
To maximize the information obtained from SH2 profiling experiments, ideally one would use a comprehensive panel of probes consisting of all SH2 domains with distinct binding specificities. We are currently using a total of 44 different SH2 probes for the SH2 profiling assay, derived from 24 different SH2 domain-containing proteins. This number is still far from the total number of SH2 domains existing in the human genome (at least 115).2 PTB domains also can bind to specific tyrosine-phosphorylated proteins (68, 69); the binding specificity and sensitivity of the only PTB domain we have tried to date, that of Shc, is sufficient for the far-Western assay (Fig. 3A). Thus, other phosphotyrosine-dependent PTB domains are good candidates for additional probes recognizing distinct classes of tyrosine-phosphorylated sites.
In the end, because of overlapping binding specificities we expect that the complete set of SH2/PTB domain probes will not be necessary to comprehensively sample all classes of tyrosine-phosphorylated binding sites. For example, the structure and binding specificity of the SH2 domains of the eight human Src family tyrosine kinases are very similar (84, 85), thus only one or two representatives from this family need be included in profiling studies. Similarly, many other SH2-containing proteins belong to closely related families with two or more members, only one of which need be included. SH2 domains may be further classified into subgroups according to sequence, three-dimensional structure, peptide screening data, and far-Western binding data. Thus, in the future we anticipate a standard screening strategy using a representative panel of
20 SH2 domains, chosen for distinct binding specificity and relatively strong signals and low background. Alternatively, the comprehensive set of all SH2 and PTB domains could be assayed using various high-throughput multiplexed assay formats outlined in the next section.
Although the competitive SH2 profiling assay allows highly specific detection of tyrosine-phosphorylated proteins, it is obviously not equivalent to protein-protein interaction in the cell, where SH2 domain-phosphopeptide interactions are tightly regulated by local concentration, other protein domains, and binding partners. Therefore, in vitro binding data must be interpreted thoughtfully. Each SH2 domain has a distinct spectrum of affinities for target phosphopeptides, with dissociation constants (Kd) for specific targets in the range of 100 nM to 1 µM (86). Thus, the binding of an SH2 domain probe that is highly specific but has relatively low affinity for its ligands might not be detected in vitro, where conditions have been optimized for high-affinity probes. We have therefore explored ways to improve the apparent affinity of some SH2 probes.
Multimerization of a protein binding module would be expected to increase the avidity of binding, and indeed "natural" tandem SH2 domains, such as those of SHP-2, Zap-70, and PLC
, have been shown to have greater affinity than single domains (87). Such avidity effects would be expected to be exaggerated in the solid-phase far-Western binding assay, where high local concentrations of a phosphorylated binding protein are present in a band on the membrane, allowing the multiple SH2 domains of a tandem probe to bind simultaneously. We have applied a recombinant tandem SH2 domain strategy to the c-Abl SH2, which is normally present in a single copy, and PLC
, which normally contains a tandem pair of SH2 domains. In each case, GST fusion proteins were constructed containing various numbers of tandemly repeated SH2 domains. In both cases the multimerized SH2 domain probes gave stronger binding signals without apparent loss of specificity. As shown in Fig. 5, significant improvement in signal intensity was observed for clinical samples, for which specific binding is often difficult to detect for some probes. It should be noted that all of our SH2 probes are fusions with GST, which exists as a dimer in solution, thus even single SH2 probes are in actuality homodimers. Consistent with the importance of multimerization, we have previously observed that SH2 domains cleaved from GST bind much more poorly than the corresponding GST fusions in far-Western assays (Ref. 14 and data not shown).
|
We are also exploring an alternative approach to SH2 profiling that can be applied to intact cells and tissues. As mentioned above, SH2 profiling can differentiate tumor cells based on their SH2 domain binding patterns; however, clinical specimens, especially solid tumors, generally contain a mixture normal and tumor cells that may themselves exhibit considerable heterogeneity. Furthermore, biopsy samples often contain only a small number of tumor cells, insufficient for analysis by the far-Western approach. Thus, a method allowing visualization of SH2 binding at the single-cell level would be quite useful, and we are therefore developing an immunohistochemistry-based method in which SH2 domain probes are used to stain the tumor cell directly.
Detection of phosphorylation in fixed, paraffin-embedded tumor sections is challenging, but successful staining by phosphospecific antibodies has been reported (8891). We found that by modifying SH2 domain binding and staining conditions, we could also detect SH2 domain-specific signals from paraffin-embedded sections of breast carcinomas (Fig. 6). Of course unlike the far-Western binding assay, no information about the identity (molecular mass) or number of SH2-binding proteins is obtained by this approach. However, absolute quantitative binding information (strong binding, weak binding, no binding) for a large panel of SH2 domain probes is likely to provide sufficient information to classify tumor cells based on similarity in binding patterns. An obvious advantage of this approach is that archived samples linked to existing patient records can be analyzed, permitting large-scale retrospective studies to assess possible correlations between tyrosine phosphorylation patterns and clinical outcomes. Furthermore, quantitative data analysis is greatly simplified compared with the far-Western format. For clinical samples, SH2 profiling in both the 1D far-Western and tissue-staining formats may provide complementary data, representing protein-specific and cell-specific data, respectively.
|
| MULTIPLEXED ASSAY FORMATS FOR SH2 PROFILING |
|---|
|
|
|---|
In principle, two different formats can be applied to study global SH2 binding profiles with multiple SH2 domains (Fig. 7). The first approach is based on the conventional protein array format. A large panel of different SH2 domains is spotted on the surface of an array, and SH2 binding profiles are quantitatively determined after the binding of labeled sample proteins (Fig. 7A). Several recent publications demonstrated that protein-protein interactions or protein expression profiles can effectively be analyzed with high throughput using different microarray-based formats (92, 93). In these investigations, different antibodies, peptides, kinases, or a large panel of recombinant yeast proteins (representing almost 80% of the yeast proteome) were spotted on the surface of microarrays. Incubation of arrays was performed with single binding proteins, an artificial mixture of different target proteins, or complex mixtures of proteins derived from tissue culture supernatants or whole cellular lysates (94). Bound analytes were detected either by direct labeling of proteins prior to incubation (e.g. single or dual labeling by fluorescent dyes), by target-specific secondary antibodies, or by measuring the enzymatic activity of kinases subsequent to the binding reaction (92, 93, 95).
|
SH2 Arrays
In contrast to antibody-based or other protein arrays, there is very little data on comparable array formats for immobilized SH2 domains. In pilot experiments, we tested two different formats for profiling the binding of tyrosine-phosphorylated proteins to immobilized SH2 domains. As a model system, whole cellular lysates from 3T3 fibroblasts and Abl-transformed 3T3 cells were used; the Abl-transformed cells have a much higher level of tyrosine phosphorylation due to the constitutive kinase activity of the Abl oncoprotein (Fig. 8A). In the first approach, different SH2 domains (generated as GST fusion proteins) were immobilized on GSH-derivatized nitrocellulose membranes,3 and binding reactions were performed with 32P-labeled or biotinylated whole cellular extracts (Fig. 8, B and C). In the second approach, pTyr binding profiles were examined by SELDI-TOF after immobilization of different SH2 domains on protein chip arrays and incubation with whole cellular lysates (data not shown). In contrast to anti-pTyr blots and far-Western blot analyses, we found that no specific pTyr binding profile data could be detected using either format under a variety of assay conditions tested (Fig. 8 and data not shown).
|
In general, signaling proteins such as tyrosine kinase receptors are expressed at low levels of several hundred or thousand molecules per cell. Assuming that a crude protein extract from 106 cells is applied to an array in a volume of 1 ml, limits of detection of between 1 pg/ml and 100 pg/ml are theoretically required for the detection of these low-abundance proteins. As outlined above, comparable sensitivities have been achieved by different antibody-based microarray formats, suggesting that low-abundance proteins such as phosphorylated signaling proteins might be detectable by SH2-based microarray formats. The discrepancy between our observations and antibody-based microarrays may be explained by differences in the binding affinities of antibodies and SH2 domains. While the Kd for antigen-antibody reactions is typically below 1 nM, interactions of SH2 domains with tyrosine-phosphorylated proteins are significantly weaker, with affinities in the range of 100 nM to 1 µM (15, 86). Specific binding is further compromised by poor kinetics. Because the rate of binding is proportional to the on-rate and the concentration of analyte, relatively modest on-rates and low analyte concentrations necessarily imply that specific binding will be relatively slow, while nonspecific binding may be more rapid.
Thus, for binding to SH2 arrays, both more sensitive labeling and detection methods and increasing the concentration of the analyte are likely to be required for acceptable signal strength and signal-to-noise ratio. Enrichment of tyrosine-phosphorylated proteins prior to the binding reaction may be achieved by methods already discussed, for example IMAC or anti-pTyr IP (47). However, enrichment can be technically demanding when only small starting amounts of protein are available, and the differences in the relative efficiency of enrichment of different phosphoproteins may bias results. Sensitivity and specificity might also be improved by analyzing different fractions of cellular proteins after chromatographic separation, e.g. by ion exchange or reverse-phase chromatography, but such an approach limits potential throughput (99).
The sensitivity of SH2-based arrays may also be affected by nonspecific binding. Compared with highly sensitive antibody-based sandwich assays in which the analyte is captured by immobilized antibodies and specifically detected by secondary antibodies recognizing the analyte, the binding partners for immobilized SH2 domains in whole cellular extracts are a priori unknown (although a sandwich assay format with anti-pTyr as a detecting antibody is a promising approach to explore (100)). Therefore, SH2-based array analyses require that the protein or peptide sample be unselectively labeled, with the disadvantage of increased background and low signal-to-noise ratios. Moreover, direct labeling is commonly achieved by chemical coupling of labels to peptide reactive groups (e.g. to amino groups by N-hydroxysuccinimide esters); coupling can lead to the inactivation of binding sites and/or decreased solubility, and quantitation may be biased by disproportionate labeling depending on the number of reactive groups present in a given peptide. Taking together current data and theoretical considerations, it remains to be established whether SH2 domain-based array formats are capable of profiling the cellular phosphorylation state of complex protein mixtures.
Reverse-phase Arrays
A second strategy for profiling the global cellular SH2-binding state is based on "reverse-phase" formats, in which phosphoproteins present in immobilized cellular lysates are detected by soluble SH2 domain probes (Fig. 7B). The feasibility of this approach is clearly validated by the success of the 1D far-Western approach described above. For immobilization, protein samples are spotted on solid surfaces (membranes, plastic, or glass) or are covalently attached to beads either in their native or denatured state (92, 101). A reverse-phase microarray approach was recently applied for comparative expression analysis of several prosurvival proteins in microdissected normal and cancerous prostate specimens (65). Levels of protein expression were determined by antibodies, and signal amplification was achieved with biotinylated secondary antibodies in combination with the streptavidin-peroxidase complex. For single immobilized recombinant proteins, the limit of detection was
1 ng/ml and specific signals were detectable in whole cellular extracts derived from less than 200 cells; intra- and interassay reproducibility varied between 2 and 40% and increased with decreasing amounts of protein immobilized (65).
In pilot experiments, we tested the reverse-phase approach for SH2 profiling in slot-blot or dot-blot formats using immobilized protein extracts of normal and v-Abl-transformed cells (Fig. 9 and data not shown). In accordance with previous far-Western blot analyses, differential binding of individual labeled SH2 domains to whole cellular lysates was reproducibly detectable. To this point, pTyr-specific signals were detected from as little as 0.5 µg of whole cellular lysate, equivalent to
2500 cells. Although not yet tested on microarray formats, our data suggest that quantitative SH2 profiling could be performed on miniaturized reverse-phase arrays with relatively low amounts of immobilized protein. However, it is likely that for the detection of low-abundance proteins in extracts of small numbers of cells, more sensitive detection methods will be required.
|