Large-scale Proteomics Analysis of the Human Kinome

Members of the human protein kinase superfamily are the major regulatory enzymes involved in the activity control of eukaryotic signal transduction pathways. As protein kinases reside at the nodes of phosphorylation-based signal transmission, comprehensive analysis of their cellular expression and site-specific phosphorylation can provide important insights into the architecture and functionality of signaling networks. However, in global proteome studies, low cellular abundance of protein kinases often results in rather minor peptide species that are occluded by a vast excess of peptides from other cellular proteins. These analytical limitations create a rationale for kinome-wide enrichment of protein kinases prior to mass spectrometry analysis. Here, we employed stable isotope labeling by amino acids in cell culture (SILAC) to compare the binding characteristics of three kinase-selective affinity resins by quantitative mass spectrometry. The evaluated pre-fractionation tools possessed pyrido[2,3-d]pyrimidine-based kinase inhibitors as immobilized capture ligands and retained considerable subsets of the human kinome. Based on these results, an affinity resin displaying the broadly selective kinase ligand VI16832 was employed to quantify the relative expression of more than 170 protein kinases across three different, SILAC-encoded cancer cell lines. These experiments demonstrated the feasibility of comparative kinome profiling in a compact experimental format. Interestingly, we found high levels of cytoplasmic and low levels of receptor tyrosine kinases in MV4–11 leukemia cells compared with the adherent cancer lines HCT116 and MDA-MB-435S. The VI16832 resin was further exploited to pre-fractionate kinases for targeted phosphoproteomics analysis, which revealed about 1200 distinct phosphorylation sites on more than 200 protein kinases. This hitherto largest survey of site-specific phosphorylation across the kinome significantly expands the basis for functional follow-up studies on protein kinase regulation. In conclusion, the straightforward experimental procedures described here enable different implementations of kinase-selective proteomics with considerable potential for future signal transduction and kinase drug target analysis.

Reversible protein phosphorylation represents the most common type of post-translational modification (PTM) 1 in eukaryotic organisms. A plethora of studies on a large variety of proteins have established that site-specific phosphorylation events fulfill key functions in the activity control of signaling cascades and networks (1). Cellular protein phosphorylation is controlled by more than 500 members of the protein kinase superfamily, which comprises one of the largest enzyme families encoded by the human genome (2). Protein kinases represent the key elements in phosphorylation-based signal transmission. Aberrant protein kinase expression and/or activity, often because of gene amplification or mutational changes, is involved in pathological processes leading to malignant transformation and tumor development (3). Therefore, protein kinases have emerged as a major class of drug targets for therapeutic intervention (4 -6). Given the diversity of molecular mechanisms related to de-regulated kinase function in human cancers, proteomic approaches could significantly enhance our understanding of disease-relevant kinase function and also help to optimize and adjust therapeutic strategies. In addition to assessing protein expression, the analysis of site-specific phosphorylations on protein kinases is of particular relevance, as these PTMs can be indicative of their cellular catalytic activities (7,8). Protein kinases can not only modulate each other's functions and activities through site-specific phosphorylation events, but often also undergo site-specific autophosphorylation once they get activated (9). Thus, the comprehensive assessment of kinase-derived phosphopeptides can provide important insights into the regulation of these key players in phosphorylation-controlled signaling.
Regulatory enzymes such as protein kinases are often expressed at low cellular levels. This can impede their detection by LC-MS in highly complex peptide mixtures derived from total cell or tissue extracts. These analytical challenges are further aggravated in phosphoproteomic experiments due to the fact that many phosphopeptide species result from substoichiometric phosphorylation events (10). Consequently, phosphopeptide isolation methods have proven to be essential. Among others, techniques such as immobilized metal affinity chromatography or enrichment by means of titanium dioxide (TiO 2 )-coated beads have found widespread use in MS-based phosphoproteomics (11)(12)(13). In addition, to reduce initial sample complexity, either protein fractionation by gel electrophoresis or peptide separation by strong cation exchange chromatography is typically included in contemporary phosphoproteomics workflows (14 -16). These separation techniques in combination with LC-MS on state-of-the-art mass spectrometers enabled the identification of thousands of phosphorylation sites from total cellular extracts (15,17,18). Despite these impressive advances, such large-scale efforts require considerable instrument time, and the current methodology is still not comprehensive across the full dynamic range of the entire phosphoproteome. This creates the rationale for sub-proteome analyses to achieve high coverage and analytical sensitivity, which is particularly relevant for members of the protein kinase enzyme family.
To date, the only pre-fractionation techniques permitting the enrichment of more than a few protein kinases are affinity capture methods relying on immobilized and kinaseselective small molecule inhibitors (19 -21). We and others have demonstrated that combinations of such kinase inhibitor resins efficiently pre-fractionate kinases for subsequent phosphorylation analysis (7,22,23). Ideally, capture molecules for kinase proteomics have two properties. First, they should exhibit high non-selectivity within the kinase superfamily. Second, they should efficiently discriminate between protein kinases and other classes of cellular proteins under the biochemical conditions of the pre-fractionation procedure.
In our efforts to characterize affinity reagents fulfilling these criteria, we quantitatively compared a selection of immobilized pyrido [2,3-d]pyrimidine-based inhibitors with respect to their proteome-wide kinase binding properties. Based on this assessment, an affinity matrix displaying the small molecule VI16832 was used as an enrichment tool for the comparative expression analysis of protein kinases in different cancer cell lines. The highly efficient VI16832 affinity resin further enabled a large-scale phosphoproteomics survey resulting in the identification and confident assignment of about 1200 phosphorylation sites on more than 200 distinct protein kinases.
Cell Lysis and Affinity Enrichment-Cells were lysed in 50 mM Hepes-NaOH, pH 7.5, 150 mM NaCl, 0.5% Triton X-100, 1 mM EDTA, 1 mM EGTA, 1 mM phenylmethylsulfonyl fluoride, 10 mM NaF, 2.5 mM Na 3 VO 4 , 50 ng/ml calyculin A (Alexis Biochemicals, San Diego, CA), 10 g/ml aprotinin, 10 g/ml leupeptin, and 1% phosphatase inhibitor mixture 1 and 2 (Sigma) for 1 h at 4°C. The cell debris was removed by centrifugation (20 min at 13,000 rpm) and by filtering through 0.22-m mixed esters of cellulose membranes (Millipore). Protein concentration was measured using the BCA assay (Pierce). For comparative SILAC analysis of different inhibitor resins, 1.5 mg from each differentially labeled MV4 -11 lysate was subjected to in vitro association with the respective kinase inhibitor resins. 30 l of drained beads coupled with the respective kinase inhibitor were washed three times with lysis buffer and further three times with lysis buffer containing 1 M NaCl. Washed beads were incubated for 2.5 h at 4°C in the dark with the lysates that had been adjusted to 1 M NaCl in a final volume of 650 l. In each experiment, aliquots of the three differentially labeled lysates were pooled to determine the initial SILAC ratios and resulting correction factors for the quantification after affinity enrichment. Beads were washed twice with lysis buffer containing 1 M NaCl and twice with lysis buffer containing 150 mM NaCl. For elution, resin-bound proteins were incubated for 10 min with 50 l 0.5% LDS buffer (Invitrogen) containing 50 mM dithiothreitol at 70°C. Elution fractions were pooled and concentrated by a factor of three in a vacuum concentrator (Eppendorf). Moreover, aliquots of the different elution fractions were compared by immunoblotting with kinase-specific antibodies.
For SILAC-based comparison of protein kinases in MV4 -11, HCT116, and 435S cells, total cell lysates were prepared as described above and all adjusted to 1.5-mg protein in a volume of 500 l. This amount of protein was obtained upon lysis of 17 ϫ 10 6 MV4 -11, 7.3 ϫ 10 6 HCT116, and 5.3 ϫ 10 6 435S cells, respectively. The three lysates were pooled prior to incubation with 90 l of drained VI16832 beads according to the same protocol as used for the inhibitor resin comparisons.
Protein kinase enrichment for phosphorylation site mapping was performed using an Ä KTA explorer system and Tricorn 5/20 chromatography columns (GE Healthcare) packed with 500 l of VI16832 resin. Cells were lysed in a volume of 35-40 ml per experiment. The protein amounts of the starting extracts used in the first and second experiments were: 435S, 85 and 120 mg; HCT116, 240 and 175 mg; MV4 -11, 180 and 120 mg. Lysates were adjusted to 1 M NaCl prior to loading onto the VI16832 column at a flow rate of 0.07 ml/min. Subsequent washing and elution steps were performed as described previously (22). Protein-containing elution fractions were lyophilized, re-suspended in one tenth of the initial volume, and then desalted by protein precipitation prior to gel electrophoresis (28).
Sample Preparation and MS Analysis-For gel electrophoresis, ready-made 10% NuPAGEா Bis-Tris gels (Invitrogen) were used according to the manufacturer's instructions. Resolved proteins were stained using the Collodial Blue staining kit (Invitrogen). In all SILAC experiments, gels were cut into three slices followed by in-gel digestion with trypsin and peptide purification with StageTips as described (29,30).
For phosphopeptide identifications, gels were cut in either three (experiment 1) or 6 (experiment 2) molecular weight regions prior to in-gel proteolysis with trypsin (29). Phosphopeptides were specifically enriched using titanium dioxide (TiO 2 ) microspheres (31,32). The TiO 2 beads (GL Science, Tokyo, Japan) were first equilibrated by consecutive incubations with 20 mM NH 4 OH in 20% acetonitrile (ACN), pH 10.5, washing buffer (50% ACN, 0.1% trifluoroacetic acid) and loading buffer (5 g/liter 2,5-dihydrobenzoic acid in 55% ACN). Fractions of extracted peptides were adjusted to loading conditions and incubated for 30 min with 5 mg TiO 2 beads at room temperature on a rotating wheel. Afterward, beads were washed once with 100 l of loading buffer, three times with 1.5 ml of washing buffer, and phosphopeptides were eluted by incubating twice with 30 l of 20 mM NH 4 OH in 20% ACN, pH 10.5. Eluates were combined and passed through C 8 StageTips followed by a 30-l rinse with 80% ACN, 0.5% acetic acid. After adjusting to a pH of 6, samples were concentrated to ϳ3 l and mixed with an equal volume of 4% ACN, 0.2% trifluoroacetic acid. MS analyses were done as described previously (7,15). Briefly, peptide separations were done on 15-cm analytical columns (75-m inner diameter) in-house packed with 3-m C 18 beads (Reprosil-AQ Pur, Dr. Maisch) using a nanoflow high pressure liquid chromatography system (Agilent Technologies 1100), which was coupled online to a LTQ-Orbitrap mass spectrometer (Thermo Fisher Scientific) via a nanoelectrospray ion source (Proxeon Biosystems). The LTQ-Orbitrap was operated in the data-dependent mode to automatically switch between full scan MS in the orbitrap analyzer (with resolution r ϭ 60,000 at m/z 400) and the fragmentation of the five most intense peptide ions by either MS/MS or multi-stage activation in the LTQ part of the instrument, the latter being triggered on neutral loss species at 97.97, 48.99, or 32.66 m/z below the precursor ion for 30 ms (33). For all measurements with the orbitrap detector, a lock-mass strategy was used for internal calibration as described (34).
Peptide Identification, Quantitation, and Data Analysis-Raw MS files acquired from individual experiments were merged using the Raw2msm software (34), and the resulting msm files were searched against concatenated forward and reversed versions of the human IPI protein database version 3.13 (SILAC-based inhibitor comparison), version 3.19 (phosphorylation site mapping), or version 3.24 (SILAC-based kinome profiling) containing 57,032, 60,397, and 66,921 protein entries, using the MASCOT search engine (Matrix Science). All databases contained frequently occurring contaminants including human keratins, porcine trypsin, and endopeptidase Lys-C. Search parameters were set to up to three missed cleavages, mass tolerances of 25 ppm for MS, and 0.5 Da for MS/MS scans. Carbamidomethylation of cysteine was set as fixed modification; variable modifications included oxidized methionine, phosphorylation of serine, threonine and tyrosine, N-acetyl protein, N-pyroglutamine and in the SILAC experiments, the isotopic variants Lys 4 , Lys 8 , Arg 6 , and Arg 10 .
The html output files generated by MASCOT together with the raw data files were then further processed using the MSQuant software, version 1.4.0 used for SILAC-based inhibitor comparison and version 1.4.3 for SILAC-based kinome profiling and phosphorylation site identification). Prior to peptide quantification or computation of PTM scores, peptide datasets were filtered for a false-discovery rate (FDR) of less than 1% (p Ͻ 0.01) according to a target/decoy database searching strategy. To achieve a FDR of less than 1%, filtering criteria such as a peptide length Ն 6 and a mass error Ͻ 5ppm were applied together with a minimal MASCOT score that ranged from 21 to 29 depending on the experiment.
MSQuant determines the average ratio over the peptide elution profile, and all precursor ion assignments used for quantitation were manually validated (15). Upon normalization for the initial SILAC pooling error, protein ratios were calculated as the mean of all ratios from uniquely assigned peptides.
To identify highly significant differences in relative protein abundance, the relative ratios of the protein quantifications from the two biological replicate experiments were analyzed for their normal distribution to account for the combined biological and technical variation in the quantitative MS analyses. Protein abundance was considered as significantly different (p Ͻ 0.01) in case ratios differed from the mean by 2.58 as determined from the "ratio of ratios" distributions of the biological replicate analyses.
The assignment of phosphorylation sites in identified phosphopeptides was done with the PTM scoring algorithm implemented in MS-Quant as described previously by Olsen et al. (15). In our present study, phosphorylation sites were rated as class I in case of a localization probability of at least 0.95. The localization p values for all identified phosphopeptides as well as the corresponding annotated MS/MS spectra can be accessed online (35).For enrichment analysis of gene ontology (GO) categories, Cytoscape (36) together with the BinGO plugin (36,37) was used to identify statistically over-represented GO molecular function terms compared with a reference dataset consisting of all IPI entries and their respective GO identifiers essentially as described (38).

RESULTS
Comparative Target Profiling for Kinase-selective Pre-fractionation Reagents-To enable broad kinase enrichment, immobilized kinase inhibitors should ideally exhibit considerable non-selectivity in conjunction with high affinity for many members of the protein kinase superfamily. The previously described pyrido [2,3-d]pyrimidine-based capture molecule PP58 exhibited high potency and non-selectivity for a subset of protein kinases comprising about 25% of the human kinome (39). These kinases possess a small amino acid (often a threonine residue) at a critical "gate-keeper" position, which does not interfere with the positioning of the inhibitor's dichlorophenyl moiety in a hydrophobic pocket located at their ATP binding sites (40). In contrast, this cavity is rather inaccessible in the majority of protein kinases with more spacious "gatekeeper" residues (40). We initially reasoned that this part of the kinome might be targeted by a PP58-related compound that lacks the dichlorophenyl group but is otherwise identical in structure. However, such a capture molecule exhibited fairly weak kinase affinity (data not shown). Therefore, based on previously described structure-activity relationship data for Cdk4 inhibition by pyrido [2,3-d]pyrimidine kinase antagonists (25), we have recently introduced compounds with larger cyclopentyl and norbornyl moieties (designated VI16743 and VI16832) at the N8 position to compensate for the observed drop in potency (7) 2 . As expected, these capture molecules retained considerable numbers of kinases. However, the actual effect of the N8 substituent on cellular target binding profiles has not been systematically analyzed. Such information would help to adjust pre-fractionation toward the cellular kinases that are of interest in individual projects. Therefore, we prepared pyrido [2,3-d]pyrimidine ligands with the cyclopentyl and norbornyl moieties (VI16743 and VI16832) as well as a derivative with a smaller ethyl moiety in the N8 position (VI16741) and immobilized all three compounds through their primary amino groups (Fig. 1A) (7).
To compare the VI16741, VI16743, and VI16832 resins, we performed stable isotope labeling by amino acids in cell culture (SILAC) with the acute myelogenous leukemia (AML) cell line MV4 -11 to enable quantitative MS analysis (41). These cells harbor an internal tandem duplication in the juxtamembrane domain of the FMS-like tyrosine kinase 3 (FLT3). This mutation is also present in a subset of AML patients and results in constitutive up-regulation of growth-promoting FLT3 tyrosine kinase activity (42). Upon quantitative incorporation of either normal arginine and lysine (Arg 0 /Lys 0 ) or their isotopic variants (Arg 6 /Lys 4 or Arg 10 /Lys 8 ), lysates from three differentially SILAC-encoded MV4 -11 cell populations were incubated with the affinity resins carrying covalently immobilized VI16741, VI16743, or VI16832 as capture ligands (Fig. 1, A and B). After in vitro association and elution of retained proteins, we first analyzed small aliquots of the elution fractions by gel electrophoresis and silver staining. Most proteins appeared to be present in all three fractions, albeit typically at somewhat higher levels in the VI16743 and VI16832 resin eluates (Fig. 1C). In addition, some protein bands showed specific resin binding, such as a prominent VI16741 and VI16743 resin-interacting 35 kDa protein identified as mitochondrial delta (3,5),delta(2,4)-dienoyl-CoA isomerase and a VI16832 resin-bound 50 kDa protein identified as the multifunctional protein ADE2 (data not shown), indicating that different hydrophobic moieties in the N8 position can confer selectivity as evident for these abundant purine-binding nonprotein kinases.
The large remainders of the elution fractions were combined prior to gel electrophoresis and tryptic digestion of proteins from three different molecular weight regions. The resulting mixtures of proteolytically derived peptides were analyzed by LC-MS on a LTQ-Orbitrap hybrid mass spectrometer. After database searching and filtering for an identification certainty greater than 99%, relative peptide abundances between VI16741-, VI16743-, or VI16832-enriched fractions were determined by SILAC-based quantification. Quantitative peptide data was then used to calculate the relative protein levels in the different resin eluates (supplemental Tables 1 and 2). In total, more than 130 distinct protein kinases could be identified and quantified in the inhibitor resin eluates. This demonstrates the capacity of the pyrido[2,3d]pyrimidine-based affinity ligands to pre-fractionate and detect almost half of the expressed kinome, which is estimated to consist of up to 300 distinct protein kinases in a given mammalian cell (43). A considerable subset of protein kinases interacted more strongly with the VI16743 and V16832 resins than with the VI16741-containing beads, indicating that the space-filling cyclopentyl and norbonyl moieties in the N8 position resulted in an overall increase in potency with respect to kinase binding ( Fig. 2A). In a few cases, bulky N8-substituents were not well tolerated, notably by various NIMA-related expressed kinase (NEK) family members (NEK3, 6, and 7) that were found in higher abundance in the VI16741 resin eluates. The differences between the VI16743 and V16832 resins were less pronounced, with relatively small subsets of protein kinases preferentially bound by either affinity matrix. Similar comparisons were also made for the more than 250 nonprotein kinase proteins quantified from the affinity resin eluates (supplemental Fig. 1A). To verify the quantitative MS approach with a second assay, immunoblotting was done with a selection of kinase-specific antibodies. The outcome of this analysis was found in excellent agreement with the MS results (Fig. 2, B and C). To evaluate the reproducibility of the SILAC-based quantification, the protein kinase ratios obtained in biological replicate analysis were visualized in scatter blots (Fig. 2D). Notably, independent experimental ratios for VI16743 versus VI16741 as well as for VI16832 versus VI16741 resin binding were similar, demonstrating the accuracy and reliability of the quantitative MS approach. Comparable results were obtained for the identified non-protein kinases (supplemental Fig. 1B). Moreover, as determined from the distribution of the ratios of the replicate, log 2 -transformed protein quantifications, values of more than 2.65 or less than 0.38 indicated differential binding with high confidence (p Ͻ 0.01) (supplemental Fig. 2). Hence, the SILACenabled strategy applied here not only enabled the compar-ative profiling of pyrido [2,3-d]pyrimidine-derived kinase enrichment reagents, but further exemplifies the general utility of such approaches to quantitatively assess cellular target binding to distinct immobilized small molecule ligands in a compact experimental format (44).
The pyrido [2,3-d]pyrimidine-based affinity resins are efficient and easy-to-use purification reagents for a considerable subset of the expressed kinome. Importantly, many of the isolated protein kinases have not been tractable biochemically by conventional immunoprecipitation approaches, due to the lack of high affinity antibodies for that purpose. In agreement with efficient protein kinase-selective enrichment, GO analysis revealed highly significant overrepresentation of corresponding GO molecular function terms in the resinbound protein fraction (data not shown). Moreover, although protein kinases accounted for just one third of all protein identifications, they were on average identified and quantified with three times more peptides than non-protein kinases, indicating even higher purification efficiency as evident from GO analysis.
To further explore the binding characteristics of the pyrido[2,3-d]pyrimidine-based affinity ligands, we performed a GO analysis for over-represented molecular functions in the subset of resin-bound proteins that were not protein kinases ( Fig. 3 and supplemental Table 3). This assessment revealed considerable enrichment for nucleotide-dependent enzymes such as oxidoreductases, dehydrogenases, and lipid kinases, which likely resulted from direct interactions with the purinelike pharmacophore provided by the immobilized inhibitors. Notably, important signaling factors were among these resinbound enzymes such as phosphatidylinositol 3-kinase, phosphatidylinositol-4,5-bisphosphate 3-kinase and others. GO analysis of non-protein kinases further indicated over-representation of the molecular function "protein kinase binding", which resulted from the specific co-purification of protein kinase interactors such as various cyclins as well as other regulatory kinase subunits.
Comparative Kinase Expression Analysis in Different Cancer Cell Lines-The quantitative comparison of pyrido [2,3d]pyrimidine resins demonstrated efficient kinome fractionation by the VI16743 and VI16832 resins, and we chose VI16832-based affinity enrichment to monitor cell-type specific differences of kinase expression profiles. The identification of up-regulated kinases has considerable potential in tumor cell analysis, given that over-expression of protein kinases is frequently involved in carcinogenesis. To test this experimental strategy, three cancer cell lines (MV4 -11 leukemia, HCT116 colon carcinoma, and 435S melanoma-derived cells) were differentially SILAC-encoded prior to cell  lysis. Total cell extracts were combined and incubated with VI16832 inhibitor beads followed by sample processing and quantitative MS as described above (Fig. 4A). In total, more than 170 protein kinases and almost 40 other nucleotideutilizing enzymes could be identified and quantified in their relative expression levels in the three cancer cell lines (supplemental Tables 4 and 5). Our affinity resin-based strategy covered more than twice as many protein kinases as a previously reported approach, which involved the use of reactive and biotinylated ATP conjugates for kinase expression profiling (45). Moreover, the VI16832 resin enabled the analysis of an almost similar number of protein kinases in a single sample as detected upon enrichment with up to seven distinct affinity ligands for protein kinases (7,23). The comparison of replicate experiments indicated high reproducibility of the quantified protein ratios (supplemental Fig.  3). To generate expression profiles, protein levels detected in the cell line with the highest abundance were set to 100%. The quantified ratios were then used to determine percentage values for relative expression in the other two cell lines. In cases where no peptide ion signals were recorded in one or two of the analyzed cell lines, expression values were set to 0%. We then generated a heat map to visualize quantified protein kinases and other nucleotidebinding enzymes according to their relative expression patterns across the three cancer cell lines (Fig. 4B).
The summed relative ratios of protein kinases were rather similar for the seven major groups of the kinome as well as for the other and atypical kinases (Fig. 4C). Interestingly, when we divided the tyrosine kinase group into its cytoplasmic and receptor-type members, the overall relative expression of cytoplasmic PTKs was higher in the MV4 -11 suspension cells compared with the adherent HCT116 and 435S cell lines. Syk, Tec family kinases (Btk, Tec) and several members of the Src family (Lyn, Fgr, HCK) were substantially higher expressed or exclusively detected in MV4 -11 cells. Furthermore, we found two examples of an inverse correlation of closely related cytoplasmic PTKs; Fes and PYK2 were far more prominent in MV4 -11, whereas their close relatives Fer and FAK were found in much higher levels in HCT116 and 435S cells. Unlike for cytoplasmic PTKs, HCT116 and 435S cells exhibited a higher overall level of receptor tyrosine kinases according to our analysis (Fig. 4C). However, FLT3 receptor tyrosine kinase expression was exclusively found in MV4 -11 cells, demonstrating that our comparative analysis revealed this key oncogenic kinase in receptor tyrosine kinase pathogenesis.
To verify that differential resin binding measured by quantitative MS reflects cell-type specific kinase expression, we cultivated all three cell lines in normal growth media and prepared total cell extracts using denaturing lysis conditions. Subsequently, cell extracts were compared by immunoblotting against a selection of kinases, which exhibited significant cell-type specific differences according to the SILAC-based quantification of resin eluates. As shown in Fig. 4D, immunoblotting results were found in good agree- FIG. 3. Gene ontology analysis of non-protein kinases enriched with immobilized kinase inhibitors. Non-protein kinases that were identified with at least one unique peptide upon inhibitor affinity enrichment from MV4 -11 cell extracts were compared with the entire list of IPI entries. Significantly over-represented GO molecular function terms (p Ͻ 0.001) are shown. Ratios represent the numbers of either inhibitor-enriched proteins or all IPI entries annotated to the listed GO molecular function terms divided by the respective numbers of all proteins with annotated GO molecular function terms. ment with the measured SILAC ratios. This indicated that other potential sources of variation that could affect the affinity purification approach, such as cell-type specific expression changes upon SILAC or differential solubilization of kinases due to non-denaturing cell lysis, apparently had no major influence on the quantitative cell line comparisons.
Large-scale Phosphoproteomics Analysis of Cancer Cell Lines upon Kinase Affinity Enrichment-The efficient pre-fractionation of protein kinases from total cell extracts provides an experimental basis for the analysis of post-translational modifications with high analytical sensitivity. To exploit this potential for phosphoproteomics analysis, we employed the VI16832 resin for protein kinase enrichments from MV4 -11, HCT116, and 435S cell lysates. In contrast to the batch purification protocol described above, a column chromatography set-up was employed to enable processing of larger amounts of starting material and thereby enhance the sensitivity of phosphopeptide detection. Affinity-purified proteins from the different cancer cell lysates were separated by gel electrophoresis prior to in-gel digestion with trypsin and subsequent phosphopeptide enrichment with TiO 2 microspheres (31,32). Raw data from the LC-MS analyses were filtered for phosphopeptide identifications within a false-discovery rate of less than 1% for each individual experiment. Phosphoproteomics analysis of VI16832-enriched fractions from MV4 -11, HCT116, or 435S cells resulted in more than 8500 phosphopeptide identifications. These translated into almost 1700 distinct phosphopeptide species derived from 212 different members of the protein kinase superfamily. We further identified more than 1300 distinct phosphopeptides on 563 nonprotein kinases (Table I). Notably, about 30% of the protein kinases and 50% of the other proteins were not detected in our previous analysis of kinase-enriched fractions from HeLa S3 cells. Using computational PTM scoring, more than 1200 phosphorylation sites on kinases and 900 on other proteins could be localized with high confidence (Table I and supplemental Table 6). All identified phosphopeptides can be accessed through the Phosida database, which also provides links to annotated fragmentation spectra harboring the identified phosphopeptides (35). The more than 200 identified protein kinases were rather evenly distributed in the dendrogram of the human kinome (Fig. 5A). This indicated that VI16832 did not select for sequence-related determinants restricted to certain subsets of the kinome and further highlighted the utility of this reagent as broadly kinase-selective enrichment tool for sensitive PTM analysis. The Venn diagrams show the cell line distribution of the identified phosphorylation sites on protein kinases. Importantly, the analysis of VI16832-retained proteins from the three cancer cell lines considerably increased the overall number of identified phosphorylation sites on protein kinases (Fig. 5B). However, as kinase-enriched fractions from the different cell extracts were subjected to individual, qualitative phosphopeptide mapping experiments, selective identifications did not necessarily indicate cell-type specific differences but could also be due to run-to-run variability inherent to LC-MS in the data-dependent acquisition mode. Phosphorylation sites were also identified on various nucleotide-binding enzymes as well as other proteins including regulatory subunits of protein kinases (Table I and supplemental Fig. 4A and supplemental Table 6). Although the retained non-kinase phosphoproteins accounted for 60 -70% of all identifications, they were typically found with fewer phosphopeptides per protein. Moreover, on average, peptides from these proteins exhibited considerably lower signal intensities than protein kinase-derived peptides. Considering the sum of all phosphopeptide intensities as a measure for VI16832-enriched protein amount, more than 80% was derived from protein kinases. This value demonstrates the remarkable kinase selectivity of the phosphoproteomics workflow presented here (supplemental Fig. 4B).
When we analyzed the distribution of phosphoserine, -threonine, and -tyrosine sites, we found tyrosine phosphorylation to be more frequent on protein kinases than on other identified proteins (Table II). Even more intriguingly, tyrosine phosphorylation accounted for more than one third of the identified sites in the activation segment regions of  protein kinases (Table II and Fig. 6). Activation segment phosphorylations often induce conformational changes that allow protein kinases to adopt and maintain their catalytically active states. Therefore, they are involved in the regulation of many members of the protein kinase superfamily (9). In this context, our results highlight the key role of tyrosine phosphorylation for the activity control of kinasemediated cellular signaling. This is remarkable considering the low overall abundance of Tyr(P), which by comparison accounted for just 1.8% of all identified phosphorylation sites in the large-scale analysis of growth factor-induced signaling in total cell extracts by Olsen et al. (15). Moreover, as activation segment phosphorylation can serve as a direct biochemical read-out for cellular kinase activity; VI16832mediated enrichment provides an experimental basis to monitor these key signaling events for more than 50 kinases per cell line (Fig. 6).
The comparison of the current dataset with the three largest previous phosphoproteomics studies on human proteins revealed both overlapping as well as complementary information about phosphorylation sites on protein kinases (supplemental Fig. 4C). Compared with a large-scale study of epidermal growth factor signaling by Olsen et al. (15) in which 254 out of 5674 confidently assigned phosphorylation sites mapped to protein kinases; about four times more sites were found on protein kinases. Moreover, we identified almost twice the number of phosphorylation sites on protein kinases as found in a recent large-scale study on mitotic phosphorylation (18). The total number of 1201 phosphosites that could be assigned in this study even sur- FIG. 5. Phosphorylation site mapping across the human kinome. A, protein kinase-derived phosphopeptides identified upon enrichment by VI16832 affinity chromatography are marked in the dendrogram of the human kinome (2). Colors indicate in which of the analyzed cell lines phosphopeptides were found. The kinome tree illustration was adapted with permission from Cell Signaling Technology, Inc. B, cell line distribution of the identified phosphorylation sites on protein kinases, which could be confidently localized to specific serines, threonines, or tyrosines (class I sites; p Ͼ 0.95). Numbers are shown for all phosphorylations sites combined (pSTY) and separately for phosphoserine, -threonine, and -tyrosine (pS, pT, and pY).
passed the 1007 confidently localized phosphorylation events reported in our previous analysis of cell cycle-regulated changes in kinase-enriched fractions (7). Moreover, despite an overlap of 555 phosphosites between these two studies, we find almost 650 additional site-specific phosphorylations on protein kinases. In contrast to these earlier quantitative studies using more complex, SILAC-encoded samples, the analyses in our current study were done in a qualitative manner with the goal to promote comprehensive phosphorylation site mapping, which, to the best of our knowledge, resulted in the most extensive phosphoproteomics analysis of protein kinases reported to date. DISCUSSION In our present study, we immobilized kinase inhibitors from the pyrido[2,3-d]pyrimidine class of compounds to generate affinity resins for the pre-fractionation of protein kinase-enriched sub-proteomes (25). Cellular target capture was compared for three immobilized pyrido[2,3-d]pyrimidine derivatives using SILAC-based quantitative MS (41). The VI16743 and VI16832 resins were particularly efficient as purification tools, as these affinity resins were capable of retaining more than 130 distinct protein kinases from a single cell extract. Thus, they represent straightforward and easy-to-use purification reagents for a considerable subset of the expressed human kinome. Due to a lack of affine antibodies, many protein kinases are difficult to study by conventional immunoprecipitation approaches. In such cases, small molecule-based isolation can provide a straightforward alternative for targeted analysis; for example when sample processing and MS analysis is restricted to the molecular weight region comprising the kinase-of-interest. Thus, the datasets reported here specify a large number of protein kinases amenable for focused signal transduction analysis, which might involve quantitative MS to monitor PTM regulation upon different types of cell treatment.
The quantitative MS strategy for the comparison of pyrido[2,3-d]pyrimidine derivatives represents a generic approach. It can easily be extended to characterize other kinase-selective capture molecules, which retain subsets of the expressed kinome not efficiently purified by the VI16743 and VI16832 resins. Such comparative analyses should be useful to further improve on previously described multi-resin approaches, which combine immobilized kinase inhibitors with distinct target profiles for maximal coverage of the expressed kinome (7,22,23).
Kinase-selective proteomics focuses on a subset of the proteome, which is of high relevance for targeted therapeutic intervention in diseases such as human cancer. Our comparative analysis of three different cancer cell lines demonstrates quantitative profiling of kinase expression in a compact experimental format. Although only three cell lines can be compared by SILAC in a single experiment, further multiplexing is possible by merging data from parallel triple-labeling experiments through a shared reference sample (15). SILAC-based kinase profiling across larger collections of cancer cell lines could reveal subgroup-specific expression patterns, which might help to adjust targeted therapeutic interventions to the kinases involved in disease progression. Conceptually similar strategies could also be applied to the analysis of protein kinase-enriched fractions from primary tumor specimens; for example by employing chemical tagging with iTRAQ reagents as an alternative to SILAC-based quantification. In a recently published large-scale study by Rikova et al. (8), non-small cell lung cancer cell lines and tumors were comparatively analyzed for their phosphotyrosine-containing proteomes. Remarkably, this survey allowed clustering of the analyzed samples into different groups with distinct tyrosine kinase patterns. Compared with phosphotyrosine-directed approaches, proteomics of kinase-enriched sub-proteomes can be expected to provide both overlapping information (with respect to tyrosine kinases) as well as complementary data regarding serine/threonine kinases (and also other nucleotidebinding proteins). Protein expression data from the serine/ threonine kinase branches of the human kinome likely reveal further insights into cancer cell biology, as exemplified by findings that overexpression of mitotic serine/threonine kinases, such as Aurora A and B, polo-like kinase1 and NEK2, can result in chromosomal instability and has been implicated in malignant transformation (46,47). The pyrido [2,3-d]pyrimidine inhibitor resins described in this study are particularly useful for such sub-proteome surveys due to their ability to capture these key mitotic enzymes as well as many other additional serine/threonine kinases. Here, we have further exploited the enrichment abilities of the pyrido[2,3-d]pyrimidine-based capture ligand VI16832 to extensively map phosphorylation sites on protein kinases. Results from these largescale analyses provide a multitude of new starting points for further functional studies on cellular kinase regulation. Combination of kinase-selective enrichment, quantitative MS, and phosphopeptide purification exhibits considerable potential for future studies, as kinome-wide comparisons on both the protein as well as the post-translational level across different cancer cell lines or tumor samples might significantly expand FIG. 6. Identified phosphorylation sites in the kinase activation loop region of protein kinases. Phosphopeptides with at least one confidently assigned phosphorylation site (class I, in bold red) in at least on cell line (marked by X) are shown and highlighted in yellow. Additional class III phosphorylation sites are shown in light red. Cell lines in which the same sequence and number of phosphorylations was found in the absence of site-determining information are indicated by X. The conserved tripeptide motifs DFG and APE, which define the borders of the activation segment, are highlighted in turquoise and green, respectively. In case activation segment phosphopeptides are shared among different members of the expressed kinome, alternative protein kinases are indicated in parentheses. our knowledge about kinase drug targets and their oncogenic activities on a system-wide level.