|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
,
,¶

,

,¶¶
From the
Howard Hughes Medical Institute, Division of Biology, California Institute of Technology, Pasadena, California 91125,
Department of Biochemistry and Molecular Biology, UBC Centre for Proteomics and || Department of Statistics, Michael Smith Laboratories, University of British Columbia, Vancouver, British Columbia V6T 1Z4, Canada, and 
Department of Genome Sciences, Health Sciences Center, University of Washington, Seattle, Washington 98195
| ABSTRACT |
|---|
|
|
|---|
) (or only its UIM (ubiquitin-interacting motif) polyubiquitin-binding domain (uim
)). Twenty-seven percent of the UPS substrates accumulated as ubiquitylated species in rpn10
cells, whereas only one-fifth as many accumulated in uim
cells. These findings underscore a broad role for Rpn10 in turnover of ubiquitylated substrates but a relatively modest role for its ubiquitin-binding UIM domain. This approach illustrates the feasibility of systems-level quantitative analysis to map enzyme-substrate networks in the UPS.
The yeast Rpn10 protein is a stoichiometric component of the 26 S proteasome and was the first protein found to bind polyubiquitin chains (8). Its amino-terminal domain consists of a conserved von Willebrand A (VWA) motif that docks Rpn10 to the proteasome. Recruitment of ubiquitin chains to Rpn10 is mediated by the 20-amino acid ubiquitin-interacting motif (UIM) domain located near its carboxyl terminus (9). S5a protein, the human Rpn10 ortholog, contains a second UIM domain that is thought to mediate the recruitment of other receptor proteins (10). The general impact of Rpn10 on the turnover of proteasome substrates is not known. Given that budding yeast rpn10
mutants are viable (11, 12), Rpn10 may be required for the turnover of only a small subset of ubiquitylated proteins, or Rpn10 may target a large number of substrates that, in its absence, are targeted by other proteasomal receptors (e.g. Rad23 or Dsk2). Even less well understood is the contribution of the two domains of Rpn10 to substrate turnover. Complete deletion of RPN10 (i.e. rpn10
) stabilizes the cell cycle regulator Sic1 and the transcription factor Gcn4. Paradoxically removal of the UIM domain by itself (i.e. uim
) has no discernable effect on either of these substrates (5)2 suggesting that Rpn10 function may rely solely on an uncharacterized biochemical activity associated with its VWA domain.
To understand fully the biological roles of protein ubiquitylation and the functions of individual components of the UPS such as Rpn10, it will be necessary to identify UPS substrates on a proteome-wide scale. Several studies have started to address this challenge using mass spectrometry to analyze the ubiquitin proteome (13–18). Although these seminal studies illustrate that shotgun mass spectrometry is a powerful tool that can provide a systems-level view of the ubiquitin proteome, it is clear that application of this technology to the ubiquitin system remains in an embryonic state. For example, no proteomics study has yet succeeded in identifying even one of the 11 yeast G1 and mitotic cyclins that are well known substrates of the UPS. Indeed many ubiquitin conjugates identified in proteomics experiments might be stably accumulating species that are not substrates of the UPS. To obtain more focused information from shotgun mass spectrometry experiments, we and others have previously applied subtractive approaches to identify conjugates that accumulate in rpn10
(15) and in npl4ts but not ubc7
mutants (18). Although this strategy allowed the identification of several ubiquitylated proteins, by its nature the subtractive approach excludes substrates whose accumulation is only partially dependent upon a given factor. This is a major concern given the redundancy of many UPS pathways (5, 19). No fewer than six ubiquitin ligases (Mdm2, Pirh2, p300, PARC, Cul7, and Cop1) have been implicated in p53 regulation (20–22), and at least three different ubiquitin chain receptors contribute to turnover of ubiquitylated Sic1 (5). Clearly a method that allows for more subtle quantitative comparisons is needed.
In this study, we adapted stable isotope labeling techniques that have been used previously to address a variety of biological problems to perform relative quantitative analysis of polyubiquitylated proteins in two distinct cell cultures. By applying a statistical approach based on p and q values, we were able to identify ubiquitylated proteins whose levels are altered in response to a specific perturbation (chemical or genetic). After validating the approach, we used this method to identify putative substrates of the proteasome and to determine the contribution of the Rpn10 proteasome receptor pathway in the targeting of UPS substrates. We further dissected the function of Rpn10 by assessing the role of its UIM domain.
| MATERIALS AND METHODS |
|---|
|
|
|---|
and uim
(5) were crossed with RJD3314 to obtain RJD3315 and RJD3318, respectively.
Isotopic Labeling and Two-step Purification—
Cells were grown in YNB-D (0.17% yeast nitrogen base without amino acids and ammonium sulfate, 5% dextrose, 20 µg/ml ampicillin) supplemented with 0.5% ammonium sulfate (14N) at 25 °C to an A600 between 0.5 and 1, washed three times with YNB-D only, and then diluted to an A600 of 0.008 in 2 liters of YNB-D with 0.5% 14N- or 15N-labeled (>98%, Cambridge Isotope Laboratories) ammonium sulfate. The two different isotopically labeled cell cultures were grown in parallel at 25 °C to an A600 of 1 (seven generations), and a drug treatment was applied for 30 min where indicated. Each isotopically labeled culture was then harvested by centrifugation and resuspended in 400 ml of ice-cold TBS. The A600 was carefully measured, and equal amounts of cells for each isotope variant were mixed together, centrifuged, resuspended in 200 ml of ice-cold TBS with 1 mM 1,10-phenanthroline and 10 mM iodoacetamide, recentrifuged, and snap frozen in liquid nitrogen. The two-step purification and tryptic digestion of ubiquitin conjugates was carried out as described previously (23) with the following minor changes: cells were broken in 40 ml of lysis buffer after which 1.8 mg each of recombinant GST-Rad23 and GST-Dsk2 coupled to Sepharose were used for the first step and 100 µl of nickel magnetic bead slurry (Promega) were used for the second. Typically only one-half of the sample was analyzed in a single LCQ analysis (corresponding to 1 liter of each labeled cell sample), whereas with the LTQ, material corresponding to 500 ml of each labeled culture was analyzed. An aliquot (10 µl) of cleared total cell extract was collected after the first centrifugation, precipitated (25), and resuspended in UB buffer (8 M urea, 100 mM NaH2PO4, 10 mM Tris-HCl, pH 8.0). 5 µg of protein were subjected to tryptic digest, mass spectrometry, and quantitative analysis to calculate the average ratio of isotopically labeled proteins in the total cell extract (median of 14N/15N ratio values).
Mass Spectrometry Analysis, Computation, and Statistical Analysis—
The proteolytically digested proteins analyzed in Fig. 1 were evaluated by multidimensional chromatography coupled in line to ESI-MS using an LCQ-Deca mass spectrometer (ThermoFisher) as described previously (15, 26). Samples in Figs. 3–5 were analyzed with an LTQ ion trap (ThermoFisher) using a modified protocol (supplemental information). Sequest and DTASelect (27, 28) were used to analyze the mass spectrometry profiles using two sets of parameters for 14N and 15N searches (supplemental information). Proteins identified by at least two peptides (Supplemental Table 2) were retained for the quantitative analysis, and peptides (14N and 15N) were separately analyzed with RelEx (versions 0.91 and 0.92 (29)) using the following parameters: (i) width option was ±75 scans with LCQ and ±25 scans with LTQ data; (ii) for purified material, ratio correction was applied using the (median ratio)–1 value derived from total cell extract; (iii) regression filter was applied (minimum correlation, 0.7 at 1 and 0.4 at 10); and (iv) signal to noise filter was applied (minimum of 3 for LCQ and 5 for LTQ data). The files from the two separate analyses (14N and 15N) were merged and sorted using RelEx (Supplemental Table 3).
|
|
|
2) of log ratios was estimated from the data observed in the reference experiment (supplemental information), which specified a null distribution for log ratios, i.e. a normal distribution with mean 0 and variance
2/n where n is the peptide number. For each ORF quantified in RelEx, the observed log ratio and peptide number were compared with this null distribution to obtain a two-sided p value.
![]() |
To permit the control of the false discovery rate (FDR) (30) in subsequent lists of putative proteasome substrates, we computed adjusted p values, also known as "q values" (31),
![]() |
where p(i) is the ith order statistic of the observed p values,
0 is the (estimated) proportion of unresponsive proteins, and m is the number of proteins quantified. To calculate the recovery rate associated with a q value cutoff equal to q(i), note that, on average, i x q(i) of the i"discoveries" are false and conversely that i x (1 – q(i)) of the i discoveries are true. Overall m x (1 –
0) ORFs are thought to be truly responsive; therefore we can estimate the fraction of these that have been recovered in any given list of putative proteasome substrates.
![]() |
All statistical analysis was performed using the software environment R (The R Project for Statistical Computing) and the q value package.3 Analysis scripts are provided in the supplemental information. Frequency distribution, Deming regression, and Fisher's test analyses were performed with Prism 4.0 (GraphPad), and list analysis was performed with Excel (Microsoft) using filter functions.
One-step Purification/Immunoblot Analysis of Ubiquitin Conjugates—
For IMAC purification of His8-ubiquitin, cells transformed with RDB1851 (15), a URA3-based plasmid constitutively expressing His8-ubiquitin, were grown in 100 ml of SD-URA medium (0.67% yeast nitrogen base, 5% dextrose) at 30 °C to an A600 of 1 and then treated for 30 min with MG132 (20 µM) or DMSO only (160 µl). Samples were prepared as described previously (15) with the following modifications: trichloroacetic acid was added to a 10% final concentration to the cell cultures, and 50 µl of nickel magnetic bead slurry (Promega) were used. TAP tagged proteins were detected using the anti-calmodulin-binding peptide antibody (Upstate; Fig. 2) or the anti-TAP polyclonal antibody (Open Biosystems; Fig. 4), and Cdc28 was detected for loading control in total cell extract using the PSTAIRE antibody (Santa Cruz Biotechnology).
|
|
| RESULTS |
|---|
|
|
|---|
To identify proteasome substrates, we had to articulate exactly what constituted compelling evidence that a protein is truly enriched or depleted in the MG132-treated cells. An absolute threshold, such as requiring an observed 2-fold increase or decrease, suffers from two disadvantages. 1) It does not normalize the observed ratio by comparison with the background variability arising purely from biological and experimental noise, and 2) it does not acknowledge the increased confidence that arises when a protein is represented by, for example, 10 distinct peptides as opposed to just three. To avoid these pitfalls, we opted for a statistical approach. The quantitative data taken for each protein is the average log ratio of all sequenced peptides derived from it using a base 2 logarithm. We obtained a "null distribution" of log ratios defined as the distribution of log ratios observed in a reference experiment in which both the 14N- and 15N-labeled cells received the same mock treatment (Fig. 1B, top panel). Data were obtained for peptides representing 159 different proteins (all quantitative data are provided in Supplemental Table 3). Based on the assumption that log ratios in a null distribution will be centered at 0, it follows that the null distribution for a protein represented by averaging log ratios from n peptides will also be centered at 0 with variance
2/n. Our distributional assumptions (normal distribution, centered at 0) and the implied relationship between variance and peptide number (
2/n) are consistent with the data observed in the reference experiment (Fig. 1, B and C, top panels). Using an averaging approach that accounted for differences in peptide number (supplemental information), we estimated the variance (
2) of the null distribution to be 0.24.
To assess the impact of proteasome inhibition on the composition of the ubiquitin proteome, we treated cells labeled with 14N with the proteasome inhibitor MG132 for 30 min, whereas DMSO was added to 15N control cells. Log ratios were obtained for 140 proteins, and as expected, the observed distribution was shifted toward positive values and was generally more spread out, suggesting that the treatment induced meaningful differences in protein abundance with evidence for a higher prevalence of enrichment (ratios >1 or positive log ratios) than depletion (Fig. 1B, middle panel; examples of ratio calculation for two 14N-enriched peptides are shown in Supplemental Fig. 1). To evaluate the reproducibility of the ratio measurements and ensure that the enrichment of proteins in the 14N sample was due to the proteasome inhibitor, we performed a "reciprocal" experiment in which MG132 was added to the 15N-labeled cells and DMSO was added to 14N cells. The observed data distribution was as expected with a general shift to negative log ratio values and a much greater range of values relative to the reference experiment (Fig. 1B, bottom panel). Further evidence of reproducibility is found in the high correlation between the peptide numbers of proteins appearing in both the original MG132 analysis and the reciprocal experiment (Supplemental Fig. 2A). Moreover a Deming regression analysis of the log ratios exhibited by the proteins identified in the two experiments yielded a fit that is compatible with the ideal x = y relationship predicted by theory (Supplemental Fig. 2B). Taken together, this evidence confirms that the MG132 treatment similarly affected ubiquitylation levels of proteins analyzed in both experiments.
For each protein, we obtained a two-sided p value by using the null distribution to compute the probability that its log ratio in the MG132 experiment could arise by stochastic fluctuation (Supplemental Table 4). With this approach, one expects that ubiquitylated proteins that are insensitive to MG132 treatment will exhibit p values close to 1, whereas ubiquitylated proteins that exhibit robust accumulation upon MG132 treatment will exhibit p values close to 0 (in Fig. 1C the dotted lines indicate log ratios that have an associated p value of 0.1). Because we computed and interpreted p values for hundreds of proteins simultaneously, it was desirable to adjust them such that by thresholding the adjusted p values at some fixed level we could specify an appropriate error rate for the entire collection of p values. We used a method of p value adjustment that provides control of the FDR (30) defined as the expected proportion of discoveries (i.e. proteins claimed to be enriched or depleted due to treatment) that are actually false discoveries (i.e. proteins unaffected by treatment). These adjusted p values are often called q values (31), and a useful and interpretable by-product of the q value computation is an estimate of the proportion of studied proteins that are truly unresponsive to the experimental perturbation, often denoted
0. Further computation also allowed us to estimate the "recovery rate" associated with each potential q value cutoff defined as the proportion of proteins that respond to the perturbation that would be discovered. Supplemental Table 4 contains p and q values,
0, and recovery rates for all experiments, and Fig. 1D represents the results for the MG132-analysis. We chose to set the FDR at the level of 5%, which produces a high quality list of candidates (on average, only 5% will be false "hits") while also promising a very high recovery rate (estimated to be
80% in both MG132 analysis and reciprocal experiments). To identify those proteins that are the best candidates for proteasome substrates, we then narrowed our focus to proteins enriched in cells treated with MG132 (Supplemental Table 5).
Validation of UPS Substrates—
To evaluate the legitimacy of our approach, we sought to validate candidate proteasome substrates using a completely orthogonal method. Yeast cells expressing His8-ubiquitin and carrying a TAP tagged allele at the endogenous locus of each tested candidate gene were treated with or without MG132, lysed, and subjected to IMAC followed by immunoblotting for the TAP tag (Fig. 2A). Control experiments were done in parallel with an untagged strain and in the absence of His8-ubiquitin expression to evaluate specificity. Upon addition of MG132, increased ubiquitylation was observed for 17 of the 18 tested candidates (Fig. 2B). Notably several proteins with q values close to the 0.05 threshold (e.g. Rps8A) were shown to accumulate as ubiquitylated species after proteasome inhibition. These data provide strong evidence that the list of proteins derived by our approach is highly enriched for proteasome substrates.
The UPS Proteome—
During the course of this study, we acquired an LTQ linear ion trap mass spectrometer that enabled us to obtain quantitative information on more than 500 proteins in a single 10-h analysis (Fig. 3A), thereby revealing 225 proteins that were specifically enriched in MG132-treated cells (using the same approach as in Fig. 1; Supplemental Table 6). Thus, 3–4% of all proteins encoded in the yeast genome detectably accumulated as ubiquitylated species when the proteasome was inhibited. Using this more substantial collection of candidate UPS substrates, we performed in silico analyses to determine the functions, localization, and expression level of proteins targeted to the UPS. Candidate UPS substrates were distributed across many different functional classes (Fig. 3B) with proteins involved in small molecule metabolism accounting for the biggest fraction (24%). Notably we also identified many cell cycle control proteins (10%), including Cdc5, Cdc20, Clb2, Cln1, Cln2, and Far1, which are known substrates of the proteasome (34–38). Comparison of our dataset with a global green fluorescent protein localization study (39) revealed that UPS substrates were recovered from all major cellular compartments but were notably de-enriched in mitochondria and peroxisomes (Fig. 3C). This is not surprising given that there is no known retrotranslocation pathway for the proteasome-dependent degradation of mitochondrial and peroxisomal proteins that are sequestered from the cytoplasm (note that the few identified mitochondrial proteins may have been ubiquitylated prior to their translocation into the organelle). When we assessed the abundance of UPS substrates using a previously reported global analysis of protein expression levels (40), we found that highly abundant proteins (>105 molecules/cell) were enriched 6-fold in the pool of proteasome substrates (Fig. 3D). Although our dataset was slightly biased against proteins present at less than 1000 molecules/cell (i.e. 0.7-fold enrichment), we nevertheless detected some of the least abundant yeast proteins, including Gcn4, Ecm3, and Cpr4 (estimated to be present at less than 50 molecules/cell (40)). Taken together, these data indicate that our approach provided a "broad and deep" view of the ubiquitylated proteins that are targeted to the proteasome for degradation regardless of their function, localization, or abundance.
The Rpn10-dependent Ubiquitin Conjugates—
Previous studies have resulted in the identification of several ubiquitylated substrates that depend on the proteasome receptor Rpn10 for targeting and degradation (5, 15). However, it is unclear which portion of ubiquitylated proteins is influenced by Rpn10. Given that rpn10
mutants are viable and have modest phenotypes, it is possible that Rpn10 contributes to the degradation of only a small fraction of proteasome substrates. A competing hypothesis is that Rpn10 contributes to the degradation of a relatively large number of proteasome substrates, but these substrates can use other receptor pathways when Rpn10 is absent. Such substrates might be missed in a subtractive search for substrates that are found uniquely in rpn10
but not wild-type cells (15) because this type of search is biased toward identifying those substrates that poorly engage an alternative pathway in the absence of Rpn10. However, a quantitative method that can detect changes in accumulation of ubiquitylated proteins might reveal substrates influenced by RPN10 even if their degradation is only modestly retarded in the absence of Rpn10 due to their ability to engage redundant targeting mechanisms.
To address the question of how many ubiquitylated species exhibit a change in accumulation upon loss of Rpn10, we compared the ubiquitin conjugate proteomes of wild-type and rpn10
cells. Cells lacking RPN10 grown in 14N and wild-type cells grown in 15N were mixed together, lysed, and subjected to sequential affinity purification of ubiquitin conjugates followed by MudPIT and quantitative analysis as described in Fig. 1. In this analysis, quantitative information was obtained for 530 proteins (Fig. 4A, middle panel). We identified 122 proteins that were specifically enriched in rpn10
cells (Supplemental Fig. 3A and Table 7). Among these proteins, both Sic1 and Gcn4, which require RPN10 for turnover at normal rates (5, 15), were identified. Thus, these proteins serve as internal standards to lend confidence that this approach can successfully identify proteins whose degradation is influenced by RPN10. It is important to note that this does not mean that these substrates depended, in an absolute sense, upon Rpn10 for their turnover. Rather in the absence of Rpn10, the amount of these substrates that accumulated as ubiquitylated species was higher. This can be due to several reasons. The most likely is that degradation of the ubiquitylated intermediate was slowed in the absence of Rpn10. However, it is also possible for some substrates that loss of Rpn10 repressed editing by deubiquitylating (DUB) enzymes or increased the rate of ubiquitylation. Regardless of the exact mechanism at play, the quantification approach used here reveals that Rpn10 has a broad impact on the UPS.
Quantitative Analysis of the UIM Domain of Rpn10—
We sought to determine whether this approach could also uncover UPS substrates whose metabolism was influenced by the UIM domain of Rpn10. We had previously failed to identify UIM-dependent targets using a subtractive approach.4 Deletion of UIM yields no known cellular phenotype unless the uim
mutation is combined with other mutations that compromise the UPS (5). This scenario is compatible with at least two hypotheses. On the one hand, the UIM domain of Rpn10 might only target a small portion of the UPS proteome. Alternatively the UIM domain might not be involved in substrate degradation unless other targeting mechanisms are compromised.
To address the role of the UIM domain, we compared the ubiquitin conjugate proteomes of wild-type and uim
cells. Mutant uim
cells grown in 14N and wild-type cells grown in 15N were processed as for the RPN10 analysis (Fig. 4A, right panel). We found that the distribution of log ratios, when compared with that obtained in the reference experiment, exhibited a shift to the right and increased spread but to a much lesser extent than that seen in the RPN10 and MG132 analyses. We performed a reciprocal experiment to ensure that this increase was specific to the UIM deletion and found a very similar result (data not shown). Our results suggest the existence of specific proteins that accumulate as ubiquitin conjugates when the UIM domain is mutated. Because the perturbation was relatively subtle (1 –
0 = 15%), low q value thresholds (e.g. 0.05) were associated with poor recovery rates (Supplemental Fig. 3A). Hence in this particular case, we used a less restrictive threshold (<0.15) to increase the recovery rate (>75%). Among the 35 proteins that specifically accumulated in uim
cells (Supplemental Table 8), we selected three candidate genes for validation according to their availability (CPA1, DSE1, and YBR071W). Using the same approach as in Fig. 2A, we were able to confirm an increase of ubiquitin conjugate levels in uim
cells for these three candidates (Fig. 4B; note that both DSE1 and YBR071W have q values between 0.1 and 0.15). To date, no such protein has ever been described. This result illustrates the ability of shotgun-based isotopic quantification to identify UPS substrates whose metabolism is influenced by a particular functional element (i.e. the UIM domain of Rpn10) even when mutation of that element has little or no discernable phenotype.
Proportional Impact of the Rpn10 Deletion on Proteasome Substrates—
To gain a better idea of how broad a contribution Rpn10 makes to the operation of the UPS, we sought to determine what fraction of UPS substrates identifiable by our method (i.e. those conjugates that accumulated upon treatment of cells with MG132) also accumulated upon deletion of RPN10. The estimated proportion of proteins affected by the loss of RPN10 (1 –
0 = 45%) is lower than in MG132-treated cells (Fig. 3A; 1 –
0 = 75%). This indicates that, on a proteome-wide level, the loss of RPN10 has less impact on polyubiquitylated proteins than does inhibition of the proteasome. This is not surprising given the existence of multiple receptors that guide ubiquitylated proteins to the proteasome. To identify the UPS substrates whose degradation is influenced by RPN10, we focused on the 225 proteins enriched in MG132-treated cells (Supplemental Table 6) and evaluated their enrichment in the rpn10
analysis. Among the 225 ubiquitylated species that accumulated in MG132-treated cells, 159 were also present in the rpn10
versus WT analysis, including 43 proteins that were enriched in the mutant cells (Fig. 4C). This degree of overlap (159/225 = 70%) is anticipated by the observation that any single MudPIT run reveals 
of the identifiable proteins in a complex sample (41). This result suggests that metabolism of 27% (43 of 159) of all UPS substrates detectable in the two separate analyses was influenced by Rpn10 (Fig. 4C).
To further address whether a particular class of substrates was specifically affected by Rpn10 absence, we compared the distributions of the different functional classes. As in the previously analysis (Fig. 3B), a large array of classes were represented among the proteins enriched in Rpn10
cells, suggesting a broad function of Rpn10 in the UPS (Supplemental Fig. 3B). However, some variations could be observed when the two analyses were compared (RPN10 versus MG132). Notably fewer proteins associated to small molecule metabolism were identified in rpn10
cells (Supplemental Fig. 3B). Concurrently the proportions of proteins functioning in translation and DNA-associated proteins were significantly enhanced (Fig. 4D). This suggests that Rpn10 may be more important for the degradation of that particular subset of ubiquitylated proteasome substrates.
Proportional Impact of the UIM Domain—
We next performed a comparative analysis, as for the prior analysis of rpn10
, to gauge the influence of the UIM domain of Rpn10 on UPS substrates. In this case, of the 225 putative UPS substrates found to accumulate in the MG132 experiment, 106 were detected in the uim
versus WT analysis. Of these, only eight were enriched in uim
cells, representing
7.5% (8 of 106) of the UPS-pooled substrates (Fig. 4E, left diagram). This shows that deletion of the UIM domain has far less impact than deletion of RPN10. Indeed when we considered the overlap between the RPN10 and UIM analysis, only about 22% of proteins enriched in rpn10
cells were also enriched in uim
cells (11 of 51; Fig. 4E, right diagram). The implication of this result is that, of the UPS substrates influenced by Rpn10, only about one-fifth are sensitive to loss of the UIM domain. This shows that Rpn10 functions on different pools of proteasome substrates using two distinctive mechanisms. The dominant function of Rpn10 is provided by the VWA domain that does not directly bind to ubiquitin, whereas the UIM domain, which can bind to polyubiquitin chains, targets a smaller portion of proteasome substrates.
| DISCUSSION |
|---|
|
|
|---|
The Rpn10 Pathway—
By comparing datasets for ubiquitin conjugates that accumulate when the proteasome is inhibited with MG132 (putative UPS substrates) and those that accumulate in rpn10
cells, we estimated that Rpn10 influenced the steady-state level of ubiquitin conjugates for up to
27% of all UPS substrates (Fig. 5). The simplest interpretation of this result is that Rpn10 contributes to the turnover of a significant number of ubiquitylated proteins. However, we cannot exclude the possibility that in some cases the role of Rpn10 in turnover is indirect or that the increase in conjugates is due to increased ubiquitylation or decreased deubiquitylation. Interestingly there were also several proteins that were significantly de-enriched in rpn10
cells suggesting that Rpn10 function might be broader than suspected. Given that rpn10
cells exhibit mild phenotypes (12) this important impact of rpn10
in our experiments is somewhat surprising. Presumably the "Rpn10-dependent" candidate substrates reported here do not rely exclusively on Rpn10 for delivery to the proteasome. Instead it is more likely that substrates that use Rpn10 can also use other receptor pathways, albeit with reduced overall efficiency, when Rpn10 is absent as is the case for Sic1 (5).
The role of the UIM domain of Rpn10 in the UPS has been perplexing. Although it was the first polyubiquitin binding domain to be identified, it remained unclear whether it plays any role in proteolysis in wild-type cells. The Rpn10-dependent substrates Sic1 and Gcn4 are unaffected by the UIM deletion unless it is combined with deletions of other receptors such as RAD23 (5).2 In this analysis we identified several ubiquitylated substrates that accumulated in uim
cells (Supplemental Table 8). To our knowledge, these represent the first physiological UPS substrates that have been shown to be affected by loss of the UIM domain by itself. This shows that there is a dual role for Rpn10 function at the proteome-wide level (Fig. 5). Those identified proteins can now be used as "indicator proteins" to unravel the physiological role of the UIM domain. The notion of using mass spectrometry to identify an "indicator protein" should be readily applicable to other poorly understood components of the UPS.
Of the proteins accumulating as ubiquitylated species upon RPN10 deletion, only one-fifth accumulated upon the selective deletion of the UIM domain. This is consistent with the observation that rpn10
cells have a more severe growth defect than uim
cells (12). The substrates that accumulated in rpn10
but not uim
cells presumably are dependent upon the VWA domain. The VWA domain enhances the degradation of ubiquitylated Sic1 docked to the proteasome by the Rad23 receptor, but the biochemical function of the VWA domain remains unknown. Interestingly ribosome and DNA-associated proteins were found to be significantly enriched in the Rpn10 analysis but not in the UIM analysis (data not shown). These proteins are part of large and tight complexes. It is possible that the VWA might participate in the structural conformation of the 19 S subunit of the proteasome, favoring a larger entry site for the substrate or better alignment with other proteasomal functions (e.g. Rpn11-DUB or the chaperone activities of Rpt).
The UPS Proteome—
Our analysis revealed a large number of ubiquitylated proteins that accumulated upon inhibition of the proteasome, including relatively low abundance cell cycle control proteins whose activity is known to be regulated by proteolysis (G1 cyclins Cln1, Cln2, and Pcl1; B-type cyclin Clb2; and Cdk inhibitors Sic1 and Far1). Interestingly we also found and confirmed that abundant, presumably stable proteins (ribosomal protein Rps8A, enolase, and phosphoglycerate kinase) accumulated as ubiquitylated species upon inhibition of the proteasome with MG132 (Fig. 2B). It is possible that these proteins succumbed to quality control mechanisms that eliminate improperly translated, misfolded, or damaged proteins (42, 43). If so, the methods reported here may be useful for studying on a proteome-wide scale the chaperones involved in protein folding and assembly. An important feature of our method is that we use consecutive affinity purification steps to focus our analysis on proteasome substrates and bias against contaminating proteins. The effectiveness of this strategy is underscored by the fact that >22% of our candidate UPS substrates are present at <1000 molecules/cell, whereas only 6.8% of proteins identified in a single MudPIT analysis of crude extract are of equivalent abundance (41). The positive attributes of this method suggest that it should be generally useful for identifying targets of specific ubiquitin ligases and deubiquitylating enzymes.
General Issues in Quantitative Profiling of UPS Substrates—
The accumulation of a particular substrate in any given experiment was likely guided by several factors, including the amount of co-accumulating substrates, the relative activity of individual ubiquitin ligases and DUB enzymes that act upon the substrate, the relative ability of different ubiquitylation pathways to incorporate His6-ubiquitin or compete for free ubiquitin, and the degradation rate of the ubiquitylated protein. Because of these factors, we believe there is little value in investing great significance in individual ratios when drawing conclusions in a proteomics manner. Thus, we limited our analysis to classifying proteins as being enriched or not.
In many cases the enrichment values for UPS substrates were lower than the threshold of 4 (or 2 when expressed in log2 scale) often applied in microarray analyses of mRNA expression (44). In our experiments, the total enrichment for ubiquitin recovered from MG132-treated compared with untreated cells averaged only
1.9-fold (in Fig. 1). Thus, one would expect the average ubiquitylated substrate to accumulate 2–3-fold. This value agrees with immunoblot analysis of ubiquitin conjugates in total cell extracts. Although we could have increased this value by longer treatment with MG132, excessive accumulation of conjugates runs the risk of depleting cellular ubiquitin, thereby causing its redistribution to different proteins (45, 46). Remarkably despite the limited extent of accumulation of total ubiquitin conjugates, large variations in accumulation of specific conjugates were seen.
We used a statistical analysis based on q values to identify proteins for which the ubiquitylation level was altered. The reference experiment, in which two biologically equivalent pools were compared, provided crucial information regarding the typical variation arising from simple biological variability and experimental noise. Protein-specific p values provide a statistical measure of the inconsistency between the observed log ratio and a null hypothesis of no enrichment or depletion. We further modified our p values prior to forming lists to address the perennial problem in "omic" analyses, namely the large scale multiple testing problem. A simple filter for p values only allows a selection based on the error rate (proportion of unaffected ORFs that are considered enriched, i.e. false negative), whereas q values allow the setting of a threshold FDR (based on false positive rate). By setting a q value threshold of 0.05 (or 0.15 in the UIM analysis), the lists of putative proteome substrates have, on average, a false positive proportion smaller than 5% (or 15%). Note that in all experiments the error rates were below or close to 10%. In addition to the absolute meaning of the q value cutoff in terms of the FDR, our choice of cutoff was guided by a desire to recover a high proportion (which we estimated to be close to 80% in most of our analyses) of the ORFs that responded to a chemical (MG132) or genetic (rpn10
, uim
) perturbation of the UPS. Manual validation of a subset of proteins from the MG132 and uim
analyses confirmed the efficacy of the approach. To our best knowledge, this is the first analysis that combines the null distribution of a reference experiment and q value test to identify responsive or affected proteins in quantitative mass spectrometry analysis. This method offers considerable promise that could be broadly applied to other proteomics studies.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, July 20, 2007, DOI 10.1074/mcp.M700264-MCP200
1 The abbreviations used are: UPS, ubiquitin proteasome system; VWA, von Willebrand A; UIM, ubiquitin-interacting motif; FDR, false discovery rate; TAP, tandem affinity purification; MudPIT, multidimensional protein identification technology; WT, wild-type; DUB, deubiquitylating. ![]()
2 J. R. Lipford, personal communication. ![]()
3 A. Dabney and J. D. Story, personal communication. ![]()
4 T. Mayor and R. J. Deshaies, unpublished data. ![]()
* This work was supported in part by the Howard Hughes Medical Institute. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
** Supported by Natural Sciences and Engineering Research Council (Canada) and the Michael Smith Foundation for Health Research. ![]()

Supported by National Institutes of Health Grant P41-RR011823. ![]()
¶¶ An Investigator of the Howard Hughes Medical Institute. ![]()
¶ To whom correspondence should be addressed: Dept. of Biochemistry and Molecular Biology, UBC Centre for Proteomics, University of British Columbia, 301-2185 East Mall, Vancouver, British Columbia V6T 1Z4, Canada. Tel.: 604-822-5144; Fax: 604-822-2114; E-mail: mayor{at}interchange.ubc.ca
| REFERENCES |
|---|
|
|
|---|