Abstract
The small ubiquitin-like modifier (SUMO) family in vertebrates includes three different family members that are conjugated as post-translational modifications to target proteins. SUMO-2 and -3 are nearly identical but differ substantially from SUMO-1. We used quantitative proteomics to investigate the target protein preferences of SUMO-1 and SUMO-2. HeLa cells were established that stably express His6-SUMO-1 or His6-SUMO-2. These cell lines and control HeLa cells were labeled with stable arginine isotopes, and His6-SUMOs were enriched from lysates using immobilized metal affinity chromatography. 53 SUMO-conjugated proteins were identified, including 44 novel SUMO targets. 25 proteins were preferentially conjugated to SUMO-1, 19 were preferentially conjugated to SUMO-2, and nine proteins were conjugated to both SUMO-1 and SUMO-2. SART1 was confirmed by immunoblotting to have both SUMO-1- and SUMO-2-linked forms at similar levels. SUMO-1 and SUMO-2 are thus shown to have distinct and overlapping sets of target proteins, indicating that SUMO-1 and SUMO-2 may have both redundant and non-redundant cellular functions. Interestingly, 14 of the 25 SUMO-1-conjugated proteins contain zinc fingers. Although both SUMO family members play roles in many cellular processes, our data show that sumoylation is strongly associated with transcription because nearly one-third of the identified target proteins are putative transcriptional regulators.
Conjugation of ubiquitin-like protein modifiers to target proteins regulates a wide variety of cellular processes (1). The ubiquitin family includes small ubiquitin-like modifiers (SUMOs)1 that are similar in structure to ubiquitin (2), but in contrast to polyubiquitination, sumoylation does not target proteins for degradation by the proteasome (3–5). Sumoylation can regulate the function of proteins by affecting protein-protein interactions, which in turn can influence subcellular localization. This was first demonstrated for RanGAP1, a protein that is targeted to the nuclear pore component RanBP2 by sumoylation (6–9).
The conjugation pathway of SUMO is similar to the conjugation pathway of ubiquitin and consists of E1, E2, and E3 enzymes (3–5). SUMO is activated by the SUMO-activating enzyme 1/2 dimeric E1 enzyme, and subsequently SUMO is transferred to target proteins by a single E2 enzyme designated Ubc9. Several E3-like factors have been identified, including RanBP2 and the protein inhibitor of activated signal transducer and activator of transcription family, that enhance SUMO conjugation to proteins (3–5, 10, 11). Sumoylation is a reversible process; SUMO-specific proteases can remove SUMO from target proteins (12). These SUMO proteases are also essential for SUMO maturation because SUMO precursor proteins require C-terminal cleavage to expose a diglycine motif essential for conjugation. RNA interference and genetic studies of several components of the sumoylation pathway have established that sumoylation is critical for eukaryotic cell viability (13–17).
Many target proteins have been identified for Smt3, the single SUMO in budding yeast (18–23). These include transcription factors, replication factors, RNA-binding and processing proteins, translation factors, transport factors, cytoskeleton components, and metabolic enzymes, highlighting the broad impact of SUMO on cellular processes.
In contrast to the single SUMO found in Saccharomyces cerevisiae, Caenorhabditis elegans, and Drosophila melanogaster, higher eukaryotes express multiple different SUMOs. A complex SUMO family has been identified in Arabidopsis thaliana with up to eight members (24, 25). Humans express three SUMO family members, SUMO-1, SUMO-2, and SUMO-3. Mature SUMO-2 and SUMO-3 are nearly identical (∼95% identity) but differ substantially from SUMO-1 (∼50% identity) (26–28). In addition to genes that encode functional SUMOs, extensive sets of SUMO pseudogenes exist (29).
We have previously purified and identified a set of target proteins for human SUMO-2 (30), and other groups have identified target proteins for SUMO-1 and SUMO-3 (31–35). These studies have emphasized the broad impact of SUMO on multiple cellular processes. It is not clear, however, whether different SUMO family members have unique cellular roles or whether they act in a redundant manner. Recent developments in quantitative proteomics now enable the systematic investigation of target protein preferences for different SUMO family members. The power of these new techniques has been demonstrated in recent studies. For example, in our laboratories Andersen et al. (36) have investigated the flux of 489 endogenous nucleolar proteins in response to metabolic inhibitors, Kratchmarova et al. (37) have compared the closely related signaling pathways of epidermal growth factor and platelet-derived growth factor, and Trinkle-Mulcahy et al. (38) have identified PP1α- and -γ-binding proteins using the quantitative proteomics technique SILAC (39, 40). Here we used SILAC to identify and compare target protein sets for SUMO-1 and SUMO-2.
EXPERIMENTAL PROCEDURES
Cell Culture—
HeLa cells stably expressing His6-SUMO-1 or His6-SUMO-2 were described previously (30, 41). HeLa cells were grown in Dulbecco’s modified Eagle’s medium supplemented with 10% FCS and 100 units/ml penicillin and streptomycin (Invitrogen). Stable isotope labeling was carried out essentially as described previously (36, 37, 42) using [12C6,14N4]arginine (referred to as Arg0), [13C6,14N4]arginine (referred to as Arg6), or [13C6,15N4]arginine (referred to as Arg10) as indicated.
Purification of His6-SUMO-conjugated Proteins—
His6-SUMO conjugates were purified essentially as described previously (30). Briefly cells were isolated by trypsinization and washed twice with ice-cold PBS. Nuclei were isolated and washed in ice-cold CSK buffer (10 mm PIPES, pH 6.8, 300 mm sucrose, 100 mm NaCl, 3 mm MgCl2, 1 mm EGTA, 0.5% Triton X-100) supplemented with protease inhibitor mixture 1873580 (Roche Diagnostics GmbH). Subsequently, proteins were solubilized in lysis buffer (8 m urea, 100 mm Na2HPO4/NaH2PO4, 10 mm Tris/HCl, pH 8.0) and sonicated. His6-SUMO conjugates were enriched on Talon beads (BD Biosciences) and washed extensively with lysis buffer. Conjugates were eluted in lysis buffer containing 200 mm imidazole.
Mass Spectrometry and Data Analysis—
Mass spectrometric analysis was performed by nanoscale LC-MS/MS using a linear ion trap (LTQ)-FT-ICR mass spectrometer (ThermoFinnigan, Bremen, Germany). Eluates were analyzed by one-dimensional gel electrophoresis. The two gel lanes used were cut in 10 slices and subjected to in-gel digestion with trypsin. The resulting peptides were extracted, concentrated, and then loaded onto a fused silica capillary with a 75-μm inner diameter and an 8-μm tip opening (New Objective, Woburn, MA) filled with Reprosil 3-μm reverse phase material (Dr. Maisch, Ammerbuch, Germany). Peptides were eluted with a 140-min linear gradient of 95% buffer A (0.5% acetic acid in H2O) to 50% buffer B (80% acetonitrile, 0.5% acetic acid in H2O). The LTQ-FT-ICR instrument was operated in the data-dependent mode to acquire high resolution precursor ion spectra (from m/z 300 to 1,500, R = 25,000, and ion accumulation to a target value of 10,000,000) in the ICR cell. The three most intense ions were sequentially isolated for accurate mass measurements by selected ion monitoring scans (10-Da mass window, R = 50,000, and a target accumulation value of 50,000). The ions were simultaneously fragmented in the linear ion trap with a normalized collision energy setting of 27% and a target value of 2,000.
Combined peak lists were searched in the International Protein Index (IPI) database (www.ebi.ac.uk/IPI/IPIhelp.html) using the Mascot program (Matrix Science, London, UK). LTQ-FT-ICR data were searched with a peptide mass tolerance of 5 ppm and a fragment mass tolerance of 0.8 Da. Iterative calibration algorithms on the basis of identified peptides resulted in an average absolute peptide mass accuracy of better than 1 ppm. Stringent criteria were required for protein identification based on the LTQ-FT-ICR data: at least two matching peptides per protein, a mass accuracy within 3 ppm, a Mascot score for individual ions of better than 20, and a delta score of better than 5.
Protein ratios were calculated for each arginine-containing peptide as the peak area of Arg6 divided by the peak area of Arg0 and the peak area of Arg10 divided by the peak area of Arg0 for each single scan mass spectrum. The peptide ratios were averaged for all arginine-containing peptides sequenced for each protein. MSQuant, an in-house developed software program, was used to extract information from the Mascot HTML database search files and to manually validate the certainty in peptide identification and in peptide abundance ratio. The program is available as open source (53).
Proteins, Antibodies, Immunoprecipitations, Protein Electrophoresis, and Immunoblotting—
SUMO-1 and SUMO-2 proteins were produced in Escherichia coli and purified as described previously (43). The amino acid sequence of the mature protein that we refer to as SUMO-2 is MSEEKPKEGVKTENDHINLKVAGQDGSVVQFKIKRHTPLSKLMKAYCERQGLSMRQIRFRFDGQPINETDTPAQLEMEDEDTIDVFQQQTGG (43). Peptide antibody AV-SM23-0100 against SUMO-2/3 was generated in rabbit using the peptide MEDEDTIDVFQQQTG (Eurogentec) (30). Peptide antibody 1607 against SART1 was also generated in rabbit by Eurogentec using peptides CSLSIEETNKLRAKLGLKPLEV and CNLDEEKQQQDFSASSTT as described previously (44). Monoclonal antibodies 21C7 against SUMO-1 and 19C7 against RanGAP1 were obtained from Zymed Laboratories Inc., monoclonal antibody His.Tag against His5 was obtained from Novagen, polyclonal antibody AB1380 against Sp100 was obtained from Chemicon, monoclonal antibody 1814460 against green fluorescent protein was obtained from Roche Diagnostics Corp., and polyclonal antibody SC-551 against retinoic acid receptor α was obtained from Santa Cruz Biotechnology. Secondary antibodies used were anti-rabbit horseradish peroxidase and anti-mouse horseradish peroxidase (1:5,000, Pierce).
HeLa cells were lysed in RIPA buffer (50 mm Tris-HCl, pH 7.5, 150 mm NaCl, 1% Nonidet P-40, 1% sodium deoxycholate, 5 mm EDTA) supplemented with 1 mm DTT, 1 mm PMSF, 20 mm N-ethylmaleimide, and protease inhibitor mixture 1873580 (Roche Diagnostics GmbH). Lysates were precleared by centrifugation, and RanGAP1 and Sp100 were immunoprecipitated using specific antibodies. Species-matched control antibodies were directed against green fluorescent protein and retinoic acid receptor α. Lysates were incubated with antibodies at 4 °C for 1 h and cleared again by centrifugation, and immunocomplexes were subsequently purified on protein G-Sepharose 4 fast flow beads (Amersham Biosciences) for 3 h at 4 °C. After extensive washing with lysis buffer, immunoprecipitates were eluted in lithium dodecyl sulfate protein sample buffer (Invitrogen).
Protein samples were size-fractionated on Novex 4–12% Bis-Tris gradient gels using 4-morpholinepropanesulfonic acid buffer (Invitrogen). Total protein was visualized using the colloidal blue staining kit according to the instructions of the manufacturer (Invitrogen). For immunoblotting experiments, size-fractionated proteins were subsequently transferred onto Hybond-C extra membranes (Amersham Biosciences) using a submarine system (Invitrogen). The membranes were incubated with specific antibodies as indicated. The monoclonal antibody His.Tag against His5 was used according to the instructions of the manufacturer (Novagen). Bound antibodies were detected via chemiluminescence with ECL Plus (Amersham Biosciences).
RESULTS
SUMO-1 and SUMO-2/3 Conjugation Profiles—
To study the target protein profiles for the nearly identical human SUMO family members SUMO-2 and SUMO-3, a polyclonal antiserum was raised in a rabbit against a peptide from the identical C-terminal regions of both SUMO-2 and SUMO-3. This antiserum specifically recognizes SUMO-2/3 but not SUMO-1 as judged by immunoblotting experiments, whereas the commercially available monoclonal antibody 21C7 specifically recognizes SUMO-1 but not SUMO-2/3 (Fig. 1, A and B). Endogenous SUMO target protein profiles in HeLa lysates were studied by immunoblotting using both this SUMO-2/3-specific antiserum and monoclonal antibody 21C7 (Fig. 1). Using short exposure times, it became apparent that the major 75-kDa band recognized by the SUMO-1 antibody (indicated by an asterisk) is not prominent in the SUMO-2/3 target protein profile. Previously, this major SUMO-1 target was shown to be RanGAP1 (8), and these results thus indicate that RanGAP1 is a preferential target for SUMO-1. Other parts of the target protein profiles also differ for SUMO-2/3 and SUMO-1, indicating that substantial numbers of proteins are preferentially conjugated either to SUMO-1 or to SUMO-2/3.
Differential conjugation of endogenous SUMO-1 and endogenous SUMO-2/3 to target proteins.A and B, SUMO-1 and SUMO-2 proteins were produced in E coli and purified. 5-ng protein samples were subjected to SDS-PAGE, transferred to membranes, and probed using monoclonal antibody 21C7 directed against SUMO-1 (A) and polyclonal antibody AV-SM23-0100 directed against SUMO-2/3 (B). C and D, whole cell extracts of HeLa cells were separated by SDS-PAGE, transferred to membranes, and probed using antibodies 21C7 or AV-SM23-0100. Unconjugated SUMO is indicated by an arrow, and SUMO-conjugated RanGAP1 is indicated by an asterisk. The figure is composed of immunoblotting results representing short exposure times (C) and long exposure times (D).
Purification and Identification of SUMO-1 and SUMO-2 Conjugates by Quantitative Proteomics—
To investigate the SUMO-1- and SUMO-2-conjugated proteins in more detail and to identify novel SUMO targets, a quantitative proteomics approach was chosen utilizing the SILAC method (39, 40) (Fig. 2). An important technical aspect of the purification of sumoylated proteins is the use of denaturing buffers to inactivate SUMO proteases and to prevent co-purification of non-sumoylated proteins because they are bound to sumoylated proteins in a non-covalent manner. The use of non-denaturing buffers can instead lead to the inclusion of false positives. We used His6 to tag SUMO in these experiments because the small hexahistidine tag is less likely to interfere with enzymatic attachment to target proteins and should therefore maximize the range of SUMO targets selected, and this tag is compatible with the use of denaturing buffers. Stable HeLa cell lines expressing either His6-SUMO-1 or His6-SUMO-2 were established and labeled with either carbon-substituted arginine (Arg6) or carbon plus nitrogen-substituted arginine (Arg10), respectively. Control HeLa cells were labeled using unsubstituted arginine (Arg0). Nuclear lysates were prepared in 8 m urea from the three different cell lines because nearly all sumoylated proteins are present in the nucleus (30). 10% of these input fractions were separately affinity-purified on Talon beads, size-separated by SDS-PAGE, and immunoblotted to detect the His tag (Fig. 3A). Similar expression levels of His6-SUMO-1 and His6-SUMO-2 were found.
A quantitative proteomics strategy to identify SUMO-conjugated proteins. HeLa cells were labeled with Arg0, HeLaHis6-SUMO-1 cells were labeled with Arg6, and HeLaHis6-SUMO-2 cells were labeled with Arg10. Equal amounts of lysates from the three different populations were mixed, and proteins conjugated to His6-SUMO were affinity-purified on cobalt-agarose. The SUMO-enriched fraction was separated by SDS-PAGE, the gel lane was cut in slices, and the proteins present in these slices were digested by trypsin and identified by mass spectrometry. Peptide mass spectra of proteins were quantified to identify proteins potentially conjugated to SUMO-1 and/or SUMO-2.
Purification of SUMO conjugates. HeLa cells were labeled with Arg0, HeLaHis6-SUMO-1 cells were labeled with Arg6, and HeLaHis6-SUMO-2 cells were labeled with Arg10. Nuclei were isolated and lysed in 8 m urea. Equal amounts of protein were used in purification experiments. A, three separate purifications were performed on cobalt columns, using 10% of each lysate, to purify His6-SUMO-1- and His6-SUMO-2-conjugated proteins separately. Proteins were size-separated by SDS-PAGE, transferred to a membrane, and probed using an antibody directed against the His tag. A nonspecific (n.s.) band is indicated by an asterisk. B and C, equal amounts of protein from the three different lysates were mixed 1:1:1, and sumoylated proteins were purified on a cobalt column. Proteins were eluted from the column in fractions. Small aliquots of each fraction were size-separated by SDS-PAGE, transferred to membranes, and probed using antibody 21C7 directed against SUMO-1 (B) or antibody AV-SM23-0100 directed against SUMO-2/3 (C). D, fraction 3 was size-separated by SDS-PAGE and stained for total protein. The top part of the gel-lanes was cut in 10 slices, and proteins present in each slice were identified by trypsin digestion and mass spectrometric analysis. E, short summary of the identified proteins. 53 potential SUMO target proteins were identified with a SILAC score of at least 1.5, including 25 preferential SUMO-1 targets, 19 preferential SUMO-2 targets, and nine proteins that show little or no preference for SUMO-1 or SUMO-2.
The three nuclear lysates were mixed in a 1:1:1 ratio for large scale His6-SUMO affinity purification on Talon beads. Small aliquots of the eluted fractions were used to detect SUMO-1 and SUMO-2/3 by immunoblotting (Fig. 3, B and C). Purified proteins present in the third eluate were size-separated by SDS-PAGE in two lanes and stained using colloidal Coomassie (Fig. 3D). The top part of the gel lanes that contain the majority of the SUMO conjugates was cut in 10 slices, and proteins were in-gel digested by trypsin. Peptides were eluted from the gel slices and analyzed by LTQ-FT-ICR. A complete list of the 1,656 arginine-containing peptides matching 324 different proteins identified in this screen is available as supplemental material.
Although the SILAC technique can quantify changes smaller than 10% (45, 46), we chose 1.5 as a conservative cutoff ratio. 53 proteins were detected whose identity could be confirmed by at least two arginine-containing peptides and that were enriched at least 1.5-fold in the heavy arginine form (Fig. 3E and Table I). 25 of these proteins were enriched in the Arg6-labeled form, indicating conjugation to SUMO-1, whereas 19 were enriched in the Arg10-labeled form, indicating conjugation to SUMO-2. Nine proteins were enriched in both heavy arginine-labeled forms as compared with the control fraction, indicating conjugation to both SUMO-1 and SUMO-2.
SUMO target proteins
Proteins identified by at least two arginine-containing peptides with a minimum SILAC cutoff ratio of 1.5 are presented here and include 25 SUMO-1 targets (top part), 19 SUMO-2 targets (middle part), and nine proteins conjugated to both SUMO family members (bottom part). 14 of the 25 SUMO-1 targets contain one or more zinc fingers (ZF). A complete list of identified arginine-containing peptides can be found in the supplemental dataset. RB, retinoblastoma susceptibility protein; KRAB, Krueppel-associated box.
Of the 53 SUMO target proteins selected, nine have previously been found in SUMO target protein screens. These proteins are RanGAP1 (6–9), PML (47–50), Δ1-pyrroline-5-carboxylate synthetase (35, 33), 82-kDa FMRP-interacting protein, Ataxin-2-like protein, PP1 regulator, hypothetical protein DKFZp434D1319 (33), and SART1 (31–35). Three proteins were previously identified in our screen for SUMO-2 targets, SART1, Ataxin-2-like protein, and a homolog of zinc finger protein 106 (30). The lowest ratio of heavy arginine to light arginine that we observed for peptides from these nine known SUMO target proteins was 1.53 for the PP1 regulator. This confirms the validity of using 1.5 as the SILAC cutoff ratio to assess specificity. We recognize that using the relatively low cutoff ratio of 1.5 harbors the risk of including false positive SUMO targets in Table I. On the other hand, we feel this is justified to avoid rejecting genuine SUMO targets. The relatively low SILAC ratios observed are likely due to the low abundance of sumoylated proteins and the co-purification of contaminating, non-sumoylated proteins.
Many proteins are conjugated to SUMOs via lysines present in the sumoylation consensus motif ψKX(E/D) where ψ is Val, Ile, Leu, Met, or Phe (3–5). We searched for the presence of this sumoylation consensus motif in the 53 proteins identified. As shown in Table II, a total of 112 consensus sumoylation sites was found in 39 of the 53 proteins (74% of total). Based on a small number of SUMO target proteins, the sumoylation consensus motif was initially defined as ψKXE where ψ is Val, Ile, or Leu (47). A total of 75 sumoylation sites that match this motif was found in 33 of the 53 proteins (62% of total). These frequencies were compared with the frequencies of sumoylation sites in the 13,124 human proteins present in Swiss-Prot release 48.5. 13,207 (V/I/L/M/F)KX(E/D) type sumoylation sites were found in 6,849 proteins (52% of total), and 6,709 (V/I/L)KXE type sumoylation sites were found in 4,318 proteins (33% of total). Thus, our set of SUMO target proteins is enriched in sumoylation consensus motifs, indicating the validity of our strategy to identify endogenous SUMO target proteins. The 14 proteins that are lacking a consensus site for sumoylation are not necessarily false positives, however, because several proteins have previously been found to be conjugated to SUMO via lysines that are not situated in sumoylation consensus sites (3, 4, 19, 21).
Consensus sumoylation sites found in the identified SUMO target proteins
A total of 112 (V/I/L/M/F)KX(E/D) consensus sites for sumoylation were identified in 39 of the 53 SUMO targets. Indicated are the positions of the sumoylated lysines in the respective proteins and the tetramer sequences involved.
A striking feature of many SUMO-1 target proteins is the presence of one or more zinc fingers (Table I) in agreement with a previous study (32). 14 of the 25 SUMO-1-conjugated proteins contain zinc fingers in contrast to three zinc finger-containing proteins of 19 SUMO-2 conjugates and one zinc finger-containing protein of nine SUMO-1 and SUMO-2 conjugates. These zinc fingers are mainly C2H2 type, but C3H1 type and C3HC4 type zinc fingers are also found.
Many zinc finger proteins play a role in transcription, therefore it is not surprising that one-third of the identified proteins are transcriptional regulators. This functional group includes 10 preferential SUMO-1 targets, six preferential SUMO-2 targets, and one protein that is possibly conjugated to both SUMOs, showing that both SUMO family members play a role in transcription. Furthermore, SUMO target proteins are involved in signaling, metabolism, cell cycle regulation, glycosylation, DNA repair, pre-mRNA splicing, RNA editing, and other cellular processes, providing more evidence for the broad impact of SUMOs on cells (Table I).
We compared our dataset with data from previous studies on the target proteins that were identified for yeast SUMO, Smt3 (19–23). Eight yeast homologs of the human SUMO target proteins were also identified as targets for Smt3 (Table III). Homologs were found for four preferential SUMO-1 targets, three preferential SUMO-2 targets, and three target proteins for SUMO-1 and SUMO-2. A single yeast homolog, CHD1, exists for the human proteins chromodomain helicase DNA-binding protein 2 and chromodomain helicase DNA-binding protein HELSNF1, and a single yeast homolog, PBP1, exists for the Ataxin-2 protein and the Ataxin-2-like protein. All the proteins in Table III contain sumoylation consensus sites with the exception of Histone H2B and PBP1.
A comparison between previously published Smt3 target proteins and the SUMO target proteins identified in this study
Ten proteins identified in our screen for SUMO-1 and SUMO-2 targets have yeast homologs that were previously identified as targets for Smt3. Consensus sumoylation sites are found in all these proteins except for Histone H2B and PBP1.
Several examples of peptide spectra corresponding to SUMO-1 and/or SUMO-2 target proteins are given in Fig. 4. Chromodomain helicase DNA-binding protein (CHD) 2 is an example of a preferential SUMO-1 target (Fig. 4B), whereas CHD1 is an example of a non-sumoylated protein (Fig. 4A). Interestingly, CHD1 is missing five of the eight consensus sites for sumoylation that are found in CHD2 (Table II). RanGAP1 was preferentially conjugated to SUMO-1, but the data also indicated the existence of a SUMO-2-linked form (Fig. 4C). SART1 was a target for both SUMO family members (Fig. 4D), whereas PML (Fig. 4E) and the Ataxin-2-related domain protein (Fig. 4F) were preferential SUMO-2 targets.
Peptide mass spectra of SUMO target proteins. Peptide mass spectra of proteins were quantified, and peptides matching the control fraction (Arg0), the SUMO-1 enriched fraction (Arg6), and the SUMO-2 enriched fraction (Arg10) are indicated. Chromodomain helicase DNA-binding protein 1 is an example of a non-sumoylated protein (A), chromodomain helicase DNA-binding protein 2 (B) and RanGAP1 (C) are examples of preferential SUMO-1 targets, PML (E) and Ataxin-2-related domain protein (RDP) (F) are examples of preferential SUMO-2 targets, and SART1 is an example of a protein that is conjugated to both SUMO family members (D). Ctrl, control.
Confirmation of SUMO-1 and SUMO-2 Target Protein Preferences by Immunoblotting.—
To confirm that SUMO-1 and SUMO-2 display target protein preferences and also share target proteins, immunoblotting experiments were performed (Fig. 5A). His6-SUMO-1- and His6-SUMO-2-conjugated proteins were purified separately, and control HeLa cells were included in the experiment. Proteins were size-separated by SDS-PAGE and blotted to membranes, and SUMO target proteins were detected by specific antibodies. In line with the results shown in Figs. 1C and 4C, RanGAP1 was detected as a preferential target for SUMO-1; however, a SUMO-2-conjugated form was also detectable (Fig. 5A). Note that SILAC was able to find small amounts of RanGAP1 in the negative control fraction, whereas no RanGAP1 could be found in the negative control by immunoblotting. SILAC is apparently more sensitive in detecting proteins in the negative control fraction than immunoblotting. Sp100 (50) is an example of a specific target for SUMO-2, and SART1 was conjugated to both SUMO family members at similar levels (Fig. 5A).
Preferential conjugation of proteins to SUMO-1 or SUMO-2.A, His6-SUMO conjugates were purified from HeLaHis6-SUMO-1 nuclei and HeLaHis6-SUMO-2 nuclei. Control purifications from HeLa nuclei were included in the experiment. Proteins were size-separated by SDS-PAGE, transferred to membranes, and probed using antibodies directed against endogenous SUMO target proteins. RanGAP1 is preferentially conjugated to SUMO-1, Sp100 is conjugated to SUMO-2, and SART1 is conjugated to both SUMOs at similar levels. B, preferential conjugation of endogenous target proteins to endogenous SUMO-1 or endogenous SUMO-2/3. The SUMO-1 target protein RanGAP1 and the SUMO-2 target protein Sp100 were immunoprecipitated from HeLa cell lysates, transferred to a membrane, and probed using antibody 21C7 directed against SUMO-1 or antibody AV-SM23-0100 directed against SUMO-2/3.
These results were obtained using endogenous SUMO target proteins but exogenous His6-SUMOs. To also investigate endogenous SUMOs conjugated to endogenous proteins, immunoprecipitation assays were used (Fig. 5B). A major technical challenge in this type of experiment is the presence of SUMO proteases in extracts that remove SUMOs from target proteins. To inhibit these proteases, N-ethylmaleimide was added to lysis buffers. Sp100 and RanGAP1 were immunoprecipitated, and SUMO attachment was studied by immunoblotting. The antiserum against SART1 failed to function in these immunoprecipitation experiments. The results confirmed that endogenous RanGAP1 is preferentially conjugated to endogenous SUMO-1 and that endogenous Sp100 is preferentially conjugated to endogenous SUMO-2/3.
DISCUSSION
In contrast with the single SUMO found in lower eukaryotes, vertebrates express three SUMO family members that are attached to substrate proteins (3–5). The recently developed quantitative proteomics tool SILAC (39) enabled us to identify and quantify target proteins conjugated preferentially to SUMO-1 and SUMO-2, respectively. Three sets of proteins were identified: SUMO-1 preferential targets, SUMO-2 preferential targets, and also several proteins that are conjugated to both SUMO-1 and to SUMO-2. We confirmed these results by immunoblotting experiments for a subset of SUMO target proteins.
The purification and identification of sumoylated proteins has been hampered by the low abundance of many SUMO targets, the finding that usually only a small fraction of a protein is sumoylated at any time, and the high activity of SUMO proteases (3). We chose to deal with these serious technical challenges using the novel approach of combining immobilized metal affinity chromatography with stable isotope labeling. Because it is essential to both enrich the sumoylated target proteins and block the action of SUMO proteases by using denaturing buffers, this limits in practice the choice of the affinity tag that can be used. The His6 tag is compatible with the use of a denaturing 8 m urea buffer, and significant enrichment of tagged proteins can be obtained using immobilized metal affinity chromatography. Nevertheless, the resulting purified fractions, although enriched, are never completely pure and inevitably contain a variety of contaminating, nonspecific proteins. Such contaminants are always observed using this and related methodologies and arise for several reasons. For example, they include the proteins that interact with the immobilized metal cobalt via internal histidine-rich regions and other abundant, “sticky” proteins that bind via lower affinity ionic interactions. SILAC is used to discriminate between such inevitable contaminants and the bona fide SUMO targets by accurately and objectively quantitating the specific enrichment of proteins above background levels. In addition, SILAC enables the quantitation of proteins that are preferentially conjugated to either His6-SUMO-1 or His6-SUMO-2. As an example, Fig. 5A shows that a small amount of SART1 could interact in its non-sumoylated form with immobilized cobalt. Sumoylated forms can in addition be purified from lysates of His6-SUMO-expressing cells, and SILAC is able to detect the larger amounts of SART1 present in the heavy arginine-labeled forms, corresponding to the His6-SUMO-1- and His6-SUMO-2-conjugated fractions. Importantly, our successful identification of a number of known SUMO target proteins provides a powerful positive control that further underlines the validity of our approach.
Several lines of evidence further support the notion that different SUMO family members display target protein preferences in vivo. Preferential conjugation of RanGAP1 to SUMO-1 was noted previously by Saitoh and Hinchey (28). In two proteomics approaches using SUMO-1 and SUMO-3, the sets of identified target proteins were also only partially overlapping, but the interpretation of these results is more complicated due to the use of non-quantitative proteomics approaches (33, 35).
Interestingly, proteins are conjugated in vitro to SUMO-1 and SUMO-2 by the E2 enzyme Ubc9 with similar efficiency (51). This indicates that, in addition to SUMOs, target proteins, and the E1 and E2 enzymes that are used in sumoylation reactions in vitro, additional factors may be present in cells that regulate the preferential usage of SUMO-1 or SUMO-2. E3 enzymes are likely candidates to fulfill this role in vivo. In agreement with this hypothesis, it was shown that adding a fragment of the SUMO E3 ligase RanBP2 to in vitro sumoylation assays involving either PML or Sp100 stimulates the preferential usage of SUMO-2 over SUMO-1 (51). This is in line with our results demonstrating the preferential conjugation of Sp100 to SUMO-2. However, it is currently unclear whether RanBP2 also regulates SUMO-2 conjugation of PML and Sp100 in cells. The elucidation of the cellular mechanism underlying target protein preferences for different SUMO family members is therefore an important future objective.
Previously, it has been shown that Sp100 can also be conjugated to SUMO-1 in an interferon-dependent manner (50). This could indicate that SUMO target protein preferences can be stimulus-dependent. A more detailed study of conditional sumoylation is required to obtain better insight into the target protein preferences of different SUMO family member upon activation of specific cellular signaling pathways.
In addition to differences in target protein preferences for SUMO-1 and SUMO-2/3, the relative amount of conjugated SUMO compared with free SUMO is also different between these SUMO family members. It has been shown that a large pool of free non-conjugated SUMO-2/3 exists in COS-7 cells compared with SUMO-1 that mainly exists in the protein-conjugated form (28). The free SUMO-2/3 pool is conjugated to target proteins in a stress-dependent manner. In contrast to the situation in COS-7 cells, the pool of free SUMO-2/3 in the HeLa cells we used appears to be small, although it is probably larger than the pool of free SUMO-1, and many proteins are SUMO-2/3-conjugated in a stress-independent manner (Fig. 1). Thus, cell type-specific differences in conjugation-efficiencies of SUMO-2/3 appear to exist.
The three SUMOs also differ in their ability to form SUMO chains. This occurs via an internal sumoylation site that is present in SUMO-2 and SUMO-3 but is missing in SUMO-1 (43). SUMO-2 chains are formed on PML in vitro, and SUMO-2 dimers have been found attached to HDAC4 (43).
Evidence exists that the closely related SUMO-2 and SUMO-3 proteins also display functional differences. Although mature SUMO-2 and SUMO-3 are nearly identical, the precursor proteins differ substantially in their C termini (43). This could indicate that the processing of the precursor proteins occurs differently or is mediated by different SUMO proteases. The SUMO protease SENP2 has indeed been shown to catalyze the maturation of pre-SUMO-2 and pre-SUMO-3 with strikingly different efficiencies, and this difference can be attributed to the differences in C termini of these proteins (52). Whether these differences between SUMO-2 and SUMO-3 affect their conjugation to target proteins is currently unclear.
In summary, we identified a set of novel potential SUMO target proteins and in addition confirmed several previously identified SUMO conjugates. Some of these proteins were preferentially conjugated to SUMO-1, other proteins were preferentially conjugated to SUMO-2, and a third set of proteins was found to be conjugated to both SUMO-1 and SUMO-2. This indicates that SUMO-1 and SUMO-2 probably have both redundant and non-redundant cellular functions.
Acknowledgments
We thank Drs. Peter ten Dijke and Hans Tanke for critically reading the manuscript.
Footnotes
-
Published, MCP Papers in Press, September 25, 2006, DOI 10.1074/mcp.M600212-MCP200
-
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked “advertisement” in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
-
↵ 1 The abbreviations used are: SUMO, small ubiquitin-like modifier; Arg0, [12C6,14N4]arginine; Arg6, [13C6,14N4]arginine; Arg10, [13C6,15N4]arginine; Bis-Tris, 2-[bis(2-hydroxyethyl)amino]-2-(hydroxymethyl)propane-1,3-diol; CHD, chromodomain helicase DNA-binding protein; E1, SUMO-activating enzyme; E2, SUMO protein carrier protein; E3, SUMO ligase; LTQ, linear quadrupole ion trap; PML, promyelocytic leukemia protein; RanGAP1, Ran GTPase-activating protein 1; SART, squamous cell carcinoma antigen recognized by T-cells; SILAC, stable isotope labeling by amino acids in cell culture; Ubc9, ubiquitin-conjugating enzyme 9; ψKX(E/D), consensus sumoylation site where ψ is Val, Leu, Ile, Met, or Phe and X is any amino acid; PIPES, 1,4-piperazinediethanesulfonic acid; FMRP, fragile X mental retardation protein.
-
↵* This work was supported in part by the Netherlands Organisation for Scientific Research (NWO) (to A C O V.) as part of the Innovational Research Incentives Scheme and by a fellowship from the Dutch Cancer Society (to A C O V.), by a Wellcome Trust program grant (to A I L.), by a Biotechnology and Biological Sciences Research Council grant (to R T H.), and by a generous grant from the Danish National Research Foundation (to J S A. and M M.).
-
↵S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material.
-
↵§§ A Wellcome Trust principal research fellow.
- Received June 6, 2006.
- Revision received August 28, 2006.
- © 2006 The American Society for Biochemistry and Molecular Biology