|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 7:58-70, 2008.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
60 known soluble hydrolytic enzymes and associated proteins to achieve this purpose (1). Most of these proteins are targeted to the lysosome by the mannose 6-phosphate (Man-6-P)1 pathway. Here, shortly after biosynthesis, the N-linked oligosaccharides on lysosomal proteins are selectively modified to generate the Man-6-P recognition marker. The phosphorylated proteins bind to Man-6-P receptors (MPRs) in the trans Golgi network (TGN). The receptors and bound ligands enter transport vesicles and travel to a prelysosomal compartment where the receptor and ligands dissociate. The lysosomal proteins subsequently travel to the lysosome while the MPRs recycle back to the TGN to repeat the process, or to the cell surface.
Most mammalian cells contain two distinct MPRs, a
46 kDa cation-dependent (CD) MPR and a
300 kDa cation-independent (CI) MPR (2, 3). Both receptors are type I transmembrane glycoproteins that cycle between the TGN, endosomes, and the plasma membrane. In addition to its role in the transport of newly synthesized lysosomal proteins, the CIMPR is also involved in the endocytosis of extracellular Man-6-P glycoproteins, whereas the CDMPR, despite being present on the cell surface, does not function in endocytosis under physiological conditions. The CIMPR also binds insulin-like growth factor II (IGF2) (4, 5), a ligand that does not contain Man-6-P, and mediates its endocytosis and clearance from the bloodstream.
The physiological rationale for the existence of two distinct MPRs has long been a question of fundamental interest (6, 7), and one approach to investigate their individual functions has been to generate mutant mice lacking either type of MPR. CIMPR mutants have developmental abnormalities and usually die perinatally (8, 9), but this is independent of disruptions in lysosomal enzyme targeting and results from decreased clearance of IGF2 and subsequent embryonic overgrowth. Mice lacking both IGF2 and the CIMPR are dwarfs due to the lack of IGF2 but otherwise viable and fertile (9–11). CDMPR mutants are fertile and have no obvious defects (12, 13). Mice lacking both the CDMPR and CIMPR in an IGF2-deficient background have a short lifespan and exhibit a lysosomal storage phenotype (10, 14).
Serum from mice lacking either MPR (in an IGF2-deficient background) contain elevated levels of Man-6-P glycoproteins (12, 15), suggesting that both MPRs are required for the complete targeting of all lysosomal proteins under physiological conditions. This conclusion is supported by studies of primary fibroblasts derived from these mice that demonstrate that the lack of either MPR results in the missorting of multiple lysosomal proteins while the lack of both MPRs results in the secretion of even greater amounts (12, 13, 16, 17). In general, the levels of Man-6-P glycoproteins secreted in the absence of the CIMPR appeared higher than those secreted in the absence of the CDMPR. This is consistent with binding studies employing purified receptors that indicate that all of the proteins bound by the CDMPR were also bound by the CIMPR (but not vice-versa) and with higher affinity (18). However, if there are Man-6-P-containing ligands that are preferentially transported by either MPR, they have not been identified to date.
Thus, the precise molecular basis for the requirement for two MPRs still remains to be elucidated. One explanation is that they are required for the targeting of overlapping yet distinct subsets of lysosomal proteins. Previous investigations using a blotting assay revealed that Man-6-P glycoproteins of
26 and
114 kDa were selectively elevated in serum of CDMPR- and CIMPR-deficient mice, respectively (15). In the current study, we have used mass spectrometric approaches to confirm and extend these observations and to identify Man-6-P glycoproteins that are selectively elevated in the absence of each type of MPR. Identification of these proteins represents an important step toward understanding the respective function of each receptor in lysosomal protein targeting.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Mice—
All experiments and procedures involving live animals were conducted in compliance with approved Institutional Animal Care and Use Committee protocols. Adult mice (2–12 months) were killed as described previously (19). Blood was collected by heart puncture, allowed to clot for 1 h at room temperature, and serum was obtained by centrifugation at 300 x g for 15 min and stored at –80 °C until use. Blood was collected from wild-type (wt) mice (mixed genetic background; strains BALB/c, 129/Sv, and C57BL/6J), or mice deficient in both the CIMPR and IGF2 (designated here as CI(-), official designation Igf2r(–/–) Igf2(–/–), mixed genetic background; strains C57BL/6J and 129/Sv (15)) or mice deficient in the CDMPR (designated here as CD(-), official designation M6pr(–/–), congenic C57BL/6J (12)). While the genetic backgrounds of the groups of mice were not matched, comparison of current blotting experiments (see "Results") with previous blotting experiments on serum from 16-day-old littermates of similar genetic backgrounds lacking IGF2 (15) indicated that that strain differences were unlikely to affect the results. (Note that the receptor deficiencies in the juvenile animals analyzed previously appeared to result in a greater magnitude of Man-6-P glycoprotein secretion compared with the current measurements, possibly due to age differences, but the patterns were quite similar.) However, we cannot exclude some minor variations due to strain dependence.
Purification and Immobilization of Each MPR—
Soluble bovine CIMPR (sCIMPR) was purified from fetal bovine serum by affinity chromatography on immobilized phosphomannan (20) and was coupled to Affi-Gel 15 as described (21) at a density of 2.3 to 4 mg/ml (
9 to 16 µM). Soluble bovine CDMPR (sCDMPR) was produced from a Pichia pastoris strain expressing a construct encoding the extracytoplasmic domain engineered to eliminate four of the five N-linked glycosylation sites (generously provided by Dr. Nancy Dahms) (22). sCDMPR was first isolated from culture medium by absorption on immobilized phosphomannan. The protein was eluted with 20 mM sodium acetate pH 4.5 and applied to a Mono S 10/100 GL column (GE Healthcare) which was then eluted with a 0 to 2 M sodium chloride gradient in 20 mM sodium acetate, pH 4.5. Fractions containing sCDMPR were further purified by gel filtration on a 26 x 60 cm Superdex 75 column (GE Healthcare) equilibrated with 50 mM MES buffer, pH 6.5 containing 100 mM NaCl. Sypro Ruby (Molecular Probes, Eugene, OR) gel stain of purified sCDMPR indicated no visible contaminants. Purified sCDMPR was immobilized on Affi-Gel 15 at a density of 1.4–2.4 mg/ml (
80 to 140 µM).
Affinity Purification of Mouse Serum Man-6-P Glycoproteins—
Serum was thawed at 37 °C and centrifuged at 13,000 x g for 1.5 h at 4 °C. Supernatants were filtered using 6 layers of cheesecloth and diluted into an equal volume of buffer A (100 mM imidazole, pH 6.5, 150 mM NaCl, 20 mM MnCl2, 10 mM β-glycerophosphate, 2% Triton X-100, 2 mM Pefabloc, 2 µg/ml leupeptin, 2 µg/ml pepstatin A). Affinity chromatography was conducted using columns (1 x 10 cm) containing between 2 and 4 ml bed volume of immobilized MPR equilibrated with buffer B (50 mM imidazole, pH 6.5, 150 mM NaCl, 10 mM MnCl2, 5 mM β-glycerophosphate, 1 µg/ml leupeptin, 1 µg/ml pepstatin A) containing 1% Triton X-100 and 1 mM Pefabloc. Diluted serum was loaded onto the columns, which were then washed with eight column volumes of buffer B containing 1% Triton X-100 followed by washing with buffer B until A280 returned to baseline. After washing, the column was mock eluted using four column volumes of buffer B containing 10 mM glucose 6-phosphate (Glu-6-P) followed by specific elution with four column volumes of buffer B lacking MnCl2 and containing 10 mM Man-6-P. Finally, each column was eluted with four column volumes of 100 mM glycine pH 2.5 to remove tightly bound and nonspecifically bound material. Fractions containing Man-6-P glycoproteins were identified by monitoring the activities of β-glucuronidase and β-mannosidase (15). In some cases, Man-6-P-containing glycoproteins were repurified by MPR affinity chromatography after removal of Man-6-P by buffer exchange to buffer B by ultrafiltration (Ultra-15, Millipore, Billerica, MA). The final Man-6-P eluates were concentrated and buffer exchanged to 100 mM ammonium bicarbonate by ultrafiltration. Protein concentrations were determined using the Bradford method (23) with bovine serum albumin as a standard. Aliquots of purified Man-6-P glycoproteins were stored at –80 °C prior to use.
Two-dimensional Gel Electrophoresis—
IPG strips were rehydrated for 14 h with sample in 200 µl rehydration buffer (7 M urea, 1.25% CHAPS, 2 M thiourea, 32.5 mM DTT, 0.65% v/v IPG buffer pH 3–10 NL, 0.5% w/v Zwittergent 3–10, 0.5% w/v ASB-14, 0.5% v/v Surfact-Amps Triton X-100, 2.5 mM TCEP hydrochloride, and 0.0002% bromphenol blue). Isoelectric focusing was conducted using a PROTEAN IEF Cell (Bio-Rad) with a current limitation of 50 µA per strip for a total of 54,000 voltage hours. IPG strips were stored at –80 °C prior to two-dimensional SDS-PAGE. Proteins in the IPG strips were reduced with DTT, alkylated with iodoacetamide, and fractionated on 11.5% polyacrylamide gels run at constant current (60 mA per gel) for 6 h. For visualization, preparative gels were stained first with Sypro Ruby and then stained with colloidal Coomassie blue (24) and were scanned using either a Typhoon 9400 (GE Healthcare) or an Epson scanner.
Detection of Man-6-P Containing Glycoproteins—
After fractionation by one- or two-dimensional gel electrophoresis, proteins were transferred to nitrocellulose and glycoproteins containing the Man-6-P modification were detected using 125I-labeled sCIMPR as described previously (25). Signal was visualized and quantified by storage phosphor autoradiography using a Typhoon 9400 scanner (GE Healthcare) and ImageQuant 5.2 software, respectively.
In-gel Tryptic Digestion—
Gel slices or spots excised from one-dimensional and two-dimensional gels were digested with modified trypsin (Promega, Madison, WI) as described. In addition, for Preparation B, digested gel slices were extracted using 60% acetonitrile (ACN)/5% formic acid (FA) and pooled with the supernatants prior to drying samples and subsequent LC-MS/MS.
iTRAQ Labeling—
iTRAQ labeling (26) was performed according to the manufacturers protocol (Applied Biosystems). We conducted two iTRAQ experiments. The first iTRAQ experiment was conducted on material from B1 (iTRAQ 1, Fig. 1). Three samples representing equal amounts of protein (30 µg) purified from wt, CD(-) and CI(-) mouse serum were each spiked with 1/100 (w/w) chicken ovalbumin (Sigma) as an internal standard to monitor recovery. Samples were denatured with guanidine hydrochloride, reduced with DTT, and alkylated with iodoacetamide as described previously (27). Samples were buffer exchanged by ultrafiltration using YM-10 Microcon devices (Millipore) to 50 mM ammonium bicarbonate and modified trypsin (1:50 w/w ratio trypsin:substrate) was added to the retentate. After incubation at 37 °C for 16 h, peptides were eluted by centrifugation at 13,000 rpm for 15 min with two washes of 200 µl 50 mM ammonium bicarbonate. Pooled fractions were dried using a vacuum concentrator and residual ammonium bicarbonate was removed using three cycles of drying and resuspension in 200 µl 50% methanol. A 50 µl portion was removed from each of the wt, CD(-) and CI(-) samples and pooled to generate a fourth sample that served as an internal control for subsequent steps. Samples were dried, dissolved in 30 µl 0.5 M triethylammonium bicarbonate, and iTRAQ reagent dissolved in 70 µl of ethanol was added before incubation for 1 h at 20 °C. Labeling was verified by matrix-assisted laser desorption ionization (MALDI)/time-of-flight (TOF) (MALDI)/time-of-flight (TOF)/TOF analysis using an Applied Biosystems 4800. Labeling was nearly complete, as revealed by subsequent database searching with the LC-MALDI/TOF/TOF data allowing for variable iTRAQ modification, which failed to detect a significant number of unlabeled peptides. Samples labeled with different iTRAQ reagents were dried using a vacuum concentrator, dissolved in 50% methanol, and pooled. Aliquots were vacuum-dried and stored at –80 °C until use.
|
Two-dimensional Liquid Chromatography of iTRAQ Samples—
Forty micrograms of an iTRAQ-labeled mixture was dissolved in 100 µl 0.1% FA, and loaded onto a pipette C18 tip (SPEC, Varian, Lake Forest, CA) equilibrated with 0.1% FA. The absorbed sample was washed with 300 µl 0.1% FA and peptides eluted with 100 µl of 50% ACN containing 0.1% FA and then 100 µl of 80% ACN/0.1% FA. Eluates were pooled and vacuum-dried. To reduce sample complexity prior to tandem mass spectrometry, iTRAQ labeled peptides were fractionated by two-dimensional liquid chromatography consisting of strong cation exchange (SCX) followed by reverse phase liquid chromatography. For SCX, the dried mixture was dissolved in 25 µl buffer A (5 mM KH2PO4 and 25% ACN, pH 3.0) and applied to a polysulfoethyl A column (PolyLC Inc, Columbia, MD), 0.30 x 150 mm, 5 µm, 300Å, using an Ultimate LC system (Dionex, Sunnyvale, CA) with flow rate of 4 µl/min. Buffer B consisted of 5 mM KH2PO4, 25% ACN and 400 mM KCl, pH 3.0. The column was washed with Buffer A for 5 min; then peptides eluted with a gradient of 0–100% Buffer B in 30 min followed by 100% Buffer B for 5 min. Fractions were collected at 4 min intervals and vacuum-dried. For second dimension reverse phase liquid chromatography, SCX fractions of interest were dissolved in 10 µl 0.1% TFA, and 5 µl was loaded onto a 300 µm x 5 mm C18 trap column (Dionex). The trap column was washed with 0.1% TFA. The flow was reversed and connected to a 75 µm x 12 cm column (in-house packed with Magic C18AQ, 3 µm, 200Å, Michrom BioResources Inc., Auburn, CA). Peptides were eluted using a linear gradient of 0–50% ACN in 0.1% TFA at a flow rate of 200 nl/min for 30 min with an infusion of 5 mg/ml alpha-cyano-4-hydroxy cinnamic acid in 50% ACN containing 0.1% TFA at a flow rate of 1.5 µl/min. Fractions were collected every 10 s and spotted onto a 384-well MALDI plate using a Probot microfraction collector (Dionex).
Tandem Mass Spectrometry—
Nanospray LC-MS/MS of tryptic digests was conducted using an LTQ linear ion trap mass spectrometer (Thermo Electron, San Jose, CA) as described previously (28). Tandem MALDI mass spectrometry was conducted using an Applied Biosystems 4800 MALDI-TOF/TOF. MS spectra were acquired in a window of m/z 800–4000 in positive ion reflectron mode. The 15 most abundant precursor ions with a signal-to-noise ratio >10 were selected for top-down MS/MS scans, excluding identical precursor ions contained in adjacent spots from a given LC-MALDI run. MS/MS was conducted using medium collision energy (1 kV) in positive ion mode.
Identification of Sites of Man-6-P Modification—
Affinity purified protein samples (10 µg) were denatured, reduced, and alkylated (27). Samples were buffer exchanged by ultrafiltration to 50 mM ammonium bicarbonate and then digested with trypsin alone or trypsin together with Glu-C (Roche). The Man-6-P-containing peptides were purified on an immobilized sCIMPR microcolumn and deglycosylated with Endo Hf as described previously (29).
Data Analysis—
For analysis of LTQ data, peak lists were generated using the Sequest module of Bioworks 3.3 (Thermo Electron) using previously described parameters (29). For analysis of 4800 data, peak lists were generated using the "Launch Peaks to Mascot" function of the 4000 Series Explorer software (Applied Biosystems) using the following parameters: minimum signal to noise for both MS and MS/MS peaks was 20; minimum peak area of 1000 for both MS and MS/MS; MS spectra range, m/z from 500 to 4000; MS/MS spectra range, m/z from 60 to 20 units below the m/z of the selected precursor ion. Database searches were conducted using a local version of the Global Proteome Machine (GPM) (30, 31) (GPM-USB, Beavis Informatics Ltd., Winnipeg, Canada) using a combined ENSEMBL mouse NCBIm36.42 database (29784 unique sequences) and a database of non-mouse protein contaminants (cRAP, GPM common Repository of Adventious Proteins). Data files generated with a given instrument were searched both individually and together to produce a merged output using the "Mudpit" option. LTQ LC-MS/MS data were analyzed as described (29), and 4800 MALDI-TOF/TOF data were searched using a precursor ion mass error of 100 ppm and a fragment mass error of 0.4 Da. Thresholds for the minimum number of MS/MS fragments were 5 and 15 for the MALDI-TOF/TOF and LTQ data, respectively. For all MS/MS searches, cysteine carbamidomethylation was a constant modification, oxidation of methionine was a variable modification, and one missed cleavage was allowed during the preliminary model development. The threshold used for model refinement was a peptide expectation score of 0.0001. During refinement, deamidation at asparagine and glutamine residues was allowed. For analysis of iTRAQ experiments, modifications were as described above except a fixed iTRAQ modification at lysines and peptide amino termini were specified during preliminary model development and refinement.
The spectral count method (32) was used to estimate the relative abundance of a given protein in two different sample types analyzed using LC-MS/MS. The total number of spectra assigned to a given gene product was tabulated for each type of sample (note that in this analysis, spectral counts assigned to different accession numbers of the same gene, polymorphisms and splice variants, were combined). The proportion of counts in a given sample was calculated as
![]() |
where a and b represent the spectral counts of a given protein in samples a and b, respectively. The statistical tests used are described in (33) unless noted otherwise and computed using the open source language R (R Foundation for Statistical Computing). Upper and lower 95% confidence intervals were calculated using Wilsons method (34) to estimate counting errors. One-sided p values were calculated using the exact binomial test to determine if values were significantly different from 1, 1.5, and 2 (see "Results"). p values were transformed to q values to control for the false discovery rate among multiple comparisons (35). To estimate fold differences, data are presented as log2(a/b), which is formally,
![]() |
For clarity of presentation, for a = 0 or a/b
0.1, the log2(a/b) value was replaced by –3.32 (log2 0.1), while for b = 0 or a/b
10, it was replaced by 3.32 (log210). Fishers exact test was also used to calculate confidence intervals and significance values for odds ratios, estimated as
![]() |
where ai and bi represent the spectral counts of a given protein i in samples a and b, respectively, and at and bt represent the total spectral counts in samples a and b, respectively. Statistics are included in supplemental Table VII.
For calculation of iTRAQ reporter ion ratios, in-house PERL scripts were used to generate a list of identified spectra and associated information from the GPM xml files and to extract relevant peak areas from the Oracle database used by the TOF-TOF for data acquisition. These data were subsequently manipulated using Microsoft Excel. Matrix functions were used to adjust iTRAQ reporter ion peak areas for crossover using correction factors supplied by the vendor. The following inclusion criteria were applied to increase the quality of the data set: 1) only proteins with GPM expectation scores of 10–5 or better were chosen for further analysis; 2) for all comparisons, data were excluded if the average iTRAQ reporter ion intensity was below 10,000 or if any individual intensity was below 1000 (see "Results"); and 3) proteins were required to be represented by at least two independent peptides for inclusion in the final analysis. The log2 values of the remaining peak area ratios were used to calculate a mean and standard deviation for each protein. Students two-sample t test (assuming equal variance) was used to calculate p values, and q values were subsequently calculated as described above.
Western Blot Analysis—
Samples were fractionated by electrophoresis on 10% NuPAGE Bis-Tris precast gels (Invitrogen, Carlsbad, CA). Proteins were transferred to nitrocellulose and baked for 2 h under vacuum. Membranes were probed with a goat antibody against cellular repressor of E1A-stimulated genes 1 (R&D Systems), rabbit antibodies against alpha-mannosidase B1 (kindly provided by Dr. Daulat Tulsiani) (36), tripeptidyl peptidase I (37), Niemann Pick type C2 (38), and cathepsin D (Calbiochem). Bound antibodies were visualized using SuperSignal West Pico chemiluminescent substrate after probing with horseradish peroxidase-conjugated goat anti-rabbit or rabbit anti-goat secondary antibodies.
| RESULTS |
|---|
|
|
|---|
Given these concerns, we chose to identify the spectrum of Man-6-P-containing glycoproteins present in serum of mice lacking either type of MPR and to estimate their relative abundances. The rationale was that, if intracellular lysosomal targeting for a given newly synthesized protein is selectively mediated by a single MPR, it would be selectively secreted in the absence of that MPR and thus may be elevated in blood (15). To this end, we conducted two large scale purification experiments on wt, CD(-), and CI(-) serum, with the overall experimental design shown in Fig. 1. The two purifications served as both biological and technical replicates, using pooled sera obtained from different animals (
100 animals of each genotype were used per purification). In addition, the two purifications were designed to evaluate different experimental factors.
The first large scale preparation was designed to monitor column capacity and specificity of the immobilized MPRs. Mouse serum was applied to two columns connected in tandem, the first containing immobilized sCDMPR, the second containing immobilized sCIMPR. This geometry was chosen because in vitro binding studies indicated that, at submicromolar concentrations, the CIMPR was more effective in capturing Man-6-P glycoproteins than the CDMPR and, thus, the second column could capture ligands that might not be bound by the first (18). After extensive washing, the two columns were disconnected. Each column was mock eluted with 10 mM Glu-6-P, and specifically eluted with 10 mM Man-6-P. Finally, each column was eluted with 100 mM glycine pH 2.5. The material specifically eluted from the columns by Man-6-P was analyzed by 125I-labeled sCIMPR blotting to estimate recovery of Man-6-P glycoproteins. For all sera tested, we found that the first sCDMPR column retained almost all of the Man-6-P glycoproteins (see legend, Fig. 2). Essentially no signal was detected in material eluting with glycine, indicating that Man-6-P was sufficient to elute all Man-6-P-containing glycoproteins (data not shown). Moreover, there were no unique bands apparent from MPR blotting that were selectively retained by the second sCIMPR column (Fig. 2, compare A1 and A1'). In addition, LC-MS/MS analysis on equivalent amounts of protein from each column eluate did not reveal any lysosomal proteins that did not adhere to the sCDMPR column and that were selectively retained by the sCIMPR column (supplemental Table II).
|
10–50 µg/ml serum, representing a purification factor >1000, given a total serum protein concentration of
70 mg/ml. Blotting revealed that, compared with wild-type controls, Man-6-P glycoprotein levels per milliliter of serum were elevated
3 and
5 fold in CD(-) and CI(-) mouse serum, respectively (Fig. 2, middle panel, and supplemental Table III). Note that analysis of unfractionated sera by MPR-blotting reveals
26 kDa and
114 kDa proteins enriched in the CD(-) and CI(-) sera, respectively, as seen previously in MPR-deficient mice (15). While these bands were also observed in the MPR-affinity purified samples, the MPR purification step also allowed visualization of numerous additional Man-P glycoproteins that were previously obscured by albumin and other abundant serum proteins (Fig. 2). The second large scale purification was designed to corroborate the first experiment as well as to help distinguish abundant contaminants from proteins that specifically associate with the immobilized sCDMPR. Here, after application of each type of serum to a single sCDMPR column, a portion of Man-6-P eluate was buffer exchanged by ultrafiltration to remove Man-6-P and repurified using a fresh sCDMPR column. This procedure helped remove some nonspecific contaminants, but comparison of the Coomassie stained gels did not reveal likely candidates that correspond to the selectively elevated Man-6-P glycoproteins identified by blotting (supplemental Fig. 2).
Two-dimensional Gel Electrophoresis Analysis—
In an attempt to identify proteins selectively elevated in the different serum sources, MPR affinity purified material was further fractionated by two-dimensional gel electrophoresis and analyzed by MPR blotting and staining (supplemental Fig. 3). While the overall pattern of stained proteins was remarkably similar in all three types of serum samples, sCIMPR blotting revealed a series of spots that appeared to contain the
26 kDa protein elevated in the CD(-) sample. LC-MS/MS analysis revealed multiple proteins per spot analyzed but clearly indicated that cellular repressor of E1A-stimulated genes 1 (Creg1) was an excellent candidate for the 26 kDa protein (supplemental Table IV). However, a candidate for the
114 kDa protein was not apparent from this analysis.
Spectral Count Analysis—
We also characterized the composition of the different purified samples using LC-MS/MS. For each of the three mouse genotypes, we analyzed tryptic digests of the total material in preparations A1, B1, and B2. The samples from A1 and B2 were also fractionated by SDS-PAGE, and 32 gel slices from each were analyzed following in-gel digestion with trypsin. After excluding likely contaminants such as highly abundant mouse proteins known not to be lysosomal and non-mouse proteins introduced during sample handling, 271 gene products in the mouse database were identified, including 60 known or likely lysosomal proteins (henceforth referred to as lysosomal, see supplemental Table V). Finally, to help identify true Man-6-P glycoproteins from potential contaminants, we digested the MPR-purified proteins with trypsin and isolated and identified Man-6-P glycopeptides as described previously (29). This resulted in the identification of 57 phosphorylation sites on 37 lysosomal proteins (including palmitoyl protein thioesterase 2 and deoxyribonuclease 2, which were not identified in the total purified protein mixture) and 81 phosphorylation sites on 52 additional proteins (supplemental Table VI).
We analyzed the LC-MS/MS results using the spectral count method (32) to identify serum proteins that are specifically elevated in the absence of either MPR. To conveniently visualize fold-differences in the relative abundance of a given protein in two different samples, data are presented as the log2 of the spectral count ratio (see "Experimental Procedures"), with a value of 1 representing a relative 2-fold increase in abundance and a value of –1 representing a relative 2-fold decrease in abundance. Moreover, we can determine confidence intervals based on the total number of spectral counts and the relative ratios (see "Experimental Procedures"). The distribution of all proteins in the different serum samples is shown in supplemental Fig. 4. The relative levels of lysosomal proteins in CD(-) and CI(-) samples are indicated in Fig. 3. We consider proteins to be highly elevated in the CD(-) serum if the relative ratios of both CD(-)/CI(-) and CD(-)/wt are
2 and reach statistical significance (Fig. 3 and supplemental Table VII). The Man-6-P forms of myeloperoxidase (Mpo), Creg1, and tripeptidyl peptidase I (Tpp1) meet these criteria. In addition, two proteins not currently classified as lysosomal, a member of the C-type lectin family 1b (Clec1b) and glutamate-ammonia ligase (Glul), also were identified using this filter. We use the same statistical tests for classifying proteins highly elevated in CI(-) serum using both the CI(-)/CD(-) and CI(-)/wt ratios, and find that cathepsin D (Ctsd), alpha mannosidase 2 B1 (Man2b1), and prosaposin (Psap) meet these criteria. Based on the distribution of spectral counts in the individual excised gel slices, Man2b1 is likely to represent the 114 kDa protein elevated in CI(-) serum that is detected by sCIMPR blotting. We classify proteins as moderately elevated if the ratios are between 1.5 and 2. By these criteria, heparanase (Hpse) was elevated in the CD(-) serum while lysosomal protective protein (Ppgb) and granulin (Grn), a protein currently not classified as lysosomal, were elevated in the CI(-) serum.
|
There are two major potential sources of experimental variation: 1) variation in steps prior to sample pooling (e.g. reduction, alkylation, digestion, and labeling); and 2) variation due to inadequate reporter ion statistics arising from insufficient signal intensity. We have addressed these potential sources of error in two ways. First, analysis of the chicken ovalbumin internal standard added to each sample prior to processing and labeling (see "Experimental Procedures") indicated that these steps did not significantly contribute to experimental variation (supplemental Fig. 5). Second, to establish signal thresholds for inclusion of data, we conducted a control experiment where, before labeling, a portion of each of the three different samples to be analyzed was removed and pooled to create a fourth sample that should contain peptides at levels corresponding to the average of the three original samples if the labeling efficiencies are identical (see "Experimental Procedures"). While the vast majority of MS/MS analyses yielded reporter ions with the expected intensity ratio, there were a number of outliers, particularly at low signal intensities (supplemental Fig. 6). Given this, we filtered the data based on signal threshold as described under "Experimental Procedures," which excluded
5% of the total spectra (supplemental Fig. 6).
The relative abundance of the Man-6-P forms of lysosomal proteins in the three types of serum samples is shown in Fig. 4. As expected, the relative abundance of most lysosomal proteins present in the wild-type serum sample (hatched bars) was lower when compared with samples from the MPR-deficient animals. In contrast, when comparing the CI(-) (open bars) and CD(-) (filled bars) samples, the relative abundance was highly dependent on the individual lysosomal protein analyzed.
|
2-fold elevated in the CD(-) samples while Man2b1, Ctsd, and Psap were
2 fold elevated in the CI(-) samples (Fig. 5). Results from the iTRAQ and spectral count analyses are generally in excellent agreement (Fig. 6 and supplemental Table IX).
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
Spectral Count Analysis—
Our initial data sets were generated using essentially qualitative LC-MS/MS methods. However, we explored the use of the spectral count method (32) in an attempt to abstract quantitative information from these data. There are several statistical methods that could be used to analyze spectral count data (39, 40). We have chosen simply to compare the spectral counts in two specimens of interest without normalization and calculate confidence intervals using the Wilsons method and significance using the exact binomial test with a false discovery rate correction (see "Results" and supplemental Table VII). We also calculated an odds ratio, taking into account the total spectral counts associated with each specimen, and determine confidence intervals and statistical significance using Fishers exact test (supplemental Table VII). For the serum samples, both methods yield similar results, which is expected because the total number of spectral counts were similar in the wt, CD(-), and CI(-) data sets. However, when comparing levels of proteins in Man-6-P and glycine eluates of MPR-purified brain samples, the odds ratio analysis appears to overestimate the abundance of proteins in the glycine sample where there were far fewer total protein assignments (see below).
We chose to merge results from separate MS-based analyses (solution digests and in-gel digests) and purifications (A, B1, and B2, Fig. 1). We think that this is justified as analysis of each individual experiment reveals a similar trend in terms of relative abundance of a given protein in the wt, CD(-), and CI(-) samples (Supplemental Table V). Data pooling tends to average out random variation between experiments and also increases the total number of spectral counts. This is important, as there is considerable inherent uncertainty in the calculated ratios when the spectral counts are low. For instance, for the case where the spectral counts for a given protein in two samples differ by a factor of 2, then one would need at least 42 total spectral counts (a ratio of 28:14 or 14:28) in order to achieve statistical significance (two-sided p < 0.05). (For examples of different simulated ratios, see (40)). Nonetheless, the spectral count method clearly revealed a number of candidates for MPR-selective ligands.
iTRAQ Analysis—
The iTRAQ method produced a high-quality data set after filtering to remove spectra that gave low reporter ion intensities (see "Experimental Procedures" and "Results"). Low ion intensities are expected to yield poorer estimates of abundance due to signal to noise considerations, as has been observed previously (41). In Experiment 1, we had an internal control that could be used to filter the data either independently or in addition to the intensity criteria that we applied. However, this required the use of one of the reporter ions and decreases the numbers of comparisons that can be made in a given iTRAQ experiment, and thus increases the expense and effort associated with a project. However, either filtering criteria generated similar results (data not shown) so we chose to apply intensity thresholds established from analysis of iTRAQ Experiment 1 to filter the data from iTRAQ Experiment 2.
Comparison of Analytical Methods—
The results generated using the iTRAQ and spectral count methods were in excellent agreement. One advantage in the iTRAQ approach is that a single spectrum yields abundance ratios; and in cases where the abundance ratios of different spectra or peptides assigned to a given protein vary, it is possible to inspect each on a case-by-case basis. One limitation was that the total proteome coverage in the LC-MALDI experiments was less than that obtained by LC-MS/MS on unlabeled samples (e.g. only 39 lysosomal proteins were identified in the LC-MALDI/TOF/TOF experiments on iTRAQ labeled samples, two of which did not yield usable iTRAQ ratios, while 60 proteins were found using LC-MS/MS on unlabeled samples.). This in part may arise from oversampling the more abundant proteins at the expense of detecting less abundant proteins. For example, we collected 434 acceptable spectra for murinoglobulin 1 but only 2 for beta-galactosidase. This redundancy might be reduced by implementing improvements in precursor ion selection for MS/MS analysis such as excluding repeated analysis of peptides that elute in multiple ion exchange fractions or limiting the number different peptides analyzed that are derived from a single protein. In addition, we typically only obtain
3 usable MS/MS spectra (identified peptides with acceptable reporter ion statistics) per MALDI spot, and this is open for further optimization.
Despite the difference in protein coverage using LC-MALDI/TOF/TOF on iTRAQ labeled samples and LC-MS/MS on unlabeled samples, both yielded similar quantitative results in terms of discovering true differences in protein abundances. (supplemental Table IX). For instance, a total of 10 lysosomal proteins were identified as having a
1.5-fold difference when comparing the CD(-) and CI(-) samples by the spectral count and/or iTRAQ approaches (one-sided q < 0.025). Eight proteins were identified using both approaches, one was only identified using iTRAQ, and one was only identified using the spectral count analysis. In the latter two cases, both approaches yielded similar trends, but data from only one of the approaches reached the specified significance level (supplemental Table IX).
In addition to the agreement of the two MS-based approaches, these results also were supported by the Western blot analysis. It is important to note that the Western blot experiments were not quantitative, as revealed by the nonlinearity of the response from the different amounts of a given sample loaded on the gel, despite a wide range of exposure times (Fig. 8 and data not shown). This is not simply due to nonlinearity of chemiluminescence and film detection, there are cases where the same amount of a given protein loaded on two different lanes of a gel yields dramatically different signals in Western blotting (42, 43). There are a variety of factors that can account for this, including differences in transfer efficiency of the protein from gel to membrane (42). Our impression is that the MS-based results more accurately reflect the relative abundance of the proteins of interest in the different samples, but further work is required to test this rigorously.
Potential New Man-6-P Glycoproteins and Other Ligands for the MPRs—
In the course of this study, we analyzed MPR-purified preparations from mouse brain and serum. As mentioned under "Results," the brain Man-6-P glycoproteins were affinity purified on sCIMPR or sCDMPR columns, but data from these experiments were not used to classify ligands as CDMPR or CIMPR specific due to the high concentration of immobilized MPRs. In addition, the brain Man-6-P glycoproteins are largely in their processed forms (21, 44, 45), and as processing typically occurs after targeting, these do not reflect the forms that the receptors would encounter in the TGN. This also could affect interaction with the immobilized receptors in different ways. For instance, by 125I-labled sCIMPR blotting, we found that in the analysis of brain extracts, a
46 kDa Man-6-P protein was predominantly located in the glycine eluates from both of the MPR affinity columns. This contrasts markedly with the serum samples, where the levels of Man-6-P glycoproteins detected in the glycine eluate by MPR blotting were extremely low (data not shown). MS analysis indicated that the
46 KDa brain protein represents the mature form of Tpp1 (supplemental Fig. 1 and supplemental Table I). The inactive precursor form of this protein is soluble and can be eluted from MPR affinity columns using Man-6-P (46) but, after low pH-triggered autocatalytic processing, converts to a form that tends to aggregate and absorb to surfaces (46).
Data from the mouse brain and serum analyses can contribute to ongoing studies to identify novel components of the luminal lysosomal proteome (21, 28, 29, 44, 47–53). LC-MS/MS analysis of MPR affinity purified proteins from mouse brain and serum samples have revealed a total of 63 different lysosomal proteins as well as numerous others that are not known to reside in the lysosome (supplemental Tables I and V). The latter may represent Man-6-P modified proteins, proteins that specifically interact with Man-6-P modified proteins (e.g. protease inhibitors and lectins), contaminants, or proteins that interact with a given MPR in a Man-6-P-independent manner (i.e. IGF2). Distinguishing between these possibilities is a formidable challenge. Most known lysosomal proteins were found in both the brain and serum samples (Fig. 9, left). In contrast, of the total 313 different proteins (after excluding likely contaminants) not known to be lysosomal, only 29 were found in both types of samples (Fig. 9, right and supplemental Tables I and V). These proteins are thus worthy of further consideration. Other criteria can also be used for prioritization (discussed in (28)). These include the detection of Man-6-P glycopeptides and enrichment of proteins following two sequential rounds of MPR-affinity purification. When combining these classification strategies, 5 proteins meet all three criteria, 11 meet two of the three criteria, and 68 meet one of the criteria (supplemental Table X).
|
147 amino acids units that are 14–38% identical to each other and that are similar to the 159 residue luminal domain of the CDMPR. The luminal CDMPR monomer contains a single Man-6-P binding site while two of the repeating units on the CIMPR contain the high affinity binding sites (domains 3 and 9) and domain 11 contains the IGF2 binding site. The CIMPR may also bind urokinase-type plasminogen activator receptor at a site distinct from the Man-6-P and IGF2 binding sites (54), but this is controversial (55). Given the multiple domains of the CIMPR, it is possible that there are additional, as yet unidentified, ligands for this receptor. We would expect that these would be enriched in the CIMPR glycine eluate compared with the glycine eluate of the CDMPR column or the Man-6-P eluates of both MPR columns. Interestingly, IGF2 as well as four IGF2 binding proteins were found only in the CIMPR column glycine eluate, and there were a number of other proteins that had similar behavior that may be worthy of further investigation (supplemental Table I).
Relative Functions of the Two MPRs—
The two MPRs are present in a wide variety of vertebrates (56), suggesting that each has a specific and conserved biological function. However, despite over 20 years of investigation (6, 7), the specific role of each MPR in lysosomal protein targeting remains unclear. Studies using mutant mice indicate that animals lacking either the CDMPR (12, 13) or the CIMPR (9) exhibit no apparent health defects, whereas the combined MPR deficiency results in a profound lysosomal storage phenotype and early death (10, 14). This indicates that each MPR can partially compensate for the absence of the other, which complicates determination of MPR-specific functions. Previous studies have directly examined the affinities of the two MPRs for different Man-6-P-containing ligands with the aim of identifying differences that might have functional significance. However, apart from the observations that the CDMPR binds Man-6-P-containing ligands with lower affinity than the CIMPR (18, 57), and that the CIMPR but not the CDMPR can bind mannose 6-phosphodiesters (57), no clear differences were found that suggest specialized biological functions. Other studies have examined Man-6-P glycoproteins secreted from cultured cells lacking either MPR with the rationale that ligands specific to either MPR should be secreted only in the absence of that MPR (16, 17). These studies conclude that, for the most part, the two MPRs direct the lysosomal targeting of distinct but overlapping populations of Man-6-P glycoproteins and that neither MPR can fully compensate for the absence of the other. The CIMPR appears more efficient in targeting newly synthesized lysosomal proteins to the lysosome (17, 58), but this in part may be due to its specific ability to function in endocytosis and thus recapture secreted lysosomal proteins (59). In particular, cathepsin D was preferentially secreted in the absence of the CIMPR (17). However, to our knowledge, no Man-6-P glycoproteins that appear preferentially secreted in the absence of the CDMPR have been identified to date.
One indication that there may be Man-6-P glycoproteins that are specifically targeted by each MPR arose from analysis of MPR-deficient mice (15). Using 125I-labeled sCIMPR as an affinity probe to visualize Man-6-P glycoproteins, bands migrating at
26 kDa and
114 kDa were highly elevated in the CD(-) and CI(-) serum samples, respectively. However, there is some uncertainty in interpretation of these blotting experiments, as an elevated signal could be due to increased protein levels and/or the number of phosphorylated oligosaccharides on the protein. Our quantitative MS-based and Western blotting analyses demonstrate that levels of multiple Man-6-P glycoproteins in serum vary in an MPR-specific manner. Of these, alpha-mannosidase 2B1 is likely to represent the
114 kDa Man-6-P glycoprotein detected in unpurified CI(–) serum, whereas Creg1 is likely to represent the
26 kDa Man-6-P glycoprotein in CD(–) serum.
The molecular basis for the selective secretion of these proteins in an MPR-specific manner remains to be elucidated. It is possible that there is a significant difference in the respective affinities of the MPRs for these proteins. For example, a protein bound with high affinity and targeted efficiently by one MPR but not the other would be predicted to be secreted only in the absence of its high affinity receptor. Proteins that are recognized in a comparable manner by both MPRs would be secreted into the bloodstream to a similar extent in the absence of either MPR, and this appears to be the case with most of the Man-6-P glycoproteins identified. It is possible that both protein and carbohydrate determinants could modulate receptor specificity (58, 60, 61). Another possibility is that serum Man-6-P glycoprotein levels could reflect differences in the cell or tissue distribution of the MPRs and their ligands. For example, a lysosomal protein that is highly expressed in a cell type that predominantly contains a single type of MPR would appear disproportionately elevated in serum in the absence of that MPR. Support for this possibility comes from the observation that, while the MPRs are expressed in most cell types, their relative expression is tissue-dependent, with heart and spleen containing 3–4-fold higher molar levels of CDMPR than CIMPR (62). While the molecular bases for the selective secretion of these various proteins in the absence of either MPR is not yet apparent, we anticipate that the identification of these proteins in this study will provide a useful basis for the future investigation of the relative functions of the two MPRs.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, September 11, 2007, DOI 10.1074/mcp.M700217-MCP200
1 The abbreviations used are: Man-6-P, mannose 6-phosphate; MPR, mannose 6-phosphate receptor; CD, cation-dependent; TGN, trans Golgi network; CI, cation-independent; IGF2, insulin-like growth factor II; FA, formic acid; ACN, acetonitrile; SCX, strong cation exchange; wt, wild type; MALDI, matrix-assisted laser desorption ionization; TOF, time-of-flight. ![]()
* This work was supported by National Institutes of Health Grant DK054317 (to P. L.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental Figs. 1–7 and Tables I–X. ![]()
|| To whom correspondence should be addressed: Center for Advanced Biotechnology and Medicine, 679 Hoes Lane, Piscataway, NJ 08854. Tel.: 732-235-5032; Fax: 732-235-5289; E-mail: lobel{at}cabm.rutgers.edu
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
Y. Qian, I. Lee, W.-S. Lee, M. Qian, M. Kudo, W. M. Canfield, P. Lobel, and S. Kornfeld Functions of the {alpha}, {beta}, and {gamma} Subunits of UDP-GlcNAc:Lysosomal Enzyme N-Acetylglucosamine-1-phosphotransferase J. Biol. Chem., January 29, 2010; 285(5): 3360 - 3370. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Song, Y. Lasanajak, L. J. Olson, M. Boonen, N. M. Dahms, S. Kornfeld, R. D. Cummings, and D. F. Smith Glycan Microarray Analysis of P-type Lectins Reveals Distinct Phosphomannose Glycan Recognition J. Biol. Chem., December 11, 2009; 284(50): 35201 - 35214. [Abstract] [Full Text] [PDF] |
||||
![]() |
N. M Dahms, L. J Olson, and J.-J. P Kim Strategies for carbohydrate recognition by the mannose 6-phosphate receptors Glycobiology, September 1, 2008; 18(9): 664 - 678. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |