|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 5:1942-1956, 2006.
© 2006 by The American Society for Biochemistry and Molecular Biology, Inc.
,
,¶



,
,**
From the
Center for Advanced Biotechnology and Medicine and
Department of Pharmacology, University of Medicine and Dentistry of New Jersey, Piscataway, New Jersey 08854 and the || Department of Biochemistry and Molecular Biology, New Jersey Medical School, Newark, New Jersey 01701-1709
| ABSTRACT |
|---|
|
|
|---|
The primary location and site of function of lysosomal enzymes is by definition intracellular. However, lysosomal activities have also been identified in a variety of extracellular environments including plasma (3). These proteins may be delivered to the plasma through leakage from dead or dying cells, through mobilization of secretory lysosome or granule contents, or by the release of lysosomal residual bodies by cell defecation. It is also possible that a portion of the newly synthesized lysosomal proteins escape the intracellular targeting pathway for transport to the lysosome and are secreted. Evidence for the latter is well documented; for example, in cultured mouse embryonic fibroblasts, wild-type cells secrete a small proportion of Man-6-P glycoproteins (
10% of that secreted in the absence of both MPRs, depending on the protein) (47), indicating that the sorting of newly synthesized lysosomal proteins is not absolutely efficient.
The biological significance of circulating lysosomal proteins is unclear, but there are several possibilities. First, some hydrolases may be synthesized and released by one cell type and delivered to another, and so circulating lysosomal proteins may represent intermediates in this process. Second, circulating lysosomal proteins might have a specific function in plasma. Third, the presence of lysosomal proteins in plasma may simply represent the steady state levels reflecting the balance between the rate of unwanted appearance in plasma (due to leakage and/or lack of absolute fidelity in the intracellular targeting system) versus the rate of uptake and clearance.
Regardless of biological function, circulating lysosomal hydrolases potentially represent valuable biomarkers for the study and diagnosis of human diseases. Mutations in genes encoding lysosomal proteins result in over 40 storage diseases (for a review, see Ref. 8). In addition, lysosomal activities have also been indirectly implicated in more widespread pathogenic processes (912) including tumor invasion and metastasis, Alzheimer disease, rheumatoid arthritis, and atherosclerosis as well as in normal processes such as aging (13) and immune system function (14). As a class, lysosomal proteins represent a relatively small subgroup of the plasma proteome that should be amenable to quantitative analysis.
In this study, we set out to characterize the human plasma proteome of mannose 6-phosphate glycoproteins to provide a reference dataset as a diagnostic tool for lysosomal storage diseases that could also facilitate further studies of the role of these proteins in other human conditions. We also viewed plasma as a promising source for the identification of potentially novel lysosomal proteins in our ongoing effort to identify components of the lysosomal proteome (1517).
Given the complexity of the plasma protein population and the wide range of protein abundances, it is not currently feasible to directly detect lysosomal proteins in unfractionated plasma using proteomic methods due to their extremely low abundance compared with classical plasma proteins (18). We have therefore used an approach in which the Man-6-P-containing lysosomal proteins are purified by affinity chromatography on immobilized MPR (1517, 1922). As expected, we identified many lysosomal proteins, but we also purified many proteins that have not previously been assigned either Man-6-P or lysosomal function. Many of these proteins are highly abundant plasma glycoproteins that we initially suspected to represent likely contaminants that do not contain Man-6-P. Therefore, to distinguish between true Man-6-P glycoproteins and contaminants, we used a technique to directly identify Man-6-P-modified glycopeptides. Unexpectedly we found that a small proportion of multiple abundant classical plasma proteins exist as Man-6-P-containing glycoforms.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
2 liters of frozen human plasma were rapidly thawed at 37 °C and then clarified by centrifugation at 13,000 x g for 90 min at 4 °C. The supernatant was filtered through six layers of cheesecloth and then added to an equal volume of ice-cold PBS containing 10 mM ß-glycerophosphate, 10 mM EDTA, 2% Triton X-100, 0.4% Tween 20, and 0.2 mM Pefabloc (Pentafarm, Basel, Switzerland). The diluted plasma was then applied to a column of immobilized pentamannosyl phosphate-aminoethyl agarose (100-ml bed volume) to remove circulating soluble cation-independent MPR (sCI-MPR), and the flow-through was applied to a column of immobilized sCI-MPR (100-ml bed volume at a coupling density of 3.3 mg/ml) at a flow rate of
200 ml/h to capture circulating Man-6-P glycoproteins. The MPR resin was batch washed with 3 x 1 column volume of PBS containing 5 mM EDTA, 5 mM ß-glycerophosphate, 1% Triton X-100 and then with 3 x 1 column volume of the same buffer without Triton X-100. The column was flow-washed with 20 column volumes of PBS/EDTA/ß-glycerophosphate at 125 ml/h and then eluted with 2 column volumes of the same buffer containing 10 mM Man-6-P at a flow rate of 100 ml/h. Fractions containing Man-6-P glycoproteins were identified by protein and ß-mannosidase assay, pooled, concentrated by ultrafiltration (Centriprep-10, Millipore, Billerica, MA), and buffer-exchanged to either 100 mM ammonium bicarbonate (single purification protocol) or PBS/EDTA/ß-glycerophosphate (first elution from the double purification protocol). For repurification, the buffer-exchanged Man-6-P eluate was reapplied to the column of immobilized MPR and purified essentially as described above. The Man-6-P eluate was buffer-exchanged with 100 mM ammonium bicarbonate. Purified proteins were stored at 80 °C, and volatile salts were removed by lyophilization prior to subsequent analyses.
Two-dimensional Gel Electrophoresis
First dimension isoelectric focusing was conducted using pH 37 or 310 non-linear Immobiline DryStrips (Amersham Biosciences), and second dimension gel electrophoresis was conducted using bis-Tris·HCl-buffered (pH 6.4) 10% Duracryl (Genomic Solutions, Ann Arbor, MI) gels run with MOPS running buffer. Transfer to nitrocellulose and detection of Man-6-P glycoproteins using iodinated sCI-MPR were as described previously (15, 22). Fractionated proteins were visualized using either a colloidal Coomassie Blue staining protocol (23) or SYPRO Ruby (Invitrogen).
Tandem Mass Spectrometry
Proteolytic digests of MPR-affinity purified plasma proteins were analyzed by nanospray LC-MS/MS using a ThermoElectron LTQ linear ion trap (ThermoElectron, San Jose, CA) and a Waters Micromass Q-TOF API US (Waters, Milford, MA) mass spectrometer. For both the singly and doubly purified human plasma samples, MPR-affinity purified mixtures were digested with trypsin in solution prior to LC-MS/MS. Forty micrograms of each sample were also fractionated by 1D SDS-PAGE (Novex 10% NuPage gels with MOPS buffer (Invitrogen)), and
30 gel slices were digested with trypsin. For the doubly purified sample alone, 40 µg of purified proteins were fractionated by two-dimensional (2D) gel electrophoresis, and
60 spots were excised and digested with trypsin.
The LTQ was used to analyze digests of all the samples discussed above. Solution digests of each of the singly and doubly purified samples were analyzed in duplicate. Instrument, chromatographic conditions, and generation of peak lists were as described previously (24) except for the analysis of gel slices, which was conducted using a 50-min gradient. For the Q-TOF, tryptic digests of each MPR-affinity purified sample (5 µg) or tryptic digests of 1D gel slices were fractionated using a Micromass CapLC. Peptides were first desalted on a 300-µm x 1-mm PepMap C18 trap column with 0.1% formic acid in HPLC grade water at a flow rate of 20 µl/min. After desalting for 3 min, peptides were back-flushed onto an LC Packings 75-µm x 15-cm C18 nanocolumn (3 µm, 100 Å) at a flow rate of 200 nl/min. For solution digests of MPR-affinity purified proteins, peptides were eluted on a 170-min gradient of 342% acetonitrile in 0.1% formic acid. For digests of 1D gel slices, peptides were eluted on a 30-min gradient of 342% acetonitrile in 0.1% formic acid. The scan time for MS and MS/MS were 1.5 and 2.0 s, respectively. Mass ranges for the MS survey scan and MS/MS were m/z 3001900 and m/z 501900, respectively. The top three multiply charged ions with an MS peak intensity greater than 30 counts/scan were chosen for MS/MS fragmentation with a precursor ion dynamic exclusion of 60 s. Instrument switching from MS/MS to MS mode occurred when either total MS/MS ion counts were over 5000 or when the total MS/MS scan time was over 6.0 s. Raw data were processed by ProteinLynx 2.0 to generate PKL files without peak threshold intensity limits (fast deisotoping) and with automatic charge state determination.
Database Searching
Peptide assignment and protein identification were conducted using a local implementation of X!Tandem Version 2005.12.01 (GPM-USB, Beavis Informatics Ltd., Winnipeg, Canada) to search the human proteome database (the February 2006 assembly of the National Center for Biotechnology Information (NCBI) 35, which contains a total of 33,869 entries representing 30,352 unique sequences) as described previously (24). Database searching using data generated with the Q-TOF was identical except that the parent ion mass error was ±0.5 Da. Methods for analysis of LC-MS/MS data from proteolytic digests of unfractionated mixtures or gel slices and for deglycosylated Man-6-P glycopeptides (see below) have been described previously (24). Data were exported into Microsoft Excel spreadsheets and filtered for threshold significance as described below. To avoid redundancy arising from proteins present in the human proteome database under multiple names and/or accession numbers, a list of ENSP numbers was compiled with all numbers corresponding to a single protein referenced to a single primary number. Protein assignments were compared with this database to ensure that multiple accession numbers representing the same protein were not listed.
Purification and Identification of Proteolytic Peptides Containing Mannose 6-Phosphate
The presence of Man-6-P on MPR-purified proteins was verified using a recently developed protocol for the purification and identification of glycopeptides containing Man-6-P from an unfractionated proteolytic digest of MPR-purified proteins (24).
Criteria for Protein Identification or Peptide Assignment
The GPM protein expectation score is a measure of the confidence of protein identification for which the lower the log value, the smaller the probability that this represents a random match. When identifying proteins in the MPR affinity-purified protein mixtures, the score of highest confidence obtained at random when searching a reversed orientation human proteome was used as the threshold for acceptance of protein assignments. Method-specific thresholds were determined for unfractionated digests (log(e) = 5.1 and 5.4 for the LTQ and Q-TOF, respectively) and for proteins fractionated by 1D PAGE when each gel slice was analyzed individually (log(e) = 4.3 and 4.1 for the LTQ and Q-TOF, respectively). Each threshold was used to filter the respective datasets obtained from searching the forward orientation human proteome. Samples were analyzed using different instruments (LTQ and Q-TOF), were singly and doubly purified, and were prepared for analysis using several different methods (solution digest and gel electrophoresis). Thus, as an additional level of stringency, proteins identified with one approach alone were rejected if the protein assignment scores failed to reach a threshold of log(e) less than 10. For analysis of purified Man-6-P glycopeptides, the threshold for acceptance of peptide assignments to proteins not previously classified as lysosomal was the score of lowest confidence of a deglycosylated peptide assigned to a known lysosomal species (peptide log(e) of 2.3 for ß-glucuronidase) with the added stringency that peptides with log(e) more than 3 were only included if the assignment could be clearly verified following manual inspection.
MS/MS Spectra and General Summary of MS Data
A summary of data obtained with each instrument with different samples is presented in Supplemental Table 1. For each protein identified, the sequence coverage and number of unique peptides assigned are summarized in Supplemental Table 2. xml files for each database search that include protein assignments and MS/MS spectra are also included as supplemental data. (Saved xml data can be viewed on line by uploading to the GPM (human.thegpm.org/tandem/thegpm_upview.html). Experimental parameters for each xml file are summarized in Supplemental Table 3.
| RESULTS |
|---|
|
|
|---|
500 µg/liter, which represents
0.001% of the total starting protein content. Staining of the mixture after fractionation by two-dimensional gel electrophoresis using pH 310 isoelectric gradients revealed a complex pattern of purified proteins with >1500 spots resolved (Fig. 1A).
|
Comparison of Analytical Methods
A number of mass spectrometric approaches were used to characterize the singly and doubly purified mixtures of MPR-affinity purified proteins. All proteins that reached threshold significance (see "Experimental Procedures") are listed in Supplemental Table 2 along with identification statistics achieved with each sample preparation method and mass spectrometric approach. These data are summarized in Supplemental Table 1. Both tryptic digests of the total unfractionated mixtures and gel slices of material fractionated by 1D gel electrophoresis were analyzed. Nanospray LC-MS/MS was conducted using both a linear ion trap (ThermoElectron LTQ) and a quadrupole time-of-flight instrument (Micromass Q-TOF). Results obtained using the different instruments and samples are summarized in Supplemental Table I. The doubly purified material was also further fractionated by 2D gel electrophoresis and analyzed using the LTQ (Supplemental Table 1). For the Q-TOF, we found that prefractionation of the singly purified sample by SDS-PAGE provided more overall unambiguous lysosomal protein identifications than analysis of the unfractionated solution digest. In contrast, prefractionation of the doubly purified sample by SDS-PAGE resulted in fewer lysosomal protein identifications with this instrument when compared with the unfractionated solution digest. This likely reflects the fine balance between sample losses due to 1D SDS-PAGE and inefficiencies in tryptic digestion of proteins in gel slices versus the reduction in sample complexity. The doubly purified sample, being less complex than the singly purified sample, will thus receive less benefit from prefractionation. With the LTQ, prefractionation of either sample by SDS-PAGE resulted in fewer protein identifications than the unfractionated mixture; this probably reflects the faster duty cycle of the LTQ, meaning that sample complexity is less likely to be a limiting factor. Prefractionation by 2D gel electrophoresis resulted in a significant decrease in identification in the number of proteins identified. In terms of LC-MS/MS approach, of a total of 148 unique protein assignments exceeding threshold expectation values, 147 were made with the LTQ compared with 94 with the Q-TOF (Supplemental Table 2).
|
|
|
|
|
The Plasma Proteome Project of the Human Proteome Organization (www.bioinformatics.med.umich.edu/hupo) has cataloged a high confidence (>95%) subset of the total plasma proteome database representing 889 plasma proteins (25). It is worth noting that of the known lysosomal proteins identified in this study, only one (
-glutamyl hydrolase) is currently included in the high confidence dataset. This reflects the relatively low concentration of lysosomal proteins in plasma and the corresponding difficulties associated with their isolation and identification in a highly complex mixture.
Identification of Lysosomal Candidates
One of the aims of this and similar MPR affinity purification studies of the human proteome is to identify novel lysosomal candidates for further study. In our previous analysis of human brain (17), the number of proteins identified that were not classified as lysosomal was relatively small compared with the number of known lysosomal proteins; thus selection of candidates, based upon their respective biological and biochemical properties, was relatively straightforward. In contrast, in our analysis of the unfractionated solution digests of the mixture of proteins purified from plasma, we identified 80 different proteins (excluding contaminant immunoglobulin chains) that are not currently classified as lysosomal (Table II). Here selection of candidates of interest is not straightforward, especially given the significant challenges inherent in distinguishing a subset of plasma proteins with specific properties (e.g. the presence of carbohydrates containing Man-6-P) from highly abundant contaminants. We have therefore developed several experimental and bioinformatic approaches to help in the process of distinguishing the more probable lysosomal candidates from other purified proteins.
One approach is based on the hypothesis that true Man-6-P glycoproteins (and specific contaminants) will be enriched after two rounds of purification compared with a single purification as contaminants are removed. We used the X!Tandem log(I) score (the sum log intensities for all matched MS/MS fragment ions for every peptide assigned to a given protein) from the LTQ analyses of unfractionated mixtures as a measure of protein abundance. As expected, of the 37 known lysosomal proteins identified in both the singly and doubly purified samples, the relative abundance of 35 increased in the doubly compared with the singly purified preparation and is graphically illustrated (Fig. 2A). In contrast, of the 44 proteins that are not currently classified as lysosomal that were found in both the singly and doubly purified samples, only 27 were enriched after two rounds of purification (Fig. 2B). Sixteen proteins were identified only in the doubly purified sample. Based on our observations with known lysosomal proteins, we predict that most of the remaining 37 that were either depleted or not found after the second round of purification are likely to represent nonspecific contaminants.
|
Although a comparison of the intensity scores obtained for individual proteins after a single versus double purification may provide some clues toward the identification of true Man-P glycoproteins, this is an indirect approach that cannot differentiate between true Man-6-P glycoproteins and specific contaminants. In addition, low abundance and/or sticky lysosomal proteins might be lost due to the increased sample handling. As an alternative experimental classification approach, we have recently developed an affinity purification protocol for the isolation of Man-6-P glycopeptides in a form suitable for identification by LC-MS/MS, allowing direct verification of the presence of Man-6-P (24). Briefly peptides containing Man-6-P from a proteolytic digest of a reduced and alkylated preparation of purified Man-6-P glycoproteins are purified by microscale MPR affinity chromatography, deglycosylated, and identified by LC-MS/MS. The presence of Man-6-P is verified when a characteristic mass increment resulting from deglycosylation with endoglycosidase H (N+203) can be assigned in the context of an N-linked glycosylation motif (NX(S/T)).
Forty-two Man-6-P glycopeptides were assigned to 28 of 44 (
64%) of the known lysosomal proteins found in human plasma (Table III and Fig. 2A). Most of these peptides have been confirmed to contain Man-6-P in the analysis of lysosomal proteins from human brain (24), although a novel Man-6-P site was found here for di-N-acetylchitobiase, and a site was also identified for CPVL, a putative peptidase not identified in brain. In brain, we identified Man-6-P glycopeptides corresponding to 39 of 50 (78%) of the lysosomal proteins identified in the total mixture. This is a greater proportion than observed for plasma and may reflect the greater complexity of the Man-6-P glycopeptide mixture from this source.
Sites for Man-6-phosphorylation were also identified in 28 proteins that were not previously assigned lysosomal function (Table III). Similar to the set of known lysosomal proteins identified in plasma, these plasma proteins were enriched in the doubly purified sample. A number of these proteins (acid sphingomyelinase-like protein 3a (SMPDL3A), biotinidase (BTD), cellular repressor of E1A-regulated gene transcription (CREG1), mammalian ependymin-related protein (EPDR1), prostaglandin-H2 D-isomerase (PTGDS), and RNASET2) were also previously found to contain Man-6-P in human brain (24). However, an intriguing observation was the identification of Man-6-P glycopeptides derived from a large number of classical plasma proteins. Some of these proteins, e.g. haptoglobin, transferrin, hemopexin, and
2-HS-glycoproteins, are extremely abundant plasma species in the 0.53g/liter range that we originally considered to be likely nonspecific contaminants. Classical plasma glycoproteins for which we identified Man-6-P isoforms in this study can be classified into several broad categories.
Protease Inhibitors
One striking observation was the number of protease inhibitors that were purified from human plasma by MPR affinity chromatography (Table IV). In total, 21 protease inhibitors or proteins containing protease inhibitor domains were found, representing 26% of the proteins identified that have not been assigned lysosomal function, and of these, eight are members of the SERPIN family. Given that many lysosomal Man-6-P glycoproteins are proteases, we previously suggested that such protease inhibitors probably represent specific contaminants (17). This is clearly the case with some, e.g. cystatins, which cannot contain Man-6-P as their primary sequence lacks the consensus motif for N-linked glycosylation. However, the presence of Man-6-P is demonstrated in eight of the protease inhibitors found in the mixture of MPR-purified proteins (as well as in thyroxin-binding globulin (SERPINA7), which was not identified with threshold confidence in the analysis of the unfractionated solution digests), indicating that there are glycoforms of these proteins that do contain Man-6-P. Interestingly previous studies have demonstrated the presence of Man-6-P on several proteins containing the SERPIN domain that are secreted by the uterus of pregnant sheep and pigs including uterine milk proteins and uteroferrin-associated protein (26, 27). In addition, another member of the SERPIN family, C1 esterase inhibitor, may also contain Man-6-P under some circumstances (28).
Plasma Transporters
Vitamin E-binding protein afamin and metalloproteins ceruloplasmin (CP) and serotransferrin are all plasma transporters that appear to contain Man-6-P. Haptoglobin and hemopexin scavenge free hemoglobin from the bloodstream, and both were also found to contain Man-6-P. Other transport proteins found here include
1-acid glycoprotein (ORM1 and ORM2) and fatty acid-binding protein 5.
Enzymes
Twenty-two of the proteins that are not classified as lysosomal have known enzymatic function or may be predicted to have catalytic function based on sequence homology with known enzymes. This category includes a number of abundant plasma glycoproteins such as CP, cholinesterase (BCHE), BTD, and PTGDS and complement factors (e.g. C2, C3, and BF) as well as a number of less abundant species that are not found in the Plasma Proteome Project high confidence dataset. The relatively high prevalence of proteins with enzymatic function probably reflects the inclusion in this category of a number of novel lysosomal hydrolases.
Circulating Glycoproteins of Unknown or Unclear Function
Man-6-P glycopeptides were assigned to a number of plasma proteins of unknown or unclear function including
1-acid glycoprotein 2 (ORM2),
1B-glycoprotein (A1BG), leucine-rich
2-glycoprotein (LRG1), and zinc-
2-glycoprotein (AZGP1).
| DISCUSSION |
|---|
|
|
|---|
There are several approaches for proteomic profiling (29) that could be applicable to plasma Man-6-P glycoproteomes. One of the more frequently utilized methods is comparative two-dimensional gel electrophoresis, and in this study, we generated a 2D map of plasma Man-6-P glycoproteins (Supplemental Fig. 1 and Supplemental Table IV) as a potential clinical reference. Two problems were immediately apparent with this approach. First, LC-MS/MS analysis of even well resolved 2D gel spots (e.g. Supplemental Fig. 1 and Supplemental Table IV, gel spots 58) revealed a mixture of proteins; thus a deficiency in any given protein could potentially be difficult to identify. Second, limitations in the amount of clinical samples available, especially in young children undergoing diagnosis, would greatly limit use of the 2D approach. In terms of yield, we obtained
200 µg of Man-6-P glycoproteins/liter after a double purification; thus in a clinical setting, yields of
1 µg/5 ml of plasma could be achievable with appropriate method development. With such amounts, most of the individual spots on a stained 2D gel would be below the limits of detection. Sensitivity could be increased by visualization of Man-6-P glycoproteins using a labeled MPR probe (22), but this would result in a decrease in resolution (for example, in Fig. 1, compare A with B and C with D). Therefore, although the comparative profiling of plasma Man-6-P glycoproteins by 2D gel electrophoresis might have investigational applications, these issues combined with the technical expertise required suggest that this approach is probably unsuitable as a widely applicable method for the diagnosis of lysosomal storage diseases.
An alternative approach that may overcome the drawbacks of 2D gel electrophoresis is quantitative mass spectrometry for the comparison of Man-6-P glycoproteins from patients and normal reference controls (30). In this way, Man-6-P glycoproteins that are deficient as a consequence of a lysosomal storage disease may be readily identifiable, and a preliminary diagnosis may be made that can be verified using standard biochemical and/or DNA-based tests. This approach is more specific than comparative 2D gel electrophoresis and may be compatible with the projected yields of Man-6-P glycoproteins from clinical samples. Quite apart from its utility in the diagnosis of storage disorders, mass spectrometry-based clinical methods could also prove useful in investigating the role of lysosomal proteins in more widespread human diseases, including cancer, where alterations in lysosomal activities have been implicated. In addition, such approaches could also prove useful in evaluating the potential of lysosomal proteins as plasma biomarkers of prognostic or diagnostic value. Implementation of quantitative methods (e.g. isobaric peptide tagging (31)) combined with the enrichment methods outlined in this study may be useful in this respect.
Another aim was to identify Man-6-P glycoproteins for further study that could potentially represent new lysosomal proteins. A large number of proteins were identified in this study that have not been assigned lysosomal function so we have used several criteria to filter the data to identify the most promising potential candidates. The first criterion is that a candidate should be enriched after a second round of purification reflecting the loss of nonspecific contaminants. The second criterion for a candidate is that it should either be or be predicted to be a hydrolase like most soluble lysosomal proteins. Forty-one of 44 lysosomal proteins found in plasma here fall into this class of enzymes. The third criterion is that a candidate should not represent an abundant plasma protein. Lysosomal proteins are low abundance constituents of plasma, and this is reflected by the fact that only one of the known lysosomal proteins found here is currently represented in the Plasma Proteome Project high confidence dataset (25) of plasma proteins (Table I). Categorization of both plasma lysosomal proteins and proteins not assigned lysosomal function with respect to these three criteria is illustrated in Fig. 3. Of the known lysosomal proteins, most (38 of 44) are hydrolases that are absent from the Plasma Proteome Project and that are enriched by two rounds of affinity purification. Based upon the assumption that novel lysosomal proteins will have properties similar to the known lysosomal proteins, we were able to identify nine highly promising candidates. These are summarized in Table V. Most of these proteins, i.e. the two phospholipase-like hypothetical proteins, SMPDL3A, plasma
-fucosidase (FUCA2), and several ribonucleases, have homology to known lysosomal proteins and are thus particularly promising. EPDR1 and CREG1 are also worth further investigation as they are low abundance, confirmed Man-6-P glycoproteins for which database searching reveals no similarity to other proteins that might provide clues to function. However, it should be emphasized that the objective of this and other large scale proteomic studies is to identify candidates of interest that will require further extensive characterization at the functional and compartmental levels before their actual cellular roles can be precisely ascertained.
|
1-acid glycoprotein, ceruloplasmin, and haptoglobin. It is possible that detection of Man-6-phosphorylated glycoforms of these proteins may arise from a lack of absolute fidelity that occurs in biological processes (see below) and may result in a very small proportion of the proteins receiving Man-6-P. Thus, it is worth considering the relative amount of a given protein that is in the phosphorylated form in plasma. We previously used enzyme activity measurements and MPR affinity chromatography to estimate that, for any given lysosomal protein, 550% of the total enzyme activity present in plasma was in the Man-6-phosphorylated form (16). Here after two rounds of purification, we obtained
200 µg of Man-6-P glycoproteins/liter of plasma, representing 132 different proteins with an average yield of
1.5 µg/liter for each individual protein, which is consistent with the predicted concentrations of lysosomal proteins containing Man-6-P in plasma. For example, total
-glucosidase is normally present at a median concentration of 17 µg/liter of plasma (32); thus purification of 1.5 µg/liter would correspond to 9% in the Man-6-P-containing form. In accordance, we previously determined by enzymatic assay that 28% of
-glucosidase in plasma actually contains Man-6-P (16). In contrast, we estimate that the proportion of the individual classical plasma proteins that contains Man-6-P is much lower. The average concentration of the classical plasma proteins purified here in Man-6-P glycoforms is
500 mg/liter (Table VI); thus the average proportion of each that contains Man-6-P would be 1.5 x 104% of each respective total; this is many orders of magnitude lower than that observed for known lysosomal proteins.
|
In addition to receiving the Man-6-P modification, isoforms of non-lysosomal proteins must also escape the cellular pathway for targeting to the lysosome to allow secretion or must be delivered into the plasma by some other mechanism (see the Introduction). Given that they may be Man-6-phosphorylated aberrantly, these proteins could also represent low affinity ligands for the MPRs that are not efficiently recognized and targeted to the lysosome but that are instead secreted. However, the high concentration of MPR used in the affinity chromatography would allow such proteins to be purified. For example, cathepsin L is a protein that binds to the MPR with relatively low affinity and, as a result, is largely secreted in cell culture (35, 36), but it is still purified by MPR chromatography (15, 17, 21).
Irrespective of biological significance and source, the observation of Man-6-P glycoforms of abundant classical plasma proteins has important implications for the increasing number of studies directed toward the characterization of lysosomal proteomes using MPR affinity chromatography (15, 17, 19, 20, 37). Although the Man-6-P modification is suggestive of lysosomal localization and can provide a valuable clue toward novel candidates, its detection does not obviate the necessity for detailed characterization of subcellular localization and functional analyses. Regardless the ability to detect and quantify a small and distinct subset of plasma proteins with both established and proposed biomedical importance is likely to have valuable future application in the discovery of useful biomarkers for human disease.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, May 17, 2006, DOI 10.1074/mcp.M600030-MCP200
The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact.
1 The abbreviations used are: Man-6-P, mannose 6-phosphate; MPR, Man-6-P receptor; CI-MPR, cation-independent MPR; sCI-MPR, soluble cation-independent MPR; HS, Heremans-Schmid; bis-Tris, 2-[bis(2-hydroxyethyl)amino]-2-(hydroxymethyl)propane-1,3-diol; 1D, one-dimensional; 2D, two-dimensional; GPM, Global Proteome Machine; BTD, biotinidase; PTGDS, prostaglandin-H2 D-isomerase; CP, ceruloplasmin; CPVL, carboxypeptidase vitellogenic-like; SMPDL3A, acid sphingomyelinase-like protein 3a; CREG1, cellular repressor of E1A-regulated gene transcription; EPDR1, mammalian ependymin-related protein. ![]()
* This work was supported by National Institutes of Health Grants DK054317 and S10RR017992 (to P. L.) and NS046593 and NSF0100831 (to H. L.). ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
¶ To whom correspondence may be addressed: Center for Advanced Biotechnology and Medicine, 679 Hoes Lane, Piscataway, NJ 08854. E-mail: sleat{at}cabm.rutgers.edu
** To whom correspondence may be addressed: Center for Advanced Biotechnology and Medicine, 679 Hoes Lane, Piscataway, NJ 08854. E-mail: lobel{at}cabm.rutgers.edu
| REFERENCES |
|---|
|
|
|---|
-Glucosidase and N-acetylglucosamine-6-sulphatase are the major mannose-6-phosphate glycoproteins in human urine.
Biochem. J.
324, 33
39[Medline]
-glucosidase protein: evaluation as a screening marker for Pompe disease and other lysosomal storage disorders.
Clin. Chem.
46, 1318
1325
-2 HS glycoprotein during the inflammatory process: evidence that
-2 HS glycoprotein is a negative acute-phase reactant.
J. Clin. Investig.
64, 1118
1129[Medline]
1-microglobulin. Purification procedure, chemical and physiochemical properties.
J. Biol. Chem.
252, 8048
8057