|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 4:1459-1470, 2005.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
2,000 proteins per cell group by over 12,000 peptides with mass deviations of less than 4.5 ppm. Datasets obtained by LTQ FT analysis provided a significant increase in the number of proteins identified and greater confidence in those identifications and improved reproducibility in replicate analyses. Because CD45 and not CD45+ cells are able to regenerate functional pancreatic islet cells in a mouse model of type I diabetes, protein expression was further compared by a subtractive proteomic approach in search of an exclusive protein expression profile in CD45 cells. Characterization of the proteins exclusively identified in CD45 cells was performed using gene ontology terms via the Javascript GoMiner. The CD45 cell subset readily revealed proteins involved in development, suggesting the persistence of a fetal stem cell in an adult animal.
Due in part to the dynamic range challenge, recent proteomic studies have profiled proteins expressed in cellular organelles rather than whole cell extracts. By reducing initial mixture complexity, investigators have been able to identify lower abundance proteins involved in nuclear pore trafficking (5), centrosome function (6), and chromatin organization (7, 8) increasing our knowledge of basic cellular physiology. Because mass spectrometers are generally poor at measuring quantitative differences in peptide concentration purely by ion intensity, a number of methods have been developed to measure biological changes between samples. These include a diverse set of labeling methods including stable isotope labeling with amino acids in cell culture (SILAC)1 (9) and the ICAT (10) strategies. In addition, investigators have also proposed non-stable isotope-based methods to profile protein expression differences between complex mixtures including several based on peptide counting or protein coverage (1113). In general, relative extent of protein coverage by detected peptides scales with the expression level of the observed protein.
Recently investigators have utilized subtractive proteomics to identify proteins from cellular organelles by subtracting out common protein contaminants. In one elegant example, Schirmer et al. (14) utilized a subtractive proteomic technique to identify nuclear envelope proteins with possible disease links. Thirteen known and 67 potentially new integral nuclear membrane proteins were described by removing common nuclear membrane preparation contaminants. Another study identified candidate substrates for sumoylation in yeast via large scale subtraction of proteomic datasets (15). Subtractive proteomics relies on the exclusive or differential detection of peptides from a given protein during a shotgun proteomic experiment. As might be expected, the effectiveness of subtractive proteomics is greatly influenced by the sampling rate of peptides in the complex mixture and the number of unique peptides identified for each protein. Characterization of the proteome of interest by subtractive proteomics depends primarily on obtaining a pure (approaching 100%) protein extract of at least one of the two samples to be compared.
In the present studies, we performed subtractive proteomic analysis with a new (LTQ FT) and more traditional (LCQ Deca XP) instrumentation to characterize two populations of cells from mouse spleen. The LTQ FT is a novel hybrid mass spectrometer consisting of a linear ion trap and an FT ICR mass analyzer. Conventional three-dimensional (spherical trap) ion trap mass spectrometers exhibit lower mass accuracy and limited ion trapping efficiency. In contrast, the linear (two-dimensional) ion trap of the LTQ has a relatively high ion capacity, a feature that directly results in improved dynamic range. In addition, improvements to the ion source and transfer optics have allowed for increased sensitivity. Huge increases in both resolution and mass accuracy of precursor ions are achieved via mass analysis in the ICR cell (16, 17). With the improved mass accuracy provided by the ICR cell and the substantial increase in scan rate for the LTQ ion trap, this hybrid instrument has the potential to significantly improve shotgun analysis of very complex protein samples.
Protein mixtures were generated by whole cell fractions prepared from CD45 and CD45+ spleen cells. These cell populations were chosen due to the observation that CD45 cells were able to regenerate functional pancreatic islet cells in a non-obese mouse model of type I diabetes (18). Because the CD45 spleen cell population also contains the stem cells responsible for the regeneration, the stated goal of these experiments was to identify proteins expressed exclusively in CD45 cells through subtractive proteomics of the CD45+ cell fraction. We define subtractive proteomics as the set of proteins identified exclusively by at least two unique peptides in each cell population. When replicates are considered an exclusively identified protein should be identified by at least two unique peptides in any of the replicate studies but not in any of the replicate control samples. The assumption is that the exclusive expression of a protein in one cell population would result in the exclusive detection of peptides from that protein.
After LC-MS/MS analysis using identical chromatographic conditions on both instruments, proteins were identified by database searching using the SEQUEST algorithm (19). Proteins from CD45 and CD45+ cells were compared by a subtractive proteomic approach in search of exclusive expression in CD45 cells. To our knowledge no studies have utilized this subtractive proteomic approach to identify differences between complex tissue samples. Our goal was to evaluate proteins involved in the regeneration of pancreatic islet cells by subtractive proteomics with new and exciting instrumentation.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Isolation of CD45 and CD45+ Cell Populations from C57B1/6 Mice
Splenoctyes from seven mice were harvested from spleen tissue mechanically disrupted with forceps. Following the lysis of red blood cells (140 mM ammonium chloride in 100 mM Tris buffer, pH 7.5) CD45+ and CD45 spleen cells were separated using mouse-specific CD45 MicroBeads (Miltenyi Biotec, Auburn, CA) according to the manufacturers instructions. Briefly pooled splenocytes were counted, and 107 cells were resuspended in 10 µl of CD45 MicroBeads in 90 µl of 1x PBS, pH 7.2 and incubated for 15 min at 4 °C. Cells were washed with 1x PBS, centrifuged at 1,800 rpm for 5 min, and diluted in 3 ml of 1x PBS. Labeled cells were placed in a SuperMACS column in the magnetic field and washed three times with 3 ml of 1x PBS to recover the unlabeled, negatively selected CD45 cells. The column was removed from the magnetic field, and the positively selected CD45+ cell fraction was collected in 5 ml of 1x PBS.
Protein Preparation, Separation, and In-gel Digestion
Whole cell fractions were prepared from either pooled CD45 or CD45+ cell populations using RIPA buffer (1x phosphate-buffered saline (Invitrogen), 0.5% deoxycholic acid, 1% Nonidet P-40, 0.1% SDS, 0.02 mM phenylmethylsulfonyl fluoride, 2 mM dithiothreitol, and 20 mM sodium orthovanadate (Sigma)) and stored at 80 °C. Protein concentrations were determined by the Bio-Rad DC protein assay (Bio-Rad), and a total of 1 mg of protein was resolved by one-dimensional polyacrylamide gel electrophoresis using a 412% bis-Tris gel on a Novex Mini-Cell (Invitrogen). Proteins prepared from CD45 and CD45+ cells were separated by molecular weight to
5 cm from the origin in independent 1-well Invitrogen gels using 1x MES, SDS running buffer. Gels were removed from the cassette, stained with 0.1% Coomassie Brilliant Blue R250 (Pierce) for 2 min, and destained overnight in a solution of 10% acetic acid and 30% methanol. Gel bands were then excised and used to prepare 14 independent in-gel trypsin (Promega, Madison, WI) digests for each gel as described previously (20). Peptides were extracted by washing gel pieces two times for 20 min at room temperature with a solution containing 5% formic acid and 50% acetonitrile, then dried to complete dryness by vacuum concentration, and stored at 20 °C until analysis by mass spectrometry.
Peptide Sample Preparation
Prior to analysis, peptide samples were prepared by using in-house nanocolumns to remove excess salt and particulates. Nanocolumns were constructed using Eppendorf GeloaderTM tips (Brinkmann Instruments) pinched 1 mm from the end and filled to 1.5 cm with Oligo R3 reverse phase packing resin (PerSeptive Biosystems, Framingham, MA) in 100% 2-propanol with the aid of a 1-ml syringe with a flow of 1 µl/min without drying the resin. The column was washed with 20 µl of 2-propanol and 40 µl of elution buffer (97.4% acetonitrile, 2.5% H2O, and 0.1% formic acid) and conditioned with loading buffer (98% H2O, 0.1% TFA, and 2% acetonitrile). Peptide samples were diluted in 30 µl of loading buffer, incubated for 10 min at room temperature, and loaded onto the column. The column was washed with 40 µl of 0.1% TFA, and peptides were eluted with 2030 µl of elution buffer. Samples were vacuum-dried and reconstituted in 100200 µl of sample buffer A (95% H2O, 5% acetonitrile, and 0.5% formic acid), and 2% was loaded via autosampler for MS analysis.
LCQ Deca-XP Mass Spectrometry
LC-MS/MS was performed using an LCQ Deca XPPlus ion trap mass spectrometer (ThermoElectron, San Jose, CA). Samples were autoloaded (Famos autosampler, LC Packings, Sunnyvale, CA) to a 125-µm-inner diameter fused silica C18 capillary column packed to 14 cm with Magic (Michrom BioResources) C18 resin (200-Å pore size, 5-µm diameter) using an Agilent 1100 series binary pump with an in-line flow splitter. Peptides were loaded onto the column for 15 min at 120 bars in buffer A (2.5% acetonitrile and 0.15% formic acid). Peptides were then resolved by applying a gradient of 533% buffer B (97.5% ACN and 0.15% formic acid) for 55 min at 60 bars. Five MS/MS spectra were acquired per cycle in a data-dependent manner (2).
LTQ FT Mass Spectrometry
LC-MS/MS was performed using an LTQ FT hybrid linear ion trap FT ICR MS system (Thermoelectron, San Jose, CA) in similar fashion to the XP with slight modifications. All aspects of the microcapillary separation were identical including the column, autosampler, sample amounts, HPLC pumps, and HPLC gradient formation. Within the LTQ, 10 MS/MS spectra were acquired per cycle in a data-dependent manner from a preceding FT scan (4001,800 m/z at a resolution setting of 105) with an automatic gain control (AGC) setting of 2 x 106. Charge state screening was used such that singly charged peptides were not selected, and a threshold of 500 counts was required to trigger MS/MS spectra. Where possible, the instrument operated in a parallel processing mode where the LTQ and ICR cell were both detecting ions.
Database Searching
Raw MS/MS data were searched against the mouse NCBI non-redundant database with no enzyme constraint using SEQUEST (19) (version 27, revision 9). Parameters included a precursor mass tolerance of 1.08 and 2.0 Da for LTQ FT and XP data, respectively. Fragment ion tolerance was set at the default, and dynamic modification to methionine (+15.9949) was allowed. Cysteines were searched with a static modification (+71.0370). Only fully tryptic peptides were considered for further processing with Xcorr and mass accuracy thresholds as described in the text. A simple target/decoy database approach was used to estimate false positive rates through distraction of random hits and to establish threshold criteria (3) such that <1% false positives were included in the peptide list. An Excel spreadsheet file is available (see supplemental tables) that contains all MCP-required information concerning peptide identifications for more than 30,000 identified peptides.
| RESULTS |
|---|
|
|
|---|
|
570 FT-MS spectra and
5,700 MS/MS spectra from a single gel slice over a 55-min gradient. From the example shown in Fig. 2A, 1,258 unique peptides (421 proteins) were identified (gel slice 7). In contrast, the analysis on the XP acquired five MS/MS scans/cycle with an average scan time of 7.3 s. Approximately 450 MS and 2,300 MS/MS spectra were acquired per gel slice. In the example shown, 442 unique peptides (155 proteins) were identified from gel slice 7.
|
|
|
10,000 peptides had mass accuracies of <1 ppm (Fig. 4B). The distribution of all peptides identified within 10 ppm is shown in Fig. 4B. To contain >99% of the correct answers, 4.5 ppm was used as the final cutoff for correct matches. The average absolute mass deviation was 0.84 ppm for all accepted peptides. It should be noted that only a minimal XCorr cutoff (1.0) was needed when both a tryptic peptide requirement and mass accuracy were used. When a mass accuracy cutoff was not used to identify correct answers, we were required to increase both XCorr and
Corr to maintain a false positive rates of <1%, significantly reducing the total number of peptides and proteins identified (Fig. 4B, inset).
|
10% increase in the number of overlapping proteins between cell groups when using the LTQ FT. In an analysis of the most abundant differences between cell groups (excluding one-hit proteins), relatively few proteins were identified exclusively in CD45+ samples by both the LTQ FT (31 proteins) and the LCQ XP (23 proteins) likely due to the higher purity of that preparation (Fig. 3C). Although the potential biological significance of the differences in proteins identified in spleen cells will be presented elsewhere, a preliminary description of the 220 and 31 most abundant proteins exclusively identified in CD45 and CD45+ cells, respectively, is presented in Supplemental Tables 1 and 2.
|
30% were also identified by subtractive analysis in experiment 2 (XP2-CD45). Exclusive CD45 cell proteins identified by only one peptide were observed in a replicate analysis only 17% of the time (59 of 340 proteins) with many of the proteins not identified in either CD45 or CD45+ cells. A similar comparison of LTQ FT results showing exclusive CD45 cell proteins identified in experiment 1 (FT1-CD45) with results obtained in experiment 2 (FT2-CD45) showed improved reproducibility and many more proteins identified (Table III). As expected, reproducibility decreased with decreasing number of peptides identified. Fig. 6 shows a partial list of exclusive CD45 cell proteins characterized by replicate LCQ XP and LTQ FT studies.
|
|
|
|
90% of the proteins identified by subtractive proteomic analysis. A partial list of spleen cell proteins described by seven different gene ontology terms (cell adhesion, cell cycle, development, DNA binding, extracellular space, integral to membrane, and signal transduction) that may identify relevant CD45 spleen cell proteins is shown in Supplemental Table 3. Because we were investigating the ability of CD45 cells to regenerate pancreatic tissue, we focused on proteins involved in development and other related gene ontology terms. Of the 126 exclusive proteins 24 were identified as being derived from platelet or blood cells and were excluded from further gene ontology characterization. Of the remaining 112 exclusive proteins, 49 were characterized by the gene ontology terms described above, and 14 proteins not associated with any gene ontology terms are also described (Supplemental Table 3). The remaining 49 proteins were not identified by any of the seven gene terms of interest. Of particular interest for future analysis and characterization are seven proteins involved in development. Another interesting observation is that
50 of the proteins described in Supplemental Table 3 (shaded proteins) have been observed previously in mouse tissue at various stages of development, and many proteins are involved in spermatogenesis and other biological processes relevant to stem cell biology (www.informatics.jax.org/). A list of all the gene ontology terms associated with the 126 exclusive CD45 proteins is shown in Supplemental Table 4. | DISCUSSION |
|---|
|
|
|---|
2,000 proteins per preparation from more than 12,000 peptides by LTQ FT analysis and, by comparison,
1,000 proteins (
3,700 peptides) by LCQ XP analysis. Each of these peptide datasets had a false positive rate estimated to be <1% based on a target/decoy database searching strategy (3). A subtractive proteomic comparison of proteins expressed by CD45 and CD45+ cells was also improved by replicating the entire experiment. The field of quantitative proteomics provides a discipline for differential protein expression profiling. Stable isotope labeling is the surest way to precisely measure differences in protein abundance. However, the object of some proteomic experiments is to identify proteins that are exclusively expressed in one state but not the other. Two recent studies have provided useful experience with subtractive proteomics as a quantitative proteomic technique (14, 15). Our interest was in finding stem cell-specific proteins that were exclusively expressed in the CD45 cell population. Some evidence of the usefulness of the technique comes from the finding of red blood cell- and platelet-specific proteins exclusively in the CD45 cells. This is because a very small amount of red blood cells and platelets are not completely removed during cell preparations and remain within CD45 cell populations. Twenty-four proteins of 126 from subtracted CD45 datasets were either red blood or platelet cell proteins.
We believe the idea of using shotgun sequencing of whole proteomes followed by subtractive analysis will be useful for many specific proteomic applications because it (i) can be used on primary tissue, (ii) makes use of mature technologies (peptide sequencing) that require little refinement, (iii) can be improved by simple replication of the experiment to provide more significant differences, and (iv) relies on instrumentation that now can provide much deeper analysis of protein sequences because of increased scan rates and mass accuracy.
One stated purpose of this project was to initiate the protein profiling of a newly identified stem cell population in the spleen of adult mice (18). This stem cell population is contained within the capsular regions of the CD45 regions of the spleen and has been proposed to be a remnant of an embryonic stem cell region called the aorta gonad mesoderm (23). Mass spectrometry analysis of the CD45 cell subset readily revealed proteins involved in development, indicative of the persistence of a fetal stem cell in an adult animal. Subtractive proteomics showed development-specific proteins that control the formation of the fetal nervous system (cerebellum, neurogenesis, and axon guidance axonogenesis), blood vessels, muscle, skin, and gonads (gametogenesis and spermatogenesis). Although many of the proteins exclusive to CD45 samples had unknown function, the majority are abundantly expressed on day 1011 of gestation (Supplemental Table 1). Therefore, this preliminary analysis of the CD45 fractions of the spleen is consistent with a stem cell population that might represent a frozen fetal cluster of a midstage murine embryo (24). These interesting candidate proteins were (i) identified by two or more unique peptides, (ii) exclusive to CD45 samples, (iii) detected by LTQ FT analysis, and (iv) consistent across replicate studies. These results indicate that the improved performance of the LTQ FT significantly aided our ability to characterize complex protein mixtures by shotgun proteomics. Our primary focus for future analysis will be proteins implicated in development and exclusive integral plasma membrane proteins that maybe used to more specifically isolate potential stem cells from the CD45 cell population.
Shotgun proteome analysis by LC-MS/MS with new instrumentation provided significantly larger datasets for characterizing protein expression differences. Yet a number of concerns remain in particular with respect to the reproducibility of the results and with data processing. The reproducibility was considerably improved by LTQ FT analysis (Table III) but remained low for the majority of the proteins, which were typically identified by five or less peptides. This can be somewhat misleading for a truly subtractive approach. It is likely that many proteins are expressed differentially between the two populations but not exclusively. If a 10-fold difference in protein expression results in a peptide being detected in the control sample of one replicate but not in the other, this protein would not be included in the subtracted list but may be of great interest to the investigator due to the 10-fold difference. Repeated sample analysis can be used to provide more confidence that the protein is truly not present in a cell population or that it is present at significantly less abundant quantities. To improve the reliability of our final data we only accepted subtractive differences that were consistent in multiple analyses.
Gene ontology classification of the proteins identified was used to characterize proteins in an automated fashion to improve data processing. In the present study most of the proteins were associated with at least one gene ontology term, but
10% were completely unknown. Furthermore the evidence for term assignments vary by protein, and because many proteins have unknown functions, the gene ontology database is incomplete. Studies to characterize the biological relevance of differences observed in these experiments are currently being performed.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, July 21, 2005, DOI 10.1074/mcp.M500137-MCP200
1 The abbreviations used are: SILAC, stable isotope labeling with amino acids in cell culture; bis-Tris, 2-[bis(2-hydroxyethyl)amino]-2-(hydroxymethyl)propane-1,3-diol; AGC, automatic gain control; FP, false positive. ![]()
* This work was funded by National Institutes of Health Grants HG003456 (to S. P. G.) and TgT32DK07028 (to D. F.) and by the Iacocca Foundation. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
¶ To whom correspondence should be addressed. Tel.: 617-432-3155; Fax: 617-432-1144
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
T. Vaisar, S. Y. Kassim, I. G. Gomez, P. S. Green, S. Hargarten, P. J. Gough, W. C. Parks, C. L. Wilson, E. W. Raines, and J. W. Heinecke MMP-9 Sheds the {beta}2 Integrin Subunit (CD18) from Macrophages Mol. Cell. Proteomics, May 1, 2009; 8(5): 1044 - 1060. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hu, M. Arellano, P. Boontheung, J. Wang, H. Zhou, J. Jiang, D. Elashoff, R. Wei, J. A. Loo, and D. T. Wong Salivary Proteomics for Oral Cancer Biomarker Discovery Clin. Cancer Res., October 1, 2008; 14(19): 6246 - 6252. [Abstract] [Full Text] [PDF] |
||||
![]() |
X. Hu, S. Eszterhas, N. Pallazzi, E. E. Bouhassira, J. Fields, O. Tanabe, S. A. Gerber, M. Bulger, J. D. Engel, M. Groudine, et al. Transcriptional interference among the murine {beta}-like globin genes Blood, March 1, 2007; 109(5): 2210 - 2216. [Abstract] [Full Text] [PDF] |
||||
![]() |
W.-J. Qian, J. M. Jacobs, T. Liu, D. G. Camp II, and R. D. Smith Advances and Challenges in Liquid Chromatography-Mass Spectrometry-based Proteomics Profiling for Clinical Applications Mol. Cell. Proteomics, October 1, 2006; 5(10): 1727 - 1744. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. J. Kauffman, B. A. Ogunnaike, and J. S. Edwards Designing experiments that aid in the identification of regulatory networks Brief Funct Genomic Proteomic, February 1, 2006; 4(4): 331 - 342. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |