|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 7:486-498, 2008.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
To develop these tests, molecular markers that are predictive of cancer development need first to be identified within whole saliva. Many researchers have studied the soluble fraction of whole saliva as a source of these markers (4–8) with some of these studies using large scale mass spectrometry-based proteomics technologies to catalogue its protein components (9–14). Fewer researchers, however, have investigated using the cells found in whole human saliva for oral cancer diagnostics, and none have used proteomics methods to identify cellular protein markers for potential OSCC detection. Instead these studies have concentrated on detecting genetic or pathological changes associated with OSCC in these cells (15–17).
To catalogue proteins from complicated biological samples using mass spectrometry, peptide fractionation methods are commonly used. Complex proteolytic protein digests are divided into less complex subsets of peptide fractions from which lower abundance proteins can be detected by the mass spectrometer. Generally these methods combine multiple different chromatography or electrophoresis steps that fractionate peptide sequences using different physiochemical properties (18, 19). For historical reasons (20), the different fractionation steps used are commonly thought of as being "orthogonal" to each other, leading to the generally used description of these methods as "multidimensional" separations. However, because for the case of modern proteomics applications, spatial dimensionality no longer holds for the fractionation steps comprising these methods, these are more accurately termed "multistep" peptide fractionation methods. Two-step peptide fractionation using a first step of strong cation exchange (SCX) chromatography followed by a second step of microcapillary reverse-phase liquid chromatography (µLC) on line with ESI-MS/MS has become a standard for proteomics analysis (18, 21, 22). With increasing popularity, an alternative two-step method using preparative peptide IEF followed by µLC-ESI-MS/MS for sensitive analysis of complex protein mixtures is being used as described by our group (11, 23) and others (9, 24–29). This alternative gets valuable peptide pI information from the IEF fractionation that aids in the high confidence identification of peptide sequence matches determined by sequence database searching of the MS/MS data. One group has also demonstrated the effectiveness of a three-step, LC-based peptide fractionation method for increasing the number of proteins identified by MS/MS (30), although none have described a three-step method using preparative IEF as one of the steps.
Here we introduce a novel three-step peptide fractionation method and demonstrate its effectiveness for proteomics cataloguing of cells found in whole saliva from patients with diagnosed OSCC lesions. Furthermore we show that whole saliva contains reliable amounts of cells, mainly exfoliated from the oral epithelium, which provide adequate amounts of protein for proteomics studies using mass spectrometry. The steps of our three-step method are: 1) preparative IEF using free flow electrophoresis (FFE), 2) SCX chromatography, and 3) µLC on line with ESI-MS/MS. We show that our method effectively fractionates complex peptide mixtures and increases significantly the number of proteins identified compared with our earlier described two-step method while retaining the benefits of peptide pI for high confidence protein identification (25, 26, 31). We identified over 1000 human proteins (each matched by at least two peptides) from the cells in whole saliva from OSCC patients, with a number of these being low abundance proteins having a previously described association in oral cancer progression, and identified for the first time directly from whole saliva. We also identified proteins from over 30 different bacteria, many of which have not been described previously in whole human saliva and several of which have possible links to cancer. These results provide the first description of the potential for using cells in whole human saliva to identify protein markers of oral cancer progression and the effectiveness of our proteomics method for such studies.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
4 ml or more of total saliva from each subject. Following collection, the samples were immediately placed on ice and stored at –70 °C until further processing.
Measurement of Cell Numbers and Cellular Protein Concentrations in Whole Saliva—
1 ml of whole saliva from each of the four different subjects was separately centrifuged at 1500 x g at 4 °C for 10 min, and each was processed identically in parallel. After carefully discarding the supernatant, the insoluble cell pellet was washed twice using ice-cold PBS buffer (Invitrogen), and the cells were resuspended into 1 ml of ice-cold PBS. Next 100 µl was removed, and the cells were stained with trypan blue followed by counting using a light microscope (Nikon). The remaining 900 µl of solution containing cells was again pelleted by centrifugation, the supernatant was removed, and the cells were lysed using radioimmune precipitation assay buffer containing Triton X-100 (Boston BioProducts, Worcester, MA) and a protease inhibitor mixture (Roche Applied Science). The protein concentration was determined by the BCA protein assay (Pierce).
Cytological Characterization of Cells in Whole Saliva—
For these experiments, 500 µl of whole saliva from each of the four subject saliva samples was combined together, and the cell pellet was isolated by centrifugation as described above. Immunostaining of the cells was carried out following a previously described protocol (32). After washing the cell pellet with ice-cold PBS buffer, it was redissolved in 1 ml of PBA (0.1% BSA in 1x PBS buffer), and an equal volume of 0.2% Triton X-100 in PBS was added. The solution was incubated for 3 min on ice and then centrifuged at 500 x g for 10 min, and the supernatant was removed. The cells were redissolved with 1 ml of PBA and transferred to a 1.5-ml microcentrifuge tube with protection from light, and 5 µl of FITC-labeled anti-cytokeratin mouse monoclonal antibody was added (Abcam, Cambridge, MA) followed by gentle shaking for 30 min at room temperature. The cells were then centrifuged, washed with ice-cold PBS, redissolved in 1 ml PBS, and then stained with DAPI DNA stain (Fisher Scientific) by adding 3 µl of a 3 mg/ml DAPI stock solution in water with gentle shaking for 12 min at room temperature. The cells were again centrifuged, washed with ice-cold PBS, redissolved in 300 µl of PBS, and counted using a fluorescence microscope (BX60, Olympus, New York, NY).
Cellular Protein and Peptide Preparation—
For proteomics studies, new 2-ml aliquots of whole saliva from each of the four subjects were combined, the cell pellet was isolated by centrifugation, the cells were lysed as described above, and total protein was measured using the BCA method. From this mixture, 200 µg of total protein was exchanged into a buffer containing 50 mM Tris-HCl, 100 mM NaCl, and 1% SDS using an Amicon ultracentrifugal filter device (5-kDa molecular mass cutoff, Millipore, Billerica, MA). The sample was boiled briefly to denature proteins, and additional Tris/NaCl buffer was added to achieve a final buffer consisting of 50 mM Tris-HCl, 100 mM NaCl, 0.1% SDS, pH 7.5. Tris(2-carboxyethyl)phosphine reducing agent (Pierce) was added to reach a final concentration of 5 mM, 20 µg of modified trypsin (Promega, Madison, WI) was added, and the mixture was incubated overnight at 37 °C. The resulting peptides were desalted using a mixed mode cation exchange cartridge (Waters, Milford, MA) and concentrated by vacuum centrifugation. The peptides were labeled with the iTRAQ reagent (33) (Applied Biosystems) using the manufacturer's protocol. Although the objective of this study was not to gain quantitative information, these samples were labeled with the iTRAQ reagent to test the amenability of labeled peptides to our three-step peptide fractionation method for its potential use in future quantitative studies. After labeling, the peptides were desalted again by mixed mode cation exchange cartridge chromatography and dried by vacuum centrifugation.
FFE Fractionation—
The overall proteomics method used for peptide fractionation and protein identification is summarized in Fig. 2 and described under "Results." Here we provide the relevant experimental details that go with this figure and description. The peptides (200 µg) were redissolved in 250 µl of FFE buffer (pH
8.5) and fractionated using an FFE system (BD Biosciences) enabling preparative IEF of peptides and collection into a 96-deepwell microtiter plate as we have described previously (11, 23). Initially a 50-µl aliquot (
10% of total) was taken from each of the FFE fractions and processed using an Amicon Ultrafree-MC centrifugal filter device (5-kDa molecular mass cutoff, Millipore) to remove high molecular weight hydroxypropylmethylcellulose polymer as described previously (23). The flow-through from the centrifugal device, containing purified peptides, was dried by vacuum centrifugation and reconstituted in 30 µl of HPLC load buffer (0.1% formic acid, 2% acetonitrile in water). Each sample was analyzed by µLC-MS/MS analysis (as described below) to obtain an initial profile of the peptide distribution across the microtiter plate fractions and to identify those fractions containing the most complex peptide mixtures, necessitating their further fractionation by SCX as described below.
|
µLC-MS/MS Analysis—
All on-line µLC separations were done on an automated Paradigm MS4 system (Michrom Bioresources, Inc., Auburn, CA). Each processed FFE/SCX fraction was automatically loaded across a Paradigm Platinum Peptide Nanotrap (Michrom Bioresources, Inc.) precolumn (0.15 x 50 mm, 400-µl volume) for sample concentrating and desalting at a flow rate of 50 µl/min in HPLC buffer A (0.1% formic acid in a solution of 5% acetonitrile and 95% water). The in-line analytical capillary column (75 µm x 12 cm) was home-packed using C18 resin (5 µm, 200-Å Magic C18AG, Michrom Bioresources, Inc.) with the exception that the electrospray tip was made with a hand-held torch. Peptides were eluted using a linear gradient of 10–35% HPLC buffer B (0.1% formic acid in a solution of 95% acetonitrile and 5% water) over 60 min followed by isocratic elution at 80% buffer B for 5 min with a flow rate of 0.25 µl/min across the capillary column.
All mass spectrometry was done on an LTQ linear ion trap mass spectrometer (Thermo Fisher, Inc.) using Xcalibur version 2.0 operating software. Ionized peptides eluting from the capillary column were automatically selected for MS/MS using a data-dependent procedure that alternated between one MS scan followed by four MS/MS scans for the four most abundant precursor ions in the MS survey scan. Those m/z values selected for MS/MS were dynamically excluded for 30 s using a repeat count of 2. The electrospray voltage applied was 2.0 kV. MS and MS/MS spectra were acquired with a maximum fill time of 50 and 100 ms for MS and MS/MS analysis, respectively. MS spectra were acquired from a single microscan, whereas MS/MS spectra were acquired using two microscans. For MS scans, the m/z scan range was set from 400 to 1800 daltons.
Sequence Database Searching and Data Analysis—
To identify the human proteins from the cellular samples, obtained MS/MS spectra were searched using SEQUEST (34) (Bioworks version 3.2, Thermo Finnigan, San Jose, CA) against a non-redundant human protein sequence database from the European Bioinformatics Institute (ipi.HUMAN.v3.18.fasta, containing 62,000 entries). This database was chosen because it is manually curated to minimize protein redundancy due to identical protein sequences appearing under different accession codes. A reversed-sequence version of the same database was appended to the end of the forward version for the purpose of false positive rate estimation (22). Search parameters included static mass shift (+144.0 Da) for the N terminus and lysine due to modification with the iTRAQ reagent. Differential amino acid mass shifts for oxidized methionine (+16 Da) were also included. Precursor peptide mass tolerance was ±2.0 Da with no tryptic specificity. Fragment ion tolerance was set at 1.0. To each matched peptide sequence a predicted pI using the Shimura et al. (35) algorithm was automatically assigned using a script developed in house. The pKa values for the N terminus and lysine
amino group were set to a value of 8.6 to account for the addition of the N-methylpiperazine ring of the iTRAQ reagent (33). The search results were validated using the peptide validation program PeptideProphet (36), which assigns a comprehensive probability (p) score from 0 to 1 to each peptide sequence match based on its SEQUEST scores (Xcorr,
Cn, Sp, and RSp) and additional information, including mass difference between the precursor ion and the assigned peptide and the number of tryptic termini. The peptide sequence match results were organized and interpreted using the software tool Interact (37), allowing up to two missed cleavage for identified peptides.
For each FFE fraction, an average pI value was calculated from peptide matches with p
0.9. Our previous studies (11, 23) have shown that the average pI of the FFE fraction approximates the pH of the fraction and is useful for filtering peptide sequence matches. Using this average pI assigned to each FFE fraction, only peptide matches (regardless of assigned p score) were kept for further consideration if their predicted pI was within ±0.5 units, based on the FFE resolution (23), of the average pI value for the FFE fraction from which they were identified and if the peptide sequence was at least partially tryptic to maximize the high confidence matches (31). Those peptides passing this first phase of filtering were then further filtered using a two-step procedure similar to that we have described previously (11). In the first step, we took the entire dataset of filtered peptide matches and determined the lowest p score at which the estimated false positive rate was maintained at 1%, which was p = 0.75 (by comparison, without the initial pI filtering step the p score threshold needed to be 0.98 to maintain this low false positive rate). The second step filtered the peptide sequence matches first using a low stringency p score of 0.2 with the added stipulation that a protein would be accepted only if it was matched by two or more unique peptides. This filtering step also resulted in a false positive rate below 1% (again by comparison, without the initial pI filtering step, the p score using this method must be raised to 0.5 to maintain the same false positive rate). The protein matches resulting from both of these filtering steps were combined, and only proteins matched from at least two or more unique peptides were included in our final catalogue of proteins. The estimated false positive rate for our protein catalogue was 0.24%.
To identify proteins derived from bacteria within the cell pellet of whole saliva, the obtained MS/MS spectra were also searched against the updated, comprehensive UniProt/Swiss-Prot protein sequence database release 6.0 (38) containing 194,317 entries from over 10,000 different organisms as well as over 135,000 protein sequences from bacteria and viruses. A reversed-sequence version of the same database was appended to the end of the forward version to enable false positive rate estimation. Average FFE pI values (derived from the human peptide sequences identified in each fraction) and calculated peptide pI values were used to filter the resulting bacterial peptide sequence matches as described above for the human protein sequence database results, and the same two-step filtering method was used to obtain a final catalogue of proteins with an estimated false positive rate of 1%. Protein identifications matching to non-human organisms (i.e. bacteria and viruses) were manually extracted and further inspected. To ensure high confidence, bacteria were only considered to be present in the sample if peptides matched to two or more proteins derived from the bacteria were observed, or if only a single protein from the bacteria was identified, two or more unique peptides from the protein were matched. Additionally at least one of the peptide sequences identified from each of the bacteria had to be expressed only in that bacterium (i.e. not a redundant peptide sequence found in proteins known to be expressed in multiple bacteria types).
Immunoblotting Confirmation Experiments—
For these studies, the washed cell pellet was lysed by suspending the pellet in 2 volumes of 50 mM Tris-HCl (pH 6.8) with 2% (w/v) SDS, 5% (v/v) β-mercaptoethanol, and a protein inhibitor mixture (Roche Applied Science 13457200) and boiling for 10 min. After cooling, the solution was centrifuged at 16,000 x g for 15 min at 4 °C to remove cellular debris. Proteins in the supernatant were concentrated by acetone precipitation and redissolved in
200 µl of 50 mM Tris-HCl (pH 7.4) containing 2% SDS.
Western blots were performed for three proteins, whose genes are denoted in Table II as STAT3, PRDX3, and SCCA1. SDS-PAGE was performed on a Bio-Rad Tris-HCl minigel (10, 12, and 12%) with Tris-glycine-SDS buffer at 144 V. The proteins were transferred to a PVDF membrane for 60 min at 100 V in a cold room. The membrane was blocked with 1x PBS containing 2.5 mg/ml BSA and 0.25% (v/v) Tween 20 for 15, 80, and 120 min. This was followed by incubation with the primary antibody (mouse monoclonal (23G5) to STAT3 (phospho-Ser-727), Abcam ab24922, at 1:200 dilution for P40736; anti-AOP1, Sigma-Aldrich A7674, at 1:10,000 dilution for P30048; SCCA1 (8H11), Santa Cruz Biotechnology, Inc. sc-21767, at 1:400 dilution for P29508) in 1x PBS containing 2.5 mg/ml BSA and 0.25% (v/v) Tween 20. After three washes, the membranes were incubated with secondary antibody solution (horseradish peroxidase-conjugated goat anti-mouse IgG1, Immunology Consultants Laboratory GG1-90P) at 1:20,000, 1:10,000, and 1:30,000 dilution in 1x PBS containing 5 mg/ml BSA and 0.25% (v/v) Tween 20) for 60, 45, and 50 min. Finally the blot was visualized using the SuperSignal West Pico ECL Substrate (Thermo Scientific 34078) and autoradiography film.
|
| RESULTS |
|---|
|
|
|---|
|
|
To sensitively identify proteins from these highly complex cellular samples, we developed a novel, three-step peptide fractionation method prior to MS/MS analysis. This three-step method is a modification to our previously described, two-step peptide fractionation method (23) combing FFE and µLC wherein we have added the intervening step of SCX to more extensively fractionate the complex peptide mixtures with the objective of increasing the coverage of proteins and our sensitivity for low abundance proteins. Each of these steps are numbered and in bold text in the flowchart shown in Fig. 2 and shown in a simplified schematic on the right side of the figure. After a first step of preparative IEF fractionation using FFE, single or combined FFE fractions are each subjected to a second step of fractionation by SCX step elution chromatography, and each SCX fraction is analyzed by µLC-ESI-MS/MS using an LTQ linear ion trap mass spectrometer. Identical to our two-step fractionation method, peptide pI information added during the first stage of FFE fraction is used for filtering of peptide matches determined by sequence database searching to provide a high confidence catalogue of proteins (see description of this process below).
Given the novelty of our method, before proceeding to the large scale proteomics analysis of cells in whole saliva, we initially undertook studies to test its effectiveness. We first investigated the compatibility of coupling IEF peptide fractionation with a subsequent step of SCX chromatography. Because peptide fractionation using IEF and SCX have closely related physiochemical properties of peptide pI and solution phase charge, respectively, it was necessary to confirm that SCX effectively fractionated peptides of similar pI values contained in each FFE fraction. We selected an FFE fraction containing acidic peptide sequences (average predicted peptide pI,
4.6) and a fraction containing basic peptide sequences (average predicted peptide pI,
8.7) to test the compatibility of these different FFE fractions to SCX fractionation. Fig. 3, A and B, shows the results of each of these FFE fractions where each was separately loaded to the SCX cartridge and eluted by step gradient chromatography using increasing concentrations of salt at each elution step. Each elution fraction was then analyzed by µLC-ESI-MS/MS. We used a slightly higher salt concentration (100 mM) for the third SCX elution step for the basic peptide fraction compared with the acidic fraction (50 mM) because of the expectation that more of the basic peptides would have stronger affinity for the SCX column and elute at higher salt concentrations. For both the acidic and basic FFE fractions shown in Fig. 3, A and B, qualitative differences between the chromatograms of each different SCX fraction were observed, and the number of unique peptides identified in each fraction were evenly distributed, indicating resolution of distinct subsets of peptides with each SCX elution step.
|
We next applied our method to the remaining FFE fractions to obtain a catalogue of proteins from cells in whole saliva. Single FFE fractions from the acidic region of the pH gradient were subjected to SCX fractionation followed by µLC-ESI-MS/MS analysis. For basic pH fractions, adjacent pairs of FFE fractions were combined prior to SCX fractionation because these fractions are less abundant in peptides than the acidic fractions (23) and combining these fractions helped to minimize the number of µLC-MS/MS analyses. After subjecting the resulting MS/MS spectra to sequence database searching using SEQUEST (34), we used the pI information, determined in the first FFE fractionation step, to filter the peptide sequence matches and obtain high confidence protein matches. As shown in the bottom portion of Fig. 2, for each FFE fraction or pair of adjacent fractions, we calculated an average peptide pI value using only high confidence peptide sequence matches (p score of 0.90 or greater) for this calculation and the algorithm described by Shimura et al. (35) to predict the pI value for each sequence. Although a quantitative comparison was not the objective of this study, the peptide mixture was labeled with the iTRAQ reagent to test the amenability of the labeled peptides with our fractionation method for possible future quantitative studies. Therefore, we accounted for iTRAQ reagent labeling at the N terminus and lysine residues in our pI calculations (see "Experimental Procedures"). The calculated average pI values for each FFE fraction were compared with their measured pH, and these values were within 0.5 pH unit (data not shown), similar to our past descriptions of preparative peptide fractionation by FFE (23). Based upon this finding, only peptide sequence matches with predicted pI values within 0.5 unit of the average pI for the FFE fraction from which they were derived were accepted as we have done in our past studies using FFE (23). p score thresholds were then applied to these peptide sequence matches at a level such that the estimated false positive rate was 1%.
Our proteomics method generated a catalogue of over 1000 human proteins identified by two or more unique peptides (5884 total peptides were identified from these proteins). Supplemental Table 1 shows all of these proteins along with annotations and relevant information on the peptide sequence matches determined by SEQUEST. Although not displayed in Supplemental Table 1, we identified another 847 proteins from matches to single peptides (i.e. "single hits"). The use of pI filtering increases the confidence of these single hit proteins despite the limited sequence coverage, and the catalogue of these proteins is available upon request. The human proteins were identified by searching the MS/MS data against the non-redundant International Protein Index (IPI) human protein database (40), minimizing the potential for matching to identical protein sequences that may be listed under multiple accession numbers. Use of pI information in the filtering of peptide sequence matches (as described under "Experimental Methods") enabled a lowering of the p score threshold for peptide matches while still maintaining a low estimated false positive rate of 0.24%. Similar to our past descriptions using peptide pI information (11, 23), the decreased p score thresholds resulted in significantly more proteins identified with high confidence, identifying an additional 1027 peptide sequences and adding 144 proteins to our catalogue when considering only those proteins identified from two or more peptides. The additional peptide sequences identified as a consequence of the pI filtering process are shown in red text in Supplemental Table 1.
A number of identified human proteins play a role in cell signaling and tumorigenesis pathways of OSCC. We searched our catalogue for proteins with known association to OSCC whose identification may be a result of our analysis of samples derived from patients with OSCC lesions. Table II shows a selection of these proteins from the catalogue, all of which are known to be expressed in epithelial cells. Three of these proteins (STAT3, IKBKB, and LY6D) were identified from a single peptide sequence (matched to multiple MS/MS spectra), and Supplemental Fig. 1 shows annotated MS/MS spectra supporting these sequence matches. Proteins involved in OSCC signaling pathways include the transcription factor STAT3, which putatively plays an early role in the development of OSCC (41–43), and also several proteins known to regulate the activity of the transcription factor NF-
B, which has a well described role in early events of cancer onset including OSCC (43, 44). The proteins SCCA1 and SCCA2 are tumor markers of OSCC and may play a role in its progression (45). Also identified were numerous membrane-spanning proteins with either confirmed or putative roles in cell adhesion and growth mechanisms important to OSCC tumor growth.
We also independently confirmed the presence of some of the proteins shown in Table II in cells from whole saliva collected from three additional patients diagnosed with OSCC. These were separate samples from the four patient samples that were combined and used for our proteomics analysis. We selected the proteins STAT3, PRDX3, and SCCA1 from Table II for confirmation studies by immunoblotting. We detected each of these proteins in each of the three patient samples, further confirming the presence of these protein in cells from whole saliva and validating the results from our proteomics analysis. Results from our immunoblotting experiments are shown in Supplemental Fig. 2.
Because bacteria are known to be present within the oral environment (46) and play an important role in oral health and possibly even oral cancer development (47), we also interrogated our MS/MS dataset for the presence of proteins derived from bacterial sources. For this investigation, the entire MS/MS dataset was subjected to sequence database searching against the comprehensive UniProt protein sequence database (38), which contains proteins expressed in a variety of organisms including bacteria. As with the human proteins, the resulting peptide sequence matches, regardless of the organism from which they were derived, were filtered using peptide pI information and p score thresholds to obtain high confidence protein matches. Next those proteins identified from bacteria were extracted from the larger dataset for further analysis. The high confidence protein matches were grouped based upon the distinct bacteria in which they are expressed, and only bacteria with multiple peptide matches or matches to multiple different proteins were considered for acceptance. Of these, at least one of the peptide sequence matches also had to be specific to that bacterium (i.e. not a redundant peptide sequence found in proteins expressed in other organisms). Using these strict criteria, we identified proteins from 34 different bacteria. These bacteria are summarized in Table III. The organisms in bold text in Table III have not been reported previously in the literature as appearing in saliva.
|
| DISCUSSION |
|---|
|
|
|---|
3 ml or more) of saliva collected in the clinic using non-invasive methods. These amounts of protein provide more than enough starting material for large scale proteomics studies, which usually require 100 µg or more of total starting protein. Therefore, our finding that the cells from non-invasively collected whole saliva dependably provide adequate protein material for proteomics analysis demonstrates its potential in studies discovering proteins relevant to the diagnosis of oral cancer and other conditions. We also pursued cytological studies to characterize the makeup of the cells in whole saliva using immunostaining for cytokeratin expression, a well described marker for epithelial cells (32, 48). Because many researchers are interested in using whole saliva as a bodily fluid for the early detection and diagnosis of OSCC, which predominantly affects oral epithelial cells (49, 50), these experiments provided an assessment of the suitability of exfoliated cells present in non-invasively collected whole saliva for studies of OSCC progression. The results showing the presence of mainly intact, keratin-expressing cells in whole saliva (Fig. 1) support their potential as samples for diagnostic tests of diseases of the oral epithelium such as OSCC. Our findings are consistent with a previous study by members of our research team that also detected a high proportion of intact cells present in saliva with the majority of these being dead cells based upon trypan blue staining (15). Our observation of different cellular keratin staining patterns is consistent with other descriptions using an antibody with broad affinity for different types of keratin, which have also shown a heterogeneous mixture of staining patterns indicating epithelial cells at different stages of differentiation (48, 51, 52).
Our findings answer two questions critical to evaluating our novel three-step peptide fractionation method. First, is the peptide IEF step orthogonal enough to the SCX step to be effectively coupled together? Because the effective fractionation of peptide mixtures in a multistep method is dependent upon each step resolving peptides based upon different (i.e. orthogonal) physiochemical properties, the closely related properties of peptide pI (IEF) and solution phase charge (SCX) could potentially limit the effectiveness of coupling these two steps in our method. Particularly acidic peptides often contain fewer basic amino acids, which might limit their retention and chromatographic resolution by SCX due to its dependence on the affinity of positively charged basic amino acid side chains for the negatively charged SCX packing material. However, the qualitatively different mass chromatograms shown in Fig. 3 demonstrate that both acidic peptides (Fig. 3A) and basic peptides (Fig. 3B) are effectively resolved by a second SCX fractionation step. Furthermore the roughly equal number of unique peptides identified from each different SCX fraction demonstrates the resolution of the peptide mixtures by SCX. Additionally although iTRAQ reagent-labeled peptides were used in the described studies, we have observed similar results on complex mixtures of non-labeled peptides. Collectively these results provide a strong "yes" to the question of orthogonality between the IEF and SCX steps.
Second, does the inclusion of a third peptide fractionation step significantly increase peptide sequence matches relative to a two-step method combining preparative IEF and µLC? Fig. 3C clearly answers this question, showing a greater than 6-fold increase in peptide sequence matches using the three-step fractionation method compared with a two-step fractionation on the same FFE peptide fraction. Our three-step fractionation method should therefore prove an effective and general method for large scale cataloguing of complex protein mixtures. Given the increasing use of preparative IEF in proteomics studies using a variety of different instrumental platforms (9, 11, 23–29), our findings should be useful for researchers wishing to further fractionate complex peptide mixtures post-IEF.
One slight drawback to our three-step fractionation method is decreased throughput due to multiple sample handling steps and the generation of the large number of fractions generated by the first two fractionation steps. From each FFE fraction (20 total) we generated four SCX fractions, resulting in 80 fractions needing to be analyzed by µLC-MS/MS. Using automated sample loading, µLC-MS/MS analysis of this set of fractions took over 4 days of instrument time. Although the instrument time needed to analyze such a large number of fractions cannot be easily decreased, automation of the peptide fractionation steps could decrease time spent on manual sample processing and help to increase throughput. Notably the second and third steps of our method (SCX and µLC-MS/MS) could be combined in an automated platform, like the popular multidimensional protein identification technology (MudPIT) method (18, 21), increasing the throughput of our method.
Our method retains peptide pI information, enabling the identification of more proteins with high confidence. Preparative peptide IEF using FFE adds information on the pI of fractionated peptides shown by our group (31) and others (24–26) to provide a valuable constraint for high confidence peptide sequence matching from MS/MS data. Because preparative IEF is the first step in our three-step method, peptide pI information is retained for all subsequent fractionation steps and can still be used for filtering the peptide sequence matches after sequence database searching. As shown at the bottom of Fig. 2, for each FFE fraction, after SCX fractionation and µLC-MS/MS analysis, an average peptide pI, adjusted for the presence of the iTRAQ reagent label, was calculated for the FFE fraction based upon high confidence peptide matches (p score >0.9) from all of the SCX fractions, and this average pI value was used to further filter the peptide matches. Our results showed that the average peptide pI calculated for these iTRAQ reagent-labeled peptides approximates the pH of the FFE fraction with accuracy similar to our previous descriptions of peptide fractionation by FFE (11, 23). Consequently only considering peptide sequence matches that have predicted pI values within 0.5 pH units of the average pI value to be correct increases the confidence of these matches. Others have similarly demonstrated the utility of using average peptide pI values to filter peptide sequence matches (24–26). This filtering enables the lowering of the p score threshold while still maintaining a low false positive rate (
1%) as estimated by reverse database searching. This decrease in the p score threshold increased the number of peptides identified by over 1000 and increased the size of our protein catalogue by almost 14%, further supporting the value of preparative IEF fractionation of peptides for MS/MS-based proteomics studies. The overall estimated false positive rate for our protein catalogue was 0.24%.
Among the proteins in our catalogue, the identification of proteins involved in cellular pathways linked to OSCC has 2-fold importance. First, it demonstrates for the first time that these epithelial cellular proteins can be detected in the cells contained in non-invasively collected whole saliva using proteomics methods or otherwise. All work in the literature describing proteins and biochemical pathways associated with OSCC development have been identified either from studies on model cell lines or from invasively collected patient tissue biopsies. For example, the role of the transcription factor STAT3, shown in Table II, has been characterized from a head and neck cancer cell line (43) and oral tissue biopsies (42); likewise the tumor inhibitor role of the SCCA1 and SCCA2 proteins was characterized in a model head and neck cancer cell line (45). The presence of both of these proteins in the cells from whole saliva was also confirmed by immunoblotting experiments in independent patient samples (see Supplemental Fig. 2). Our detection of these proteins with ties to OSCC directly from cells in whole saliva bolsters its potential as a sample for clinical diagnostics of oral cancer.
Second, the identification of proteins from signaling and tumorigenesis pathways associated with OSCC demonstrates the ability of our proteomics method for discovering proteins with potential diagnostic value in oral cancer detection directly from the cells in whole saliva. In particular, the identification of proteins involved in transcriptional control, such as STAT3, and kinases involved in signal transduction, such as RIPK2 and IKBKB, indicate the ability of our proteomics method to detect relatively low abundance proteins with roles in OSCC development. The identification of numerous transmembrane-bound proteins with roles in cell adhesion and proliferation and ties to OSCC development demonstrates the ability of our method to detect this class of hydrophobic proteins, which challenge other proteomics methods such as those based upon two-dimensional gel electrophoresis (53). Although the identification of these proteins of potential diagnostic importance directly from cells in whole saliva is a significant first step, quantitative studies will be necessary to confirm their diagnostic power and, importantly, discover new protein markers of OSCC. Our demonstration here of the effectiveness of our proteomics method provides a foundation for these quantitative proteomics studies of the cells in whole saliva from patients at different stages of OSCC progression.
We also identified proteins derived from dozens of interesting bacteria within the cell pellet of whole saliva. Many of the bacteria have not been detected previously in whole saliva as determined by a search of the literature, which included results from a recent study of the soluble fraction of whole saliva using proteomics methods (9). Interestingly several of the bacteria detected have known associations with cancer, and their detection may be a consequence of our analysis of OSCC patient samples in this study, consistent with another recent study suggesting a role for bacteria in diagnosing OSCC (47). For example, Eikenella corrodens has been found in elevated levels of infection in head and neck cancer patients (54), and Bacteroides fragilis and Pasteurella multocida both produce toxins that are thought to promote cell proliferation in cancer (55, 56). Although further study is necessary to elucidate the role, if any, of these bacteria in OSCC development, their detection demonstrates the potential of our proteomics method for detecting bacteria within the oral environment and investigating their possible role in cancer development using quantitative proteomics methods.
| CONCLUSIONS |
|---|
|
|
|---|
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, November 28, 2007, DOI 10.1074/mcp.M700146-MCP200
1 The abbreviations used are: OSCC, oral squamous cell carcinoma; FFE, free flow electrophoresis; µLC, microcapillary LC; SCX, strong cation exchange; DAPI, 4',6-diamidino-2-phenylindole; iTRAQ, isobaric tags for relative and absolute quantitation. ![]()
* This work was supported by NIDCR, National Institutes of Health Grant 1 R01 DE17734 and in part by grants from the Minnesota Medical Foundation, American Cancer Society Institutional Research Grant IRG-58-001-46, and a research award from Eli Lilly and Co. (to T. J. G.). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
** To whom correspondence should be addressed: University of Minnesota, 321 Church St. SE, 6-155 Jackson Hall, Minneapolis, MN 55455. Phone: 612-624-5249; Fax: 612-624-0432; E-mail: tgriffin{at}umn.edu
| REFERENCES |
|---|
|
|
|---|
B associated cytokines: TNF-
, IL-1
, IL-6, and IL-8 in whole saliva for the malignant transformation of oral lichen planus.
Mol. Carcinog.
44, 77
–82[CrossRef][Medline]
B dependent cytokine levels in saliva of patients with oral preneoplastic lesions and oral squamous cell carcinoma.
Cancer Detect. Prev.
29, 42
–45[CrossRef][Medline]
B and STAT3 signaling pathways in head and neck squamous cell carcinoma.
Neoplasia
8, 733
–746[CrossRef][Medline]
B and gelatinases in the tumorigenesis of oral squamous cell carcinoma.
Head Neck
28, 916
–925[CrossRef][Medline]
B and inhibitor of
B kinase pathways in oncogenic initiation and progression.
Oncogene
25, 6817
–6830[CrossRef][Medline]
B kinase activity is responsible for the augmented activity of NF-
B in human head and neck carcinoma cells.
Cancer Lett.
171, 165
–172[CrossRef][Medline]
B activation.
J. Biol. Chem.
272, 30952
–30961This article has been cited by other articles:
![]() |
T. B.M. Schaaij-Visser, A. P. Graveland, S. Gauci, B. J.M. Braakhuis, M. Buijze, A. J.R. Heck, D. J. Kuik, E. Bloemena, C. R. Leemans, M. Slijper, et al. Differential Proteomics Identifies Protein Biomarkers That Predict Local Relapse of Head and Neck Squamous Cell Carcinomas Clin. Cancer Res., December 15, 2009; 15(24): 7666 - 7675. [Abstract] [Full Text] [PDF] |
||||
![]() |
L.-M. Chi, C.-W. Lee, K.-P. Chang, S.-P. Hao, H.-M. Lee, Y. Liang, C. Hsueh, C.-J. Yu, I-N. Lee, Y.-J. Chang, et al. Enhanced Interferon Signaling Pathway in Oral Cancer Revealed by Quantitative Proteome Analysis of Microdissected Specimens Using 16O/18O Labeling and Integrated Two-dimensional LC-ESI-MALDI Tandem MS Mol. Cell. Proteomics, July 1, 2009; 8(7): 1453 - 1474. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Hu, M. Arellano, P. Boontheung, J. Wang, H. Zhou, J. Jiang, D. Elashoff, R. Wei, J. A. Loo, and D. T. Wong Salivary Proteomics for Oral Cancer Biomarker Discovery Clin. Cancer Res., October 1, 2008; 14(19): 6246 - 6252. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |