|
Advertisement | ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 8:1278-1294, 2009.
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
5,500 pooled tumor cells (corresponding to
550 ng of protein lysate/analysis) obtained through laser capture microdissection (LCM) from two independently processed data sets (n = 24 and n = 27) containing both tamoxifen therapy-sensitive and therapy-resistant tumors. Peptides and proteins were identified by matching mass and elution time of newly acquired LC-MS features to information in previously generated accurate mass and time tag reference databases. A total of 17,263 unique peptides were identified that corresponded to 2,556 non-redundant proteins identified with
2 peptides. 1,713 overlapping proteins between the two data sets were used for further analysis. Comparative proteome analysis revealed 100 putatively differentially abundant proteins between tamoxifen-sensitive and tamoxifen-resistant tumors. The presence and relative abundance for 47 differentially abundant proteins were verified by targeted nano-LC-MS/MS in a selection of unpooled, non-microdissected discovery set tumor tissue extracts. ENPP1, EIF3E, and GNB4 were significantly associated with progression-free survival upon tamoxifen treatment for recurrent disease. Differential abundance of our top discriminating protein, extracellular matrix metalloproteinase inducer, was validated by tissue microarray in an independent patient cohort (n = 156). Extracellular matrix metalloproteinase inducer levels were higher in therapy-resistant tumors and significantly associated with an earlier tumor progression following first line tamoxifen treatment (hazard ratio, 1.87; 95% confidence interval, 1.25–2.80; p = 0.002). In summary, comparative proteomics performed on laser capture microdissection-derived breast tumor cells using nano-LC-FTICR MS technology revealed a set of putative biomarkers associated with tamoxifen therapy resistance in recurrent breast cancer.
, which is expressed in
70% of all primary breast tumors and is known to be important in the development and course of the disease. When diagnosed at an early stage, adjuvant systemic tamoxifen therapy can cure
10% of the patients (1). In recurrent disease,
50% of patients have no benefit from tamoxifen (intrinsic resistance). From the other half of patients who initially respond to therapy with an objective response (OR)1 or no change (NC), a majority eventually develop progressive disease (PD) due to acquired tamoxifen resistance (2, 3). With the markers available to date we can insufficiently predict therapy response. Therefore, identification of new biomarkers that can more effectively predict response to treatment and that can potentially function as drug targets is a major focus of research. The search for new biomarkers has been enhanced by the introduction of microarray technology. Gene expression studies have resulted in a whole spectrum of profiles for e.g. molecular subtypes, prognosis, and therapy prediction in breast cancer (4–10). Corresponding studies at the protein level are lagging behind because of immature technology. However, protein-level information is crucial for the functional understanding and the ultimate translation of molecular knowledge into clinical practice, and proteomics technologies continue to progress at a rapid pace.
Proteomics studies reported so far have mainly been performed with breast cancer cell lines using either two-dimensional gel electrophoresis (11–14) or LC-MS for protein separation (15–17). However, it is known that the proteomic makeup of a cultured cell is rather different from that of a tumor cell surrounded by its native microenvironment (18). Furthermore cell lines lack the required follow-up information for answering important clinical questions. In addition, tumor tissues in general and breast cancer tissues in particular are very heterogeneous in the sense that they harbor many different cell types, such as stroma, normal epithelium, and tumor cells. LCM technology has emerged as an ideal tool for selectively extracting cells of interest from their natural environment (19) and has therefore been an important step forward in the context of genomics and proteomics cancer biomarker discovery research. LCM-derived breast cancer tumor cells have been used for comparative proteomics analyses in the past using both two-dimensional gel electrophoresis (20, 21) and LC-MS (22). This has resulted in the identification of proteins involved in breast cancer prognosis (21) and metastasis (20, 22). Although these studies demonstrated that proteomics technology has advanced to the level where it can contribute to biomarker discovery, major drawbacks, such as large sample requirements (42–700 µg) and low proteome coverage (50–76 proteins), for small amounts of starting material (
1 µg) persist. Because clinical samples are often available in limited quantities, in-depth analysis of minute amounts of material (<1 µg) necessitates advanced technologies with sufficient sensitivity and depth of coverage.
Recently we demonstrated the applicability of nano-LC-FTICR MS in combination with the accurate mass and time (AMT) tag approach for proteomics characterization of
3,000 LCM-derived breast cancer cells (23). This study showed that proteome coverage was improved compared with conventional techniques. The AMT tag approach initially utilizes conventional LC-MS/MS measurements to establish a reference database of AMT tags specific for a particular proteome sample (e.g. breast cancer tissue). Each tag consists of a theoretical mass calculated from the peptide sequence, an LC normalized elution time (NET) value, and an indicator of quality. The AMT tag database serves as a "lookup table" for identifying peptides in subsequent quantitative LC-MS analyses. Substituting routine LC-MS/MS analyses (shotgun approach) with LC-FTICR MS analyses (AMT tag approach) significantly increases overall throughput and sensitivity while reducing sample requirements. Additionally quantitative intensity information related to the abundance of the protein can be discerned from these MS analyses (24). In the present study, we used the same strategy to analyze eight pools of tumor cells in duplicate or triplicate (resulting in 19 samples) derived from 51 fresh frozen primary invasive breast carcinomas that appeared to be either sensitive or resistant to tamoxifen treatment after recurrence. This work resulted in the identification of a putative protein profile associated with tamoxifen therapy resistance. In addition, the top discriminating protein of the putative profile, extracellular matrix metalloproteinase inducer (EMMPRIN), was validated in an independent patient cohort and was significantly associated with resistance to tamoxifen therapy and shorter time to progression upon tamoxifen treatment in recurrent breast cancer.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
expression as assessed by ligand binding assay or enzyme-linked immunosorbent assay (
10 fmol/mg of cytosolic protein). Tumor tissues were divided into two classes based on the type of response to tamoxifen therapy. 24 tumors were sensitive to tamoxifen therapy, showing either complete remission (CR) or partial remission (PR), and were assigned as OR. 27 tumors were resistant to therapy, showing an increase in tumor size, and were designated as PD. Clinical response was defined by standards of the International Union against Cancer criteria of tumor response (25). 20 of the above mentioned tumor tissues were selected for the verification study. Tissues were included based on their high tumor cell content of >70%. Tumor cell content was judged after hematoxylin/eosin stain of a separately cut 4-µm tissue section.
For immunohistochemical validation, a primary breast tissue microarray (TMA) containing 0.6-µm cores of formalin-fixed paraffin-embedded tumors was used. Within the TMA, there were 156 tumor tissues from patients that received tamoxifen as first line treatment upon recurrence. Median follow-up of patients alive after primary surgery was 103 months (range, 16–222 months) and 51 months after the onset of tamoxifen treatment (range, 9–136 months). Included patients showed CR, PR, PD, and NC of >6 and
6 months. Further patient and tumor characteristics are summarized in Table IV.
|
Laser Capture Microdissection
LCM was performed on 8-µm tissue cryosections that were fixed in ice-cold 70% ethanol and stained with hematoxylin as described previously (27). Briefly slides were washed in Milli-Q water, stained for 30 s in hematoxylin, washed again in Milli-Q water, subsequently dehydrated twice in 50, 70, 95, and 100% ethanol for 30 s each, and air-dried. Laser microdissection and pressure catapulting was performed directly after staining. Tumor epithelial cells were collected, using a P.A.L.M. LCM device, type P-MB (P.A.L.M. Microlaser Technologies AG, Bernried, Germany). From each cryosection an area of
500,000 µm2 that corresponds to
4,000 cells (area x slide thickness/1,000-µm3 cell volume) was collected in P.A.L.M. tube caps containing 10 µl of 0.1% RapiGest (Waters Corp., Milford, MA) and then spun down into 0.5-ml Eppendorf Protein LoBind tubes (Eppendorf, Hamburg, Germany). Collected cells were stored at –80 °C until further processing. Because we used small numbers of microdissected cells in this study, the protein concentration was typically below the detection limit of any protein assay. Hence the protein concentration for samples undergoing LC-MS analysis was estimated based on microdissected tissue area and extrapolations from protein assays performed on whole tissue lysates (i.e.
4,000 cells corresponds to
400 ng of total protein).
Sample Preparation
Microdissected cell batches were pooled into OR and PD tumor groups (corresponding to
25,000 cells/pool) prior to sample preparation. Briefly cells were lysed by sonication directly in RapiGest solution using an Ultrasonic Disruptor Sonifier II (Model W-250/W-450, Branson Ultrasonics, Danbury, CT) for 1 min at 60% amplitude. Proteins were subsequently equilibrated for 2 min at 37 °C, denatured at 99 °C for 5 min, and processed for overnight trypsin digestion according to the instructions of the manufacturer using MS-grade porcine modified trypsin gold (Promega, Madison, WI) at a 1:20 (w/v) ratio as described previously (23). Digestion was stopped by incubation with 0.5% TFA at 37 °C for 30 min. Remaining cellular debris were spun down for 20 min at 10,600 x g, and supernatant was transferred to a new Eppendorf LoBind cup. Peptides were lyophilized and stored at –80 °C until further analysis. Prior to FTICR MS analysis, samples were reconstituted in 18 µl of NH4HCO3, vortexed briefly, and spun down again for 10 min at 10,600 x g to pellet any contaminating particulate material.
For the verification study, whole tissue lysates were prepared from 20 tumor tissues from which 6 x 4-µm cryosections per sample were cut. Tissue cryosections were placed in a Teflon container, frozen in liquid N2, and then pulverized in a frozen state in a microdismembrator (Braun Biotech International). The resulting powder was resuspended in 100 µl of 0.1% RapiGest. Cell lysis and trypsin digestion were performed as described above. Prior to trypsin digestion, a BCA protein assay (Pierce) was performed to determine protein concentration. From each total tissue sample, 50 µg of protein lysate was used for trypsin digestion at a trypsin:protein ratio of 1:50 (w/w) and further handled as described above.
Nano-LC-FTICR MS
Nano-LC-FTICR MS was performed using a slightly modified procedure as described previously (23, 28). Each pooled sample was analyzed in triplicate by injecting 4 µl (equivalent to
5,500 cells or
550 ng) directly via a 3-µl sample loop onto a custom-built reversed-phase (RP) 80-cm x 50-µm-inner diameter fused silica capillary column (Polymicro Technologies, Phoenix, AZ) packed in house with 3-µm C18 particles (300-Å pore size; Jupiter, Phenomenex, Torrence, CA) and subjected to an applied pressure of 10,000 p.s.i. through a high pressure syringe pump (ISCO, Lincoln, NE). Flow rate over the column was
250 nl/min. After an injection period of 45 min, peptides were eluted from the column using a gradient from 100% mobile phase A (99.75% H2O, 0.2% acetic acid, 0.05% TFA) to
70% mobile phase B (90% acetonitrile, 9.9% H2O, 0.1% TFA) over a
200-min period. The nano-LC column outlet was coupled on line to a 7-tesla FTICR mass spectrometer through a nano-ESI emitter; 4,000 mass spectra were acquired in each LC-MS analysis using 0.3-s ion accumulation time and 50-µs gas pulse (29).
LC-MS/MS
In the verification study, tryptic digests of 20 different whole tissue lysates (8 OR and 12 PD) were analyzed on a custom-built RPLC system via ESI utilizing an ion funnel (30) coupled to a ThermoFisher Scientific LTQ-Orbitrap mass spectrometer (Thermo Fisher Scientific, San Jose, CA). Separation was performed using a custom-made column (60 cm x 75-µm inner diameter) packed in house with Jupiter particles (C18 stationary phase, 5-µm particles, 300-Å pore size). The capillary RPLC system used for peptide separations has been described previously (23, 28). Mobile phase A consisted of 0.1% formic acid in water, and mobile phase B consisted of 100% acetonitrile. The column was equilibrated at 10,000 p.s.i. with 100% mobile phase A. A mobile phase selection valve was switched 50 min after injection to create a near exponential gradient as mobile phase B displaced mobile phase A in a 2.5-ml mixer. A split was used to provide an initial flow rate through the column of
400 nl/min. The column was coupled to the mass spectrometer using an in-house manufactured ESI interface with homemade 20-µm-inner diameter chemically etched emitters (31). The heated capillary temperature and spray voltage were 200 °C and 2.2 kV, respectively. Mass spectra were acquired for 80 min over the m/z range 400–2,000 at a resolving power of 100,000. An inclusion list with m/z values corresponding to peptide masses of 100 target proteins was used to select precursor ions. In cases when no targeted precursor ion was present, a maximum of six data-dependant LTQ tandem mass spectra were recorded for the most intense peaks in each survey mass spectrum.
Protein Identification and Quantitation
FT mass spectra, acquired with the 7-tesla FTICR or LTQ-Orbitrap, were processed using ICR-2LS, Decon2LS (32), and VIPER v3.39 software developed in house (33). The output data files were visualized as two-dimensional displays of peptide monoisotopic mass versus LC elution time (i.e. spectrum number). Next MS peaks with similar measured neutral masses and LC elution times were clustered to form LC-MS features (or unique mass classes). LC elution times were converted into NET to make multiple LC-MS runs comparable (34). The assembled set of LC-MS features was then searched against the human mammary epithelial cell line AMT tag database (35), MCF-7 epithelial breast carcinoma cell line AMT tag database (36), and a composite database for a mixture of human mammary epithelial cells and MCF-7-c18, BT-474, MDA-231, and SKBR-3 breast cancer cell lines (37) using stringent filtering criteria: Xcorr
1.5, 2.7, and 3.3 for 1+, 2+, and 3+ fully tryptic peptides, respectively, and Xcorr
3.0, 3.7, and 4.5 for 1+, 2+, and 3+ partially tryptic peptides (with a minimum length of 6 amino acids), respectively, as reported previously (23). The LCMSWARP (liquid chromatography-based mass spectrometric warping and alignment of retention times of peptides) algorithm (38) was used to match LC-MS features to AMT tags. A tolerance window of mass measurement accuracy <6 ppm and NET error <0.025 was applied to ensure reliable peptide identification with false discovery rate of
10%. Identified peptides were coupled to their corresponding proteins using the human International Protein Index (IPI) databases, 2006 version 3.20 including 61,255 protein entries (discovery phase) and 2008 version 3.39 including 69,731 protein entries (verification phase), and in-house built Qrollup v2.2 software. Two or more constituent peptides were required to confidently identify a protein. In the case of proteins with multiple splice isoforms, these isoforms were only specifically listed if they were identified by at least one unique peptide (in addition to overlapping peptide sequences). For average abundance calculation, only highly abundant and, where possible, unique peptides were used. Protein names and descriptions were then converted to TrEMBL, NCBI (National Center for Biotechnology Information), and Swiss-Prot database formats. Protein information was retrieved from European Molecular Biology Laboratory-European Bioinformatics Institute databases. Proteins identified from all available AMT tag databases were assembled into a single list, giving rise to some redundancy. A final non-redundant protein list was generated using ProteinProphet software (SourceForge, Inc.). MS peak intensities were used as a measure of the relative peptide abundances. The mean abundance of the LC-MS features was used, and the relative abundances of constituent peptides were averaged to derive the relative abundance of the parent protein.
Tandem mass spectra acquired with the LTQ-Orbitrap were searched against the human IPI 2008 database using TurboSEQUEST v27. We used in-house developed DeconMSn software to correct the monoisotopic masses prior to generation of the dta files used for subsequent database search. Peptide sequences were considered confident with the following filtering criteria: Xcorr of 1.9, 2.2, and 3.75 for 1+, 2+, and
3+ peptides and
Cn
0.1. We also applied the AMT tag strategy to identify peptides in survey mass spectra acquired with the LTQ-Orbitrap by matching the accurate masses and elution times against the composite breast cancer cell line AMT database. Peak intensities measured in high resolution survey spectra were used to retrieve relative abundance information as described above.
Immunohistochemistry
Immunohistochemical validation was performed with an in-house prepared TMA. The TMA was established in close collaboration with a dedicated pathologist (M. A. d. B.) who evaluated all tissues for histology, grade, and Bloom and Richardson scoring (39). Tissue sections of 4 µm were stained overnight at 4 °C for EMMPRIN using a 1:100 diluted antibody directed against the C terminus of the protein (8D6, sc-21746, Santa Cruz Biotechnology, Inc., Santa Cruz, CA). Antigen retrieval was performed prior to antibody incubation for 40 min at 95 °C using DAKO retrieval solution, pH 6 (DakoCytomation, Carpinteria, CA) after which the slides were cooled down to room temperature. Staining was visualized using the anti-mouse EnVision+® System-HRP (DAB) (DakoCytomation) according to the instructions provided by the manufacturer. Scoring of immunostaining was performed by two independent observers who recorded both percentage of positive tumor cells and staining intensity.
Data Analysis and Statistics
Relative abundance levels of all identified proteins in one sample were intra- and intersample normalized by log2 transformation using in-house developed MultiAlign software v1.1. Subsequently Z-score normalization was applied to each protein across the samples using the formula (value – mean)/standard deviation. Sample sets 1 and 2 were separately Z-score-normalized to correct for time and experimental variation. Normalized values were subjected to class comparison and prediction analysis using BRB-ArrayTools version 3.5.0 beta1 developed by Dr. Richard Simon and Amy Peng Lam. Class comparison involved finding differentially abundant proteins between therapy-sensitive (OR) and therapy-resistant (PD) tumors using a univariate two-sample t test with a significance threshold of 0.05. All data from sample sets 1 and 2 were combined to create a general list of differentially abundant proteins between OR and PD tumors and subjected to a Mann-Whitney Wilcoxon rank sum test performed with the STATA statistical package, release 10.0 (STATA, College Station, TX).
Hierarchical clustering of the data was performed using the OmniViz Desktop 3.8.0 package. For clustering, average linkage and the Euclidian similarity metric were used. Principal component analysis (PCA) was performed using Spotfire DecisionSite 8.1, version 14.3.
Kaplan-Meier survival analysis as a function of time to progression after the onset of first line tamoxifen treatment as well as correlation with response and other clinical parameters was performed using STATA. The primary end point for the Cox proportional hazard model was disease progression after the onset of tamoxifen treatment.
| RESULTS |
|---|
|
|
|---|
|
550 ng of protein lysate were analyzed using nano-LC-FTICR MS. Resulting data sets were visualized in a form of a two-dimensional plot, displaying monoisotopic mass versus spectrum number (NET) as shown in supplemental Fig. 1. On average
40,000 LC-MS features were detected in each analysis. These features were matched against previously established breast (cancer) cell line AMT tag databases. On average,
20% of LC-MS features matched with peptides in the database and were thus identified as illustrated in supplemental Fig. 1B. For this study, two sample sets were independently prepared and analyzed, using a different set of tumors, as shown in Fig. 2. Sample set 1 consisted of 24 tumors of which 11 were sensitive (OR) and 13 were resistant (PD) to tamoxifen treatment. Sample set 2 contained 27 tumors, 13 OR and 14 PD tissues. Microdissected cells were pooled to average sample heterogeneity and to enable triplicate analysis and were analyzed by nano-LC-FTICR MS. Replicate MS analyses, for which technical problems such as clogged tips were observed, were excluded from further data analysis, leaving 19 LC-MS data sets for further analysis (Table I). In total, 17,263 peptides corresponding to 2,556 proteins were identified through AMT tag database matching. Between the two sample sets 1,713 proteins, identified by 13,729 peptides, were identical, corresponding to an overlap of 67% (Table I). Protein abundance was computed by averaging intensities of the highly abundant peptides identified for the given protein and, where possible, using unique peptide sequences to account for multiple splice isoforms. It needs to be mentioned that it is difficult to correctly assess average protein abundance of highly homologous proteins that may have different abundance levels if these proteins are identified through identical peptides. In those cases, the additional use of unique peptide sequences may partly overcome this problem. Information on protein identification, such as filtering scores, assigned peptides and number of peptides used for abundance, mass and NET errors, and additional information is reported in supplemental Table S1. Normalized protein abundances for 1,713 proteins are displayed in supplemental Table S2.
|
|
|
|
|
Similar results were obtained by PCA (supplemental Fig. 2). In the PCA complex information is reduced to three principal components, represented by the x, y, and z axes. Samples are visualized in a three-dimensional plot and cluster according to their relative protein abundance. From this PCA it is clear that, in this sample set, OR (green squares) and PD samples (red squares) were completely separated from each other based on their protein abundance profile.
To verify that individual peptides showed differential abundance similar to that of their corresponding proteins, we performed hierarchical clustering on all peptides corresponding to the putative 100-protein profile. As expected, clustering based on peptides resembled the results of protein clustering (data not shown).
Verification of Differential Protein Abundance
Our next goal was to verify the presence and abundance level of all profile proteins in separate tumor samples. Because we used pooled microdissected tumor cells for the discovery study, information on the single tumor level as well as the relation with clinical factors was lost. To verify our putative profile proteins, we performed targeted LC-MS/MS analyses using an inclusion list (supplemental Table S4) compiled from the m/z values of the peptides that corresponded to the 100 putative profile proteins. We prepared whole tissue protein lysates from tumors (eight OR and 12 PD) with a high tumor cell content (>70%) so that microdissection could be omitted. Using this approach, we identified and therefore verified the presence of 50 proteins from the inclusion list. In addition, peak intensities of survey mass spectra (on average
14,000 LC-MS features per sample) were used for quantitation. In this case, peptide identity was derived by matching LC-MS features from survey spectra to the composite breast cancer cell line AMT tag database. This resulted in the identification and quantitation of 47 target proteins of which 42 were also identified by MS/MS sequencing (Fig. 5). Overall a total of 55 proteins (50 by MS/MS sequencing and five additional by LC-MS feature (survey mass spectra) matching with the AMT database of the 100-putative protein list) were verified in an independent targeted LC-MS/MS experiment. The 47 proteins for which relative abundance was available were used in further analyses. Surprisingly the top discriminating protein in the original profile, EMMPRIN, was not identified through this targeted approach. Raw MS/MS data obtained for verified proteins and relative abundance ratios for verified proteins are listed in supplemental Tables S5 and S6, respectively.
|
|
|
To independently validate differential EMMPRIN protein abundance between OR and PD patients, IHC was performed using our primary breast cancer TMA. Among the different tissues, there were 156 breast tumors of patients who received first line tamoxifen therapy after recurrence. This set of tumors had no overlap with the discovery set tumors. In total, 130 tumors showed reproducible IHC staining on the TMA when assays were performed in triplicate. Patient and tumor characteristics are described in Table IV. Different staining outcomes were categorized as undetectable, weak, medium, and strong membrane staining. Weak membrane staining, present in <10% of tumor cells, was scored as 1+. Medium membrane staining, present in 10–50% of tumor cells, was scored as 2+. Strong membrane staining, observed in >50% of tumor cells, was assigned score 3+ (Fig. 7). These scoring outcomes were subsequently related to clinical endpoints. We observed that none of the CR tumors displayed EMMPRIN staining, whereas highest EMMPRIN staining (3+) was observed in PD tumors (Table V). This finding, originally indicated using LC-MS-based technology, was thus confirmed by IHC. For comparison, we defined a "clinical benefit" group composed of tumors showing NC for >6 months, CR, and PR and a "no clinical benefit" group representing NC for
6 months and PD tumors. Absence of detectable EMMPRIN levels showed a significant clinical benefit with an odds ratio of 2.98 (95% CI, 1.32–6.73; p = 0.009). The presence of detectable EMMPRIN levels was more frequently observed in premenopausal women (X2 = 11.7; p < 0.001) and in patients with a shorter disease-free interval (X2 = 11.2; p = 0.004) defined as the time from primary diagnosis to recurrence (Table VI). In addition, Cox regression analysis showed that presence of EMMPRIN significantly correlated with shorter progression-free survival from the start of tamoxifen treatment (HR, 1.87; 95% CI, 1.25–2.80; p = 0.002) (Fig. 8). Thus, high EMMPRIN levels correlate with poor outcome on first line tamoxifen treatment.
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
Protein Identification by Nano-LC-FTICR MS
Many different proteomics technologies are available nowadays that all aid in the quest for cancer biomarkers. The method of choice will depend on the type of question asked, the type of material being investigated, and the availability of resources. Several studies have shown that the combination of dedicated nano-LC separation coupled to high end FT MS offers the best potential for in-depth analysis of limited sample quantity, which is usually the case with clinical material (23, 28, 36, 37, 40). In the present study, we used nano-LC-FTICR MS and a composite breast cancer cell line AMT tag database for the identification of peptides from as little as
550 ng of protein lysate. Overall we identified over 17,000 unique peptides corresponding to over 2,500 unique proteins, a significantly larger fraction of the proteome than attainable with more conventional proteomics techniques (20, 22). Furthermore we believe there is more to gain if a breast cancer tissue-specific AMT tag database becomes available. Although breast cancer cell lines represent aspects of normal and malignant breast tissue, it is well known that cultured cell lines have quite a distinct proteomic profile compared with primary cells or tissues. This was clearly demonstrated by Ornstein et al. (18) who compared proteomes of microdissected prostate tumor cells with proteomes of matching cell lines from the same patient. They showed that protein expression was strikingly altered in cultured cells, which had less than 20% proteins in common with uncultured cells (18). Therefore, it is very well possible that proteins involved in therapy resistance of breast tumors are not expressed in cell lines and thus are missing from the AMT tag database used in this study. To overcome this problem, we are currently constructing an AMT tag database from breast cancer tissues using a selection of tumors that have distinct phenotypic characteristics. A breast cancer tissue-specific AMT tag database will most likely increase the number of identified peptides (i.e. proteome coverage) in LC-MS analyses, thus increasing our chances of identifying relevant biomarkers. Proteome coverage could even be further improved using "smart MS/MS," e.g. by fragmenting currently unidentified LC-MS features.
Discovery and Verification of Putative Tamoxifen Therapy Response-associated Proteins
The putative protein profile described in this study consists of 100 proteins involved in a variety of biological processes. These proteins can be categorized into different functional classes, such as structural proteins, signaling proteins and kinases, metabolic enzymes, proteins involved in apoptosis, and others (see Table II). Several of the putative profile proteins (NAP1L1, pyridoxine-5'-phosphate oxidase, and UQCRFS1) have been previously associated with tamoxifen therapy resistance in breast cancer (41, 42) or chemotherapy resistance (SGPL1 and TUBB3) in vitro and in clinical specimens (43–45) and with aggressiveness of breast cancer (S100A6, S100A9, CLIC4, EBP50, and OCLN) (46–51).
Because the discovery of putative tamoxifen response-predictive proteins was performed in pooled samples, it was important to verify the presence and relative abundance of these proteins in each individual tumor tissue. Using a targeted MS/MS approach, we successfully identified 55 profile proteins in individual, non-microdissected tumor lysates and retrieved quantitative information for 47 of these proteins. Clearly 45 putative proteins were left unverified in individual tumor samples, including our top discriminating protein, EMMPRIN. The relatively low verification rate can be justified by the use of different samples and LC-MS platforms for the discovery and verification part of the study. Microdissected tumor cell lysates were analyzed by ultranarrow LC coupled to FTICR for discovery, whereas whole tissue lysates representing a mixture of cell types were analyzed by a standardized LC-MS/MS platform for verification. Nano-LC-FTICR analysis yielded an average of
40,000 LC-MS features, whereas LC-MS/MS Orbitrap analysis detected on average
14,000 LC-MS features. Therefore, the nano-LC-FTICR platform yielded
3x higher proteome coverage and, one can speculate, resulted in a similar improvement in sensitivity (i.e. limit of detection). Similarly we only used information on accurate mass in targeted MS/MS experiments because it was not possible to use NET information as an inclusion criterion with the software version available at the time. The addition of NET information as an inclusion criterion will most likely increase the success rate of target peptide identification through MS/MS in future studies using updated instrument control software. The compilation of these effects (i.e. LC-MS platform with lower overall sensitivity and inadequate targeted MS/MS strategy) resulted in a failure to confirm the identity of our top discriminating protein as EMMPRIN in the verification study.
Nevertheless the presence of 55 putative profile proteins was verified, and based on the abundance ratios, ENPP1, UQCRFS1, and GNB4 were confirmed to be significantly differentially abundant between OR and PD tumors. In addition ENPP1, EIF3E, and GNB4, were significantly associated with time to progression upon first line tamoxifen treatment of recurrent breast cancer. So far, no link between ENPP1 or GNB4 and breast cancer or response to tamoxifen has been described, although ENPP1 overexpression and polymorphisms have been repeatedly associated with insulin resistance and obesity (52, 53). Obesity is a risk factor for breast cancer (54), and insulin resistance may be linked to tamoxifen therapy resistance.
EIF3E protein expression has been shown to be significantly decreased in breast cancer, which was frequently associated with loss of heterozygosity at the Int-6/eIF3-p48 locus (55). EIF3E is ubiquitously expressed and highly conserved, and it encodes the p48 subunit of the translation initiation factor eIF3, also named INT6. In a multiplex tissue immunoblotting study by Traicoff et al. (56), EIF3E expression was determined in 124 breast cancer tissues. It was shown that breast tissues clustered according to high or low EIF3E expression, and this segregation was not dependent on tumor stage. Furthermore EIF3E expression positively correlated with tumor suppressors, such as p53, suggesting a function in the same signaling pathway (56). It was postulated that EIF3E has diverse functions in cell growth in addition to translation initiation, including tumor suppressive properties. This was particularly clearly shown in studies where truncation or knockdown of EIF3E induced angiogenesis and tumor formation (57, 58). This tumor-suppressive role correlates well with the elevated abundance of EIF3E in OR tumors and its contribution to prolonged progression-free survival upon tamoxifen treatment.
Validation of EMMPRIN
The validation study was focused on our top discriminating protein, EMMPRIN, which is known to be involved in breast cancer and for which an appropriate antibody is conveniently available. EMMPRIN has been previously described to play a role in tumor cell invasion and metastasis (59). In particular, it acts through up-regulation of the urokinase-type plasminogen activator system, thereby promoting tumor cell invasion (60). In an immunohistochemical study using high density breast cancer tissue microarrays, it was shown that positive EMMPRIN staining correlated with various histopathological parameters, in particular with decreased tumor-specific survival in postmenopausal patients (61). EMMPRIN is up-regulated in many types of cancer (62), supporting the previous findings that the involvement of EMMPRIN in urokinase-type plasminogen activator deregulation may be a universal phenomenon in tumorigenesis and is not restricted to breast cancer. In addition, EMMPRIN has been recently shown to predict response and survival following cisplatin-containing chemotherapy in patients with advanced bladder cancer (63). An IHC analysis in 101 advanced bladder cancer patients showed that high EMMPRIN expression strongly correlated with shorter survival time, in particular in patients with metastatic tumors, and that response to chemotherapy could also be predicted with an odds ratio of 4.41 (63). In our study, high expression of EMMRPIN was more frequently observed in PD than OR tumors, and it was significantly associated with an early tumor progression after the onset of first line tamoxifen treatment in recurrent breast cancer. Combining our results with previous findings, one can speculate that EMMPRIN-induced tumor aggressiveness may be the result of therapy resistance in general (i.e. tamoxifen and chemotherapy) and that this mechanism is not restricted to breast cancer.
Concluding Remarks
In this study we demonstrated quantitative analysis of minute amounts of clinically relevant tumor tissues using ultrasensitive nano-LC-FTICR technology. These analyses have put forward a putative protein profile that may predict the outcome of response for tamoxifen therapy in breast cancer patients. Whether this profile as a whole is a good predictor for tamoxifen therapy response in a larger, independent group of patients and whether it is applicable to chemotherapy as well will be the subject of further investigations.
| ACKNOWLEDGMENTS |
|---|
, and Samual Purvine for assistance in MS data management and analysis. Anita Trapman-Jansen and Renée Foekens are acknowledged for technical assistance with IHC and TMA. Portions of this research were performed at the Environmental Molecular Sciences Laboratory, a national scientific user facility sponsored by the Department of Energy's Office of Biological and Environmental Research and located at Pacific Northwest National Laboratory, Richland, WA. | FOOTNOTES |
|---|
Published, February 24, 2009
* This work was supported, in whole or in part, by National Institutes of Health Grant RR18522 from the National Center for Research Resources. This work was also supported by the National Genomics Initiative/Netherlands Organization for Scientific Research (NWO). ![]()
The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
1 The abbreviations used are: OR, objective response; AMT, accurate mass and time; BRB, biometric research branch; CI, confidence interval; CLIC4, chloride intracellular channel protein 4; CR, complete remission; EBP50, ezrin-radixin-moesin-binding phosphoprotein 50; EIF3E, eukaryotic translation initiation factor 3 subunit 6/E; EMMPRIN, extracellular matrix metalloproteinase inducer; GNB4, guanine nucleotide-binding protein β subunit 4; HR, hazard ratio; IHC, immunohistochemistry; IPI, International Protein Index; LCM, laser capture microdissection; NAP1L1, nucleosome assembly protein 1-like 1; NC, no change; NET, normalized elution time; OCLN, occluding; PCA, principal component analysis; PD, progressive disease; PR, partial remission; RP, reversed-phase; S100A6, calcyclin; S100A9, calgranulin B; SGPL1, sphingosine-1-phosphate lyase; TMA, tissue microarray; TUBB3, β-tubulin type 3; UQCRFS1, ubiquinol-cytochrome c reductase iron-sulfur subunit mitochondrial precursor; LTQ, linear trap quadrupole; ENPP1, ectonucleotide phosphatase/phosphodiesterase 1..
|| Present address: Dept. of Chemistry, Ajou University, Suwon 443-749, Korea. ![]()

Present address: Dept. of Computer Science, University of Toronto, Toronto, Ontario M5S 3G4, Canada. ![]()
Supported in part through a personal fellowship from the Dutch Cancer Society. To whom correspondence should be addressed: Erasmus Medical Center Rotterdam, Josephine Nefkens Inst., Dept. of Medical Oncology, Laboratory of Genomics and Proteomics of Breast Cancer, Dr. Molewaterplein 50, Be 430c, P. O. Box 2040, 3000 CA Rotterdam, The Netherlands. Tel.:31-10-7043814; Fax:31-10-7044377; E-mail: a.umar{at}erasmusmc.nl
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
L. F. Waanders, K. Chwalek, M. Monetti, C. Kumar, E. Lammert, and M. Mann Quantitative proteomic analysis of single pancreatic islets PNAS, November 10, 2009; 106(45): 18902 - 18907. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |