|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 6:1170-1182, 2007.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
1-antitrypsin achieved the best results with a sensitivity, specificity, predictive value, and positive predictive value of 0.95 each in a logistic regression analysis. In addition, three new potential markers were discovered, whereas two other proteins showed promising trends but were not detected in sufficient numbers of samples to permit statistical validation. Differential expressions of some of these candidate biomarkers were independently verified using immunohistochemistry.
Endometrial carcinoma (EmCa), a cancer of the lining of the uterus, is the fourth most common cancer in Canadian women.2 Current methods of diagnosis rely on invasive techniques (biopsy and curettage), and no screening is available. A panel of biomarkers that helps in early diagnosis would, therefore, be useful especially for high risk groups, e.g. women who are on tamoxifen treatment or have hereditary nonpolyposis colorectal cancer syndrome. Although the eventual diagnostic testing for such biomarkers would be most facile from bodily fluids, such as blood or urine, the iTRAQ experiments performed thus far have been on resected EmCa from uterine tissues (hysterectomy specimens) (2, 3). The rationale for this approach is that the concentration of any biomarker is most likely highest in the cancerous tissue itself and not when diluted in the bodily fluids, thus facilitating discovery. In addition, the use of the cancerous tissue reduces the intrinsic need to demonstrate that any differentially expressed protein detected originates from the endometrial cancer. By contrast, the origins of differentially expressed protein in the blood include a variety of potential sites other than the actual tumor. The use of homogenized tissues yields a heterogeneous sample with the proteome being contributed by the stroma, vasculature, blood, and malignant/benign epithelium. This heterogeneity may attenuate, and even mask, the variation in protein expression levels characteristic of cancerous epithelial cells. One remedy for this drawback that we have adopted is the use of laser capture microdissection to procure the specific, malignant epithelial cells from the samples (5). This approach, however, is not practical when 103104 cells/sample are required for current proteomics techniques in a global biomarker discovery strategy. Thus far, the types of differentially expressed proteins discovered (2, 3) are primarily medium to high abundance proteins as universal detection methods, including the MS/MS technologies that were used, are much more efficient in detecting major rather than minor components in a complex mixture.
As described previously, our strategy in the search for EmCa markers requires a comparison between the cancerous endometrium and the two major phases, proliferative and secretory, of the normal reproductive aged endometrium (3, 6). EmCa occurs primarily in postmenopausal women; normal atrophic endometrium contains few glands and few epithelial cells. Thus we chose normal proliferative and secretory samples as controls. The multiplexing ability afforded by the iTRAQ reagents, which are available in four different tags, is well suited for such a simultaneous comparison especially in view of the fact that endometrial carcinoma itself can have two distinct morphologic and physiologic types. Type I cancers are endometrioid in histologic typing, well differentiated, and estrogen-dependent and have typically a better prognosis. By contrast, Type II carcinomas are serous and clear cell carcinomas, hormone-independent, and aggressive and have generally a poorer clinical outcome (7). In the current study, we differentiated between these two EmCa categories, thus fully utilizing all four iTRAQ labeling categories in the analysis. A key to mass tagging is the ability to mix the samples and perform subsequent isotope dilution mass analysis. Mixing ensures that any differences in expression levels of individual proteins are a result of initial differences between the samples in the set and not an artifact of differences in sample handling or processing. To ameliorate natural and unavoidable variations, such as the inherent interindividual variations that would exist between clinical samples from different patients, the experiments were performed in 10 sets of four samples each. Additionally the 10 normal proliferative samples that were used as the controls in each of the sets were also separately labeled and compared with each other in a separate set of experiments. This permitted the evaluation of the extent of variation attributable to interindividual expression levels for each of the PCMs as well as provided a means of normalizing the ratios of individual protein levels in different sets to the average expression level in the normal proliferative samples.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
1 ml of PBS with a mixture of protease inhibitors as described previously (1 mM 4-(2-aminoethyl)benzenesulfonyl fluoride, 10 µM leupeptin, 1 µg/ml aprotinin, and 1 µM pepstatin) (3). The washed tissue was then homogenized in 0.5 ml of PBS with protease inhibitors using a hand-held homogenizer. These homogenates were then flash frozen in liquid nitrogen and stored at 80 °C until use. Samples were thawed and clarified by centrifugation, and the protein concentration was determined by a Bradford-type assay using Bio-Rad's protein quantification reagent. Two hundred micrograms of each of the 40 samples were then labeled individually with an iTRAQ tag. As we were using double the manufacturer's suggested amounts (Applied Biosystems), we used two individual vials of each tag for labeling each sample. Trypsin digestion and labeling were performed according to the manufacturer's protocol. Normal proliferative, normal secretory, Type I cancer, and Type II cancer samples were labeled with the 114, 115, 116, and 117 tags, respectively. The trypsin-digested and labeled samples were then mixed in sets of four with each set containing one of each type of label, thus resulting in 10 sets in total.
Strong Cation Exchange (SCX) Separation Conditions
Each set of labeled samples was then separated by SCX fractionation using an HP1050 HPLC instrument (Agilent, Palo Alto, CA) with a 2.1-mm-internal diameter x 100-mm-length PolyLC Polysulfoethyl A column packed with 5-µm beads with 300-Å pores (The Nest Group, Southborough, MA). A 2.1-mm-internal diameter x 10-mm-length guard column of the same material was fitted immediately upstream of the analytical column. Separation was performed as described previously (3). Briefly each pooled sample set was diluted with the loading buffer (15 mM KH2PO4 in 25% acetonitrile, pH 3.0) to a total volume of 2 ml, and the pH was adjusted to 3.0 with phosphoric acid. Samples were then filtered using a 0.45-µm syringe filter (Millipore, Cambridge, Ontario, Canada) before loading onto the column. Separation was performed using a linear binary gradient over 1 h. Buffer A was identical in composition to the loading buffer; Buffer B was Buffer A containing 350 mM KCl. Fractions were collected every 2 min using an SF-2120 Super Fraction Collector (Advantec MFS, Dublin, CA) after an initial wait of 2 min to accommodate the void volume. This resulted in a total of 30 SCX fractions per sample set. These fractions were dried by speed vacuuming (Thermo Savant SC110 A, Holbrook, NY) and resuspended in 30 µl of 0.1% formic acid each.
LC-MS/MS Run Conditions
The fractions from 6 to 25 were then analyzed by nano-LC-MS/MS using the LC Packings Ultimate instrument (Amsterdam, The Netherlands) fitted with a 1-µl sample loop. Samples were loaded onto a 5-mm reverse phase (RP) C18 precolumn (LC Packings) at 50 µl/min and washed for 4 min before switching the precolumn in line with the separation column. The separation column used was either a 75-µm-internal diameter x 150-mm-length PepMap RP column from LC Packings packed with 3-µm C18 beads with 100-Å pores or an in-house equivalent packed with similar beads from Kromasil (The Nest Group). The flow rate used for separation on the RP column was 200 nl/min, and the gradient was as shown in Table I.
|
Data Analysis
The software used (Applied Biosystems/MDS SCIEX) for data acquisition for the first set of runs was Analyst 1.0 SP8, whereas the software for the second run onward was Analyst 1.1. Data were analyzed using ProQUANT 1.0 or 1.1, respectively, and the database searched was the Celera human database (human KBMS 20041109) with a total of 178,243 entries provided by Applied Biosystems. Tolerance for the searches was set for 0.4 Da for the MS and 0.2 Da for the MS/MS spectra. The two parameters used to evaluate the quality of the peptide matches were the score and the confidence, which are described in detail in the literature accompanying the software. Briefly the score is a ProQUANT-generated value based on the number of ions that matches the theoretical list of fragments of the peptide in question, whereas the confidence, also a ProQUANT-generated value, is calculated from empirical data. The algorithm used to calculate the confidence incorporates the distance score calculated for the peptide as well as factors such as the total number of results returned in the search. The distance score itself is calculated by determining the difference between the score of the particular peptide and that of the seventh highest scoring peptide for that particular MS/MS spectrum and is a measure of the confidence of the match. Only those peptides scoring higher than a score of 20 and a confidence of 75 were retained in the ProQUANT search. The ProQUANT results were then grouped using ProGroup viewer, which reports the lowest number of non-redundant protein identities that would account for the peptides identified along with the ratios for the relative abundance of these proteins after normalizing. Normalizing was performed by first calculating the median ratio of all proteins reported. Peptides that contribute to the protein identification but with ratios of the iTRAQ signature peaks smaller than 40 counts between the pair of labeled peaks in question were excluded from this calculation. The resulting median ratio was the normalizing factor used and was termed the applied bias. This normalizing factor is based on the assumption that most of the protein levels in the test samples should be similar to those in the control with the exception of those that are specific to the condition of the test sample itself (i.e. malignant or benign), thus minimizing any systematic error. When the ratio for a protein from a set of constituent peptides is calculated, peptide ratios with smaller errors (better ion-counting statistics) are weighted more heavily by the program. All peptides used for this calculation were unique to the given protein; peptides that were common to other isoforms or proteins of the same family were ignored. ProGroup also assesses the confidence of the protein identities reported. The ProGroup confidence score cutoff used was 1.3, which corresponds to a confidence limit of 95%. On occasion, the ratios of some proteins that were not automatically given by the ProGroup software were also reported using the ratios returned by the ProQUANT searches. These were typically instances in which the confidence in the sequence of the identifying peptides was lower than the specified cutoff for reporting by ProGroup but for which we had more confident results for the same peptides from a different sample run. Identities of these peptides were manually verified prior to inclusion. Lastly the ratios for each of the potential markers were averaged across all the runs in which they were identified.
As mentioned previously, the 10 normal proliferative samples were also compared against each other in a separate series of experiments. Samples for this second series of experiments were grouped in three sets. The first of these sets contained the proliferative samples used in the first four sets of samples in the experiments comparing the cancerous samples, i.e. P1P4; the second set comprised proliferative samples P4P7; and the third set comprised P7P10. In cases where the particular protein of interest was identified in all three sample sets in these proliferative sample comparisons, the expression ratios were all recalculated relative to one proliferative sample, typically P1. These adjusted ratios were then used to calculate the average normal proliferative ratio, which was in turn used to normalize all the individual normal proliferative ratios themselves. This calculation was also performed on the individual expression ratios for the EmCa sample comparisons, thus permitting them to first be expressed relative to P1 and then relative to the average normal proliferative level.
Dot-blot and Immunohistochemical Verification
Verification of the differential expression levels of potential markers discovered using iTRAQ analysis was provided by dot-blot analyses and/or immunohistochemical analyses using antibodies specific to the protein of interest. Dot-blot analysis was performed by spotting 2 µg of each homogenate on a nitrocellulose filter (Bio-Rad); after blocking with 5% (w/v) skimmed milk in TBS (20 mM Tris, pH 7.5, 150 mM NaCl), each filter was probed by incubating it with a primary antibody in 5% bovine serum albumin in TBS with 0.1% Tween 20 overnight with shaking. An additional blot was probed with antibody specific for ß-actin. Additionally selected proteins identified in the iTRAQ study were verified and localized using immunohistochemistry of proliferative, secretory, and EmCa tissues fixed in 10% buffered formalin and embedded in paraffin blocks. These tissues were different from those used in the iTRAQ work. The antibodies were applied in an appropriate dilution determined through a pilot study and immunohistochemically visualized using a diaminobenzidine chromogen. Interpretations of the immunohistochemically stained sections were conducted using a standardized microscopic review to assess positive staining (brown) for the targeted proteins in four tissue components: epithelium/carcinoma, endometrial stroma, any white blood cells, and glandular secretions. Antibodies used for these verifications were purchased from various commercial sources: ß-actin, Cell Signaling Technologies (Pickering, Ontario, Canada); polymeric immunoglobulin receptor (PIGR), Cedarlane Laboratories (Hornby, Ontario, Canada); pyruvate kinase (PK) M2, ScheBo Biotech AG (Glessen, Germany); and chaperonin 10 (Cpn 10), Stressgen (Victoria, British Columbia, Canada).
Statistical Analysis
Evaluation of differential expression in the iTRAQ analyses was performed using two statistical approaches. A preliminary evaluation of the data was carried out using a power analysis. The deviation from unity,
, beyond which differential expression is indicated is given by [2 x S.D.2 x (Z
+ Zß)2/N]0.5 where S.D. is the standard deviation, (Z
+ Zß)2 is the power index, and N is the number of sample sets (10). The standard deviations of the cytoplasmic structural proteins, actin and ß5-tubulin, were used to estimate the variation of protein concentrations between individual patients and sets. These averaged to be
0.3 over many iTRAQ analyses (see e.g. Table III). A power index of 10.5 was used for confidence limits of 95% for Type I error (
) and 90% for Type II error (ß) (10). Type I error occurs when a difference is assumed (i.e. when the null hypothesis is rejected) where there is none; by contrast, Type II error occurs when no difference is assumed (i.e. when the null hypothesis is accepted) where there actually is. Thus for N = 2, the ratios must be <1/(1 +
) or >(1 +
) = <0.51 or >1.97 to indicate differential expression; for N = 10, the criteria relax to <0.70 or >1.43. The three most significant and consistent biomarkers were then chosen as explanatory input variables in a logistic regression model as a discriminator between malignant and normal samples. If p denotes the predicted probability that a case i whose observed marker values are given by the vector x(i) = (x(i, marker 1), x(i, marker 2), x(i, marker 3)) is malignant. Then the logistic regression discriminator has the form
![]() |
where the index "i" denotes the individual sample and "j" is a summation index that runs over the markers. Analogously logistic regression discriminators were defined for each of the three markers individually. For a training set S of marker values x(i) (i = 1, ... , n) the model parameters
and ßj were determined by maximizing the multiplicative likelihood over S using R Statistics (version 2.0.1). The discriminators were trained using the average observed iTRAQ ratios as marker observations in the malignant and benign cases. Here the malignant cases comprise the 20 Type I and Type II cancer cases, whereas the benign cases comprise 10 normal proliferative and 10 normal secretory cases. Receiver operating characteristic curves were calculated from the predictive scores of the parametrized logistic regression model by varying thresholds for "positive" calls between 0 and 1. Sensitivities, specificities, predictive values (PVs), and positive predictive values (PPVs) were calculated using a cutoff value of 0.5 on the logistic regression predictor. For any given receiver operating characteristic curve, the area under the curve (AUC) value was determined using the Mann-Whitney statistics (11, 12).
|
| RESULTS |
|---|
|
|
|---|
|
6 of the 10 sets and with
50% showing differential expression). These proteins, all confidently identified with more than two peptide matches in each case (see supplemental data for peptide sequence and coverage for each potential marker) are given in Table III along with two structural proteins, actin and ß5-tubulin, as controls. In addition, two other proteins that showed differential expression in an earlier discovery work (3) and two interesting proteins that show promising trends are also given. Two samples, initially classified as Type II cancers (II6 and II10), were subsequently reclassified as predominantly Type I after histopathological re-examination and are shown in Table III as I6b and I10b. The expression ratios shown are the averages of the replicate analyses. For pyruvate kinase M1/M2, polymeric immunoglobulin receptor precursor, macrophage migration-inhibitory factor (MIF),
1-antitrypsin (AAT), creatine kinase chain B (CKB), transgelin, actin, and ß5-tubulin, the ratios are those relative to the averages of the proliferative phase samples. Observations of the other listed proteins were incomplete in the proliferative phase comparisons; for these proteins, the ratios are relative to the specific proliferative phase samples used in the pairing. Table IV shows the details of PK results as an illustration of the typical analytical precision achievable. Due to the scope of this study, the various runs for each sample set were often temporally separated by as much as 6 months. The ratios determined, however, varied typically by no more than ±20%. PCMs such as PK, PIGR, Cpn 10, MIF, AAT, CKB, and transgelin have been reported in our earlier study (3) and were verified here in this more extensive study. Two proteins reported earlier (3), phosphatidylethanolamine-binding protein (PEBP) and heterogeneous nuclear ribonucleoprotein D0 (hnRNP D0) did not show consistent differential expression in this expanded study and thus were not pursued further. Three new proteins showing differential expression in the 10 sets examined are WAP four-disulfide core domain protein 2 (WFDC2), clusterin, and mucin 5B. In addition, progestagen-associated endometrial protein, also known as PP14 and known to be selectively overexpressed in the secretory phase (13, 14), was evident.
|
The comparisons of the 10 proliferative samples afforded us an estimate on the variation of the abundances of proteins across samples or individual patients. An analysis of the following nine consistently observed proteins, PIGR, PK, Cpn 10, MIF, AAT, CKB, transgelin, actin, and ß5-tubulin, in the proliferative and secretory phases (thus giving 18 cases) showed that 13 of 18 cases have relative S.D. values
30%, three of 18 cases have relative S.D. values of 3140%, and two of 18 cases have relative S.D. values >40%. The two structural proteins, actin and ß5-tubulin, exhibit relative S.D. values of 2532% in the Type I and Type II EmCa samples. However, of the 14 remaining cases in the malignant samples, five of 14 cases have relative S.D. values
30%, three of 14 cases have relative S.D. values of 3140%, and six of 14 cases have relative S.D. values >40%. Thus there are typically much larger patient-to-patient variations across the malignant samples (see supplemental data for more details).
In a second statistical analysis strategy, all listed proteins in Table III were screened for their individual association with malignant or benign status using the two-sample t test. Four proteins were deemed to provide the maximal allowable number of individual components in a panel that constitute robust and reproducible results, i.e. without losing validity due to overfitting. At a t test significance threshold of p = 0.005, the following four proteins were found to be differentially expressed between cancer and normal cases: PK (p = 1.24 x 107), Cpn 10 (p = 2.2 x 103), AAT (p = 8.97 x 104), and CKB (p = 2.06 x 104). AAT was more uniformly expressed than CKB within the combined proliferative and secretory samples and was included in a candidate panel marker together with PK and Cpn 10 (see supplemental data). The performance is shown in Fig. 1. Evidently the use of the panel of three potential markers permits discrimination between cancer and normal samples, achieving an AUC of 0.96 and a sensitivity, specificity, PV, and PPV of 0.95 each. This was an improvement over the result when using the single best marker (PK), which achieved an AUC of 0.95, a sensitivity of 0.85, specificity of 0.90, PV of 0.875, and PPV of 0.895. To assess whether the panel would be reproducible and valid in its predictive performance on independent data, we used two-thirds/one-third cross-validation. The set of 40 samples was split 10 times randomly into training and test sets of, respectively, 26 and 14 samples; the data from the 26 samples were used as input variables to train the logistic regression predictor. To maintain proportions and make the performance of the predictor over the random splits more comparable, the random selection was programmed such that identical absolute numbers of benign and malignant cases were assigned to training and test sets in each of the 10 data splits (i.e. 13 benign/13 malignant in each training set; seven benign/seven malignant in each test set). Once the logistic regression discriminator was parametrically specified on a training set, it was used as a predictor to make calls for each of the 14 "independent" test cases by using a cutoff value of 0.5. The accuracy of these calls, compared with the actual disease status of the test cases, was evaluated in terms of fractions of true positives (sensitivity) and false positives (1 specificity) for each of the 10 test sets (Table V). The similarity in performances between the training and test sets validates the predictability and ruggedness of the panel of biomarkers.
|
|
|
|
| DISCUSSION |
|---|
|
|
|---|
In our preliminary study, pyruvate kinase M1/M2 was demonstrated as being overexpressed in EmCa samples by both cleavable ICAT and iTRAQ methods (3). This result was verified in this study where PK appears to be an effective marker for differentiating between both Types I and II EmCa and normal endometrial tissues. The significance of pyruvate kinase as a cancer biomarker has increasingly been recognized. A number of studies have suggested that PK M2, in particular, is present primarily in a dimeric form in tumors and that it is useful as a biomarker in the early detection of tumors (16, 17). In fact the M2 isoform, after initial expression at the fetal stage, was reported to be prevalent only in proliferating cells and tumors (17). PK overexpression in tumor cells is understandable and can be explained on the basis of the key role that it plays in the generation of ATP in the glycolytic pathway. Under the hypoxic conditions that are typical for tumors, this pathway is a critical route by which tumors satisfy the higher energy requirements needed for proliferation (for a review, see Ref. 18). Another study demonstrated that PK M2, in combination with any of three tumor markers (carcinoembryonic antigen, CA72-4, and CA19-9) for gastrointestinal cancer, results in improved sensitivity for detection of colorectal, gastric, and esophageal cancers (19).
Polymeric immunoglobulin receptor precursor was observed previously to be overexpressed in EmCa and was verified in this study (3). PIGR is part of the immune response system and is typically expressed by epithelial cells. Its primary role is the transport of dimeric IgA from the basolateral surface of the epithelium to the apical surface where it is released into exocrine secretions (20, 21). It is, therefore, plausible that the overexpression is part of the host's response to the presence of the cancerous cells themselves or to the carcinogenic stimulus. This would also suggest possible mechanistic explanations for the less aggressive nature of the Type I cancer. These possible explanations stem from the fact that the cleaved form of PIGR, known as the secretory component, is a known inhibitor of the proinflammatory cytokine IL-8 and acts by forming an inactive complex with this chemokine, thereby preventing chemotaxis of polymorphonuclear neutrophils (PMNs) (22). Although it is generally accepted that PMNs play an antitumorigenic role (23), there are instances where this might not hold true. A recent study showed that melanoma cell extravasation is facilitated by PMNs and that blocking either the IL-8 receptors on PMNs or neutralizing the soluble IL-8 in cell suspensions reduced extravasation of these melanoma cells (24). Thus the inhibition of PMN accumulation might reduce the potential for metastases to occur. PMNs might also facilitate tumor progression through the release of enzymes that are responsible for activation of matrix metalloproteinase-2 (MMP-2) from its inactive pro-MMP-2 form (25). In turn, MMP-2 is known to be involved with angiogenesis and tumor invasion (25). Consequently the increased level of PIGR in the Type I cancer might result in the effective inhibition of angiogenesis and prevention of tumor invasion. Such a contradictory role for cells that are part of the immune response is well documented. A similar role for macrophages was recently described in a review, which demonstrated that macrophages facilitate tumor progression by enabling angiogenesis and tumor cell motility as a result of increased intravasation (26). Thus the inhibition of PMN migration by PIGR overexpression might result in the inhibition of angiogenesis, tumor invasion, and metastases thereby accounting for the less aggressive nature of the Type I cancer.
A closer examination of the factors that affect the expression levels of the potential markers is also enlightening. The factors influencing the expression levels of PIGR include induction by cytokines such as IL-4, tumor necrosis factor
, and interferon-
(21, 27, 28). Signaling pathways that are involved with the response to induction by such ligands include the STAT (signal transducers and activators of transcription), NF
B, and p38-MAPK (mitogen-activated protein kinase) pathways (21, 22, 27, 28). In addition, there are cofactors that are also known to be involved with up-regulation of PIGR expression. One such cofactor is all-trans-retinoic acid (RA), which is a metabolite of vitamin A (29). RA enhances the up-regulation of PIGR expression in response to IL-4 and interferon-
stimulations. RA and NF
B also regulate the expression levels of some of the other potential markers discovered in this study and are discussed below. It is also noteworthy that NF
B has been specifically linked with endometrial cancer by various other studies (30, 31).
WAP four-disulfide core domain protein 2, which is also known as HE4, belongs to a family of proteins that are known to be proteinase inhibitors. WFDC2 is known to be overexpressed in a range of different cell lines including ovarian, renal, lung, colon, and breast lines. In a recent study, WFDC2 showed up-regulation in mRNA levels during the secretory phase in rhesus monkeys (32). This result is consistent with the iTRAQ results that we observed in the secretory phase samples (Table III). The bulk of the initial studies on WFDC2 were focused on using it as a biomarker for ovarian carcinoma (33). However, an investigation on the expression levels of this protein in various human tissues using DNA microarrays followed by validation with immunohistochemistry has confirmed that overexpression is also observed in 90% of endometrial adenocarcinomas (34). It is noteworthy that a recent review has suggested that the overexpression of WFDC2 is a good, early marker for ovarian cancer, even better than CA125 for that purpose. However, WFDC2 did not show as high an overexpression in clear cell as opposed to epithelial ovarian carcinomas and might not prove useful for diagnosis of clear cell cancers (35). This last aspect appears to mirror our results with Type II EmCa in which overexpression levels, on average, were also not as high as those in Type I EmCa; Type II endometrial cancers are serous and/or clear cell cancers (36).
Another noteworthy point is that NF
B might also play a role in regulating the expression levels of WFDC2, through a binding site identified in the promoter region of WFDC2, as well as other proteins belonging to this family (35). This link with NF
B appears to be in common between WFDC2 and PIGR above, thus suggesting a possible common means for the overexpression of both proteins.
Mucin 5B is a new potential EmCa marker found in this study. This protein has not been reported previously to be a marker for or associated with endometrial cancer to the best of our knowledge. Mucins in general, however, have been associated with various cancers and have been proposed to promote tumor cell invasion and metastases (37). In the case of lung cancer, tumors of patients who were smokers showed a higher level of Mucin 5B, and these patients tended to show higher degrees of postoperative relapse (37). Furthermore it has been demonstrated that Mucin 5B mRNA expression is enhanced by RA, a factor in common with PIGR above (38). The 5' flanking region of Mucin 5B has two NF
B binding sites, suggesting another element in common with PIGR and WFDC2 (38).
1-Antitrypsin is a secreted glycoprotein and like WFDC2 is a protease inhibitor. In our study, the expression levels were down-regulated relative to the normal proliferative samples. AAT is known to inhibit angiogenesis and tumor growth; thus underexpression would have foreseeable implications for cancer (39). The precedence for such down-regulation of expression levels for AAT in cancers has already been discussed previously (3).
Clusterin is another new potential biomarker for EmCa found in this study. It is an antiapoptotic glycoprotein that has been implicated in resistance to various cell death triggers (40). Independent validation for our findings is provided by the tissue microarray results available from the Human Protein Atlas (41). Their results show rare, moderately stained cells in the stroma and no staining in the glandular cells or the myometrium in the normal endometrial samples. By contrast, five of 12 endometrial cancer samples show moderate cytoplasmic staining in the epithelial cells, and another four show weak staining. This is in support of our results. Overexpression of clusterin has been reported previously for various cancers, including hepatocellular, breast, prostate, and urothelial bladder carcinoma (4245). Of particular interest is a study that showed that inhibition of clusterin expression enhances sensitivity to chemotherapy, thus making clusterin a useful therapeutic target (43). Moreover another study demonstrated that tamoxifen, a drug used to treat breast cancer, enhances clusterin expression levels, which in turn was linked to an increased potential for metastases of breast cancer cells. This, in their view, suggests a possible mechanism for the increase of endometrial cancer in postmenopausal women undergoing tamoxifen treatment for breast cancer (46).
The small increase observed in the levels of creatine kinase B in the secretory phase in this study was consistent with the findings of another study that had demonstrated a similar increase in the secretory phase over the proliferative phase using two-dimensional gels followed by tryptic digestion and partial sequencing (47). Additionally other independent enzyme activity studies showed a greater than 3-fold increase in the activity for CKB in the secretory phase over the proliferative phase (48). CKB is underexpressed in EmCa; the extent is apparently larger in Type II than Type I samples. This down-regulation has also been observed in various other cancers including colon and lung adenocarcinomas as well as squamous cell carcinomas (49).
Cpn 10, calgizzarin, transgelin, and MIF are all proteins we had detected previously as being differentially expressed in EmCa samples; these have all also been implicated in various other forms of cancer and were discussed previously (3, 50). Macrophage capping protein (Cap-G) and leucine aminopeptidase 3 (LAP3) were not identified in a sufficient number of EmCa samples to justify inclusion in the list of differentially expression proteins in this study. Although the numbers of samples in which they were observed were limited, they did show apparent trends in expression levels in Type II EmCa, suggesting that they might prove useful as subjects of a targeted investigation. Cap-G belongs to the gelsolin family of proteins and upon activation by Ca2+ is responsible for capping barbed ends of actin filaments (51). Thus Cap-G affects the actin filament structure within a cell, and as non-muscle cells require rapid reorganization of the actin filament network to change shape during movement, it is conceivable that Cap-G is one of the proteins involved in the mechanism by which a tumor cell metastasizes. This could be the reason that it appears to be overexpressed to a larger extent in the more aggressive Type II than in Type I EmCa. Currently not much detail is known about the function and the distribution of expression for LAP3. Interestingly placental leucine aminopeptidase (P-LAP) has been linked specifically with EmCa, and an increased expression level of P-LAP is associated with a poor prognosis (52). However, a BLAST (Basic Local Alignment Search Tool) search between the LAP3 and P-LAP amino acid sequence returned no significant homology, thus making LAP3 a potentially novel marker for endometrial cancer.
Some commonalities appear among the various PCMs discussed above. One of these is the possible implication of PMNs. As noted individually above, PMNs and PIGR expression levels are closely linked. In fact, not only can the PIGR expression level affect PMN chemotaxis but also PMN-expressed enzymes, such as neutrophil elastase (NE) and proteinase 3, known to cleave PIGR to form secretory component (22). Furthermore under specific conditions, supernatants from activated PMNs have been shown to induce PIGR expression through the NF
B pathway (22). Thus PMNs might conceivably be the potential common element that was alluded to earlier that could elicit a response through NF
B sites in WFDC2 and PIGR as well as Mucin 5B. Another possible association between PMNs and WFDC2 is the fact that in some cell types other proteins belonging to the WFDC family, namely, secretory leucocyte protease inhibitor (SLPI) and elafin, are known to inhibit NE (22). Antiproteinase activity by WFDC2 has not yet been demonstrated but has been inferred on the basis of its similarity to SLPI and elafin (35); therefore it is possible that WFDC2 may play a role in inhibiting PMN-expressed enzymes in the endometrium akin to the role of secretory leucocyte protease inhibitor (SLPI) and elafin in the other cell types. Another antiproteinase that might have some influence on the possible role of PMNs in this context is AAT, a known inhibitor of the PMN-released enzyme NE. Lastly it has also been proposed that one of the mechanisms by which PMNs cause the overexpression of PIGR is through the release of IL-1ß (22). IL-1 is also known to cause an increase in the clusterin expression level, thus representing another link between clusterin and the aforementioned biomarkers (4). Although it is still too early to draw conclusions regarding the role of PMNs in influencing the expression levels of the above potential markers, the possible confluence of so many links to a single common factor does pose intriguing questions that might merit further investigation.
In our study, we chose to correlate the 10 sample sets by comparing the 10 proliferative samples among themselves. An alternative and arguably simpler strategy is to pool the 10 proliferative samples and compare every other sample to the proliferative pool. In the future, we will evaluate the merits of using a single, pooled sample as the control normal sample for all the sample sets; this will mean, however, that we will lose information on the distributions of protein concentrations in normal samples. As shown above, the relative expression level (ratio) for any given PCM across the 10-sample sets appears to vary with a relative S.D. typically
30%. Some of this variation may reflect genuine person-to-person differences; however, a significant contribution to this observed variation must also stem from differing proportions of cancerous glands within the samples that were homogenized or differing stages and extents of the EmCa. In future analyses, it will be useful to record the proportion of cancerous tissue present in each sample. Accounting for such a factor might help to reduce the range of differential expression observed within each PCM. A perhaps conceptually simpler means to address this issue would be to analyze laser capture microdissected cancerous glands or epithelial cells as mentioned earlier. Relative expression of PCMs would then be evaluated against similarly procured epithelial cells from normal endometrial tissues. To minimize the number of laser capture microdissected cells required, this analysis could conceivably be performed under multiple reaction monitoring mode on a triple quadrupole or linear ion trap instrument, which has long been used for small molecule quantification in the pharmaceutical industry. Such monitoring would target the transitions specific to the peptides of interest from the PCMs. The increased sampling time afforded by multiple reaction monitoring would result in superior sensitivity, thus requiring less protein or fewer cancerous cells.
The presence of Types I and II samples offered an opportunity to discover and identify proteins that may differentiate between Types I and II EmCa. Although there appears to be tantalizing trends in some of the proteins shown in Table III, we prefer to withhold further comments until a large data set is available.
| CONCLUSION |
|---|
|
|
|---|
1-antitrypsin, performed satisfactorily and achieved a sensitivity, specificity, predictive value, and positive predictive value of 0.95 each. In addition we discovered three new potential markers, which will be investigated further in a targeted study.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, March 19, 2007, DOI 10.1074/mcp.M600378-MCP200
1 The abbreviations used are: iTRAQ, isobaric tags for relative and absolute quantitation; PCM, potential cancer marker; EmCa, endometrial carcinoma; SCX, strong cation exchange; RP, reverse phase; PIGR, polymeric immunoglobulin receptor; PK, pyruvate kinase; Cpn 10, chaperonin 10; PV, predictive value; PPV, positive predictive value; AUC, area under the curve; MIF, macrophage migration-inhibitory factor; AAT,
1-antitrypsin; CKB, creatine kinase chain B; PEBP, phosphatidylethanolamine-binding protein; hnRNP D0, heterogeneous nuclear ribonucleoprotein D0; WFDC2, WAP four-disulfide core domain protein 2; PMN, polymorphonuclear neutrophil; MMP-2, metalloproteinase-2; RA, retinoic acid; P-LAP, placental leucine aminopeptidase; IL, interleukin; LAP3, leucine aminopeptidase 3; SLPI, secretory leucocyte protease inhibitor; NE, neutrophil elastase. ![]()
2 National Cancer Institute of Canada, Canadian Cancer Statistics 2006, www.ncic.cancer.ca/vgn/images/portal/cit_86751114/31/23/935505938cw_2006stats_en.pdf.pdf. ![]()
* This work was supported in part by operating grants from the National Cancer Institute of Canada and the Ontario Genomics Institute and Genome Canada. Infrastructural support was provided by the Ontario Research and Development Challenge Fund, Applied Biosystems, and MDS SCIEX. The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()

Supported by a clinical fellowship from Mount Sinai Hospital. ![]()
|||| To whom correspondence should be addressed. Tel.: 416-650-8021; Fax: 416-736-5936; E-mail: kwmsiu{at}yorku.ca
| REFERENCES |
|---|
|
|
|---|
2-globulin, the major secretory protein of the luteal phase and first trimester pregnancy endometrium, is not glycosylated prolactin but related to beta-lactoglobulins.
J. Clin. Endocrinol. Metab.
65, 1067
1071
B- and p38 mitogen-activated protein kinase-dependent mechanisms.
Am. J. Respir. Cell Mol. Biol.
28, 485
498
B in the expression by tumor necrosis factor-
of the human polymeric immunoglobulin receptor (pIgR) gene.
Immunogenetics
51, 289
295[CrossRef][Medline]
.
Mol. Cell. Endocrinol.
194, 95
105[CrossRef][Medline]
B family and related proteins in endometrial carcinoma.
J. Pathol.
204, 569
577[CrossRef][Medline]
B inactivation by progesterone through progesterone receptors in Hec50co poorly differentiated endometrial cancer cells: induction of A20 and ABIN-2.
Gynecol. Oncol.
94, 463
470[CrossRef][Medline]
1-Antitrypsin inhibits angiogenesis and tumor growth.
Int. J. Cancer.
112, 1042
1048[CrossRef][Medline]This article has been cited by other articles:
![]() |
R. Ralhan, L. V. DeSouza, A. Matta, S. Chandra Tripathi, S. Ghanny, S. Datta Gupta, S. Bahadur, and K. W. M. Siu Discovery and Verification of Head-and-neck Cancer Biomarkers by Differential Protein Expression Analysis Using iTRAQ Labeling, Multidimensional Liquid Chromatography, and Tandem Mass Spectrometry Mol. Cell. Proteomics, June 1, 2008; 7(6): 1162 - 1173. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |