|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 1:304-313, 2002.
© 2002 by The American Society for Biochemistry and Molecular Biology, Inc.






,

Department of Surgery, University of Michigan, Ann Arbor, Michigan 48109
Department of Biostatistics, University of Michigan, Ann Arbor, Michigan 48109
|| Department of Epidemiology, University of Michigan, Ann Arbor, Michigan 48109
** Department of Pathology,University of Michigan, Ann Arbor, Michigan 48109
¶ Department of Pediatrics, University of Michigan, Ann Arbor, Michigan 48109
| ABSTRACT |
|---|
|
|
|---|
40% of all new cases of non-small cell lung cancer and are now the most common histologic type. Functional genomics, broadly defined as the comprehensive analysis of genes and their products, have become a recent focus of the life sciences (1). Application of these approaches to lung adenocarcinomas has the potential to aid in the identification of high risk patients with resectable early stage lung cancer that may benefit from adjuvant therapy, as well as to identify new therapeutic targets. In human lung cancer, however, little is currently understood regarding the relationship between gene expression as determined by measuring mRNA levels and the corresponding abundance of the protein products. A number of powerful techniques for analysis of gene expression have been used including differential display (2), serial analysis of gene expression (3), DNA microarrays (4), and proteomics via two-dimensional polyacrylamide gel electrophoresis and mass spectrometry (5). Bioinformatics tools have also been developed to help determine quantitative mRNA/protein expression profiles of all types of cells and tissues (6) and now can be applied to benign and malignant tumors. DNA microarrays (cDNA and oligonucleotide) permit the parallel assessment of thousands of genes and have been utilized in gene expression monitoring (7), polymorphism analysis (8), and DNA sequencing (9). Recent studies have focused on classification or identification of subgroups of lung tumors using DNA microarrays (10, 11). The use of mRNA expression patterns by themselves, however, is insufficient for understanding the expression of protein products, as additional post-transcriptional mechanisms, including protein translation, post-translational modification, and degradation, may influence the level of a protein present in a given cell or tissue. Proteomic analyses, a complementary technology to DNA microarrays for monitoring gene expression, involves protein separation and quantitative assessment of protein spots using 2D 1 -PAGE and protein identification using mass spectrometry. By combining proteomic and transcriptional analyses of the same samples, however, it may be possible to understand the complex mechanisms influencing protein expression in human cancer.
In this study, we determined mRNA and protein levels for 165 proteins (98 genes) in 76 lung adenocarcinomas and nine non-neoplastic lung tissues. Protein levels were determined using quantitative 2D-PAGE analysis, and the separated protein polypeptides were identified using matrix-assisted laser desorption/ionization mass spectrometry (MALDI-MS). The corresponding mRNA levels for the identified proteins within the same samples were determined using oligonucleotide microarrays. Correlation analyses showed that protein abundance is likely a reflection of the transcription for a subset of proteins, but translation and post-translational modifications also appear to influence the expression levels of many individual proteins in lung adenocarcinomas.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
Oligonucleotide Array Hybridization
The HuGeneFL oligonucleotide arrays (Affymetrix, Santa Clara, CA) containing 6800 genes were used in this study. Total RNA was isolated from all samples using Trizol reagent (Invitrogen). The resulting RNA was then subjected to further purification using RNeasy spin columns (Qiagen). Preparation of cRNA, hybridization, and scanning of the HuGeneFL arrays were performed according to the manufacturers protocol (Affymetrix, Santa Clara, CA). Data analysis was performed using GeneChip 4.0 software. The gene expression profile of each tumor was normalized to the median gene expression profile for the entire sample. Details of data trimming and normalization are described elsewhere (11).
2D-PAGE and Quantitative Protein Analysis
Tissue for both protein and mRNA isolation came from contiguous areas of each sample. Protein separation using 2D-PAGE, silver staining, and digitization were performed as described previously (12, 13). Our 2D-PAGE system allows us to run 20 gels at one time (one batch). Spot detection and quantification were accomplished utilizing Bio Image Visage System software (Bioimage Corp., Ann Arbor, MI). The integrated intensity of each spot was calculated as the measured optical density units x mm2. Of the total possible 2000 spots detectable on each gel, 820 spots on the gel of each sample were matched using a Gel-ed match program with the same spots on a chosen "master" gel. In each sample, 250 ubiquitously expressed reference spots were used to adjust for variations between gels, such as that created by subtle differences in protein loading or gel staining. Slight differences because of batch were corrected after spot-size quantification.
Mass Spectrometry and 2D Western Blotting
Preparative 2D gels were run using extracts from A549 lung adenocarcinoma cells (obtained from ATCC) and using the identical experimental conditions as the analytical 2D gels, except 30% more protein was loaded. The resolved protein gels were silver-stained using successive incubations in 0.02% sodium thiosulfate for 2 min, 0.1% silver nitrate for 40 min, and 0.014% formaldehyde plus 2% sodium carbonate for 10 min. For protein identification, protein polypeptides underwent trypsin digestion followed by MALDI-MS using a MALDI-TOF Voyager-DE mass spectrometer (Perseptive Biosystems, Framingham, MA). The masses were compared with known trypsin digest databases using the MS-FIT database (University of California, San Francisco; prospector.ucsf.edu/ucsfhtml3.2/msfit.htm). Some of the polypeptides included in the analysis had been identified prior to this study on the basis of sequencing (14). The identified protein spots used in this paper are shown in Fig. 1A. The method for 2D-PAGE Western blot verification was as described previously (15). The 2D Western blots of GRP58 and Op18 are shown in Fig. 1, C and E; the others, such as GRP78, GRP75, HSP70, HSC70, KRT8, KRT18, KRT19, Vimentin, ApoJ, 143-3, Annexin I, Annexin II, PGP9.5, DJ-1, GST-pi, and PGAM, are described elsewhere. 2
|
log (1 + x) was applied to normalize all protein expression values. The relationship between protein and mRNA expression levels within the same samples was examined using the Spearman correlation coefficient analysis (16). To identify potentially significant correlations between gene and protein expression, we used an analytical strategy similar to SAM (significance analysis of microarrays) (17), which uses a permutation technique to determine the significance of changes in gene expression between different biological states. To obtain permuted correlation coefficients between gene and protein expression, genes were exchanged first in such a way that permutated correlation coefficient were calculated based on pseudo pairs of genes and proteins. The distribution of permutated correlation coefficients became stable after 60 permutations. This procedure was then repeated 60 times to obtain 60 sets of permutated correlation coefficients. For each of the 60 permutations, the correlations of genes and proteins were ranked such that
p(i) denotes the ith largest correlation coefficient for pth permutation. Hence, the expected correlation coefficient,
E(i), was the average over the 60 permutations,
. A scatter plot of observed correlations (
(i)) versus the expected correlations is shown in Fig. 2D. For this study, we chose threshold
= 0.115 so that correlation would be considered significant if absolute value of difference between
(i) and
E(i) was greater than the threshold. Twenty-nine (including one with observed correlation coefficient -0.4672) of 165 pairs of gene and protein expression were called significant in such criteria, and the permuted data generated an average of 5.1 falsely significant pairs of gene and protein expression. This provided an estimated false discovery rate (the percentage of pairs of gene and protein expression identified by chance) for our data set.
|
| RESULTS |
|---|
|
|
|---|
|
|
|
In addition to differences in the relationship between mRNA levels and protein expression among separate isoforms, some genes with very comparable mRNA levels showed a 24-fold difference in their protein expression. Genes with comparable protein expression levels also showed up to a 28-fold variance in their mRNA levels.
Lack of Correlation for mRNA and Protein Expression when Using Average Tumor Values across All 165 Protein Spots (98 Genes)
The relationship between mRNA and protein expression was also examined by using the average expression values for all samples. To analyze this relationship using this approach, the average value for each protein or mRNA was generated using all 85 lung tissue samples. The range of normalized average protein values ranged from -0.0646 to 0.0979 (raw value 0.0036 to 4.1947), and the range for mRNA was from 0 to 15260.5 for all 165 individual protein spots. The Spearman correlation coefficient for the whole data set (165 protein spots/98 genes) was -0.025 (Fig. 3A). Even for the 28 protein spots (Fig. 2D) that were found to have a statistically significant correlation between their mRNA and protein, use of the average value resulted in a correlation coefficient value of -0.035, which was not significant (Fig. 3B).
|
Stage-related Changes in the Protein/mRNA Correlation Coefficients
To determine whether the 21 genes (28 protein spots) showing a significant correlation between the protein and mRNA expression among all samples demonstrate changes in this relationship during tumor progression, the correlations were examined separately for stage I (n = 57) and stage III (n = 19) lung adenocarcinomas (Table III). The number of non-neoplastic lung samples (n = 9) was insufficient for a separate correlation analysis of this group. Many of the protein spots represent one of several known protein isoforms for a given gene. The majority of genes (16/21) did not differ in the protein/mRNA correlation between stage I and stage III tumors indicating a similar regulatory relationship between the mRNA and protein spot. GRP-58, PSMC, SOD1, TPI1, and VIM, however, were found to demonstrate significant differences in the correlation coefficients between stage I and stage III lung adenocarcinomas. For GRP-58, PSMC, and VIM the change in the correlation coefficient was because of a relative increase in protein expression in stage III tumors. For SOD and TPI the change resulted from a relative decrease in expression of this specific protein in stage III tumors.
| DISCUSSION |
|---|
|
|
|---|
-haptoglobin demonstrated a strong negative correlation with its mRNA expression values. This may reflect negative feedback on the mRNA or the protein or the presence of other regulatory influences that are not understood currently. Post-translational modification or processing will result in individual protein products of the same gene migrating to different locations on 2D-PAGE gels (20). Because the identity of all possible isoforms for each protein examined has not been characterized completely, this may influence the correlation analyses performed in this study. This is partly because of limitations of the 2D-PAGE and mass spectrometry technologies (21, 22). Potential inconsistencies between mRNA and protein correlations that have been reported may also be because of differences, even in the same gene, in the mechanisms of protein translation among different cells or as measured in different laboratories (23).
In this study, we examined 165 protein spots identified in lung adenocarcinomas. Ninety-six protein spots, representing the products of 29 genes, contained at least two protein isoforms. Nineteen of 96 protein spots, representing 12 genes, were shown to have a statistically significant correlation between their protein and mRNA expression, suggesting that the levels of these proteins reflects the transcription of the corresponding genes. Differences in protein/mRNA correlations were found among the individual isoforms of a given protein. For example, of the four OP18 isoforms, three showed a statistically significant correlation between the protein and mRNA expression levels. The lack of relationship for the one isoform, however, indicates that individual protein isoforms of the same gene product can be regulated differentially. This is not unexpected and likely reflects other post-translational mechanisms that can influence isoform abundance in tissues and cancer.
In addition to the analyses of the correlation of mRNA/protein within the same tumor samples, we also tested the global relationship between mRNA and the corresponding protein abundance across all 165 protein spots in the lung samples. A protein and mRNA average value for each gene was generated using all 85 lung tissues samples. We observed a very wide range of normalized average protein and mRNA values. The correlation coefficient generated using this average value data set was -0.025, and even for the 28 protein spots that showed a statistically significant correlation between individual mRNA and proteins, the correlation value was only -0.035. This suggests that it is not possible to predict overall protein expression levels based on average mRNA abundance in lung cancer samples. This conclusion is also supported by previous results from Anderson and Seilhamer (24), who examined 19 genes in human liver cells, and by Gygi et al. (25), who examined 106 genes in yeast. Both studies found a lack of correlation between mRNA and protein expression when average or overall levels were used.
A good correlation was reported when the 11 most abundant proteins were examined in yeast (25), suggesting that the level of protein abundance may be a factor that may influence the correlation between mRNA and protein. In the present study, a fairly wide range of mean protein values among 165 protein spots in lung adenocarcinomas was observed, and the correlation coefficients also varied from -0.467 to 0.442. A comparison between the mean value of each protein and the correlation coefficient generated using all 85 tissue samples did not reveal a strong relationship between the overall protein abundance and the correlation coefficients (r = 0.039; p > 0.05). Detailed analysis of different subsets of protein abundance also failed to show a correlation between mRNA and protein expression. Thus in contrast to yeast, a relationship between mRNA/protein correlation coefficient and protein abundance in human lung adenocarcinomas was not observed.
The results of this study indicate that the level of protein abundance in lung adenocarcinomas is associated with the corresponding levels of mRNA in 17% (28 proteins) of the total 165 protein spots examined. This was substantially higher than the amount predicted to result by chance alone (which was 5.1) and suggests that a transcriptional mechanism likely underlies the abundance of these proteins in lung adenocarcinomas. We also demonstrate that the expression of individual isoforms of the same protein may or may not correlate with the mRNA, indicating that separate and likely post-translational mechanisms account for the regulation of isoform abundance. These mechanisms may also account for the differences in the correlation coefficients observed between stage I and stage III tumors, indicating that specific protein isoforms show regulatory changes during tumor progression. Further studies in lung adenocarcinomas will examine the relationship between the expression of individual protein isoforms and specific clinical-pathological features of these tumors, such as the presence of angiolymphatic invasion, and nodal or pleural surface involvement. The potential to identify specific protein isoforms associated with biological behavior in lung adenocarcinomas would be of considerable interest and will add to our understanding of the regulation of gene products by transcriptional, translational, and post-translational mechanisms.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, March 12, 2001, DOI 10.1074/mcp.M200008-MCP200
1 The abbreviations used are: 2D, two-dimensional; MALDI-MS, matrix-assisted laser desorption/ionization mass spectrometry. ![]()
2 Chen et al., submitted for publication. ![]()
* This work was supported by NCI, National Institutes of Health Grant U19 CA-85953. ![]()

To whom correspondence should be addressed: General Thoracic Surgery, SRB II, B560, Box 0686, University of Michigan Medical School, Ann Arbor, MI 48109-086; E-mail: dgbeer{at}umich.edu
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
H. Lu, Y. Yang, E. M. Allister, N. Wijesekara, and M. B. Wheeler The Identification of Potential Factors Associated with the Development of Type 2 Diabetes: A Quantitative Proteomics Approach Mol. Cell. Proteomics, August 1, 2008; 7(8): 1434 - 1451. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Hou, Y. H. Wu, H. Grabsch, Y. Zhu, S. H. Leong, K. Ganesan, D. Cross, L. K. Tan, J. Tao, V. Gopalakrishnan, et al. Integrative Genomics Identifies RAB23 as an Invasion Mediator Gene in Diffuse-Type Gastric Cancer Cancer Res., June 15, 2008; 68(12): 4623 - 4630. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. Lin, A. G. Utleg, K. Gravdal, J. T. White, O. J. Halvorsen, W. Lu, L. D. True, R. Vessella, P. H. Lange, P. S. Nelson, et al. WDR19 Expression is Increased in Prostate Cancer Compared with Normal Cells, but Low-Intensity Expression in Cancers is Associated with Shorter Time to Biochemical Failures and Local Recurrence Clin. Cancer Res., March 1, 2008; 14(5): 1397 - 1406. [Abstract] [Full Text] [PDF] |
||||
![]() |
M. L. Slongo, B. Molena, A. M. Brunati, M. Frasson, M. Gardiman, M. Carli, G. Perilongo, A. Rosolen, and M. Onisto Functional VEGF and VEGF receptors are expressed in human medulloblastomas Neuro-oncol, October 1, 2007; 9(4): 384 - 392. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Wu, S.-I. Hwang, K. Rezaul, L. J. Lu, V. Mayya, M. Gerstein, J. K. Eng, D. H. Lundgren, and D. K. Han Global Survey of Human T Leukemic Cells by Integrating Proteomics and Transcriptomics Profiling Mol. Cell. Proteomics, August 1, 2007; 6(8): 1343 - 1353. [Abstract] [Full Text] [PDF] |
||||
![]() |
Q. Luo, L. Siconolfi-Baez, P. Annamaneni, M. T. Bielawski, P. M. Novikoff, and R. H. Angeletti Altered protein expression at early-stage rat hepatic neoplasia Am J Physiol Gastrointest Liver Physiol, May 1, 2007; 292(5): G1272 - G1282. [Abstract] [Full Text] [PDF] |
||||
![]() |
E. Hatjiharissi, H. Ngo, A. A. Leontovich, X. Leleu, M. Timm, M. Melhem, D. George, G. Lu, J. Ghobrial, Y. Alsayed, et al. Proteomic Analysis of Waldenstrom Macroglobulinemia Cancer Res., April 15, 2007; 67(8): 3777 - 3784. [Abstract] [Full Text] [PDF] |
||||
![]() |
G. Chen, X. Wang, J. Yu, S. Varambally, J. Yu, D. G. Thomas, M.-Y. Lin, P. Vishnu, Z. Wang, R. Wang, et al. Autoantibody Profiles Reveal Ubiquilin 1 as a Humoral Immune Response Target in Lung Adenocarcinoma Cancer Res., April 1, 2007; 67(7): 3461 - 3467. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. J. Lengi, R. A. Phillips, E. Karpuzoglu, and S. A. Ahmed Estrogen selectively regulates chemokines in murine splenocytes J. Leukoc. Biol., April 1, 2007; 81(4): 1065 - 1074. [Abstract] [Full Text] [PDF] |
||||
![]() |
U. T. Shankavaram, W. C. Reinhold, S. Nishizuka, S. Major, D. Morita, K. K. Chary, M. A. Reimers, U. Scherf, A. Kahn, D. Dolginow, et al. Transcript and protein expression profiles of the NCI-60 cancer cell panel: an integromic microarray study Mol. Cancer Ther., March 1, 2007; 6(3): 820 - 832. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. G. Helwig, T. I. Musch, R. A. Craig, and M. J. Kenney Increased interleukin-6 receptor expression in the paraventricular nucleus of rats with heart failure Am J Physiol Regulatory Integrative Comp Physiol, March 1, 2007; 292(3): R1165 - R1173. [Abstract] [Full Text] [PDF] |
||||
![]() |
I. M. Frey, I. Rubio-Aliaga, A. Siewert, D. Sailer, A. Drobyshev, J. Beckers, M. H. de Angelis, J. Aubert, A. B. Hen, O. Fiehn, et al. Profiling at mRNA, protein, and metabolite levels reveals alterations in renal amino acid handling and glutathione metabolism in kidney tissue of Pept2-/- mice Physiol Genomics, February 12, 2007; 28(3): 301 - 310. [Abstract] [Full Text] [PDF] |
||||
![]() |
C. C. Whiteford, S. Bilke, B. T. Greer, Q. Chen, T. A. Braunschweig, N. Cenacchi, J. S. Wei, M. A. Smith, P. Houghton, C. Morton, et al. Credentialing Preclinical Pediatric Xenograft Models Using Gene Expression and Tissue Microarray Analysis Cancer Res., January 1, 2007; 67(1): 32 - 40. [Abstract] [Full Text] [PDF] |
||||
![]() |
S. Zvonic, M. Lefevre, G. Kilroy, Z. E. Floyd, J. P. DeLany, I. Kheterpal, A. Gravois, R. Dow, A. White, X. Wu, et al. Secretome of Primary Cultures of Human Adipose-derived Stem Cells: Modulation of Serpins by Adipogenesis Mol. Cell. Proteomics, January 1, 2007; 6(1): 18 - 28. [Abstract] [Full Text] [PDF] |
||||
![]() |
K. M. Waters, J. G. Pounds, and B. D. Thrall Data merging for integrated microarray and proteomic analysis Brief Funct Genomic Proteomic, December 1, 2006; 5(4): 261 - 272. [Abstract] [Full Text] [PDF] |
||||
![]() |
T. Baas, C. R. Baskin, D. L. Diamond, A. Garcia-Sastre, H. Bielefeldt-Ohmann, T. M. Tumpey, M. J. Thomas, V. S. Carter, T. H. Teal, N. Van Hoeven, et al. Integrated Molecular Signature of Disease: Analysis of Influenza Virus-Infected Macaques through Functional Genomics and Proteomics J. Virol., November 1, 2006; 80(21): 10813 - 10828. [Abstract] [Full Text] [PDF] |
||||
![]() |
J. Cui, S.-Y. Han, C. Wang, W. Su, L. Harshyne, M. Holgado-Madruga, and A. J. Wong c-Jun NH2-Terminal Kinase 2{alpha}2 Promotes the Tumorigenicity of Human Glioblastoma Cells. Cancer Res., October 15, 2006; 66(20): 10024 - 10031. [Abstract] [Full Text] [PDF] |
||||
![]() |
L. Puricelli, E. Iori, R. Millioni, G. Arrigoni, P. James, M. Vedovato, and P. Tessari Proteome Analysis of Cultured Fibroblasts from Type 1 Diabetic Patients and Normal Subjects J. Clin. Endocrinol. Metab., September 1, 2006; 91(9): 3507 - 3514. [Abstract] [Full Text] [PDF] |
||||
![]() |
C.-W. Lee, R. Rivera, S. Gardell, A. E. Dubin, and J. Chun GPR92 as a New G12/13- and Gq-coupled Lysophosphatidic Acid Receptor That Increases cAMP, LPA5 J. Biol. Chem., August 18, 2006; 281(33): 23589 - 23597. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Provenzani, R. Fronza, F. Loreni, A. Pascale, M. Amadio, and A. Quattrone Global alterations in mRNA polysomal recruitment in a cell model of colorectal cancer progression to metastasis Carcinogenesis, July 1, 2006; 27(7): 1323 - 1333. [Abstract] [Full Text] [PDF] |
||||
![]() |
A. Jaleel, V. Nehra, X.-M. T. Persson, Y. Boirie, M. Bigelow, and K. S. Nair In vivo measurement of synthesis rate of multiple plasma proteins in humans Am J Physiol Endocrinol Metab, July 1, 2006; 291(1): E190 - E197. [Abstract] [Full Text] [PDF] |
||||
![]() |
B. S. Taylor, S. Varambally, and A. M. Chinnaiyan A Systems Approach to Model Metastatic Progression |