MCP Sign the guestbook
HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 QUICK SEARCH:   [advanced]


     


Originally published In Press as doi:10.1074/mcp.D400001-MCP200 on July 21, 2004.
This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
D400001-MCP200v1
3/10/1039    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Glossary
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Yan, W.
Right arrow Articles by Aebersold, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Yan, W.
Right arrow Articles by Aebersold, R.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?

Molecular & Cellular Proteomics 3:1039-1041, 2004.
© 2004 by The American Society for Biochemistry and Molecular Biology, Inc.


Dataset

A Dataset of Human Liver Proteins Identified by Protein Profiling Via Isotope-coded Affinity Tag (ICAT) and Tandem Mass Spectrometry*,S

Wei Yan{ddagger},§, Hookeun Lee{ddagger}, Eric W. Deutsch{ddagger}, Catherine A. Lazaro, Weiliang Tang, Eric Chen||, Nelson Fausto, Michael G. Katze|| and Ruedi Aebersold{ddagger}

From the {ddagger} Institute for Systems Biology, Seattle, WA; Department of Pathology, University of Washington, Seattle, WA; || Department of Microbiology, University of Washington, Seattle, WA


    ABSTRACT
 TOP
 ABSTRACT
 REFERENCES
 
Proteins from human liver carcinoma Huh7 cells, representing transformed liver cells, and cultured primary human fetal hepatocytes (HFH) and human HH4 hepatocytes, representing nontransformed liver cells, were extracted and processed for proteome analysis. Proteins from stimulated cells (interferon-{alpha} treatment for the Huh7 and HFH cells and induction of hepatitis C virus [HCV] proteins for the HH4 cells) and corresponding control cells were labeled with light and heavy cleavable ICAT reagents, respectively. The labeled samples were combined, trypsinized, and subject to cation-exchange and avidin-affinity chromatographies. The resulting cysteine-containing peptides were analyzed by microcapillary LC-MS/MS. The MS/MS spectra were initially analyzed by searching the human International Protein Index database using the SEQUESTTM software (1). Subsequently, new statistical algorithms were applied to the collective SEQUEST search results of each experiment. First, the PeptideProphetTM software (2) was applied to discriminate true assignments of MS/MS spectra to peptide sequences from false assignments, to assign a probability value for each identified peptide, and to compute the sensitivity and error rate for the assignment of spectra to sequences in each experiment. Second, the ProteinProphetTM software (3) was used to infer the protein identifications and to compute probabilities that a protein had been correctly identified, based on the available peptide sequence evidence. The resulting protein lists were filtered by a ProteinProphet probability score p ≥ 0.5, which corresponded to an error rate of less than 5%. A total of 1,296, 1,430, and 1,476 proteins or related protein groups were identified in three subdatasets from the Huh7, HFH, and HH4 cells, respectively. In total, these subdatasets contained 2,486 unique protein identifications from human liver cells. An increase of the threshold to p ≥ 0.9 (corresponding to an error rate of less than 1%) resulted in 2,159 unique protein identifications (1,146, 1,235, and 1,318 for the Huh7, HFH, and HH4 cells, respectively).


This human liver proteomic dataset consists of three subdatasets generated from three protein profiling experiments using the following samples: human liver carcinoma cells (Huh7), primary cultures of human fetal hepatocytes (HFH)1 (4), and an immortalized cell line derived from human fetal hepatocytes (HH4).2

The Huh7 and HFH cells were selected to study the interferon response in transformed (Huh7) and nontransformed (HFH) human liver cells, respectively. About 2 x 107 cells were either interferon-{alpha}2b- (400 IU/ml Intron-A; Schering-Plough Co., Kenilworth, NJ) or mock- treated for 16 h before harvest. The cells were lysed, and cell lysates were fractionated into cytosolic, membrane, and nuclear fractions by sequential differential centrifugation at 3,000 x g (nuclear fraction from the pellet) and 100,000 x g (cytosolic fraction from the supernatant and membrane fraction from the pellet). Proteins from each subcellular fraction were labeled with isotopically light- (12C, for stimulated cells) or heavy- (13C, for control cells) ICAT reagents following the manufacturer’s protocol (Applied Biosystems, Foster City, CA). Corresponding isotopically light- and heavy-labeled samples were then combined and digested with trypsin (Promega, Madison, WI). The resulting peptides were separated by strong cation exchange chromatography, as previously described (5), and affinity purified by avidin cartridges following the manufacturer’s protocol (Applied Biosystems), through which the cysteine (Cys)-containing peptides were enriched. The Cys-containing peptides from ~20 fractions purified above were then subjected to µLC-ESI-MS/MS using an LCQ-DECA-XP ion-trap mass spectrometer (ThermoFinnigan, San Jose, CA) as previously described (6, 7).

All observed MS/MS spectra were subsequently subjected to search against the human International Protein Index (IPI) database (www.ebi.ac.uk/IPI/IPIhelp.html) (v2.28) using the SEQUESTTM software. Search parameters for the cleavable ICAT-labeled samples used in this study were the following: +227.13 Da for static modification on cysteine residues labeled with cleavable ICAT, +9 Da for 13C isotopic ICAT-labeled cysteine, +16 Da for oxidized methionine; mass tolerance ± 3 Da; restriction on Cys-containing peptides; and no proteolytic enzyme specified. Accuracy of the SEQUEST assignments of MS/MS spectra to peptide sequences was estimated by the PeptideProphetTM software based on a statistical model (2). For each identified peptide, a probability score was computed on a scale of 0 (for "incorrect") to 1 (for "correct") based on match of the peptide sequence to the tandem mass spectra and the trypsin proteolytic pattern. These assigned peptides were then subjected to ProteinProphetTM analysis to assign a protein probability score for each identified protein or related protein group inferred from the peptide data (3). The protein probabilities, again on a scale of 0 to 1, discriminate correct (p = 1) from incorrect (p = 0) protein identifications. Validation of initial data base search results on the basis of statistical modeling allows the presentation of large-scale proteomics datasets with known sensitivity for positive identifications and error rates for false positive identifications.

In the Huh7 cells, 23,310 peptides, with a PeptideProphet probability score p ≥ 0.05, were obtained and included for subsequent ProteinProphet analysis. The sequences of the assigned peptides, together with their IPI reference name, PeptideProphet probability, and calculated and measured mass, are presented as reference for future proteomics studies (Supplemental Table Ia). Using ProteinProphet software, 1,146 proteins or related protein groups were identified with an arbitrary probability cut-off of p ≥ 0.9 (Supplemental Table IIa). This value corresponded to 87.5% sensitivity (i.e. 87.5% of all possible identifications were made) and a false positive error rate of 0.7% (Supplemental Fig. 1A). This type of analysis also allows the investigator to compute the implications of changing the probability value on sensitivity and false positive error rate. For example, a reduction of the protein probability from 0.9 to 0.5 in this subdataset increased the number of protein identifications to 1,296, increased the sensitivity to 95.7%, and also increased the error rate to 3.8%.

Similarly, a total of 31,641 peptides (p ≥ 0.05) were obtained from the analysis of the HFH cells (Supplemental Table Ib). These assigned peptides were used for subsequent ProteinProphet analysis and lead to the identification of 1,235 proteins and related protein groups (p ≥ 0.9) (Supplemental Table IIb), which corresponded to 86.5% sensitivity and 0.8% error rate (Supplemental Fig. 1B). A reduction of protein probability threshold to 0.5 resulted in 1,430 protein identifications with 96% sensitivity and 4.6% error rate.

The HH4 cells are immortalized human hepatocytes. Two HH4-based cell lines were constructed, based on an ecdysone-inducible expression system (8), to induce expression of the entire hepatitis C virus (HCV) ORF or green fluorescence protein, respectively.3 The ecdysone-regulated gene expression system consists of a modified ecdysone receptor (a heterodimer of VgEcR and RXR) that binds to its recognition sequence (5xE/GRE) and associates with transcription corepressors to repress the downstream promoter. Upon induction by a plant-derived ecdysone analog ponasterone A (ponA), ponA binds to the VgEcR to release the corepressors and recruit cotransactivators to activate transcription of the downstream target genes. We performed two ICAT labeling experiments to investigate HCV-mediated protein expression profiles in human hepatocytes. The first compared total cell extracts from HCV ORF-induced cells (ponA+, light-labeled) with noninduced cells (ponA-, heavy-labeled). The second experiment compared total cell extracts from cells with induced HCV proteins (light-labeled) with cells carrying induced green fluorescence protein (heavy-labeled). The labeled samples were subjected to the same analyses as described above. From HH4 cells we obtained a total of 28,029 peptide assignments with p ≥ 0.05 (Supplemental Table Ic), which contributed to identification of 1,318 (p ≥ 0.9) or 1,476 proteins and related protein groups (p ≥ 0.5) (Supplemental Table IIc and Supplemental Fig. 1C).

Taken together, proteomics analyses of the three human liver cells of both transformed and nontransformed cells lead to a total of 2,159 (p ≥ 0.9) or 2,486 (p ≥ 0.5) unique protein identifications. Among them, 496 (p ≥ 0.9) or 540 (p ≥ 0.5) were found in all three liver cells, while 337/397, 364/457, and 414/456 proteins and related protein groups were uniquely observed in the Huh7, HFH, and HH4 cells, respectively (p ≥ 0.9/p ≥ 0.5). Comparison of the three proteomics subdatasets from human liver cells (Supplemental Table IIIa for p ≥ 0.9 and IIIb for p ≥ 0.5) are also shown as a Venn Diagram (Fig. 1) using the on-line Create-A-Venn system at www.venndiagram.com. This human liver proteomics datasets with more than 2,000 protein identifications, presented in a statistically validated and transparent way, describes a possible mechanism for publishing large-scale protein identification datasets in the literature and for data comparison from different experiments.



View larger version (31K):
[in this window]
[in a new window]
 
FIG. 1. Comparison of the protein identifications of the three subdatasets from the Huh7, HFH, and HH4 cells. Identified protein or related protein groups from the three proteomics subdatasets (Supplemental Table IIa for Huh7, IIb for HFH, and IIc for HH4) were compared based on the IPI identification of each entry. The results were displayed by Venn Diagram (www.venndiagram.com) at the protein probability threshold of 0.9 and 0.5, respectively.

 


    FOOTNOTES
 
Received, June 14, 2004

1 The abbreviations used are: HFH, human fetal hepatocytes; Cys, cysteine; MS/MS, tandem mass spectrometry; IPI, International Protein Index; HCV, hepatitis C virus; ponA, ponasterone A. Back

2 W. Tang and N. Fausto, personal communication. Back

3 W. Tang and N. Fausto, unpublished data. Back

* This work was supported in part by grants from the National Heart, Lung, and Blood Institute Proteomics Center at the Institute for Systems Biology (N01-HV-28179) and the National Institute on Drug Abuse (1P30DA01562501). Back

S The on-line version of this manuscript (available at http://www.mcponline.org ) contains supplemental material. Back


    FOOTNOTES
 
Back

§ To whom correspondence should be addressed: Institute for Systems Biology, 1441 N. 34th St., Seattle, WA 98103. Tel.: 206-732-1305; Fax: 206-732-1299; E-mail: wyan{at}systemsbiology.org


    REFERENCES
 TOP
 ABSTRACT
 REFERENCES
 

  1. Eng, J., McCormack, A., and Yates, J. (1994) An approach to correlate tandem mass spectral data of peptides with amino acid sequences in a protein database. J. Am. Soc. Mass Spectrom. 5, 976[CrossRef]

  2. Keller, A., Nesvizhskii, A. I., Kolker, E., and Aebersold, R. (2002) Empirical statistical model to estimate the accuracy of peptide identifications made by MS/MS and database search. Anal. Chem. 74, 5383 –5392[Medline]

  3. Nesvizhskii, A. I., Keller, A., Kolker, E., and Aebersold, R. (2003) A statistical model for identifying proteins by tandem mass spectrometry. Establishment, characterization, and long-term maintenance of cultures of human fetal hepatocytes. Anal. Chem. 75, 4646 –4658[Medline]

  4. Lazaro, C. A., Croager, E. J., Mitchell, C., Campbell, J. S., Yu, C., Foraker, J., Rhim, J. A., Yeoh, G. C., and Fausto, N. (2003) Hepatology 38, 1095 –1106[CrossRef][Medline]

  5. Han, D. K., Eng, J., Zhou, H., and Aebersold, R. (2001) Quantitative profiling of differentiation-induced microsomal proteins using isotope-coded affinity tags and mass spectrometry. Nat. Biotechnol. 19, 946 –951[CrossRef][Medline]

  6. Lee, H., Yi, E. C., Wen, B., Reily, T. P., Pohl, L., Nelson, S., Aebersold, R., and Goodlett, D. R. (2004) Optimization of reversed-phase microcapillary liquid chromatography for quantitative proteomics. J. Chromatogr. B. Analyt. Technol. Biomed. Life Sci. 803, 101 –110[Medline]

  7. Von Haller, P. D., Yi, E., Donohoe, S., Vaughn, K., Keller, A., Nesvizhskii, A. I., Eng, J., Li, X. J., Goodlett, D. R., Aebersold, R., and Watts, J. D. (2003) The application of new software tools to quantitative protein profiling via isotope-coded affinity tag (ICAT) and tandem mass spectrometry: II. Evaluation of tandem mass spectrometry methodologies for large-scale protein analysis, and the application of statistical tools for data analysis and interpretation. Mol. Cell. Proteomics 2, 428 –442[Abstract/Free Full Text]

  8. No, D., Yao, T. P., and Evans, R. M. (1996) Ecdysone-inducible gene expression in mammalian cells and transgenic mice. Proc. Natl. Acad. Sci. U S A. 93, 3346 –3351[Abstract/Free Full Text]


Add to CiteULike CiteULike   Add to Complore Complore   Add to Connotea Connotea   Add to Del.icio.us Del.icio.us   Add to Digg Digg   Add to Reddit Reddit   Add to Technorati Technorati    What's this?


This article has been cited by other articles:


Home page
Physiol. GenomicsHome page
S. P. Mirza and M. Olivier
Methods and approaches for the comprehensive characterization and quantification of cellular proteomes using mass spectrometry
Physiol Genomics, October 8, 2008; 33(1): 3 - 11.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
A. Audhya and A. Desai
Proteomics in Caenorhabditis elegans
Brief Funct Genomic Proteomic, May 1, 2008; 7(3): 205 - 210.
[Abstract] [Full Text] [PDF]


Home page
Am. J. Pathol.Home page
W. Tang, C. A. Lazaro, J. S. Campbell, W. T. Parks, M. G. Katze, and N. Fausto
Responses of Nontransformed Human Hepatocytes to Conditional Expression of Full-Length Hepatitis C Virus Open Reading Frame
Am. J. Pathol., December 1, 2007; 171(6): 1831 - 1846.
[Abstract] [Full Text] [PDF]


Home page
Mol. Cell. ProteomicsHome page
C.-M. Huang, C.-C. Wang, M. Kawai, S. Barnes, and C. A. Elmets
Surfactant Sodium Lauryl Sulfate Enhances Skin Vaccination: Molecular Characterization via a Novel Technique using Ultrafiltration Capillaries and Mass Spectrometric Proteomics
Mol. Cell. Proteomics, March 1, 2006; 5(3): 523 - 532.
[Abstract] [Full Text] [PDF]


Home page
Brief Funct Genomic ProteomicHome page
B. Canas, D. Lopez-Ferrer, A. Ramos-Fernandez, E. Camafeita, and E. Calvo
Mass spectrometry technologies for proteomics
Brief Funct Genomic Proteomic, February 1, 2006; 4(4): 295 - 320.
[Abstract] [Full Text] [PDF]


Home page
J. Virol.Home page
J. M. Jacobs, D. L. Diamond, E. Y. Chan, M. A. Gritsenko, W. Qian, M. Stastna, T. Baas, D. G. Camp II, R. L. Carithers Jr., R. D. Smith, et al.
Proteome Analysis of Liver Cells Expressing a Full-Length Hepatitis C Virus (HCV) Replicon and Biopsy Specimens of Posttransplantation Liver from HCV-Infected Patients
J. Virol., June 15, 2005; 79(12): 7558 - 7569.
[Abstract] [Full Text] [PDF]


Home page
J. Physiol.Home page
D. R. M Graham, S. T Elliott, and J. E Van Eyk
Broad-based proteomic strategies: a practical guide to proteomics and functional screening
J. Physiol., February 15, 2005; 563(1): 1 - 9.
[Abstract] [Full Text] [PDF]


This Article
Right arrow Abstract Freely available
Right arrow Full Text (PDF)
Right arrow Supplemental Data
Right arrow All Versions of this Article:
D400001-MCP200v1
3/10/1039    most recent
Right arrow Submit a response
Right arrow Alert me when this article is cited
Right arrow Alert me when eLetters are posted
Right arrow Alert me if a correction is posted
Right arrow Citation Map
Services
Right arrow Email this article to a friend
Right arrow Similar articles in this journal
Right arrow Similar articles in PubMed
Right arrow Alert me to new issues of the journal
Right arrow Download to citation manager
Right arrow Glossary
Right arrow reprints & permissions
Citing Articles
Right arrow Citing Articles via HighWire
Right arrow Citing Articles via Google Scholar
Google Scholar
Right arrow Articles by Yan, W.
Right arrow Articles by Aebersold, R.
Right arrow Search for Related Content
PubMed
Right arrow PubMed Citation
Right arrow Articles by Yan, W.
Right arrow Articles by Aebersold, R.
Social Bookmarking
 Add to CiteULike   Add to Complore   Add to Connotea   Add to Del.icio.us   Add to Digg   Add to Reddit   Add to Technorati  
What's this?


HOME HELP FEEDBACK SUBSCRIPTIONS ARCHIVE SEARCH TABLE OF CONTENTS
 All ASBMB Journals   Journal of Biological Chemistry 
 Journal of Lipid Research   ASBMB Today