|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 5:1514-1519, 2006.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
Traditional methodologies for proteome wide identification of immunogenic microbial proteins (IMPs)1 involve screening microbial recombinant genomic expression libraries in plasmid/phage expression vectors and laboratory host strains with sera from colonized or infected hosts. However, colony immunoscreening and in vivo induced antigen technology (IVIAT) (1), a variation of colony immunoscreening that defines only partial immunoproteomes, and bacterial surface display coupled with magnetic cell sorting (2) are laborious and require several months or more for definitive IMP identification. Immunoproteomics of pathogens cultured in vitro under either standard laboratory conditions or those that attempt to mimic the host environment are also popular; however, pathogens cultured in vitro might not express the entire spectrum of virulence proteins. In view of the challenging task of accurately reproducing the host environment, such approaches might overlook those immunogenic virulence proteins that are expressed exclusively in response to host environmental cues and contribute significantly to pathogenicity (3). Although these limitations may be circumvented by immunoproteomics of pathogens isolated directly from either biological specimens or host anatomical sites of infection, consistent recovery of sufficient numbers of suitable organisms for analysis presents a significant challenge (4). Protein microarray/chip technology has tremendous potential for rapid, global definition of IMPs but is constrained by bottlenecks in proteome scale purification of microbial proteins and currently permits immunological characterization of only a partial proteome (5). Newer formats such as nucleic acid programmable protein arrays (6) are still experimental, and neither nucleic acid programmable protein arrays nor antibody arrays/protein chips utilizing SELDI-TOF mass spectrometry for antigen identification (7) have been demonstrated to rapidly define microbial immunoproteomes.
To rapidly identify proteins comprising microbial immunomes, we have developed a novel technique, proteomics-based expression library screening (PELS), that couples standard recombinant DNA and immunochemistry techniques with proteomics. The principle of PELS is outlined in Fig. 1 and involves capture of recombinant proteins expressed from an inducible, microbial genomic DNA expression library using polyclonal antibodies (pAbs) affinity-purified from acute/convalescent sera of infected hosts or sera from reservoirs colonized by the cognate pathogen ("bait" pAbs) coupled to a solid support. Proteins captured by the bait pAbs are subjected to one-dimensional (1D) SDS-PAGE ESI nano-LC-MS/MS (GeLC-MS/MS) and identified via SEQUEST database searching (8) (Fig. 1). The entire process, from recombinant genomic expression library construction to definitive protein identification, is accomplished in only 3 weeks without biases inherent to manual screening. To our knowledge, this is the first application of proteomics for rapid, global identification of IMPs from among proteins expressed from genes on inserts within recombinant clones comprising microbial genomic expression DNA libraries.
|
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
or BL21(DE3) using a Gene Pulser (Bio-Rad) as instructed by the manufacturer. Electroporation conditions were 2,500 V at 25-micro-faraday capacitance, producing time constants of 4.84.9 ms. Proteomic analysis, namely GeLC-MS/MS, was performed at the Harvard Partners Center for Genetics and Genomics, Cambridge, MA.
Construction of an Inducible E. coli O157:H7 (O157) Genomic DNA Expression Library Comprising Clones Containing DNA Inserts of Optimal Size ("Optimized" O157 Expression Library)
We generated an O157 genomic DNA expression library as described earlier (10) using genomic DNA isolated from the O157 strain 43894 (an isolate from a human patient with hemorrhagic colitis in the United States) in the pET-30abc series of expression vectors (Novagen, Madison, WI). Vector and insert DNA in the size range of 0.53.0 kbp were prepared as described previously (10). Because accuracy of protein identification using tandem MS/MS data and SEQUEST database searching increases with the number of peptides generated following trypsin digestion of the cognate protein (8), we sought to preferentially ligate insert DNA fragments that were larger than the average size of ORFs in this pathogen. The rationale was that larger DNA fragments were more likely to include genes encoding full-length or near full-length recombinant proteins containing potentially a larger number of recognition sites for cleavage by trypsin such that multiple peptide fragments resulting from trypsin cleavage of such proteins might facilitate robust protein identification. We accomplished this by performing multiple ligation reactions with each containing a different mole ratio of insert to vector. Each ligation reaction was then used to transform competent E. coli DH5
via electroporation according to standard protocols (9). To minimize overrepresentation of sister clones in the O157 expression library, transformants were directly plated onto LB plates supplemented with 50 µg/ml kanamycin (LB-Kan) without allowing for phenotypic expression and incubated overnight at 37 °C. To determine both the percentage of transformants containing inserts as well as the insert size, 100 colonies were randomly picked from each O157 expression library and analyzed by colony PCR using vector-specific primers. Greater than 90% of all transformants examined contained inserts, and >80% of clones of one O157 expression library included inserts that exceeded 1.7 kbp in size. Recombinant clones comprising this particular O157 expression library were scraped off LB-Kan plates, plasmid DNA was isolated using the QIAprep Spin miniprep kit (Qiagen Sciences) and used to transform electrocompetent E. coli BL21(DE3) (Novagen), the recommended expression host, and transformants were plated as described above. The resultant O157 expression library comprised >105 recombinant clones. Given the genome sizes of the sequenced O157 strains EDL933 (11) and Sakai (12) and that the average size of ORFs in these strains is 1 kbp (13), we concluded that this expression library, comprising recombinant clones containing inserts of optimal size (optimized O157 expression library), was adequate for defining components of the O157 immunoproteome (14).
Expression of the Optimized O157 Expression Library and Preparation of Cell Lysate and Pellet Fractions Containing Recombinant Proteins for Immunoaffinity Capture
To expedite sample preparation and circumvent laborious experimentation to define the inducer concentration and postinduction growth conditions (such as incubation temperature and duration of growth after induction) that facilitated recombinant O157 protein expression in a form that permitted optimal capture by pAbs affinity-purified from pooled hyperimmune cattle sera (bait pAbs) and coupled to HiTrap NHS-activated HP columns (see below), several batches of the optimized O157 expression library were cultured simultaneously at 37 °C to an initial A600 of 0.6. Cultures were then induced with 0.25, 0.5, 1, or 2.0 mM isopropyl ß-D-thiogalactopyranoside (IPTG) and incubated at 25, 30, or 37 °C for either 5 h or overnight. Cultures were pooled and centrifuged at 7,000 rpm for 10 min at 4 °C to harvest cell pellets, which were then washed three times with chilled, sterile PBS (pH 7.4). The "Complete" protease inhibitor mixture (Roche Diagnostics) was added to twice the concentration recommended by the manufacturer after which cells were lysed by three freeze-thaw cycles. Lysed cells were resuspended in 2 ml of chilled PBS and microcentrifuged at maximum speed for 10 min, and supernatant (lysate) fractions were decanted into fresh tubes. Pellet fractions were resuspended in 2 ml of PBS containing 2x Complete protease inhibitor mixture (Roche Diagnostics), and the nonionic detergent n-octyl ß-D-glucopyranoside (NOG; 0.2% final concentration) was added to solubilize membrane proteins. Both the lysate and pellet fractions containing recombinant O157 proteins were stored at 70 °C until used.
Coupling of Bait pAbs to HiTrap NHS-activated HP Columns
Prior to coupling, hyperimmune cattle sera against diverse O157 strains were generated and evaluated for reactivity against previously identified O157 antigens (supplemental material and Fig. 2, a and b). Polyclonal antibodies were affinity-purified from pooled hyperimmune cattle sera (bait pAbs) as detailed in the supplemental material. Affinity-purified bait pAb fractions were pooled together and dialyzed overnight against PBS (pH 7.4) at 4 °C and then for 4 h against coupling buffer consisting of 0.2 M NaHCO3, 0.5 M NaCl (pH 8.3) using a dialysis membrane with a molecular weight cutoff of 3500. Coupling of bait pAbs via amine groups was done as instructed by the manufacturer with minor modifications. Following removal of isopropanol, HiTrap NHS-activated columns were equilibrated with 10 column volumes of coupling buffer. Pooled bait IgG pAbs were slowly loaded on the column using a syringe and then recirculated through the column for 30 min at room temperature by attaching another syringe to the outlet. Active groups that did not couple to the ligand were quenched with 10 ml of 1 M tris(hydroxymethyl)aminomethane (pH 9.0), and nonspecifically bound pAbs were eluted with 10 ml of 1 M acetic acid. Columns with immobilized bait pAbs (charged columns) were then rinsed with 10 ml of deionized water to remove the quencher and the eluant.
|
0.751 ml/min). Following a rinse with 20 volumes of loading buffer to remove loosely bound proteins, specifically captured recombinant proteins were eluted with 10 ml of 1 M acetic acid directly into 15-ml Falcon tubes containing 500 µl of ammonium hydroxide (elution with 1 M acetic acid facilitates multiple reuses of the charged column without a significant loss in specific binding capacity). This process was repeated three times to maximize yield of captured recombinant O157 proteins. Nonspecific adsorption of recombinant O157 proteins to the column matrix was assessed by passing optimized O157 expression library lysate and pellet fractions of cells of the O157 expression library through "uncharged" columns (HiTrap NHS-activated HP columns not coupled to bait pAbs) with quenched active groups. Specificity of capture and lack of nonspecific adsorption was confirmed by visualizing recombinant O157 proteins by fractionation of elutions from charged and uncharged columns on SDS-PAGE gels and subsequent staining (Fig. 3, ad).
|
In-gel Digestion/Peptide Extraction
Gel sections from the previous step were placed in 2.0-ml tubes (Axygen, Union City, CA) and moved into a dual isolation biosafety cabinet. Gel pieces were destained with two washes of 50% methanol, 5% acetic acid and rinsed with three alternating washes of 100 mM ammonium bicarbonate and 100% acetonitrile to remove the destain solution. After removal of the acetonitrile following the final wash, gel slices were dried for 10 min in a SpeedVac. Tubes containing the dried gel sections were placed on ice, and 200 µl of Promega sequencing grade trypsin at a concentration of 6.6 µg/ml in 50 mM ammonium bicarbonate was added to each sample. The gel pieces were allowed to swell for 60 min on ice after which the tubes were capped and incubated at 37 °C for 20 h. Peptides were extracted with two washes of 500 µl of 50 mM ammonium bicarbonate and two washes of 500 µl of 50% acetonitrile, 0.1% formic acid. All extracts were frozen at 80 °C, lyophilized to dryness, and redissolved in 60 µl of 5% acetonitrile, 0.1% formic acid. Samples were then loaded onto a 96-well plate for MS analysis (see below).
Mass Spectrometry
Samples for MS were run on either an LCQ DECA XP (LCQ) plus Proteome X work station as described in an earlier report from our group (10) or on a linear ion trap-Fourier transform mass spectrometer (LTQ-FT) from Thermo Finnigan. For each run using the LCQ, 10 µl of each reconstituted sample was injected with a Famos Autosampler, and the separation was done on a 75-µm (inner diameter) x 20-cm column packed with C18 media running at a flow rate of 250 nl/min provided from a Surveyor MS pump with a flow splitter with a gradient of 572% water, 0.1% formic acid, and 5% acetonitrile over the course of 240 min (4.0-h run). For each run using the LTQ-FT, 10 µl of each reconstituted sample was injected with a Famos Autosampler, and the separation was done on a 75-µm (inner diameter) x 20-cm column packed with C18 media running at a 225 nl/min flow rate provided from a Surveyor MS pump with a flow splitter with a gradient of 560% water, 0.1% formic acid, acetonitrile 0.1% formic acid over the course of 120 min (150-min total run). Between each set of samples, standards from a mixture of peptides, 5 Angiotensin (Michrom Bioresources), were run for 2.5 h to ascertain column performance and observe any potential carryover that might have occurred. The LCQ was run in a top five configuration with one MS scan and five MS/MS scans. Dynamic exclusion was set to 1 with a limit of 30 s. The LTQ-FT was run in a top nine configuration with one MS 200,000 resolution full scan and nine MS/MS scans. Dynamic exclusion was set to 1 with a limit of 180 s with early expiration set to two full scans.
Peptide identifications were made using SEQUEST (Thermo Finnigan) through the Bioworks Browser 3.2. Sequential database searches were performed using the E. coli O157:H7 strain EDL933 FASTA database from the European Bioinformatics Institute (www.ebi.ac.uk/newt/display) using static carbamidomethyl-modified cysteines and differential oxidized methionines. A reverse E. coli O157:H7 strain EDL933 FASTA database was spiked in to provide noise and determine validity of peptide hits. In this fashion, the statistical relevance of all the data could be determined in a sample- and mass spectrometer-independent manner (15). LCQ data were searched with a 2-dalton window on the MS precursor with 0.8 dalton on the fragment ions, whereas the FT data were searched at 5 ppm for the precursor ion and 0.5 Da on the fragment ions. Peptide score cutoff values were chosen at cross-correlation values (Xcorr) of 1.8 for singly charged ions, 2.5 for doubly charged ions, and 3.0 for triply charged ions along with delta rank scoring preliminary cutoff (
Cn) values of 0.1, peptide probability values of 1.00E3 and cross-correlation-normalized values (RSp) of <10. The cross-correlation values chosen for each peptide assured a high confidence match for the different charge states, whereas the
Cn ensured the uniqueness of the peptide hit. The RSp value of 1 was used on single peptide hits to ensure that the peptide matched the top hit in the preliminary scoring. At these peptide filter values, very few single peptide reverse database hits were observed, allowing us to place a higher confidence on the few single peptide protein identifications. Single hit proteins were also manually validated to ensure relevance.
Cellular localization and putative functions of hypothetical proteins, identified by querying the E. coli O157:H7 strain EDL933 FASTA database at European Bioinformatics Institute with tandem MS data, were determined using bioinformatics as follows. (i) Cellular localization of such proteins was determined using the PSORTb Version 2.0/PSLpred at psort.org/ and PSORTdb at db.psort.org. (ii) Extracytoplasmic location was confirmed by examining the N terminus of amino acid sequences of cognate proteins from the O157 strain EDL933 database at www.tigr.org for the presence of a signal sequence using the program SignalP 3.0 at www.cbs.dtu.dk/services/SignalP. (iii) Putative functions were designated using the Conserved Domain Database (CDD) at www.ncbi.nlm.nih.gov/Structure/cdd/wrpsb.cgi.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
|
In conclusion, this report details a new application of proteomics and highlights the power of proteomics-based methodologies. A striking feature is the rapidity of proteome wide IMP identification, which renders PELS an ideal alternative/complement to emerging protein chip/array technologies. Other attractive features include broad applicability, robustness, and an elimination of subjective bias. Also because proteins are expressed from genes on inserts within clones of a genomic DNA expression library, a more comprehensive determination of the immunoproteome of the cognate pathogen is possible. Limitations relate to constraints on expression of heterologous proteins in E. coli hosts (1); however, improved prokaryotic cell-free in vitro translation systems coupled with the advent of technologies for facile generation of expression ORFeomes of microbes (21) should further enhance the power of this methodology.
| FOOTNOTES |
|---|
Published, MCP Papers in Press, May 31, 2006, DOI 10.1074/mcp.T600013-MCP200
1 The abbreviations used are: IMP, immunogenic microbial protein; PELS, proteomics-based expression library screening; IVIAT, in vivo induced antigen technology; pAb, polyclonal antibody; 1D, one-dimensional; GeLC-MS/MS, 1D SDS-PAGE ESI nano-LC-MS/MS; NHS, N-hydroxysuccinimide; IPTG, isopropyl ß-D-thiogalactopyranoside; NOG, n-octyl ß-D-glucopyranoside; HP, high performance. ![]()
* This work was supported in part by National Institutes of Health Grants R21 AI055963 (to I. T. K.) and R01 DK52081 (to P. I. T.). ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
¶ Both authors contributed equally to this work. ![]()

Supported in part by the National Research Initiative of the United States Department of Agriculture Cooperative State Research, Education, and Extension Service Grants 99-35201-8539 and 04-04562 and National Institutes of Health Grants NO1-HD-0-3309, U54-AI-57141, P20-RR16454, and P20-RR15587. ![]()
|||| To whom correspondence should be addressed. Tel.: 617-724-7528; Fax: 617-726-7416; E-mail: mjohn1{at}partners.org
| REFERENCES |
|---|
|
|
|---|
This article has been cited by other articles:
![]() |
R. C. LaRocque, B. Krastins, J. B. Harris, L. M. Lebrun, K. C. Parker, M. Chase, E. T. Ryan, F. Qadri, D. Sarracino, and S. B. Calderwood Proteomic Analysis of Vibrio cholerae in Human Stool Infect. Immun., September 1, 2008; 76(9): 4145 - 4151. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |