|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||









,||
From the
Fundamental Science Division, ¶ Environmental Molecular Science Laboratory, Pacific Northwest National Laboratory, Richland, Washington 99352 and
Department of Molecular Microbiology and Immunology, Oregon Health and Sciences University, Portland, Oregon 97239
| ABSTRACT |
|---|
|
|
|---|
Because many peptides are known to have important roles during host-pathogen interactions (e.g. antimicrobial peptides, pro-inflammatory peptides, and bacterial peptide toxins), we expanded our proteomics investigation of Salmonella pathogenesis (6) to include a peptidomic component. The Gram-negative bacterium Salmonella enterica serovar Typhimurium (S. typhimurium) is the major causative agent of salmonellosis (food poisoning) in
7,000 cases per year in the United States (7). The closely related strain S. enterica serovar Typhi causes typhoid fever, which results in
600,000 deaths per year worldwide (8). In response to the emergence of antibiotic-resistant isolates (9) and the threat of bioterrorism (e.g. the contamination of the food supply by bioterrorists in Oregon in 1984 (10)), we are applying high throughput techniques to identify the proteins required for Salmonella pathogenesis. Key to this effort is the premise that expression levels of bacterial genes required for virulence are disproportionately increased during infection. As the survival of phagocytosed Salmonella is an important element of its pathophysiology, we focused our investigation on this stage of infection. Validation of putative virulence factors will require orthogonal experimentation (e.g. pathogenicity assays of Salmonella knockouts).
Despite the fact that peptides are generally incapable of catalysis, binding, transport, or other protein functions, they have enormous functional diversity (1). Unfortunately, their physiological roles are often difficult to identify, and most techniques designed to do so are low throughput. This is further exacerbated by the fact that the majority of peptides are bioinactive byproducts of protein turnover. However, if the above premise is correct, then the putative identification of bioactive peptides may be possible via parallel comparative proteomics and peptidomics investigations. In this article we describe the use of high throughput LC-MS and LC-MS/MS to identify and quantify thousands of peptides generated by S. typhimurium grown in host-free culture conditions designed to mimic the environment of the phagosome. As such, it constitutes the first global description of a bacterial peptidome. Possible physiological roles for subsets of the Salmonella peptidome are discussed.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
18 m
xcm, Barnstead International, Dubuque, IA). Reagents were obtained from Sigma unless otherwise specified and included acetic acid, acetone, acetonitrile, ammonium bicarbonate, isopropanol, methanol, and trifluoroacetic acid.
Cell Culture
Wild type S. enterica serovar typhimurium strains 14028 and LT2 were grown to logarithmic (Log)1 phase, stationary (Stat) phase, magnesium minimal medium- (MgM-) shocked stationary phase, and linear in N-Salts medium phase and then harvested following standard batch culture techniques (11). N-Salts medium consisted of 100 mm Bis-Tris pH 5, 0.5 mm KCl, 7.5 mm (NH4)2SO4, 0.5 mm K2SO4, 1 mm KH2PO4, 0.2% (v/v) glycerol, 0.1% (w/v) casamino acids, 8 µm MgCl2, and was similar to MgM pH 5 (12) but with less potassium (the concentration and role of potassium in the phagosome is the area of active investigation (13, 14)). Cultures were initiated by inoculating 5 ml of either LB (Luria-Bertani) broth (Log, Stat, and MgM samples) or N-Salts medium (N-Salts samples) with cells from a single colony and then grown for 16 h at 37 °C. A 2-liter flask containing 300 ml of either LB broth (Log, Stat, and MgM samples) or N-Salts medium (N-Salts samples) was inoculated with 3 ml of the starter culture and shaken at 200 rpm at 37 °C. The Log, Stat, and N-Salts cultures were harvested at room temperature at an OD600 nm of 0.6, 2.0, and 0.2, respectively. The MgM samples were grown to stationary phase as described above, rinsed twice with MgM pH 5, and then incubated for 4 h in MgM pH 5 at 37 °C while shaking at 200 rpm. Harvested cell cultures were centrifuged at 5000 x g, cell pellets were washed twice with Dulbecco's phosphate buffered saline (Mediatech, Inc., Herndon, VA), and cell pellets with an approximate wet weight of 0.1 g/tube were stored at 80 °C.
Peptide Extraction and LC-MS/MS
Bacterial cells were lysed using 0.1-mm zirconia/silica beads (BioSpec Products, Inc., Bartlesville, OK) and cleared by centrifugation as described in Ref. 6. The S. typhimurium strain 14028 cells were lysed in the presence of a protease inhibitor mixture formulated for use with bacterial cell extracts (Sigma). Also, half of each S. typhimurium strain 14028 cell lysate was incubated at 22 °C for 20 min to mimic poor sample handling, whereas the other half was in a cooling block at
7 °C.
Isopropanol was then added to the cleared lysates to 1:1, 3:2, 2:1, or 5:2 (v/v) proportions (respectively) as indicated. The samples were then microcentrifuged at 16,000 x g at 4 °C for 20 min, and the resulting supernatants were concentrated in a SpeedVac (Thermo Electron Corp., Waltham, MA). Peptide concentrations were determined by BCA assays (Pierce), and SDS-PAGE analyses were performed using 1020% Tris-Tricine Ready Gels® (Bio-Rad). Gels were fixed for 30 min in 40% methanol, 10% acetic acid and then stained for 60 min using GelCode® Blue reagent (Pierce).
Prior to mass spectrometric analysis, the concentrated isopropanol extracts were either solid phase extracted using C-18 pipette tips or were fractionated by strong cation exchange (SCX) HPLC. Each C-18 solid phase extraction was performed by applying 30 µg (peptide mass) of concentrated isopropanol extract to an OMIX® C-18 100-µl pipette tip (Varian, Palo Alto, CA) and by following the manufacturer's instructions. The C-18 SPE extracts were concentrated in a SpeedVac (Thermo Electron Corp.) and then analyzed by mass spectrometry. Each SCX fractionation was performed as described previously (6), each was performed using
150 µg (peptide mass) of concentrated isopropanol extract, and each resulted in 25 fractions. SCX fractionation was performed on four of the concentrated isopropanol extracts (the 14028 strain Log, Stat, MgM, and N-Salts samples that had been kept in the cooling block at
7 °C), and the resulting 100 fractions were concentrated in a SpeedVac. Note that the acidic conditions under which the C-18 SPE and SCX HPLC took place probably resulted in the precipitation and removal of some isopropanol-soluble proteins. The concentrated C-18 SPE extracts and SCX fractions were then analyzed by reversed-phase microcapillary HPLC-nanoelectrospray ionization-LTQ ion trap tandem mass spectrometry as described previously (6). Six of the samples were analyzed by LC-MS/MS in duplicate, and a description of all 126 LC-MS/MS analyses is included as Supplemental Table I. Each LC-MS/MS gradient was 100 min long, and adding to that the time that was consumed performing column regeneration, system maintenance, and LC-MS/MS analyses of quality control standards the total time that the LTQ mass spectrometer was dedicated to these samples was
300 h.
LC-MS/MS Data Analyses
Peptides were identified by using SEQUEST® (TurboSEQUEST® (cluster) v.27 (revision 12), Thermo Electron Corp.) to search the resulting MS/MS spectra against the annotated S. typhimurium FASTA data file of proteins translated from genetic code provided by The Institute for Genomic Research (4,550 protein sequences, September 19, 2004, Stanford University, www.tigr.org/) (15). These analyses used a standard parameter file with peptide_mass_tolerance = 3, fragment_ion_tolerance = 0, and no amino acid modifications. Also, these analyses searched for all possible peptide termini (i.e. not limited to only tryptic termini). Separate SEQUEST searches that used the above FASTA data file but with scrambled amino acid sequences were performed in parallel.
SEQUEST generally returns multiple peptide identifications for each MS/MS spectrum and for each parent ion charge state. Therefore, for each MS/MS spectrum and for each parent ion charge state, only the peptide identification with the highest XCorr value (i.e. the "top ranked hit") was retained. Note that if a MS/MS spectrum returned both normal and scrambled peptide identifications, both top ranked hits were retained.
The estimated percentage of false positive peptide identifications was defined as %FPest. = 100% x (number of scrambled peptide identifications)/(number of normal peptide identifications) (16). %FPest. was calculated for each charge state, XCorr_Cutoff value (i.e. the minimum XCorr value requirement, which ranged from 1.5 to 5 in units of 0.02), and
Cn_Cutoff value (i.e. the minimum
Cn value requirement, which ranged from 0 to 0.4 in units of 0.005). For each parent ion charge state, the optimal pair of cutoff values (XCorr_Cutoff and
Cn_Cutoff) was identified by using a two step process. First, the pair of cutoff values was required to corresponded to %FPest.
5%. Second, of the remaining pairs of cutoff values, the pair that was satisfied by the most normal (i.e. not scrambled) peptide identifications was identified as optimal.
The optimal XCorr_Cutoff and
Cn_Cutoff values at each parent ion charge state (+1 through +5) was determined to be 1.84 and 0.21 (+1), 2.1 and 0.21 (+2), 2.8 and 0.23 (+3), 3.56 and 0.265 (+4), and 4.16 and 0.22 (+5), respectively. Peptide identifications that failed to pass these cutoff values were discarded, as were those that corresponded to multiple S. typhimurium protein sequences. The XCorr_Factor was calculated for each peptide identification (XCorr_Factor = XCorr/XCorr_Cutoff) by using the above optimal XCorr_Cutoff values, and in the rare instance that a single MS/MS spectrum generated multiple filter-passing peptide identifications (each corresponding to a different parent ion charge state), only the peptide identification with the highest XCorr_Factor was retained. The estimated percentage of false positive protein identifications was defined as %FPrest. = 100% x (number of scrambled protein identifications)/(number of normal protein identifications). If
10 filter-passing peptide identifications were required for each protein identification, then %FPrest. = 1.99%. For each protein, spectrum counting (i.e. tallying of filter-passing peptide identifications) was used as a rough measure of the abundance of its fragments within the peptidome (17).
To better estimate the percentage of false positive peptide identifications, an alternative method was also used (18). All of the MS/MS spectra were reanalyzed with SEQUEST using a concatenated normal + scrambled S. typhimurium FASTA data file of proteins (i.e. the scrambled protein sequences were appended to the end of the normal FASTA data file, therefore resulting in 9,100 protein sequences total). This estimated percentage of false positive peptide identifications was defined as %FPest.2 = 100% x 2 x (number of scrambled peptide identifications)/(total number of peptide identifications). Using the above optimal XCorr_Cutoff and
Cn_Cutoff value pairs, %FPest.2 was found to be equal to 3.48%, and 88.1% of the filter-passing peptide identifications that resulted from the search of the normal FASTA file were also filter-passing identifications that resulted from the search of the concatenated FASTA file.
To test for the remote possibility of contamination of the samples, MS/MS spectra that resulted from the analyses of the eight 14028 S. typhimurium C-18 SPE samples were searched using SEQUEST against a non-redundant, multiorganism database from the Protein Information Resource (283,416 protein sequences, PIR-PSD release 78.00, Georgetown University, pir.georgetown.edu/) (19). The resulting data were filtered using the above optimal XCorr_Cutoff and
Cn_Cutoff value pairs, and although 207 new peptides were identified, they seemingly randomly corresponded to 127 apparently unrelated proteins.
LC-MS
The eight S. typhimurium (14028 strain) concentrated C-18 SPE extracts were analyzed by reversed-phase microcapillary HPLC-nanoelectrospray ionization-FTICR-MS (11.5 tesla) (20). These analyses all used the same chromatographic column and were run in the order given in Table II. The resulting MS spectra were analyzed using the accurate mass and elution time (AMT) tag approach (5). Briefly, the calculated mass and the observed normalized elution time (NET) of each filter-passing peptide identification from the LC-MS/MS analyses were used to construct a reference database of AMT tags (note that none of the peptide identifications that had a +4 or +5 parent ion charge state were included in the AMT tag database). Features from LC-MS analyses (i.e. m/z peaks deconvolved of isotopic and charge state effects and then correlated by mass and NET) were matched to AMT tags to identify peptides in a manner that resulted in roughly 5% estimated false positive identifications. For each protein, the sum of its peptide peak areas (NET versus peak height) was used as a measure of the abundance of its fragments within the peptidome.
|
Data Visualization and Cluster Analysis
Maps of the estimated percentage of false positive peptide identifications and of the number of peptide identifications versus XCorr_Cutoff and
Cn_Cutoff were visualized using OmniViz® 3.8 (OmniViz Inc., Maynard, MA) (21) (Supplemental Fig. 1). Protein abundance data (derived by spectrum counting or by summing FTICR peak areas) were subjected to hierarchical cluster analyses using OmniViz with correlations determined by magnitude and shape (Euclidean distance).
| RESULTS |
|---|
|
|
|---|
Ultrafiltration of cleared cell lysates was initially used in an attempt to isolate S. typhimurium peptides. However, the filtration membranes clogged and the number of species that managed to traverse the membrane was negligible when analyzed by LC-MS/MS (data not shown). Consequently, this approach was abandoned, and the use of organic solvents to effect protein precipitation was investigated.
The addition of water-miscible organic solvents to an aqueous solution causes a reduction in its dielectric constant, an increased attraction between charged molecules, and protein aggregation (for a detailed review, see section 4.4 of Ref. 25). In general, high molecular mass proteins (
100 kDa) precipitate at low concentrations of organic solvents (
30% v/v), whereas low molecular mass proteins (
10 kDa) require higher concentrations of organic solvents (
65% v/v). Peptides are generally soluble even at high concentrations of organic solvents. Protein depletion by use of organic solvents is additionally attractive because the addition of organic solvents causes protein denaturation and therefore negligible protein-peptide co-precipitation. However, some peptide precipitation is likely at very high organic solvent concentrations. Therefore, four different organic solvents were tested, and isopropanol was found to perform the best. Afterward, to identify an ideal isopropanol concentration, the results of extractions at four different concentrations were compared. These experiments are described below.
One of four organic solvents (acetone, acetonitrile, isopropanol, or methanol) was added to cleared bacterial cell lysates to a 2:1 (v/v) proportion (respectively). In this experiment, instead of S. typhimurium cells, Shewanella oneidensis cells were used, and this experiment was performed in triplicate (i.e. there were 12 samples total). The extracts were then analyzed by LC-MS/MS, and it was found that isopropanol extraction resulted in the most filter-passing peptide identifications (data not shown). Acetonitrile performed almost as well, but acetone and methanol performed poorly. It was also observed that the protein pellets resulting from isopropanol precipitation were much more compact than the acetonitrile pellets. Consequently, isopropanol was used to extract S. typhimurium peptides from cleared cell lysates. Log, Stat, and MgM phase, LT2 strain S. typhimurium cells were lysed by bead beating and cleared by centrifugation, and four concentrations of isopropanol were added. Cleared cell lysates and isopropanol extracts were then analyzed by SDS-PAGE (Fig. 1). Although peptides <6.5 kDa were too small to be observed by SDS-PAGE, Fig. 1 shows that increasing concentrations of isopropanol progressively precipitated the S. typhimurium proteins, as expected.
|
Cn_Cutoff value (Supplemental Fig. 1). In this way, %FPest. was constrained to
5%.
Highly observed proteins were clustered, and a disproportionate number of protein fragments were observed in the MgM samples (Fig. 2). Because it was possible that native proteases generated these peptides after cell lysis, the experiment was repeated but this time with a bacterial protease inhibitor mixture solution added to the lysis buffer. This time the more virulent and pathophysiologically relevant 14028 strain (compared with the LT2 strain) was used. Also, half of each lysate was incubated at 22 °C for 20 min to mimic poor sample handling, whereas the other half was in a cooling block at
7 °C. All of these samples were extracted with isopropanol (66.7% v/v, final concentration), C-18 solid phase extracted, and analyzed by LC-MS/MS.
|
30 amino acids), highly charged (+4 or +5) parent ions might be identifiable. However, because of the small number of such species detected in our early experiments (Table I), the MS/MS spectra of subsequent experiments were only searched for parent ions with
+3 charge states. Overall, there were 45,965 filter-passing peptide identifications of 14,388 peptides from 2,003 proteins.
|
|
Of note however was the over-representation of carboxy-terminal arginine and lysine residues in the peptide identification data. Peptides with carboxy-terminal arginine and lysine residues accounted for 24.64 and 27.81% of the peptide identifications, respectively. This resulted in a 6.38- and 4.00-fold over-representation (respectively) compared with what would be expected from a random distribution of the observed amino acids (Supplemental Table XI). It is known that peptides that contain arginine and/or lysine residues are more effectively ionized during nanoelectrospray ionization (reviewed in detail in Chapter 3 of Ref. 26). In addition, peptides having carboxy-terminal arginine or lysine residues are detected more efficiently after collisionally induced dissociation (reviewed in detail in Chapter 4 of Ref. 26).
Ten proteins (STM 0359, 0462, 1121, 1244, 1438, 1728, 1810, 1924, 3362, and 3419) observed in the peptidomics data (i.e. had
4 unique (i.e. differing sequences), filter-passing peptide identifications) were not detected in the corresponding bottom-up proteomics data (i.e. had
1 filter-passing peptide identification (total)) from our previous investigation (6). To test whether isopropanol extraction enriched a set of proteins that was not detected during our bottom-up proteomics investigation (e.g. low molecular mass proteins), the 12 S. typhimurium LT2 concentrated isopropanol extracts were digested with trypsin, analyzed by LC-MS/MS, and searched using SEQUEST. Two additional proteins (STM 3363 and 3450) were found (i.e. had
4 unique, filter-passing peptide identifications and were not in the corresponding bottom-up proteomics data). These 12 proteins do not seem to be related to each other in any way, and it was concluded that isopropanol extraction did not enrich a substantial set of proteins that had been missed by our traditional proteomics investigation.
To identify non-annotated S. typhimurium proteins, MS/MS spectra were searched using SEQUEST against a protein FASTA file constructed from translations of S. typhimurium LT2 DNA sequences flanked by stop codons (i.e. stop-to-stop translations). These searches included MS/MS spectra from both the peptidomics experiments described above and the bottom-up proteomics experiments performed previously in our laboratory (6). Care was taken to discard any non-annotated peptide identification that had resulted from the analysis of a MS/MS spectrum that had already been associated with an annotated peptide identification from the prior SEQUEST searches. In addition, care was taken to discard any identification of a non-annotated peptide which could have derived from a "S. enterica" or "S. typhimurium" protein annotated in the UniProt database (7,714 protein sequences, August 9, 2006, us.expasy.org/) (27). Ultimately, nine non-annotated proteins were identified (requiring
3 unique (i.e. differing sequences) and
15 total filter-passing peptide identifications). However, the percentage of these protein identifications that are false positives has yet to be estimated, and none of these identifications have yet been validated using orthogonal experimentation (e.g. Western blotting). Therefore, these results are preliminary. It was noted that seven of the nine non-annotated proteins had a high degree of homology with proteins identified from Basic Local Alignment Search Tool (BLAST) searches against the Entrez protein sequence database provided by the National Center for Biotechnology Information (www.ncbi.nlm.nih.gov/). In addition, seven of the nine non-annotated genes partially overlapped other, annotated genes, and it is possible that these non-annotated genes had been identified as pseudogenes for this reason. Studies of non- and misannotated S. typhimurium proteins are ongoing in our laboratory.
Identification and Quantification of Peptides Using LC-MS
The eight S. typhimurium (14028 strain) concentrated C-18 extracts were analyzed by LC-FTICR-MS. Equalization of the sample masses took place following isopropanol extraction and prior to C-18 SPE (Table II). Equalization was designed to take into account any sample preparation biases, and no subsequent sample or data normalization was performed. Note that the peptidome masses recovered after the C-18 cleanups were similar.
LC-MS features (i.e. m/z peaks deconvolved of isotopic and charge state effects and then correlated by mass and NET) were matched to AMT tags (i.e. filter-passing LC-MS/MS peptide identifications) in a process that resulted in an average mass error of 1.09 ppm and an average NET error of 0.74%. Using the AMT tag approach, the eight LC-MS analyses resulted in 15,025 peptide identifications of 5,163 peptides from 682 proteins. As described above, 14,388 peptides from 2,003 proteins were identified by LC-MS/MS. Most likely, compared with the high resolution LC-MS analyses of the unfractionated samples, a greater depth of coverage resulted from the LC-MS/MS analyses of the SCX fractions. The total number of LC-MS features and the percentage of matching features increased in rough proportion to the stress that the cells underwent (Table II), which correlated well with the spectrum counting results (Fig. 2).
Highly abundant proteins were clustered (Fig. 4, left), and the results were very similar to the spectrum counting results (Fig. 2). The peptidomics data were compared with proteomics data (Fig. 4, right) from a prior study performed in our laboratory (6). Although individual Z-scores of
1.5 are generally not statistically significant by themselves, the overall trend of the data suggests that ribosomal proteins were most abundant during logarithmic phase growth in rich medium, as expected (28). In contrast, a number of known stress response factors were found to be most abundant during suboptimal culture conditions. The two-component regulator phoP (29, 30), the histone-like protein hupB (31), and the universal stress protein ybdQ (32) were most abundant in the Stat samples, and the chaperone dnaK (30, 33, 34) and the histone-like protein hns (31) were most abundant in the N-Salts samples.
|
|
| DISCUSSION |
|---|
|
|
|---|
A number of S. typhimurium proteins were found to be degraded upon exposure of cells to phagosome-like culture conditions. Because the MgM and N-Salts media contain a low concentration of nutrients (including 10% of the concentration of amino acids of LB broth), it is possible that the cells were primarily responding to starvation. During the "stringent response," bacteria down-regulate rRNA and ribosomal protein expression (28), dimerize and inactivate ribosomes (37), and degrade rRNA (35, 36) (which liberates complexed ribosomal proteins (38)). In addition, bacteria produce polyphosphate (PolyP), which results in the degradation of uncomplexed ribosomal proteins by the PolyP-activated Lon protease (3941). In this study, the stationary phase samples had relatively little ribosomal protein fragmentation even though these cells were likely experiencing significant carbon starvation. This suggests that low magnesium and/or low pH are also required for ribosomal protein proteolysis to be up-regulated. Magnesium is known to be required for ribosomal stability (42), and it is also a known inhibitor of the rRNA RNase rna (36, 38).
Most ribosomal proteins are small, alkaline, and function solely by stabilizing rRNA folding by binding to it (and likely to PolyP) with non-globular, positively charged extensions (43, 44). These extensions are probably structurally disordered when unbound to polynucleotides because they are disproportionately detected by proteome-wide predictions of regions of intrinsic structural disorder (45, 46). Lon is known to target defective proteins (4751), probably because its unfoldase can process proteins that contain intrinsically disordered regions faster than proteins without such regions. This suggests a mechanism by which Lon targets ribosomal proteins. First, during starvation Salmonella rRNA is quickly degraded (52), releasing ribosomal proteins. Simultaneously, polyphosphatase is down-regulated, and the non-globular extensions of uncomplexed ribosomal proteins bind to PolyP (note that only uncomplexed ribosomal proteins are vulnerable to Lon proteolysis (refer to Supplemental Fig. 2, Ref. 53). Next, Lon binds to PolyP and possibly traverses up and down the phosphate polymer. Lastly, PolyP-bound, non-globular extensions of ribosomal proteins are rapidly processed by the Lon unfoldase. Bacterial histone-like proteins have similar DNA-binding extensions (54, 55), and three (hupA (56), stpA (57), and ymoA (58)) are known Lon substrates. Paradoxically, although Lon down-regulates the expression of invasion genes and the efficiency of host-cell invasion (59, 60), the study of Lon-null mutants has demonstrated that Lon is essential for systemic infection (61). Because the availability of nutrients within the phagosome is limited, ribosomal protein degradation may foster pathogenesis by generating much needed amino acids, thereby increasing the survivability of phagocytosed Salmonella. In addition, Lon degradation of ribosomal proteins might produce bioactive peptides such as host-cell toxins.
Although many of the detected peptides were derived from ribosomal proteins, a number of other proteins were also observed. Two of these proteins, trigger factor (tig) (62) and protein chain initiation factor I (infA) (63), are known to bind to ribosomes, and it is possible that the uncomplexed forms of these proteins are somehow more vulnerable to degradation. Many of the remaining proteins were found to be most abundant in stressed Salmonella (i.e. in the Stat, MgM, or N-Salts samples) and thus are likely to have stress response functions. Fragments of many of these were most abundant in the N-Salts samples. That Salmonella would up-regulate the expression of stress response factors only to target them for degradation is seemingly paradoxical. However, translational fidelity is known to dramatically decrease during starvation, which results in mistranslated proteins that are significantly less likely to properly fold, and thus their exposure to proteolysis is increased (64). In addition, misfolded proteins are more vulnerable to modification by reactive chemicals (e.g. carbonylation by reactive oxygen species), and these modifications further undermine proper folding, which makes these proteins even more susceptible to degradation (64). Assuming that ribosomes are indeed being catabolized, then this introduces a novel explanation of why starvation results in the production of defective proteins as partially digested ribosomes almost certainly have reduced translational fidelity. Two stress response factors depicted in Fig. 4, dnaK (33, 50, 65) and hns (33, 66), are known to be carbonylated in response to starvation, and it is possible that they are subsequently targeted for proteolysis. Although Lon is the major protease of aberrant proteins in bacteria (4751), ClpABPX (4951, 67), HslUV (50), and HtrA (49, 68) also have important roles. Therefore, two modes of protein degradation are likely up-regulated by phagocytosed Salmonella, PolyP/Lon-dependent directed proteolysis of ribosomal proteins and ClpABPX/HslUV/HtrA/Lon proteolysis of defective stress response factors.
It is likely that some of the identified peptides are bioactive. For example, Salmonella peptides are believed to interfere with host-cell MHC-I peptide presentation via an unknown PhoPQ- and YejABEF-dependent mechanism (69). Some of the amphipathic peptides may act by lysing the host-cell phagosomal and/or plasma membranes or may act as biofilm surfactants (70). Some may be bacteriocins (e.g. the Helicobacter pylori ribosomal L1 derived bacteriocin (71)), quorum sensors (e.g. Salmonella acetyltransferase repressing factor (72)) (perhaps via PhoPQ (29)), quorum sensor-mimics (e.g. against Gram-positive bacteria in the gastrointestinal tract), host hormone-mimics, host-cell toxins, pro-inflammatory peptides, or the cause of autoimmune disorders (i.e. via molecular mimicry on major histocompatibility complex molecules). The extent of peptidomic host-Salmonella interactions is largely unknown because only recently have researchers had the ultrasensitive, high throughput tools necessary to conduct peptidomics investigations. In the future, linguistic models of peptide sequences may be capable of distinguishing bioactive peptides from bioinactive ones (73). For now, however, although comparative peptidomics alone was insufficient to tentatively identify bioactive peptides, it did prove to be a powerful tool for identifying proteins targeted for proteolysis.
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, January 16, 2007, DOI 10.1074/mcp.M600282-MCP200
1 The abbreviations used are: Log, logarithmic; Stat, stationary; MgM, acidic, magnesium-depleted minimal medium; LB, Luria-Bertani; SCX, strong cation exchange; SPE, solid phase extraction; %FPest., estimated percentage of false positive peptide identifications; AMT, accurate mass and elution time; NET, normalized elution time; PolyP, polyphosphate. ![]()
* Portions of this work were supported by the National Institute of Allergy and Infectious Diseases, (National Institutes of Health (NIH)/Department of Health and Human Services through interagency agreement Y1-AI-4894-01) and the NIH National Center for Research Resources (RR18522). Additional supporting results and protocols are available at the National Institute of Allergy and Infectious Diseases funded Administrative Resource for Biodefense Proteomics Research Centers under the PNNL data section (www.proteomicsresearch.org). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
|| To whom correspondence and reprint requests should be addressed: Dept. of Molecular Microbiology and Immunology, OHSU Mail Drop L220, 3181 S. W. Sam Jackson Park Rd., Portland, OR 97239. E-mail: heffronf{at}ohsu.edu
| REFERENCES |
|---|
|
|
|---|