The Proteome of Shigella flexneri 2a 2457T Grown at 30 and 37 °C*

To upgrade the proteome reference map of Shigella flexneri 2a 2457T, the protein expression profiles of log phase and stationary phase cells grown at 30 and 37 °C were thoroughly analyzed using multiple overlapping narrow pH range (between pH 4.0 and 11.0) two-dimensional gel electrophoresis. A total of 723 spots representing 574 protein entries were identified by MALDI-TOF/TOF MS, including the majority of known key virulence factors. 64 hypothetical proteins and six misannotated proteins were also experimentally identified. A comparison between the four proteome maps showed that most of the virulence-related proteins were up-regulated at 37 °C, and the differences were more notable in stationary phase cells, suggesting that the expressions of these virulence factors were not only controlled by temperature but also controlled by the nutrients available in the environment. The expression patterns of some virulence-related genes under the four different conditions suggested that they might also be regulated at the post-transcriptional level. A further significant finding was that the expression of the protein ArgT was dramatically up-regulated at 30 °C. The results of semiquantitative RT-PCR analysis showed that expression of argT was not regulated at the transcriptional level. Therefore, we carried out a series of experiments to uncover the mechanism regulating ArgT levels and found that the differential expression of ArgT was due to its degradation by a periplasmic protease, HtrA, whose activity, but not its synthesis, was affected by temperature. The cleavage site in ArgT was between position 160 (Val) and position 161 (Ala). These results may provide useful insights for understanding the physiology and pathogenesis of S. flexneri.

Shigella flexneri is a Gram-negative facultative pathogen that causes the majority of communicable bacterial dysenteries in developing countries. Seven years ago, the whole genome sequence of S. flexneri 2a strain 2457T was determined (1), and a proteome reference map was also constructed at the same time (2). Predicting the pI of all proteins encoded by the bacterial genome revealed that about 45.54% (1972 proteins) were basic. However, all of these basic proteins and the vast majority of virulence-related proteins were missed in the old proteome reference map because of the limitations of the experimental conditions used at that time. Now, the proteomics platform of two-dimensional polyacrylamide gel electrophoresis (2-DE) 1 has been optimized and further developed. The resolution and sensitivity of today's 2-DE technique has been greatly improved. As a result, the expression profiles obtained today are difficult to compare with old profiles. An upgrade of the reference proteome map is urgently required for further proteomics research of S. flexneri, particularly for comparative studies.
The virulence genes of Shigella flexneri are located on a large (ϳ230-kb) plasmid acquired by horizontal transfer and encode a type III secretion system apparatus, substrates of this apparatus, and their dedicated chaperones. This enables S. flexneri to invade epithelial cells in the lower gut of humans (3,4). The expression of virulence genes is induced under growth conditions similar to those found at the site of invasion. For example, a temperature of 37°C is a particularly important environmental signal. Maurelli et al. (5) found that S. flexneri cultivated at 37°C produced keratoconjunctivitis in guinea pigs and was able to penetrate and replicate in intestinal epithelial cells, but bacteria grown at 30°C were phenotypically avirulent and non-invasive. The mechanism regulating virulence gene expression in S. flexneri has been well studied (for a review, see Ref. 6). A living cell is a highly complex entity, all parts of which are involved in its normal functions. We can expect that a large number of genes, other than virulence genes, will be simultaneously differentially expressed in response to a change in environmental temperature. To obtain a global view on protein expression level changes in S. flexneri induced by temperature up-shift, we compared the protein expression profiles of log phase and stationary phase cells grown at 30 and 37°C. In agreement with previous reports, most up-regulated genes at 37°C are known virulence-related genes and heat shock proteins, and the differences were more notable in stationary phase cells. At the same time, we found that the level of the protein ArgT is dramatically increased at 30°C.

EXPERIMENTAL PROCEDURES
Bacterial Strains and Growth Conditions-Escherichia coli strain DH5␣, used for plasmid constructions, was grown in Luria-Bertani (LB) agar or LB broth (Difco). Wild-type S. flexneri 2a strain 2457T was grown in tryptic soy agar (Difco) containing 0.01% Congo red or in LB broth. When necessary, antibiotics were added at the following concentrations: ampicillin, 100 g/ml; kanamycin, 50 g/ml; and chloramphenicol, 30 g/ml.
Preparation of Whole-cell Protein Extracts-The virulent S. flexneri serotype 2a strain 2457T was grown aerobically in 100 ml of LB medium at 37°C. Cells were harvested in the log phase (A 600 nm ϭ 1.0) and stationary phase (after about 9 h of culture). The preparation of whole-cell protein extracts was performed as described previously (7). The protein concentration of samples was measured using the PlusOne 2-D Quant kit (GE Healthcare), and 0.8-mg aliquots were stored at Ϫ80°C.

2-DE and
In-gel Protein Digestion-To obtain better separation, pH 4.0 -5.0, pH 4.5-5.5, pH 5.0 -6.0, pH 5.5-6.7, and pH 6 -11 IPG strips (18 cm; GE Healthcare) were used in the IEF analysis. For each analysis, the samples were treated with the 2-D Clean-Up kit (GE Healthcare) according to the kit instructions and resuspended in 350 l of rehydration buffer (7 M urea, 2 M thiourea, 4% (w/v) CHAPS, 50 mM DTT, 0.5% IPG Buffer (same pH range of the IPG strip)). The following 2-DE procedure and the in-gel protein digestion were carried out as described previously (7). Briefly, an 800-g protein sample was used to rehydrate an 18-cm immobilized gradient strip for 12 h at 20°C. Focusing started at 300 V (1 h), was increased to 1000 V (1 h), and was further increased to 8000 V (12 h). After focusing, the strips were equilibrated for 15 min in equilibrium buffer (2% SDS, 50 mM Tris-HCl, pH 8.8, 6 M urea, 30% glycerol, 0.002% bromphenol blue, 1% DTT). The strips were then overlaid onto 12.5% SDS-polyacrylamide gels for the second dimensional separation. The protein spots were carefully excised from the Coomassie-stained 2-DE gel, destained, washed, and then digested for 13 h with sequencing grade modified trypsin (Roche Applied Science). Peptides from digested proteins were used for MALDI-TOF/TOF analysis. Image analysis was processed by ImageMaster 2D Platinum software (GE Healthcare). To facilitate the discrimination between real spots and artifacts, the spot detection parameters were adjusted as follows: smooth, 3; minimum area, 50; and saliency, 6. The images of two samples under different cultivation conditions were compared. The relative volume of each spot was determined from the spot intensities in pixel units and normalized to the sum of the intensities of all the spots on the gel.
MALDI-TOF/TOF MS-MALDI-TOF/TOF MS measurements were performed on a Bruker Ultraflex TM III MALDI-TOF/TOF MS (Bruker Daltonics) operating in reflectron mode with 20-kV accelerating voltage and 23-kV reflecting voltage. A saturated solution of ␣-cyano-4hydroxycinnamic acid in 50% acetonitrile and 0.1% trifluoroacetic acid was used as the matrix. One microliter of the matrix solution and sample solution at a ratio of 1:1 were applied onto the Score384 target well. Routinely, a standard peptide calibration mixture in the mass range 800 -3200 Da (Bruker Daltonics) was analyzed for external calibration of the mass spectrometer. The calibration mixture contained angiotensin II, angiotensin I, substance P, bombesin, ACTH clip 1-17, ACTH clip 18 -39, and somatostatin 28. A series of eight samples was spotted around one external calibration mixture. The SNAP algorithm (signal to noise ratio threshold, 5; quality factor threshold, 30) in FlexAnalysis TM 3.4 was used to pick up the 150 most prominent peaks in the mass range m/z 700 -4000. The subsequent MS/MS analysis was performed in a data-dependent manner, and the five most abundant ions fulfilling certain preset criteria (signal to noise ratio higher than 25 and quality factor higher than 50) were subjected to high energy CID analysis. The collision energy was set to 1 keV, and nitrogen was used as the collision gas.
Data Interpretation and Database Searching-Peptide mass fingerprints (PMFs) were searched by the program Mascot 2.1 (Matrix Science Ltd.) against the database of S. flexneri 2a 2457T (4540 entries, including all of the ORFs predicted on the chromosome of S. flexneri 2a 2457T (GenBank TM accession number GI:30043918), the virulence plasmid pCP301 (GenBank accession number GI:18462515) from S. flexneri 2a 301, and the large IncHI plasmid R27 (GenBank accession number GI:7800243) from Salmonella typhi) to eliminate redundancy resulting from multiple members of the same protein family, and the results were checked against the NCBInr database (version October 21, 2006, 4,072,503 sequences). For those proteins identified in the NCBInr database, the proteins of S. flexneri spp. were selected as the best hits from the homologue protein lists. The search parameters were as following: trypsin digestion with one missed cleavage; carbamidomethyl modification of cysteine as a fixed modification and oxidation of methionine as a variable modification; peptide tolerance maximum, Ϯ0.2 Da; MS/MS tolerance maximum, Ϯ0.6 Da; peptide charge, 1ϩ; monoisotopic mass. Scores greater than 49 were significant (p Ͻ 0.05) for a local PMF search. For unambiguous identification of proteins, more than five peptides must be matched for a PMF search.
RNA Extraction and Semiquantitative RT-PCR-Under different growth conditions, the relative abundance of argT mRNA was determined by semiquantitative RT-PCR. Total RNA was extracted from 2457T cultivated at 30 and 37°C using the RNeasy Mini kit (Qiagen) and treated with the RNase-free DNase set (Qiagen). RNA concentration and purity were evaluated by spectrophotometric analysis at 260 and 280 nm. After digestion by DNase I to eliminate contaminating DNA, reverse transcription was carried out with the reverse Su-perScript Choice system (Invitrogen) using 2 g of total RNA as the template. Specific target RT-PCR products were normalized to an established endogenous control, 16 S ribosomal RNA, the expression of which is relatively constant in bacteria (8). Negative controls were performed using RNA as template to confirm the absence of contaminating DNA in the RNA preparations.
To introduce a point mutation in htrA, PCR was performed using Pfu DNA polymerase (TaKaRa), primers containing the desired mutation, and the appropriate plasmid as the template according to the QuikChangeா site-directed mutagenesis kit (Stratagene). The PCR products were digested by DpnI and then transformed into DH5␣ competent cells. The desired point mutation was confirmed by sequencing. Primers S210A1 and S210A2 were used to generate pGEX-htrA S210A .
An htrA deletion mutant of S. flexneri (2457T⌬htrA) was constructed by a modified form of lambda red recombination, originally described by Datsenko and Wanner (9). Briefly, a 500-bp upstream region and a 500-bp downstream region of htrA were independently amplified by PCR using primers h5p1 and h5p2 and primers h3p1 and h3p3, respectively. The resulting products were cloned into pET22b (Novagen) along with kan amplified from the plasmid pKD4. S. flexneri 2457T, carrying pKD46 (9) containing the red recombinase genes, was grown at 30°C in the presence of 10 mM arabinose (to induce the recombinase genes) and was transformed with gel-purified h5-Kan-h3 PCR product amplified by h5p1 and h3p2 primers from the previously constructed pET22b based vector. Recombinants were selected on LB plates containing kanamycin (50 g/ml). The primers used in this study are listed in supplemental Table 1.
Periplasmic Protein Extraction-Periplasmic protein was extracted by the improved osmotic shock method (10). About 5 ml of early stationary phase cells were harvested and washed twice with 1ϫ PBS. The washed cell pellet was resuspended in 100 l of cell lysis buffer (20% sucrose, 30 mM Tris-Cl, pH 8.0, 1 mM EDTA, 1 mg/ml lysozyme). The cell suspension was shaken gently in ice-cold water for 15 min followed by centrifugation at 6000 rpm for 15 min. After that, the recovered supernatant represents the periplasmic proteins. The pellet, representing the remaining whole-cell proteins, was resuspended in 500 l of water.
Protein Expression and Purification-Expression of fusion proteins was induced by adding 1 mM IPTG when cultures had grown to an A 600 of 0.6 -0.9. After the cell growth continued for another 4 h at 30 or 37°C, an aliquot of 1 ml was harvested. The induced ArgT and HtrA fusion proteins were purified from cultures growing at 20°C for 16 h using Glutathione-Sepharose 4B beads (GE Healthcare). The proteins were eluted with elution buffer (20 mM glutathione, 100 mM Tris-HCl, pH 8.0, 120 mM NaCl).
Antibody Preparation-To generate polyclonal ArgT antibodies, purified GST-ArgT proteins were injected subcutaneously into two BALB/c female mice. Sera from the immunized mice were collected and purified by affinity chromatography according to the manufacturer's instructions (Pierce).
SDS-PAGE and Western Blot Analysis-All samples were boiled and loaded on 12% acrylamide gels for SDS-PAGE, and after electrophoresis the proteins were stained with Coomassie Blue. When necessary, proteins were transferred to PVDF membranes (Immobilon, Millipore) after electrophoresis. The membranes were blocked with 5% nonfat milk and then incubated for 1 h with primary antibodies raised against ArgT. The membranes were then washed, and horseradish peroxidase-conjugated goat anti-mouse IgG secondary antibodies (Santa Cruz Biotechnology) were added for 1 h. The ECL Plus kit from Pierce was used for detection.
Proteolysis Test-Proteolysis was performed as described previously (11). HtrA (0.5 g) was incubated with ArgT (50 g) in 50 mM Tris-Cl, pH 8.0 buffer in a final volume of 100 l. 15-l samples were taken every hour and then resolved by SDS-PAGE.
Coomassie-stained ArgT fragments cleaved by HtrA were cut out, and in-gel digestion was performed by trypsin (7) and Glu-C (12) as described previously. Peptides from digested fragments were resolved in 2 l of 0.5% trifluoroacetic acid. Mass spectrometry was performed on a Bruker Ultraflex III MALDI-TOF/TOF mass spectrometer (Bruker Daltonics) according to the procedures described above.

Two-dimensional Gel Electrophoresis and Protein Identification
Because bacteria only express a limited portion of their proteome under any one condition (13), we analyzed the proteome of S. flexneri at two different temperatures and at two different growth phases to obtain better proteome coverage. According to our previous report (2), most of the wholecell proteins were scattered in the pI ranges of pH 4 -7. So, variable IPG strips of pH 4.0 -5.0, pH 4.5-5.5, pH 5.0 -6.0, and pH 5.5-6.7 were used to resolve protein in the densely populated pH 4.0 -7.0 zone. These basic proteins were also analyzed using IPG strips of pH 6.0 -11.0. The separate graphs of the above different pH gradient range gels were merged to produce a single artificial gel map of pH 4.0 -11.0. The two artificial maps of log phase cells displayed more than 1500 Coomassie-stained protein spots, whereas those of stationary phase cells only displayed about 1000 spots ( Fig. 1 and supplemental Fig. 1). This indicated that the various metabolisms of cells were more active in the log phase. After destaining and in-gel trypsin digestion, a total of 723 spots representing 574 proteins were successfully identified by MALDI-TOF/TOF MS (supplemental Table 2). Maps and information for all identified proteins are available at the Proteome 2D-PAGE Database.

Predicted and Actual Proteome of S. flexneri 2a 2457T
The theoretical 2-DE map was constructed according to the genome annotation of S. flexneri 2a 2457T and the virulence plasmid pCP301 and compared with the identified maps. As shown in supplemental Fig. 2, the actual 2-DE map displayed a bimodal protein distribution pattern similar to the predicted map. The identified basic proteins (13.74%) were less abundant than predicted (45.54%). This is a common scenario in other bacteria (14). The possible explanations are that the basic proteins are inherently difficult to separate by 2-DE or that their levels are very low. Isoelectric point values and molecular weight values estimated by gel electrophoresis matched closely with predicted values, although some discrepancies were seen (supplemental Fig. 3). The differences of isoelectric point or molecular weight values suggested that these proteins had undergone post-translational modifications. The differences in molecular weight values are mainly due to the cleavage of signal peptides or other functional sequences.

Distribution of Proteins According to CAI, GRAVY Value, Cellular Role Categories, and Localization
CAI estimates the degree of synonymous codon adaptation in a coding region compared with the optimal usage (15). The CAI distributions of genes coding for the proteins identified were compared with those of all predicted proteins (Fig. 2). The proteins encoded by genes with a CAI value Ͼ0.4 account for 17.45% of the total proteins but 54.37% of the currently identified proteins, supporting the idea that proteins encoded by genes with a high CAI were abundant and easily identified. The overall hydrophobicity of a protein, expressed as the GRAVY index (15), was also calculated for all identified and predicted proteins. A comparison between identified proteins and total predicted proteins ( Fig. 2) shows that both hydrophobic proteins (with very high GRAVY values) and the very hydrophilic proteins (with very low GRAVY values) are not present among the identified proteins. These results were similar to those of previous reports from E. coli (16), Lactococcus lactis (17), Bacillus subtilis (18), and Bifidobacterium longum (19).
Based on clusters of orthologous groups (COG) information, experimentally identified proteins were grouped into cellular roles and are summarized in Fig. 2. Proteins related to energy production and conversion (category C), translation (category J), and amino acid transport and metabolism (category E) were the three categories containing the most identified proteins. Interestingly, although proteins related to replication, recombination, and repair (category L) were predicted to be the most abundant category (18.02%), only a small proportion of these proteins (1.52%) was identified. However, this observation was not in agreement with similar proteomics data obtained in other bacteria (19,20). When compared with the COG distributions of the theoretical proteomes of related microbes, it was found that only about 5% of the predicted proteins were grouped into category L in other proteobacteria, such as E. coli and Salmonella. The reason for this difference is that there is an astonishingly large number of transposable genetic elements in the genome of Shigella spp. (1,21) that encode proteins related to recombination, such as transposase. The low rate of identification in this category suggested that these transposon-associated proteins were not necessary for cell grown in complex media. The cellular localizations of all identified proteins, predicted by PSORT Version 2.0, are also compared with those of all predicted proteins (Fig. 2). Briefly, 337 proteins identified are cytoplasmic, 13 proteins were predicted to reside in the cytoplasmic membrane, and 18 proteins were predicted to be located in the outer membrane. Proteins located in the outer membrane seem to be more easily identified than those located in the cytoplasmic membrane.

Abundance Analysis of Identified Proteins
The abundance of many proteins varied largely according to the four different conditions, such as the virulence factors shown in Fig. 3 and some growth phase-dependent proteins (such as Spot IDs 125, 205, and 270). Besides these, there were still some highly abundant proteins (volume % Ͼ0.6) common to the different conditions, suggesting that they play important functions. The information about these abundant proteins is summarized in Table I. Among these proteins, only one protein has a calculated CAI value of less than 0.65, indicating that proteins encoded by genes with a high CAI are abundant. Furthermore, we also located the identified proteins in the genome with respect to their abundance. As shown in supplemental Fig. 4, the distribution and organization of expressed genes in the genome does not show any distinct active regions, although it is known that most functional genes are located in gene clusters. Despite this, there are still several regions of ϳ100 kb without any genes expressed (so-called "expression deserts"; such as 1.86M-1.96M and 3.85M-3.95M regions in the genome) and one 20-kb region where abundant proteins are clustered (called an "expression oasis"; 4.33M-4.35M in the genome). Interestingly, this expression-abundant region is not the most highly expressed region (the highest peak in the outer circle in supplemental Fig. 4). a Score greater than 49 is significant. The detailed information about protein identification is summarized in supplemental Table 2. b ENc is the effective number of codons, which is a measure of the extent of codon preference in a gene. c The single letter codes represent a particular functional category as follows: J, translation; L, replication, recombination, and repair; M, cell wall/membrane biogenesis; O, post-translational modification, protein turnover, chaperones; C, energy production and conversion; G, carbohydrate transport and metabolism.

Hypothetical Genes and Misannotated Genes
The lack of homology to other known proteins makes hypothetical proteins interesting as they may have unique functions required in a particular organism. Altogether 64 products of hypothetical genes, whose functions have not been assigned up to now, were experimentally confirmed in this study. Proteome studies focusing on the presence of these proteins during growth phase, stress, or other processes may give insight into their cellular roles (19). Moreover, the expression of six misannotated genes was also confirmed. Among these, four genes (Spot IDs A123, 0673, zx70, and A153) are annotated as pseudogenes due to frameshift, and one gene (Spot ID A020) is unannotated in the genome sequencing project. In fact, previous proteome research of bacteria also provided evidence for expression of genes annotated as pseudogenes (20,22), suggesting that expression of pseudogenes is common in bacteria. The last misannotated gene, previously predicted to encode a 4.3-kDa protein in the same region of the genome, actually encodes a 26-kDa protein homologous to protein GI: 110805376. All of these data will be helpful to correct the genome annotation of S. flexneri.

Comparative Proteomics Analysis of S. flexneri Comparative Proteomics Analysis of S. flexneri in Log Phase and Stationary Phase
The results of this study are similar to a previous report (23). Briefly, compared with the protein expression profiles of cells in log phase, the expression of 2-oxoglutarate dehydrogenase (SucAB; Spot IDs X60 and 205) was reduced. The expressions of the pyruvate dehydrogenase (AceE; Spot ID 198) and fumarate reductase (FrdAB; Spot IDs 322 and 425), aspartase (AspA; Spot ID 170), and periplasmic asparaginase II (AnsB; Spot ID 300) were, however, greatly increased. These results are consistent with bacterial cells converting their aerobic metabolism into anaerobic fumarate metabolism during the stationary phase (24). The expression of the glycerol 3-phosphate (glp) regulon-encoded proteins (GlpABDKQ; Spot IDs 396, 344, 505, 179, and 133) was also increased in the stationary phase. These proteins might be related to stationary autophagy (dwarfing phenomenon), an important survival mechanism of bacteria (25).

Comparative Proteomics Analysis of S. flexneri Grown at 30 and 37°C
Because proteins are the functional outputs of genes, analysis of the changes in the proteome of pathogens grown at conditions similar to their host environments will uncover those precisely regulated and selectively expressed genes. The 2-DE profiles of S. flexneri cultures grown at 30 and 37°C are highly comparable (supplemental Fig. 1). A protein was regarded to be temperature-dependent if the spot density in one condition was significantly different (Ͼ2-fold) from that in the other condition.
Virulence-related proteins-The database search results obtained from MALDI-TOF/TOF MS showed that most upregulated proteins at 37°C were heat shock proteins and virulence factors (such as IpaABCD; Spot IDs 395, 678, 486, and 303) in agreement with previous reports (6). Locating the positions of these virulence factors on 2-DE maps may facilitate the study of their cellular roles in subsequent functional proteomics analyses (i.e. under invasion or other processes). All of the identified known virulence factors encoded by the virulence plasmid are listed in Table II. The abundance of some virulence factors under the different conditions is also shown in Fig. 3. Interestingly, although the abundance of IpaC is highest in the stationary phase at 37°C, its CAI value is only 0.222, which is much lower than those of other abundant proteins encoded by the chromosome. Thus, evaluating synonymous codon usage is not accurate enough for the prediction of expression abundance, particularly for those exogenous genes acquired by horizontal transfer.
As shown in Fig. 3, the expression patterns of virulence factor IpaC and its protein chaperon IpgC are similar. Their abundances were greatly increased at 37°C in the stationary phase but not in the log phase no matter what the growth temperature was. Strangely, this kind of expression pattern is not consistent with that of other virulence factors, such as IpaA and IpaD. The levels of IpaA and IpaD were not apparently increased in the stationary phase, although they were increased at 37°C. Because the encoding genes of the above virulence factors are located in a single cluster in the virulence plasmid of S. flexneri, the difference of their expression patterns might result from a particular post-transcriptional regulation. This regulatory mechanism is probably related to the growth phase transition. Furthermore, the abundance of virulence protein IpaB was significantly lower than its group partners, suggesting that its distinct physical-chemical characteristics probably lead to dissolution difficulties. In addition, the levels of all of these virulence proteins were higher in the log phase than in the stationary phase at 30°C. The reason for this might be that the superhelix conformations of the plasmid DNA were much looser under rapid growth and efficient metabolism.
Another finding is that the abundance of protein SepA (Spot ID 475), whose expression was not previously considered to be controlled by temperature (26), was actually higher at 37°C than at 30°C (Fig. 3). SepA is a secreted serine protease of the IgA1 protease family. Its function is not required for entry into epithelial cells or for intercellular dissemination, but it might play a role in tissue invasion (26). The molecular mass of SepA precursor is 146 kDa, and its mature form is about 110 kDa after autonomous proteolysis. However, the experimental molecular mass of identified spots was only about 26 kDa. Further analysis of the PMF database search revealed that most matched peptides were clustered at the C-terminal outer membrane autotransporter barrel domain (TIGR01414) a Score greater than 49 is significant for a PMF search, and 21 is significant for an ion MS/MS search. b ENc is the effective number of codons, which is a measure of the extent of codon preference in a gene.
of SepA, suggesting that this protein was cut off, and only its C terminus was left. Although this cleavage has been analyzed by traditional biochemical methods (26), it still indicated the capability of proteomics data mining for finding posttranslational modifications.
There are still some known virulence factors that were not identified, such as ospD3, ospE1/2, and ipaH. In fact, the expression of virulence proteins is controlled by two kinds of plasmid-encoded transcriptional regulators: 1) virB and virF, which are active upon temperature up-shift, and 2) mxiE, which is active under conditions of secretion (27). The above unidentified virulence factors are those controlled only by regulator mxiE, which will not be active in our experimental conditions. It should also be noted that the identification of parts of known virulence factors was in agreement with a previous microarray analysis (4). Thus, these proteomics results not only provide a global view on expression of virulence proteins but also provide new evidence for the classification of effectors in the type III secretion system of S. flexneri.
De Novo Biosynthesis of Purine Nucleotides-Five of the down-regulated genes at 37°C (purDEHMT; Spot ID 036/444/ 208/042/229, ratio 0.36/0.26/0.49/0.27/0.46 at stationary phase) were associated with the de novo biosynthesis of purine nucleotides. An additional four genes of this pathway (purABCU; Spot ID 177/335/106/467, ratio 0.69/0.55/0.58/ 0.67 at stationary phase) were also down-regulated, although the ratios were not significant. These genes are unlinked single units in the genome of S. flexneri but may act in a single unit of regulation controlled by the "PurR" repressor protein (28). Therefore, the temperature-dependent expression of these genes is possibly regulated at the level of transcription. Simultaneous detection of the differential expression of these scattered genes also exhibited the power and the confidence of comparative proteomics analysis. Because the bacterium divides more rapidly at higher temperature, it is surprising to see a decrease in abundance of Pur gene products. It might be a result of using rich growth medium and hence no need for de novo synthesis. The decreased expression of these proteins might be caused by the increased expression of virulence-related proteins at 37°C that consume a lot of energy and material resources. Another possible explanation is that the enzyme activities of Pur proteins are higher at 37°C than those at 30°C. Then, if the same units of enzyme are needed for the survival of bacteria, the total abundance of these proteins will be increased.
Amino Acid Transporter ArgT-A further interesting finding is that the spots of protein ArgT (Spot ID 113) disappeared at 37°C (Fig. 3), which has not been previously reported in related species. The argT gene is predicted to encode the lysine/arginine/ornithine-binding protein, a periplasmic protein responsible for binding the amino acid substrates during the process of amino acid transport into the cytoplasm (29). According to studies in E. coli, the expression of ArgT is up-regulated in the stationary growth phase (30) or under glucose-limiting conditions (31), suggesting its role in the physiological adaptation of bacteria to detrimental conditions. Our following experiments were focused on the regulatory mechanism of ArgT expression.

Regulation Mechanism of ArgT Expression Differential Expression of ArgT Is Due to Proteolysis
In bacteria, regulation at the transcriptional level is the most efficient and cost-effective way to control gene expression. However, the results of semiquantitative RT-PCR analysis (Fig. 4A) showed that the quantity of argT mRNA at 37°C is similar to that at 30°C, suggesting that the regulation of ArgT expression did not occur at the transcriptional level.
It is well known that translation is coupled to transcription in bacteria. Because the regulation of ArgT expression did not occur at the transcriptional level, post-translational regulation, such as proteolysis, was the most plausible mechanism; that is the protein might be degraded at 37°C. To test our hypothesis, we constructed an ArgT-overexpressing plasmid, pProEX-HTb-argT (pProEX-argT), and transformed it into competent cells of S. flexneri 2457T. After induction by IPTG at 37 and 30°C, the cultures were collected and analyzed by SDS-PAGE. The results showed that even if 2457T cells were compelled to express ArgT by the exogenous plasmid the expression product could not be detected at 37°C (Fig. 4B). Because transcription and translation of argT in the exogenous expression vector were not subject to the effect of temperature, we concluded that the differential expression of ArgT was due to protein proteolysis.

ArgT Undergoes Proteolysis in Periplasmic Space of S. flexneri
According to the genome annotation of S. flexneri, ArgT is located in the periplasmic space. The ArgT protein sequence contains a signal peptide to traverse the inner membrane and reach the periplasmic space. Hence, is ArgT degraded in the cytoplasm or in the periplasmic space? To answer this question, the periplasmic proteins of 2457T/ pProEX-argT were extracted and probed by Western blot analysis with anti-ArgT antibody. ArgT protein was only detected in the periplasmic proteins extracted from cultures grown at 30°C but not in the remaining whole-cell proteins at 30°C or in the protein samples extracted from cultures grown at 37°C (Fig. 5A).
On the other hand, 2457T strains carrying pGEX-argT and carrying pGEX-argT2, which expressed ArgT without the signal peptide, were induced to express at 30°C and then incubated at 37 and 30°C for 4 h. The total proteins of each culture were analyzed by Western blot with anti-ArgT (Fig.  5B). The results showed that ArgT protein was not degraded when the signal peptide was removed. Thus, we concluded that ArgT undergoes proteolysis in the periplasmic space of S. flexneri.

ArgT Proteolysis Is Not Dependent on de Novo Protein Synthesis
Is the protease responsible for the cleavage of ArgT synthesized at 37°C but not at 30°C? To answer this question, we did the following experiment. 2457T/pProEX-argT cells were cultivated and induced to express ArgT by IPTG at 30°C for 4 h, and then chloramphenicol was added to prevent new protein synthesis. After that, the temperature of the experimental group was increased to 37°C for another 6 h, whereas the temperature of the control group was kept at 30°C. Whole-cell protein samples were prepared every hour and analyzed by Western blot. As shown in Fig. 6, the overexpressed ArgT proteins at 30°C were degraded rapidly after temperature up-shift to 37°C, and this process was not dependent on any newly synthesized proteins. That is, ArgT was degraded at 37°C by a protease synthesized at 30°C but activated at 37°C.

ArgT Was Degraded by HtrA Protease
Because ArgT was degraded in the periplasmic space of 2457T at 37°C but not at 30°C, it is reasonable to suppose that a temperature-activated periplasmic protease may be responsible for the degradation of ArgT. HtrA (also named DegP) attracted our attention because it is a heat shockinduced protein and plays crucial roles with regard to protein quality control in the periplasmic space, functioning as both a molecular chaperone and protease (32,33). It has been reported that the proteolytic activity of HtrA increases with increasing temperature (11,33).
Is ArgT specifically degraded by HtrA at 37°C in vivo, or can another protease perform the same function? To answer this question, an htrA deletion mutant of S. flexneri FIG. 4. Differential expression of ArgT is due to proteolysis. A, semiquantitative RT-PCR analysis of argT. B, overexpression of ArgT at 37 and 30°C. 2457T/pProEX-argT (2457T carrying pProEX-HTb-argT) was induced by IPTG at 37 and 30°C, respectively. M, protein marker (Fermentas).
FIG. 5. ArgT undergoes proteolysis in periplasmic space of S. flexneri. A, the periplasmic proteins (periplasm) and the remaining whole-cell proteins (cytoplasm ϩ membrane) of induced 2457T/ pProEX-argT cells were extracted from cultures grown at 37 and 30°C. ArgT expression products were only detected in the periplasm of 2457T grown at 30°C. B, 2457T/pGEX-argT (with signal peptide) and 2457T/pGEX-argT2 (without signal peptide) were induced by IPTG at 30°C and then incubated at 37 and 30°C for 4 h. Due to the cleavage of the signal peptide, the ArgT detected in 2457T/pGEX-argT is smaller than that in 2457T/pGEX-argT2. M, protein marker (Fermentas).
(2457T⌬htrA) was constructed using the lambda red recombination strategy. Then, the ArgT-overexpressing plasmid was transformed into the 2457T⌬htrA strain. 2457T⌬htrA/pProEX-argT was induced to express ArgT by IPTG at 37 and at 30°C. The results (Fig. 7A) showed that ArgT could not be degraded in the htrA deletion mutant. Thus, we concluded that HtrA is related to ArgT proteolysis in vivo.
Skó rko-Glonek et al. (11) reported that mutation of the Ser residue to Ala at position 210 of the mature HtrA could totally eliminate the proteolytic activity of HtrA without changing its structure. To test whether HtrA protease degraded ArgT directly without the participation of any other proteins, GST-ArgT, GST-HtrA, and GST-HtrA S210A fusion proteins were purified, and proteolysis assays were carried out in vitro. As shown in Fig. 7B, ArgT was degraded by HtrA in vitro at 37°C but not at 30°C. Moreover, ArgT was not degraded by the inactivated HtrA S210A . Thus, we concluded that HtrA is the protease responsible for the temperature-dependent degradation of ArgT. Periplasmic protease HtrA, a known virulence factor (34), is responsible for the degradation of ArgT. It has been reported that HtrA facilitates IcsA surface expression and is required for efficient intercellular spread of S. flexneri (35). HtrA is also found to be a virulence factor in other bacteria, such as Salmonella typhimurium (36) and Listeria monocytogenes (37).

Identification of Cleavage Site in ArgT
To identify the HtrA cleavage sites of ArgT, the fragments of ArgT degraded by HtrA were separated by 15% SDS-PAGE (Fig. 8A). ArgT was cleaved into two fragments (Fig. 8A, frag1  and frag2). The fragments were cut out and digested by trypsin and Glu-C, respectively (trypsin cleaves peptide on the carboxylic side of lysine and arginine; Glu-C cleaves peptide on the carboxylic side of Glu in ammonium bicarbonate at pH 7.8). The resolved digestion products were analyzed by tandem mass spectrometry (MALDI-TOF/TOF). The PMF result of the larger fragment (frag1; about 32 kDa) matched some peptides of GST (m/z 1032.57, 1094.56, 1182.68, 1516.81, 1801.94, 2229.08, 2269.12, 2326.12, and 2357.18), suggesting this fragment was the N terminus of ArgT, and the MS/MS results showed that one of the peptides (molecular weight, 1141.55) of the smaller fragment (frag2; about 14 kDa) matched the C-terminal region of ArgT. Furthermore, the sum of the molecular weights of these two fragments was similar to the molecular weight of intact ArgT. All these observations suggested that these two fragments might be produced by one cleavage in ArgT.
If we can identify the N-terminal or C-terminal sequence of the cleavage fragments, we can determine the cleavage site. That is, our targets are those ion peaks whose experimental ion masses did not match the theoretical values calculated from protein digestion of ArgT. One such peptide digested by trypsin (molecular weight, 1726.842) and one peptide digested by Glu-C (molecular weight, 2583.301) were identified. They represented the same cleavage site according to their amino acid sequences (Fig. 8, B, C, and D). Thus, we concluded that the cleavage site in ArgT was between position 160 (Val) and position 161 (Ala). This is in agreement with a report suggesting a preference for Val and Ile as the residue preceding the HtrA cleavage site (34).
FIG. 6. ArgT, overexpressed at 30°C, was degraded at 37°C without new protein synthesis. The cultures of 2457T/pProEX-argT were induced to express ArgT at 30°C for 4 h. Then, chloramphenicol was added to prevent new protein synthesis. After that, the cells were incubated at 30 (left) and 37°C (right) for another 6 h. CK, the cultures of 2457T/pProEX-argT induced to express at 37°C. M, protein marker (Fermentas).

Conclusion
In this study, the proteome reference maps of the log phase and stationary phase S. flexneri 2a 2457T cells grown at 30 and 37°C were thoroughly analyzed by the use of multiple overlapping narrow pH range (pH 4.0 -5.0, pH 4.5-5.5, pH 5.0 -6.0, pH 5.5-6.7, and pH 6 -11) two-dimensional gel electrophoresis. Altogether, 755 spots representing 588 different protein entries were identified by MALDI-TOF/TOF MS. The majority of the known key virulence factors of S. flexneri were located in the 2-DE maps for the first time. We also analyzed the abundance of identified proteins at different stages and confirmed the expression of 70 hypothetical proteins. Moreover, six misannotated proteins were experimentally identified.
The comparative proteomics analysis also revealed that protein ArgT of S. flexneri was not detected at 37°C but was present at 30°C, contrary to the expression of virulence genes. We then showed that ArgT was degraded by the periplasmic protease HtrA, a known virulence-related protein.
All of these findings suggested that the temperature-dependent proteolysis of ArgT at 37°C is possibly related to the virulence of S. flexneri. Thus, our future work will focus on the mutation of argT to construct a mutant strain that can express functional ArgT proteins at 37°C. We will then be able to evaluate the virulence of this mutant by competitive invasion assays in HeLa cells and BALB/c mice.