Integrated Transcriptomic and Proteomic Analysis of the Global Response of Synechococcus to High Light Stress*

Sufficient light is essential for the growth and physiological functions of photosynthetic organisms, but prolonged exposure to high light (HL) stress can cause cellular damage and ultimately result in the death of these organisms. Synechococcus sp. PCC 7002 (hereafter Synechococcus 7002) is a unicellular cyanobacterium with exceptional tolerance to HL intensities. However, the molecular mechanisms involved in HL response by Synechococcus 7002 are not well understood. Here, an integrated RNA sequencing transcriptomic and quantitative proteomic analysis was performed to investigate the cellular response to HL in Synechococcus 7002. A total of 526 transcripts and 233 proteins were identified to be differentially regulated under HL stress. Data analysis revealed major changes in mRNAs and proteins involved in the photosynthesis pathways, resistance to light-induced damage, DNA replication and repair, and energy metabolism. A set of differentially expressed mRNAs and proteins were validated by quantitative RT-PCR and Western blot, respectively. Twelve genes differentially regulated under HL stress were selected for knockout generation and growth analysis of these mutants led to the identification of key genes involved in the response of HL in Synechococcus 7002. Taken altogether, this study established a model for global response mechanisms to HL in Synechococcus 7002 and may be valuable for further studies addressing HL resistance in photosynthetic organisms.

Sufficient light is essential for the growth and physiological functions of photosynthetic organisms, but prolonged exposure to high light (HL) stress can cause cellular damage and ultimately result in the death of these organisms. Synechococcus sp. PCC 7002 (hereafter Synechococcus 7002) is a unicellular cyanobacterium with exceptional tolerance to HL intensities. However, the molecular mechanisms involved in HL response by Synechococcus 7002 are not well understood. Here, an integrated RNA sequencing transcriptomic and quantitative proteomic analysis was performed to investigate the cellular response to HL in Synechococcus 7002. A total of 526 transcripts and 233 proteins were identified to be differentially regulated under HL stress. Data analysis revealed major changes in mRNAs and proteins involved in the photosynthesis pathways, resistance to light-induced damage, DNA replication and repair, and energy metabolism. A set of differentially expressed mRNAs and proteins were validated by quantitative RT-PCR and Western blot, respectively. Twelve genes differentially regulated under HL stress were selected for knockout generation and growth analysis of these mutants led to the identification of key genes involved in the response of HL in Synechococcus 7002. Cyanobacteria are a large group of prokaryotes that have photosystem II (PS II) 1 , photosystem I (PS I), and carry out oxygenic photosynthesis. They play important roles in global carbon and nitrogen cycles. Using solar energy, cyanobacteria generate the reducing equivalents for CO 2 fixation and synthesis of carbohydrates and other metabolite building blocks. In addition to reducing equivalents, the photosynthetic apparatus generates a proton gradient across the thylakoid membranes for ATP synthesis.
Light is a constantly changing environmental factor and cyanobacteria must have the ability to acclimate to changing light conditions. The acclimation can be divided into shortterm and long-term processes (1, 2). Short-term acclimation includes state transitions, protective energy dissipation (3,4), changes in the energy transfer efficiency from the harvesting complex to PS II (5), and the formation of nonfunctional PS II reaction centers (6,7). These responses occur rapidly and are usually completed within several minutes. The long-term acclimation is much slower and it may take up to several days to complete the processes that involve changes in the composition, structure, and function of the photosynthetic apparatus as well as other photosynthesis-related components. The process occurs at different stages of gene expression, including mRNA synthesis (transcription); protein biosynthesis (translation); and post-translational modification.
Studies have shown that there is an upper limit of light intensity beyond which a cyanobacterium absorbs more energy than its energy consumption and dissipation, leading to photoinhibition and photodamage (8). The light intensity that causes photoinhibition varies in different cyanobacterial strains. Synechococcus sp. strain PCC 7002 (hereafter Synechococcus 7002), a unicellular, euryhaline cyanobacterium, is the one that can grow in the highest light intensity among the cyanobacteria tested and/or reported (9 -11). Although most of the cyanobacteria cannot tolerate a light intensity of 1000 mol photons/m 2 /s, Synechococcus 7002 can grow rapidly at a light intensity of 2000 mol photons/m 2 /s, making it a very nice organism to study the mechanisms of acclimation to high-light (HL). Synechococcus 7002 has a completely sequenced genome (http://www.ncbi.nlm.nih.gov/) and can be easily genetically transformed (12) with a versatile system (13). It has been used for studies on various aspects of photosynthetic electron transport as well as CO 2 fixation and reduction, it is also a model organism for studies in biofuel development (14,15).
Recently, physiological responses to changing light intensity have been studied (1,16,17), and some of the relevant molecular mechanisms have been described (2,4,18). However, the mechanism of HL acclimation is still not well understood. To analyze the molecular components and regulatory mechanisms of HL-acclimation networks, especially for genes/proteins involved in long-term acclimation, we studied transcription and translation of Synechococcus 7002 in response to HL conditions with methods coupling RNA sequencing (RNA-Seq) and mass spectrometry. With the Next-Generation Sequencing (NGS) technology, RNA-Seq has become a powerful tool for transcriptomic profiling (19,20). Tandem mass tags (TMT)-based quantitative proteomics has been widely used and proven to be a reliable method for determining protein expression levels (21)(22)(23). Here, we demonstrated that an integrated study coupling NGS-based RNA-Seq transcriptomics and quantitative TMT-LC-MS/MS proteomics has gained a system level understanding of the functional components involved in acclimation to HL. A large number of genes were found to be responsive to HL as demonstrated at levels of both transcripts and proteins. Further gene knockout and comparative growth analysis revealed several important molecular components of the long-term HL acclimation network in Synechococcus 7002. To the best of our knowledge, this work represents the first combined functional transcriptomic and proteomic analysis of HL response mechanisms in cyanobacteria.

EXPERIMENTAL PROCEDURES
Sample Preparation-Synechococcus 7002 cells were grown in culture tubes (⌽40 mm ϫ 200 mm) containing medium A supplemented with 1 mg/ml NaNO 3 as nitrogen source (15,24). The cultures were grown at 38°C at 250 mol photons m Ϫ2 s Ϫ1 and were bubbled with 1% (v/v) CO 2 in air. Cell density was measured on a UV-1750 spectrophotometer (Shimadzu). Cells were grown to OD 730 nm ϭ 0.7 and were inoculated at an OD 730 nm ϭ 0.05. Then the cultures were continuously illuminated at 2000 mol photons/m 2 /s for HL treatment, or at 50 mol photons/m 2 /s (normal light, NL) as the control. The cultures were grown at 38°C and were bubbled with 1% (v/v) CO 2 in air. Illumination (50 mol photons/m 2 /s or 2000 mol photons/ m 2 /s) was provided by a halogen floodlight and light intensities were measured using a LI-250A light meter (Li-COR, Lincoln, NE). The growth temperature was maintained by using a refrigerated water circulator. One milliliter of culture samples were collected and OD was measured by spectrophotometer every 8 h. For both conditions, two biological replicates were performed independently. Growth experiments were repeated at least three times to confirm the growth patterns. Cells were harvested at the early/mid-exponential phase when OD 730 nm ϭ 2.5 by centrifugation at 6600 ϫ g for 10 minutes (min) at 4°C for RNA-Seq transcriptomic analysis and proteomic analysis, respectively (Fig. 1). For both NL and HL-grown cells, the samples were aliquoted for protein and RNA extraction. A cell suspension from an exponential-phase culture grown at 38°C in medium A ϩ at a light intensity of 50 mol photons/m 2 /s or 2000 mol photons/ m 2 /s with OD 730 ϭ 1.0 contained (0.88 Ϯ 0.1) ϫ 10 8 cells or (1.13 Ϯ 0.1) ϫ 10 8 cells in 1 ml as determined by microscopic count. RNA was extracted with Trizol reagent (Invitrogen, Gaithersburg, MD) according to the manufacturer's protocols. Briefly, liquid nitrogen grinding was used for cell-disruption and Trizol reagent was added to the grinding powder (100 mg/ml). After centrifugation at 10,000 ϫ g, 4°C for 10 min to remove the cell debris, phenol/chloroform was applied for the extraction of RNA and isopropyl alcohol for precipitation of RNA at room temperature for 15 min. Then the RNA was washed with 75% ethanol to remove organic pollution and the air-dried RNA was suspended in DEPC-treated H 2 O. RNase-free DNase I (Fermentas, Hanover, MD) was used to remove the remaining genomic DNA. RNA integrity was examined on a 1% agarose gel and RNA concentration was measured on a Nanodrop 2000 (Thermo Fisher Scientific, Waltham, MA). rRNA was subsequently eliminated with RiboMinus TM Transcriptome Isolation Kits for Bacteria (Invitrogen) and Magnetic stand (Invitrogen, Gaithersburg, MD) with some modifications according to the manufacturer's manual. The cDNA libraries were constructed from 0.5 g RNA sample using a TruSeq Stranded total RNA sample preparation kit (Illumina, San Diego, CA) following the guidelines of the manufacturers. After sequencing libraries were denatured with sodium hydroxide and diluted to 14 pmol/L with hybridization buffer (Illumina), sequencing was performed on an Illumina GAIIx platform at SinoGenoMax Co., Ltd (Beijing, China). A paired-end sequencing strategy was used and the sequencing length was 81 bp. Two biological replicates were sequenced for each condition.

RNA Extraction and Illumina Sequencing-Total
Data Analysis-Quality control was carried out on the Illumina GA reads with a perl script (threshold of Q20) to filter the low quality paired-end reads. The left high quality reads of each replicate obtained from the two conditions were mapped to the complete genome sequence of Synechococcus 7002 individually with a Burrows-Wheeler Aligner (BWA) (25) allowing 3 mismatches on a read. Reads that were not mapped to the reference genome and that were mapped to rRNA-coding regions were eliminated from the alignment results, and ambiguously mapped reads (i.e. those with more than one potential match in the genome) were also removed. As previously described (26,27), the unambiguously mapped reads were used to compile a coverage profile for each sample which reflects the depth of sequence data at each position in the Synechococcus 7002 genome. For comparative purposes, coverage profiles were normalized based on the total number of unambiguously mapped reads across the genome for each sample. The expression level for a given gene in each sample was measured as the mean coverage depth for all nucleotides in that gene. Genes with coverage Ͻ2 were removed.
Protein Extraction and Digestion-The cells were washed twice with PBS buffer, resuspended in lysis buffer containing 20 mM Tris-Cl (pH 7.5), 150 mM NaCl, 1% Triton X-100, 1ϫ protease inhibitor mixture and 1ϫ phosphatase inhibitor mixture (Thermo Fisher Scientific). Samples were sonicated at 135 W for 30 min on ice using an ultrasonic processor (JY92-IIN, Ningbo Scientz Biotechnology Co., Ltd, Ningbo, China). Cellular debris was removed by centrifugation at 12,000 ϫ g for 30 min at 4°C, and the resulting supernatants were stored in aliquots at Ϫ80°C until further use. Protein concentration was determined using the 2D Quant kit (GE Healthcare Waukesha, WI) according to the manufacturer's protocol. The samples were reduced with 5 mM DTT (Sigma, St. Louis, MO) at 56°C for 30 min and alkylated with 15 mM iodoacetamide (IAA) (Sigma) for 30 min at room temperature. Samples were then digested overnight at 37°C with sequencing grade modified trypsin (1:50 w/w) (Promega, Madison, WI).
TMT Labeling-After trypsin digestion, peptides were desalted with a Strata X-C18 SPE column (Phenomenex, Torrance, CA) and vacuum-dried. Peptides from two independent samples were reconstituted in 0.5 M TEAB and processed according to the manufacturer's protocol for the 6-plex TMT kit (the labeling reagent of 128, 129, 130, and 131 was used for 4-plex labeling). Briefly, ten units of TMT reagent (defined as the amount of reagent required to label 1.5 mg of protein) were thawed and reconstituted in 40 l ACN. The peptide mixtures were then pooled and incubated for 2 h at room temperature, desalted, dried by vacuum centrifugation, and reconstituted in 10% formic acid (FA).
Strong Cation Exchange Fractionation-Peptides were fractionated using strong-cation exchange as described previously (28). In brief, strong-cation exchange was performed using a Zorbax BioSCX-Series II column (0.8 mmϫ50 mm, 3.5 m). Solvent A consisted of 0.05% FA in 20% ACN, solvent B consisted of 0.05% formic acid, 0.5 M NaCl in 20% ACN. The following gradient was used: 0 -0.01 min UPLC system. The separated peptides were analyzed with a Q Exactive TM Plus hybrid quadrupole-Orbitrap mass spectrometer (Thermo Fisher Scientific). Intact peptides were detected in the Orbitrap at a resolution of 70,000. Peptides were selected for MS/MS using 27% normalized collision energy (NCE) with 12% stepped NCE; ion fragments were detected in the Orbitrap at a resolution of 17,500. A data-dependent procedure that alternated between one MS scan followed by 20 MS/MS scans was applied for the top 20 precursor ions above a threshold ion count of 3 ϫ 10 4 in the MS survey scan with 15.0 s dynamic exclusion. The electrospray voltage applied was 1.8 kV. Automatic gain control (AGC) was used to prevent overfilling of the ion trap; 1 ϫ 10 5 ions were accumulated for generation of MS/MS spectra. For MS scans, the m/z scan range was 350 to 1600 Da. The fixed first mass was set to 100 m/z for TMT quantification.
Database Search-Raw data files were processed to generate peak list files using Proteome Discoverer software (Thermo Fisher Scientific, v. 1.3.0.339). The filtering parameters used were as follows: (1) Allowed precursor mass range was 350 Da to 5000 Da, (2) Precursor charge state was allowed from 1 to 5, (3) Signal to noise ratio was set as 1. data were processed using MaxQuant software (v.1.4.1.2) with an integrated Andromeda search engine (29,30). The protein database used for MS/MS searches was downloaded from Cyanobase (http://genome.kazusa.or.jp/cyanobase, 3,186 CDSs, released 2012) for Synechococcus 7002. Trypsin/P was specified as the cleavage enzyme allowing up to 2 missed cleavages. The precursor charge states allowed were from 1 to 5. Mass error was set to 10 ppm for precursor ions and 0.02 Da for fragment ions. Carbamidomethylation (C), TMT6plex (K) and TMT6plex (N-term) were set as fixed modifications. Oxidation of methionine (M) was set as a variable modification. Detection of at least two matching peptides per protein was set as a requirement for unambiguous identification. The TMT datasets were quantified using the centroid peak intensity with the 'reporter ions quantifier' mode. For all experiments, only unique peptides were considered for protein quantification. The peptide false discovery rate (FDR) was set to 1% and minimum peptide score was set to 13.0. The minimum peptide length was set at 7. All the other parameters in MaxQuant were set to default values.
Real-Time Quantitative Reverse Transcription PCR (qRT-PCR)-RNA was reverse transcribed into first-strand cDNA with the highcapacity cDNA reverse transcription kit with RNase inhibitor (Invitrogen). Gene transcription was measured using the SYBR Green PCR Master Mix (Applied Biosystems, Foster City, CA) and the LightCycler 480 Real-Time PCR System (Roche Diagnostics Ltd, Mannheim, Germany). The 16S or 23S rRNA gene was used as the endogenous control gene for normalizing expression of the target gene. Triplicate technical replicates were performed for duplicate cultures. ⌬CT values were obtained by subtracting the average values of experimental genes from an average of the control gene for each sample. Using a Welch approximation for unequal group variances, a p value was estimated based on the t-distribution that resulted from a betweensubjects t test evaluating the control RNA relative to a given experimental RNA. Primers used for qRT-PCR are shown in supplemental  Table S1.
Production of Polyclonal Antibodies-Anti-PsaC (PS I subunit VII), CpcG (phycobilisome rod-core linker polypeptide cpcG (L-RC 28.5)), RbcL (ribulose bisphosphate carboxylase large subunit), PsaD (PS I subunit II), ApcA (allophycocyanin alpha subunit) or SYNPCC7002_ F0063 (hereafter F0063) polyclonal antibodies were produced and purified via affinity chromatography by ABclonal Co. (Wuhan, Hubei, China). Briefly, polyclonal antibodies of PsaC, CpcG, PsaD, ApcA were generated against the following synthetic peptides: PsaC, CKA-GQIASSPRTED; CpcG, EQGEIPFNIKSPR; PsaD, VFPSGETQFLY-PLDGVPSEKVNEGR; and ApcA, CDRIKAFVGGAARLR. To produce antibodies against RbcL or F0063, the full-length cDNA of rbcL or F0063 was amplified, PCR products were cloned into the pGEX-4T expression vector (Amersham Pharmacia Biotech, Piscataway, NJ ) at the BamHI-XhoI restriction sites, and the resulting plasmid was transformed into E. coli strain BL21 (DE3) for overexpression of RbcL. Cells growing logarithmically were treated with 1 mM isopropyl-␤-D-thiogalactopyranoside (IPTG) for 4 h at 30°C. The fusion proteins were then purified by His-tag affinity chromatography. Following purification of these antigens, immunization and sampling of the anti-sera from rabbit were performed by ABclonal Co. (Wuhan, China), according to standard operating procedures. The specificity of the generated antibodies was determined by the manufacturer using ELISA and Western blotting.
Western blotting-Equal amounts of proteins (10 g) from both HL and NL grown cells were prepared as described previously, denatured in SDS sample buffer, and separated by 12% SDS-PAGE. Proteins were stained with Coomassie Brilliant Blue R250 or transferred to polyvinylidene fluoride (PVDF) membranes (GE Healthcare). After blocking with 5% nonfat milk, membranes were incubated overnight with PsaC, CpcG, RbcL, PsaD, ApcA and F0063 protein-spe-cific antibodies (1:1000 dilution), followed by a 1 h incubation with a 1:3000 dilution of peroxidase-conjugated anti-rabbit IgG (KPL, Gaithersburg, MD) at room temperature. Chemiluminescence was detected by using the SuperSignal® West Pico Chemiluminescent Substrate (Thermo Fisher Scientific) and the gray-scale of Western blots was recorded using ImageQuant TL (GE Healthcare). Immunoblots were performed in three independent experiments and bands of interest analyzed by ImageJ (http://rsb.info.nih.gov/nih-image/) were expressed as mean Ϯ S.D.
Bioinformatics Analysis-Functional enrichment analysis of differentially expressed transcripts and proteins between HL grown cells and NL grown cells was performed to identify significantly overrepresented GO terms and KEGG pathways using DAVID 6.7 (31). The significance of the enrichment was statistically evaluated with a modified Fisher's exact test (EASE score of p value) (31). For GO term enrichment, the GO FAT annotation available in DAVID was used. GO FAT is a subset of the GO term set created by filtering out the broadest ontology terms in order to not overshadow more specific ones. GO terms with p value Ͻ0.05 and fold enrichment Ͼ1.5 are considered to be significantly enriched.
Construction and Analysis of Gene Knockout Mutants-F0063 was disrupted by replacing an internal EcoRI fragment with a nonpolar cassette conferring clindamycin resistance, and psbU was disrupted by replacing an internal PstI fragment with a nonpolar cassette conferring kanamycin resistance. The gene-deleted mutants for psaC, psaD, psaF, psbC, psbO, apcF, cpcG (L-RC 28.5), SYNPCC7002_ A0568, SYNPCC7002_A1479 or SYNPCC7002_A1480 were constructed by replacement of DNA sequence with a kanamycin cassette.
The resultant plasmid was used to transform motile Synechococcus 7002 wild-type cells and one of the antibiotic-resistant transformants was selected for further study. Complete segregation of the mutation was confirmed by PCR and DNA sequencing. PCR primers for mutant construction and validation are listed in supplemental Table S1 in the Supplementary Material. The knockout mutants of 12 selected gene were comparatively grown under 50 mol photons/ m 2 /s (NL) and 2000 mol photons/m 2 /s (HL) for 72 h, respectively.

RESULTS
Overview of Transcriptomic Analysis-RNA-Seq was performed for four samples at the two light intensities in Synechococcus 7002, and produced 34 million (81bp) uniquely non-rRNA reads. The obtained reads represent an average of ϳ130 times Synechococcus 7002 genome lengths and transcripts were detected for nearly all the predicted ORFs. After filtering, a total of 7,549,172 and 12,874,548 effective reads were obtained when growing Synechococcus 7002 under HL and NL conditions, respectively. Further analysis showed that 3180 and 3179 out of the whole 3186 genes in the genome were covered under HL stress and NL, respectively (supplemental Table S2). Candidate genes involved in HL adaption were chosen according to the following criteria: (1) more than 2-fold change after normalization, and (2) statistically significant level p Ͻ 0.05. Finally, the transcription of 526 genes was detected to be associated with HL response, including 311 up-regulated genes and 215 down-regulated genes (Table S3).
Overview of Quantitative Proteomics Analysis-Protein samples of HL grown and NL grown Synechococcus 7002 cells were subjected to TMT-based proteomic analysis. The overview of the proteomic results such as protein mass distribution, peptide distribution and length of peptides are presented in supplemental Fig. S1. A total of 1746 proteins and 25,229 unique peptides were quantified in our experiment (supplemental Table S4andsupplemental Table S5), representing 54.8% of the 3186 predicted proteins in the Synechococcus 7002 proteome (http://genome.kazusa.or.jp/ cyanobase). Using a cutoff of 1.40-fold change and a p value less than 0.05, we determined that 233 proteins were differentially regulated under HL stress. Among these proteins, 128 were up-regulated and 105 were down-regulated upon HL exposure (supplemental Table S6). All raw data has been deposited in the PeptideAtlas database (http://www. PeptideAtlas.org) with the identifier PASS00642.
Validation of Changes in Gene Expression Using qRT-PCR Analysis-To validate the RNA-Seq results, qRT-PCR was used to quantify changes in the transcript levels of 20 selected genes after HL treatment. These genes have different transcript abundance, length, change tendency (upregulated, down-regulated or unchanged) and distribution (on chromosome or on plasmid). The RNA-Seq and qRT-PCR results for the 20 tested genes were strongly correlated (R 2 ϭ 0.95, slope ϭ 0.9855), and the expected trend in the expression pattern was obtained ( Fig. 2A and 2B). These results further proved that our transcriptome data were reliable.
Validation of Changes in protein Expression Using Western blot Analysis-To confirm the results from the proteomic study, Western blot analysis were performed to examine the expression status of several of the quantified proteins; PsaC, CpcG, RbcL, PsaD, ApcA and F0063. Results from Western blot and densitometric analysis were consistent with the quantitative proteomic results (Fig. 2C and 2D), confirming the reliability of our proteomic data. The transcript levels of these genes were determined by qRT-PCR, which were also consistent with the RNA-Seq results. The primers for qRT-PCR are listed in supplemental Table S1.
Comparison of Transcriptome and Proteome Data-To estimate the reproducibility of the TMT-based quantitative proteomic and RNA-Seq results, linear regression analysis based on the log 2 -transformed protein ratios or gene coverage depth was performed for pair-wise comparison of the two experiment replicates. The observed R 2 values revealed a relatively strong linear correlation between the two experiment replicates for both quantitative proteomic and RNA-Seq data (supplemental Fig. S2). These findings indicate a high level of reproducibility between replicate data sets.
We conducted a correlation analysis between the quantitative proteomic and RNA-Seq transcriptomic data. Globally, the expression levels of all the quantified proteins and their corresponding mRNAs showed limited correlation (r ϭ 0.2390) (Fig. 3A). However, a higher correlation was observed between the differentially expressed proteins (DEPs) and their corresponding mRNAs (r ϭ 0.4074) (Fig. 3A). The expression ratio of proteins and their corresponding mRNAs with the same or different direction of change (both Ͼ 1 or both Ͻ1) were also plotted, and higher positive or negative correlation was indicated ( Fig. 3B and C).
Transcriptomic analysis identified 526 differentially expressed genes (DEGs), and 293 of them have quantitative information on their respective proteins (55.7%) as detected by MS (supplemental Table S3). A total of 64 genes were detected to be regulated at both transcription (Ն 2-fold and p value Յ 0.05) and translation (Ն 1.4 fold and p value Յ 0.05) levels, of which 50 genes have the same direction of change and 16 genes have the opposite direction of change in the two levels. The 526 genes differentially expressed at the transcript level and 233 genes differentially expressed at protein level (64 genes were differentially expressed on both mRNA and protein level) under the HL condition were classified into 27 categories according to their GO function (Fig. 4A). We also classified the 233 DEPs in to different groups according to their GO function (Fig. 4B). These genes and proteins were mainly involved in photosynthesis and related pathways, transport and binding, energy metabolism, DNA replication and repair, transcription, and translation.
Our transcriptomic and proteomic analyses showed that many hypothetical proteins coding genes were differentially regulated after HL treatment (Fig. 4), 211 of the 526 (40.1%) differentially expressed transcripts were hypothetical genes, and 77 of the 233 (33.0%) DEPs were hypothetical proteins (supplemental Tables S3 and S6). We suggest that these hypothetical genes are indeed expressed and may play important roles in the HL acclimation.
We constructed a heatmap (Fig. 5A) to compare the expression patterns of the 1746 quantified proteins and their corresponding transcripts and the expression patterns of the 233 HL-induced proteins and their corresponding transcripts (Fig. 5B), the heatmap patterns also show the lack of correlation between mRNA levels and proteins.
The mRNAs/proteins were selected to represent six functional groups, of relevance to photosynthesis, protein production and chlorophyll biosynthesis (Fig. 5C). Genes encoding subunits of photosystem I (PS I and PS II were all significantly down-regulated at the protein level. However, at the mRNA level, genes encoding subunits of PS I were shown to be unchanged or down-regulated whereas genes encoding subunits PS II tended to be up-regulated. Gene and protein expression of subunits of phycobilisomes (PBS) or NADH dehydrogenases tended to change in the same direction, whereas gene and protein expression of enzymes involved in chlorophyll biosynthesis or ribosome subunits tended to change in the opposite direction. Fig. 6, The KEGG pathway enrichment analysis showed that three KEGG pathways were differentially regulated at both mRNA and protein level: "Photosynthesis-antenna proteins" (syp00196), "Photosynthesis" (syp00195), and "Oxidative phosphorylation" (syp00190) (supplemental Table S7-1). GO term enrichment analysis showed that several biological processes, including "photosynthesis", "oxidative reduction", "electron transport chain", "generation of precursor metabolites and energy" and "photosynthesis, light reaction" were significantly enriched in both DEPs and DEGs, suggesting that these processes are very active during the HL treatment (supplemental Table S7-2). GO cellular component terms, including "thylakoid," "photosynthetic membrane," "light-harvesting complex," and "thylakoid membrane" and GO molecular function terms, including "electron carrier activity" and "oxidoreductase activity, acting on NADH or NAPDH" were all significantly enriched in both DEPs and DEGs (supplemental Table S7-3 and supplemental Table S7-4).

Changes in the Transcript and Protein Abundance of Genes Encoding Thylakoid-Located Complexes
PS II-Related Genes and Proteins-The psb genes, which encode subunits of PS II, were predominantly induced at the transcript level under HL stress. The three psbA genes that encode the D1 protein in cyanobacteria are under strict regulation to guarantee the proper functioning of the PS II (32). The transcription of these genes is modulated in response to changes in light intensity and O 2 level (33)(34)(35). The transcript abundance of three psbA genes (SYNPCC7002_A1418, SYNPCC7002_A0157, SYNPCC7002_A2164) in Synechococcus 7002 increased about 3-to 5-fold when grown under the HL condition. Similar induction pattern by HL treatment was observed in the psbD (SYNPCC7002_A2199) that encodes the reaction center D2 protein and psbM. Transcripts of genes involved in the oxygen-evolving complex, psbO, psbU and many other genes that encode small subunits of PS II, including psbC, psbD1, psbB, psbP, psbL and psbJ were all slightly induced by the HL treatment. However, all the subunits of PS II that were quantified by mass spectrometry were significantly decreased at the protein level (supplemental Table S8).
PS I-Related Genes and Proteins-In contrast with the case of genes encoding subunits of PS II, the PS I genes generally decreased at the transcript level. The quantitative proteomic analysis revealed that the PS I subunits were all significantly reduced under HL stress. Declining PS I content would be expected to lower the susceptibility of the cells to HL damage particularly under prolonged exposure (9).
Phycobilisome-Related Genes and Protein-Phycobilisomes serve as the main antennae for photosynthesis in cyanobacteria, and they transfer excitation energy to both photosystems (36,37). There was an overall decrease in transcript levels for phycocyanin and phycocyanin-associated linker proteins (3-to 6.4-fold; supplemental whereas the transcript level of allophycocyanin-associated genes decreased to a lesser extent (maximally ϳ2-fold reduction; supplemental Table S8). Additionally, the protein expression levels of phycobiliproteins were all significantly decreased.
The ndh Genes and Proteins-The transcript levels of many genes encoding NADH dehydrogenase subunits were increased 1.5-to 4-fold (supplemental Table S8). Transcripts for ndhD2 were 5.7-fold higher in cells exposed to HL than in normal conditions (supplemental Table S8). However, transcripts for ndhD1, encoding a paralogous form of NdhD subunit for the Type-1 NADH dehydrogenase complex remained unchanged in HL grown cells. Almost all of the NADH dehydrogenase subunits were up-regulated (1.4-to 2-fold) at the protein level. The Type-1 NADH dehydrogenase complex is required for cyclic electron flow (38), so we predict that it has a preferential involvement of cyclic electron flow at HL intensity.

Changes in the Transcript and Protein Abundance of the Other Genes-
Calvin-Benson-Bassham Cycle (CBB Cycle)-The transcript levels of many genes involved in the CBB cycle were 2to 3.5-fold higher in cells exposed to HL (supplemental Table  S8). The protein expression of the large subunit of ribulose-1,5-bisphosphate carboxylase/oxygenase (RuBisCO), RbcL was significantly induced. Genes encoding the structural components of carboxysomes (ccmK, ccmL, ccmM, ccmN) were also significantly increased at transcript level (1.5-to 4-fold), although they were only slightly up-regulated at the protein level.
CO 2 Uptake Mechanism-Interestingly, transcript levels for the genes encoding the so-called inducible CO 2 uptake mechanism (ndhD3, ndhF3, cupA, and cupS) (39) were significantly induced (maximum increase of ϳ6-fold); however, the transcript levels for the constitutive CO 2 -concentrating mechanism (ndhD4, ndhF4, cupB) remained constant, which disagrees with results obtained in previous studies on shortterm exposure to HL in Synechocystis sp. PCC 6803 and Synechococcus 7002 (15,40).
Chlorophyll Biosynthesis-Most genes encoding the enzymes of chlorophyll biosynthesis remained constant at the transcript level when grown under HL-condition, with the exception of the transcripts for chlH (SYNPCC7002_A1000, SYNPCC7002_A1018) and chlL (SYNPCC7002_A2347) genes, encoding magnesium-chelatase and protochlorophyllide reductase, which increased ϳ2-fold upon HL treatment. However, six proteins in chlorophyll biosynthesis were found to be significantly decreased about 1.4-to 1.8-fold, which are ChlM, HemF, HemJ, Hox1, PcyA, and Por. Flavoproteins-Flavoproteins have previously been reported to act as oxygen photoreductases in Synechocystis sp. PCC 6803 (41,42). HL treatment increased the transcript level of SYNPCC7002_ A1321, a flavoprotein coding gene by ϳ2-fold. The transcript level for SYNPCC7002_A1743, another flavoprotein coding gene, remained unchanged. However, the protein expressions of these two genes were both significantly induced. This observation is in accordance with their functions as catalysts in dissipation of excess electrons via the Mehler reaction (42).
Chaperone and ROS Scavenging Enzyme-Despite the increasing energy consumption during HL, the cells could suffer damage from ROS at HL. Thus, genes that have chaperoninlike roles such as heat shock proteins were expected to be up-regulated. Our results show that genes encoding the molecular chaperones increased 2-to 6-fold at the transcript level (supplemental Table S8). However, protein products of these genes only slightly increased or remained constant (1.7-fold at maximum).
Genes that encode scavenging enzymes for ROS were also expected to be up-regulated under HL conditions in which production of ROS may be accelerated (43). However, only SYNPCC7002_A0970, which encodes glutathione peroxidase, was induced at both the transcript and protein level (ϳ5and 2-fold, respectively). Transcription of other antioxidant enzymes such as katG (SYNPCC7002_A2422, catalase), sodB (SYNPCC7002_A0242, Mn-superoxide dismutase), and SYNPCC7002_A0117, which encodes another glutathione peroxidase, was not significantly affected. These results suggest that glutathione peroxidase encoding by SYNPCC7002_A0970 may have an important function in resistance to ROS under HL stress.
High-Light-Inducible Polypeptides (HLIPs)-The hli genes, present in cyanobacteria, algae and vascular plants, encode small proteins [high-light-inducible polypeptides (HLIPs)]. In our study, three HLIPs were found to be significantly induced after exposure to HL. The protein expression of SYNPCC7002_A0858 (hliA) was significantly induced (ϳ2.5fold), whereas the transcript level remained constant. SYNPCC7002_A0186 was up-regulated at the transcript level (about 3.4-fold), but the protein expression of this gene was not detected by proteomic profiling. The expression of SYNPCC7002_A0602 (hliA) significantly increased at both the mRNA and protein levels.
Characterization of HL-Sensitive Mutants-To confirm the involvement in HL response, 12 genes differentially regulated under HL stress were selected for gene mutant construction, including three subunits of PS I (psaC, psaD, psaF), three subunits of PS II (psbC, psbO, psbU), allophycocyanin beta-18 subunit apcF and phycobilisome rod-core linker polypeptide cpcG (L-RC 28.5). Four genes encoding hypothetical proteins (SYNPCC7002_A0568, SYNPCC7002_A1479, SYNPCC7002_A1480, SYNPCC7002_F0063; hereafter A0568, A1479, A1480, and F0063) were also selected for mutant construction. These selected genes were all significantly down-regulated at the protein level upon HL stress as determined by quantitative proteomics, except for F0063, which was significantly up-regulated (1.65-fold, p Ͻ 0.05), and A0568, which was slightly up-regulated (1.33-fold, p Ͻ 0.05). Four genes (psaC, psaD, A1479 and A1480) were downregulated at the transcript level, whereas A0568 was significant induced (about 10-fold, p Ͻ 0.05). However, the mRNA expression levels of other genes were not changed.
As shown in Fig. 7, the growth of the knockout mutants ⌬A1479, ⌬A0568, ⌬psbC and ⌬psaC did not show much difference when compared with wild-type under both NL and HL conditions, suggesting that these genes were not indispensable for the normal growth and HL acclimation. The inactivation of pasD, psaF, psbO, psbU, and F0063 also showed slower growth under NL condition (p Ͻ 0.01), and these mutants could not grow at all under HL treatment, suggesting that they were HL-sensitive lethal mutants. These mutants are necessary for the normal growth of Synechococcus 7002 and are indispensable in HL acclimation. Comparative analysis showed that although there was no visible difference in terms of growth patterns between the wild type and the mutants under the NL condition, the ⌬cpcG mutant grew slower than the wild type under HL stress (p Ͻ 0.01), suggesting that it is more sensitive to HL and may be involved in HL resistance. Interestingly, the growth of ⌬apcF and ⌬A1480 showed no difference in comparison with wild type under NL, but these two mutants grew faster than wild type under the HL condition (p Ͻ 0.01).

DISCUSSION
Synechococcus 7002 is known to be extremely tolerant to HL intensity (9,10), and sunlight intensity is one of the key environmental factors in natural habitats of cyanobacteria. Global investigation on HL acclimation of cyanobacteria has been conducted at transcript level (15,40). This transcriptomic data revealed extensive changes in cellular transcript levels in response to HL stress and provided novel insights into the molecular mechanisms of response to HL stress and the genes of potential importance for the adaptation of HL in Synechococcus 7002. Although informative, transcript abundances do not necessarily reflect cellular protein levels because protein expression is influenced by an array of posttranscriptional regulatory mechanisms and the correlation between protein and mRNA levels is generally modest (44 -46). It is necessary to analyze the response of Synechococcus 7002 to HL at both the transcriptomic and proteomic levels in an effort to gain systems-level information. Therefore, we employed an integrated quantitative proteomic and transcriptomic approach to investigate the HL acclimation mechanisms of Synechococcus 7002, aiming at identifying novel gene components and regulatory mechanisms in response to HL acclimation.
In this study, the RNA-Seq combined with the quantitative proteomic analyses showed that 526 genes and 233 proteins were differentially regulated and could be related with the response and resistance of Synechococcus 7002 to HL stress. A conceptual model that summarizes the function of the key DEPs was developed to decipher the global molecular mechanisms involved in HL responses in Synechococcus 7002. As shown in Fig. 8, HL stress induced the degradation of phycobiliproteins or the reduction of phycobilisome size. Photosystem content was also reduced to avoid absorption of excess light energy. These changes may originate from the down-regulation of genes that encode enzymes for biosynthesis of photosynthetic pigments (hem and chl genes), structural components of phycobilisome (apc and cpc genes), and subunits of photosystems (psa and psb genes). Although many psb genes did not change or were only slightly induced at transcript levels, psbA genes, which encode D1, were strongly up-regulated. It is likely that the elevated level of psbA transcripts makes the increasing turnover rate of the D1 protein under HL conditions possible (33). In contrast with photochemical reactions down-regulated by HL, CO 2 fixation was accelerated. It is not surprising that the rbcL gene was significantly induced at both the transcript and protein levels, because the reaction catalyzed by RuBisCO is the primary rate-limiting factor of the CBB cycle under saturating light intensity. The ccm genes encoding components of the CO 2concentrating mechanism were also induced. Up-regulation of ndh genes, involved in high affinity CO 2 uptake (47, 48) may also help to increase the availability of CO 2 under HL. Despite the increasing energy consumption during HL, the cells could suffer damage from ROS at HL. Proteins encoding ROS scavenging enzymes (SYNPCC7002_A0970, glutathione peroxidase) and proteins that have chaperonin-like roles, GroES (co-chaperonin GroES) and HtpG (heat shock protein 90), were up-regulated. Accumulation of HLIPs under HL intensities proved to be associated with the pigment alteration (i.e. decrease in light-harvesting pigments, accumulation of the carotenoid myxoxanthophyll and decrease in PS I-associated chlorophylls) and stabilization of PS I trimers to protect cells under HL stress (49 -52). In our study, three HLIPs in Synechococcus 7002 were found to be significantly induced after HL exposure. It is likely that the HLIPs in Synechococcus 7002 are critical for survival when absorbing excess excitation energy and may allow the cells to cope more effectively under HL conditions. Enzymes in the carotenoid synthesis pathway were all significantly induced at the transcript level (about 9fold at maximum), and two enzymes, CruG (carotenoid 2-Orhamnosyltransferase) and CrtP (phytoene desaturase) were significantly up-regulated at the protein level (1.5-fold). Previous studies have shown that orange carotenoid protein (OCP)-related non-photochemical -quenching mechanism is a very important regulatory mechanism in the quenching and dissipation of excess light energy in some cyanobacteria (53,54). Therefore, we deduce that this is another protection mechanism against HL-caused damage in Synechococcus 7002.
Under conditions of HL stress, cells also systematically regulated their transcription, translation, and post-translational modification functions (supplemental Table S8). For example, Protein expression of enzymes in DNA replication, repair and modification increased (For example, ParA, GvrA, and PhrA). Transcriptional regulators (NusB, SigD, SYNPCC7002_ A2523), and ribosomal proteins (RpsE, RplB, RplU, and RplT) were also found to be significantly induced at the protein level (supplemental Table S8). We suggest that Synechococcus 7002 bacteria accelerate the key processes in genetic central dogma to cope with the increased demands of DNA damagerepairing and protein synthesis under HL exposure.
Comparative growth analysis of the knockout mutants of HL-responsive genes and the wild type under NL and HL condition revealed several important HL-sensitive mutants and HL-sensitive lethal mutants, which will be discussed below.
⌬cpcG was revealed to be a HL-sensitive mutant, which grew normally under NL, but was inhibited under HL. CpcG1 (homolog of SYNPCC7002_A0811), which connects the rods with the core of major allophycocyanins was reported to be involved in state transitions in Synechocystis 6803 (55). Our results support the previous notion that state transitions are important in acclimating to light intensities (3,56), and suggested that cpcG may contribute to the HL acclimation of Synechococcus 7002.
In Synechococcus 7002, ApcF plays an important role in energy transfer from PBS to PS II. ⌬ApcF mutant strain showed a higher tolerance to HL stress than the wild type (Fig.  7D), which is in accordance with a previous study (37). This is possibly because when apcF is inactivated, energy transfer from PBS to PS II is less efficient. The ⌬A1480 mutant strain showed the same growth pattern as the ⌬ApcF, indicating that it may function as a negative regulator in HL acclimation mechanisms (Fig. 7D).
in Synechococcus 7002 are important for normal cell growth and are indispensable in HL acclimation process.
The PsaD subunit, as a conserved peripheral protein on the reducing side of PS I, is involved in the docking of ferredoxin to PS I reaction centers and assembly of other peripheral subunits (66). A previous study in Synechocystis 6803 revealed that the photoautotrophic growth of ⌬psaD is much slower than that of wild type cells (67). The PsaF subunit of PS I also has dispensable accessory roles in the function and organization of the complex (68). Therefore, it was not surprising that ⌬psaD and ⌬psaF mutant of Synechococcus 7002 could not grow under HL and showed slower growth rate under NL compared with wild type.
F0063, with its putative role as an acetyltransferase (belongs to GCN5-related N-acetyltransferase (GNAT) superfamily) (supplemental Fig. S4), was significantly induced upon HL treatment. Previous phosphoproteomic analysis of Synechococcus 7002 indicated that post-translational modifications are deeply involved in the photosynthesis process (69). It is well established that lysine acetylation is one of the most common post-translational modifications to proteins in both eukaryotes and prokaryotes and plays important roles in many cellular physiological processes (70,71). Up-regulation of F0063 under HL conditions suggests that protein acetylation may be an important regulation mechanism in response to HL stress.
In conclusion, the integrated transcriptomic and proteomic analysis revealed multiple levels of regulation in response to HL in Synechococcus 7002, including possible post-translational regulation. Comparative growth analysis of the knockout mutants led to the identification of key genes involved in response to HL in Synechococcus 7002. Our results provide novel insights into the global response mechanisms to HL in Synechococcus 7002 and may be valuable for further studies addressing HL resistance in photosynthetic organisms in general. □ S This article contains supplemental Figs. S1 to S4 and Tables S1 to S8.