High Resolution Electron Transfer Dissociation Studies of Unfractionated Intact Histones from Murine Embryonic Stem Cells Using On-line Capillary LC Separation

Epigenetic regulation of chromatin is dependent on both the histone protein isoforms and state of their post-translational modifications. The assignment of all post-translational modification sites for each individual intact protein isoform remains an experimental challenge. We present an on-line reversed phase LC tandem mass spectrometry approach for the separation of intact, unfractionated histones and a high resolution mass analyzer, the Orbitrap, with electron transfer dissociation capabilities to detect and record accurate mass values for the molecular and fragment ions observed. From a single LC-electron transfer dissociation run, this strategy permits the identification of the most abundant intact proteins, determination of the isoforms present, and the localization of post-translational modifications.

Over the last two decades, the integration of the unanticipated discovery of new modes of internal energy deposition and rapid technological progress in mass spectrometrybased instrument platforms has revolutionized our experimental capabilities to effectively study the inherent complexity of human and other mammalian proteomes. This includes the rapid identification of proteins, the detection and unambiguous assignment of protein isoforms, and the detection and localization of post-translational modifications (PTMs). 1 The vast majority of mass spectrometry-based studies in protein biology and proteomics has used proteolytic digestion using trypsin and analysis of the resulting peptide mixtures by a variety of MS and MS/MS approaches (1)(2)(3)(4). Although of considerable use for many purposes, this common strategy has a variety of limitations especially from the view of acquiring the information needed to establish and understand biological function. These limitations include the loss of information regarding the nature of protein isoforms and a loss of knowledge of the global presence of multiple PTMs that can co-occur at differing sites on the same protein molecule. These deficiencies are exacerbated when the portions of protein sequence coverage observed in a digest are low because peptides that would reveal the modifications or isoform-specific sequence variances may not have been detected. Additionally, the sequence identification for fragment ion spectra representing peptides with modifications that are unanticipated can be difficult to assign using common search algorithms.
Hence, structural information covering an entire protein sequence has been lacking, and although needed from a protein biology viewpoint all along, the kind of methodology that would be required to obtain this knowledge has begun to emerge only recently. This has come about through the discovery of electron capture by polyprotonated peptide and protein species (5) and the more recent development of electron transfer by their reaction with suitable radical anions (6). Because these physicochemical processes deposit sufficient internal energy at the sites of charge reduction to cleave peptide backbone bonds promptly (non-ergodically), the distribution of cleavage sites reflects the potential positions of protonation along the peptide/protein backbone without the usual redistribution of vibronic energy that occurs following collisional activation. Thus, these energy deposition techniques are being exploited in Fourier transform-ion cyclotron resonance (7,8) and Orbitrap platforms (9) and are being developed further to provide access to fragmentation information on peptides significantly larger than those derived by tryptic digestion and smaller proteins even on a chromatographic time scale.
As noted above, these powerful new fragmentation techniques are able to generate a greater frequency of protein backbone cleavages even as peptide/protein size is increased by more than a factor of 100 over tryptic peptides. In addition, labile covalent modifications are generally retained because internal energy randomization is minimal (10,11). These properties of electron capture dissociation (ECD) and electron transfer dissociation (ETD) have provided the experimental basis (12)(13)(14) for the rigorous analysis of peptides/proteins bearing variable combinations of multiple PTMs for the first time, providing revolutionary advantages of particular importance for many challenges in the characterization of biological systems.
One class of small highly basic proteins seems to provide a virtually ideal test bed for further development of these methods, namely the histones. Histones are the archetypical class of chromatin regulatory and packaging substances bearing a yet to be determined complex variety of reversible post-translational modifications, including but not limited to acetylation, methylation, ubiquitination, and phosphorylation. Even the exact same protein sequence may be modified by multiple PTMs in varying combinations and levels of site occupancy. There are over 60 residues on the four core histones subject to one or more reversible modifications by the small moieties noted above (15)(16)(17)(18)(19)(20). These proteins are not only integral components of chromatin and the nature of its diverse functional repertoire but play a highly complex dynamic role in virtually all cellular processes. It is hypothesized that particular combinations of PTMs embody a histone code that is recognized by effector proteins (mostly still unknown) that are recruited specifically to modified histones to carry out essential functional processes on chromatin (21). Disruption of these sites and modifications and/or the enzymatic processes that regulate particular occupancy states is intimately associated with a wide variety of human diseases from developmental diseases to cancer (22). Thus, gaining an understanding of the processes controlling the modulation of chromatin structure and its exquisite differential accessibility by myriad enzymatic activities by the covalent code written on members of the family of histones is a major challenge of our time. Recently, even -amino methyl reversibility has emerged through the discovery of demethylases that operate with precise site and methylation state specificity (23).
Historically the chromatin field has relied on immunochemical methodology to probe structure-function studies for many years. Only recently has mass spectrometry begun to play an important role in identifying the types, combinations, specific sites, and abundance of PTMs (15,16). Despite recent contributions from several laboratories, comprehensive structural characterization of histone modification patterns remains a major analytical challenge. As a result, efforts aimed at facilitating and gaining a comprehensive characterization of multiple post-translational site occupancies are an important pursuit that will accelerate progress in revealing a fundamental knowledge of structural/functional relationships.
Here, we describe progress on intact protein analysis by on-line LC-ESI Orbitrap ETD-MS/MS of unfractionated histone mixtures. These studies provide information on protein identification, protein isoform determination, and localization of PTMs for the most abundant species with minimal sample preparation. We also show that more detailed PTM occupancy information can be obtained from longer sequences derived by Asp-N digestion.

EXPERIMENTAL PROCEDURES
Sample Preparation-Mouse embryonic stem cells were grown in Dulbecco's minimum essential medium supplemented with 5% fetal bovine serum. Cells were harvested by trypsinization and centrifugation at 1500 rpm with two PBS washes. The nuclei were collected as described previously (24) except the cells were resuspended in nuclear isolation buffer containing 20 mM HEPES (pH 7.4), 1.5 mM MgCl 2 , 10 mM KCl, 0.5 mM DTT, 0.1% Triton, 10 mM sodium butyrate, 2 mM PMSF, 1ϫ phosphatase inhibitor mixture, 1ϫ protease inhibitors (pepstatin, aprotinin, E-64, and leupeptin) (Sigma). Cells were lysed using Dounce homogenization on ice, and nuclei were pelleted at 1200 rpm for 5 min at 4°C and washed twice with nuclear isolation buffer. Nuclei were resuspended in Triton extraction buffer (PBS, 10 mM sodium butyrate, 0.5% Triton, 2 mM PMSF, 1ϫ phosphatase inhibitor mixture, 1ϫ protease inhibitors) and washed twice with Triton extraction buffer. Histones were then extracted into 0.4 N H 2 SO 4 at 4°C for an hour with constant rotation. The sample was centrifuged at 14,000 ϫ g for 15 min, the supernatant was collected, and the histones were precipitated with 20% trichloroacetic acid (v/v, final) and incubated for 1 h at Ϫ20°C. The precipitated histones were pelleted at 14,000 ϫ g for 15 min at 4°C, and the pellets were washed once with ice-cold acetone containing 0.1% HCl and once with ice-cold acetone alone. The protein pellet was then dried, resuspended in water, and stored at Ϫ20°C. Histone samples were analyzed directly by mass spectrometry (intact protein analysis) or following digestion with endoproteinase Asp-N (Roche Applied Science) (1:20) at pH 8 for 4 h.
On-line LC-ESI Orbitrap ETD of Intact Histones-Approximately 45 pmol of total, unfractionated histones were separated by reversed phase chromatography on line with a Thermo LTQ-Orbitrap mass analyzer by use of a Waters nanoACQUITY ultraperformance LC (UPLC) system with a 75-m-inner diameter ϫ 250-mm-long C 18 column (300-Å pore size) packed with 1.7-m-diameter particles (Waters). The histones were separated using a gradient of 10 -40% solvent B over 55 min (solvent A, 0.1% formic acid in water; solvent B, 0.1% formic acid in 100% acetonitrile) at a flow rate of 200 nl/min. The eluate was detected directly by mass spectrometric analysis. Mass spectrometry data were acquired on a hybrid LTQ-Orbitrap equipped with ETD (Thermo). After a precursor scan of the molecular ions was measured in the Orbitrap by scanning from m/z 400 to 1200, the two most intense multiply charged precursors were selected for ETD analysis in the Orbitrap. Both the molecular and the fragment ions were detected in the Orbitrap with a resolution of 60,000 (at m/z ϭ 400). Molecular ions for fragmentation were isolated with a 10 m/z isolation window and 50-ms ETD activation time for fragmentation. The automatic gain control target value for the fluoranthene anions was 2 ϫ 10 5 with a maximum injection time of 500 ms. The ion/ion reactions were conduction in the linear ion trap before being sent to the Orbitrap for high resolution m/z analysis. The ETD spectra were acquired with an automatic gain control target value of 3 ϫ 10 5 , and each scan included two microscans with an FTMS maximum injection time of 500 ms.
On-line LC-ESI Orbitrap ETD of Asp-N Histone Digests-Unfractionated histone Asp-N digests were analyzed by mass spectrometry as above except the peptides were separated using a two-step gradient of 2-20% solvent B over 35 min followed by a 5-min gradient from 20 to 50%. After a precursor scan of the molecular ions was measured in the Orbitrap by scanning from m/z 350 to 1200, the four most intense multiply charged precursors were selected for ETD analysis in the Orbitrap. The molecular and the fragment ions were detected in the Orbitrap with a resolution of 30,000 and 15,000 (at m/z ϭ 400), respectively. Molecular ions for fragmentation were isolated with a 3 m/z isolation window and 200-ms ETD activation time for fragmentation.
Data Analysis-Data were analyzed using a combination of Xtract in Qual Browser version 2.0 (Thermo Scientific) and two in-house softwares, Protein Prospector and Fragment Assignment by Visual Assistance (FAVA) (25). Xtract was used for the deconvolution of the molecular ion and for peak list generation for the fragment ion spectra. As ion transient data are unavailable on the Orbitrap, each fragment ion spectrum used was an average spectrum composite of multiple fragment ion spectra of the same protein species across the chromatographic peak (ranging from two to five MS/MS spectra). As detailed in the data acquisition settings above, each individual spectrum is the average of two ion transients (microscans) each. These data were then entered into Protein Prospector version 5.3.2 (26), and a search was performed against the Mus musculus entries in the UniProtKB database (downloaded on July 7, 2009 with 60,970 entries). Precursor and fragment mass tolerances of 30 ppm were allowed. Considered modifications were protein N-terminal acetylation, loss of N-terminal methionine, (mono-, di-, and tri-) methylation of lysine residues, acetylation of lysine residues, and phosphorylation of serine and threonine residues. Initial acceptance criteria required a reported expectation value of less than 0.1. All results were then manually verified. Additional undefined mass modification searches were performed; however, no unexpected proteins were identified. Determination of all protein identifications/isoforms and thorough localization of PTMs were performed using FAVA software (25) to accurately assign the fragment ions. The ion assignments, including those used for determination of sites of post-translational modifications, were manually verified. For quantitation of the variably modified histone H4 N-terminal tail peptides, the chromatographic peak areas (following Asp-N digestion) were summed, and the relative abundance of each modified peptide was determined as the average percentage of the total for three biological replicates.

RESULTS
Histones were acid-extracted from mouse embryonic stem cells. This total histone isolate was subjected to separation by high pressure RP C 18 chromatography with controlled particle stationary phase and mass spectrometric analysis using ETD on an electrospray LTQ-Orbitrap. The high mass resolution data set was analyzed using a combination of two in-house software packages, Protein Prospector (version 5.3.2) and FAVA (25) (Fig. 1).
The core histones, histones H2A, H2B, H3, and H4, eluted over ϳ15 min (Fig. 2a). The base peak chromatogram (Fig. 2b) shows that the separation achieved resolved approximately seven histone molecular species with elution time profiles in the range of 1-2 min as shown in the extracted ion chromatograms of the most abundant species in each chromatographic peak (Fig. 2c). For each chromatographic peak, the protonated monoisotopic molecular masses of the most abundant species are indicated ( Fig. 2c and Table I) using the Xtract software based on THRASH (27). Because of the highly mod- ified nature of the many histone H3 isoforms present, the ion intensity is dispersed across a multitude of variably modified histone H3 species so that the base peak intensities for the molecular ions appear low compared with those observed for the H4, H2B, and H2A isoforms. These H3 variants eluted between 54 and 58 min as indicated in the total ion chromatogram (Fig. 2a) where regions of high total ion abundance are observed centered around 54, 56, and 57.5 min. In addition, the isotopic envelopes of each of the modified species are overlapping as the modified species are not chromatographically resolved. For these reasons, the Xtract software was unable to deconvolute the molecular ion spectra with the exception of the peak at 56 min where the most abundant ion was determined to be 15,347.56 as indicated in the total ion chromatogram in Fig. 2a and listed in Table I.
Intact Histone Identification-A summary of the proteins identified by on-line RP UPLC Orbitrap ETD of unfractionated histones is provided in Table I. The specific isoform, modifications, and sites of modifications are listed where applicable. In all cases, the parent masses were detected within 7-ppm mass error. The ETD fragment ions detected were sufficient to determine the histone isoform and localize the sites of PTMs. It is important to note that although these are the most abundant ions present they are not the only species present. As described above, multiple histone H3 isoforms are present; however, the ability to determine the monoisotopic mass as well as to detect and isolate the molecular ion in a data-dependent fashion is hindered because of the large number of modifications and lack of their separation with RP LC.
Use of ETD Fragment Ions for Determination of Isobaric Histone H2B Isoform-The measured MH ϩ parent mass of 13,797.5289 Da for the third peak in the extracted ion chromatogram ( Fig. 2c) corresponds to three possible histone H2B variants, all with a mass of 13,797.5349 Da (Ϫ0.431-ppm mass error): histone H2B type 1-F with a loss of a methionine residue, histone H2B type 3-B with a combined loss of a methionine residue and addition of two methyl moieties, and histone H2B type 1-M with a loss of a methionine residue. The sequence alignment for these three histone H2B sequence variants is shown in Fig. 3a. Interpretation of the on-line ETD fragment ion spectrum enabled the conclusive assignment of histone H2B type 1-F based on the presence of 62% corresponding fragment ion coverage, which includes 30 unique c ions (Fig. 3, b and c). In addition, no ions were detected that could uniquely identify the histone H2B variants, type 1-M or type 3-B, suggesting that only the histone H2B type 1-F isoform is present.
Identification and Localization of PTMs on Intact Histone H4 -The broadband molecular ion spectrum for histone H4 (m/z 629.14, z ϭ 18) is shown in Fig. 4a. Isotopic resolution was achieved with an Orbitrap resolving power of 60,000 (at m/z 400); there is sufficient resolution to determine the charge state and monoisotopic mass of the intact histones (Fig. 4b).

On-line ETD Analysis of Intact Histones
The deconvoluted monoisotopic masses of the most intense species in the molecular ion spectrum are shown in Fig. 4c. The base peak, 11,300.3904, correlates to histone H4 with the loss of a methionine residue and the addition of one acetyl group and two methyl groups (Ϫ2.35-ppm mass error; Table  I). Using the parameters described under "Experimental Procedures," histone H4 was the top hit when searching the whole of the mouse database even when searched with the parent mass alone. Confirmation of this identification as well as the localization of the modifications, however, is derived from the fragmentation spectrum.
Software-assisted and manual data analyses of the ETD fragment ion spectrum acquired in the same run confirmed the identification of histone H4 and putative localization of modifications (Fig. 5). The peak list of the fragment ions generated by Xtract was sufficient to identify histone H4 with N-terminal acetylation and dimethylation at Lys-20 when the data were searched using Protein Prospector. However, further interpretation of the ETD fragment ion spectrum using the FAVA strategy revealed extensive coverage of sequence ion series from both the N and C termini, confirming the protein identification and sites of post-translational modification (Fig.   FIG. 3. Determination of histone H2B isoform by on-line LC Orbitrap ETD-MS/MS analysis. a, sequence alignment for mouse histone H2B types 1-M, 1-F, and 3-B. Sites of sequence variation are indicated in red. b, ETD fragment ion spectrum with only the most abundant ions labeled. The spectrum is an average spectrum composite of three MS/MS spectra. c, fragment ion assignments for histone H2B type 1-F. Although three isobaric H2B variants exist, ETD fragmentation enables the detection and confident identification of histone H2B type 1-F. 5c). A total of 129 c and z fragment ions were detected, including at least one fragment ion representing 67% of the possible fragmentation sites in this entire protein sequence (Tables I and II). A mass addition of 42.01 was found on all c ions observed, confirming the localization of the acetylation to the N terminus. The c 20 ion corresponding to the cleavage at the Lys-20 residue (and subsequent c ions C-terminal of the Lys-20 residue) was found to be shifted in mass by an additional 28.03 (total of 70.04), indicating a lysine -amino dimethylation at this site. No unmethylated or monomethylated c ions were detected C-terminal to Lys-20. In addition to the detection of fragment ions corresponding to the most abundant species isolated, we also detected a c 17 ion in the diacetylated state corresponding to N-terminal acetylation and acetylation of Lys-16, a known site of relatively abundant acetylation in mouse embryonic stem cells (14). stem cells is known to be the most heavily modified and enables combinatorial analysis of the most common histone H4 modifications: acetylation at the N terminus, Lys-5, Lys-8, Lys-12, and Lys-16 as well as mono-, di-, and trimethylation at Lys-20 (14).
When analyzed by on-line RP UPLC MS, the histone H4 N-terminal 23-mers modified with each combination of one to five acetylations and zero to three methylations were detected. These peptides are chromatographically separated according to their acetylation status as shown in Fig. 6. Thus, the isobaric trimethylated and acetylated peptides are completely resolved, enabling relative quantitation by comparing peak intensities (Table III). The most abundant species is the dimethylated species with one acetylation. As the number of acetylations increased, the relative abundance of the peptides decreased. Although the extracted ion chromatogram has low signal to noise at five acetylations, the unmodified, mono-, and dimethylated versions can clearly be detected (Fig. 6, inset). At all levels of acetylation, the dimethylated species was the most abundant species, the unmethylated and monomethylated species were nearly equivalent in intensity to each other, and the trimethylated species was of lower abundance.
The identities of the peptides were confirmed from analysis of the electrospray ETD spectra. A representative high resolution ETD mass spectrum for the histone H4 N-terminal tail [M ϩ 6H] 6ϩ is shown in Fig. 7. This particular species contains an additional three acetyl and two methyl moieties. Use of the FAVA strategy (25) permitted detection and assignment of a complete c and z fragment ion series. All members of the c ion series were detected bearing at least one acetyl moiety (42.01), indicating the acetylation of all N termini and thus indicating that the remaining two acetyl and two methyl groups were internal peptide modifications. Both methyl residues were localized to Lys-20 based upon the observation that the z 1 -z 3 fragment ions were unmodified and all z 4 ions had a mass shift of 28.31 Da, the mass of two methyl residues. Acetylation, although not stoichiometric acetylation, was detected at Lys-16 and Lys-12 as indicated by the presence of both the acetylated, dimethylated z 8 ion and the dimethylated z 8 ion as well as the acetylated, dimethylated z 12 ion and the diacetylated, dimethylated z 12 ion, respectively. Lys-8 acetylation was indicated based upon the fact that all ions N-terminal of z 16 were completely modified with two acetyl and two methyl moieties. In addition, the c ion series was used to confirm the presence of Lys-5 acetylation as both monoacetylated and diacetylated c 5 ions were detected indicating substoichiometric Lys-5 acetylation. DISCUSSION Here we present the data obtained from on-line separation, detection, and identification of intact, unfractionated histones. The core histones of mouse embryonic stem cells range from 11 to 15 kDa in size, all with relatively basic N-terminal tails. The effectiveness of RP UPLC with on-line ETD for carrying out analysis of intact histones has been presented using ϳ45 pmol of histones. Sufficient separation of the core histones enabled multiple ETD fragmentations over the elution of each protein, requiring ϳ8 s per scan. With the ETD fragmentation implemented on an Orbitrap mass analyzer (28), we were able to record high resolution data for both the molecular ions and the fragment ions that are necessary for the experiments carried out at the intact level. This study represents the first analysis of an intact bulk histone mixture taking advantage of the detailed sequence ion information provided by ETD mass analysis at high resolution. Although the dynamic range proves to be a limitation, sufficient fragmentation was obtained to identify the major species, including isoform determination and post-translational modification assignment.
Only recently have new technological advances based on ECD and ETD energy deposition processes begun to permit these studies. Gaining the ability to characterize multiple post-translational modifications on intact histone molecules provides the rigorous modification occupancy required to decipher biological functions. This was shown by early ECD work using FTICRMS on the post-translational occupancies of histone variants H2B.1 and H2B.2 isolated from Tetrahymena (12). MS/MS analysis of intact histones has been performed by direct infusion following off-line fractionation for histone H4 (13), histone H2B (29), histone H2A (30), and (to a lesser extent because of the complexity of the PTMs) histone H3 (31). Young et al. (32) have recently reported the separation and detection of large histone H3 peptides according to both their acetylation and methylation status by on-line ETD MS utilizing a "saltless" pH gradient weak cation exchangehydrophilic interaction liquid chromatography. Wu et al. (33) have integrated the analysis of intact proteins and enzymatic digests to identify proteins, modifications, and genetic variants by utilizing high resolution RP LC intact protein separations coupled on line with an FTICR mass spectrometer to profile and tentatively identify modified proteins in conjunction with bottom-up analysis of the corresponding LC fractions.
The data processing work flow for protein identification, protein isoform determination, and PTM localization, as indicated in Fig. 1, included Fig. 5 For some higher mass peaks, the monoisotopic peak was not observed; therefore, higher isotopic peaks were used.  the on-line ETD spectra. Preliminary deconvoluted, monoisotopic MH ϩ masses of the molecular and fragment ions were obtained using the Xtract software (Thermo Scientific) on both the broadband MS and the ETD fragment ion spectra. These preliminary mass values were submitted to the Protein Prospector MS-TAG search algorithm (an in-house software), and putative protein isoform and putative sites of PTM were obtained for the major positional isomer present. As a variety of protein isoforms/positional modification variants are possible within a few ppm mass cutoff, further more thorough analysis of the actual fragment ion spectra is essential. We have experienced three inherent drawbacks in using peak extraction software for complex spectrum analysis. These include the fact that this software often (a) assigns incorrect MH ϩ values, (b) fails to find low abundance ions, and (c) fails to deconvolute overlapping isotopic clusters. For these reasons, comprehensive fragment ion assignment was obtained for each ETD spectrum with the assistance of FAVA (25). These assignments establish the protein isoforms and the actual sites of PTM attachments. The FAVA strategy permits searches of raw data files for fragment ions of a particular input sequence, which may or may not include site-specific modifications, that resulted in sensitive and accurate assignment of histone H4 fragment ions. With this data processing work flow, we were able to identify eight different histone isoforms during the 15-min elution ( Table I). The advantages of on-line ETD were exemplified when the molecular weight of one histone was equivalent to three isobaric histone H2B isoforms. The ETD fragmentation enabled the determination of histone H2B type 1-F as the only H2B isoform present in the spectrum. In addition, the ETD fragmentation was essential in localizing PTMs. For the majority of the histones, the most abundant species detected were present as unmodified or only modified by acetylation at the N termini. The most abundant histone H4 species, however, had N-terminal acetylation and dimethylation at Lys-20. We also detected a further acetylated c 17 ion in the ETD spectrum. The minimum isolation window for the intact ETD experiments was 10 m/z. This isolation window encompassed more than a single precursor ion species, and this further acetylated species was also included, albeit at a lower abundance (Fig. 4), which upon fragmentation "contaminated" the fragment ion spectrum of the isolated species intended. In addition to the limitations that the 10 m/z isolation window presents when dealing with co-eluting intact proteins (namely, modified versions of the same protein), another limitation is the low dynamic range for the detection of low abundance modified species. Analysis of Asp-N digests of the same unfractionated acid-extracted histones led to the detection of a plethora of modified histone H4 N-terminal tails (Figs. 6 and 7 and Table III), which were previously undetected in the intact histone analysis. Asp-N digestion of histone H4 generates a 23-residue N-terminal peptide that permits combinatorial analysis of the most common histone H4 modifications in mouse embryonic stem cells: acetylation at the N terminus, Lys-5, Lys-8, Lys-12, and Lys-16 as well as mono-, di-, and trimethylation at Lys-20.
With a possible five acetyl and three methyl groups present, there are 15 possible N-terminal histone H4 peptides with variable levels of modification. The percentage of each species present is listed in Table III, which indicates the two most abundant species as the monoacetylated, dimethylated species and the diacetylated, dimethylated species with ϳ34 and 12% of total ion abundance, respectively. This is consistent with the intact protein analysis (Fig. 4); it illustrates the low dynamic range of the intact analysis as the less abundant modified species were not detected in the analysis conducted at the intact level.
In addition to the analysis of intact histones, the work flow used here will likely be applicable to the on-line analysis of a variety of intact protein families or intact protein subpopulations. The ability to perform fragmentation with an on-line LC FIG. 6. Extracted ion chromatograms showing typical separation of differentially modified histone H4 N-terminal tails according to level of acetylation. An increased level of acetylation increased the peptide retention time of the peptides; however, the level of methylation did not affect the retention time. Although signal to noise is high for the extracted ion chromatograms of peptides with five acetylations, the inset shows the sufficient ion intensity in the MS spectrum. 0.14 Ϯ 0.07 5Ac ϩ 2Me 0.14 Ϯ 0.08 time scale diminishes the need for complete purification of a single protein species that is often required for analysis by direct infusion experiments. In effect, sample handling and loss are reduced, and the on-line experiment requires less sample compared with the direct infusion experiment, thus making intact MS/MS analysis more accessible for difficult to purify and/or low abundance proteins. Although near complete sequence information can be obtained for small proteins, enabling the localization of post-translational modifications, this may not be the case for larger proteins. With that in mind, however, essential information can still be obtained regarding protein identification and the specific presence of modifications and proteolytic processing of the N and C termini. CONCLUSIONS We have described an on-line RP UPLC Orbitrap-ETD approach for the profiling of histone extracts for isoform and PTM variants. This approach combined the speed, sensitivity, and high resolution capabilities of the LTQ-Orbitrap with the efficiency of ETD fragmentation to characterize the genetic variants and PTMs of the most abundant species in bulk histone extracts of mouse embryonic stem cells in a single LC ETD run. However, because of the relatively wide isolation window required for intact protein fragmentation and the low dynamic range, further advances will be required to improve the on-line separation of modified intact histones and the technical methods by which the ions are isolated for ETD fragmentation. Although data were only presented for the analysis of intact histones, we suggest that this method will be directly applicable to the comprehensive analysis of proteins of similar size for identification and site-specific localization of modifications. In addition, protein identification experiments for larger proteins are likely, although localization of modifications not restricted to the N and C termini may still remain a challenge.