|
Advertisement | |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 5:265-273, 2006.
| |||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| ABSTRACT |
|---|
|
|
|---|
IUPs have so far been identified by the chance observation of the structural anomaly of proteins studied for their functional interest. We reasoned that a straightforward technique to separate IUPs from globular proteins in a cellular extract could be established by the combination of a native gel electrophoresis of heat-treated proteins followed by a second, denaturing gel containing 8 M urea. The rationale for the first dimension is that IUPs are very often heat-stable as demonstrated for Csd1 (9), MAP2 (10), NACP (11), stathmin (12), and p21Cip1 (13) for example. Heat treatment thus results in a good initial separation from globular proteins, most of which aggregate and precipitate. In the native gel, IUPs and rare heat-stable globular proteins will then be separated according to their charge/mass ratios. Combining this first dimension with an 8 M urea second step is rationalized by the usual structural indifference of IUPs to chemical denaturation by trichloroacetic acid, guanidine HCl, or urea as reported for Csd1 (9), NACP (11), ß-casein (14), stathmin (12), and p21Cip1 (13) for example. As urea is uncharged and IUPs are just as "denatured" in 8 M urea as under native conditions, they are expected to run the same distance in the second dimension and end up along the diagonal. Heat-stable globular proteins, on the other hand, will unfold in urea, slow down in the second gel, and accumulate above the diagonal. Because of this effective separation, IUPs are amenable to subsequent MS identification.
In this work we elaborated this novel separation principle. By 10 known IUPs and four globular proteins, it was shown that IUPs did occupy diagonal positions in the second gel and separate from globular proteins, and thus the gel can probe the IUP nature of a protein available in very low quantity and limited purity. By running Escherichia coli and Saccharomyces cerevisiae extracts, we demonstrated that a combination of this novel 2D technique and MS analysis can lead to the identification of novel IUPs.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
- and ß-caseins (both from ICN Biomedicals, catalog numbers 100251 and 100321); DARPP-32 (Calbiochem, catalog number 251755); and BSA (catalog number A 7638), ovalbumin (catalog number A 2512), fetuin (catalog number F 3004), and alcohol dehydrogenase (catalog number A 7011, all four from Sigma). The GF calibration kit (catalog number 17-0442-01) was from Amersham Biosciences. Thermus thermophilus IPMDH was a gift from Prof. Peter Závodszky (Institute of Enzymology, Biological Research Center, Hungarian Academy of Sciences). Csd1 and MAP2c were prepared as described previously (15), and ERD10 was cloned by PCR from an Arabidopsis thaliana cDNA library obtained from Dr. László Szabados (Institute of Plant Biology, Biological Research Center, Hungarian Academy of Sciences, Szeged, Hungary). The cDNA of NACP cloned into a pT7-7 vector was obtained from Dr. Michael J. Volles (Center for Neurological Disease, Harvard Medical School, Cambridge, MA). Mypt1-(304511) in a pET21 vector was a gift from Prof. Ferenc Erd
di (Department of Medical Chemistry, University Medical School of Debrecen, Debrecen, Hungary) and was expressed as described previously (16). Bob-1 in a pET 32 vector was obtained from Prof. Ben Luisi (Department of Biochemistry, University of Cambridge, Cambridge, UK) and purified as described previously (17). All proteins were dialyzed into a buffer of 50 mM Tris, 150 mM NaCl, 2 mM dithioerythritol, pH 7.5 and kept at 80 °C in aliquots until use.
Novel Proteins
The cDNA of novel proteins was amplified by PCR by primers encoding a 5' NdeI and a 3' XhoI restriction sites either from genomial DNA (for E. coli ORF1, ORF2, YhgI, DnaKs, and GccH) or from a cDNA library generated by RT-PCR (for S. cerevisiae tropoM, Ubi6, sRib, and calmodulin). The products were ligated into pET-22b vector (Novagen) and transformed into E. coli BL21(DE3) strain. Expression of these proteins and S. cerevisiae transcription factor IIA (TFIIA) (the expression vector of which was a gift from Dr S. Hahn, The Fred Hutchinson Cancer Research Center, Seattle, WA) was induced by 0.5 mM isopropyl 1-thio-ß-D-galactopyranoside at 30 °C for 3 h. The proteins were purified to homogeneity by heat-treating (10 min x 100 °C) cell extracts following DEAE-cellulose ion exchange chromatography of the supernatants.
Extracts of E. coli and S. cerevisiae
E. coli cells were grown overnight in NZYM medium (NZ-amine, NaCl, yeast, MgSO2) and S. cerevisiae cells for 2 days in YP medium (yeast extract, pecton) (both 500 ml). The cells were centrifuged at 4,000 rpm at 4 °C for 20 min, washed in 10 ml of 50 mM Tris/HCl, pH 7.5, and centrifuged again. E. coli cells were then suspended in 2 ml of 50 mM Tris/HCl, 150 mM NaCl, 2 mM dithioerythritol, pH 7.5 and disrupted by sonication (10 x 10 s, 24 mµ at 0 °C on an MSE sonicator with 1-min breaks between runs). S. cerevisiae cells were disrupted by three freezing/thawing cycles in liquid nitrogen, homogenizing in a Potter-Elvehjem tissue grinder, and subsequent sonication as with E. coli. In both cases, disrupted cells were centrifuged for 100,000 x g for 30 min at 4 °C, and the supernatant was heated to 100 °C for 10 min and centrifuged again for 100,000 x g for 30 min at 4 °C. The final supernatant usually contained 2 mg/ml protein and was kept at 80 °C in small aliquots for 2D runs.
2D Electrophoresis
The 2D electrophoresis combined a native gel and an 8 M urea gel, both prepared according to Laemmli (18) but without SDS. To the heat-treated samples (1 µg each of control proteins or 100 µg of E. coli or S. cerevisiae extract)
volume of native sample buffer was added, and the sample was run on a native gel of 0.7-mm thickness either in small format (8 x 6 cm, Mini-Protean II) or large format (20 x 16 cm, Protean II XL, both from Bio-Rad). The gel was briefly stained with BioSafeTM Coomassie G250 (Bio-Rad), and the lane containing protein bands was cut out. This 1015-mm-wide strip was then soaked into the upper gel buffer plus 8 M urea for 45 min and placed on top of a second gel prepared with 8 M urea. This second dimension had no upper (stacking) gel, and it was cast between 1-mm spacers to accommodate the strip swollen in urea. The second gel was run 2.5 times longer than the first at the same voltage and stained in colloidal Coomassie (19) overnight.
Mass Spectrometry
Potential IUPs to be identified were cut out of the 2D gel and analyzed at the Proteomics Research Group of the Biological Research Center of the Hungarian Academy of Sciences (Szeged, Hungary). Briefly spots were in-gel digested by trypsin and analyzed by MALDI-TOF MS first. The resulting monoisotopic peptide masses (peptide mass fingerprint) were then subjected to a database search to identify the protein(s) present. Protein identities were confirmed by postsource decay spectra, providing sequence information on selected peptides.
Gel Filtration Chromatography
The folded/unfolded nature of proteins was characterized by GF chromatography. 100 µl of each of the proteins was run on an Amersham Biosciences Superdex 200 (1 x 30-cm) column at 0.5 ml/min in 50 mM Na2HPO4, 150 mM NaCl (or occasionally 1 M NaCl to exclude aggregation), pH 7.0 buffer on an Amersham Biosciences FPLC system. Proteins were recorded at 280 nm. The column was calibrated by the following globular proteins (molecular mass): ribonuclease A (13.7 kDa), chymotrypsinogen A (25.0 kDa), ovalbumin (43.0 kDa), BSA (67.0 kDa), and alcohol dehydrogenase (146.8 kDa). The Mw of novel proteins was determined from a calibration curve constructed by plotting log Mw values versus elution volume.
Characterization of Protein Disorder
Sequence-specific assessment of protein disorder was obtained by the PONDR® (Predictor of Natural Disordered Regions) algorithm (2, 5) available at www.pondr.com. Access to PONDR was provided by Molecular Kinetics (6201 La Pas Trail, Suite 160, Indianapolis, IN 46268; E-mail: main{at}molecularkinetics.com) under license from the Washington State University Research Foundation. PONDR is copyright ©1999 by the Washington State University Research Foundation, all rights reserved. The percentage of disorder is calculated as the ratio of residues with a PONDR score above 0.5.
The global tendency of proteins for disorder was also assessed by calculating their mean net charge (R) and mean hydrophobicity (H) (see Ref. 7). R was calculated as (Glu + Asp) (Lys + Arg) divided by the length of the protein. H was calculated with the ProtScale server (us.expasy.org/tools/protscale.html) with settings as defined in Ref. 7. IUPs and globular proteins in the hydrophobicity-charge space thus defined (the CH plot) are usually separated by a line defined by the equation r = 2.785H 1.151 (7). The tendency of a given protein for disorder was characterized by its distance from this line (set as positive for IUPs and negative for ordered proteins).
Structural disorder can also be characterized by the distribution of secondary structural elements determined by CD spectroscopy. To this end, CD spectra were decomposed into
-helix, ß-sheet, turn, and coil by the CDpro software (lamar.colostate.edu/
sreeram/CDPro/main.html), and coil was considered to correspond to disordered regions of the proteins.
The hydrodynamic dimension, a very sensitive measure of globularity, was characterized by the ratio of the apparent Mw determined by GF chromatography and the absolute Mw calculated from the amino acid sequence and also by the ratio of the apparent and real Stokes radius derived from Mw,app and Mw by the equation: log(RS) = 0.369 log(Mw) 0.254 (20). Because the column was calibrated by globular proteins, these ratios are close to 1 for folded proteins. The value of Mw,app/Mw = 23 and the value of RS,app/RS = 1.41.5 are typical of the proteins in a molten globule (MG) state and the value of Mw,app/Mw = 45 and the value of RS,app/Rs = 1.71.9 are typical of the extended, premolten globule (PMG) or random coil-like (U) proteins (3).
Other Methods
CD spectra were recorded in a Jasco J-720 spectropolarimeter in a buffer of 10 mM NaH2PO4, 150 mM NaCl, pH 7.0. In the case of gels run for MS analysis, all solutions were prepared from Elix/Synergy purified water (Millipore) and filtered to sterility through a 0.2-µm filter (Whatman). All the equipment was washed and rinsed in Millipore water. The gels were handled with great care to avoid contamination. Spots cut out were transferred to 100 µl of sterile water and sealed for transport to the MS laboratory. Protein concentration was determined according to Bradford (21).
| RESULTS |
|---|
|
|
|---|
-casein) ran a little above the diagonal, and one globular protein (fetuin) ran closer to IUPs. These deviations probably result from a significant residual structure in caseins (22) and the resistance to denaturation of fetuin (6). Nevertheless this run overall demonstrated that the 2D electrophoresis separates IUPs and globular proteins as predicted.
|
Identification of E. coli and S. cerevisiae IUPs by Combining the 2D Technique with MS
The major asset of the 2D technique is that it can recognize novel IUPs not yet described as such. This point is demonstrated by running heat-treated extracts of E. coli and S. cerevisiae in the large format gel (Figs. 2 and 3) and identifying spots at, and above, the diagonal by MS (Tables I and II). It is of note that the procedure is highly reproducible as parallel runs resulted in very similar gel patterns (not shown).
|
|
|
|
and P2ß has been reported (23), whereas E. coli ribosomal L7/L12 is a heat-stable, molten globule-like protein that acquires structure only upon dimerization (24). A similar behavior was reported for GroES (25). E. coli acyl carrier protein may also be a molten globule as it is stabilized by Mg2+ binding (Protein Data Bank code 1acp), but it is extremely flexible in solution (26), and its Vibrio harveyi homologue is fully unfolded under physiological conditions (27). Some of the proteins have separate disordered and ordered regions. For example, the structure of region 80156 of the acetyl-CoA carboxylase BCCP subunit is known (Protein Data Bank code 1a6x), but the protein has a highly flexible N-terminal linker region (28). Proteins above the diagonal are heat-stable enzymes (e.g. superoxide dismutase), which typically require a well defined structure for function. Nevertheless the high flexibility of calmodulin is necessary for its regulatory function (29).
Benchmarking the 2D Technique against Other Methods
Our current base of knowledge on protein disorder relies on a few experimental techniques and derived bioinformatic predictors. The different techniques (mostly x-ray crystallography, CD, and NMR (2, 3, 7)), however, probe distinct aspects of protein disorder (4), which results in a certain level of uncertainty in the output of disorder predictors (see Ref. 30). Thus, benchmarking the 2D technique against other methods is of both theoretical and practical importance.
To this end, we generated the PONDR pattern and the CH plot for all control and newly identified proteins (Supplemental Figs. S1S6). Besides clear cut cases, there were several contradictory proteins that would be difficult to correctly classify by any single method (see also Fig. 4). For example, Bob-1 was largely disordered by PONDR (73.44%) but appeared ordered on the CH plot. Caseins and NACP (Fig. 4A) would be held mostly ordered by both approaches, although they are classical examples for disorder (11, 31), also underscored by the CD spectra here (Supplemental Fig. S1). On the other extreme, IPMDH is a thermostable globular enzyme (Protein Data Bank code 1ipd), yet its level of disorder predicted by PONDR exceeded these three IUPs, whereas its structural status was correctly identified by the CH plot (Fig. 4B). Similar inconsistencies were apparent with some of the novel proteins identified as disordered by the 2D technique. For example, DnaKs (acronyms of protein names are given in Tables I and II) had a lower level of disorder by PONDR (39.07%) than GccH (53.49%), but it was disordered by the CH plot, whereas GccH (Fig. 4C) appeared ordered. GroES and L7/L12 (Fig. 4D) had a rather high level of disorder by PONDR (43.3 and 57.02%) and are thought to be disordered in the monomer state (24, 25), but they were predicted as ordered by the CH plot. By both techniques, YhgI appeared balanced between order and disorder without a clear clue as to where it belongs.
|
-helical. GF chromatography agreed with the 2D technique in ascertaining its overall disorder. An interesting example is sRib, which showed little disorder by PONDR® but appeared to exist in an extended conformation by GF and occupied a diagonal position on the 2D gel.
|
| DISCUSSION |
|---|
|
|
|---|
Its first practical application is the rapid characterization of a single protein in terms of its IUP status. As demonstrated, a simple run of the 2D format can tell with high certainty if the protein is structured or unstructured. Given that the technique can provide information on a protein of very small quantity and limited purity, it will be a useful complement to other techniques that are more demanding on protein quantity and quality (2, 4). Actually with a good antibody at hand, it may even be performed on cell/tissue extracts without prior purification.
The other application is in the analysis of cellular extracts. This point was demonstrated by the large format gel with E. coli and S. cerevisiae extracts and the identification of a range of novel, mostly disordered proteins. The list of proteins actually identified (Tables I and II) raises intriguing points. The majority of proteins have never been described as IUPs, which is consistent with the fact that we know only a small fraction of the "disorderome" (2, 4). This is also underscored by the lack of an overlap between identified E. coli and S. cerevisiae proteins despite the evolutionary conservation of protein disorder in some protein families (2, 33). Furthermore when we compared the 2D pattern of the extracts from E. coli and S. cerevisiae, more proteins were seen in the diagonal of the latter in agreement with predictions that the frequency of protein disorder increases with increasing complexity of the organisms (2, 5).
Whereas the 2D technique offers unique applications, it has some limitations as well. Because of the application of a native gel in the first dimension, its resolving power does not match that of the conventional 2D electrophoresis. This can be partially overcome by applying mild, non-charged detergents, and this will certainly be one future direction of our studies. A related point is that some IUPs occur at very low levels not detected by conventional staining. Improving the sensitivity of staining/detection, increasing sample load in the presence of detergents, or prefractionation of samples may be of help here. An important further limitation is that under the standard conditions of native electrophoresis, proteins of net positive charge are lost. As roughly half of the IUPs have a basic isoelectric point (7), the buffer system needs to be changed to extend the analysis over these as well.
Notwithstanding these limitations, this technique has already yielded significant information on the identity of novel IUPs. With the conceived improvements implemented, its potential is expected to increase in the future. Given the ubiquity and functional importance of IUPs, this novel 2D electrophoresis technique may represent an important step toward developing a high throughput proteomic tool for these proteins.
| ACKNOWLEDGMENTS |
|---|
di, Dr. László Szabados, Dr. Steven Hahn, and Dr. Michael J. Volles for various proteins and constructs. | FOOTNOTES |
|---|
Published, MCP Papers in Press, October 13, 2005, DOI 10.1074/mcp.M500181-MCP200
1 The abbreviations used are: IUP, intrinsically unstructured protein; Bob-1, B cell-specific transcription co-activator; Csd1, calpastatin domain 1; DARPP32, dopamine- and cAMP-regulated phosphoprotein; GF, gel filtration; CH plot, mean hydrophobicity-mean net charge plot; IPMDH, 3-isopropylmalate dehydrogenase; MAP2c, microtubule-associated protein 2c; MG, molten globule; Mypt1-(304511), myosin phosphatase target subunit 1 (fragment 304511); NACP (
-synuclein), non-Aß component of Alzheimers amyloid plaque, precursor; PMG, premolten globule; PONDR®, Predictor of Natural Disordered Regions; RS, Stokes radius; 2D, two-dimensional; TFIIA, transcription factor IIA; GccH, glycine cleavage complex H; DnaKs, DnaK suppressor; BCCP, biotin carboxyl comer protein of acetyl-CoA carboxylase. ![]()
2 V. Csizmók, E. Sz
ll
si, P. Friedrich, and P. Tompa, unpublished observations. ![]()
* This work was supported in part by International Senior Research Fellowship GR067595 from the Wellcome Trust. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
Supported by a Bolyai János Scholarship. To whom correspondence should be addressed: Inst. of Enzymology, Biological Research Center, Hungarian Academy of Sciences, P. O. Box 7, H-1518 Budapest, Hungary. Tel.: 361-279-3143; Fax: 361-466-5465; E-mail: tompa{at}enzim.hu
| REFERENCES |
|---|
|
|
|---|
(s1)-, ß- and
-caseins.
J. Chem. Soc. Faraday Trans. 89, 2683
2692[CrossRef]This article has been cited by other articles:
![]() |
R. Ivanyi-Nagy, J.-P. Lavergne, C. Gabus, D. Ficheux, and J.-L. Darlix RNA chaperoning and intrinsic disorder in the core proteins of Flaviviridae Nucleic Acids Res., February 11, 2008; 36(3): 712 - 725. [Abstract] [Full Text] [PDF] |
||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
| HOME | HELP | FEEDBACK | SUBSCRIPTIONS | ARCHIVE | SEARCH | TABLE OF CONTENTS |
| All ASBMB Journals | Journal of Biological Chemistry |
| Journal of Lipid Research | ASBMB Today |