|
|
||||||||
| ||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||||
Molecular & Cellular Proteomics 5:1286-1299, 2006.
© 2006 by The American Society for Biochemistry and Molecular Biology, Inc.
,
,
,¶
,¶
,**
,**,

From
The Walter and Eliza Hall Institute of Medical Research, Melbourne, Victoria 3050, Australia, ¶ Department of Medical Biology, The University of Melbourne, Parkville, Victoria 3010, Australia,, and || Joint Proteomics Laboratory, Ludwig Institute for Cancer Research and the Walter and Eliza Hall Institute of Medical Research, Melbourne 3052, Australia
| ABSTRACT |
|---|
|
|
|---|
94% of the GPI-anchored schizont/merozoite proteome and constitute by far the largest validated set of GPI-anchored proteins in this organism. Moreover MSP-1 and MSP-2 were present in similar copy number, and we estimated that together these proteins comprise approximately two-thirds of the total membrane-associated surface coat. This is the first time the stoichiometry of MSPs has been examined. We observed that available software performed poorly in predicting GPI anchoring on P. falciparum proteins where such modification had been validated by proteomics. Therefore, we developed a hidden Markov model (GPI-HMM) trained on P. falciparum sequences and used this to rank all proteins encoded in the completed P. falciparum genome according to their likelihood of being GPI-anchored. GPI-HMM predicted GPI modification on all validated proteins, on several known membrane proteins, and on a number of novel, presumably surface, proteins expressed in the blood, insect, and/or pre-erythrocytic stages of the life cycle. Together this work identified 11 and predicted a further 19 GPI-anchored proteins in P. falciparum.
The GPI anchoring of proteins is one of the most ancient and widespread forms of protein glycosylation in eukaryotes, and the higher level rules governing the attachment of these glycolipids appear to be strongly conserved (5). The preprotein sequences of GPI-anchored proteins are characterized by being book-ended by an N-terminal hydrophobic signal sequence that directs co-translational insertion into the ER and a C-terminal hydrophobic region that guides the attachment of a GPI anchor. No other hydrophobic stretches that could serve as internal transmembrane domains are usually present. During co-translational insertion into the ER membrane the C-terminal hydrophobic region is recognized by a transamidase complex that simultaneously removes the hydrophobic region and replaces it with a presynthesized GPI anchor (5). The cleavage site (between
and
+ 1 amino acids) is a short distance upstream of the hydrophobic region and is usually comprised of three amino acids with small side chains (6, 7).
The surfaces of the various extracellular forms of the malaria parasite, the merozoite, gamete, ookinete, and sporozoite, are coated by different proteins that are either known or presumed to be GPI-anchored. The identities of some of these proteins are known in P. falciparum including for example, merozoite surface protein 1 (MSP-1) and MSP-2 on merozoites (8, 9), Pfs48/45 on gametes (10), Pfs25 and Pfs28 on ookinetes (11, 12), and circumsporozoite protein (CSP) on sporozoites (13, 14). Using a variety of approaches, a number of other proteins have been predicted as likely to possess GPI anchors, in our own case using proteomic analysis of detergent-resistant membrane preparations (4). A cursory examination of the C-terminal sequences of known or predicted GPI-anchored proteins reveals that they share the general characteristics of eukaryotic GPI anchor signals, namely a C-terminal stretch of 1318 non-charged amino acids enriched with hydrophobic amino acids. The cleavage sites of three P. falciparum GPI anchor signal sequences are known (MSP-1, -2, and -4), and they, like other eukaryotic GPI attachment sites, are largely comprised of the small amino acids serine and asparagine (15). Despite this knowledge, very few P. falciparum proteins have been confirmed as possessing GPI anchors.
The objective of this study was to identify all the proteins present in the currently annotated version of the P. falciparum genome (2) (www.PlasmoDB.org) that are most likely to be GPI-anchored. The paucity of validated GPI-anchored proteins in this organism, together with the evolutionary distance of this parasite from other species in which GPI anchoring has been well studied, led to uncertainty in the use of available programs that predict the presence of GPI anchors from primary sequence data. Because of this we used a combination of biochemical and bioinformatic approaches to determine the GPI proteome of P. falciparum. Proteomic analysis was used to identify GPI-anchored proteins synthesized late in the blood-stage cycle and that eventually reside on the merozoite form of the parasite. For this procedure, GPI anchors were first labeled with [3H]glucosamine, and GPI-anchored proteins were enriched by detergent-phase partitioning. Eleven different GPI-anchored proteins were assigned using this approach, and these were rank ordered according to their relative copy number. Up to another nine lower abundance [3H]glucosamine-labeled GPI-anchored proteins were also apparent at this life stage, but these remain unidentified. To predict the identity of these lower abundance proteins as well as those present at other life stages we developed a program, GPI-HMM, trained on P. falciparum sequences to search all annotated and predicted protein sequences derived from the P. falciparum genome. Using this approach we confidently predicted that the P. falciparum genome encodes 30 GPI-anchored proteins. Sixteen of these have a late blood-stage expression profile, which is in good agreement with proteomic analysis. With the emergence of genome sequences across the Plasmodium genus, these studies provide a framework for predicting the nature of surfaces of the various extracellular forms of these parasites.
| EXPERIMENTAL PROCEDURES |
|---|
|
|
|---|
1 x 1010 parasite-infected erythrocytes with 10 µCi/ml D-[6-3H]glucosamine hydrochloride in glucose-free RPMI 1640 medium supplemented with 10 mM fructose, 25 mM Hepes, 0.2 mM hypoxanthine at 37 °C for 4 h. Infected erythrocytes were pelleted by centrifugation at 2,800 rpm for 10 min (Beckman GS-6 centrifuge), lysed with saponin (0.15% saponin for 10 min on ice), and extensively washed in cold TBS supplemented with a mixture of protease inhibitors (TBS+PIs; Roche Complete protease inhibitor mixture and/or Calbiochem Protease Inhibitor Mixture Set III). Parasites were resuspended to a volume of 1 ml in TBS+PIs, snap frozen in liquid nitrogen, and stored at 80 °C until required.
Detergent Extraction and Two-phase Partitioning of GPI-anchored Proteins
Detergent extraction and enrichment of GPI-anchored proteins was achieved by Triton X-114 (TX-114) two-phase separation. Parasite pellets were equilibrated in 2.4 ml of ice-cold TBS with protease inhibitors at a final protein concentration of <4 mg/ml and extracted with 0.6 ml of precondensed TX-114 stock solution (
2% final concentration). Samples were maintained on ice for 30 min with occasional mixing and centrifuged at 4 °C at 100,000 x g for 30 min to remove TX-114-insoluble material. The supernatant was incubated at 33 °C for 10 min and centrifuged at 10,000 rpm for 10 min at room temperature to induce phase separation. Aqueous and organic phases were recovered and mixed with fresh buffer or detergent as appropriate, and the phase separation procedure was repeated twice more. Aliquots of total lysate and TX-114-insoluble and TX-114-soluble aqueous or detergent-enriched fractions were collected and quantified using BCA protein assay reagent (Pierce). The resulting TX-114-soluble detergent fractions enriched in GPI-anchored proteins were pooled, and proteins were precipitated with 34 volumes of acetone at 20 °C.
Immunodetection and Autoradiography of Detergent-enriched Proteins
The GPI-enriched fraction was resuspended in reducing sample buffer (10 mM Tris-HCl, pH 8.0, 1 mM DTT, 1% SDS, 1% sucrose, 0.005% bromphenol blue) and placed at 70 °C for 10 min. Protein samples (50 µg) were separated by 1D SDS-PAGE on 18-cm x 16-cm x 1.5-mm 515% gradient or 10% polyacrylamide gels and electrophoretically transferred to Immobilon-PSQ polyvinylidene difluoride membranes (Millipore) in Towbins transfer buffer containing 20% methanol and 0.01% SDS. Complete transfer of high and low molecular protein species (>95%) was confirmed by protein staining using Coomassie Brilliant Blue R250 and/or Ponceau S dyes. Immunoblotting against malaria antigens was performed as described previously (18). Imaging of D-[6-3H]glucosamine-labeled protein bands was achieved by direct autoradiography (15-day exposure) of dry blots using Fujifilm BAS-TR2040 tritium imaging plates and an FLA-3000 luminometric detection system. Accurate densitometric quantification of light emitted from imaging plates was performed using ImageJ and ImageQuant Version 5.0 (GE Healthcare).
Two-dimensional (2D) Gel Electrophoresis
Two-dimensional gel electrophoresis was performed using conditions required for optimal extraction and separation of P. falciparum-infected erythrocyte proteins (19). Isoelectric focusing was carried out on wide range immobilized pH gradients (11 cm long, pH 310 non-linear Immobiline DryStrip gels; Amersham Biosciences) using the Protean IEF cell system (Bio-Rad). The GPI-enriched fraction (
200 µg of acetone precipitate) was redissolved in 300 µl of rehydration/sample buffer (7 M urea, 2 M thiourea, 2% ASB-14, 100 mM DTT, 0.4% ampholytes). Samples were loaded by passive rehydration for 12 h and focused at a current limit of 50 µA/IPG strip using a fast voltage gradient (8000 V maximum, 24,000 V-h) at 20 °C. The second dimension was carried out on 10% polyacrylamide gels (18 cm x 16 cm x 1.5 mm) using a Hoefer SE 600 system at 200 V constant voltage and 10 °C until the dye front reached the bottom of the gel. Analytical gels were transferred to Immobilon-PSQ PVDF membranes, and D-[6-3H]glucosamine-labeled protein spots were visualized by autoradiography as above. To visualize proteins after 2D electrophoresis preparative gels were stained with potassium salt silver stain, Bio SafeTM Coomassie stain (Bio-Rad), or ImperialTM protein stain (Pierce) using protocols compatible with electrospray ionization-mass spectrometry (20) or the manufacturers instructions. Computer-aided 2D image analysis of digitized autoradiographs and Coomassie-stained gel images was carried out using ImageJ and ImageQuant Version 5.0. The relative amount of GPI-anchored protein was calculated from densitometric scans of autoradiographs of [3H]glucosamine-labeled total schizont proteins separated by high resolution 1D gel electrophoresis. To estimate the relative amounts of multiple GPI-anchored proteins present in individual 1D bands, densitometric scans of autoradiographs of 2D gels of the same material were used as indicated in Table I.
|
10 µl by centrifugal lyophilization (Savant model AES1010, Savant) for ESI ion trap MS/MS (LCQ-Deca, Finnigan). Protein digests (
10 µl of 1% formic acid) were transferred into 100-µl glass autosampler vials, and peptides were fractionated by capillary reversed-phase HPLC (Agilent Model 1100 capillary HPLC system) using a butyl-silica 150 x 0.15-mm-inner diameter reversed-phase capillary column (ProteColTM, C4, 3 µm, 300 Å; SGE Australia) developed with a linear 60-min gradient from 0 to 100% B where Solvent A was 0.1% (v/v) aqueous formic acid and Solvent B was 0.1% aqueous formic acid, 60% (v/v) acetonitrile with a flow rate of 0.8 µl/min. The capillary HPLC system was coupled on line to the ESI ion trap mass spectrometer for automated MS/MS analysis of individually isolated peptide ions (21). Uninterpreted CID spectra were filtered excluding spectra with less than 30 peaks using the LCQ-DTA program as part of Bioworks 3.1 sr1 (Finnigan, San Jose, CA). The parameters used to create the peak lists are as follows: minimum mass, 400 Da; maximum mass, 5000 Da; grouping tolerance, 1.5; intermediate scans, 1; minimum group count, 1; LCQ-DTA auto charge state calculation; 10 peaks minimum per spectrum; peptide charge states, 1+, 2+, and 3+; ±2-Da peptide mass tolerance; ±0.5-Da MS/MS fragment mass tolerance. Parent ion masses were determined based on the isotope cluster spacing in the zoom scan spectrum, and individual spectra files (.dta file extension) were generated. These files were then automatically searched using MascotTM Version 2.1 (Matrix Science, London, UK) against the latest LudwigNR database (21) as well as the P. falciparum strain 3D7 genomic data peptide sequence database in FASTA format downloaded from the PlasmoDB website (www.plasmodb.org). Note that the PlasmoDB RAMA sequence (MAL7P1.208) is incorrect and was replaced with another whose accuracy had been verified by cDNA sequencing (Genbank accession number Q89710). Searches were conducted with the carboxymethylation of cysteine as a fixed modification (+58 Da), variable oxidation of methionine (±16 Da), and the allowance for up to four missed tryptic cleavages (22). Peptide identities were chosen to be correct with Mascot scores of 40 and above and were also manually validated. Peptide identities with Mascot scores of less than 40 were all manually validated and deemed as positive identities according to the fragmentation principles as published previously (22).
Prediction of GPI Anchoring
A hidden Markov model was used to predict GPI anchoring of P. falciparum proteins (GPI-HMM), and the topology for the model was chosen to correspond to the known overall structure of such proteins (23). The ER signal region was chosen to be identical to that in SignalP-HMM (24); the middle region is a simple looping state with a background amino acid emission distribution; the GPI signal region was trained using the amino acid sequences of the C-terminal regions of 14 previously characterized proteins proven or thought to be GPI-anchored and aligned by predicted
-site. The training procedure for the GPI signal region used sequence weighting and a Dirichlet mixture prior similar to that used by HMMer (25). For a given protein, a score is calculated that shows how well the protein fits the model as compared with a null model, which is just a single looping state that is the same as the middle region. A positive score indicates a better fit than the null model. To calibrate the model, we used a dataset ("negative test set") of 1909 proteins known not to be GPI-anchored but that have a transmembrane domain within 50 amino acids of the C terminus and come from various organisms.
Many proteins with transmembrane domains score highly, so to increase the power of our predictions we also automatically flag any protein that is predicted to have at least three transmembrane domains by TMHMM (26). This model was then used to score each protein in the P. falciparum proteome. An estimated p value for each protein is then calculated as the proportion of the negative test set that has not been flagged and scores at least as well as the given protein. Note that this p value is conservative due to the choice of a negative test set closer to GPI-anchored proteins than the majority of P. falciparum proteins. To measure the robustness of this model, a leave-one-out cross-validation procedure was applied. For each protein in the training set, a new model was trained with that protein removed, and the resulting model was applied to the removed protein.
| RESULTS AND DISCUSSION |
|---|
|
|
|---|
The GPI anchors of schizont-stage P. falciparum proteins were specifically labeled with [3H]glucosamine because in P. falciparum little, if any, N-linked glycosylation occurs (30, 31), and virtually all amino-sugar precursor is incorporated into the glycan moiety of the GPI anchor (32). Total schizont-stage proteins were fractionated by TX-114 two-phase partitioning, a process that separates soluble proteins into aqueous and detergent-phase fractions. Densitometric analysis of protein equivalents of the ensuing fractions (total, insoluble, aqueous, and detergent) showed that TX-114 partitioning resulted in a specific 510-fold enrichment of detectable [3H]glucosamine-labeled proteins in the TX-114 detergent-soluble phase (Fig. 1, A and B). Pulse-chase analyses at various points in the 48-h blood-stage cycle revealed that [3H]glucosamine-labeled proteins were most strongly expressed late in the cycle (Fig. 1C). That the expression of most asexual blood-stage GPI-anchored proteins peaked during schizont stages is congruent with peak transcription of the genes encoding enzymes that synthesize and attach the GPI anchors (33) (see Supplemental Fig. S1 for transcriptional profiles from Ref. 34).
|
75 kDa (band 6) that is similar to its predicted and previously published size (35). Anti-RAMA antibodies labeled a major band at
55 kDa (band 8) and a lower abundance larger band of
160 kDa (band 2) that probably corresponds to the full-length protein. The 55-kDa species is the expected dominant C-terminally processed fragment of RAMA that appears shortly after the expression of full-length protein (36). MSP-2 is present as three bands: a major 50-kDa band (band 9) that corresponds to the full-length protein and lesser amounts of a faster migrating 42-kDa band (band 10) and a 110-kDa, presumably SDS-resistant dimeric, species (band 4). Co-migrating with the 42-kDa MSP-2 species is MSP-4 and MSP-5, all of which comprise band 10 (Fig. 2). These data led to a designation for nine of the 15 [3H]glucosamine-labeled bands. Of the remaining six unidentified bands five labeled strongly (bands 5, 11, 12, 13, and 14) and one labeled weakly (band 7) with [3H]glucosamine. Hence a number of unidentified GPI-anchored proteins appeared to be relatively abundantly expressed at schizont stage.
|
To generate a more comprehensive map of GPI-anchored schizont proteome we performed 2D gel electrophoresis and subjected [3H]glucosamine-labeled spots to peptide sequencing by LC-MS/MS. [3H]Glucosamine-labeled proteins were precipitated from the TX-114 detergent phase, separated on 2D gels, and blotted. Autoradiographs of the resulting protein blots produced a highly reproducible pattern of
30 [3H]glucosamine-labeled 2D spots (Fig. 3, left panel). Regions corresponding to individual radiolabeled spots were then excised from duplicate Coomassie- or silver-stained gels (e.g. Fig. 3, right panel), and their proteins were sequenced by LC-MS/MS (Supplemental Table S2 and Fig. S2). The isoelectric point, molecular weight, and approximate corresponding 1D gel band of each assigned protein spot, as measured from five separate 2D gels, is summarized in Table I. This approach proved much more conclusive than the 1D approach with known or potential GPI-anchored proteins dominating the peptide coverage in 21 of 30 analyzed 2D spots (Supplemental Table S2).
|
The acidic region of 2D gels contained multiple partially reduced forms of the abundant known GPI-anchored merozoite surface proteins MSP-2 (spots 12 and 18), MSP-4 (spots 14 and 19), and MSP-5 (spot 14), which were difficult to resolve due to their nearly identical isoelectric points and molecular weights. In agreement with 1D Western blot results for bands 2, 4, and 15 (Fig. 2), immunostaining of 2D Western blots also confirmed that the corresponding spots (spots 2, 5, and 30) were recognized by antibodies against RAMA, MSP-2, or MSP-119, respectively (Table I, data not shown). As in the 1D proteomic analysis we also detected the three newly identified surface antigens Pf92, Pf38, and Pf12. These were predicted to be GPI-anchored based on primary sequence and their presence in detergent-resistant membrane preparations (4). The 2D gel proteomic approach confirmed the posttranslational modification and TX-114-partitioning of these proteins, strongly supporting the likelihood that they do possess GPI anchors (Table I). Moreover we confirmed the presence of GPI anchors on MSP-10 and ASP, two known apical surface proteins that unlike most other GPI-anchored merozoite surface proteins were not detected in low density detergent-resistant membrane fractions (4, 38). Finally we also detected a novel putative GPI-anchored protein, PFD0955w. PFD0955w has a predicted molecular mass of
34 kDa, and we therefore have named it Pf34. In total, the presence of GPI moieties was confidently assigned to 11 different proteins in mature blood-stage P. falciparum parasites (MSP-1, -2, -4, -5, -10, RAMA, ASP, Pf92, Pf38, Pf12, and Pf34).
In addition to the 21 identified GPI-anchored protein spots, nine minor but distinct [3H]glucosamine-labeled protein spots were observed that could not be authenticated (Table I and Fig. 2). Seven of these 2D gel spots yielded no detectable parasite-related peptide sequences (spots 8, 21, 23, 26, 27, and 28), and two merely contained peptide sequences of non-GPI-anchored P. falciparum proteins (spots 20 and 29). Interestingly spot 29 yielded unique high peptide coverage for the hypothetical protein PFF0335c (formerly MAL6P1.71) in four independent experiments. Apart from apparently labeling with [3H]glucosamine, the same spot also incorporated the radioactive GPI lipid precursors [3H]palmitate and [3H]myristate (data not shown). Moreover PFF0335c partitioned in the hydrophobic TX-114 detergent phase, and proteomic analysis revealed that it is particularly abundant in a low density detergent-resistant membrane fraction of P. falciparum (4). However, peptides covering the putative C-terminal hydrophobic domain of this protein were detected by LC-MS/MS (Supplemental Fig. S2), indicating that this sequence is not removed to add a GPI anchor. Hence assuming that the current gene model for this protein is correct, it is unlikely that PFF0335c is GPI-anchored but instead appears to co-migrate with an as yet unidentified GPI-anchored protein. This is probably also the case for spot 20, which yielded significant peptides for multiple highly expressed non-GPI-anchored P. falciparum proteins (Supplemental Table S2). The nine spots that have so far eluded identification were presumably below the detection limit of our LC-MS/MS system and/or masked by other co-migrating highly abundant proteins. It is also possible that additional GPI-anchored proteins may also be present in some of the 21 spots that have an assignment but that these are masked by more abundant species. Also we cannot completely exclude the possibility that some spots appear as a result of a small degree of N- or O-linked glycosylation. Exactly how many additional GPI-anchored schizont proteins remain to be assigned is uncertain, but it is probably less than nine as it is likely that at least some of the 2D spots represent different proteolytic or oligomeric forms of the same protein.
Using the assignments from the 1D and 2D analyses (Table I), the relative copy number of GPI-anchored proteins was estimated from densitometric scans of autoradiographs of [3H]glucosamine-labeled proteins. Because each GPI-anchored protein has a single anchor, the level of radiolabel is proportional to the number of molecules of each protein. Remarkably the 11 identified GPI-anchored proteins constitute
94% of the GPI-anchored schizont proteome (Fig. 4). Hence even when considered together the nine unidentified 2D gel spots are clearly low in copy number. Individually MSP-1, MSP-2, and RAMA were the most abundant, comprising
31, 21, and 10% of all GPI-anchored proteins, respectively (Fig. 4). Of these, only MSP-1 and MSP-2 are located on the surface, and hence this analysis demonstrates that these two proteins dominate the merozoite surface, comprising approximately two-thirds of all the GPI-anchored proteins residing in this location. Four proteins (Pf38, Pf12, Pf92, and MSP-10) each comprise
5% of the GPI-anchored proteome. MSP-4 and -5 could not be clearly separated by 2D gel electrophoresis and, similar to ASP and Pf34, each represent
23% of GPI-anchored proteins at the schizont stage.
|
|
|
100 aa) were not observed in our blood-stage proteomic analysis. Proteins possessing both a GPI anchor and an internal TMD are exceedingly rare, therefore all proteins with a total of three or more TMDs were also rejected as likely false positives (Supplemental Table S4). Some proteins with high GPI-HMM scores contained two or more charged residues in the last 15 aa that would tend to disrupt the hydrophobic TMD of typical GPI anchor signals. We consider it unlikely that these proteins are anchored, and the reason they received high scores was that their ER signal sequences and processing sites for addition GPI anchors were very similar to the training set, although the potential of the C-terminal amino acids to form a TMD was low (Supplemental Table S4). In all, 26 of 60 positive scoring proteins survived the manual cull and were considered most likely to be GPI-anchored (Table II). Another four proteins that were just below the cutoff were included in the final list because they retained low p values (0.010.02) (Table II and Supplemental Table S4). This list includes all 11 biochemically validated blood-stage proteins as well as CSP and RMP-1, two proteins for which considerable evidence of GPI anchoring is available (45, 46). Note also that two biochemically validated GPI-anchored proteins (Pf92 and Pf34) were not included in the training set yet scored highly (Table II). Also present on the final GPI-HMM list but not included in the training set were several genes (PFF0620c, PFD0215c/Pfs36p, and PF10_0302/Pfs28) independently predicted to be GPI-anchored based on homology to known GPI-anchored surface proteins (47, 48). Also none of the proteins on this list have been shown not to possess a GPI anchor.
Expression Profile and Domain Structure of GPI-anchored Proteins
Transcription profiles for each of the genes encoding the 30 GPI-anchored proteins predicted by GPI-HMM were extracted from the Affymetrix (multiple life stages) and glass slide 70-mer microarray (blood stage only) databases (34, 49). Transcription of 28 of these genes was detected in the Affymetrix array analysis, and a Venn diagram displaying the expression patterns in blood, gametocyte, and sporozoite stages was generated (Fig. 6). Note that RAMA was excluded from this analysis as probes corresponding to this gene were not present on either microarray chip, whereas PFF0620c transcripts were not detected in any life stage. Expression levels (according to the Affymetrix arrays) of strictly less than 10% of the total level of transcription for a particular gene were considered background and were not taken into consideration in assigning stage specificity. Seventeen of the 28 (61%) genes were assigned to just one of the blood, gametocyte, or sporozoite stages (Fig. 6). Eight genes demonstrated a degree of expression in two stages, whereas three genes (Pf38, Pf12, and msp-4) appear to be expressed at a reasonable level in all stages (Fig. 6). More detailed transcriptional profiles are displayed in Fig. 7.
|
|
In addition to the known gamete and ookinete surface antigens, two novel proteins, PFE0220w and PFF0975c, are transcribed in gametocytes and are likely to be present on the surface of one or the other of these extracellular stages. Similarly in addition to the known sporozoite antigens, CSP and P36p, several novel GPI-anchored proteins are predicted to be present at this life stage, most notably PFC0750w and PF08_0008 (Fig. 7).
P. falciparum GPI-anchored proteins vary tremendously in size with their predicted proproteins ranging between 24 kDa (Pfs25/Pfs28) and 195 kDa (MSP-1) (Fig. 7). One feature shared by most predicted P. falciparum GPI-anchored proteins is the presence of cysteine-rich domains that probably form disulfide bonds and contribute to globular structures. In some cases these domains have homology to domains of other eukaryotic proteins. EGF-like domains are found in MSP-1, -4, -5, -8, -10, Pfs25, Pfs28, and chr10.phat_47, and a sushi domain is seen in ASP (37).
One particularly interesting group of cysteine-rich proteins are the six-cysteine (Cys6) protein family (4, 47). These proteins appear to be restricted to the phylum Apicomplexa and are present as paired domains in the GPI-anchored proteins Pf12, Pf38, Pf47, Pfs48/45, Pf52 (also known as Pf36p), and PFF0620c. In this study we assigned GPI anchors to Pf38 and Pf12, two proteins recently identified in schizont-stage detergent-resistant membranes (4). A previous study suggested that mRNA of the rodent malaria orthologue of Pf38 is present in gametocytes (48), but we showed here and in a prior study (4) that Pf38 is also expressed in asexual blood stages. Evidence for Pf12 expression was first detected in a COS cell expression library of P. falciparum genomic DNAs screened with malaria immune African serum (50). We confirmed here that Pf12 is translated in blood stages, and its protein is present at levels similar to those of Pf38; both proteins each comprise
5% of total GPI-anchored proteins in schizonts. These Cys6 proteins appear to be related to the surface antigen protein family of Toxoplasma gondii that are mostly GPI-anchored and are thought to bind the sulfated proteoglycans on host cells (51, 52). Pf12 and Pf38 are natural antigens and are localized to the plasma membrane of mature merozoites, although Pf38 also localizes to apical organelles (4). As Pf12 and Pf38 appear also to be expressed in gametocytes and pre-erythrocytic stages, there is a possibility that vaccines derived from these proteins could act during multiple points of the life cycle of the parasite (4).
Many other predicted GPI-anchored proteins possess cysteine-rich regions that remain to be characterized (Fig. 7D, green boxes). Once such protein is Pf92 that was also discovered in a proteomic analysis of detergent-resistant membrane preparations and is localized to the merozoite surface (4). The mature form of Pf92 contains 14 cysteine residues that could form up to seven disulfide bridges. We could not detect any other proteins that were homologous to Pf92 in the P. falciparum genome, but in rodent and primate Plasmodium species orthologues are clearly present (data not shown).
Another feature shared by many GPI-anchored proteins is the presence of runs of short repetitive amino acid motifs that are generally hydrophilic and probably lie on the protein surface (Fig. 7D). It has been noted previously that the length and copy number of these motifs is greater in asexual antigen proteins compared with other Plasmodium proteins (53). It has been proposed that these repetitive regions serve as a "smoke-screen" to elicit a strong but ineffective immune response from the host and have expanded recently as a result of selective pressure (54). That many of the GPI-anchored proteins expressed in asexual blood-stage merozoites and therefore exposed to the immune system contain repetitive arrays is in agreement with this proposal.
In summary, in this study we used two approaches to predict the entire GPI-anchored proteome of P. falciparum. First, as the blood stages are amenable to continuous in vitro culture we used a proteomic approach to identify [3H]glucosamine-labeled proteins at this life stage, all of which are assumed to possess GPI anchors. Consistent with a previous study (55), we showed that GPI-anchored proteins are mostly synthesized late in the blood-stage cycle and hence are expected to be present on the surface or in the secretory organelles of the invasive merozoite form of the parasite. Thirty [3H]glucosamine-labeled species were observed by 2D gel electrophoresis, and 21 of these were assigned to 11 GPI-anchored proteins. Hence a maximum of nine GPI-anchored proteins remain to be identified in schizonts. Even when considered together these unidentified proteins only represent a very small proportion (
6%) of the total GPI-anchored protein content at this life stage. In this regard, this study represents the first formal examination of the stoichiometry of P. falciparum merozoite surface proteins and confirms a widely held suspicion; just two proteins, MSP-1 and MSP-2, comprise a large proportion (approximately two-thirds) of the membrane protein surface coat. Despite this, a number of other GPI-anchored proteins, including the newly identified proteins Pf92, Pf38, Pf34, and Pf12, are present at relatively high copy number.
Second, in parallel with the proteomic approach we used predictive software to rank all proteins in the P. falciparum genome according to the likelihood that they are GPI-anchored. We observed that pre-existing GPI prediction programs performed unsatisfactorily with P. falciparum genome data and failed to predict many proteins known to bear GPI anchors. For this reason we created a new algorithm to predict GPI-anchored P. falciparum proteins. A final list of 30 GPI-anchored was derived, a list that includes a number of novel proteins predicted to be present on the surface of one or more of the extracellular life stages.
An important aspect of future studies is to establish the functions of these proteins. Thus far, significant progress has only been made in understanding the function(s) of GPI-anchored proteins expressed exclusively in sexual and pre-erythrocytic stages because their genes can be disrupted or modified by insertional mutagenesis in blood-stage cultures. Gene knock-outs have revealed that Pfs48/45 is required for fusion of the microgametocyte to the macrogametocyte possibly by acting as an adherence molecule (56). Knockouts of the ookinete-stage Pbs21 and Pbs25 implicate these proteins, particularly the former, in protecting the parasite from host defenses because oocyst maturation is curtailed in Pbs21 knock-outs (57). CSP is the dominant surface coat protein on sporozoites and is essential for sporozoite formation in the oocyst (58), and importantly in the context of the present study, recent mutational analysis of the putative CSP GPI attachment sequence demonstrated the functional importance of this mode of membrane attachment (46). Another sporozoite protein, P36p, is a GPI-anchored Cys6 domain-containing protein that appears to be involved in the recognition and/or invasion of hepatocytes (59, 60).
Insertional mutagenesis of GPI-anchored proteins expressed during late blood stages is technically difficult, although gene modifications have provided some functional insight into the double EGF domains of MSP-1 (61, 62). Of attempts to knock out genes encoding seven schizont-stage P. falciparum GPI-anchored proteins, only one (that of MSP-5) has proven successful.3 Most GPI-anchored proteins are therefore probably necessary for blood-stage growth, and other methodologies will be required to resolve their functions. Newly developed techniques in P. falciparum such as inducible gene expression may prove useful in this regard (63).
| ACKNOWLEDGMENTS |
|---|
| FOOTNOTES |
|---|
Published, MCP Papers in Press, April 7, 2006, DOI 10.1074/mcp.M600035-MCP200
1 The abbreviations used are: GPI, glycosylphosphatidylinositol; MSP, merozoite surface protein; HMM, hidden Markov model; RAMA, rhoptry-associated membrane antigen; ASP, apical sushi protein; EGF, epidermal growth factor; ER, endoplasmic reticulum; CSP, circumsporozoite protein; TX-114, Triton X-114; 1D, one-dimensional; 2D, two-dimensional; TMD, transmembrane domain; aa, amino acids. ![]()
2 J. Kronegg and D. Buloz, personal communication. ![]()
3 Sanders, P. R., Kats, L. M., Drew, D. R., ODonnell, R. A., ONeill, M., Maier, A. G., Coppel, R. L., and Crabb, B. S. (2006) A set of GPI-anchored membrane proteins of Plasmodium falciparum are refractory to genetic deletion. Infect. Immun., in press. ![]()
* This work was supported in part by the National Health and Medical Research Council of Australia and the Wellcome Trust (UK). The costs of publication of this article were defrayed in part by the payment of page charges. This article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. ![]()
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. ![]()
Both authors contributed equally to this work. ![]()
** International research scholars of the Howard Hughes Medical Institute. ![]()

To whom correspondence should be addressed: The Walter and Eliza Hall Inst. of Medical Research, 1G Royal Parade, Parkville, Victoria 3050, Australia. Tel.: 61-3-9345-2555; Fax: 61-3-9347-0852; E-mail: crabb{at}wehi.edu.au
| REFERENCES |
|---|
|
|
|---|