If you don't remember your password, you can reset it by entering your email address and clicking the Reset Password button. You will then receive an email that contains a secure link for resetting your password
If the address matches a valid account an email will be sent to __email__ with instructions for resetting your password
Gene expression in trypanosomes is mediated by noncanonical mechanisms.
•
Trypanosome mRNA nuclear export system comprises unique proteins to kinetoplastids.
•
The present work highlights an amalgam of kinetoplastid-specific and conserved components.
•
Our data support a highly coupled mRNA maturation pathway.
Abstract
In trypanosomatids, transcription is polycistronic and all mRNAs are processed by trans-splicing, with export mediated by noncanonical mechanisms. Although mRNA export is central to gene regulation and expression, few orthologs of proteins involved in mRNA export in higher eukaryotes are detectable in trypanosome genomes, necessitating direct identification of protein components. We previously described conserved mRNA export pathway components in Trypanosoma cruzi, including orthologs of Sub2, a component of the TREX complex, and eIF4AIII (previously Hel45), a core component of the exon junction complex (EJC). Here, we searched for protein interactors of both proteins using cryomilling and mass spectrometry. Significant overlap between TcSub2 and TceIF4AIII-interacting protein cohorts suggests that both proteins associate with similar machinery. We identified several interactions with conserved core components of the EJC and multiple additional complexes, together with proteins specific to trypanosomatids. Additional immunoisolations of kinetoplastid-specific proteins both validated and extended the superinteractome, which is capable of supporting RNA processing from splicing through to nuclear export and cytoplasmic events. We also suggest that only proteomics is powerful enough to uncover the high connectivity between multiple aspects of mRNA metabolism and to uncover kinetoplastid-specific components that create a unique amalgam to support trypanosome mRNA maturation.
Trypanosoma cruzi is the causative agent of Chagas disease, a significant infection of humans and animals in Latin America where 10 million people are estimated to be infected (
First WHO Report on Neglected Tropical Diseases: Working to Overcome the Global Impact of Neglected Tropical Diseases. World Health Organization,
Geneva2010: 1-184
). Climate change and human migration have both increased the range of the disease and risk for much of the Americas and Europe; recent global events will likely inflate this further, as well as compromise the ability of healthcare organizations to deliver therapeutics (
). Trypanosomatids diverged some 1 billion years ago from other eukaryotes and adapted to specific hosts and life cycle demands through multiple alterations in metabolic pathways, stage-specific gene expression, and other mechanisms (
). Novel aspects of these processes in trypanosomes have attracted considerable interest, both through providing insight into diverse mechanisms of mRNA processing and representing possible therapeutic targets (
Unlike most eukaryotes, trypanosome genes are transcribed as polycistronic RNAs, frequently encompassing tens of open reading frames, while mature mRNAs (processed by trans-splicing and polyadenylation) exhibit differential abundance as a consequence of posttranscriptional regulation (
). Cytosolic RNA granules also impact mRNA abundance by interacting with the translation and stability machinery to regulate expression during differentiation processes, mainly in response to biological stimuli such as heat shock or starvation stress (
Translation regulation and RNA granule formation after heat shock of procyclic form Trypanosoma brucei: Many heat-induced mRNAs are also increased during differentiation to mammalian-infective forms.
). Trypanosomes have multiple distinct RNA granules, and their composition is divergent from animals and fungi. For example, during translational repression, some populations of RNA granule are compositionally stable (
Inhibition of mRNA maturation in trypanosomes causes the formation of novel foci at the nuclear periphery containing cytoplasmic regulators of mRNA fate.
). However, it remains unclear how these complexes interact with the nuclear components involved in RNA export.
In animals and fungi, mRNA export is initiated by cotranscriptional association of nascent mRNAs with the THO complex. THO is a conserved multimeric complex required for mRNP biogenesis consisting of at least four components: Tho2, Hpr1, Mft1, and Thp2 (
). THO also associates with two mRNA export factors, UAP56 and ALY (Sub2 and Yra1 in budding yeast), to form the TREX complex that couples transcription with mRNA nuclear export (
). A second complex, TREX-2, links transcription and the nuclear pore complex (NPC) by facilitating transfer of mature mRNPs from the nuclear interior to the NPC (
), to interact with the NPC and export mRNA to the cytoplasm. TAP/Mex67 share a modular domain organization that includes the aminoterminal RNA recognition motif (RRM) followed by Nuclear Transport Factor 2-like (NTF2L) and the ubiquitin associated (UBA) (
Alternatively, UAP56/Aly can also be recruited by the exon-junction complex (EJC), which is composed of eIF4AIII, MLN51 (also known as BTZ/Barentsz), Y14 and Mago (or Magoh/Mago-nashi) in mammalian cells. The EJC is loaded onto immature mRNA molecules during cis-splicing at a conserved position upstream of the exon–exon junction (
) to form a stable mRNA export complex and is only removed from mRNP complexes when translation is initiated in the cytoplasm; hence, the EJC accompanies an mRNA molecule throughout nuclear export (
). For example, the EJC can bind to the mRNA export factors REF/Aly and TAP/p15 to enhance the efficiency of transport of spliced mRNAs into the cytoplasm (
). EJC complex components have diversified in individual lineages, and not all components are retained universally, for example, MLN51 is not conserved in some trypanosomes (
). Some EJC components have been identified in trypanosomes, but the direct association with the mRNA export machinery remains to be demonstrated.
Although many components of the RNA export machinery are known from animals, fungi, plants, and other organisms, in silico identification of components in trypanosomatid genomes is unreliable due to high divergence and few proteins appear conserved or recognizable (
). In fact, several nuclear complexes and nuclear-pore-associated proteins that play major roles in mRNA export in opisthokonts are lacking in trypanosomes, and a recent comparative review described the main differences in nuclear mRNA maturation and mRNA export (
), confirming the presence of unique structures and potentially parasite-specific interactions for trypanosome RNA export. Further, Mex67 associates with Ran in trypanosomes, which is distinct from animals and fungi. Trypanosomes can initiate mRNA nuclear export cotranscriptionally, and initiation of export does not depend on completion of trans-splicing, suggesting that trypanosomes regulate completion as opposed to initiation of export. This is supported by the divergence of the cytoplasmic face composition of the NPC in trypanosomes (
) together with cytoplasmic nuclear peripheral granules (NPGs), but the components of both the NPC and export machinery responsible await identification (
Progress in characterizing the trypanosome NPC, kinetochores, and nuclear lamina has been achieved mainly through proteomics, essentially the only available approach due to high primary structure divergence (
). Here we have also explored proteomics with cryomilling and immunoisolation to identify proteins associated with validated components of the mRNA export pathway in T. cruzi, TcSub2, and TceIF4AIII. TcSub2 is a TREX complex protein involved in mRNA export (
). We identify not only conserved components but also proteins specific to trypanosomatids. Among them, we identified a kinetoplastid-specific NTF2-like protein that interacts with Mex67, crucial for mRNA export. These observations indicate considerable divergence in the composition and function of trypanosome mRNA export factors.
Experimental Procedures
Parasite Culture
T. cruzi Dm28c epimastigotes were maintained in axenic culture in liver infusion tryptose (LIT) medium at 28 °C (
). Procyclic forms of T. brucei Lister 427 and RNA interference T. brucei 29-13 line were maintained in SDM-79 medium supplemented with 10% fetal bovine serum at 28 °C (
). Pullout of TcSub2::GFP was achieved using a buffer containing 20 mM HEPES pH 7.4, 10 μM CaCl2, 1 mM MgCl2, 10 mM sodium citrate, 0.1% CHAPS (w/v), and protease inhibitors (COMPLETE Mini Protease inhibitor cocktail tablet, Roche); for TceIF4AIII, TcMago, TcNTF2L, and TcMex67 a buffer with 20 mM HEPES pH 7.4, 10 μM CaCl2, 1 mM MgCl2, 50 mM sodium citrate, 0.1% Triton X-100 (v/v), and protease inhibitors (COMPLETE Mini Protease inhibitor cocktail tablet, Roche); and for TcFOP and TcAPI5, a buffer with 20 mM HEPES pH 7.4, 10 μM CaCl2, 1 mM MgCl2, 50 mM sodium citrate, 0.1% Triton X-100 (v/v), and 10% glycerol (v/v). Aliquots of purified complexes were separated on 4 to 12% NuPAGE Novex Bis-Tris precast gels (Life technologies) and stained with SilverQuest Silver Staining (Life technologies).
For TcFOP, TcAPI5, TcNTF2L, and TcHYP, the complex was analyzed by mass spectrometry. T. cruzi cell extracts were prepared using 5 × 108 parasites lysed in 1 ml of citrate buffer (50 mM sodium citrate, 20 mM Hepes pH 7.4, 1 mM MgCl2, 1 μM CaCl2, 0,5% Triton X-100, protease inhibitor (COMPLETE Mini Protease inhibitor cocktail tablet, Roche) by sonication. Pullout of TcAPI5::GFP, TcFOP::GFP, TcNTF2L::GFP, and TcHYP::GFP was achieved by affinity purification of GFP-tagged proteins (
). For TcFOP and TcAPI5 pullout, 10% glycerol was added to citrate buffer. The complexes were eluted using 15 μl of elution buffer (2% SDS, 20 mM Tris-HCl pH 8.0) at 72 °C for 20 min. Aliquots of purified complexes were separated on 13% SDS-PAGE and stained with Silver Staining protocol.
Proteomics: Experimental Design and Statistical Rationale
Initially, TcSub2 and TceIF4AIII complexes were affinity purified and, for those analyses, three different technical replicates were performed for each TcSub2::GFP or TceIF4AIII::GFP complex. In each experiment, two samples were compared: TcSub2::GFP or TceIF4AIII::GFP pullout with a wild-type pullout (control). Next, samples were processed and identified by LC-MS/MS as previously described. Proteins were compared between TcSub2 and TceIF4AIII complex with the control samples, and only proteins identified in at least two replicates and with emPAI index >0.1 were considered for analyses.
For TcMex67, TcAPI5, TcFOP, TcNTF2L, and TcHYP complexes, different replicate numbers were processed or reprocessed in different experiments to increase the reliability and the coverage. Three biological replicates were used for TcMex67 analyses. For TcAPI5, seven replicates were used: three biological replicates, but technical triplicates were performed for two biological replicates. For FOP, TcNTF2L, and TcHYP, five technical replicates were used for each protein: three biological replicates, but technical replicates were performed for one biological replicates. In each replicate, two samples were compared: target pullout and wild-type pullout (control). The TcMex67, TcAPI5, TcFOP, TcNTF2L, and TcHYP data analysis was performed based on the averages of all replicates and comparing the protein intensities between the target and control samples. The fold change ≥2 was considered to select the proteins.
Mass-Spectrometry-Based Proteomics
For TcSub2 and TceIF4AIII samples, mass spectrometry was performed at the FingerPrints Proteomics Facility at the University of Dundee (Scotland, UK). Samples were purified and desalted by electrophoresis into 4 to 12% NuPAGE Novex Bis-Tris precast gel (Life technologies), silver-stained, excised, and dehydrated in a SpeedVac (Thermo Scientific). The dehydrated gel was digested with trypsin (Modified Sequencing Grade, Roche) for 18 h at 30 °C. Tryptic peptides were extracted from the gel with 95% Acetonitrile/5% Formic acid (v/v) and captured with an Acclaim PepMap 100 system (C18, 100 μm × 2 cm) and fractionated on a C18 Easy-Spray PepMap RSLC (75 μm × 50 cm) (Thermo Scientific) columns with an Ultimate 3000 RS LCnano system (Thermo Scientific) coupled to an LTQ OrbiTrap Velos Pro (Thermo Scientific). For identification of proteins, the Mascot Search Daemon (Version 2.4.1) (Matrix Science) (http://www.matrixscience.com/) was used with a custom T. cruzi protein sequence library of 54,610 sequences from five different strains (CL Brener Esmeraldo-like, CL Brener non-Esmeraldo-like, Sylvio, Dm28c, and Marinkellei, downloaded on February 4, 2015 from Uniprot). This strategy increased coverage and hence the likelihood of identification of peptides. MS tolerance allowed up to two missed and/or nonspecific tryptic cleavages, and peptide mass tolerance and fragment mass tolerance were set as ±10 ppm and ±0.6 Da, respectively. Carbamidomethylation (C) was set as a fixed modification, and Acetyl (N-term), Dioxidation (M), Glu->pyro-Glu (N-term E), Oxidation (M) were included as variable modifications. FDR was calculated using a reverse translated proteome. This target-decoy strategy uses along with the target database a decoy database (randomized database, reverse target database used in maxquant). The number of matches to this decoy database equals the number of random matches (false positives) obtained in the target database and is used to calculate local or global FDRs. The relative abundance of proteins was estimated using the exponentially modified protein abundance index (emPAI score), determined in a peptide mixture based on the coverage of peptides from the identified protein (
Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein.
TcMex67, TcAPI5, TcFOP, TcNTF2L, and TcHYP samples were processed at the mass spectrometry facility at the Carlos Chagas Institute/FIOCRUZ-PR (RPT02H PDTIS/Instituto Carlos Chagas—Fiocruz Paraná, Brazil). Samples were purified by electrophoresis into SDS–polyacrylamide gels; lanes were excised and dehydrated with 100% ethanol and dried in a vacuum centrifuge. After reduction with 10 mM DTT, 50 mM ammonium bicarbonate (ABC), samples were alkylated with 50 mM iodacetamide, 50 mM ABC. Gels were washed with 50 mM ABC and dehydrated with 100% ethanol, and this step was repeated. Then, gels were incubated with 12.5 ng/ml trypsin (Promega, V5113), 50 mM ABC at 37 °C for 18 h. After trypsinization, trifluoroacetic acid (TFA) was added to a final concentration of 0.5%. Peptides were extracted from the gel matrix through incubation twice with 30% MeCN, 3% TFA, and twice with 100% MeCN. The extracted peptide solution was concentrated in a vacuum centrifuge to remove MeCN and desalted with homemade C18 spin columns and analyzed by LC-MS/MS in a Thermo Scientific Easy-nLC 1000 system coupled to a LTQ Orbitrap XL ETD. Peptide separation was carried out in a 30 cm (75 μm inner diameter) fused silica in-house packed column with reverse-phase ReproSil-Pur C18-AQ 1.9 μm resin (Dr Maisch GmbH). Chromatography runs were performed with a flow rate of 250 nl/min from 5 to 40% MeCN in 0.1% formic acid, 5% DMSO in a 120 min gradient. The mass spectrometer operated in a data-dependent mode to automatically switch between MS and MS/MS (MS2) acquisition. Survey full-scan MS spectra (at 300–2000 m/z range) were acquired in the Orbitrap analyzer with resolution of 60,000 at m/z 400 (after accumulation to a target value of 1,000,000 in the C-trap). The 12 most intense ions were sequentially isolated and fragmented in the linear ion trap using collision-induced dissociation at a target value of 30,000. The “lock mass” option was enabled in all full scans to improve mass accuracy of precursor ions (
) for database searching. Default parameters of the software were used for all analysis steps, unless stated otherwise. Proteins were searched against a “decoy database” prepared by reversing the sequence of each entry of the T. cruzi CL Brener protein sequence database (containing 19,242 protein sequences, downloaded on August 10, 2016 from Uniprot) and appending them to the forward sequences. This database was complemented with frequently observed contaminants (porcine trypsin, Achromobacter lyticus lysyl endopeptidase, and human keratins) and their reversed sequences. Search parameters specified an MS tolerance of 20 ppm, an MS/MS tolerance of 0.5 Da, and full trypsin specificity, allowing for up to two missed cleavages. Carbamidomethylation of cysteine was set as a fixed modification, and oxidation of methionine and N-terminal acetylation (protein) were allowed as variable modifications.
For validation of the identifications, a minimum of seven amino acids for peptide length and two peptides per protein was required. In addition, an FDR threshold of 1% was applied at both peptide and protein levels. When transforming peptide identifications into protein identifications, similar protein sequences (e.g., isoforms) present in the database that could not be distinguished by the experimentally detected peptides were grouped and referred to as protein groups. Protein quantification was performed using a label-free approach, where peptides eluting from each LC run are detected as three-dimensional features—retention time versus signal intensity (extracted ion chromatogram, XIC) versus mass/charge—aligned and compared across runs, as previously described (
The mass spectrometry proteomics data have been deposited to the ProteomeXchange Consortium via the PRIDE (https://www.ebi.ac.uk/pride/) partner repository with the dataset identifier PXD028072.
Bioinformatic Analyses
For TceIF4AIII phylogenetic analysis, database searches were done using BLASTp (
) and ortholog proteins identified by reciprocal best hit (RBH). Protein sequences from representative species were obtained from the NCBI database and included proteomes from Saccharomyces cerevisiae (fungi), Homo sapiens (metazoa), Dictyostelium discoideum (amoebozoa), Arabidopsis thaliana (Viridiplantae), Plasmodium falciparum (Chromalveolata), Toxoplasma gondii (Chromalveolata), T. cruzi (Excavata), T. brucei (Excavata), and Leishmania major (Excavata). For calculating the similarity and identity among amino acid sequences (supplemental Table S1, sheet2), the Needle program from the EMBOSS package was used (
), which allows a global alignment between two protein sequences. Phylogenetic analysis was performed using maximum likelihood (ML) by FastTree 2.1 using WAG model (
). The phylogenetic tree was drawn using FigTree v1.4 program (http://tree.bio.ed.ac.uk/software/figtree/). TcHYP ortholog proteins were identified by RBH criteria and synteny evaluation. A total of 108 positions with more reliable alignment obtained by PSI-coffee (
) composed the final dataset. The tree was rooted using the Bodo saltans sequence. An evolutionary tree of TcHYP (TcCLB.506435.150) orthologs was inferred by using MrBayes 3.2.6 (
). Markov Chain Monte Carlo (MCMC) was run for 1,000,000 generations, sampling trees every 1000 generations. An ML tree (JTT +F + G + I) was also inferred using MegaX (
) with 1000 bootstrap replicates. For TcAPI5, TcFOP, and TcNTF2L, T. cruzi CL Brener Non-Esmeraldo-like sequences were obtained from TriTrypDB version 33 and used as queries for BLASTp against a database containing 121 eukaryotic predicted proteomes (supplemental Table S2) with an e-value threshold of 0.1. All hits identified were subject to reverse BLASTp using an e-value threshold of 0.1. Orthology was predicted based upon by RBH criteria being the original T. cruzi query sequence. Additional hits were identified using sequences obtained from B. saltans and Dictyostelium purpureum. Predicted orthologous sequences were aligned using MUSCLE (
), respectively. PhyML 3.0 was used with default settings, and a bootstrap of 1000 and MrBayes 3.2.6 was used with an MCMC generation of 800,000, Γ shape rate variation with four discreet categories and a sampling frequency of 1000 with the first quarter as burn-in. MrBayes trees were run using the CIPRES Science Gateway portal (
Interactome visualization was performed using Cytoscape (available in https://cytoscape.org) from selected data from Tables 1 and 2. A static representation was generated from Cytoscape and processed using Affinity Designer (available in https://affinity.serif.com/en-gb/).
Table 1Lists of filtered proteins identified in TceIF4AIII and TcSub2 isolates
Protein name according to conserved and specific domain identified using INTERPRO, HMMER, PFAM, or ortholog in NCBI nr database. Bold represents kinetoplastid-specific granules, and italic represents nuclear peripheral granules.
Location based on TrypTag, mass spectrometry, or direct experimental evidence. Restricted to either T. cruzi or T. brucei evidence, except in case of highly conserved ortholog, where evidence from animal or fungal cells is accepted.
a Protein name according to conserved and specific domain identified using INTERPRO, HMMER, PFAM, or ortholog in NCBI nr database. Bold represents kinetoplastid-specific granules, and italic represents nuclear peripheral granules.
b Location based on TrypTag, mass spectrometry, or direct experimental evidence. Restricted to either T. cruzi or T. brucei evidence, except in case of highly conserved ortholog, where evidence from animal or fungal cells is accepted.
Protein name according to conserved and specific domain identified using INTERPRO, HMMER, PFAM, or ortholog in NCBI nr database. Bold represents kinetoplastid-specific granules, and italic represents nuclear peripheral granules.
Location based on TrypTag, mass spectrometry, or direct experimental evidence. Restricted to either T. cruzi or T. brucei evidence, except in case of highly conserved ortholog, where evidence from animal or fungal cells is accepted.
Protein name according to conserved and specific domain identified using INTERPRO, HMMER, PFAM, or ortholog in NCBI nr database. Bold represents kinetoplastid-specific granules, and italic represents nuclear peripheral granules.
Location based on TrypTag, mass spectrometry, or direct experimental evidence. Restricted to either T. cruzi or T. brucei evidence, except in case of highly conserved ortholog, where evidence from animal or fungal cells is accepted.
Protein name according to conserved and specific domain identified using INTERPRO, HMMER, PFAM, or ortholog in NCBI nr database. Bold represents kinetoplastid-specific granules, and italic represents nuclear peripheral granules.
Location based on TrypTag, mass spectrometry, or direct experimental evidence. Restricted to either T. cruzi or T. brucei evidence, except in case of highly conserved ortholog, where evidence from animal or fungal cells is accepted.
Protein name according to conserved and specific domain identified using INTERPRO, HMMER, PFAM, or ortholog in NCBI nr database. Bold represents kinetoplastid-specific granules, and italic represents nuclear peripheral granules.
Location based on TrypTag, mass spectrometry, or direct experimental evidence. Restricted to either T. cruzi or T. brucei evidence, except in case of highly conserved ortholog, where evidence from animal or fungal cells is accepted.
a Protein name according to conserved and specific domain identified using INTERPRO, HMMER, PFAM, or ortholog in NCBI nr database. Bold represents kinetoplastid-specific granules, and italic represents nuclear peripheral granules.
b Location based on TrypTag, mass spectrometry, or direct experimental evidence. Restricted to either T. cruzi or T. brucei evidence, except in case of highly conserved ortholog, where evidence from animal or fungal cells is accepted.
The open reading frames of TcAPI5, TcFOP, TcNTF2L, and TcMago were amplified by PCR using specific oligonucleotide primers as shown in supplemental Table S3. T. cruzi Dm28c genomic DNA was used as template. The PCR products were cloned into the pDONR221 vector from Gateway technology (Invitrogen) and further recombination into the pDEST17 vector (Invitrogen) to produce his-tagged recombinant protein, according to the manufacturer’s protocol. Expression of recombinant protein was induced in Escherichia coli BL21 (DE3) by addition of 1 mM IPTG and incubation for 3 h at 37 °C. His-tagged protein was purified by affinity chromatography on Ni-NTA resin (Qiagen) under denaturing conditions and used to inoculate mice to produce polyclonal antibodies, according to Inoue et al. (2014) (
). Animals were immunized by intraperitoneal injection with approximately 50 μg of the his6-tagged protein in Freund's complete adjuvant (Sigma) for the first inoculation. For the three booster injections, we used 20 μg of the recombinant protein in Alu-Gel (Serva) administered at 2-week intervals. Antiserum was obtained 5 days after the last booster injection and by blood collection via cardiac puncture. The protocol was approved by Ethics Commission on Animal Use (CEUA) from FICRUZ, protocol 47/12-3, license LW15/13; project title: "Characterization of trypanosome proteins."
Immunoblotting
Proteins were separated by gel electrophoresis (SDS-PAGE) and transferred to a nitrocellulose membrane (Hybond C, Amersham Biosciences). The membrane was blocked with 0.1% Tween 20 and 5% milk in phosphate-buffered saline (PBS). Primary antibodies were diluted in blocking solution at the following concentrations: mouse anti-TceIF4AIII (diluted 1:500) (
), mouse anti-TcAPI5 (diluted 1:500), mouse anti-TcFOP (diluted 1:500), mouse anti-TcNTF2L (diluted 1:500), mouse anti-TcMago (diluted 1:500), rabbit anti-GFP (kindly provided by Dr Stenio Fragoso Perdigão, diluted 1:500), mouse anti-GFP (Roche, diluted 1:500), rat anti-HA (Roche, diluted 1:500). Antibodies were incubated with the membrane for 1 h. The membrane was washed three times in 0.1% Tween 20 in PBS. Bound antibodies were detected by fluorescence with IRDye whole IgG secondary antibody (LI-COR). Relative quantification of Western blots was performed using ImageJ software (available in https://imagej.nih.gov/ij/).
Protein Tagging
The coding sequences of TceIF4AIII, TcMago, TcAPI5, TcHYP, TcFOP, and TcFOPlΔFOP, a mutated version of TcFOP without the FoP domain, cloned previously into pDONR221 (Invitrogen) were recombined pTcGW (
) for C-terminal eGFP/PTP tagging. T. cruzi epimastigotes were transfected with these plasmids, and stable lines were selected by adding 500 μg/ml G418 to the culture medium. Cell lines carrying TcSub2 tagged at N-terminal with GFP and TcMex67 tagged at C-terminal with PTP were described in Serpeloni et al. (2011) (
), respectively. In. T. brucei, cell lines expressing HA-tagged or GFP-tagged TbMago were generated by in situ tagging with the pMOTag4H or pMOTag4G plasmid, as described previously in Oberholzer et al. (2006) (
). The parasites were analyzed by FACSAria II (BD) in the flow cytometry facility at Carlos Chagas Institute–Fiocruz/PR. GFP-positive parasites were cloned using single cell precision mode into 96-well plate containing 100 μl of LIT medium/well and the selection antibiotic.
The endogenous and tagged proteins were localized by indirect immunofluorescence assays, as described by Serpeloni et al. (2011) (
). The parasites were incubated with specific antibodies for 1 h at 37 °C and then washed with PBS and incubated with Alexa Fluor 488-conjugated goat anti-mouse IgG or Alexa Fluor 594-conjugated goat anti-rabbit IgG antibodies (Invitrogen, 1:600 dilution), as appropriate, for 1 h. DNA was stained by incubation with 5 μg/ml DAPI for 15 min. Slides were analyzed by fluorescence microscopy (Nikon 80i), and images were captured with a CoolSnap PROcf (Media Cybernetics) camera and analyzed with Image Pro-Plus v. 4.5.1.22 (Media Cybernetics). Images were also obtained by inverted microscopy (Leica DMI6000B) associated with deconvolution software Leica AF6000 (microscope facility RPT07C PDTIS/Carlos Chagas Institute–Fiocruz Paraná).
RNA Interference
The coding sequences of orthologs of TcFOP (TbFOP, Gene ID Tb927.6.1470), TcAPI5 (TbAPI5, Gene ID Tb927.7.2240), TcNTF2L (TbNTF2L, Gene ID Tb927.10.2240), and TcHYP (TbHYP, Gene ID Tb927.04.3060) from T. brucei were analyzed by RNAit (
) to identify specific target nucleotide sequences for RNA interference. The oligonucleotides predicted by RNAit were used to amplify a DNA fragment (supplemental Table S3) for cloning into the p2T7-177 vector (
). A total of 10 μg insert-containing vector was linearized with NotI and transfected into procyclic forms of T. brucei 29-13. Transfected parasites were selected by the addition of 5 μg/ml phleomycin to the medium, and RNAi was induced by adding 2 μg/ml tetracycline to log phase parasite cultures.
mRNA in Situ Hybridization
To analyze the distribution of poly(A)+ RNA in T. brucei, a digoxigenin-conjugated oligo(dT) probe was used as previously protocol described in Inoue et al. (2014) (
). Probe binding was detected by indirect immunofluorescence analysis, as described above, with mouse monoclonal anti-digoxigenin antibody (Sigma-Aldrich, 1:300 dilution) and Alexa Fluor 488-conjugated goat anti-mouse IgG secondary antibody (Invitrogen, 1:600 dilution).
Quantitative PCR Analysis
T. brucei total RNA was extracted using TRIzol Reagent (Invitrogen). To remove residual DNA, the DNase treatment was performed with RQ1 (RNA Qualified) RNase-Free DNase (Promega) according to the manufacturer's instruction. RNA quantification was performed with NanoDrop One Spectrophotometer (Thermo Fisher Scientific), and 1 μg of RNA was run on a formaldehyde gel to check the integrity of RNA. To prepare templates for quantitative PCR, cDNA was synthesized from 1 μg of RNA by ImProm-II Reverse Transcription System (Promega), using random primers (Invitrogen. Real-time PCR was performed in triplicate with the SYBERSelect Master Mix (Applied Biosystems) on LightCycle 96 system (Roche).
qPCR analysis for TbAPI5, TbFOP, TbNTF2L, and TbHYP was normalized using the data obtained by amplification of 7SL RNA. supplemental Table S3 lists the primers used for qPCR, PCR conditions were as follows: preincubation at 95 °C for 10 min, followed by 65 cycles of denaturation at 95 °C for 20 s, annealing at 55 °C for 20 s, and extension at 72 °C for 20 s, one cycle of melting at 95 °C for 60 s, 40 °C for 60 s, 65 °C for 1 s, and 97 °C for 1 s. Expression levels were calculated by Pfaffl’s method (
), was used to isolate RNA samples. The induction of RNAi was performed in triplicate by adding 2 μg/ml tetracycline to log phase parasite culture. Total RNA from 1 × 109 cells was isolated using RNeasy Mini Kit (Qiagen) at 24- and 48-h postinduction. RNA was treated with DNAse1 on-column using a RNase-Free DNase Set (Qiagen) following the manufacturer protocol. The integrity was determined by nanoelectrophoresis using an Agilent 2100 Bioanalyzer (Agilent) and quantified by spectrophotometry using a NanoDrop One Spectrophotometer (Thermo Scientific). Approximately 5 μg RNA was sent to BGI services (China) for transcriptome sequencing using Illumina HiSeq 2000 technology (Agilent). The transcriptome data have been submitted to SRA at NCBI with the accession number SUB10190938. We used https://www.bioinformatics.babraham.ac.uk/projects/fastqc and MultiQC for quality control, and analysis was done with Snakemake script. We performed differential expression between each pair of conditions, using edgeR (specifically, exactTest), selecting only coding sequences with at least ten counts in at least one condition (mean across replicates). We used sleuth for differential expression of Salmon data, which simulates technical replicates and improves accuracy. There are 3736 transcripts “differentially expressed” across all three conditions (that is changed in at least one condition) out of 8485 total transcripts analyzed.
Results
TcSub2 and TceIF4AIII Share Kinetoplastid-Specific Protein Interactions
To identify components of the T. cruzi mRNA maturation machinery, we used T. cruzi Dm28c ectopically expressing N-terminal GFP fusions of TcSub2 (
). Cell grindates were directly thawed into buffers to determine optimum conditions for preservation and isolation of complexes. Following analysis by SDS-PAGE (Fig. 1A), where we confirmed the enrichment of the tagged-proteins, TcSub2 and TceIF4AIII isolates were analyzed by mass spectrometry and protein identifications filtered according to exponentially modified protein abundance index (emPAI). Proteins from three independent experiments with emPAI >0.1, and identified in at least two of the three replicates, are listed in the supplemental material (supplemental Table S4). In addition to proteins known to be involved with RNA metabolism, many additional proteins were recovered among the significant identifications (Table 1).
Fig. 1Identification of proteins copurified with TcSub2 and TceIF4AIII.A, affinity purified TcSub2 (Sub2) and TceIF4AIII complexes were analyzed on 4 to 12% NuPAGE Novex Bis-Tris precast gel (Life technologies) and stained with SilverQuest Silver Stain (Life technologies). As a negative control, cryomilled cell extracts from wild-type (WT) parasites were treated in the same conditions. Arrowheads indicate the bands corresponding to TcSub2 and TceIF4AIII. B, schematic representation of selected proteins identified by mass spectrometry: Trypanosome EJC proteins (TcMago, TcY14 and TcBTZL) and three novel proteins (TcAPI5, TcFOP, and TcNTF2L). The motifs and domains were predicted using INTERPRO, HMMER, and PFAM databases and are shown in colors.
There was considerable coincidence in the proteins interacting with TcSub2 and TceIF4AIII, together with specific interactions (Table 1). The coincident identifications suggest that TcSub2 and TceIF4AIII associate as a supercomplex, further supported by copurification of TcSub2 and TceIF4AIII. The TcSub2/TceIF4AIII cohort includes orthologs of PRP19-like, Dbp1, Alba1-3, and orthologs of DRBD2 and 3, all of which have defined functions in mRNA metabolism (
Two splicing factors carrying serine-arginine Two splicing factors carrying serine-arginine motifs, TSR1 and TSR1IP, regulate splicing, mRNA stability, and rRNA processing in Trypanosoma brucei.
Inhibition of mRNA maturation in trypanosomes causes the formation of novel foci at the nuclear periphery containing cytoplasmic regulators of mRNA fate.
), were also detected, indicating an interplay of transcription/mRNA export with other RNA metabolism pathways. The domain architectures of several hypothetical proteins were predicted and include representatives containing an FOP domain (TcFOP), a dual API5 domain (TcAPI5), and an NTF2-like domain (TcNTF2L), while others were identified as core components of the EJC (Fig. 1B). Phylogenetic reconstructions demonstrated that these proteins are restricted to the trypanosomatids and eubodonids and hence are likely kinetoplastid-specific mRNA export factors (Fig. 2 and supplemental Fig. S1). Among them, only NTF2L has been already described in trypanosomes as a component of mRNA export, both associated to EJC complex (
Fig. 2TcNTF2L orthologs were found in trypanosomatids and Eubodonida. A list of TcNTF2L homologs and the organisms searched are shown in supplemental Table S2. Red arrow head indicates Euglena, which is outside of the Kinetoplastoda lineage but a relative. Colors represent homolog proteins from representative species of distinct eukaryotic groups, which grouped into a distinct clade from TcNTF2L. Near the nodes are Bayesian posterior probabilities and ML bootstrap values. Bar indicates the estimated number of changed sites.
For TcNTF2L, while the N-terminal NTF2-like domain is broadly conserved, the overall architecture is clearly kinetoplastid-specific. Other proteins correspond to the human Ran GTPase-activating protein-binding protein 1 and 2 (Q13283 and Q9UN86) and contain an RNA recognition motif (RRM) in the C-terminal region absent from the kinetoplastid NTF2L proteins (supplemental Table S2, sheet 3). Phylogenetic reconstructions for NTF2L (Fig. 2) recovered two major clades, one containing only kinetoplastid NTF2L proteins and the other all other sequences, consistent with unique pathways for mRNA export (
To provide orthogonal evidence for the identified interactions between TcAPI5, TcFOP, and TcNTF2L with TcSub2 and TceIF4AIII, we raised antibodies to TcAPI5, TcFOP, and TcNTF2L. Western blots confirmed TcAPI5 and TcFOP as present in the TcSub2 isolates, while TcNTF2L was present in TceIF4AIII pullouts, confirming the assignments (supplemental Fig. S2).
EJC Proteins Interact With the T. cruzi mRNA Export Pathway
We previously described the T. cruzi ortholog of eIF4AIII (
), and phylogenetic tree indicated that TceIF4AIII grouped into the eIF4AIII clade (supplemental Fig. S3). eIF4AIII is a core component of the metazoan EJC and functions in multiple pathways of mRNA metabolism (
). Additional proteins related to the mammalian EJC core were identified here; TcMago is represented in the TceIF4AIII/TcSub2 cohort, while TcY14 was identified only in TcSub2 isolations. A protein containing a Barentsz/BTZ domain, named as TcBarentsz-like (TcBTZL), was also present among the TcSub2 interactors.
To confirm the interaction between TcMago and TcSub2/TceIF4AIII, we created T. cruzi parasites expressing TcMago::GFP. The fusion protein localized mainly in the nucleus (Fig. 3A), and the T. brucei ortholog tagged with HA or GFP displayed a nuclear localization (supplemental Fig. S4), as described in Tryp Tag database (http://tryptag.org). TcMago localizes with TcSub2 in the nucleus (Fig. 3, A and B), and Western blot analysis of TcMago::GFP immunoprecipitations (Fig. 3D) identified both TcSub2 and TceIF4AIII (Fig. 3E), robustly confirming that conserved core EJC proteins interact with components of the trypanosome mRNA export pathway.
Fig. 3Localization and protein interactions of TcMago.A, cellular localization of TcSub2 (Sub2) and TcMago tagged with eGFP (Mago) by confocal microscopy using anti-GFP antibodies. Nucleus is circled in white. DAPI = DNA stained with DAPI. MERGE = merged images. N = nucleus. K = kinetoplast. The kinetoplast DNA usually displays stronger DAPI signal than nucleus. B, graphs show the quantification of fluorescence intensity detected across the dotted line in merge images. DAPI in blue, TcMago::GFP in green and TcSub2 in red. C, TcMago::GFP pullout (Mago) was analyzed in silver-stained SDS–polyacrylamide gel. As a negative control, cryomilled cell extracts from wild-type (WT) parasites were treated in the same conditions. D, TcMago::GFP isolates were analyzed by Western blot using the following antibodies: mouse anti-GFP, mouse anti- TceIF4AIII, and rabbit anti-TcSub2 and detection occurred with IRDye whole IgG secondary antibody (LI-COR).
Kinetoplastid-Specific Proteins Are Associated With mRNA Export Pathways
We selected four proteins identified here and which are restricted to kinetoplastids for further characterization. TcFOP, TcAPI5, and TcHYP are also localized to the nucleus, like TcSub2 (Fig. 4, Aand B and supplemental Fig. S7A) while TcNTF2L is cytoplasmic, like TceIF4AIII (Fig. 4C). To investigate the physical interactions of these proteins, we created lines expressing TcFOP, TcAPI5, TcNTF2L, and TcHYP tagged with GFP (supplemental Figs. S5 and S7A). We confirmed that TcSub2 is present in immunoprecipitates from both TcFOP::GFP (Fig. 5B) and TcAPI5::GFP, while TceIF4AIII is indeed present in TcNTF2L::GFP pullouts (Fig. 6B). To ensure that the GFP tag was not affecting TcNTF2L location, we used antibodies to TcNTF2L and observed a similar cytoplasmic localization to TcNTF2L::GFP (supplemental Fig. S5E). Altogether, these data also provide robust support for the presence of kinetoplastid-specific proteins TcFOP, TcAPI5, TcNTF2L, and TcHYP in association with conserved components within trypanosomatid mRNA export pathways.
Fig. 4Localizations of TcFOP, TcAPI5, and TcNTF2L. Cellular localization of (A) TcSub2 (Sub2) and TcFOP (FOP); (B) TcSub2 and TcAPI5 (API5); (C) TceIF4AIII (eIF4AIII) and TcNTF2L::GFP (NTF2L). DAPI = DNA stained with DAPI. MERGE = merged images. N = nucleus. K = kinetoplast. Bar = 10 μm (A and B) and 7.5 μm (C). At right, graphs show the quantification of fluorescence intensity detected across the dotted line in merge images.
Fig. 5Interactions of TcFOP depend on the FOP domain.A, TcFOP (FOP) and mutated TcFOP (FOPΔC) tagged with GFP were pulled out and the copurified proteins were analyzed on silver-stained SDS-polyacrylamide gels. As a negative control, cryomilled cell extracts from wild-type (WT) parasites were treated in the same conditions. B, the interaction with TcSub2 (Sub2) and TceIF4AIII was analyzed by Western blot. Isolates were analyzed by Western blot using the following antibodies: mouse anti-TceIF4AIII, rabbit anti-TcSub2, or mouse anti-TcFOP and detection were carried out with IRDye whole IgG secondary antibody (LI-COR). C, relative quantification by densitometry of TcSub2 signal, normalized against FOP and ΔFOP signal. Data are expressed as means and standard deviation of three independent experiments.
Fig. 6Protein interaction of TcAPI5 and TcNTF2L.A, copurified proteins with TcAPI5 (API5) and TcNTF2L (NTF2L) were analyzed on silver-stained SDS–polyacrylamide gels. As a negative control, cryomilled cell extracts from wild-type (WT) parasites were treated in the same conditions. B, TcAPI5 and TcNTF2L isolates were analyzed by Western blot using the following antibodies: mouse anti-TceIF4AIII, rabbit anti-TcSub2, mouse anti-TcAPI5, or mouse anti-TcNTF2L. Detection was carried out with IRDye whole IgG secondary antibody (LI-COR).
The FOP domain of TcFOP is divergent from canonical domains (supplemental Fig. S6). In metazoa, FOP contains a conserved LDXXLDAYM UAP56-binding motif (UBM) that is duplicated in the C-terminal region (X = any amino acid) (
), but in the trypanosomatid FOP, the UBMs are separated by an RGG motif. We asked if these are functional for binding TcSub2, the trypanosome ortholog of the binding partner UAP56. Deletion of the C-terminal UBM from TcFOP (FOPΔC) altered interactions with TcSub2 (Fig. 5B), decreasing TcSub2 recovery in pullouts by ∼50%, indicating that despite divergence, the FOP domain does mediate interaction with TcSub2 (Fig. 5C).
A Superinteractome Connecting Splicing, Translation, and mRNA Export
Mex67, a major mRNA export factor, has been partially characterized in T. brucei (
) and the same conditions as the TceIF4AIII pullout (Fig. 7A), we confirmed interactions with several proteins including Mtr2 and Ran previously identified in T. brucei (
) (supplemental Table S5). Although TceIF4AIII itself was not detected, several proteins common to the TceIF4AIII and TcSub2 interactomes were found, including PABP2, RNA helicase Hel67, and DRBD2, together with TcFOP and TcNTF2L (Fig. 7B). This further supports the presence of kinetoplastid-specific proteins as core components of the trypanosome mRNA export machinery.
Fig. 7Interactions of TcMex67.A, proteins copurified with TcMex67 (Mex67) were analyzed on silver-stained SDS–polyacrylamide gels. As a negative control, cryomilled cell extracts from wild-type (WT) parasites were treated in the same conditions. B, TcMex67 isolates were analyzed using the following antibodies: rabbit anti-Protein A, mouse anti-TcNTF2L, mouse anti-TceIF4AIII, mouse anti-TcFOP and detection were carried out with IRDye whole IgG secondary antibody (LI-COR).
Analysis of the kinetoplastid-specific protein interactions (supplemental Table S6) confirmed the presence of several conserved components of the processing, export, quality control, and translation systems (Table 2). These include TcSub2, ALBA, DRBD2, NOP, Mago, TceIF4AIII, TceEF1a, and TcPABP2. Most proteins identified with TcFOP, TcAPI5, TcNTF2L, and TcHYP were also identified as TcSub2 and TceIF4AIII interactors. RNA granule proteins were also identified, including TcDhh1, TcSCD6, and TcPABP2, reinforcing connectivity between mRNA export and the cytoplasmic processing machinery. Further, we identified PUMILIO proteins, RNA-binding proteins involved in controlling gene expression (
). The intersection of the interacting proteins recovered with the six isolated proteins was processed using a Venn Diagram tool (https://bioinformatics.psb.ugent.be/cgi-bin/liste/Venn/calculate_venn.htpl), and those proteins recovered with three or more protein complexes plotted (Fig. 8). Overall, these data indicate an unexpected high level of connectivity between multiple aspects of mRNA processing and transport.
Fig. 8Summary of coincident interacting proteins with three or more isolated proteins. The six isolated proteins are represented in lines and interacting proteins are in columns. Black circles indicate the presence of the protein in the isolated complex.
Phenotypic Screening by RNAi in T. brucei of New mRNA Export Factors
We analyze the roles of TcFOP, TcAPI5, and TcNTF2L by silencing their T. brucei orthologs. Quantitative PCR confirmed the efficiency of RNAi against all targets (Fig. 9A). Knockdown of TbAPI5 and TbNTF2L affected proliferation, while knockdown of TbFOP had little proliferative impact (Fig. 9B). None of the proteins appear involved in global export of mRNAs since accumulation of mRNA in the nucleus by polyA FISH was not seen (Fig. 9C). Two different controls (without probe and RNAse treatment) confirmed the specificity of probe signal (supplemental Fig. S8). Additionally, we also analyzed TbHYP knockdown and obtained a similar phenotype to TbAPI5, TbFOP, and TbNTF2L with no obvious inhibition of global mRNA export (supplemental Fig. S7G). Hence, these new proteins may be mediating the processing of a subset of RNA molecules or that their functions are redundant.
Fig. 9TbFOP, TbAPI5, and TbNTF2L are nonessential proteins.A, real-time PCR analysis using noninduced (N) and induced parasites (I) for RNAi of TbFOP (FOP), TbAPI5 (API5), and TbNTF2L (NTF2L). The relative mRNA levels were quantified at 72 h postinduction and normalized by detection of 7SL RNA. B, growth curve of cell lines after RNAi induction for TbFOP, TbAPI5 and TbNTF2L. The density of cells in the culture was determined by triplicate counting in a particle counter (Beckman Coulter, Z Series Coulter Counter Cell). Both noninduced and induced cultures were diluted daily to 5 × 105 cells/ml for all RNAi inductions. C, cellular localization of mRNA by fluorescence in situ hybridization (FISH) in noninduced (Con) and induced (Ind) parasites. Digoxigenin-conjugated oligo(dT) was used as probe. DAPI = DNA stained with DAPI. N = nucleus. K = kinetoplast. Bar = 10 μm.
). The impact of TbSub2 knockdown on the transcriptome was restricted and given a supposed central role in mRNA maturation, surprising. Only seven genes demonstrated highly significant increases in abundance (Fig. 10), of which three correspond to 28S and 16S rRNAs (Fig. 11) and the remaining four are protein-encoding. The top protein hits Tb927.7.6960, Tb927.5.680, and Tb927.4.910, with the fourth (Tb927.9.7290) corresponding to a VSG pseudogene. Tb927.7.6960 is an E2 ubiquitin ligase with a Ubc2 domain, while Tb927.5.680 and Tb927.4.910 are hypothetical proteins, both kinetoplast-restricted; based on localization at TrypTag Tb927.5.680 is likely endolysosomal. Similarly, few genes were found to have decreased associated transcript levels, and there was no obvious functional or other feature uniting the cohort (supplemental Table S7). The possibility that TbSub2 is involved in rRNA processing is intriguing as it is associated with mRNA export and quality control in other organisms, but the presence of several nucleolar proteins in our interactome would be consistent with an expanded role or close associations between proteins of the mRNA and rRNA export pathways.
Fig. 10TbSub2 is involved in a small number of RNA processing events.A, Manhattan plots for all RBNAseq datasets mapped against the T. brucei TREU927 genome. Reads are indicated as the y-axis and chromosome numbers as the x-axis. Note that overall, there is little major variation in the overall transcriptional pattern. B, Volcano plot of the same dataset. Solid red line indicates a log fold change of zero, and the dotted red line a significance of 2.0, above which data are considered robust. C, ratio of fold change (linear) against rank order; only seven transcripts demonstrate a greater than fivefold increase. Inset data for Sub2, the knockdown target. D, plots for the seven most increased transcripts, where green dots are rRNA and blue correspond to protein-coding transcripts. Identities of the transcripts are indicated, and data are available in full as supplemental Table S7. A ratio of 1.0 indicates no change.
Fig. 1128S and 16S RNA levels dramatically increase on Sub2 silencing. Screengrabs from IG Viewer showing two regions of the 28S rRNA loci reads mapped to the T. brucei TREU927 genome. Top two systems represent induced samples in each panel, and lower three are unindexed control cells. Genes in bold are rRNA genes, and those in regular font are protein coding genes. Note that the protein coding genes are unchanged in terms of reads mapped. A, region of chromosome 3. B, region of chromosome 2. Annotation taken rom TritrypDB.
Messenger RNA maturation requires successful completion of multiple processes, including splicing, polyadenylation, 5′-cap modification, and export to the cytoplasm prior to translation. These steps ensure that only mRNA molecules encoding native proteins are translated, preventing accumulation of unfolded proteins and also supporting differential expression (
). NPC components, both at the nuclear basket and the cytoplasmic face, together with the Mex67/Mtr2 transport receptor and a vast repertoire of ATP-dependent RNA helicases are critically important. Many factors are widely conserved, but distinct features between lineages are clear. In Arabidopsis, several INT complex proteins have divergent architectures compared with their animal orthologs (
). Trypanosomes exhibit even greater divergence as orthologs of many mRNA processing proteins are undetectable in silico.
Established differences in mRNA processing between trypanosomatids and other eukaryotes include cotranscriptional nuclear export and polycistronic transcription, placing additional burdens on posttranscriptional systems. Additionally, trypanosomes have two PABP paralogs with distinct functions (
Inhibition of mRNA maturation in trypanosomes causes the formation of novel foci at the nuclear periphery containing cytoplasmic regulators of mRNA fate.
). Deferment of mRNA QC, at least in part to the cytoplasm, is in agreement with the absence of several proteins involved in animal/yeast mRNA maturation (
). Remarkably, using TcSub2 and TceIF4AIII as affinity handles, we uncovered an interactome encompassing chromatin-associated proteins through to NPG components and several Zn2+-finger RNA-binding proteins (Tables 1 and 2, Fig. 12). One-third of identifications were coincident between both TcSub2 and TceIF4AIII, and neither PABP1 nor any PABP1 interactor was found, indicating our data are robust and specific (
). Identification of components across the entire journey of mRNA processing suggests a highly integrated system.
Fig. 12Schematic representation of T. cruzi mRNA export machinery.A, using a cohort of six affinity handles, we identified a large interactome that illustrates multiple interactions as well as the presence of conserved and divergent proteins. Furthermore, these data validate the absence of many components that have failed to be identified in silico, including proteins of the TREX and TREX2 complexes. Cohorts are arranged as identified in the pullouts and colored by predicted function using the scheme as in Tables 1 and 2. Large boxes represent a single interactome, with the affinity handle indicated. Dotted lines between entries indicate a detected interaction. Many potential interactions and the TcMex67 isolation have been omitted for clarity. Conserved proteins include TcSub2, TceIF4AIII and multiple RNA binding proteins and EJC components. kinetoplastid-specific proteins include TcAPI5, TcFOP and TcNTF2L, and TcFOP and TcAPI5 interact directly with TcSub2. Core components of the EJC are directly associated with the mRNA machinery. TcNTF2L seems to be a shuttling protein that interacts with TceIF4AIII and the export receptor TcMex67. Connectivity data are also available as a cytoscape file for readers who wish to explore further. B, model for interactome and interactions with the NPC and export systems. Conserved proteins, such as TcSub2, TceIF4AIII, RNA binding proteins, components of the EJC, and kinetoplastid-specific proteins, such as TcAPI5, TcFOP, and TcNTF2L are shown. The nuclear proteins TcFOP and TcAPI5 interact directly with TcSub2. Core components of EJC are directly associated to mRNA machinery. TcNTF2L seems to be a shuttling protein that interacts with TceIF4AIII and the export receptor TcMex67.
There are several functional cohorts within this TcSub2/eIF4AIII interactome, including proteins associated with chromatin, the trans-splicing machinery, and the EJC. Only a single Mex67 paralog (Mex67B) was recovered, suggesting specificity between mRNA export interactors. With TcMex67 as the affinity handle, we identified Ran, Mtr2 and PABP2 as expected, in addition to the kinetoplastid-specific proteins TcFOP and TcNTF2L, themselves also within the TcSub2/eIF4AIII interactome. A further cohort, preferentially identified with TceIF4AIII is likely involved in rRNA processing and/or ribosomal biogenesis and suggests a close connection between mRNA and rRNA pathways. Significantly, neither TcSub2 nor TceIF4AIII retrieved proteins that would constitute a canonical TREX or EJC.
In metazoa, the EJC is deposited upstream of exon–exon boundaries during cis-splicing (
). This is further supported by identification of TSR1, TSR1IP, SR34, and additional splicing machinery proteins. Another SR protein, RRM1, is associated with the core spliceosome and modulates chromatin structure (
). Interestingly, we also observed RBSR2 that is associated with RRM1, RBSR1, and other RNA-binding proteins that were also identified, such as UBP-1, UBP-2, and ALBA3 (
). Additionally, DRBD18, identified in TcSub2 and TcMex67 isolates, is involved in mRNA export in association with Mex67/Mtr2 by regulating the mobilization of mRNPs through the nuclear pore complex (
). The trypanosome FOP domain is distinct from metazoan FOP but is required for interaction with TcSub2. TcFOP interacts with TcMex67, supporting a role in mRNA export. However, knockdown of TbFOP did not drastically affect mRNA export, although this is also noted for CHTOP protein, where mRNA export is only affected by a dual knockdown of CHTOP and partner ALYREF (
) and was not identified here by proteomics, TcFOP may have both divergent architecture and distinct mechanisms in trypanosomes. Knockdown of API5 also did not significantly impair mRNA export, despite interacting with mRNA export factors. Moreover, API5 interacts directly with Sub2 in mammalian cells and depletion results in accumulation of poly(A)+ RNA in the nucleus, which was not found here (
). Although TcNTF2L RNAi did not cause clear nuclear mRNA accumulation, this result cannot exclude the involvement of TcNTF2L in mRNA export pathway since the T. brucei ortholog copurifies with NPG proteins (
Inhibition of mRNA maturation in trypanosomes causes the formation of novel foci at the nuclear periphery containing cytoplasmic regulators of mRNA fate.
). These results and the very mild impact on mRNA abundance from Sub2 silencing may indicate a level of redundancy and/or additional mechanisms remaining to be uncovered.
Cytoplasmic proteins PABP2, DHH1, and multiple Zn-finger RNA helicases that associate with NPGs were identified (
Inhibition of mRNA maturation in trypanosomes causes the formation of novel foci at the nuclear periphery containing cytoplasmic regulators of mRNA fate.
), even though it also represses translation and significantly we identified TcSCD6 with both TcSub2 and TceIF4AIII.
In conclusion, we report both novel and conserved components of the mRNA export pathway in trypanosomes identified through immunoisolation and proteomics (Fig. 12). Among many interactions identified, we highlight the kinetoplastid-specific proteins TcFOP, TcAPI5, and TcHYP that interact directly with TcSub2, while TcNTF2L, a shuttling protein, interacts with the export receptor TcMex67. TcNTFL2 and TcFOP are kinetoplastid-specific but connect transcription/splicing components with the conserved export factor Mex67 and other components of mRNA export. We illustrated the comparison between major proteins described in metazoa and those identified in T. cruzi (Fig. 13). Overall, we propose that trypanosome mRNA export comprises a core of pan-eukaryotic proteins together with a significant number of proteins unique to kinetoplastids and a further example of the power of proteomics to uncover evolutionary divergent mechanisms.
Fig. 13Comparision between S. cerevisae and T. cruzi components of mRNA export coupled to transcription/splicing. The figure contains the major components of S. cerevisiae as described on Kohler and Hurt (2007) (
All mass spectrometry data have been deposited with the ProteomeXchange Consortium via the PRIDE partner repository (https://www.ebi.ac.uk/pride/) with the dataset identifier PXD028072. The TbSub2 RNAi transcriptome data have been submitted to SRA at NCBI with the accession number SUB10190938.
We thank the Program for Technological Development in Tools for Health–RPT FIOCRUZ for Technological Platforms of Microscopy, Flow Cytometry and Mass spectrometry, and the FingerPrints proteomics facility at the University of Dundee, supported by the “Wellcome Trust Technology Platform” award [097945/B/11/Z]. We would like to thank Wagner Nagib de Souza Birbeire for assistance in producing Figures 11B and 13 and James Abbott for assistance in producing Figure 11A.
Funding and additional information
This work was supported by the Wellcome Trust (204697/Z/16/Z to M. C. F.), the Medical Research Council (MR/N010558/1 to M. C. F.), Coordenação De Aperfeiçoamento De Pessoal De Nível Superior (CAPES to A. R. Á., A. H. I., P. F. D., P. M. H. and M. S.), Fundação Oswaldo Cruz (FIOCRUZ to A. R. Á.), and Conselho Nacional de Desenvolvimento Científico e Tecnológico (421990/2017-1 CNPq to A. R. Á.).
Author contributions
A. H. I., M. C. F., and A. R. Á. conceptualization; M. S., P. M. H., N. M. V., E. R. B., R. C. d. P., and A. L. data curation; M. C. F. and A. R. Á. funding acquisition; A. H. I. and P. F. D. investigation; A. H. I. and P. F. D. methodology; A. R. Á. project administration; P. M. H., R. C. d. P., and C. B. resources; C. B. and M. C. F. supervision; M. S., N. M. V., and A. L. validation; M. C. F. and A. R. Á. visualization; A. H. I. and P. F. D. writing—original draft; M. C. F. and A. R. Á. writing—review and editing.
First WHO Report on Neglected Tropical Diseases: Working to Overcome the Global Impact of Neglected Tropical Diseases. World Health Organization,
Geneva2010: 1-184
Translation regulation and RNA granule formation after heat shock of procyclic form Trypanosoma brucei: Many heat-induced mRNAs are also increased during differentiation to mammalian-infective forms.
Inhibition of mRNA maturation in trypanosomes causes the formation of novel foci at the nuclear periphery containing cytoplasmic regulators of mRNA fate.
Exponentially modified protein abundance index (emPAI) for estimation of absolute protein amount in proteomics by the number of sequenced peptides per protein.