Originally published In Press as doi:10.1074/mcp.M400114-MCP200 on November 16, 2005.
Molecular & Cellular Proteomics 5:510-522, 2006.
© 2006 by The American Society for Biochemistry and Molecular Biology, Inc.
Research
In Silico Identification of New Secretory Peptide Genes in Drosophila melanogaster*,S
Feng Liu
,
,
Geert Baggerman
,¶,||,
Wannes DHertog¶,
Peter Verleyen¶,**,
Liliane Schoofs¶ and
Geert Wets
From the ¶ Laboratory for Developmental Physiology, Genomics, and Proteomics, Katholieke Universiteit Leuven, Naamsestraat 59, B-3000 Leuven, Belgium and
Data Analysis and Modeling Group, Faculty of Applied Economics, Limburg University, University Campus-Building D, 3590 Diepenbeek, Belgium
 |
ABSTRACT
|
|---|
Bioactive peptides play critical roles in regulating most biological processes in animals. The elucidation of the amino acid sequence of these regulatory peptides is crucial for our understanding of animal physiology. Most of the (neuro)peptides currently known were identified by purification and subsequent amino acid sequencing. With the entire genome sequence of some animals now available, it has become possible to predict novel putative peptides. In this way, BLAST (Basic Local Alignment Searching Tool) analysis of the Drosophila melanogaster genome has allowed annotation of 36 secretory peptide genes so far. Peptide precursor genes are, however, poorly predicted by this algorithm, thus prompting an alternative approach described here. With the described searching program we scanned the Drosophila genome for predicted proteins with the structural hallmarks of neuropeptide precursors. As a result, 76 additional putative secretory peptide genes were predicted in addition to the 43 annotated ones. These putative (neuro)peptide genes contain conserved motifs reminiscent of known neuropeptides from other animal species. Peptides that display sequence similarities to the mammalian vasopressin, atrial natriuretic peptide, and prolactin precursors and the invertebrate peptides orcokinin, prothoracicotropic hormones, trypsin modulating oostatic factor, and Drosophila immune induced peptides (DIMs) among others were discovered. Our data hence provide further evidence that many neuropeptide genes were already present in the ancestor of Protostomia and Deuterostomia prior to their divergence. This bioinformatic study opens perspectives for the genome-wide analysis of peptide genes in other eukaryotic model organisms.
Peptides (neuropeptides) are integral to behavior, immune, and neuroendocrine systems, and they play critical roles in regulating most biological processes in metazoan species. Many dozens of peptides have already been discovered usually based on their biological activity. Although bioactive peptides are of considerable biological, medical, and industrial importance, so far a method is lacking for the systematic identification of all candidate bioactive peptides in an organism. Computational methods have become especially important since the advent of genome projects. By means of Basic Local Alignment Searching Tool (BLAST)1 analysis, the genome of an organism can be screened for peptide-coding genes based on sequence similarity to known peptide genes from other organisms. Using BLAST, 36 peptide genes have already been found in Drosophila melanogaster (14). Likewise in Anopheles gambiae, 35 peptide-encoding genes were discovered using the same sequence similarity-based mining approach (5). Although certain of these peptides have been studied in detail (611), more data on the entire peptidome of insects are needed for integrated functional analyses.
However, for in silico prediction of peptide precursor genes in large sequence datasets, the performance of the BLAST tool is limited because putative peptide sequences for which no orthologous biologically active peptide has been identified as yet (for instance, because of lack of suitable detection methods) will not be revealed.
BLAST programs are very suitable for scanning databases for conserved proteins. They are appropriate to find two kinds of sequence similarities in the following scenarios (12): 1) similarity is expected along the whole or most of the sequence and 2) local alignments. However, BLAST programs are far less efficient at finding similarity to short peptides when they are scanned against the whole genome sequence. Indeed in most cases, only a short conserved motif is responsible for the function of a particular peptide, and often only this short sequence motif, which can be 5 amino acids or less in length, is conserved. For instance, members of the invertebrate FMRFamide peptide family can share the carboxyl-terminal tetramer FMRFamide or the MRFamide tripeptide motif or RFamide motif. The detection of a conserved motif may yield valuable structural and functional insights, whereas the remainder of the peptide precursor sequence, i.e. outside the sequence of the actual bioactive peptide, may be essentially irrelevant.
If one or two of the sequences are long, for example in the case of two proteins, it is quite possible that they display very weak sequence similarities overall, but there are some different local alignments having a high score. However, it is difficult for BLAST to find a short alignment containing a motif with a high score within a long amino acid sequence. The local alignment with a high score may not include a specific motif and may only be a noise generated as the result of random mutation. At the same time, the real significant alignment that includes a motif may be masked by long irrelevant sequences because many peptide genes are between 50 and 500 amino acids in length, and in general only a small part of the precursor consists of the actual active peptide(s). In this study we describe a new in silico searching program that uses additional hallmarks of biological peptides and their precursors that are not used by current predictive algorithms.
 |
EXPERIMENTAL PROCEDURES
|
|---|
Rationale
First we examined the structural hallmarks of peptide precursor proteins. Regulatory peptides are synthesized as part of larger precursor proteins that are subsequently processed into smaller active substances. All peptide precursors contain a signal peptide that directs them into the secretory pathway of the cell. After cleavage of the signal peptide, further processing by endoproteases occurs predominantly at basic processing sites, typically mono- and dibasic amino acid residues (13). Many peptide precursors encode multiple bioactive peptides that are often highly related, for example the tachykinin precursor, the allatostatin precursors, and the neuropeptide F precursor in Drosophila and other insects (1418). However, in Drosophila as in other insects, peptide genes encoding multiple, unrelated bioactive peptides or genes encoding just a single bioactive peptide also occur (19). Based on the common structural characteristics of known invertebrate peptide precursors we built a sensitive searching procedure to identify peptide genes in the Drosophila database.
Two types of programs were constructed. The first one was built to find those peptide precursors that encode multiple highly related peptides. The second one searches for precursors containing a single peptide or multiple unrelated peptides that share conserved motifs with known peptide precursor proteins from other animal species. In both cases, the putative peptide sequence was defined by the presence of characteristic proteolytic cleavage sites flanking the peptide sequences.
To avoid the shortcomings of BLAST programs in searching long sequences for short similarity (see the Introduction), we split the protein sequences from D. melanogaster into short subsequences and then applied BLAST to compare the subsequences within each protein sequence to fish for those precursor sequences that have at least two similar subsequences. Additionally the subsequences were also compared with subsequences derived from known peptide precursors in the Swiss-Prot database. These subsequences were obtained by splitting the known peptide precursors from other metazoan organisms obtained from the Swiss-Prot database. Because each Drosophila protein sequence was split into a number of subsequences and because all of these subsequences were subsequently compared with all known peptide precursor subsequences, a very large number of alignments with a high score were obtained. Because similarity does not imply homology, only the alignments with sequence motifs from actual bioactive peptides were considered significant, and the obtained subsequences were considered as possible peptides. A FASTA protein database containing all identified putative peptide precursors was constructed. This database was loaded on an in-house Mascot server and used for the identification of peptides in a peptidomic analysis of Drosophila hemolymph. Our results showed that this technique is very efficient to find novel peptide genes.
The Program
The aim of the program is to mine for putative peptide precursors according to the rules and the techniques described above. The program is implemented in SAS, a powerful integrated software to access, manage, analyze, and present data. External tools such as SignalP and BLAST need to be run independently. They communicate with the program by text files. The program includes a few subprograms listed below.
Protein.SAS
The first part of the program, named Protein.SAS, serves to pick up all proteins from a specific species, in this case D. melanogaster. The input of the subprogram consists of the Swiss-Prot protein database files and additional Drosophila genes at GenBankTM identified by Hild et al. (20). The relevant information for each of the Drosophila proteins, such as accession number, protein name, gene name, protein sequence, signal peptide information, length, and mass, is written into an SAS dataset. The first 70 amino acids of every protein sequence serve as output to a text file in FASTA format, which is used as the input of SignalP. SignalP (www.cbs.dtu.dk/services/SignalP) for eukaryotes is then run to predict the presence and location of a signal peptide in each protein sequence. Next the subprogram reads the output file by SignalP, and another SAS dataset is created that includes the predicted signal information of every Drosophila protein. The dataset is compared with the dataset of all Drosophila proteins, and the proteins are retained if they are either annotated to have signal peptides in the Swiss-Prot protein database files or predicted to have signal peptides by SignalP. The comparison result is a dataset of Drosophila proteins having amino-terminal signal peptides. From this dataset, only the proteins that are less than 500 amino acids in length are retained because previous analysis has shown that all known secretory peptide precursors are shorter than 500 amino acids (11). In total, 5096 proteins made up the final Drosophila protein dataset, which was analyzed further. The logic of Protein.SAS is illustrated in Fig. 1.

View larger version (25K):
[in this window]
[in a new window]
|
FIG. 1. Construction of database containing all Drosophila proteins that are less than 500 residues in length and that have a signal peptide.
|
|
Peptide.SAS
The subprogram filters out from Swiss-Prot protein databases all the peptides or their precursors that are known in Metazoa today. The peptide precursors are identified by the keywords in each protein data file. If a protein is picked up, its relevant information, like the information collected in Drosophila proteins, is written into an SAS dataset. Fig. 2 describes the process.
Cleavage.SAS
The objective of the subprogram is to split protein sequences into subsequences after removal of the signal peptide sequence. The protein sequences are split in silico at cleavage sites typical for peptide precursors. Conventional amino acid motifs that are required for cleavage of neuropeptides from their protein precursors in insects have been described as GKR, GRR, GR, GRK, GKK, KR, RR, GK, RK, KK, and Arg (21). From our statistical analyses on all known peptide precursors in all organisms (data not shown), it is clear that the processing of peptide precursors does not occur at every conventional cleavage site in the precursor. Cleavage also depends on the amino acids that are at the proximity of the cleavage site. For example, proteolytic processing at GKK followed by Arg always occurs. However, if GKK is followed by Ala, Asn, Ser, or Lys, the processing may or may not occur. For other amino acids at this position, it has not been demonstrated whether processing occurs. As a second example, proteolytic processing at a single Arg residue only occurs when there is a basic amino acid residue in position 4, 6, or 8 with respect to the single Arg. The basic amino acid is usually an Arg, but Lys or His residues work as well (19).
BLAST Analysis
The output of Cleavage.SAS consists of two database files "Drosophila subsequence" and "peptide." BLAST analysis is then conducted on these two databases. The score matrix "PAM30" is used, and the expectation value (e-value) as well as the parameter "word size" are set to 6 and 2, respectively, to find short but strong similarities. Fig. 3 explains the process.

View larger version (36K):
[in this window]
[in a new window]
|
FIG. 3. Cleavage of protein sequences in the two datasets and BLAST analysis of the obtained subsequences.
|
|
Extract.SAS, Shift.SAS, and Motif.SAS
These programs are used to screen the result output by BLAST and determine the biologically significant matches. The subprogram Extract.SAS extracts the Drosophila proteins that have at least two similar subsequences within the protein. The subprogram Shift.SAS reads the comparison result from the BLAST analysis and computes the shift value. The shift value is the minimal distance between the amino or carboxyl terminus of a subsequence and the matching amino acids in the subsequence. From the statistical analysis of the known peptide precursors, these shift values should be low. This means that the motif should be close to a cleavage site. The shift value is set to be no larger than 3 in the program. The subprogram Motif.SAS reads the comparison results between Drosophila subsequences and peptide subsequences as well as the comparison results among peptide subsequences and identifies the Drosophila subsequences that contain motifs.
TMpred and SOSUI
Finally on-line software TMpred at www.ch.embnet.org/software/TMPRED_form.html (22) and SOSUI sosui.proteome.bio.tuat.ac.jp/sosuimenu.html are used to determine whether a protein has a single transmembrane region at its amino terminus (Fig. 4). The minimum and maximum length of the hydrophobic part of the transmembrane region was set at 17 and 33 residues, respectively. For the TMpred program a score above 500 for both inside to outside as well as outside to inside helixes was considered to be significant for the presence of the amino-terminal transmembrane region. A score of 250 was considered to be significant for the presence of an inside to outside helix of any second or third transmembrane region. A putative peptide precursor was retained if one program predicts a single transmembrane region at the amino terminus. When both programs predict the absence of an amino-terminal transmembrane region, the protein was deleted from the list.
Mass Spectrometry
D. melanogaster were reared in 250-ml bottles containing 70 ml of water, 17 g of sucrose, 0.45 g of yeast, 0.9 g of agar, 0.5 ml of 8% Nipagin, and 0.36 ml of propionic acid. The bacteria species Micrococcus luteus and Escherichia coli were precultured in TSB medium. They were used as typical organisms for infection with Gram-positive and Gram-negative bacteria, respectively. Pellets taken when the cultures were in the log phase of growth were resuspended in a small amount of culture medium. Septic injury was performed by pricking third instar larvae of D. melanogaster with a fine insect needle dipped in a mixed solution of M. luteus and E. coli. Then larvae were incubated on tissue saturated with a 5% sucrose solution. After 12 h, the hemolymph was collected using microcapillaries under a binocular microscope. In the control experiment, third instar larvae were collected and incubated for 12 h on the 5% sucrose solution before hemolymph was taken.
Sample Preparation
For each experiment, 25 µl of hemolymph were collected in 100 µl of extraction solution of methanol/water/formic acid (90:9:1, v/v/v). The sample was then sonicated for 15 min. All cell debris and large proteins were removed by centrifugation. The supernatant was dried and resuspended in 100 µl of MilliQ water, 0.1% TFA. The lipids in the sample were removed by adding 100 µl of ethyl acetate to the sample and then vortexing it. After centrifugation, the ethyl acetate with the dissolved lipids was separated from the aqueous solution with the peptides. This solution was dried and resuspended in 15 µl of MilliQ water/acetonitrile/formic acid (94.9:5:0.1) and next filtered through a Millipore spin down filter. As control experiments, hemolymph of non-infected larvae was collected and treated the same way as the test samples.
Capillary LC-tandem MS experiments were conducted using an Ultimate HPLC pump, a column-switching device (Switchos), and a Famos autosampler (all LC Packings) coupled to a Q-TOF mass spectrometer (Micromass). Chromatography was performed using a guard column (µ-guard column MGU-30 C18, LC Packings) acting as a reverse phase support to trap the peptides. Ten microliters of the sample were loaded on the precolumn with an isocratic flow of 2% acetonitrile in MilliQ water with 0.1% formic acid at a flow rate of 10 µl/min. After 2 min, the column-switching valve was switched, placing the precolumn on line with the analytical capillary column, a Pepmap C18, 3-µm, 75-µm x 150-mm nanocolumn (LC Packings). Separation was conducted using a linear gradient from 95% solvent A, 5% solvent B to 80% solvent A, 20% solvent B in 90 min followed by a linear gradient from 80% solvent A, 20% solvent B to 50% solvent A, 50% solvent B in 60 min (solvent A: water, formic acid (99.9:0.1, v/v); solvent B: acetonitrile, formic acid (99.9:0.1, v/v)). The flow rate was set at 150 nl/min.
The LC system was connected in series to the electrospray interface of the Q-TOF device. The column eluent was directed through a stainless steel emitter (Proteon). Needle voltage was set at 1650 V, and cone voltage was set at 35 V. Nitrogen was used as nebulizing gas. Parent ions with two, three, or four charges of sufficient ion intensity (threshold was set at 15 counts/s) were automatically recognized by the charge state recognition software (MassLynx 3.5, Micromass) and selected for fragmentation as they eluted from the column. Argon was used as a collision gas; collision energy was set at 2540 eV depending on the mass and charge state of the selected ion. The detection window in the survey scan was set from 400 to 1400 mass to charge (m/z). Fragmentation spectra were acquired from m/z 50 to 2000.
 |
RESULTS
|
|---|
Construction of Two Databases
First we constructed two databases. The first database was generated as follows. Drosophila proteins that are less than 500 amino acids in length and that start with a signal peptide were assembled from Swiss-Prot and TrEMBL databases as well as from a collection of additional Drosophila genes identified by Hild et al. (20). The program SignalP for eukaryotes was used to predict the occurrence of a signal peptide for a protein sequence (23). As a result, 5096 Drosophila protein sequences were retained. Then all these protein sequences were split into short subsequences at the conventional cleavage sites, taking into account the nature of the amino acids in the proximity of each cleavage site. These subsequences formed the first database, which we named Drosophila subsequence.
The second database is a peptide database that comprises all known peptide precursor subsequences from all metazoan organisms. These annotated peptides or peptide precursors were filtered from the Swiss-Prot (release 42.11) and TrEMBL (release 25.11) databases as follows. A protein was retained when it is annotated as (neuro)peptide precursor or when its name contains the word "neuropeptide." Proteins of which the corresponding protein file contains keywords such as peptide, neuropeptide, hormone, or neurotransmitter were also retained. But if these proteins have a subcellular location as membrane protein (as indicated in the protein file) or if they are characterized by key words such as receptor, signal-anchor, transmembrane, binding protein, DNA binding, nuclear protein, nuclear transport, or enzyme or words ending in "ase," they were excluded. In total, 2858 proteins met these criteria. These peptide precursors were subsequently split into short subsequences at the conventional cleavage sites, also taking into account the character of the amino acids in the proximity of each cleavage site. This collection of peptide precursor subsequences constituted the peptide database.
Setup of Data Mining Analysis
Stand-alone BLAST was used to compare the two above mentioned databases. Interpretation of the results generated by BLAST involved evaluation of the matches to determine whether they are significant. Therefore, genuine and biologically meaningful similarities need to be distinguished from the irrelevant and essentially random ones. If the alignment was similar to a motif, it was considered significant, and the subsequence was considered a putative peptide. To find the conserved motifs, all known peptide precursor subsequences were compared by BLAST.
Four types of analysis were performed. 1) The Drosophila subsequence database was compared with itself, and those protein sequences that have at least two similar subsequences within the same protein sequence were retained (first screening method). 2) The peptide precursor subsequences in the peptide database were compared with each other, and the obtained similar amino acid sequence tags were considered as possible motifs. 3) The Drosophila subsequence database was compared with the peptide database, and those Drosophila subsequences that displayed sequence similarities to such a conserved motif were retained (second screening method). 4) The retained proteins from 1 and 3 were then analyzed by a transmembrane region prediction method. Our analysis of the annotated Drosophila neuropeptide precursors indicates that almost all have a single transmembrane region, which is located at the amino terminus and which corresponds to the signal peptide. Therefore, in a final step we fine tuned the generated list of putative peptide precursors based on this hallmark. The list was curated by the deletion of (i) all soluble proteins (lacking membrane-spanning regions), (ii) proteins having more than one transmembrane region, and (iii) proteins having one transmembrane region that is not located in the amino-terminal region (the cut of the start of the transmembrane region was set at the 20th residue; transmembrane regions that started at or after this point were not considered to be at the amino-terminal side).
Screening Method 1
The first screening method was based on the principle that multiple peptides encoded by a single invertebrate peptide precursor gene often are highly related. Therefore, proteins were only selected if they had at least two similar subsequences and if the matching amino acid sequence was at or close to the amino or carboxyl terminus of at least one subsequence.
Therefore, the structural pattern of a putative peptide precursor is: . . . . [cleavage1]-X1(3, 60)-[cleavage2]- . . . . [cleavage3]-X2(3, 60)-[cleavage4] . . . . where X1(3,60) and X2(3,60) are two similar subsequences that are between 3 and 60 amino acids long. [cleavage1(4)] can be any conventional cleavage site. The subsequences do not need to be adjacent within the precursor.
Using this screening method we found 58 peptide precursors in Drosophila, 10 of which are well known peptide precursor genes that encode at least two related bioactive peptides, drosulfakinin (dsk), FMRFamide, short neuropeptide F, tachykinin, capa or mt-cap2b, and diuretic hormone.
For example, the protein identified by Swiss-Prot/TrEMBL accession number Q9V808 is a putative peptide precursor (Fig. 5). By comparing the Drosophila subsequence database with itself, we obtained three similar subsequences in Q9V808. The number following the accession number represents a different subsequence within a protein sequence by its position. The results of this screening method is depicted in Supplemental Table 1.

View larger version (32K):
[in this window]
[in a new window]
|
FIG. 5. Screening method 1. The protein identified by accession number Q9V808 was retained as a putative peptide precursor. By comparing the Drosophila subsequence database with itself, we obtained three similar subsequences in Q9V808 (underlined): Q9V808_6 (IPYEVKVDVPQPYIVE), Q9V808_8 (IPYEVKVPVDKPYEVKVPVPQPYEVI), and Q9V808_9 (IPYEVKVPVPQPYEVI). The number following the accession number represents a different subsequence within a protein sequence by its position.
|
|
View this table:
[in this window]
[in a new window]
|
TABLE I Summary of the peptides identified from the hemolymph of Drosophila larvae using the predicted peptide precursor database (NP-DB) and the NCBInr database
Identifications for which the score was not high enough for significant identification are marked with "ns" (not significant) (individual ion scores are lower than the value needed for identity or extensive homology (p < 0.05). pQ, pyroglutamic acid.
|
|
Screening Method 2
The fact that only nine of the 44 known neuropeptide precursors as well as one immune induced peptide in Drosophila were listed by the first screening method indicated that the catalogue of putative regulatory peptide precursors obtained by the first screening method is doubtless incomplete. Therefore, we carried out a second screening method that screens for Drosophila proteins having a signal peptide and of which at least one subsequence has at least 3/5 amino acids at or close to the amino or carboxyl terminus identical to a known peptide. In addition, the identical 3/5 amino acids should be similar to a motif. The retained proteins were then further filtered by the transmembrane prediction analysis as in the first method.
By means of the second method we found 70 Drosophila peptide precursor genes in total, 42 of which are known peptide precursors and 28 of which are novel. Each of these putative peptide precursor genes encodes multiple non-related peptides or only a single putative peptide. For example, protein Q8MS86 was identified as a putative peptide precursor (Fig. 6). The similar subsequence is Q8MS86_2 (WKILTAGSHFRWL). The similar known peptides are P11885_2 (YVMSHFRWNKF) from Rana catesbeiana and P06298_8 (NGNYRMHHFRWGSPPKD) from Xenopus laevis.

View larger version (42K):
[in this window]
[in a new window]
|
FIG. 6. Screening method 2. Q8MS86 was identified as a putative peptide precursor. The subsequence iQ8MS86_2 (WKILTAGSHFRWL) is similar to the subsequences P11885_2 (YVMSHFRWNKF) and P06298_8 (NGNYRMHHFRWGSPPKD) derived from the corticotropin-lipotropin precursors of R. catesbeiana and X. laevis, respectively.
|
|
The total output of this screening method is shown in Supplemental Table 2. The combined computational methods generated in total 75 novel putative peptide precursors in D. melanogaster in addition to the 43 known ones.
View this table:
[in this window]
[in a new window]
|
TABLE II Comparison of -MSH sequences of different metazoan species with the putative -MSH-like sequence in D. melanogaster
|
|
Peptidomic Analysis
The nano-LC-tandem MS method allowed us to select and fragment the peptide ions as they elute from the column even when co-eluting with other peptides. Peptides were identified by subjecting their fragmentation spectra to a Mascot search on an in-house server. This bioinformatic tool (www.matrixscience.com) allows the identification of proteins and peptides by matching MS/MS data against any FASTA format protein or (translated) nucleic acid sequence database. In a typical MS/MS ion search, we combined all MS/MS data of every peptide selected for fragmentation during a LC-MS run in a comprehensive peak list. This type of file contains the centroided mass values and associated intensity values of all the parent ions selected and corresponding fragmentation peaks and can be submitted to Mascot for identification of the peptides. Settings used were as follows: variable modifications; carboxyl-terminal amidation, oxidation of methionine, and pyro-Glu (N-terminal Glu) were selected; enzyme was set to "none"; peptide mass error tolerance was set to 0.6 Da; and MS/MS mass error tolerance was set at 0.3 Da.
In total more than 300 ions were automatically selected for fragmentation. Twenty peptides were identified, most of which are known to occur in the hemolymph such as the Attacins, defensins, and Drosophila immune induced peptides (DIMs) (24). The identified peptides and their Mascot scores are summarized in Table I. In an earlier study these peptides could already be identified in the hemolymph after immune challenge (24). In addition however, we identified two new peptides, LDDSENNDQVVGLLDVADQGANHANDGAREA and a truncated form of this peptide, LLDVADQGANHANDGAREA (Fig. 7). These peptides originate from protein CG7738 that was identified as a putative peptide precursor by screening method 1 (Fig. 8).

View larger version (34K):
[in this window]
[in a new window]
|
FIG. 7. Fragmentation spectra and annotation of two peptides identified in the Drosophila hemolymph. A, LDDSENNDQVVGLLDVADQGANHANDGAREA. B, LLDVADQGANHANDGAREA Both peptides originate from the CG7738 that was picked up by screening method 1. The most important fragment ions (mainly b- and y-type ions) are indicated on the spectra.
|
|

View larger version (19K):
[in this window]
[in a new window]
|
FIG. 8. Amino acid sequence of CG7738. The peptides that were identified in the hemolymph are indicated in bold. The signal peptide predicted by SignalP is underlined.
|
|
For comparison, the same data file was used in a Mascot search on a larger database. The database used was the National Center for Biotechnology non-redundant (NCBInr) database, Drosophila was selected for taxonomy, and other settings were kept identical to those used for the restricted database. Results of both searches are summarized in Table I. Peptides identified with the NCBInr database are essentially the same as those identified with our restricted database. No additional peptides could be identified with the full NCBInr database.
 |
DISCUSSION
|
|---|
Because of the availability of its complete genome sequence, Drosophila becomes a model insect for peptide research. We identified in total 118 putative secretory peptide precursor genes in D. melanogaster by applying the database searching programs presented here. 43 of them are annotated peptide precursors. All predicted peptide precursors met following criteria. (i) Each putative peptide precursor is less than 500 amino acids in length and has a signal peptide. (ii) Each precursor contains one or several putative peptides that are flanked by conventional cleavage sites. There are two possibilities: the precursor contains two or more peptides that share sequence similarities, or alternatively the precursor contains a single peptide that shares conserved motifs with known peptide precursor subsequences from other organisms. (iii) All predicted peptide precursors have one amino-terminal transmembrane region. Several of the genes mined by our method encode peptides that display significant sequence similarities to known vertebrate or invertebrate neuropeptides. These similarities could not be discovered by BLAST scanning of the whole Drosophila genome. For instance, a putative peptide encoded by CG3868, mined by the first method, displays sequence similarities with an antifreeze glycopeptide precursor identified in Antarctic fish (25). The salivary gland glue protein (CG18087) contains a putative peptide sequence that displays significant similarities to vertebrate neurophysins. Neurophysins are a group of small, soluble proteins secreted by the hypothalamus. They serve as binding proteins for oxytocin and vasopressin during their transport to the posterior pituitary. They are secreted with the hormones but have no known functions other than serving as a carrier. In vertebrates neurophysins originate from the vasopressin precursor. The salivary gland glue protein (CG18087) does not contain a vasopressin/oxytocin-like peptide.
Putative peptides from two genes, CG9358 and BK003312, display sequence similarities to conserved parts of the prolactin precursor. Prolactin and growth hormone are two distinct neuropeptide hormones that have been found in all vertebrate groups but not in cyclostomes (26), although prolactinergic neurons that were detected immunochemically occur in a protochordate (27). The growth hormone/prolactin superfamily is likely to have a prevertebrate origin, but a putative invertebrate member has not been found yet in contrast to other neuropeptide superfamilies that are highly conserved in vertebrates and invertebrates. Examples are tachykinins, gastrin, insulin, neuropeptide Y, corticotropin-releasing factor, and calcitonin gene-related peptide (28).
Drosophila BK002187 encodes a peptide with sequence similarities to atrial natriuretic peptide. Natriuretic peptides are vertebrate hormones that play a pivotal role in cardiovascular and body fluid homeostasis in vertebrates (29). Although a novel natriuretic peptide has been found recently in the heart and brain of the hagfish, the most primitive vertebrate (30), no member of this family has been described yet in invertebrates. Finally a peptide encoded by the Drosophila LP04693 displays sequence similarities to
-MSH, a pituitary hormone derived from the pro-opiomelanocortin precursor, the function of which has remained elusive (31) (Table II).
When we consider similarities to invertebrate neuropeptides, the CG1565 protein contains a putative peptide that has an amino-terminal hexamer contained within orcokinin, a myotropic neuropeptide discovered in crustaceans (32, 33) and very recently for the first time in insects (34, 35) (Table III). A putative peptide sequence within the Trunk protein precursor displays striking similarities with prothoracicotropic hormone, a neuropeptide that has so far only been identified in lepidopteran species in which it stimulates ecdysone biosynthesis in the prothoracic glands (36). Interestingly the sex-specific gene male specific opa containing gene (MSOPA), as identified by Jin et al. (37), encodes a putative peptide that shows sequence similarities to a male accessory gland-specific 57-kDa peptide precursor. Next the putative Drosophila peptide encoded by CG8087 displays more than 60% sequence identities with a neuropeptide derived from a neurospecific peptide precursor in the terrestrial snail Helix lucorum (38). Finally some mined genes (CG16882, CG11131, CG7465, CG1221, Argos, and Trunk) have been predicted or shown to encode for ligands of membrane receptors, such as epidermal growth factor, Toll, or Torso receptors (39), a function in line with the peptidergic nature of their products.
View this table:
[in this window]
[in a new window]
|
TABLE III Comparison of orcokinin sequences of different invertebrate species to the predicted orcokinin in D. melanogaster
|
|
Since the publication of the Drosophila genome sequence, several microarray studies have been performed, and we observed that some of the mined peptide precursor genes are up-regulated by ecdysone (CG7608, CG1807, and CG 7350) (40, 41). Ecdysone is an ecdysteroid involved in insect metamorphosis and reproduction. It is the precursor of 20-OH-ecdysone, the functional counterpart of vertebrate estrogen (42). In this way, our data are in accordance with the reported interactions of peptide and steroid hormone signaling cascades in vertebrates (43, 44). Other mined genes are up-regulated after infection (45) and encode for peptides that are secreted into the hemolymph such as Attacin, diptericin, drosocin, and various DIMs (24). With the currently established program, several additional putative peptide precursor genes display sequence similarities to known DIMs but have yet not been annotated as such. Three of them (CG32851, CG5791, and CG15065) form part of the Toll pathway (46), and one (CG18107) is rhythmically expressed in the head (47) (Table IV).
Former microarray studies revealed that regulation of transcription of known neuropeptide genes as well as other putative peptide precursor genes established in this study is circadian clock-dependent (i.e. capa, corazonin, CG4784, and CG1807) (47, 48), nutrient-dependent (CG10918 and CG15225) (49), or sex-specific (accessory gland peptides, CG7738, and CG11458) (37, 50).
Our program also picked up the drosocrystallin gene (51) as well as other annotated cuticular proteins. In Tenebrio molitor, biologically active peptides display strong sequence similarities to parts of cuticle proteins, and therefore they might be processed from them (52). Given the fact that proteolytic processing does not always occur at every conventional cleavage site (53), our established catalogue of predicted peptide precursors is doubtless incomplete, and it will be a difficult challenge to consider the existence of these unconventional cleavage sites in the further refinement of our method.
Only two of the characterized peptide precursors were not mined by our method, i.e. the diuretic hormone precursor or CG8348 because it has four transmembrane regions and the proctolin precursor "Q8MMJ7" because its sequence is too short (5 amino acids) to be filtered by the program. Only a few cases could be false positives: CG5559 has been annotated to encode a conserved protein involved in synaptic vesicle fusion, CG6409 has been predicted to be a component of the endoplasmic reticulum, CG11577 has been predicted to permanently reside in the lumen based on its carboxyl-terminal sequence (54), and CG6357 encodes a putative cysteine protease.
The database of predicted and known peptide precursors in Drosophila as established in this study will serve several applications in experimental research. Mass spectrometric data will become much easier to read and interpret if the database against which they are scanned is much narrower than the Swiss-Prot database.
In insects, as in all larger organisms the hemolymph is an important part of the defense mechanism against infections (24, 55). Many of the peptides identified so far in the hemolymph are involved in the response of the fly to an infection (24), and most of these peptides are released into the hemolymph after infection. In addition all peptides (and proteins) presented in the hemolymph are secreted by other tissues or by blood cells. Therefore we chose to perform a peptidomic analysis on the hemolymph of fruit fly larvae after infection with bacterial material.
The peptidomic analysis of the Drosophila hemolymph resulted in the identification of 22 peptides; all of these peptides originate from peptide precursor genes known to be involved in the immune response of the fruit fly. In addition we were able to identify two novel peptides originating from the CG7738. The first peptide, LDDSENNDQVVGLLDVADQGANHANDGAREA, is 31 amino acids in length and is flanked at the amino-terminal side by the cleavage site of the signal peptide (Fig. 8). At the carboxyl terminus the peptide is flanked by an arginine residue that could act as a monobasic cleavage site. The second peptide, LLDVADQGANHANDGAREA, is a truncated homologue of the first one.
For comparison, the same data file was used in a Mascot search both on the restricted peptide precursor database and on the NCBInr database (taxonomy, Drosophila). The peptides identified with the NCBInr database are essentially the same as those identified with our restricted database. No additional peptides could be identified with the full NCBInr database.
However, the threshold scores for a significant identification increases with the size of the database used (larger number of possible peptides). The minimum score for a significance threshold of 0.05 was 27 for the restricted peptide precursor database and 56 for the full NCBInr (Drosophila) database. Therefore, a large number of peptides (13 of 22) that had a significant score with the restricted database fell below the significance threshold with the full NCBInr database. This includes the larger form of the identified novel peptide; the truncated peptides fell within the set threshold limits of both searches.
Similar to most proteomic identification tools, Mascot is designed to identify a protein from the MS/MS spectra of several individual peptides cleaved from the same protein. The protein score in a peptide summary is derived from the ion scores of the individual peptides. For a search that contains a small number of queries, the protein score is the sum of the unique ion scores. Many (neuro)peptide precursors give rise to only one or a very limited number of bioactive peptides, and because a peptidomic experiment focuses on the peptides themselves rather than on the complete precursor, only the individual scores of the peptides can be taken into account for an identification. In addition, because the processing of the peptide from the precursor is unknown, no cleavage enzyme can be selected for identification. All these features of naturally occurring peptides are detrimental for easy identification in an automated fashion. Our study demonstrates the improved success rate of identification of a secretory peptide if one is able to use a restricted database.
The lack of identification of more predicted peptides may be due to several reasons. First, their concentration in the hemolymph may be below the sensitivity of the instrumental setup we used. Second, not all peptides are extracted or ionize with the same efficiency. Consequently instrument sensitivity varies between peptides depending on their amino acid composition. Third, in contrast to peptides obtained after tryptic digestion of a protein, naturally occurring peptides do not necessarily have a basic amino acid at their carboxyl terminus and will not necessarily yield nice series of y or b ions upon fragmentation. As a consequence many natural peptides yield fragmentation spectra that are hard to interpret or are of insufficient quality for Mascot to allow identification with a sufficiently high score. Fourth, these peptides were predicted from the genomic database, which is no guarantee that they are present in the hemolymph. Fifth, if they are released into the hemolymph, their secretion and/or synthesis might be dependent on the physiological condition of the animal. Further analysis of other tissues in different physiological conditions will, without doubt, result in the identification of more of the predicted peptides.
Also in mammalian models, genome-wide analysis of peptides by mass spectrometry has increased recently (56). Construction of a peptide database, like the one presented here for Drosophila, will be of high value to support these studies. As the structural hallmarks of peptide precursor sequences are highly conserved across phyla, we foresee that the established search program can be adapted for the genome-wide analysis for peptide precursor genes in other animal model systems that have a sequenced genome (24, 32, 34, 35, 5766).
 |
FOOTNOTES |
|---|
Received, August 27, 2004, and in revised form, September 20, 2005.
Published, MCP Papers in Press, November 16, 2005, DOI 10.1074/mcp.M400114-MCP200
1 The abbreviations used are: BLAST, Basic Local Alignment Searching Tool; DIM, Drosophila immune induced peptide; MSH, melanocyte-stimulating hormone. 
* This work was supported by Fonds voor Wetenschappelijk Onderzoek Vlaanderen (FWO) Grant G0146.03. The costs of publication of this article were defrayed in part by the payment of page charges. The article must therefore be hereby marked "advertisement" in accordance with 18 U.S.C. Section 1734 solely to indicate this fact. 
S The on-line version of this article (available at http://www.mcponline.org) contains supplemental material. 
Both authors contributed equally to this work. 
** Postdoctoral fellow of the FWO. 
|| Postdoctoral fellow of the FWO. To whom correspondence should be addressed. E-mail: geert.baggerman{at}bio.kuleuven.ac.be
 |
REFERENCES
|
|---|
- Vanden Broeck, J.
(2001) Neuropeptides and their precursors in the fruitfly, Drosophila melanogaster.
Peptides
22, 241
254[CrossRef][Medline]
- Hewes, R. S., and Taghert, P. H.
(2001) Neuropeptides and neuropeptide receptors in the Drosophila melanogaster genome.
Genome Res.
11, 1126
1142[Abstract/Free Full Text]
- De Loof, A., Baggerman, G., Breuer, M., Claeys, I., Cerstiaens, A., Clynen, E., Janssen, T., Schoofs, L., and Vanden Broeck, J.
(2001) Gonadotropins in insects: an overview.
Arch. Insect Biochem. Physiol.
47, 129
138[CrossRef][Medline]
- Brogiolo, W., Stocker, H., Ikeya, T., Rintelen, F., Fernandez, R., and Hafen, E.
(2001) An evolutionarily conserved function of the Drosophila insulin receptor and insulin-like peptides in growth control.
Curr. Biol.
11, 213
221[CrossRef][Medline]
- Riehle, M. A., Garczynski, S. F., Crim, J. W., Hill, C. A., and Brown, M. R.
(2002) Neuropeptides and peptide hormones in Anopheles gambiae.
Science
298, 172
175[Abstract/Free Full Text]
- Gade, G., Hoffmann, K. H., and Spring, J. H.
(1997) Hormonal regulation in insects: facts, gaps, and future directions.
Physiol. Rev.
77, 963
1032[Abstract/Free Full Text]
- Gade, G.
(2004) Regulation of intermediary metabolism and water balance of insects by neuropeptides.
Annu. Rev. Entomol.
49, 93
113[CrossRef][Medline]
- Nassel, D. R.
(2002) Neuropeptides in the nervous system of Drosophila and other insects: multiple roles as neuromodulators and neurohormones.
Prog. Neurobiol.
68, 1
84[CrossRef][Medline]
- Baggerman, G., Cerstiaens, A., De Loof, A., and Schoofs, L.
(2002) Peptidomics of the larval Drosophila melanogaster central nervous system.
J. Biol. Chem.
277, 40368
40374[Abstract/Free Full Text]
- Baggerman, G., Boonen, K., Verleyen, P., De Loof, A., and Schoofs, L.
(2005) Peptidomic analysis of the larval Drosophila melanogaster central nervous system by two-dimensional capillary liquid chromatography quadrupole time-of-flight mass spectrometry.
J. Mass Spectrom.
40, 250
260[CrossRef][Medline]
- Baggerman, G., Liu, F., Wets, G., and Schoofs, L.
(2005) Bioinformatic analysis of peptide precursor proteins.
Ann. N. Y. Acad. Sci.
1040, 59
65[Abstract/Free Full Text]
- Yona, G., and Brenner, S.
(2000) Comparison of protein sequences and practical database searching, in
Bioinformatics: Sequence, Structure and Databanks (Higgins, D., and Taylor, W., eds) pp.167
190, Oxford University Press, Oxford, UK
- Canaff, L., Bennett, H. P., and Hendy, G. N.
(1999) Peptide hormone precursor processing: getting sorted?
Mol. Cell. Endocrinol.
156, 1
6[CrossRef][Medline]
- Nichols, R., Schneuwly, S. A., and Dixon, J. E.
(1988) Identification and characterization of a Drosophila homologue to the vertebrate neuropeptide cholecystokinin.
J. Biol. Chem.
263, 12167
12170[Abstract/Free Full Text]
- Nichols, R., Bendena, W. G., and Tobe, S. S.
(2002) Myotropic peptides in Drosophila melanogaster and the genes that encode them.
J. Neurogenet.
16, 1
28[Medline]
- Schneider, L. E., and Taghert, P. H.
(1988) Isolation and characterization of a Drosophilagene that encodes multiple neuropeptides related to Phe-Met-Arg-Phe-NH2 (FMRFamide).
Proc. Natl. Acad. Sci. U. S. A.
85, 1993
1997[Abstract/Free Full Text]
- Siviter, R. J., Coast, G. M., Winther, A. M., Nachman, R. J., Taylor, C. A., Shirras, A. D., Coates, D., Isaac, R. E., and Nassel, D. R.
(2000) Expression and functional characterization of a Drosophila neuropeptide precursor with homology to mammalian preprotachykinin A.
J. Biol. Chem.
275, 23273
23280[Abstract/Free Full Text]
- Vanden Broeck, J., Veelaert, D., Bendena, W. G., Tobe, S. S., and De Loof, A.
(1996) Molecular cloning of the precursor cDNA for schistostatins, locust allatostatin-like peptides with myoinhibiting properties.
Mol. Cell. Endocrinol.
122, 191
198[CrossRef][Medline]
- Taylor, C. A., Winther, A. M., Siviter, R. J., Shirras, A. D., Isaac, R. E., and Nassel, D. R.
(2004) Identification of a proctolin preprohormone gene (Proct) of Drosophila melanogaster: expression and predicted prohormone processing.
J. Neurobiol.
58, 379
391[CrossRef][Medline]
- Hild, M., Beckman, B., Haas, S., Koch, B., Solvyev, V., Busold, C., Feelenberg, K., Boutros, M., Vingron, M., Sauer, F., Hoheisel, J., and Paro, R.
(2003) An integrated gene annotation and transcriptional profiling approach towards the full gene content of the Drosophila genome.
Genome Biol.
5, R3[CrossRef][Medline]
- Veenstra, J. A.
(2000) Mono- and dibasic proteolytic cleavage sites in insect neuroendocrine peptide precursors.
Arch. Insect Biochem. Physiol.
43, 49
63[CrossRef][Medline]
- Hofmann, K., and Stoffel, W.
(1993) Tmbasea database of membrane spanning protein segments.
Biol. Chem. Hoppe-Seyler
374, 166
- Nielsen, H., Brunak, S., and von Heijne, G.
(1999) Machine learning approaches for the prediction of signal peptides and other protein sorting signals.
Protein Eng.
12, 3
9[Abstract/Free Full Text]
- Uttenweiler-Joseph, S., Moniatte, M., Lagueux, M., Van Dorsselaer, A., Hoffmann, J. A., and Bulet, P.
(1998) Differential display of peptides induced during the immune response of Drosophila: a matrix-assisted laser desorption ionization time-of-flight mass spectrometry study.
Proc. Natl. Acad. Sci. U. S. A.
95, 11342
11347[Abstract/Free Full Text]
- Osuga, D. T., and Feeney, R. E.
(1978) Antifreeze glycoproteins from Arctic fish.
J. Biol. Chem.
253, 5338
5343[Free Full Text]
- Forsyth, I. A., and Wallis, M.
(2002) Growth hormone and prolactinmolecular and functional evolution.
J. Mammary Gland Biol. Neoplasia
7, 291
312[CrossRef][Medline]
- Pestarino, M.
(1983) Prolactinergic neurons in a protochordate.
Cell Tissue Res.
233, 471
474[CrossRef][Medline]
- De Loof, A., and Schoofs, L.
(1990) Homologies between the amino acid sequences of some vertebrate peptide hormones and peptides isolated from invertebrate sources.
Comp. Biochem. Physiol. B
95, 459
468[CrossRef][Medline]
- Takei, Y.
(2000) Structural and functional evolution of the natriuretic peptide system in vertebrates.
Int. Rev. Cytol.
194, 1
66[Medline]
- Kawakoshi, A., Hyodo, S., Yasuda, A., and Takei, Y.
(2003) A single and novel natriuretic peptide is expressed in the heart and brain of the most primitive vertebrate, the hagfish (Eptatretus burgeri).
J. Mol. Endocrinol.
31, 209
220[Abstract]
- Humphreys, M. H.
(2004)
-MSH, sodium metabolism, and salt-sensitive hypertension
Am. J. Physiol.
286, R417
R430
- Stangier, J., Hilbich, C., Burdzik, S., and Keller, R.
(1992) Orcokinin: a novel myotropic peptide from the nervous system of the crayfish, Orconectes limosus.
Peptides
13, 859
864[CrossRef][Medline]
- Huybrechts, J., Nusbaum, M. P., Bosch, L. V., Baggerman, G., De Loof, A., and Schoofs, L.
(2003) Neuropeptidomic analysis of the brain and thoracic ganglion from the Jonah crab, Cancer borealis.
Biochem. Biophys. Res. Commun.
308, 535
544[CrossRef][Medline]
- Pascual, N., Castresana, J., Valero, M. L., Andreu, D., and Belles, X.
(2004) Orcokinins in insects and other invertebrates.
Insect Biochem. Mol. Biol.
34, 1141
1146[CrossRef][Medline]
- Hofer, S., Dircksen, H., Tollback, P., and Homberg, U.
(2005) Novel insect orcokinins: characterization and neuronal distribution in the brains of selected dicondylian insects.
J. Comp. Neurol.
490, 57
71[CrossRef][Medline]
- Xu, W. H., Rinehart, J. P., and Denlinger, D. L.
(2003) Structural characterization and expression analysis of prothoracicotropic hormone in the corn earworm, Helicoverpa zea.
Peptides
24, 1319
1325[CrossRef][Medline]
- Jin, W., Riley, R. M., Wolfinger, R. D., White, K. P., Passador-Gurgel, G., and Gibson, G.
(2001) The contributions of sex, genotype and age to transcriptional variance in Drosophila melanogaster.
Nat. Genet.
29, 389
395[CrossRef][Medline]
- Bogdanov, Y., Balaban, P. M., Zakharov, I. S., Poteryaev, D. A., and Belyavsky, A. V.
(1996) Identification of two novel genes specifically expressed in the D-group neurons of the terrestrial snail CNS.
Invert. Neurosci.
2, 61
69[CrossRef][Medline]
- Casali, A., and Casanova, J.
(2001) The spatial control of Torso RTK activation: a C-terminal fragment of the Trunk protein acts as a signal for Torso receptor in the Drosophila embryo.
Development
128, 1709
1715[Abstract]
- Fletcher, J. C., and Thummel, C. S.
(1995) The Drosophila E74 gene is required for the proper stage- and tissue-specific transcription of ecdysone-regulated genes at the onset of metamorphosis.
Development
121, 1411
1421[Abstract]
- Lehman, M.
(2003) Drosophila Sgs genes: stage and tissue specificity of hormone responsiveness.
Bioessays
18, 47
54
- De Loof, A., and Huybrechts, R.
(1998) "Insects do not have sex hormones": a myth?
Gen. Comp. Endocrinol.
111, 245
260[CrossRef][Medline]
- Richards, J. S., Russell, D. L., Ochsner, S., Hsieh, M., Doyle, K. H., Falender, A. E., Lo, Y. K., and Sharma, S. C.
(2002) Novel signaling pathways that control ovarian follicular development, ovulation, and luteinization.
Recent Prog. Horm. Res.
57, 195
220[Abstract/Free Full Text]
- Cardona-Gomez, G. P., Mendez, P., DonCarlos, L. L., Azcoitia, I., and Garcia-Segura, L. M.
(2002) Interactions of estrogen and insulin-like growth factor-I in the brain: molecular mechanisms and functional implications.
J. Steroid Biochem. Mol. Biol.
83, 211
217[CrossRef][Medline]
- De Gregorio, E., Spellman, P. T., Rubin, G. M., and Lemaitre, B.
(2001) Genome-wide analysis of the Drosophila immune response by using oligonucleotide microarrays.
Proc. Natl. Acad. Sci. U. S. A.
98, 12590
12595[Abstract/Free Full Text]
- Boutros, M., Agaisse, H., and Perrimon, N.
(2002) Sequential activation of signaling pathways during innate immune responses in Drosophila.
Dev. Cell
3, 711
722[CrossRef][Medline]
- McDonald, M. J., and Rosbash, M.
(2001) Microarray analysis and organization of circadian gene expression in Drosophila.
Cell
107, 567
578[CrossRef][Medline]
- Claridge-Chang, A., Wijnen, H., Naef, F., Boothroyd, C., Rajewsky, N., and Young, M. W.
(2001) Circadian regulation of gene expression systems in the Drosophila head.
Neuron
32, 657
671[CrossRef][Medline]
- Zinke, I., Schutz, C. S., Katzenberger, J. D., Bauer, M., and Pankratz, M. J.
(2002) Nutrient control of gene expression in Drosophila: microarray analysis of starvation and sugar-dependent response.
EMBO J.
21, 6162
6173[CrossRef][Medline]
- Fujii, S., and Amrein, H.
(2002) Genes expressed in the Drosophila head reveal a role for fat cells in sex-specific physiology.
EMBO J.
21, 5353
5363[CrossRef][Medline]
- Janssens, H., and Gehring, W. J.
(1999) Isolation and characterization of drosocrystallin, a lens crystallin gene of Drosophila melanogaster.
Dev. Biol.
207, 204
214[CrossRef][Medline]
- Eigenheer, R. A., Nicolson, S. W., Schegg, K. M., Hull, J. J., and Schooley, D. A.
(2002) Identification of a potent antidiuretic factor acting on beetle Malpighian tubules.
Proc. Natl. Acad. Sci. U. S. A.
99, 84
89[Abstract/Free Full Text]
- Baggerman, G., Huybrechts, J., Clynen, E., Hens, K., Harthoorn, L., Van der, H. D., Poulos, C., De Loof, A., and Schoofs, L.
(2002) New insights in adipokinetic hormone (AKH) precursor processing in Locusta migratoria obtained by capillary liquid chromatography-tandem mass spectrometry.
Peptides
23, 635
644[CrossRef][Medline]
- Pelham, H. R.
(1990) The retention signal for soluble proteins of the endoplasmic reticulum.
Trends Biochem. Sci.
15, 483
486[CrossRef][Medline]
- Vierstraete, E., Verleyen, P., Baggerman, G., DHertog, W., Van den Bergh, G., Arckens, L., De Loof, A., and Schoofs, L.
(2004) A proteomic approach for the analysis of instantly released wound and immune proteins in Drosophila melanogaster hemolymph.
Proc. Natl. Acad. Sci. U. S. A.
101, 470
475[Abstract/Free Full Text]
- Svensson, M., Skold, K., Svenningsson, P., and Andren, P. E.
(2003) Peptidomics-based discovery of novel neuropeptides.
J. Proteome Res.
2, 213
219[CrossRef][Medline]
- Uhler, M., and Herbert, E.
(1983) Complete amino acid sequence of mouse pro-opiomelanocortin derived from the nucleotide sequence of pro-opiomelanocortin cDNA.
J. Biol. Chem.
258, 257
261[Abstract/Free Full Text]
- Pan, F. M., and Chang, W. C.
(1989) Nucleotide sequence of bullfrog pro-opiomelanocortin cDNA.
Nucleic Acids Res.
17, 5843[Free Full Text]
- Martens, G. J., Civelli, O., and Herbert, E.
(1985) Nucleotide sequence of cloned cDNA for pro-opiomelanocortin in the amphibian Xenopus laevis.
J. Biol. Chem.
260, 13685
13689[Abstract/Free Full Text]
- Kitahara, N., Nishizawa, T., Iida, K., Okazaki, H., Andoh, T., and Soma, G. I.
(1988) Absence of a
-melanocyte-stimulating hormone sequence in proopiomelanocortin mRNA of chum salmon. Oncorhynchus keta.
Comp. Biochem. Physiol. B
91, 365
370[CrossRef][Medline]
- Salzet, M., Wattez, C., Bulet, P., and Malecha, J.
(1994) Isolation and structural characterization of a novel peptide related to
-melanocyte stimulating hormone from the brain of the leech Theromyzon tessulatum.
FEBS Lett.
348, 102
106[CrossRef][Medline]
- Dircksen, H., Burdzik, S., Sauter, A., and Keller, R.
(2000) Two orcokinins and the novel octapeptide orcomyotropin in the hindgut of the crayfish Orconectes limosus: identified myostimulatory neuropeptides originating together in neurones of the terminal abdominal ganglion.
J. Exp. Biol.
203, 2807
2818[Abstract]
- Bungart, D., Hilbich, C., Dircksen, H., and Keller, R.
(1995) Occurrence of analogues of the myotropic neuropeptide orcokinin in the shore crab, Carcinus maenas: evidence for a novel neuropeptide family.
Peptides
16, 67
72[CrossRef][Medline]
- Yasuda-Kamatani, Y., and Yasuda, A.
(2000) Identification of orcokinin gene-related peptides in the brain of the crayfish Procambarus clarkii by the combination of MALDI-TOF and on-line capillary HPLC/Q-Tof mass spectrometries and molecular cloning.
Gen. Comp Endocrinol.
118, 161
172[CrossRef][Medline]
- Skiebe, P., Dreger, M., Meseke, M., Evers, J. F., and Hucho, F.
(2002) Identification of orcokinins in single neurons in the stomatogastric nervous system of the crayfish, Cherax destructor.
J. Comp Neurol.
444, 245
259