Article | Open | Published:

Nezara viridula (Hemiptera: Pentatomidae) transcriptomic analysis and neuropeptidomics

Scientific Reportsvolume 8, Article number: 17244 (2018) | Download Citation

Abstract

Stinkbugs (Hemiptera: Pentatomidae) are of major economic importance as pest of crops. Among the species composing the stinkbug complex, Nezara viridula is one of the most abundant in Brazil, Argentina and the Southern USA. However, this species has been poorly characterized at the genetic and physiological level. Here we sequenced and analyzed the complete transcriptome of N. viridula male and female adults. We identified neuropeptide precursor genes and G-protein coupled receptors for neuropeptides in this transcriptome. Mature neuropeptides were identified in N. viridula brain extracts by liquid chromatography-tandem mass spectrometry. We also analyzed the neuropeptide precursor complement in the genome sequence of Halyomorpha halys, another pentatomid of economic relevance. We compared the results in both pentatomids with the well-characterized neuropeptide repertoire from the kissing bug Rhodnius prolixus (Hemiptera: Reduviidae). We identified both group-specific features (which could be related to the different feeding habits) and similarities that could be characteristic of Heteroptera. This work contributes to a deeper knowledge of the genetic information of these pests, with a focus on neuroendocrine system characterization.

Introduction

Stinkbugs (Hemiptera: Pentatomidae) are of major economic importance as crop pests in wide regions of the world, being the most important agricultural insect pests. They feed on plants and seeds by means of their piercing-sucking mouthparts causing deformations, seed abortion, decrease in germination and survival, and transmission of plant pathogens1,2. The lower quality of seeds caused by stinkbugs is the main reason of yield reduction in crops, and results in economic losses1. Among the species of the stinkbug complex, Nezara viridula is one of the most abundant in Brazil, Argentina and Southern USA2. This species feeds on a variety of plants, the damage caused to soybean (Glycine max), sunflower (Helianthus annuus) and corn (Zea mays) being of particular importance. Despite the economic relevance of N. viridula, this species has been poorly characterized at the genetic level; only a small number of gene sequences were reported3,4,5. The lack of genetic information makes it difficult to use N. viridula for many physiological experiments. Besides, Halyomorpha halys is an invasive pest that has rapidly spread across North America, causing severe damage to fruit and vegetable crops1. H. halys genome has been sequenced (GeneBank accession number GCA_000696795.1) and annotated, but a comprehensive analysis of the most relevant gene families has not been reported to date.

Stinkbugs are controlled by means of neurotoxic insecticides6. In Argentina, for example, the insecticide market has tripled in twenty years, stinkbugs being one of its main targets7. However, neurotoxic chemicals present serious disadvantages, such as high environmental impact and negative effects on human health and on beneficent species (i.e., pollinators and natural enemies of pests). Moreover, the continuous use of insecticides results in a high selection pressure that leads to the emergence of resistant populations. Hemipterans, in particular, are capable of developing extremely high levels of resistance to neurotoxics (see8,9,10,11,12). Cases of failures in the control of stinkbugs with insecticides have already been reported13. Hence, the identification of targets for alternative insecticides to complement or replace neurotoxics is an important goal in applied entomology. In order to protect food sources, human health and natural environment, an ideal insecticide should be species-specific, biodegradable and less prone to generate resistant populations.

Neuropeptides and their receptors (mainly G-protein coupled receptors; GPCRs), but also tyrosine-kinase and guanylate-ciyclase receptors) regulate critical physiological processes in insects (see14). Therefore, a growing interest in the study of neuroendocrine molecules as potential insecticidal targets does exist, given that they would satisfy the characteristics of new alternative compounds15,16. However, the use of neuropeptides and their receptors as pesticides remains underexploited.

The identification and characterization of neuroendocrine-related genes in genomes and transcriptomes have been proposed as the first step in the “genome-to-lead” strategy for achieving new insecticidal solutions17. This strategy is harnessed by the use of next-generation sequencing techniques in the study of harmful insects. Here we sequenced and analyzed the complete transcriptome of N. viridula male and female adults, and identified neuropeptide precursor genes and their receptors. We used a liquid chromatography/tandem mass spectrometry approach in order to confirm the presence of mature neuropeptides encoded in some of these precursors in brain extracts. We also analyzed neuropeptide precursors in H. halys genome, and compared the results in both pentatomids with the well-characterized neuropeptide repertoire from the kissing bug R. prolixus (Hemiptera: Reduviidae)14,18,19,20,21. Interestingly, we identified group-specific features (which could be related to the different feeding habits), and similarities that could be characteristic of Heteroptera. This work contributes with genomic information and structural knowledge on the neuroendocrine system, a necessary step for advancing in molecular, biochemical and physiological studies with these species.

Results and Discussion

N. viridula transcriptome characterization and completeness analysis

A total of 280.4 megabases of raw sequence data was generated, resulting in 299,148 assembled transcripts (Supplementary Information (Supp. Info.) 1). We used different bioinformatics tools to estimate the coverage (see Methods), obtaining values ≥94.4% (Supp. Info. 1). These coverage metrics indicated that the assembled transcriptome is sufficient for a meaningful analysis and the characterization of gene families.

Orthologue gene analysis among four Hemipteran species: N. viridula, H. halys, R. prolixus (Reduviidae) and Oncopeltus fasciatus (Lygaeidae) was performed (Supp. Info. 2). In order to conduct a transcriptome-composition representation analysis, we carried out a functional annotation of N. viridula transcriptome using BLAST2GO (Supp. Info. Dataset 3). Using a Kyoto Encyclopedia of Genes and Genomes (KEGG) database we did a pathway analysis and could identify 125 total pathways (Supp. Info. Dataset 3). The functional annotation for the 415 transcripts shared only by the phytophagous N. viridula, O. fasciatus and H. halys resulted in the assignment of 771 GO terms and the annotation of 294 transcripts (Supp. Info. Dataset 4). KEGG pathway analysis allowed for the detection of a short list of 36 enzymatic pathways, including 39 enzymes related to central metabolism and only present in these three phytophagous insects but not in R. prolixus (Supp. Info. Dataset 4). The BLAST2GO terms and KEGG pathway representation found in N. viridula transcriptome correlate with the ones observed in H. halys, in agreement with their close evolutive origin.

Neuropeptide precursor genes in N. viridula and H. halys

Despite their economic importance and the potential of the neuroendocrine system as a source of targets for insect pest control15, the available information on the Pentatomidae neuroendocrine system is very limited. To our knowledge, the identification of products of only 8 neuropeptide precursors has been reported (pyrokinin (PK), myosuppressin (MS), corazonin (CRZ), adipokinetic hormone (AKH), short neuropeptide F (sNPF), PVK/CAPA peptides, allatotropin (AT) and tachykinins (TK))5,22,23,24,25. A comprehensive analysis of neuropeptide precursor genes has not been reported yet.

In the N. viridula adult transcriptome we identified a total of 55 neuropeptide precursor and protein hormone transcripts belonging to 38 conserved families (Table 1; Figs 1, 2; Supp. Info. 5). In H. halys genome we identified 55 neuropeptide precursor and protein hormone genes, belonging to 42 families (Table 1; Figs 1, 2; Supp. Info. 5). These numbers are within the range of precursor genes annotated in the genomes of the hemipterans R. prolixus and Cimex lectularius, 42 and 46 respectively14,26,27,28. Table 1 lists the neuropeptide precursor and protein hormone genes found in N. viridula, H. halys, R. prolixus, Drosophila melanogaster (Diptera), Bombyx mori (Lepidoptera) and Tribolium castaneum (Coleoptera).

Table 1 Neuropeptide precursors and protein hormones detected in N. viridula, H. halys, R. prolixus, B. mori, D. melanogaster and T. castaneum.
Figure 1
Figure 1

Multiple sequence alignment of neuropeptide precursors that are specific to N. viridula and H. halys. The sequences of R. prolixus were used as a reference. Predicted convertase cleavage sites, according to the rules proposed by Veenstra84, are shadowed in red. Glycine residues shadowed in pink indicate predicted amidation sites. The green shadows indicate the predicted signal peptides. The peptides found by mass spectrometry and deduced from transcriptomic sequences are underlined. Black background indicates a fully conserved residue, gray background indicates a conservative substitution. Conserved cysteine residues in neuroparsin A and insulin-like peptides are shadowed in yellow. In Insulin-like peptide, B chain, C peptide and A chain are boxed.

Figure 2
Figure 2

Multiple sequence alignment of neuropeptide precursors that are specific to the heteropteran species analyzed here. The sequences of R. prolixus were used as a reference. Predicted convertase cleavage sites, according to the rules proposed by Veenstra70, are shadowed in red. Glycine residues shadowed in pink indicate predicted amidation sites. The green shadows indicate the predicted signal peptides. The peptides found by mass spectrometry and deduced from transcriptomic sequences are underlined. Black background indicates a fully conserved residue, gray background indicates a conservative substitution.

Below we describe and discuss the particular characteristics of neuropeptide precursor genes and protein hormone identified in N. viridula transcriptome and H. halys genome, in comparison to the hematophagous heteropteran R. prolixus, whose neuroendocrine system has been extensively studied (for a review see14). A detailed description of the known physiological role of neuropeptides in hemipterans has been discussed in recent publications14,29,30,31. For the analysis, we grouped genes according to their structural particularities: (a) those with unique features in the pentatomids (Fig. 1); (b) those that are specific to R. prolixus, N. viridula and H. halys with respect to other insect species, probably reflecting conserved features in Heteroptera (Fig. 2); and (c) those that are highly conserved compared to most insect species (Supp. Info. 5).

Neuropeptide precursor genes with specific characteristics in H. halys and N. viridula

AKH

NezviAKH and HalhaAKH precursors encode the core peptide pQLNFSPGW-amide, which was previously sequenced by mass spectrometry22,24, and has been reported as characteristic of Pentatomidae22, different to RhoprAKH with the sequence pQLTFSTDW-amide (Fig. 1).

AST triple C

The arthropod genomes have one to three genes encoding AST-C paralogue genes, and some chelicerates have even more32. The existence of three AST-C type paralogues was first detected in arthropods by J.A. Veenstra in 201632. The author classified the three paralogues according to their conserved core peptide in AST-C (or PISC-AST), AST double C and AST triple C32. According to this classification, the characteristic of AST triple C is the core peptide SYWKQCAFNAVSCFamide32. These molecules have been detected in hemipterans, even though in some cases (such as RhoprASTCCC, TriinASTCCC and NilluASTCCC) they are not predicted to be amidated20,28,31,33,34.

For most insect genomes and transcriptomes, except for L. migratoria that has all the three35, two ASTC paralogues were reported32. Whereas PISCF-AST and double C paralogues have been described in Diptera, Coleoptera, and Lepidoptera32, double and triple C-type paralogues were reported in most Hymenoptera32,36,37 and in Hemiptera20,28,31,33,34. It is interesting to note that AST triple Cs in hemipterans have been classified as PISCF-ASTs in the literature, given that the detection of the three types of AST-C paralogues was reported later than the neuropeptidomic analysis for some species. However, according to the taxonomy proposed with the detection of three paralogues in arthropods32, they would be better categorized as AST triple C.

As was described in other Hemiptera20,28,31,33,34, we found AST double C and AST triple C in N. viridula transcriptome and H. halys genome. RhopASTCCC, HalhaASTCCC and NezviASTCCC are conserved throughout the sequence of the precursor. However, the core peptide of pentatomids differs in the C-terminus (the most conserved region) from all the AST triple C reported to date32 (Fig. 1). As in other hemipterans, this peptide is not predicted to be amidated. Furthermore, in the pentatomids the precursors themselves are extended in the C-terminus. AST-Cs and somatostatin (its orthologue in vertebrates) precursors contain a single copy of the core peptide at C-terminal end32. Both NezviASTCCC and HalhaASTCCC precursors are C-terminal extended after the core peptide, with a sequence EGGNSGRTLYAPKNHGKWSFFTHLFSF (Fig. 1), which is not conserved in insect genomes. Hence, ASTCCC in pentatomids have significant differences with respect to their orthologues.

Ecdysis triggering hormone (ETH)

In most insect species, including R. prolixus, the ETH precursor encodes one or two paracopies of the core peptide38. A remarkable characteristic in HalhaETH and NezviETH is that they encode three core ETH neuropeptides (Fig. 1).

Neuroparsins (NPA) and insulin-like peptides (ILPs)

Both in R. prolixus genome and in most insect genomes sequenced to date, only one NPA gene exists, even though the recently published C. lectularius genome encodes four NPA paralogues27. An interesting finding of our analysis is that the genome of H. halys possesses twelve paralogue NPA genes (Fig. 1). Nine of them are encoded in the same contig (number 1180), indicating that this expansion is due to gene duplications. Besides, an expansion in NPA gene was also suggested by N. viridula transcriptomic analysis, indicating that this could be a particularity of Pentatomidae. For N. viridula, 11 complete and 1 partial open reading frames (ORFs) encoding NPA precursors were detected (Fig. 1). One of them has been previously cloned and sequenced5. Phylogenetic analysis indicated orthologies among the different members of this gene family (Fig. 3). The expansion of several neuropeptide gene families, such as AST-C and AKH32,39, occurs in insect genomes, but the expansion in NPA gene family is unusual given the number of paralogue genes detected (with 12 members/genome). The size of these expansions is comparable to the expansion of ILP gene family in B. mori40,41 (32 gene copies in the genome). Conversely, only one gene encoding ILPs was detected as both in N. viridula transcriptome and in H. halys genome (Fig. 1). This represents a reduction, compared with most insect genomes.

Figure 3
Figure 3

Bayesian phylogenetic analysis of neuroparsin A precursors from R. prolixus, H. halys and N. viridula. N. viridula transcripts are indicated with a black circle. The scale bar represents genetic distance. The number at each node indicates the posterior probabilities.

In a recent paper, J.A. Veenstra42 observed an inverse correlation in the numbers of ILP and NPA paralogues in a given genome. Those species having several insulin paralogues (such as D. melanogaster) usually possess zero or one NPA gene in their genome. Conversely, those species having several NPA transcripts, such as decapods and Locust spp., have a small number of ILPs. The results presented here support this observation, which suggests a complementation between both hormonal systems. Interestingly, ILPs and NPAs are one of the few neuropeptides in insects acting on tyrosine kinase receptors43,44.

NPLP1

HalhaNPLP1 and NezviNPLP1 are highly conserved, encoding 22 putative mature neuropeptides, from which three are predicted to be amidated. Two of the amidated peptides could be detected in N. viridula brain extracts by mass spectrometry (Fig. 4), suggesting that they would be more stable than the unmodified ones. As in other insect species, NPLP1 paracopies do not present a conserved motif among them. Furthermore, no high conservation with their orthologue in R. prolixus was observed, with the exception of NPLP1-4 and NPLP1-10 (Fig. 1).

Figure 4
Figure 4

Mass spectrometry spectra verifying the presence of neuropeptides encoded in the precursors described, in brain extracts from male and female adult N. viridula.

NVPamide

The conservation between RhoprNVP-like precursor and its orthologues in N. viridula and H. halys is low, even in the region of the core peptides (Fig. 1). We detected two peptides encoded in NezviNPV-like precursor by mass spectrometry. One of them was found either complete or truncated in one C-terminal residue (Fig. 4).

Neuropeptide precursor features that are specific to R. prolixus, N. viridula and H. halys

Previous neuropeptidomic studies in R. prolixus indicated that several neuropeptide precursor genes are unique in their conserved domains with respect to other insect species14. Here, we found that many of these particular domains are also present in N. viridula and H. halys, suggesting that they could be characteristic of Heteroptera.

Elevenin

Different to R. prolixus, which has two, H. halys possess one gene encoding elevenin. Besides, only one NezviElevenin gene was detected (Fig. 2).

FMRDFamide

NezviFMRFamide seems to be incomplete in the transcriptome, when compared to HalhaFMRFamide and RhoprFMRFamide (Fig. 2). Most of the peptides encoded in these precursors have the usual sequence FMRF-amide. Four peptides encoded in RhoprFRMFamide precursor have an infrequent FIRF-amide C-terminal domain25,28. This domain is also present in one of the peptides encoded in HalhaFIRF-amide precursor, and in some of the peptides encoded in CimleFMRFamide precursor37, indicating that this domain is extended in Heteroptera.

FGL-AST

NezviFGL-AST and HalhaFGL-AST encode six core peptides each. As in RhoprFGL-AST25, the first HalhaFGL-AST in the precursor has the unusual C-terminal sequence LGL-amide (LTELGL-amide in R. prolixus; DLGKDSQPKSLYDLGL-amide in H. halys), instead of the conserved FGL-amide, which is present in all the paracopies of NezviFGL-AST and in most insect FGL-ASTs (http://neurostresspep.eu/diner/seqlogopage?neuropeptideID=11). Besides, NezviFGL-AST and HalhaFGL-AST have a putative amidated peptide with sequence DGAKQELP-amide, which is not conserved in other species (Fig. 4). We confirmed the expression of two core NezviFGL-AST/ASTs in the brain by mass spectrometry (Fig. 4).

Kinin

For most insect kinin neuropeptides, the C-terminal FXXWG-amide is well conserved (http://neurostresspep.eu/diner/seqlogopage?neuropeptideID=7)45. RhoprKinin precursor encodes the conserved core peptides, and five peptides with the unusual C-terminal pentapeptide FSXWA-amide20,46. NezviKinin as HalhaKinin precursors encode both kinds of core peptides, suggesting that the FSXWA-amide could be extended in Heteroptera (Fig. 2).

Myoinhibitory peptide (MIP)

MIPs were originally characterized by the core C-terminal motif W(6×)W-amide. However, MIP precursors in R. prolixus and other hemipterans encode both peptides with the conserved motif, and other paracopies with the unusual motif W(7×)W-amide20,28,31,34. Our analysis revealed that NezviMIP and HalhaMIP have six W(6×)W-amide and other six W(7×)W-amide each (Fig. 2).

Myosuppressin (MS)

In the triatomines R. prolixus and Triatoma infestans, the MS precursor is unusual among their orthologues. It possesses an Ile in the third position, instead of the usual Val, and the sequence FMRF-amide in the C-terminus instead of the more conserved FLRF-amide20,31. In N. viridula and H. halys, the third residue of MS neuropeptide is also particular (Leu), but the C- terminal is FLRF-amide (Fig. 2). This indicates that FMRF-amide C terminal could be restricted to triatomine MSs. Using mass spectrometry, we detected the NezviMS core peptide with two kinds of post-translational modifications: C-terminal amidation and N-terminal pyroglutamic and C-terminal amidation (Fig. 4). MS was also detected in triatomines in these two modified forms19,20,47; this fact could indicate that both forms have a physiological role, or that a stable intermediate form is being detected by tandem mass spectrometry. Until now, the reported functional analysis with RhoprMS has been performed using the synthetic pyroglutamated peptide48,49,50; bioassays using the non-pyroglutamated MSs would be necessary in order to clarify this point.

Orcokinins (OK)

Like RhoprOK, NezviOK gene is expressed in three splicing variants: two of them encoding OKB neuropeptides (NezviOKB and NezviOKC) and another encoding OKA (NezviOKA) neuropeptides (Fig. 2). RhoprOK, HalhaOK and NezviOK precursors are highly conserved. Two mature neuropeptides encoded in NezviOKA gene were detected by mass spectrometry (Fig. 4).

RYamide

RYamides are conserved neuropeptides in insects, first reported in the parasitic wasp Nasonia vitripennis51. In R. prolixus, the presence of a RhoprRYa precursor has been recently described14. NezviRYa and HalhaRYa encode a highly conserved core peptide with the sequence GSDNFFMGSRYamide. Pentatomids also encode another amidated conserved peptide with the sequence FY3(X)RY-amide (Fig. 2) that is less usual when compared with the RYa sequence in most insect species (http://neurostresspep.eu/diner/seqlogopage?neuropeptideID=41). Besides, the three precursors have a conserved non-amidated peptide with the sequence SGIFWTGSRYN, which is also present in Triatoma dimidiata31. The phylogenetic conservation could suggest a relevant physiological role.

Sulfakinin

Like RhoprSK precursor, NezviSK and HalhaSK precursors encode two conserved sulfakinin (SK) neuropeptides. In the three species, the SK precursor encodes the frequent C-terminal sequence GHMRF-amide, and another peptide with the unusual GYMRF-amide. The presence of a conserved and unusual SK was also observed in C. lectularius28, indicating that this could be a characteristic of Heteroptera. In several insect species, including C. lectularius, SK wasfound to be sulfated in a Y residue located N-terminal to the core sequence28,52. Hence, Heteroptera SKs would have two potential targets for sulfation. The particular Y in the GYMRF-amide core SK from Heteroptera has not been found to be sulfated in tandem mass spectrometry studies reported to date19,28. However, Predel et al.28 observed weak ion signals supporting sulfation of both Y residues in C. lectularius by MALDI-TOF mass spectrometry.

Short neuropeptide F

RhoprSNPF, NezviSNPF and HalhaSNPF precursors are shorter than most of their orthologues in insects described to date. The mature peptides predicted in RhoprSNF, NezviSNF and HalhaSNF are highly conserved (Fig. 2). We were able to detect the core NezviSNF by mass spectrometry (Fig. 4).

Highly conserved neuropeptide precursor genes

ACP

NezviACP and HalhaACP precursors are probably incomplete in our predictions, given that a signal peptide in their N-terminal region is absent (Supp. Info. 5). However, the core peptides are complete and highly conserved compared to RhoprACP53 (Supp. Info. 5).

Calcitonin-like diuretic hormone (CT-DH)

Only one isoform of CT-DH was detected in N. viridula transcriptome and predicted in H. halys genome, even though three isoforms were reported in R. prolixus20,54. These precursors encode an identical 31 residue bioactive peptide, which is highly conserved among insect species.

CAPA and pyrokinin

Unlike R. prolixus genome, which encodes two CAPA paralogue genes55, N. viridula and H. halys seem to have a unique CAPA peptide precursor. In the three species analyzed, these precursors encode two CAPA and one PK peptide (Supp. Info. 5). Although we could not identify the PK precursor in the transcriptome of N. viridula, Predel et al.24 reported a NezviPK mature peptide that is not the one encoded in NezviCAPA precursor, indicating that NezviPK would be present in the genome. Furthermore, a HalhaPK precursor gene was detected in the genomic sequence (Supp. Info. 5).

CCHamide

Like other species, R. prolixus possesses two CCHamide precursor genes in its genome (RhoprCCHamide1 and RhoprCCHamide2)56, even though only RhoprCCHamide2 has been identified in its whole lenght20. The automatic prediction revealed one HalhaCCHamide gene (CCHAa 1), but we were able to detect also a fragment of HalhaCCHa2 in the contig number 454. As much NezviCCHa1 as NezviCCHa2 transcripts are present in the transcriptome; both possessing a highly conserved core peptide (Supp. Info. 5).

CNMamide

NezviCNMa and HalhaCNMa were found in the databases. The precursors identified lack signal peptide, suggesting that the sequences are incomplete in their N-terminal region. The conservation between RhoprCNMa and the pentatomids is low, except for the ASYMSLCHFKICNM-amide core peptide, which is identical in the three species analyzed here.

CRF-like diuretic hormone (CRF-DH)

In most insect species, a single isoform of CRF-DH gene exists. However, in moth, beetles and Schistocerca gregaria, this gene presents splicing variants57. Our analysis detected two variants for NezviCRF-DH (Supp. Info. 5), a fact that was not reported previously in hemipterans. The isoforms differ in their 3′ region; the ORF of one isoform is 303 bp shorter than the other, not affecting the predicted bioactive peptide (Supp. Info. 5). The predicted mature peptides encoded in NezviCRF-DH and HalhaCRF-DH are both 44 amino acids in length, whereas the one in RhoprCRF-DH is 46 amino acids long20,58.

Tachykinin (TK) and natalisin (NTL)

NezviTK and HalhaTK precursors encode seven core TK peptides (Fig. 4). HalhaTK precursor is around 50 amino acids longer in its N terminal region compared with NezviTK and RhoprTK (Supp. Info. 5). This does not seem to be due to an error in the predictions or in the assemblies, given that the three precursors encode a signal peptide.

Natalisin was not detected in N. viridula transcriptome, and could only partially be reconstructed from H. halys genome (Supp. Info. 5). The structure of the core neuropeptides in HalhaNTL is conserved compared with R. prolixus and other species59.

Other neuropeptide precursor genes identified in N. viridula transcriptome and H. halys genome were ASTCC, allatotropin, crustacean cardioactive peptide, corazonin, eclosion hormone (only in H. halys), IDLSRF-like peptide, ITG-like, long neuropeptide F, SIFamide, pigment dispersing factor and proctolin (Supp. Info. 5). Ion transport peptide (ITP) is usually processed in different splicing variants in insects, including R. prolixus20. We detected only one isoform (ITPB) in N. viridula transcriptome (Supp. Info. 5). Likewise, the orthologue of RhoprITPA seems to be absent in the H. halys genome. We also detected the glycoprotein hormones NezviGPA2, HalhaGPA2, HalhaGPB5 and the protein hormones NezviBurs alfa, NezviBurs beta, HalhaBurs alfa and HalhaBurs beta (Supp. Info. 5). All these genes seem to be highly conserved in pentatomids compared to their orthologues in other insect species (http://neurostresspep.eu/diner/insectneuropeptides).

Neuropeptides not detected

EH, PK and GPA2 are highly conserved in insect genomes. The three of them were detected in H. halys genome but not in N. viridula transcriptome, probably due to the incompleteness characteristic of transcriptomes when compared to genomes.

The existence of a basal and variable set of neuropeptides in insects has been proposed51. The basal set would reflect the involvement in survival-related processes, whereas the variable set would regulate specialized events related to particular adaptations. Even though NTL does not belong to the basal set, its presence in H. halys genome suggests that it should also be present in N. viridula, although it was not detected in the transcriptome. The remaining components of the variable set (inotosin, NPLPs 2–4, sex peptide, trissin and PTTH) have not been identified in triatomine databases to date14,31, even though PTTH has been reported in other hemipteran species such as N. lugens33 and A. pisum60, and inotosin was detected in N. lugens33. We did not detect these precursors in our genomic, transcriptomic or peptidomic approaches.

Genes encoding G-protein coupled receptors for neuropeptides in N. viridula

A total of 34 transcripts encoding putative GPCRs for neuropeptides and protein hormones were identified in the N. viridula transcriptome (see sequences in Supp. Info. 6). From them, 24 belong to family A and 10 belong to family B GPCRs (Fig. 5). Possible ligands of these GPCRs were deduced by phylogenetic analysis and sequence similarity, comparing with the GPCRs for neuropeptides and protein hormones in two species were this gene family has been well-studied (for reviews see14,18,61,62): D. melanogaster and R. prolixus. Most of these receptors have been functionally deorphanized and characterized in both species53,56,63,64,65,66,67,68,69,70,71,72,73,74,75, whereas RhoprASTCR, RhoprATR, RhoprPDFR, RhoprBursR, RhoprFalpsR, RhoprPrlR, RhoprSKR and RhoprRFaR are phylogeny-based predictions14,18. A clear orthologue could be assigned for most of the GPCR transcripts identified in N. viridula transcriptome (Fig. 5).

Figure 5
Figure 5

Bayesian phylogenetic analysis of GPCRs for neuropeptides and protein hormones from D. melanogaster (Dm), R. prolixus (Rp) and N. viridula (Nv). N. viridula transcripts are indicated with a black circle. The scale bar represents genetic distance. The accession number in FlyBase is indicated for D. melanogaster GPCRs; for R. prolixus, either GeneBank accession number, contig number or transcript number in vectorbase/VectorBase (www.vectorbase.org) are indicated (for reconstructed R. prolixus GPCRs see predicted sequences in14,18,14,18). Family A GPCRs are indicated by red branches; family B GPCRs are indicated by violet branches. Each GPCR family is indicated by different color shadows. The names of the receptors are indicated in the base of each clade. The scale bar represents genetic distance. The number at each node indicates the posterior probabilities.

For family A GPCRs, phylogenetic analysis allowed the identification of CNMamide-R (NvA1), MS-R (NvA2), Proctolin-R (NvA3), MIP-R (NvA4) CRZ-R (NvA7), AKH-R (NvA8), CCAP-R (NvA9 and NvA10), CCHamide-R (NvA11), ETH-R (NvA12), CAPA-R (NvA13), PK-R (NvA14 and NvA15), SK-R (NvA16), RFamide-R (NvA17), kinin-R (NvA18), SIFamide-R (NvA19), AT-R (NvA20), TK-R (NvA22 and NvA23), SNF-R (NvA24) (Fig. 5). The transcripts NvA5 and NvA6 encode family-A GPCRs that are grouped with orphan receptors from R. prolixus and D. melanogaster (Fig. 5). The transcript NvA21 is closely related to DromeTrissin-R; considering that trissin neuropeptide has not been detected in Hemiptera to date (including the present results), we propose that NvA21 could be a TK-R, given its sequence and configuration in the phylogenetic analysis (Fig. 5).

Regarding family B GPCRs, 2 orthologues were identified for CTDH-R (NvB1 and NvB2; closely related to RhoprCTDH-R1 and RhoprCTDH-R2 respectively), 1 for PDF-R (NvB3) and two were related to CRFDH-R (NvB4 and NvB5) (Fig. 5). One N. viridula GPCR (NvB9) seems to be the orthologue of DromeGPA2/GPB5 GPCR (Fig. 5). Four family B GPCRs from N. viridula were classified as orphan receptors (NvB8 and Nv10), or have no clear orthologue in D. melanogaster nor in R. prolixus (NvB6 and NvB7) (Fig. 5).

Concluding remarks

The combination of transcriptomics, bioinformatics and peptidomics has produced relevant advances in entomology, particularly regarding neuroendocrine system characterization. Nevertheless, these combined studies have not been used for a comprehensive characterization of the neuropeptidergic complement in pentatomids to date, despite their economic relevance. For N. viridula, neither genomic nor transcriptomic information has been made publically available until the present work. Even though H. halys has a sequenced genome, neuropeptide precursors were only automatically annotated. Here, we present the analysis of a highly complete N. viridula transcriptome de novo generated in our laboratory. Furthermore, we performed comprehensive comparisons and analyses of the neuropeptide precursor complement in N. viridula, H. halys and R. prolixus, and identified GPCRs for neuropeptides in N. viridula. Several neuropeptide systems are well conserved among the three species analyzed, and many of them are also conserved throughout the class Insecta. However, a few neuroendocrine systems possess characteristics that could be particular for/specific to pentatomids. The most remarkable is the case of NPA, a gene family that was expanded by gene duplications, both in N. viridula and in H. halys, with 12 NPA-encoding transcripts identified in each database.

The data presented here provide necessary tools for functional studies involving RNA interference, tissue specific gene expression, quantitative peptidomics, etc. Our study provides a promising starting point for physiological studies in pentatomids, oriented to basic entomology as much as to the design of next-generation insecticides based on neuroendocrine targets, which are expected to be species-specific and environmentally friendly.

Materials and Methods

Insect rearing

Adult male and female specimens of southern green stinkbug N. viridula were obtained from an established colony at Centro de Bioinvestigaciones, Universidad Nacional del Noroeste de Buenos Aires (UNNOBA), Argentina. This colony was originated with insects collected from soy and pea fields in the environs of Pergamino City (33°53′S: 60°34′W; Buenos Aires, Argentina). Insects were reared under controlled temperature (27 ± 2 °C), with a 16 h light:8 h dark photoperiod, and supplied with food (pea or soy beans) and water (soaked cotton wool) ad libitum. Tissue paper was provided for egg deposition. Deposited eggs were separated from the adults, and nymphs were divided according to their stage.


Transcriptome preparation and sequencing

Total RNA was isolated from 10 adult insects-both females and males in the same proportion-using Trizol Reagent (Invitrogen, Carlsbad, CA, USA). A cDNA library was constructed with 1 µg of total RNA and it was barcoded and subjected to the 100 pb pair-end shotgun sequencing using HiSeq. 2000 platform (Illumina) at the Novogene sequencing facility (California, USA). The raw sequence dataset is available at the NCBI-SRA SRR7184294.


Data filtering, trimming and assembly

Before de novo assembly raw reads were processed with FASTX-toolkit software (http://hannonlab.cshl.edu/fastx_toolkit/) to remove those with quality scores lower than 30 and to trim adaptor sequences. In order to avoid contaminants, we tested the persistence of adaptor sequences using BLASTn and the UniVec database (ftp://ftp.ncbi.nlm.nih.gov/pub/UniVec) from NCBI with the following command options: -reward 1, -penalty −5, -gapopen 3, -gapextend 3, -dust yes, -soft_masking true, -evalue 700 and -searchsp 1750000000000.

N. viridula dataset was assembled with Trinity-V2.3.2 software package76 using a pair-end assembly strategy and 25 bp long kmers. To avoid the inclusion of unique kmers that could possess sequencing errors we used the min_kmer_cov = 2 as minimum coverage parameter. The assembled sequence dataset is available at the NCBI-TSA GGPJ00000000.

Related assembled genes (paralogues or gene fragments) have the same cluster number (_c# coordinate in the transcript identifier) when transcripts are generated in Trinity, where all reads corresponding to a gene would end up in such cluster. In this way, to avoid the potential effects of transcript redundancy in the complete dataset, we built an additional transcriptome dataset for the assessment of statistical representation in later analysis. In this way, a non-redundant dataset (nr_dataset) was created, discarding alternative contigs belonging to the same cluster, and keeping only the largest contig (transcript) per cluster.


Transcriptome completeness analysis

The assembled transcriptome dataset was used to identify the proportion of the core eukaryotic genome coverage using HMM profiles for 458 core eukaryotic proteins77 and HMMER3 searches with the hmmscan command and the -T 40 and–domT 40 filters, as described previously18. At the same time, a BUSCO set for arthropod78 was used to evaluate transcriptome completeness. To estimate the proportion of reads coded by the mitochondrial genome and to identify mitochondrial-encoded genes, the software Bowtie279 was employed to map raw reads of N. viridula transcriptome generated to the N. viridula reference mitochondrial genome (GenBank accession: EF208087.1).

As another transcriptomic completeness metric, we used NCBI-BLASTX (-e 1.0E-05) and the non-redundant UNIPROT database (UNIPROTnr db) to evaluate the number of proteins we could identify in the transcriptome comparing with other organisms.

To estimate the proportion of the generated database that is homologous to R. prolixus (httpd://www.vectorbase.org/organisms/rhodnius-prolixus/cdc/rproc3;26), H. halys (https://i5k.nal.usda.gov/Halyomorpha_halys), and O. fasciatus (https://data.nal.usda.gov/dataset/oncopeltus-fasciatus-genome-assembly-10) predicted proteomes, we used NCBI-BLASTX (–e 1.0E-05). Putative 1:1 orthologue identification among these datasets was done using the BLAST RBH strategy as described80. The orthologues shared by three or more species were calculated using a bash script to simplify this operation (bash script is provided in Supp. Info. 6).


GO analysis

Only the transcripts shared among all the hemipterans analyzed above and the orthologues identified in phytophagous insects–O. fasciatus, N. viridula and H. halys–were annotated using Blast2GO (B2G) platform81. The annotation was performed using NCBI-BLASTx (e value < 1.0E-3) program inbuilt in the B2G program and InterProScan. Both BLASTx and InterPro searches were merged and mapped with gene ontology (GO) terms, and only the GO terms with an e value lower than 1.0E−6, annotation score lower than 55, and GO weight more than 5 were finally annotated. We searched for KEGG database using Blast2GO to distinguish the core pathways in which N. viridula presumed proteins are involved.


Neuropeptide precursor and GPCR gene identification

In order to identify GPCRs and neuropeptide precursors, we performed iterative tBLASTn searches in the N. viridula transcriptome using local BLAST82. As queries, we used a database including all the family A and Family B GPCRs known in R. prolixus and D. melanogaster (see Fig. 5) and a database including orthologues from R. prolixus, D. melanogaster, T. castaneum, B. mori and/or Plautia stali for all the insect neuropeptides. For H. halys neuropeptide identification, tBLASTn searches were performed online in the NCBI server (https://blast.ncbi.nlm.nih.gov/Blast.cgi), in H. halys whole genome sequence and Nr databases. Aminoacidic sequences of neuropeptide precursors from were used as queries. For the structural analysis of the neuropeptide precursors SignalP3 (identification of signal peptide)83 and the rules previously proposed for the prediction of convertase cleavage sites84 were used.

For GPCRs, the resulting aminoacidic sequences were used to perform an InterProScan85 search using the Gene3d, PfamA and SuperFamily applications to obtain the proteins of interest, which were further analyzed to confirm their identity.


Phylogenetic analysis

Phylogeny for NPA precursor and GPCR family was based on sequence alignments generated by CLUSTAL Ω86, using the software BEAST v1.8.350 in the CIPRES Science Gateway87. Beauti v1.8.350 was used to generate the BEAST input files. We used 30 million generations for each run, combined with LogCombiner v1.8.350 discarding the first 10% of each chain as a burn-in. The maximum clade credibility tree was generated using TreeAnnotator v1.8.388. The result was visualized with Figtree and Itol tools89.


Peptide extraction, liquid chromatography-tandem mass spectrometry and peptide identification

Brains from adult male and female N. viridula (N = 13) were dissected and immediately pooled in 100 μl cold methanol/water/acetic acid (90, 9, 1, v/v/v). Peptide extractions and desalting were performed as described previously19. The resulting peptides were separated by reverse phase nanoflow liquid chromatography (ProXeon EASY-nLC II, Bruker, Bremen, Germany) and spotted onto AnchorChip matrix-assisted laser desorption/ionization targets and analyzed on an UltrafleXtreme MALDI-TOF/TOF mass spectrometer (Bruker, Bremen, Germany). Tandem mass spectrometry spectra were processed using the software msconvert (http://proteowizard.sourceforge.net). For peptide identification we used Peaks Studio 7.0 (Bioinformatic Solutions, Waterloo, ON, Canada) with the same specification used previously31. Searches in a database containing the neuropeptide precursor sequences of R. prolixus, H. halys and N. viridula were performed.

Data Availability Statement

All the data presented are publically available. We provide the GeneBank accession number for each sequence. The raw sequence dataset is available at the NCBI-SRA SRR7184294.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

  1. 1.

    Leskey, T. et al. Pest status of the brown marmorated stink bug, Halyomorpha halys in the USA. Outlooks pest management 23, 218–226 (2012).

  2. 2.

    Fuentes, F. et al. In Actualizacion 2016 (ed. INTA) (2016).

  3. 3.

    Tohidi-Esfahani, D. et al. Isoforms of the heteropteran Nezara viridula ecdysone receptor: protein characterisation, RH5992 insecticide binding and homology modelling. Pest management science 67, 1457–1467, https://doi.org/10.1002/ps.2200 (2011).

  4. 4.

    Kavar, T., Pavlovcic, P., Susnik, S., Meglic, V. & Virant-Doberlet, M. Genetic differentiation of geographically separated populations of the southern green stink bug Nezara viridula (Hemiptera: Pentatomidae). Bulletin of entomological research 96, 117–128 (2006).

  5. 5.

    Marco, H. G., Anders, L. & Gade, G. cDNA cloning and transcript distribution of two novel members of the neuroparsin peptide family in a hemipteran insect (Nezara viridula) and a decapod crustacean (Jasus lalandii). Peptides 53, 97–105, https://doi.org/10.1016/j.peptides.2013.10.013 (2014).

  6. 6.

    Gamundi, J. & Sosa, M. In Chinches fitofagas en soja. Revision y avances en el estudio de su ecologia y manejo (eds Trumpeter, E. V. & Edelstein, J. D.) (INTA, 2007).

  7. 7.

    Perotti, E., Gamundi, J. & Russo, R. Control de Piezodorus guildinii (Westwood) en el cultivo de soja. Para mejorar la producción 45, 109–115 (2007).

  8. 8.

    Fabro, J. et al. Identification of a point mutation associated with pyrethroid resistance in the para-type sodium channel of Triatoma infestans, a vector of Chagas’ disease. Infection, genetics and evolution: journal of molecular epidemiology and evolutionary genetics in infectious diseases 12, 487–491, https://doi.org/10.1016/j.meegid.2011.12.006 (2012).

  9. 9.

    Capriotti, N., Mougabure-Cueto, G., Rivera-Pomar, R. & Ons, S. L925I mutation in the Para-type sodium channel is associated with pyrethroid resistance in Triatoma infestans from the Gran Chaco region. PLoS neglected tropical diseases 8, e2659, https://doi.org/10.1371/journal.pntd.0002659 (2014).

  10. 10.

    Dang, K., Doggett, S. L., Veera Singham, G. & Lee, C. Y. Insecticide resistance and resistance mechanisms in bed bugs, Cimex spp. (Hemiptera: Cimicidae). Parasites & vectors 10, 318, https://doi.org/10.1186/s13071-017-2232-3 (2017).

  11. 11.

    Picollo, M. I. et al. High resistance to pyrethroid insecticides associated with ineffective field treatments in Triatoma infestans (Hemiptera: Reduviidae) from Northern Argentina. Journal of medical entomology 42, 637–642 (2005).

  12. 12.

    Sierra, I., Capriotti, N., Fronza, G., Mougabure-Cueto, G. & Ons, S. Kdr mutations in Triatoma infestans from the Gran Chaco are distributed in two differentiated foci: Implications for pyrethroid resistance management. Acta tropica 158, 208–213, https://doi.org/10.1016/j.actatropica.2016.03.014 (2016).

  13. 13.

    Toledo, C., Anglada, M. & Salusso, A. Productos fitosanitarios utilizados en las últimas campañas agrícolaspara el control de plagas insectiles en soja. INTA EEA Parana. Actualizacion Tecnica SOJA. 34, 4 (2005).

  14. 14.

    Ons, S. Neuropeptides in the regulation of Rhodnius prolixus physiology. Journal of insect physiology 97, 77–92, https://doi.org/10.1016/j.jinsphys.2016.05.003 (2017).

  15. 15.

    Verlinden, H. et al. In Advances in Insects Physiology Vol. 46 (ed. Elsevier) (Elsevier, 2014).

  16. 16.

    Audsley, N. & Down, R. E. G protein coupled receptors as targets for next generation pesticides. Insect biochemistry and molecular biology 67, 27–37, https://doi.org/10.1016/j.ibmb.2015.07.014 (2015).

  17. 17.

    Meyer, J. M. et al. A “genome-to-lead” approach for insecticide discovery: pharmacological characterization and screening of Aedes aegypti D(1)-like dopamine receptors. PLoS neglected tropical diseases 6, e1478, https://doi.org/10.1371/journal.pntd.0001478 (2012).

  18. 18.

    Ons, S. et al. Identification of G protein coupled receptors for opsines and neurohormones in Rhodnius prolixus. Genomic and transcriptomic analysis. Insect biochemistry and molecular biology 69, 34–50, https://doi.org/10.1016/j.ibmb.2015.05.003 (2016).

  19. 19.

    Ons, S., Richter, F., Urlaub, H. & Pomar, R. R. The neuropeptidome of Rhodnius prolixus brain. Proteomics 9, 788–792, https://doi.org/10.1002/pmic.200800499 (2009).

  20. 20.

    Ons, S., Sterkel, M., Diambra, L., Urlaub, H. & Rivera-Pomar, R. Neuropeptide precursor gene discovery in the Chagas disease vector Rhodnius prolixus. Insect molecular biology 20, 29–44, https://doi.org/10.1111/j.1365-2583.2010.01050.x (2011).

  21. 21.

    Sterkel, M. et al. OKB, a novel family of brain-gut neuropeptides from insects. Insect biochemistry and molecular biology 42, 466–473, https://doi.org/10.1016/j.ibmb.2012.03.003 (2012).

  22. 22.

    Kodrik, D. et al. The adipokinetic hormones of Heteroptera: a comparative study. Physiological Entomology 35, 117–127 (2010).

  23. 23.

    Neupert, S. et al. Neuropeptides in Heteroptera: identification of allatotropin-related peptide and tachykinin-related peptides using MALDI-TOF mass spectrometry. Peptides 30, 483–488, https://doi.org/10.1016/j.peptides.2008.11.009 (2009).

  24. 24.

    Predel, R. et al. Comparative peptidomics of four related hemipteran species: pyrokinins, myosuppressin, corazonin, adipokinetic hormone, sNPF, and periviscerokinins. Peptides 29, 162–167, https://doi.org/10.1016/j.peptides.2007.08.034 (2008).

  25. 25.

    Ahn, S. J. & Choi, M. Y. Identification and characterization of capa and pyrokinin genes in the brown marmorated stink bug, Halyomorpha halys (Hemiptera): Gene structure, immunocytochemistry, and differential expression. Archives of insect biochemistry and physiology, e21500, https://doi.org/10.1002/arch.21500 (2018).

  26. 26.

    Mesquita, R. D. et al. Genome of Rhodnius prolixus, an insect vector of Chagas disease, reveals unique adaptations to hematophagy and parasite infection. Proceedings of the National Academy of Sciences of the United States of America 112, 14936–14941, https://doi.org/10.1073/pnas.1506226112 (2015).

  27. 27.

    Benoit, J. B. et al. Unique features of a global human ectoparasite identified through sequencing of the bed bug genome. Nature communications 7, 10165, https://doi.org/10.1038/ncomms10165 (2016).

  28. 28.

    Predel, R., Neupert, S., Derst, C., Reinhardt, K. & Wegener, C. Neuropeptidomics of the Bed Bug Cimex lectularius. Journal of proteome research 17, 440–454, https://doi.org/10.1021/acs.jproteome.7b00630 (2018).

  29. 29.

    Wulff, J. P., Capriotti, N. & Ons, S. Orcokinins regulate the expression of neuropeptide precursor genes related to ecdysis in the hemimetabolous insect Rhodnius prolixus. Journal of insect physiology 108, 31–39, https://doi.org/10.1016/j.jinsphys.2018.05.006 (2018).

  30. 30.

    Wulff, J. P. et al. Orcokinin neuropeptides regulate ecdysis in the hemimetabolous insect Rhodnius prolixus. Insect biochemistry and molecular biology 81, 91–102, https://doi.org/10.1016/j.ibmb.2017.01.003 (2017).

  31. 31.

    Traverso, L., Sierra, I., Sterkel, M., Francini, F. & Ons, S. Neuropeptidomics in Triatoma infestans. Comparative transcriptomic analysis among triatomines. Journal of physiology, Paris 110, 83–98, https://doi.org/10.1016/j.jphysparis.2016.12.005 (2016).

  32. 32.

    Veenstra, J. A. Allatostatins C double C and triple C, the result of a local gene triplication in an ancestral arthropod. General and comparative endocrinology 230-231, 153–157, https://doi.org/10.1016/j.ygcen.2016.04.013 (2016).

  33. 33.

    Tanaka, Y., Suetsugu, Y., Yamamoto, K., Noda, H. & Shinoda, T. Transcriptome analysis of neuropeptides and G-protein coupled receptors (GPCRs) for neuropeptides in the brown planthopper Nilaparvata lugens. Peptides 53, 125–133, https://doi.org/10.1016/j.peptides.2013.07.027 (2014).

  34. 34.

    Huybrechts, J. et al. Neuropeptide and neurohormone precursors in the pea aphid, Acyrthosiphon pisum. Insect molecular biology 19(Suppl 2), 87–95, https://doi.org/10.1111/j.1365-2583.2009.00951.x (2010).

  35. 35.

    Hou, L., Jiang, F., Yang, P., Wang, X. & Kang, L. Molecular characterization and expression profiles of neuropeptide precursors in the migratory locust. Insect biochemistry and molecular biology 63, 63–71, https://doi.org/10.1016/j.ibmb.2015.05.014 (2015).

  36. 36.

    Hummon, A. B. et al. From the genome to the proteome: uncovering peptides in the Apis brain. Science 314, 647–649, https://doi.org/10.1126/science.1124128 (2006).

  37. 37.

    Chang, J., Zhao, J. & Tian, X. In silico prediction of neuropeptides in Hymenoptera parasitoid wasps. PloS one 13, e0193561, https://doi.org/10.1371/journal.pone.0193561 (2018).

  38. 38.

    Zitnan, D. A. M.E. In Insect Endocrinology (ed. Gilbert, L. I.) 253–309 (Academic Press, 2012).

  39. 39.

    Roller, L. et al. The unique evolution of neuropeptide genes in the silkworm Bombyx mori. Insect biochemistry and molecular biology 38, 1147–1157 (2008).

  40. 40.

    Kondo, H., Ino, M., Suzuki, A., Ishizaki, H. & Iwami, M. Multiple gene copies for bombyxin, an insulin-related peptide of the silkmoth Bombyx mori: structural signs for gene rearrangement and duplication responsible for generation of multiple molecular forms of bombyxin. Journal of molecular biology 259, 926–937, https://doi.org/10.1006/jmbi.1996.0370 (1996).

  41. 41.

    Yoshida, I., Moto, K., Sakurai, S. & Iwami, M. A novel member of the bombyxin gene family: structure and expression of bombyxin G1 gene, an insulin-related peptide gene of the silkmoth Bombyx mori. Development genes and evolution 208, 407–410 (1998).

  42. 42.

    Veenstra, J. A. Similarities between decapod and insect neuropeptidomes. PeerJ 4, e2043, https://doi.org/10.7717/peerj.2043 (2016).

  43. 43.

    Nassel, D. R. & Vanden Broeck, J. Insulin/IGF signaling in Drosophila and other insects: factors that regulate production, release and post-release action of the insulin-like peptides. Cellular and molecular life sciences: CMLS 73, 271–290, https://doi.org/10.1007/s00018-015-2063-3 (2016).

  44. 44.

    Vogel, K. J., Brown, M. R. & Strand, M. R. Ovary ecdysteroidogenic hormone requires a receptor tyrosine kinase to activate egg formation in the mosquito Aedes aegypti. Proceedings of the National Academy of Sciences of the United States of America 112, 5057–5062, https://doi.org/10.1073/pnas.1501814112 (2015).

  45. 45.

    Nachman, R. J., Pietrantonio, P. V. & Coast, G. M. Toward the development of novel pest management agents based upon insect kinin neuropeptide analogues. Annals of the New York Academy of Sciences 1163, 251–261, https://doi.org/10.1111/j.1749-6632.2008.03633.x (2009).

  46. 46.

    Bhatt, G., da Silva, R., Nachman, R. J. & Orchard, I. The molecular characterization of the kinin transcript and the physiological effects of kinins in the blood-gorging insect, Rhodnius prolixus. Peptides 53, 148–158, https://doi.org/10.1016/j.peptides.2013.04.009 (2014).

  47. 47.

    Sterkel, M., Urlaub, H., Rivera-Pomar, R. & Ons, S. Functional proteomics of neuropeptidome dynamics during the feeding process of Rhodnius prolixus. Journal of proteome research 10, 3363–3371, https://doi.org/10.1021/pr2001012 (2011).

  48. 48.

    Leander, M. et al. Cardiac contractility structure-activity relationship and ligand-receptor interactions; the discovery of unique and novel molecular switches in myosuppressin signaling. PloS one 10, e0120492, https://doi.org/10.1371/journal.pone.0120492 (2015).

  49. 49.

    Sedra, L., Haddad, A. S. & Lange, A. B. Myoinhibitors controlling oviduct contraction within the female blood-gorging insect, Rhodnius prolixus. General and comparative endocrinology 211, 62–68, https://doi.org/10.1016/j.ygcen.2014.11.019 (2015).

  50. 50.

    Lee, D., Taufique, H., da Silva, R. & Lange, A. B. An unusual myosuppressin from the blood-feeding bug Rhodnius prolixus. The Journal of experimental biology 215, 2088–2095, https://doi.org/10.1242/jeb.067447 (2012).

  51. 51.

    Hauser, F. et al. Genomics and peptidomics of neuropeptides and protein hormones present in the parasitic wasp Nasonia vitripennis. Journal of proteome research 9, 5296–5310, https://doi.org/10.1021/pr100570j (2010).

  52. 52.

    Nassel, D. R. & Winther, A. M. Drosophila neuropeptides in regulation of physiology and behavior. Progress in neurobiology 92, 42–104, https://doi.org/10.1016/j.pneurobio.2010.04.010 (2010).

  53. 53.

    Zandawala, M., Haddad, A. S., Hamoudi, Z. & Orchard, I. Identification and characterization of the adipokinetic hormone/corazonin-related peptide signaling system in Rhodnius prolixus. The FEBS journal 282, 3603–3617, https://doi.org/10.1111/febs.13366 (2015).

  54. 54.

    Zandawala, M., Paluzzi, J. P. & Orchard, I. Isolation and characterization of the cDNA encoding DH(31) in the kissing bug, Rhodnius prolixus. Molecular and cellular endocrinology 331, 79–88, https://doi.org/10.1016/j.mce.2010.08.012 (2011).

  55. 55.

    Paluzzi, J. P. & Orchard, I. A second gene encodes the anti-diuretic hormone in the insect, Rhodnius prolixus. Molecular and cellular endocrinology 317, 53–63, https://doi.org/10.1016/j.mce.2009.11.004 (2010).

  56. 56.

    Hansen, K. K., Hauser, F., Williamson, M., Weber, S. B. & Grimmelikhuijzen, C. J. The Drosophila genes CG14593 and CG30106 code for G-protein-coupled receptors specifically activated by the neuropeptides CCHamide-1 and CCHamide-2. Biochemical and biophysical research communications 404, 184–189, https://doi.org/10.1016/j.bbrc.2010.11.089 (2011).

  57. 57.

    Li, B. et al. Genomics, transcriptomics, and peptidomics of neuropeptides and protein hormones in the red flour beetle Tribolium castaneum. Genome research 18, 113–122, https://doi.org/10.1101/gr.6714008 (2008).

  58. 58.

    Te Brugge, V., Paluzzi, J. P., Schooley, D. A. & Orchard, I. Identification of the elusive peptidergic diuretic hormone in the blood-feeding bug Rhodnius prolixus: a CRF-related peptide. The Journal of experimental biology 214, 371–381, https://doi.org/10.1242/jeb.046292 (2011).

  59. 59.

    Jiang, H. et al. Natalisin, a tachykinin-like signaling system, regulates sexual activity and fecundity in insects. Proceedings of the National Academy of Sciences of the United States of America 110, E3526–3534, https://doi.org/10.1073/pnas.1310676110 (2013).

  60. 60.

    Barbera, M. & Martinez-Torres, D. Identification of the prothoracicotropic hormone (Ptth) coding gene and localization of its site of expression in the pea aphid Acyrthosiphon pisum. Insect molecular biology 26, 654–664, https://doi.org/10.1111/imb.12326 (2017).

  61. 61.

    Hewes, R. S. & Taghert, P. H. Neuropeptides and neuropeptide receptors in the Drosophila melanogaster genome. Genome research 11, 1126–1142, https://doi.org/10.1101/gr.169901 (2001).

  62. 62.

    Hauser, F., Cazzamali, G., Williamson, M., Blenau, W. & Grimmelikhuijzen, C. J. A review of neurohormone GPCRs present in the fruitfly Drosophila melanogaster and the honey bee Apis mellifera. Progress in neurobiology 80, 1–19, https://doi.org/10.1016/j.pneurobio.2006.07.005 (2006).

  63. 63.

    Hauser, F., Williamson, M., Cazzamali, G. & Grimmelikhuijzen, C. J. Identifying neuropeptide and protein hormone receptors in Drosophila melanogaster by exploiting genomic data. Briefings in functional genomics & proteomics 4, 321–330, https://doi.org/10.1093/bfgp/eli003 (2006).

  64. 64.

    Zandawala, M., Hamoudi, Z., Lange, A. B. & Orchard, I. Adipokinetic hormone signalling system in the Chagas disease vector, Rhodnius prolixus. Insect molecular biology 24, 264–276, https://doi.org/10.1111/imb.12157 (2015).

  65. 65.

    Zandawala, M. & Orchard, I. Identification and functional characterization of FGLamide-related allatostatin receptor in Rhodnius prolixus. Insect biochemistry and molecular biology 57, 1–10, https://doi.org/10.1016/j.ibmb.2014.12.001 (2015).

  66. 66.

    Zandawala, M., Li, S., Hauser, F., Grimmelikhuijzen, C. J. & Orchard, I. Isolation and functional characterization of calcitonin-like diuretic hormone receptors in Rhodnius prolixus. PloS one 8, e82466, https://doi.org/10.1371/journal.pone.0082466 (2013).

  67. 67.

    Jung, S. H. et al. Identification of a novel insect neuropeptide, CNMa and its receptor. FEBS letters 588, 2037–2041, https://doi.org/10.1016/j.febslet.2014.04.028 (2014).

  68. 68.

    Paluzzi, J. P., Haddad, A. S., Sedra, L., Orchard, I. & Lange, A. B. Functional characterization and expression analysis of the myoinhibiting peptide receptor in the Chagas disease vector, Rhodnius prolixus. Molecular and cellular endocrinology 399, 143–153, https://doi.org/10.1016/j.mce.2014.09.004 (2015).

  69. 69.

    Hamoudi, Z., Lange, A. B. & Orchard, I. Identification and Characterization of the Corazonin Receptor and Possible Physiological Roles of the Corazonin-Signaling Pathway in Rhodnius prolixus. Frontiers in neuroscience 10, 357, https://doi.org/10.3389/fnins.2016.00357 (2016).

  70. 70.

    Lee, D., Vanden Broeck, J. & Lange, A. B. Identification and expression of the CCAP receptor in the Chagas’ disease vector, Rhodnius prolixus, and its involvement in cardiac control. PloS one 8, e68897, https://doi.org/10.1371/journal.pone.0068897 (2013).

  71. 71.

    Paluzzi, J. P., Park, Y., Nachman, R. J. & Orchard, I. Isolation, expression analysis, and functional characterization of the first antidiuretic hormone receptor in insects. Proceedings of the National Academy of Sciences of the United States of America 107, 10290–10295, https://doi.org/10.1073/pnas.1003666107 (2010).

  72. 72.

    Paluzzi, J. P. & O’Donnell, M. J. Identification, spatial expression analysis and functional characterization of a pyrokinin-1 receptor in the Chagas’ disease vector, Rhodnius prolixus. Molecular and cellular endocrinology 363, 36–45, https://doi.org/10.1016/j.mce.2012.07.007 (2012).

  73. 73.

    Collin, C. et al. Identification of the Drosophila and Tribolium receptors for the recently discovered insect RYamide neuropeptides. Biochemical and biophysical research communications 412, 578–583, https://doi.org/10.1016/j.bbrc.2011.07.131 (2011).

  74. 74.

    Lee, D., James, T. E. & Lange, A. B. Identification, characterization and expression of a receptor for the unusual myosuppressin in the blood-feeding bug, Rhodnius prolixus. Insect molecular biology 24, 129–137, https://doi.org/10.1111/imb.12141 (2015).

  75. 75.

    Lee, H. R., Zandawala, M., Lange, A. B. & Orchard, I. Isolation and characterization of the corticotropin-releasing factor-related diuretic hormone receptor in Rhodnius prolixus. Cellular signalling 28, 1152–1162, https://doi.org/10.1016/j.cellsig.2016.05.020 (2016).

  76. 76.

    Grabherr, M. G. et al. Full-length transcriptome assembly from RNA-Seq data without a reference genome. Nature biotechnology 29, 644–652, https://doi.org/10.1038/nbt.1883 (2011).

  77. 77.

    Parra, G., Bradnam, K. & Korf, I. CEGMA: a pipeline to accurately annotate core genes in eukaryotic genomes. Bioinformatics 23, 1061–1067, https://doi.org/10.1093/bioinformatics/btm071 (2007).

  78. 78.

    Simao, F. A., Waterhouse, R. M., Ioannidis, P., Kriventseva, E. V. & Zdobnov, E. M. BUSCO: assessing genome assembly and annotation completeness with single-copy orthologs. Bioinformatics 31, 3210–3212, https://doi.org/10.1093/bioinformatics/btv351 (2015).

  79. 79.

    Langmead, B. & Salzberg, S. L. Fast gapped-read alignment with Bowtie 2. Nature methods 9, 357–359, https://doi.org/10.1038/nmeth.1923 (2012).

  80. 80.

    Martinez-Barnetche, J. et al. Adaptations in energy metabolism and gene family expansions revealed by comparative transcriptomics of three Chagas disease triatomine vectors. BMC genomics 19, 296, https://doi.org/10.1186/s12864-018-4696-8 (2018).

  81. 81.

    Conesa, A. et al. Blast2GO: a universal tool for annotation, visualization and analysis in functional genomics research. Bioinformatics 21, 3674–3676, https://doi.org/10.1093/bioinformatics/bti610 (2005).

  82. 82.

    Altschul, S. F., Gish, W., Miller, W., Myers, E. W. & Lipman, D. J. Basic local alignment search tool. Journal of molecular biology 215, 403–410, https://doi.org/10.1016/S0022-2836(05)80360-2 (1990).

  83. 83.

    Bendtsen, J. D., Nielsen, H., von Heijne, G. & Brunak, S. Improved prediction of signal peptides: SignalP 3.0. Journal of molecular biology 340, 783–795, https://doi.org/10.1016/j.jmb.2004.05.028 (2004).

  84. 84.

    Veenstra, J. A. Mono- and dibasic proteolytic cleavage sites in insect neuroendocrine peptide precursors. Archives of insect biochemistry and physiology 43, 49–63, https://doi.org/10.1002/(SICI)1520-6327(200002)43:2<49::AID-ARCH1>3.0.CO;2-M (2000).

  85. 85.

    Jones, P. et al. InterProScan 5: genome-scale protein function classification. Bioinformatics 30, 1236–1240, https://doi.org/10.1093/bioinformatics/btu031 (2014).

  86. 86.

    Sievers, F. & Higgins, D. G. Clustal Omega, accurate alignment of very large numbers of sequences. Methods Mol Biol 1079, 105–116, https://doi.org/10.1007/978-1-62703-646-7_6 (2014).

  87. 87.

    Miller, M. A. et al. A RESTful API for Access to Phylogenetic Tools via the CIPRES Science Gateway. Evolutionary bioinformatics online 11, 43–48, https://doi.org/10.4137/EBO.S21501 (2015).

  88. 88.

    Drummond, A. J., Suchard, M. A., Xie, D. & Rambaut, A. Bayesian phylogenetics with BEAUti and the BEAST 1.7. Molecular biology and evolution 29, 1969–1973, https://doi.org/10.1093/molbev/mss075 (2012).

  89. 89.

    Letunic, I. & Bork, P. Interactive tree of life (iTOL)v3: an online tool for the display and annotation of phylogenetic and other trees. Nucleic acids research 44, W242–245, https://doi.org/10.1093/nar/gkw290 (2016).

Download references

Acknowledgements

This work was supported by grants from the Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET Grant PIP 2015 076 to S.O), Agencia Nacional de Ciencia y Tecnología (ANPCyT Grant PICT2015 0468 to S.O.). R.R.-P. thanks Dr. B. Schmidt (Dept. Biochemistry, University of Göttingen) for generously allowing the use of his LC-MS system. S.O., A.L, M.I.C and R.R.P are investigators of CONICET. N.E.B is researcher of Universidad Nacional de La Plata and associate researcher of Comisión de Investigaciones Científicas de la Provincia de Buenos Aires (CICBA). V.P is recipient of research fellowships from ANPCyT. L.P.G was recipient of a fellowship from Consejo Interuniversitario Nacional.

Author information

Affiliations

  1. Centro de Bioinvestigaciones, Universidad Nacional del Noroeste de Buenos Aires, Pergamino, Argentina

    • Andrés Lavore
    • , Lucila Perez-Gianmarco
    • , Victorio Palacio
    • , Maria Ines Catalano
    •  & Rolando Rivera-Pomar
  2. Centro Regional de Estudios Genomicos, Facultad de Ciencias Exactas, Universidad Nacional de La Plata, Buenos Aires, Argentina

    • Natalia Esponda-Behrens
    • , Rolando Rivera-Pomar
    •  & Sheila Ons

Authors

  1. Search for Andrés Lavore in:

  2. Search for Lucila Perez-Gianmarco in:

  3. Search for Natalia Esponda-Behrens in:

  4. Search for Victorio Palacio in:

  5. Search for Maria Ines Catalano in:

  6. Search for Rolando Rivera-Pomar in:

  7. Search for Sheila Ons in:

Contributions

Andres Lavore: Sample preparation; Data curation; Bioinformatic analysis; Data visualization; Writing of the manuscript Natalia Irene Esponda-Behrens: Data curation; Bioinformatic analysis; Data visualization. Lucila Perez-Gianmarco: Sample preparation; Data curation; Bioinformatic analysis Victorio Palacio: Data curation; Bioinformatic analysis Maria Ines Catalano: Conceptualization; Funding acquisition; Supervision Rolando Rivera-Pomar: Sample processing; Mass spectrometry analysis Sheila Ons: Conceptualization; Funding acquisition; Data curation; Bioinformatic analysis; Supervision; Writing of the manuscript.

Competing Interests

The authors declare no competing interests.

Corresponding author

Correspondence to Sheila Ons.

Electronic supplementary material

About this article

Publication history

Received

Accepted

Published

DOI

https://doi.org/10.1038/s41598-018-35386-4

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.