Introduction

Sensory genes play a key role in the life of parasitoids, such as foraging, oviposition site selection, and mating partners1. There are two major chemosensory mechanisms through olfaction and taste in which chemical signals are detected by one of the large multigene families that encode chemosensory proteins (CSPs), gustatory receptors (GRs), ionotropic receptors (IRs), odorant-binding proteins (OBPs), sensory receptors (ORs) and sensory neuron membrane proteins (SNMPs)2. The function of CSPs and OBPs is the first step in the recognition of chemical stimuli from the outside environment3. Chemoreceptors (such as GRs, IRs and ORs) are involved in the recognition and identification of various chemical signals and environmental odors to modulate chemical perception4. SNMPs are involved in cell signal transduction4.

Some sensory genes of parasitoids in Hymenoptera have been identified, including Bethylidae, e.g., Sclerodermus sp.2; Braconidae, e.g., Cotesia vestalis Haliday5, Cot. chilonis Matsumura6, Microplitis demolitor Wilkinson7, M. mediator Haliday8, Microcentrus cingulum Brischke9, Aphidius gifuensis Ashmead10, Ap. ervi Haliday11 and Meteorus pulchricornis Wesmael12; Encyrtidae, e.g., Anastatus japonicus Ashmead13 and Aenasius bambawalei Hayat14; Eupelmidae, e.g., Copidosoma floridanum Ashmead15; Ichneumonidae, e.g., Campoletis chlorideae Uchida1; Trichogrammatidae, e.g., Trichogramma dendrolimi Matsumura16 and Tric. japonicun Ashmead3; Eulophidae, e.g., Asecodes hispinarum Boucek and Chouioia cunea Yang4,17. Previous studies have revealed that the sensory genes of parasitoids are involved in searching and locating wood-boring pests and leaf-mining pests18,19. For example, Scleroderma sichuanensis Xiao can accurately find the location of their hidden host Monochamus alternatue Hope and then parasitize them. SsicOBP1 and SsicOBP2 are the basis for the behavior of the odor, which have shown a strong reaction with (−)-α-pinene, (+)-β-pinene, camphene, and (+)-3-carene20. However, the sensory genes of parasitoids used to locate gall-making pests have not yet been solved, which has aroused great interest. Understanding this information can provide potential molecular targets for research based on reverse chemical ecology.

Quadrastichus mendeli Kim & La Salle (Hymenoptera: Eulophidae) is one of the most important larval ectoparasitoids of Leptocybe invasa (Hymenoptera: Eulophidae), which is a global gall-making pest in eucalyptus plantations21,22. Q. mendeli is a uniparental parasitoid, and no males have been found23,24. Female Q. mendeli prefers to parasitize young and mature larvae of L. invasa with a percentage of parasitism of 84.20 ± 11.4023. Q. mendeli has now been successfully established and effectively controls L. invasa populations in fields in Australia, China, Cambodia, India, Israel, Italy, Kenya, Laos, South Africa, Thailand and Vietnam25,26,27,28,29. Previous studies revealed that sensory genes of parasitoids play important roles in locating hosts8. Therefore, Q. mendeliL. invasa is an ideal system to study the way that parasitoids use sensory genes in gall-making pests.

High-throughput sequencing is massively parallel, high throughput DNA sequencing, which is rapidly changing methodologies of molecular genetic studies. Through high-throughput sequencing, the view for the impact of gene translation can be expand efficiency, and more unknown proteins of parasitoids were found, such as Sclerodermus sp.2 and A. bambawalei14. In this study, we performed high-throughput sequencing of the transcriptome and identified members of the major sensory genes that are crucial for Q. mendeli to locate L. invasa. Comparative analysis of the sensory genes in Q. mendeli with those in other species was also examined, and it provided valuable information regarding the molecular basis of Q. mendeli perception.

Materials and methods

Insects

Branches of saplings damaged by L. invasa were collected from Guangxi University (108°29′ E, 22°85′ N), Nanning City, Guangxi Zhuang Autonomous Region, in October 2018. Specimens were placed in a glass container filled with water to retain freshness and transferred to a sealed net cage (length × width × height = 40 cm × 40 cm × 80 cm) with 70–80% relative humidity and a natural light photoperiod maintained at 27 ± 1 °C. The water in the glass container was replaced daily until the end of the emergence of Q. mendeli. The emerged Q. mendeli were collected daily using 50-mL plastic tubes. One day later, the tubes were immediately placed into liquid nitrogen and stored at − 80 °C. Six groups of female Q. mendeli adults (a group of twenty) were prepared for RNA extraction.

RNA sequencing

A NanoPhotometer spectrophotometer (Thermo Fisher Scientific, Massachusetts, USA) and the Nano6000 Assay Kit for the Agilent Bioanalyzer 2000 system (Agilent Technologies, California, USA) were applied to check the purity and integrity of the total RNA, respectively. After total RNA extraction, magnetic beads with Oligo dT (Thermo Fisher Scientific, Hampton, USA) were used to enrich mRNA, and then, a fragmentation buffer was added to make it a short fragment. The fragments were sequenced on an Illumina HiSeq4000 (Illumina, California, USA).

Transcriptome data analysis

Reads obtained from the sequencing machines included dirty reads containing adapters or low-quality bases, which affected the subsequent assembly and analysis. De novo transcriptome assembly was carried out with the short read assembly program Trinity v3.030. Basic annotation of unigenes includes protein functional annotation, pathway annotation, COG/KOG functional annotation and Gene Ontology (GO) annotation. To annotate the unigenes, we used the BLASTx program (http://www.ncbi.nlm.nih.gov/BLAST/) with an E-value threshold of 1e−5 for the NCBI nonredundant protein (Nr) database (http://www.ncbi.nlm.nih.gov), the Swiss-Prot protein database (http://www.expasy.ch/sprot), the Kyoto Encyclopedia of Genes and Genomes (KEGG) database (http://www.genome.jp/kegg)31,32, and the COG/KOG database (http://www.ncbi.nlm.nih.gov/COG). Protein functional annotations could then be obtained according to the best alignment results33.

Sequence alignment and phylogenetic analysis

TransMembrane prediction using Hidden Markov Models 2.0 (http://www.cbs.dtu.dk/services/TMHMM)34. The signal peptides were predicted using SignalP 4.1 (http://www.cbs.dtu.dk/services/SignalP/)35. Amino acid sequence alignment was performed using the ClustalW method implemented in Mega v7.036. The Q. mendeli CSP, GR, IR, OR, OBP and SNMP nucleotide sequences were used as queries (BLASTx) in the GenBank database, and sequences from different insect species (i.e., Apis mellifera Ligustica, Bombyx mori Silk, C. floridanum, D. melanogaster, M. mediator, N. vitripennis, Solenopsis invicta Buren, Tribolium castaneum Herbst and Tric. Pretiosum Riley) and their amino acids were retrieved in the GenBank database and used to construct a phylogenetic tree. A. mellifera, B. mori, D. melanogaster and Trib. castaneum are model insects; S. invicta is a kind of social insect; C. floridanum, M. mediator and Tric. pretiosum were parasitoid from Hymenoptera. Amino acid sequences were aligned using the Muscle method implemented in Mega v7.037. The resulting alignment was manually curated to remove gap-rich regions. Maximum-likelihood trees (for CSP, GR, IR, OR, OBP and SNMP) were constructed using IQ-TREE with the best-fitting substitution model38. Subsequently, trees were viewed and graphically edited in FigTree v1.4.339 and Adobe Illustrator CS6. Branch support was assessed using the bootstrap method based on 1000 replicates.

Results

Transcriptome assembly and annotation

To obtain high-quality clean reads, raw reads with adapters, low quality, and an N content greater than 10% were removed. The number of clean reads in female adults of Q. mendeli ranged from 18,733,252 to 25,259,462, and the sample GC content ranged from 45.06 to 53.01% (Table 1). At the same time, Q20 and Q30 ranged from 92.75 to 94.36% and 88.45 to 90.08% respectively (Table 1). In total, 31,820 transcripts were obtained and assembled into 26,925 unigenes (Additional file 1). A total of 42.10% of unigenes had a length greater than 2000 bp and an average length of 1369 bp (Table 2). The NR database (15,543, 57.73%) had the largest match. In general, the sequences had E-values between 0 and 1E−150 (Table 3). The e-value is an indication of the degree of similarity between the initial sequence used for searches and the sequence retrieved, and the higher the score, the greater the degree of similarity between them. The transcripts of Q. mendeli were most similar to the sequences of Nasonia vitripennis Walker (26.82%), followed by the sequences of Ceratosolen solmsi marchali Mayr (7.49%), Cop. floridanum (3.77%), Tric. pretiosum (2.92%), and other species (44.27%) (Additional file 2). SwissProt (11,644, 43.25%) and KOG (10,924, 40.57%) shared similar quantities; KO (7524, 27.94%) showed the least match (Table 3).

Table 1 Sequencing summary of the Quadrastichus mendeli transcriptome.
Table 2 Number and length of transcripts and unigenes.
Table 3 Unigenes annotated in different databases.

In total, 14,735 were annotated into 52 subcategories belonging to three main GO categories: a ‘biological process’, ‘cellular component’ and ‘molecular function’ (Fig. 1a). There were 22 subcategories in the ‘biological process’, 18 subcategories in the ‘cellular component’, and 12 subcategories in the ‘molecular function’. The top ten subcategories were ‘catalytic activity’ (1580), ‘metabolic process’ (1552), ‘binding’ (1552), ‘cellular process’ (1539), ‘single-organism process’ (1312), ‘cell’ (932), ‘cell part’ (932), ‘membrane’ (639), ‘biological regulation’ (615) and ‘organelle’ (576) (Additional file 3). By KOG classifications, 4689 unigenes were classified functionally into 25 categories (Fig. 1b). The cluster of ‘general fractional prediction only’ was the largest group, which had 4855 unigenes. The ‘signal transduction mechanisms’ group was second with 3998 unigenes. The top 2 categories had 36.64% unigenes annotated to the KOG database (Additional file 4). In total, 5160 unigenes were functionally classified into 5 KEGG categories (Fig. 1c). They were ‘cellular processes’ (587 unigenes, 7.66% of the unigenes annotated to the KEGG database), ‘environmental information processing’ (739, 9.64%), ‘genetic information processing’ (1612, 21.03%), ‘metabolism’ (4504, 58.77%) and ‘organismal systems’ (222, 2.90%) (Additional file 4). Among the 31 subcategories, ‘Global and Overview’ (2123, 27.70%), ‘translation’ (664, 8.66%) and ‘Signal transduction’ (599, 7.82%) were the top 3 (Additional file 5).

Figure 1
figure 1

Distribution of transcriptome contigs from Quadrastichus mendeli adults. (a) Gene Ontology (GO) analysis. (b) Eukaryotic Orthologous Groups of protein (KOG) classification. (c) Kyoto Encyclopedia of Genes and Genomes (KEGG) classification. (n = 15,623).

Identification of candidate chemosensory genes

In this study, 3 putative unigenes encoding CSPs were identified, named QM_comp07737, QM_comp08732 and QM_comp26540 (Additional file 6). The lengths of these unigenes were 509 bp, 608 bp and 280 bp, respectively (Additional file 7). Among these unigenes, QM_comp07737 and QM_comp26540 were incomplete due to a lack of a 5′ or 3′ terminus (Additional file 7). QM_comp08732 sequences were full-length putative CSP genes because they had complete ORFs and 4 cysteines, which are characteristic of typical insect CSPs. QM_comp08732 with a molecular weight of 17 kDa had a signal peptide sequence of approximately 22 amino acids at the N-terminus (Additional file 8). Through a homology search with known proteins, the results showed that 73% of QM_comp26540 was orthologs of the proteins in Tenebrio molitor L., and the orthologs of other CSP sequences were also above 60% (Additional file 8). A phylogenetic tree based on the maximum likelihood method was constructed used the 3 CSP sequences of Q. mendeli along with 67 CSP sequences from 6 other species (i.e., A. mellifera, B. mori, D. melanogaster, M. mediator, S. invicta and Trib. castaneum) (Fig. 2 and Additional file 9). The phylogenetic tree showed that QM_comp26540 shares a high homology and is closely clustered with MmedCSP1, which has been functionally characterized, and QM_comp07737 and QM_comp08732 did not branch clusters with any other insects; they may be specific CSPs of Q. mendeli (Fig. 2).

Figure 2
figure 2

Phylogenetic tree of chemosensory proteins (CSPs) from Quadrastichus mendeli and other insects based on the maximum likelihood method. Included are CSPs from Apis mellifera (Amel), Bombyx mori (Bmor), Drosophila melanogaster (Dmel), Microplitis mediator (Mmed), Quadrastichus mendeli (Qmen), and Tribolium castaneum (Tcas). The specific clades are marked. Node support was assessed with 1000 bootstrap replicates.

Identification of candidate gustatory receptors

Ten candidate GR proteins were identified from the data sets (Additional file 6). Among these unigenes, QM_comp03300, QM_comp23544, QM_comp24536 and QM_comp26507 were incomplete due to the lack of a 5′ or 3′ terminus (Additional file 7). QM_comp00164, QM_comp03333, QM_comp11847, QM_comp15910, QM_comp22611 and QM_comp22814 sequences were full-length putative GR genes because they had complete ORFs. These unigenes had molecular weights that ranged between 4 and 56 kDa and had a signal peptide sequence that ranged between 15 and 41 amino acids at the N-terminus (Additional file 8). Through a homology search with known proteins, the results showed that 79% of QM_comp15910 were orthologs of the proteins in Trichomalopsis sarcophagae Gahan. A phylogenetic tree based on the maximum likelihood method was constructed used the 10 GR sequences of Q. mendeli along with 191 GR sequences from 9 other species (i.e., A. mellifera, B. mori, C. floridanum, D. melanogaster, M. mediator, N. vitripennis, S. invict, Trib. castaneum and Tric. pretiosum) (Fig. 3 and Additional file 9). The phylogenetic tree showed that most GRs of Q. mendeli shared high homology and closely clustered with the proteins in B. mori (Fig. 3).

Figure 3
figure 3

Phylogenetic tree of gustatory receptors (GRs) from Quadrastichus mendeli and other insects based on the maximum likelihood method. Included are OBPs from Apis mellifera (Amel), Bombyx mori (Bmor), Copidosoma floridanum (Cflo), Drosophila melanogaster (Dmel), Microplitis mediator (Mmed), Nasonia vitripennis (Nvit), Quadrastichus mendeli (Qmen), Tribolium castaneum (Tcas), and Trichogramma pretiosum (Tpre). The specific clades are marked. Node support was assessed with 1000 bootstrap replicates.

Identification of candidate ionotropic receptors

Twenty-one candidate IR proteins were identified from the data sets (Additional file 6). Among these unigenes, 13 unigenes were incomplete due to the lack of a 5′ or 3′ terminus (Additional file 7). Eight unigenes were full-length putative IR genes because they had complete ORFs. These unigenes had molecular weights that ranged between 5 and 12 kDa (Additional file 8). Through a homology search with known proteins, the results showed that 100% of the IRs in Q. mendeli were orthologs of the proteins in Asbolus verrucosus LeConte, and the orthologs of other IR sequences were also above 37% (Additional file 8). A phylogenetic tree based on the maximum likelihood method was constructed used the 21 IR sequences of Q. mendeli along with 131 IR sequences from 9 other species (i.e., A. mellifera, B. mori, C. floridanum, D. melanogaster, M. mediator, N. vitripennis, S. invict, Trib. castaneum and Tric. pretiosum) (Fig. 4 and Additional file 9). The 21 IRs of Q. mendeli along with 131 IRs from 9 other species (i.e., A. mellifera, B. mori, C. floridanum, D. melanogaster, M. mediator, N. vitripennis, S. invict, Trib. castaneum and Tric. pretiosum) were chosen to construct a phylogenetic tree based on the amino acid sequences (Additional file 9). The phylogenetic tree showed that all candidate IR proteins were clustered with at least one Hymenoptera ortholog (Fig. 4).

Figure 4
figure 4

Phylogenetic tree of ionotropic receptors (IRs) from Quadrastichus mendeli and other insects based on the maximum likelihood method. Included are IRs from Apis mellifera (Amel), Bombyx mori (Bmor), Copidosoma floridanum (Cflo), Drosophila melanogaster (Dmel), Microplitis mediator (Mmed), Nasonia vitripennis (Nvit), Quadrastichus mendeli (Qmen), Tribolium castaneum (Tcas), and Trichogramma pretiosum (Tpre). The specific clades are marked. Node support was assessed with 1000 bootstrap replicates.

Identification of candidate odorant binding proteins

Fifty-six candidate OBP proteins were identified from the data sets (Table 4; Additional file 6). Among these unigenes, 10 unigenes were incomplete due to the lack of a 5′ or 3′ terminus (Additional file 7). Forty-six unigenes were full-length putative OBP genes because they had complete ORFs. These unigenes had molecular weights ranging between 10 and 17 kDa and had a signal peptide sequence ranging between 16 and 23 amino acids at the N-terminus (Additional file 8). Insect OBPs can be classified into classical OBPs (six-cysteine conserved signature) and Minus-C (missing C2 and C5) and Plus-C (carries an additional conserved cysteine located between C1 and C2 and after C6). QM_comp02388, QM_comp07285, QM_comp08846, QM_comp10855, QM_comp21133, QM_comp21238 and QM_comp24139 had four conserved cysteines, which were minus-COBPs. Other OBPs had six conserved cysteines, which are classic OBPs. Plus-COBPs were not found in hymenopteran species. Through a homology search with known proteins, the results showed that 88% of QM_comp19239 were orthologs of the proteins in N. vitripennis, and the orthologs of other OBP sequences were also above 42% (Additional file 8). A phylogenetic tree based on the maximum likelihood method was constructed used the 56 OBP sequences of Q. mendeli along with 209 OBP sequences from 9 other species (i.e., A. mellifera, B. mori, C. floridanum, D. melanogaster, M. mediator, N. vitripennis, S. invict, Trib. castaneum and Tric. pretiosum) (Fig. 5 and Additional file 9). The phylogenetic tree showed that all candidate OBP proteins were clustered with at least one Hymenoptera ortholog (Fig. 5).

Table 4 Detailed information on the OBP ungenes of Quadrastichus mendeli.
Figure 5
figure 5

Phylogenetic tree of odorant binding proteins (OBPs) from Quadrastichus mendeli and other insects based on the maximum likelihood method. Included are OBPs from Apis mellifera (Amel), Bombyx mori (Bmor), Copidosoma floridanum (Cflo), Drosophila melanogaster (Dmel), Microplitis mediator (Mmed), Nasonia vitripennis (Nvit), Quadrastichus mendeli (Qmen), Tribolium castaneum (Tcas), and Trichogramma pretiosum (Tpre). The specific clades are marked. Node support was assessed with 1000 bootstrap replicates.

Identification of candidate odorant receptors

Thirty candidate OR proteins were identified from the data sets (Additional file 6). Among these unigenes, 21 unigenes were incomplete due to the lack of a 5′ or 3′ terminus (Additional file 7). Nine unigenes were full-length putative OR genes because they had complete ORFs. These unigenes had molecular weights ranging between 4 and 53 kDa and had a signal peptide sequence ranging between 16 and 23 amino acids at the N-terminus (Additional file 8). Through a homology search with known proteins, the results showed that 95% of QM_comp14333 were orthologs of the proteins in C. cunea, and the orthologs of other OR sequences were also above 32% (Additional file 8). A phylogenetic tree based on the maximum likelihood method was constructed used the 30 OR sequences of Q. mendeli along with 235 OR sequences from 9 other species (i.e., A. mellifera, B. mori, C. floridanum, D. melanogaster, M. mediator, N. vitripennis, S. invict, Trib. castaneum and Tric. pretiosum) (Fig. 6 and Additional file 9). The phylogenetic tree showed that all candidate OR proteins were clustered with at least one Hymenoptera ortholog (Fig. 6).

Figure 6
figure 6

Phylogenetic tree of odorant receptors (ORs) from Quadrastichus mendeli and other insects based on the maximum likelihood method. Included are ORs from Apis mellifera (Amel), Bombyx mori (Bmor), Copidosoma floridanum (Cflo), Drosophila melanogaster (Dmel), Microplitis mediator (Mmed), Nasonia vitripennis (Nvit), Quadrastichus mendeli (Qmen), Tribolium castaneum (Tcas), and Trichogramma pretiosum (Tpre). The specific clades are marked. Node support was assessed with 1000 bootstrap replicates.

Identification of candidate sensory neuron membrane receptors

Two candidate SNMP proteins were identified from the data sets (Additional file 6). QM_comp21591 was incomplete due to the lack of a 3′ terminus (Additional file 7). QM_comp09081 was a full-length putative SNMP gene because it had complete ORFs. QM_comp09081 had a signal peptide sequence of 13 amino acids at the N-terminus, and the molecular weight of QM_comp09081 was 59 kDa (Additional file 8). According to sequence similarity, SNMP is divided into two SNMP subtypes, SNMP1 and SNMP2. Through a homology search with known proteins, the results showed that 67% of QM_comp09081 were orthologs of the proteins in N. vitripennis (Additional file 8). A phylogenetic tree based on the maximum likelihood method was constructed used the 2 SNMP sequences along with 25 SNMP sequences from 9 other species (i.e., A. mellifera, B. mori, C. floridanum, D. melanogaster, M. mediator, N. vitripennis, S. invict, Trib. castaneum and Tric. pretiosum) (Fig. 7 and Additional file 9). The phylogenetic tree showed that QM_comp21591 fell into the same clade as the insect SNMP1 group, and QM_comp09081 fell into the same clade as the insect SNMP2 group (Fig. 7).

Figure 7
figure 7

Phylogenetic tree of sensory neuron membrane proteins (SNMPs) from Quadrastichus mendeli and other insects based on the maximum likelihood method. Included are SNMPs from Apis mellifera (Amel), Bombyx mori (Bmor), Copidosoma floridanum (Cflo), Drosophila melanogaster (Dmel), Microplitis mediator (Mmed), Nasonia vitripennis (Nvit), Quadrastichus mendeli (Qmen), Tribolium castaneum (Tcas), and Trichogramma pretiosum (Tpre). The specific clades are marked. Node support was assessed with 1000 bootstrap replicates.

Discussion

In this study, the major sensory genes (i.e., CSPs, GRs, IRs, OBPs, ORs and SNMPs), which perhaps regulate Q. mendeli to locate its host L. invasa, are first reported, providing valuable information for exploring how parasitoids use sensory genes to locate gall-making pests.

CSPs are widespread in the antenna and other chemical sensory organs of insects40 and are involved in the chemical perception and related behavior of insects41. Compared to the total number of insects in the world, CSPs have been identified in only a few species of insects to different degrees, such as Coleoptera42, Hemiptera43, Hymenoptera12, Lepidoptera44 and Orthoptera45, whose numbers show interspecific diversity. For example, the number of CSPs varies from 4 CSPs in Drosophila melanogaster to 22 in B. mori46. In this study, 3 candidate CSPs were identified, which are less than those in the parasitoids Ch. cunea (11), Sclerodermus sp. (10), Me. pulchricornis (8) and Tric. dendrolimi (7)2,4,12,16 and are greater than those in the parasitoids Cot. chilonis (2) and Ap. ervi (2)6,11. Previous research confirmed that generalists seem to be specifically suited for the processing of odorant mixtures, and they respond in a similar manner to plant volatiles47. For example, Ch. cunea is a generalist and has multiple hosts (e.g., Stilpnotia salicis L., Ivela ochropoda Eversmann, Clostera anachoreta Fabricius, Semiothisa cinerearia Bremer & Gray and Clania variegeta Snellen), while Cot. chilonis mainly parasitizes larvae of the genus Chilo Zincken48,49. It can be deduced that the number of CSPs relates to their host range. For the specialist Q. mendeli, the use of CSPs is expected to cope with the variability in host availability46. Previous studies also revealed that CSPs could be involved in the solubilization of hydrocarbons in the stratum corneum to recognize its companion50. Therefore, CSPs in Q. mendeli should be associated with its obligate parasitic characteristics, indicating that QmenCSPs may function in the chemical sensing of L. invasa and its shelter host eucalyptus trees. A similar story has been confirmed for M. mediator, which can accurately find and then parasitize its hidden host Agrotis segetum Denis and Schiffermüller51. The phylogenetic tree showed that QM_comp07737 and QM_comp08732 did not branch clusters with any other insects and QM_comp26540 was in the same clade as MmedCSP1 of M. mediator. The result showed that QM_comp07737 and QM_comp08732 may be specific CSPs of Q. mendeli4, and their functions needed to be further explored. QM_comp26540 had the closest relationship with MmedCSP1, which has a strong reaction with methyl salicylate, pentane, ocimene, β-ionone, 3,4-dimethylbenzaldehyde, 2-hexanone and cis-3-hexe-1-ol51. Previous research suggested that MmedCSP1 can function in chemical sensing of the plant volatiles of M. mediator’s host A. segetum51. Encouragingly, significant GC-EAD responses of Q. mendeli antenna to eucalyptus volatile α-phellandrene and 1,8-cineole were observed52. Thus, QM_comp26540 may function as a chemosensor, which is involved in the process of recognizing plant volatiles from eucalyptus when Q. mendeli searches its host, L. invasa.

GRs are widespread in gustatory organs of insects that respond to various taste-related soluble compounds, and cuticular hydrocarbons and odorants, such as sugars, amino acids, salts, bitter compounds, CO2 and pheromones, can be recognized and combined by GRs53,54. To date, GRs in some model insects with genome reports have been identified, such as A. gambiae (76), B. mori (69), D. melanogaster (68) and N. vitripennis (58)55,56. In this study, 10 candidate GRs were identified, which was similar to other parasitoids, such as An. japonicus (8)13, Sclerodermus sp. (6)2 and M. mediator (6)8. This could be attributed to the sequencing depth and species-functional specificity of GRs53. DmelGR5 and DmelGR64 in D. melanogaster are receptor proteins for sweet taste and are used to detect glucose, sucrose, maltose, maltitol and cottonseed sugar57. For Q. mendeli, females that were fed a honey solution or honey solution + young eucalyptus leaves lived for a longer time than those who underwent other treatments, including flowers, gall leaves, water, galled leaves + honey solution, no food and young leaves23. Therefore, GRs in Q. mendeli should play a key role in recognizing sugar and fresh eucalyptus leaves via various soluble compounds58. The phylogenetic tree showed that QM_comp00164 and QM_comp22611 were the same clade as the proteins in Trib. castaneum, QM_comp03300, QM_comp22814, and QM_comp26507 were the same clade as the proteins in D. melanogaster, and other proteins in Q. mendeli were the same clade as the proteins in N. vitripennis. Thus, GRs of Q. mendeli may share high homology and closely cluster with the proteins in D. melanogaster, N. vitripennis, and Trib. castaneum. Interestingly, QM_comp11847 was in the same clade as the sugar receptor NvitGR1, which was used to recognize the only source of nutrients from host Lucilia caesar L. for the offspring of N. vitripennis59,60. Thus, QM_comp11847 may be involved in recognizing host organisms and sugars, which helps Q. mendeli to quickly access energy from these molecules.

IRs, which evolve from the ionotropic glutamate receptor (iGluR), are a new class of sensory proteins mainly in taste organs/sensilla that respond to food components, such as sugars, salts, water and bitter compounds, and detect small temperature differences61,62,63. In this study, 21 candidate IRs were identified, which was more than in the parasitoids Me. pulchricornis (19)12, Ch. cunea (10)4, M. mediator (6)8, Sclerodermus sp. (3)2, Mi. cingulum (3)9 and An. japonicus (3)13. Physiological recordings from taste sensilla in D. Melanogaster and other insects have revealed responses of taste neurons to salts, sugars, water, bitter compounds and a large diversity of other tastants61,62. Taste sensilla are widely distributed on the antennae of Q. mendeli28, suggesting that QmenIRs may function as taste receptors. The phylogenetic tree showed that QmenIRs were spread across the tree branches and clustered with homologous IRs from other species, which suggested that QmenIRs may be functionally conserved. QM_comp21031 was located in the same clade as BmorIR21a of B. mori, DmelIR21a of D. melanogaster and TcasIR21a of Trib. castaneum, indicating that QM_comp21031 has the closest relationship with insect IR21a, which can mediate cool sensing in Drosophila64. Thus, QM_comp21031 may perceive changes in temperature since insect IR21a can achieve both heat avoidance and heating65,66. Taking the oviposition features into consideration, we deduced that female Q. mendeli may be capable of sensing surface heat on galls related to L. invasa damage, which requires further exploration.

OBPs are crucial in insect olfactory perception and are the first step in the recognition of chemical stimuli from the outside environment3. In some model insects, OBPs have been identified to different degrees, such as B. mori (57)67, D. melanogaster (51)68, Trib. castaneum (46)69 and A. gambiae (44)70. In this study, 56 candidate OBPs were identified, which was more than in the parasitoids Ae. bambawalei (54)14, Ch. cunea (25)4, Tric. dendrolimi (24)16, M. mediator (20)8, Me. pulchricornis (16)12, T. japonicum (15)3, Ap. ervi (15)11, Sclerodermus sp. (10)2, Cop. floridanum (8)15, Cot. chilonis (8)6 and As. hispinarum (8)17. The number of Q. mendeli OBPs identified was less than that in the parasitoid Cot. vestalis (74)5. Previous studies revealed that OBPs in parasitoids play a key role in binding and transporting hydrophobic odorants from the environment to sensory receptors71. In Q. mendeli, a significant behavioral response to the gall volatiles d-limonene and decanal was observed (unpublished data). Relevant QmenOBPs function in chemical sensing of these volatiles characterizing L. invasa and its shelter host eucalyptus trees, which bioinformatics analysis could help to target. The phylogenetic tree showed that 15 QmenOBPs were the same clade as 5 DmelOBPs of D. melanogaster, and 15 QmenOBPs were the same clade as NvitOBP1 of N. vitripennis, and 3 CfloOBPs of C. floridanum, which suggested that most OBPs of Q. mendeli may share high homology and closely cluster with the proteins in C. floridanum, D. melanogaster, and N. vitripennis. The evolutionary relationship of Q. mendeli OBPs, as inferred in the phylogenetic tree, indicated that they are orthologous sequences due to the absence of monophyletic groups. Our results showed that QM_comp21238 is the same clade as MmedOBP10 of M. mediator, which is involved in the process of recognizing β-ionone and Nonanal when they find the location of their hidden host A. segetu51,72. Thus, QM_comp21238 may also be involved in the process of recognizing similar odorants or ligands when Q. mendeli locates its shelter host, L. invasa.

ORs are thought to play critical roles in the perception of chemosensory stimuli by insects54. The number of ORs in parasitoids vary greatly1,4. In this study, 30 candidate ORs were identified, which was more than in the parasitoid Tric. dendrolimi (9)16, Mi. cingulum (9)9 and Sclerodermus sp. (8)2. Previous studies revealed that the OR of M. mediator play an important role in recognizing plant volatiles, such as nonanal and farnesene, which provided a key start to manipulate and develop ORs in wasps to find hosts and use them as biological tools for pest control73. The phylogenetic tree showed that QmenORs were spread across the tree branches and clustered with homologous ORs from other species, which suggested that QmenORs may be functionally conserved. The RNAi investigation of the role of MmedOrco, the M. mediator ortholog of Drosophila Or83b, supported the assumption that this highly conserved gene plays a similar role in insects73,74. QmenORs may function as chemoreceptors to recognize plant volatiles from eucalyptus. Our results showed that QM_comp20892 is in the same clade as DmelOR10a of D. melanogaster, which plays a role in responding to odorants such as methylsalicylate and acetophenone75,76. For Q. mendeli, QM_comp20892 may be involved in the process of recognizing similar odorants or ligands.

SNMPs are involved in cellular signal transduction and play a role in odor detection5. Two SNMPs are normally broadly identified in different insects, e.g., parasitoids Ch. cunea and Sclerodermus sp.2,4. It has been reported that SNMP1 and SNMP2 are both expressed in antennae sensilla and have different expression patterns4,77. In Ch. cunea, CcunSNMP1 is a morphine receptor of neurons, and CcunSNMP2 is mainly expressed in supporting cells and the lymph of antennal sensilla4, while the location and expression patterns of SNMPs in Q. mendeli should be further studied since this information should be associated with their functions. SNMP1 of D. melanogaster is involved in pheromone detection and enhances the Ca2+ responses served in signal transduction78. SNMP1 of M. mediator was determined to participate in both pheromone and general odor detection79. In contrast, the general functional mechanism of SNMP2 in parasitoids is still poorly understood. The phylogenetic tree showed that QM_comp21591 was the same clade as TcasSNMP1 of Trib. castaneum and QM_comp09081 was the same clade as CfloSNMP2 of C. floridanum, suggesting that QM_comp21591 and QM_comp09081 shall be SNMP1 and SNMP2 in Q. mendeli respectively. In addition, Q. mendeli is a uniparental parasitoid that is not required in the male search for mating22. Thus, the function of QM_comp21591 and QM_comp09081 may include an oviposition pheromone receptor rather than a sex receptor and a membrane protein with unknown functions, which needs to be further explored.

Chemical detection involves a series of complicated processes that require participation and interactions by multiple cascades of sensory proteins. Insect sensory proteins are capable of functional cooperation and division. Firstly, OBPs and CSPs are both chemically binding proteins to various odorants and can also respond to the same chemicals, e.g., MmedOBP10 and MmedCSP1 are involved in the process of recognizing β-ionone and nonanal when they find the location of their hidden host A. segetu51,72. Secondly, ORs and IRs are both chemoreceptors, while there are differences in the process of recognizing odor substances80. For example, IRs are better at detecting long-lasting odor pulses, and they are less sensitive, suggesting that they are better at close-range odor detection. In contrast, ORs are more sensitive and better at resolving brief (low molecular flux) pulsed stimuli80,81. Moreover, features of functional organization have emerged between behavioral response profiles of OBPs and electrophysiological response profiles of ORs75. Therefore, the sensory genes in Q. mendeli should systematically act on the process of locating their gall-making host, and the biological functions of these genes and their products are still poorly known. Overall the sensory genes of the wasp reported here provide valuable insight into the molecular mechanisms of olfaction, which help pave the way for the host location of Q. mendeli in gall-making pests.