Introduction

Communication is one of the keystones of the functioning of societies, which is supported by a diversity of signals1. Chemical signals such as pheromones are the most widely used communication system among taxa thanks to the ease of production of a high diversity of compounds2. Protein products tend to have high hydrophilicity, which enables dispersion over long distances, making them relevant communication mediators for aquatic species3. However, proteins could also have evolved for terrestrial species adaptation, where physical contact with a substrate or between individuals are frequent3. Contrary to vertebrates or aquatic species, the implication of proteins in insect social interactions has been fairly neglected by the scientific community so far, despite their potential adaptation for insect societies. In contrast, the role of non-polar compounds in insect has been extensively studied.

Data on protein-based chemical cues for mammals and aquatic species is relatively abundant. In amphibians4 and fishes5,6, individuals produce protein signals involved in female attraction or sexual behavior. Proteins are also known to be involved in social interactions (mate recognition and attraction, aggressive behavior) in several species of molluscs7,8,9 and crustaceans10. In a terrestrial environment, various species, especially rodents and amphibians, use these large molecules as chemical signals during interactions between individuals. In mice, several peptides (i.e., very short proteins) serve as mediators for sexual behaviors. Males can facilitate copulation by directly transferring a peptide (ESP1) to the female vomeronasal organ11. On the contrary, sexually immature mice can inhibit the mating behavior of adult males by producing a peptide (ESP22) via their lacrimal glands12. Several major urinary proteins (MUP) produced by males are also known to be implicated in individual recognition and aggressive behaviors between males, in the context of territorial markings13,14. In amphibians, males produce peptidic pheromones via abdominal, rostral, or mental glands which attract females and enhance their receptivity15,16. In terrestrial environments, the lack of volatility and the large molecular weight of proteins limit their dispersion and therefore their perception. So, to increase their efficiency, proteins and peptides are either deposited on a substrate (e.g., at the territorial entrance) or directly transferred by contact to the recipients. This close-contact chemical communication may be promoted in social insects where physical contacts between individuals are frequent within closed structured habitats.

Eusocial insects live in closed environments where individuals constantly interact with the substrate and with their nestmates. They are characterized by a strong division of labor, and especially a reproductive division of labor. Reproductives communicate on their fertility status to maintain this reproductive division of labor17. Volatile chemical signals, and also cuticular hydrocarbons compounds (CHCs), are well known to be major components of this reproductive specific signaling, either as a pheromone to induce specific behaviors increasing the colonial inclusive fitness, or as an honest signal announcing the presence of fertile individuals to prevent the development of competitors18,19,20,21,22,23,24. For decades, the scientific community uncovered a lot of evidence on the CHCs in insect interactions including but not limited to courtship, mating, discrimination, recognition, aggregation, and dominance25,26,27,28,29,30,31,32. However, the cuticular waxes of insects also contain proteins which have been identified in several species of cockroaches33,34,35, honeybees36, paper wasps37, and termites38.

Like CHCs, cuticular polar compounds present differences in their compositions according to the sexes or castes which have been revealed in very few species. In the paper wasp Polistes dominulus, cuticular polar extracts, which are partly composed of proteins, differ between foundresses and their first emerged daughters39. Moreover, the social context seems to influence the protein profiles, i.e. orphaned workers have intermediate profiles between queenless and queenright colonies35. These specific compositions suggest a role of Cuticular Protein Compounds (CPCs) in social communication such as mate recognition or fertility signaling. Turillazzi et al. observed that the cuticular peptides dominulins can be detected and used by foundresses to locate suitable hibernation sites40. However, their implication in social interactions still needs to be demonstrated. In this perspective, Bruschini et al.41 observed that the cuticular polar fraction of Polistes dominulus was composed of medium molecular weight compounds ranging from 900 to 3000 Da. They observed that these chemical profiles do not strongly differ between colonies and concluded that these compounds cannot be used alone in nestmate recognition. In the termite Reticulitermes speratus, workers recognize eggs by their shape, size, and by a 'Termite Egg Recognition Pheromone' (TERP) composed of an antibacterial protein called lysozyme and a digestive β-glycosidase enzyme42. It has been suggested that these proteins could be directly produced by eggs or could be transferred from the reproductives. Variations in CPCs according to castes, sexes, and stages has been identified in three termites species (Reticulitermes flavipes (santonensis), Kalotermes flavicollis and Prorhinotermes simplex)38. CPC profiles from reproductives and non-reproductives differed for the three species but only P. simplex showed sex variations. These results clearly suggest a potential implication of CPCs in social interactions and particularly in caste and fertility signaling. We therefore hypothesize that polar compounds such as CPCs represent an alternative to CHCs to support reproductives' signals and we have initiated a first characterization of CPCs along with behavioral experiments.

To explore the potential role of CPCs in reproductive recognition, we have extracted cuticular polar fractions from reproductives and workers. These extracts were used for molecular profiling and tested in behavioral bioassays to measure body-shaking events displayed by the workers. Indeed, it has been recently shown that in the termite Reticulitermes flavipes, body-shaking is a vibratory behavior involved in reproductives recognition18,43,44. As controls, we also tested live reproductives (no extracts) and polar extracts from workers. The protein profile of the different extracts was determined by mass spectrometry to detect caste- and sex- specific compounds in reproductives and workers. While still quite preliminary in nature, this work combining protein profiling and behavioral assays is the first step in exploring whether polar compounds, and in particular proteins, may contribute to cues or signals in termites, as they do in the animal kingdom.

Methods

Study species and laboratory conditions

Stock colonies of R. flavipes were collected from 2014 to 2017 in the Oléron island (Charente-Maritime, France) with a distance of at least 300 m between colonies to ensure their independencies45. In the laboratory, colonies were maintained in the dark at 26 ± 1 °C with 95 ± 5% of relative humidity. Each colony was kept in a separate plastic box (Star-pack) containing cellulosic ultrapure papers (47 mm diameter; grade 42 Ashless, Whatman)32 supplied with wood sawdust. Papers and sawdust were humidified using microfiltered water.

Behavioral experiments

For each treatment, thirty workers were isolated from 10 stock colonies containing individuals of every caste. They were sorted on a CO2 pad and randomly distributed in 4 types of experimental micro-nests containing 30 workers (Fig. 1): without reproductives (R−), with reproductives (R+), with aqueous polar extracts of workers (WE) or with aqueous polar extracts of reproductives (RE). These four combinations were set up using workers and brachypterous reproductives (i.e., secondary reproductives from nymphal lines46) originating from the same 10 colonies. Micro-nests were made with plastic boxes (50 mm diameter; Star-pack Cat#04913) with a cellulosic ultrapure paper (47 mm diameter; Whatman, grade 42 Ashless) humidified with 250 µl of microfiltered water and 0.5 g of Fontainebleau sand (Carlo Erba Reagents, granulometry: 180–500 µm).

Figure 1
figure 1

Experimental micro-nests were composed of 30 workers of both sexes in absence of reproductives (R−), in the presence of reproductives (R+), with workers’ extract (WE) or with reproductives’ extract (RE). Each micro-nest has been video recorded twice at 1 h and 24 h after they had been made. Water was used as a solvent and was added to all the treatments. Living individuals and extracts are represented in different grey tones.

For the R+ treatments, reproductives were sexed on a CO2 pad and then randomly paired (one male with one female) in the corresponding micro-nests. For R− and R+ treatments, 95 µL of ultrapure water was deposited on the sand of each micro-nest, whereas for extract treatments (WE and RE), 95 µL of polar extracts made from either workers or reproductives originated from the same colonies (i.e., each extract was done on the same colony as the tested workers; see details in the “polar extracts” section) were deposited on the sand. Note that the extracts were deposited to the sand contained in an Eppendorf tube. After the deposit of the extract, the sand samples were dropped off into the micro-nest, then the reproductives were added if any, and at the end, the 30 workers were added to the micro-nest (Fig. 1).

Video recordings were processed for each micro-nest and treatment for the 10 colonies to assess body-shaking behavior of workers in a room at 26 ± 1 °C with 55 ± 5% of relative humidity. Each micro-nest was transferred to a video recording chamber where individuals could settle down for 5 min before being distributed in groups according to treatments and recorded for 5 min (Panasonic HC-VXF990) under infrared lights. Two series of recordings were done 1 h after the micro-nest setup and 24 h later. Videos were recorded in random order under double-blind conditions47. The total number of body-shaking events expressed by all the workers during the 5 min of recordings were analyzed using the BORIS freeware v.6.0.548. Note that a body-shaking event lasts on average 0.7 s, and the quantification of their occurrence was done according to previous publications43,44. At the beginning of each video recording, the humidity and temperature of the room were measured. At the end of the experiment, the mortality and sex ratio (number of males over females) of each box were calculated.

Polar extracts

Workers and brachypterous reproductives used for polar extracts (WE and RE treatments respectively) were isolated from their stock colony and then sexed on a CO2 pad. For each colony (n = 10), a pool of workers (5 males and 5 females) was extracted for the WE treatments, and a pool of reproductives (1 male and 1 female) was extracted for the RE treatments. We used this ratio of 5 workers for 1 reproductive to reduce quantitative differences during extractions due to body size differences between workers and reproductives, and so to maximize behavioral responses during experiments. Indeed, based on fresh body mean weight (n = 15 colonies), one reproductive (7.93 ± 0.25 mg) is equivalent to 2.88 workers (2.75 ± 0.15 mg). According to preliminary tests and literature data38, ultrapure water was selected as the best polar solvent for the extraction of CPCs. Individuals were washed alive in 100 µL of ultrapure water with a vortexing period of 10 s (LP vortex mixer, 1200 rpm) in a 650 µL plastic Eppendorf. Individuals were incubated at room temperature (20 °C) for 5 min. At the end of the incubation time, an additional 10 s of vortexing was done before transferring the extracts into new vials. Over the 100 µL of polar extracts, 95 µL were used for behavioral experiments and 5 µL were analyzed by spectrophotometry for quality control. Three µL of each extract were measured by spectrophotometry (Varian Cary 50-scan®) at 215 nm (with 3 consecutive readings). Extracts with optical density values lower than the mean value of the water used as control were replaced.

Mass spectrometry analyses

Compound profiling by MALDI-TOF MS

The protein profile of the cuticular polar extracts was analyzed using MALDI-TOF mass spectrometry for workers and brachypterous reproductives of each sex. The protocol described above in the “polar extracts” section was used to extract compounds, except that extractions were done on single individuals and not on a pool of individuals, per caste and sex for the 15 different colonies. An analysis of protein content in extracts was done using spectrophotometric protein quantification (as described above) and MALDI-TOF MS analysis on 5 individual extracts per sex and caste. These analyses showed that worker extracts present a lower concentration of compounds compared to reproductives. To limit detection bias in mass spectrometry by leveling up concentrations between workers and reproductives, for subsequent analyses 10 individual extracts of worker per sex were pooled for each colony (n = 10 pools per sex). Since statistical analysis of the worker samples either from individuals (n = 5 per sex) or from pools (n = 10 per sex) did not present significant differences, both datasets were treated as a single dataset. For reproductives, as mentioned above, analyses were always done individually (n = 15 per sex).

The 100 µL of cuticular extracts were stored at − 80 °C until treatment for MALDI-TOF MS analysis. Samples were vacuum concentrated 20-fold using a SpeedVac system (Thermo Scientific). The matrix solution and the MALDI deposition method were chosen to optimize the detection of peptides and proteins. Concentrated samples were diluted 5-fold in a solution of 4-hydroxy-α-cyano-cinnamic acid saturated in a solution of 33.3% acetonitrile, 66.6% water, and 0.1% trifluoroacetic acid. Matrix-sample solutions were spotted onto a gold-plated sample probe using the ultrathin layer method49,50. The excess liquid was vacuum-aspirated, and the spots were washed with cold 0.1% TFA to remove contaminants. MALDI-TOF–MS spectra were acquired on an UltraFlex I or an UltrafleXtreme mass spectrometer (Bruker Daltonics) in the linear positive mode in the 1000 to 25000 m/z range. Calibrations were performed externally using apomyoglobin and cytochrome c ion peaks, using the near-neighbor spot method. MALDI-TOF–MS spectra were processed using FlexAnalysis 3.3 software (Bruker). Spectra were smoothed using the gaussian algorithm with a width of 1 Da and 5 cycles. Peaks were annotated using the Sum algorithm with a S/N ≥ 4 and a peak width of 5 Da.

For protein/peptide profile comparisons, spectra were exported from the FlexAnalysis software into the mMass open-source software v5.5.0 (Strohalm et al. 2008) for further treatment. Spectra were averaged when necessary and negative spectra were generated by subtraction from zero.

Identification of proteins/petides by top-down MS/MS sequencing

To confirm that the profiled extracted cuticular compounds were proteins and/or peptides, top-down MS sequencing of species was performed using MALDI-TOF/TOF LIFT or ESI-QTOF CID fragmentation method. For MALDI-TOF/TOF LIFT, the same sample preparation and deposition method as for the profiling approach were used. For ESI-QTOF CID analysis, samples were diluted in the micromolar range in a solution of 50% acetonitrile, 49.8% water and 0.2% formic acid and were analyzed by direct infusion in a 4-GHz MaXis ultra-high-resolution quadrupole-TOF mass spectrometer (Bruker Daltonics) equipped with an electrospray ion source. Spectra were acquired in positive ion MS mode over a 115–3000 m/z range and were externally calibrated with the ESI-L Low Concentration Tuning Mix (Agilent Technologies). ESI mass spectra were processed using DataAnalysis v3.1 software (Bruker Daltonics). The PEAKS software was used for de novo sequencing of the compounds and spectra interpreted manually. To identify proteins based on the de novo sequence tags, we used the UniProt Peptide search tool to interrogate the TrEMBL and SwissProt Eukaryotes databases. MS/MS data were exploited using a Mascot search, performed from top-down spectra in the two databases: Termitoidae protein database and Termitoidae nucleotide database.

Statistical analyses

The number of body-shaking events counted on each video recording was analyzed using a general linear mixed model (LMM) with several explanatory variables: treatments, time, sex-ratios of workers, mortality, temperature, and humidity. Non-significant variables and interactions were removed following a step-by-step procedure which include sex-ratios, mortality, temperature, and humidity. Note that the average temperature and humidity during video recordings were respectively 25.59 °C ± 0.01 (mean ± SEM) and 57.50%RH ± 0.12. No worker mortality was observed after 1 h, and 3 out of 1200 workers were dead after 24 h. The sex-ratio (male over female) were on average 1.47 ± 0.07. Therefore, the explanatory factors used in the final model were the treatments and the time (with their interactions). To fit with homoscedasticity and normal distribution of model residuals, the numbers of body-shaking were log-transformed. Colonies were included as a random factor in all statistical models. To explore the effects of treatments and time separetly (no-significant interaction were observed), the dataset was split and two additional LMMs were conducted on either 1 h or 24 h with treatments as an explanatory factor. Post-hoc pairwise comparisons were conducted using Tuckey HSD. Time effect comparisons (i.e., 1 h vs 24 h) was analyzed for each treatment separately with non-parametric Wilcoxon signed-rank tests. Outliers were calculated with the default interquartile range criterion of 1.5.

For each caste and sex sample, MALDI-TOF mass spectra were treated in FlexAnalysis as described above to calculate peak areas. Peaks with a standard deviation below 1 Da were assigned to the same compound up to m/z 9800. For m/z > 9800, a standard deviation less than 1.7 Da was used as threshold due to the lower resolution and intensity of the peaks. Compounds that were observed only once among the samples were not considered and deleted from the mass list. Although some peaks may correspond to doubly charge species, all m/z values were treated as separate species in statistical analyses. Then peak areas were analyzed using two multivariate analyses. Centered and autoscaled data were tested with and without quadratic transformation. Since no improvement was shown with transformation, further analyses were done without51. To discriminate groups based on their protein profiles, Partial Least Squares Discriminant Analyses (PLS-DA) were done on peak areas with the plsda function (mixOmixs package) on 4 pre-defined groups (female reproductives, male reproductives, female workers, and male workers). Then VIP values were calculated to determine the importance of each CPC in the projection. The most representative CPCs with a VIP value greater than 1 were thereafter noted as major compounds52. Comparisons between sexes within each caste were done on the relative peak areas (calculated by dividing a peak area by the total area of all the peaks in the spectrum) with non-parametric Wilcoxon signed-rank tests.

All statistical analyses were conducted using the software R v4.1.2 (www.r-ptoject.org) loaded with the packages lme4, emmeans, car, multcompView, Hotelling, ade4, mixOmics, RVAideMemoire, and ggplot2. Multiple comparisons were corrected with FDR as a p-value adjustement method.

Results

Behavioral experiments

The number of body-shaking events was influenced significantly by the treatment factor within each time point (1 h or 24 h). Therefore, the dataset was split and analyzed separately for each time point. Indeed, the number of body-shaking events was dependent on the time after the setup of the micro-nests (F(3,17.6) = 31.66, p = 0.002) with no interaction with the treatments (F(3,17.6) = 1.62, p = 0.18).

At 1 h, there was a significant effect of the treatments on the number of body-shaking (F(3,17.6) = 32.47, p < 0.001; Fig. 2A). Post-hoc tests showed that workers displayed more body-shaking events when exposed to live reproductives or their extracts (R+: 46.4 ± 8.5 ; RE: 16.6 ± 3.5 ; values are mean ± SEM) than when exposed to a workers-only environment (R−: 6.1 ± 1.9) (R+ vs R−: t(27) = 8.96, p < 0.001; RE vs R−: t(27) = 4.61, p < 0.001) or than when exposed to extracts from workers (WE: 9.7 ± 3.2) (R+ vs WE: t(27) = 7.82, p < 0.001; RE vs WE: t(27) = 3.47, p = 0.009). Worker's extract did not significantly influence the number of body-shaking (WE vs R−: t(27) = 1.14, p = 0.668).

Figure 2
figure 2

Number of body-shaking events of workers after 1 h (A) or 24 h (B) for the different treatments (n = 10). In absence of reproductives (R−), in the presence of reproductives (R+), with extracts of workers (WE) or reproductives (RE). Box plots show the median, upper and lower quartiles, and range; dots, outliers; crosses, means; different letters indicate significant differences (LMMs, α = 0.05).

At 24 h after the setup of the micro-nests (Fig. 2B), the effect of the treatments was still significant (F(3,22.8) = 6.65, p < 0.001). However, post-hoc tests showed that only the presence of live reproductives (R+: 93.7 ± 8.5) was significantly different from the other treatments (R+ vs R−: 31.6 ± 8.6, t(27) = 3.67, p = 0.006; R+ vs WE: 29.3 ± 9.3, t(27) = 4.00, p = 0.002; R+ vs RE: 34.6 ± 7.9, t(27) = 2.96, p = 0.030).

Direct comparisons of the number of body-shaking events at 1 h and 24 h for each treatment (Fig. 3) showed a time-dependent increase of events for 3 treatments: with a workers-only environment (R−: Z = 4.5, p = 0.022), with live reproductives (R+: Z = 6, p = 0.032), and with extracts from workers (WE: Z = 0.5, p = 0.022). Indeed, body-shaking events increased over time only marginally significantly in the presence of extracts from reproductives (RE: Z = 8.5, p = 0.059).

Figure 3
figure 3

Number of body-shaking events of workers after 1 h and 24 h for the different treatments (n = 10). In absence of reproductives (R−), in the presence of reproductives (R+), with extracts of workers (WE) or reproductives (RE). Box plots show the median, upper and lower quartiles, and range; dots, outliers; crosses, means; * indicates significant differences (Wilcoxon signed-rank test, p < 0.05).

Protein/peptide profiling by MALDI-TOF MS

Analyses performed on cuticular polar extracts of reproductives and workers (females and males), using a MALDI-TOF MS method designed for protein detection, revealed 197 CPCs ranging from m/z 1000 to 25000 (Fig. 4). PLS-DA analyses based on the peak areas of CPCs were performed to provide a visual assessment of the repartition of individuals according to caste and sexe (Fig. 5). PLS-DA discriminated extracts according to their caste only with a Number of MissClassification (NMC) of 51.0% (Fig. 5A). The Number of MissClassification drops to 28.5% if workers of both sexes are pooled together, and to 10.2% if individuals are regrouped according to their reproductive status whatever their sex (Fig. 5B). In a permutation test with 999 repetitions, the MVA test was statistically significant (all groups p = 0.003; caste only p = 0.001). Pairwise comparisons using permutation tests based on cross model validation, separated reproductives (RF and RM) from workers (WF and WM) independently of their sexes (p = 0.001).

Figure 4
figure 4

Averaged mass spectra of cuticular polar extracts from MALDI-TOF analyses. Spectra are shown in positive/negative display for reproductives versus workers for both (A) females and (B) males. The spectra are based on a sum up of 15 spectra for each caste and sex. Annotated peaks represented in bold are the first-quartile of the 87 major CPCs identified as discriminant by the PLS-DA analyses with VIP values greater than 1. Note that the 3 major CPCs 4428, 10056 and 10196 are also represented in the figure to support the discussion even if they are not in the first-quartile (see the result section for details). See Table 1 for the entire list of compounds.

Figure 5
figure 5

PLS-DA plot based on the peak areas of the 197 CPCs identified from cuticular polar extracts (n = 15 per group; peaks were selected using the Sum algorithm with a S/N ≥ 4 and a peak width of 5 Da). (A) Score plot of female reproductives (RF, light turquoise), male reproductives (RM, dark green), female workers (WF, light orange), and male workers (WM, dark orange). (B) Score plot of reproductives (dark green) and workers (light orange). Score plot display centroids and dispersion ellipses (cellipse = 1.5).

According to VIP values, there are 87 major CPCs ranging from m/z 1000 to 25000 (Table 1). Profile comparisons of the 87 major CPCs of the 4 groups (RF, RM, WF, WM) revealed that 85 CPCs were specific to reproductives and 2 CPCs were specific to workers. Among all 7 CPCs detected in workers (i.e., CPCs specific to workers or CPCs shared with reproductives), 5 were common to females and males, and 2 were specific to females. Among the 85 CPCs detected in the reproductives, 48 were common to female and male reproductives, 19 were female-specific, and 18 male-specific. The mean relative areas of the major compounds common to both sexes were compared within each caste. These comparisons revealed that three compounds present higher mean relative areas in female reproductives compared to male reproductives: CPC-4963 (W = 153, p = 0.0252), CPC-10196 (W = 156, p = 0.022), and CPC-10416 (W = 166, p = 0.013). No significant sexe differences where found for workers.

Table 1 Compounds identified from the MALDI-TOF analyses for female reproductives (RF), male reproductives (RM), female workers (WF), and male workers (WM). CPCs average m/z values are also reported with their presence/absence in each group.

Over the first quartile of the 87 majors CPCs (Fig. 6), CPC-4139 presents the highest mean relative area in reproductives, whereas CPC-1330 is the highest peak in workers (see the full list in Table 1). Interestingly, two CPCs are specific to female reproductives (CPC-4284 and CPC-9926) and four CPCs are specific to male reproductives (CPC-223, CPC-2229, CPC-2279, CPC-4969). A visual comparison of the compounds repartition across castes and sexes of the first-quatile of the 87 major CPCs is shown in the supplementary Fig. 1 (CPCs were attributed to a group if more than one individual present the compound). Overall, in R. flavipes the diversity in workers' proteins is very low compared to reproductives (Sup. Figure 2).

Figure 6
figure 6

Relative areas of the first-quartile of the 87 major CPCs identified as discriminant by PLS-DA analyses for reproductives (A) and workers (B) of both sexes: female reproductives (RF), male reproductives (RM), female workers (WF) and male workers (MW). Relative areas are represented as mean (± SEM) and are calculated on all the spectra (see Table 1 for CPCs numbers). * indicates significant differences (Wilcoxon signed-rank test, p < 0.05; Multiple comparisons were corrected with FDR as the p-value adjustement method).

Identification of protein/peptide components by top-down MS sequencing

To confirm that the profiled cuticular extracts correspond to proteins and/or peptides, we performed mass spectrometry fragmentation on CPC-4428 and CPC-10056. Clear fragmentation spectra were produced for these two VIP CPCs. Manual interpretation of MS/MS spectra showed that inter-peak distances correspond to amino acids, as shown for CPC-10056 (Fig. 7).

Figure 7
figure 7

Fragmentation mass spectrum of the CPC-10056 using UHR-ESI-QTOF-MS/MS CID. The precursor ion m/z 914.4 (11+) is represented with a blue lozenge.

Further interpretation of the spectra allowed for the determination of sequence tags for these two CPCs (Table 2). Search with the UniProt Peptide search tool in the TrEMBL and SwissProt protein databases for the sequence tag from CPC-10056 (I/LI/LDDVI/LAVI/L) generated seven hits in Eukaryotes (Table 3). Within the Insecta taxon, the protein G12 precursor from Culex quinquefasciatus (southern house mosquito) was found under the B0WED5 Uniprot entry, while two uncharacterized proteins from Coptotermes formosanus (Formosan subterranean termite) were found under the A0A6L2P7Q1 and A0A6L2PEY0 entries. A Mascot search in the NCBI databases on MS/MS data for CPC-10056 (m/z 10055.8) identified the JK369675.1 sequence from Coptotermes formosanus in the Termitoidae cDNA library, with a score of 81 (Table 4). This hit contains the manually annotated de novo tag I/LI/LDDVI/LAVI/L from CPC-10056.

Table 2 Sequence tags obtained by de novo sequencing for CPC-4428 and CPC-10056.
Table 3 Eukaryota hits identified with the sequence tag I/LI/LDDVI/LAVI/L.
Table 4 Results of protein- and nucleotide- sequence based TDS MASCOT search in Termitoidae NCBI databases. All matches with Mascot score lower than 53 were considered insignificant.

BLAST alignements on the sequence of the G12 precursor from Culex quinquefasciatus and the JK369675.1 sequence from Coptotermes formosanus showed similarities varying from ~ 39% to ~ 72% with protein G12 from termite species Cryptotermes secundus and Zootermopsis nevadensis. The protein G12 precursor from Culex quinquefasciatus, together with its homolog AEG12 induced in the midgut of adult female mosquitoes (Aedes aegypti) after a blood meal, is thought to have originated from a common ancestor and share a close similarity to a digestion-related cockroach allergen proteins also secreted in the midgut53,54. Altogether, these results suggest that CPC-10056 is an ortholog of G12 protein, which is present in cuticular polar extracts and specific to reproductives in our studied species R. flavipes.

Discussion

In this preliminary work, we explore whether there is a potential basis for polar compounds, in particular proteins, to act on reproductive recognition in termites. For this purpose, we used aqueous extraction of polar compounds for both protein profiling and for behavioral assays. It has been previously demonstrated that body-shaking of workers (a vibratory behavior) was involved in reproductives recognition in the termite R. flavipes, since workers expressed this vibratory behavior in presence of reproductives or eggs18,43,44. Here we show that cuticular polar extracts from reproductives contain compounds that have the ability to trigger body-shaking, while extracts from workers do not. This effect is however lower than that observed with live termites. We also show that body-shaking behavior varies over time, with an increase in the number of events after 24 h, except for reproductive extracts. Using a MALDI-TOF MS method designed for protein detection, we have profiled proteins in the polar extracts of different castes and sexes, and obtained by ESI–MS/MS sequence tags on a sample of differentially observed peaks that confirm that the profiled compounds are essentially proteins. Henceforth, we have referred to the compounds detected in polar extracts using this method as Cuticular Protein Compounds (CPCs). Statistical analyses reveal a diverse mixture of CPCs, which is caste- and sex- dependent. Mass spectrometry fragmentation showed at least one example of an ortholog of G12, a digestive protein in termites which could be related to regurgitation by the insect. CPCs composition was more diverse in reproductives (171 CPCs in total) than workers (41 CPCs), with sex-specific CPCs for both male and female reproductives. The presence of a molecular signature of CPCs specific for a sex and/or caste raises the intriguing possibility that proteins could encode for social signals, as was shown for cuticular hydrocarbons (CHCs) or volatile compounds, which are well known to be involved in several social interactions18,19,20,21,22,23,24.

Chemical profiles with sex- or caste-specific compounds are well-known in numerous insect species, mainly supported in social insects by the cuticular hydrocarbons (CHCs)25,55. Indeed, CHCs are the most studied chemical cues since decades, and represent a key element of the division of labor in social insects. CHCs constitute a major part of the lipids covering the cuticle of insects, both in relative and absolute abundance56. The CHC composition vary between sexes and castes which has been shown to be a central factor involved in social interactions, social organization, recognition, sex/fertility signaling and mate choice, as well as in reproductive regulation in insects22,25,56,57,58, and more specifically in termites21,22,23,42. However, CHCs are not the only chemical components of insects’ cuticle59. While cuticular polar compounds diversity could encode information in addition to the well-known CHCs, their implication in social interactions has been largely neglected until now. The aqueous extracts prepared for the present study are not expected to contain non-polar compounds. However, we cannot rule out that at least part of the observed effect is due to polar compounds which are not proteins. Still, among polar compounds, proteins have physico-chemical characteristics that make them good candidates to hold non-volatile information, independently of or in concert with CHCs or non-polar compounds. Moreover, proteins are known to evolve to carry specific functions in organisms, and would be good candidates as specific mediators, whereas polysaccharides in the cuticle are likely to play mostly a structural role, with few variations between castes and sexes. In vertebrates and aquatic species, proteins are well-known to be major mediators for inter-individual interactions3. In insect societies, with structured nests, we could hypothesize that large biomolecules like CPCs could be deposited at nest’s hubs (royal chamber or nest entrance), allowing the transfer of contact information to walking nestmates, for example through gustatory perception. This is especially true for social insects living in close environments where contacts between individuals are numerous. Polar compounds such as CPCs could also be exchanged during social interactions like grooming or food transfer (social trophallaxis).

Globally in insects, investigations on proteins as signals are scarce and do not concern cuticular proteins3. Only few examples of proteins were described on the cuticle in bees, wasps and termites but no functional study was done36,37,38. In Drosophila melanogaster, the proteins contained in the seminal fluid are transferred to the female during mating, and they trigger various physiological and behavioral post-mating modifications of the female60. In honeybees, the proteins deposited by nurses in the royal jelly induce the development of larvae into future queens61. To our knowledge, the only studies investigating the behavioral role of proteins in termites showed the presence on the egg surface of an antibacterial lysozyme and a digestive β-glucosidase enzyme62,63. This protein complex called TERP (Termite Egg Recognition Pheromone) allow workers to recognize eggs. Here, we show evidence that CPCs could be as diverse and specific as CHCs. Indeed, the diversity and richness of reproductives' CPCs, and the behavioral response of workers exposed to polar extracts from reproductives, seems to point to their involvement in reproductive recognition. Whether CPCs are cues (i.e., unselected by-products of the metabolism) or signals (evolved for the benefit to both the sender and the receiver) remain to be determined2. Further work is needed to show if a set of CPCs induces behavioral answer in the subterranean termite R. flavipes, and to identify these proteins.

Reproductive signaling (or fertility signaling) in eusocial insects is one of the essential mechanism involved in the maintenance of the reproductive division of labor and thus it represents a major functional key of insect societies17. We observed for the first time that cuticular polar compounds of reproductives trigger body-shaking behavior in workers. Workers display more body-shaking in presence of live reproductives and extracts of reproductives than in presence of workers' extracts. In the termite R. flavipes, this behavior is a proximate of the presence of reproductives18,43,44 (also shown in this study). Thus, we bring behavioral evidence that reproductives recognition is mediated by cuticular polar compounds, with proteins and peptides good candidates as active compounds. Interestingly, the hydrocarbon heneicosane, a non-polar compound, has also been identified on the cuticle of male and female reproductives of R. flavipes, inducing body-shaking and antennation of workers18. However, some reproductives and some populations of R. flavipes do not produce heneicosane64,65,66. Therefore, more studies are needed to disentangle the potential complementary roles of heneicosane and polar compounds such as proteins on reproductives recognition. The presence of non-polar compounds is one of a few non-mutually exclusive explanations for the difference in effect between live reproductives and extracts we observed. Other chemical/physical cues, yet to be discovered could also be involved, which again could explain why live reproductives induce more body-shaking events than their cuticular polar extracts. For example, the cuticle structure itself could act as a carrier matrix with potentiation of the effect. Several other phenomena, alone or in concert, could account for the lower response elicited by a sand/cellulose paper support. First, proteins can adsorb on surfaces, and such adsorption on silica and cellulose material tends to be irreversible, which would decrease the apparent concentration, and thus the availability of the compounds for detection by workers. Second, some proteins may act synergistically42, so that an alteration of the ratio of some compounds might decrease their combined effect on workers. Taking all these considerations into account, future experiments aimed at identifying active proteins in behavioral assays should include the use of termite dummies, preferably foregoing the use of glass which bears the greatest adsorption risk. It should be noted that protein profiling was done on cuticular extracts directly, so that they more directly reflect the actual set of compounds present on the cuticle than extracts deposited on supports. Non-volatile compounds such as CPCs imply direct contact between the receptors and the substrate. When workers are exposed to a live reproductive or to an extract, in our setup it is not easy to determine if the workers exhibit body-shaking behavior immediately after contact, or if there is a delay in the behavior. The behavior could also be expressed after a short time-lapse and is not necessarily expressed directly during the contact event. Moreover, the receptor could be localized on the antennae or other body parts like the legs. It will be interesting to determine which receptors are involved in this behavior and what are their localization.

Time dependent comparisons showed an increase of body-shaking events at 24 h compared to 1 h in all treatments, except for reproductives' extracts where it was only marginally significant (Z = 8.5, p = 0.059; Fig. 3). This general increase, for both the mean values and also the data variances, may reflect a behavioral instability due to social isolation. Indeed, tested micro-nests are fragments of larger mother colonies displaying rich interactions. Therefore, workers might be more behaviorally disturbed after 24 h of isolation than after 1 h. At 24 h (Fig. 2B), the presence of reproductives still induce body-shaking but the reproductives extracts do not anymore. Alternatively, the short-lived effect of the extracts might be due to a lack of stability of the polar compounds over time. For proteins, this is a credible explanation as they would be susceptible to proteolysis by regurgitated gastric enzymes. On the other hand, some compounds may be long-lived: Turillazzi et al. reported long-lasting peptides on the cuticle and in the venom of Polistes: dominulin A (1854 Da) and dominulin B (1909 Da)40. It is possible that CPCs must be deposited repeatedly on substrates to maintain their effect, or that the cuticle is needed as a stabilizing carrier. Since extracts and focused workers were set up at once at the beginning of the experiments, we could not test the "stability over time of the extracts" separately from the "isolation time effect on workers", preventing conclusion on these hypotheses. It also would be interesting to determine the speed at which proteins/peptides in the extracts degrade or lose their structural integrities to the point of not be able to carry information anymore.

Cuticular polar extracts from all caste and sex are composed of a diverse mixture of proteins and peptides up to the 25 kDa range according to our mass spectrometry analyses. CPCs from reproductives are more diverse than those from workers' cuticle, with CPC-1330, CPC-4139 and CPC-4998 as main compounds separating both castes. In the paper wasp P. dominulus, foundress and their first emerged workers present clear specific CPCs (918 to 2679 Da)37. This caste specificity of CPCs was also observed in 3 termites species including our studied species R. flavipes38 (note that R. santonensis is a synonym of R. flavipes67). However, the compounds observed in female reproductives seems to be different from compounds in our study, except for the compound at m/z of 4141, which may correspond to CPC-4139 (m/z 4138.6). CPCs are also sex-specific for reproductives and workers in our study. Sex-specific CPCs are less common in the literature, but they have been described in a species phylogenetically close to termites, the cockroach Leucophaea maderae. In this species, males present two specific cuticular proteins (LMA-P18 and LMA-P22) progressively accumulated from ecdysis to sexual maturity35. The protein LMA-P22 is supposed to be involved in sexual behavior and belong to the lipocalin family known to bind hydrophobic compounds like pheromones68. In our study, among the 87 major CPCs we identified in reproductives 19 CPCs specific to females and 18 to males, including 3 CPCs more abundant in females than males, suggesting a function of CPCs in sex signaling and caste regulation. Indeed, the production of reproductives in R. flavipes is under a dual sex-specific regulation22. While the presence of female reproductives inhibits the production of new female reproductives, it also stimulates the molting of male workers into new male reproductives. A reciprocal phenomenon is observed for male reproductives. In workers, only females present sex-specific CPCs with 2 compounds (CPC-4139 and CPC-4158). Interestingly, CPC-4139 is the most abundant CPC in reproductives, whereas in workers, it is found in very low abundance, in only 2 samples out of 30. This low abundance and frequency may reflect the presence of female workers starting to develop into reproductives. Removing those individuals from the dataset did not change the results on group separations and on the identification of caste/sex specific CPCs. We also noticed that a few reproductives (particularly males) present CPC profiles close to workers (Fig. 5), suggesting that these males may be new immature reproductives. It would be interesting to monitor the maturation of the CPC profiles for individuals engaged into reproductive differentiation, such as workers and nymphs, during the different molting phases, as well as to track body-shaking behaviors according to the sex of the reproductives. Overall, these results are consistent with the possible role of CPCs in reproductive recognition, sex signaling and caste regulation.

In insects, secreted and cuticular proteins/peptides have been mainly identified to possess antimicrobial activities37,62,69. Mass spectrometry proteomics explorations allowed us to confirm that polar extracts contain proteins and may bear post-translational modifications such as glycosylations. Our preliminary fragmentation study shows that CPC-10056 is an ortholog of the midgut enzyme protein G12, also identified in several termite species. Interestingly, at least one digestive enzyme is known to be involved in the reproductive division of labor in the termite C. secundus70: the Neofem2 gene encodes a wood digestion enzyme (β-glucosidase). Indeed, silencing Neofem2 in queens induce fights between workers for the royal succession in queenless colonies. Moreover, this β-glucosidase is involved in egg recognition in the termite R. speratus63, and in the mating behavior of the cockroach Leucophaea maderae71. Altogether, these observations suggest that the function of the β-glucosidase may have evolved from wood digestion to pheromonal caste regulation in cockroaches and termites, even though other chemicals could contribute to these pheromonal functions18,63,70,71.

Future studies will be needed to sequence and identify reproductive-specific CPCs and to test their behavioral functions, either individually or in synergy, to determine if they play a role in the regulation of social interactions in insects, as observed in vertebrates or aquatic organisms3. Sex-specific variations observed in reproductives also open new fields of inquiry for the potential role of proteins in the dual sex-specific regulation of the reproductives which is occurring in subterranean termites22,72. The scope of our study is limited by the fact that sand-deposited polar extracts may be depleted in some proteins and other polar molecules that may act as synergistic compounds. Despite these limitations, the combination of protein profiling with behavioral bioassays represents a powerful tool to explore unknown aspects of insect communications, which when further correlated through the identification of the active proteins, could help to answer unresolved questions about social insect communication.