Introduction

Environmental molecules contain information that regulates a series of important insect behaviors, such as mating1,2, oviposition3, foraging4,5 and host-seeking6,7. Insects have evolved a highly acute and sensitive olfactory system that can selectively detect environmental molecules. Several types of olfactory proteins play crucial roles in determining or helping to complete the selective detection process for odorants, including odorant-binding proteins (OBPs), odorant receptors (ORs), ionotropic receptors (IRs) and sensory neuron membrane proteins (SNMPs), along with odorant-degrading enzymes (ODEs)8. It is now commonly accepted that insect OBPs exhibit the function of solubilizing odorants to transport hydrophobic molecules through the aqueous sensilla lymph and contribute to the sensitivity of the olfactory system9,10.

Recently, several structures of insect OBPs, e.g., Bombyx mori PBP1 (BmorPBP1), Anopheles gambiae OBP1 (AgamOBP1), AgamOBP7, Culex quinquefasciatus OBP1 (CquiOBP1) and Locusta migratoria OBP1 (LmigOBP1), have been solved based on crystal structure, NMR and X-ray studies1,11,12,13,14,15. Structural analyses of BmorPBP1 led to the conclusion that residues Asp-132 and Glu-141 form a molecular switch that, at low pH, triggers the formation of the C-terminal α-helix upon protonation16 to aid in the binding and unlocking of the sex pheromone bombykol. This process is slightly different in mosquitoes, in which AgamOBP1 and CquiOBP1 lack an extended α-helix at the C terminus, and the hydrogen bond triad might be a pH-sensing lock that clamps the C terminus onto a bound odorant. At low pH, the hydrogen bond disrupts and unlocks the bound ligands9,13. In LmigOBP1, the seventh α-helix forms a wall in the form of an “L”-shaped internal hydrophobic cavity that accommodates linear ligands15. Most studies suggest that a pH-dependent conformational change is crucial for binding and releasing ligands13,17,18, although the binding pocket and conformational changes differ between different OBP-odorant complexes. For example, the C-terminal extensions of the Anopheles gambiae OBP1 (AgamOBP1), AgamOBP7 and Culex quinquefasciatus OBP1 (CquiOBP1) proteins are locked by a hydrogen bond triad composed of the last residue and two other residues (Tyr and His). The hydrogen bond triad is disrupted at lower pH levels, causing the C-terminal loop to shift away from the binding pocket and the to “lid” open. In the silkworm, Bombyx mori, the C-terminal dodecapeptide segment of PBP (BmorPBP) forms an additional helix in the protein core, occupies the corresponding pheromone-binding site and releases ligands at lower pH levels11,13,14.

Many studies show that OBPs exhibit a preliminary selection function for odorants, acting as a first step in olfactory sensing in insects19,20,21. Considering this role as the first level of olfactory ligand recognition, structural and functional studies of OBPs will not only give us a better understanding of insect peripheral olfaction detection but also help us design more effective structure-based pest management compounds. The most elegant method for calculating the binding affinities of active odorants with OBPs is to express and purify recombinant proteins in an E. coli system and test in vitro fluorescence quenching resulting from the replacement of a reporter (usually 1-NPN)22,23. However, due to the large amount of work required to obtain pure proteins, the capacity to screen for insect OBPs is still limited. Furthermore, when testing binding affinities, odorants are usually randomly selected, or physiologically relevant compounds are used as a reference24; thus, there is no preliminary screening method for the massive pool of candidate compounds. Therefore, an effective and reliable guiding tool is needed for large-scale compound screening.

The beetle Ambrostoma quadriimpressum (Coleoptera: Chrysomelidae), a major forest pest, is a monophagous species found in the East Asian region, particularly in northeast China, where it feeds exclusively on the shoots and leaves of elms (Ulmus pumila, Ulmus macrocarpa and Ulmus japonica), leading the tree to suffer irregular death before it is fully grown and causing great damage to the timber industry as well as city afforestation25. In previous RNA sequencing (RNA-Seq) studies, we identified fifteen candidate OBPs with complete ORFs in this species and determined that AquaOBP4 was particularly abundant in the antennae compared with other AquaOBPs as well as highly antenna specific26. To explore the function and selectivity of AquaOBP4, we attempted to express and purify this protein and to screen for active odorants associated with it. Furthermore, we aimed to establish a reliable molecular screening model using the obtained data to provide a better understanding of the interaction between OBP structure and active odorants, which will be useful for large-scale ligand screening in the future.

Materials and Methods

Insects

The A. quadriimpressum population used in this study was initially obtained from elm (Ulmus pumila) trees in Jilin province of China in 2014 (E125.3 N43.9, E124.2 N 44.5). To maintain the maximum population of insects of the same age, we collected the insects within a small range in early June, which is the peak period for newly emerged adults every year. The adults of A. quadriimpressum were maintained at 28 °C and 85% RH under a 12:12 h light:dark cycle in a BIC-300 artificial climate chest (Boxun, Shanghai, China) immediately after collection. The population was fed fresh elm leaves. The population size of adults was greater than three hundred, with a male/female ratio of approximately 1:1. The sample collection was authorized by the forestry bureau of Jilin province. All experiments (RNA extraction, Y-tube and EAG assays) were performed three to five days after the collection of adults from the field, which were maintained as described above.

RNA extraction and cDNA synthesis

The antennae of A. quadriimpressum females were carefully separated with diethyl pyrocarbonate (DEPC)-treated forceps under a stereomicroscope (Motic, Hong Kong, China). Tissues were collected from 60 individuals (at a male/female ratio of 1:1). The collected tissues were stored on ice in a DEPC-treated 1:1 water/ethanol solution until use. The total RNA from homogenized antennae was isolated using the TRIzol reagent (Invitrogen, Carlsbad, CA, USA) according to the manufacturer’s protocol and then subjected to ethanol/isopropanol precipitation in a final solution with 30 μl of RNase-free water. After extraction, the total RNA was quantified with a NanoDrop 2000 (Thermo Fisher Scientific, Waltham, MA, USA) and in a 1% agarose gel. The UV absorption values at 230 nm/260 nm and 260 nm/280 nm were recorded to monitor the purity of the RNA products, and the mRNA smear above the 28S rRNA band was checked to verify RNA integrity. The total antenna RNA was transcribed into cDNA in a reaction (total volume of 20 μl) containing 4 μl of first-strand buffer (250 mM Tris pH 8.3, 375 mM KCl, and 15 mM MgCl2), 1 μl of 10 mM dNTP mix, 1 μl of RNaseOUT, 1 μl of DTT (0.1 M), 1 μl of an oligo-(dT) 20 primer (50 μM) and 1 μl of Superscript III reverse transcriptase (200 units/μl) (Invitrogen, Carlsbad, CA, USA). The cDNA synthesis program consisted of 45 min at 50 °C followed by 15 min at 70 °C.

AquaOBP4 cloning

Based on previous RNA-Seq analysis, fifteen candidate OBPs with complete ORFs were identified in A. quadriimpressum. AquaOBP4 was shown to be the most abundant and antenna-specific OBP among all of the AquaOBPs identified (Table 1). To explore the function and selectivity of AquaOBP4, we attempted to express and purify this protein and screen for active odorants that associated with it. Primers were designed based on RNA-Seq data, and the designed primers amplified complete 145-aa AquaOBP4 sequences from the antennal cDNA of A. quadriimpressum using PrimeSTAR Max DNA Polymerase (TaKaRa, Japan), which resulted in removal of the signal peptide sequence (AquaOBP4 sense: 5′-ATGAACGAGAAACAAATGGA-3′; AquaOBP4 antisense: 5′-TCAATGCAGAAAAAATTTCTC-3′). The PCR thermocycling conditions were as follows: 1 min 30 s at 94 °C followed by 32 cycles of 94 °C for 1 min 30 s, 53 °C for 30 s and 68 °C for 30 s and a final extension of 7 min at 68 °C. The PCR products were run in 1% agarose gels and visualized with Goldview Nucleic Acid Stain (Dingguo, China). The DNA bands with the expected lengths were cut, gel purified with the MiniPrep Gel Extraction Kit (Sangon, China) and cloned into the cloning vector (pUCm-T vector system; Sangon, China). The cloned PCR products were analyzed in an ABI 3730xl DNA Analyzer sequencing system using vector primers. After verifying that the sequence was correct, a 6X-His tag was added to the sense primer for purification when it was subcloned to the pET28a vector using the restriction sites Nco I and BamH I.

Table 1 Detailed information on the candidate AquaOBPs of A. quadriimpressum.

AquaOBP4 expression and purification

The positive clone verified by DNA sequencing was inoculated in 5 mL of LB medium with ampicillin (50 μg/mL) and incubated at 225 rpm at 37 °C for 4 h. Thus, culture was then diluted into 2 L of LB medium, and the cells were grown until the optical density at 600 nm (OD600) reached 0.6. Next, 0.06 mM isopropyl-beta D-thiogalactopyranoside (IPTG) was added, and expression was induced for 2 h at 37 °C, after which the cultures were grown at 15 °C for 12 h. The cells were subsequently collected, and proteins in the periplasmic fraction were extracted with buffer A (20 mM Tris-HCl, 300 mM NaCl, and 1% Triton-100, pH 8.0) using ultrasonication. After centrifugation of the sample at 16,000 × g, the precipitate was dissolved in buffer B (20 mM Tris-HCl, 300 mM NaCl, 1% Triton-100, and 8 M urea, pH 8.0) and loaded onto a Ni-IDA column (BioVision, USA). The column was then washed using 100 ml of buffer C (20 mM Tris-HCl pH 8.0, 2 M NaCl, 0.1% TritonX-100, and 8 M urea) and 50 ml of buffer D (20 mM Tris-HCl pH 8.0, 50 mM NaCl, 0.1% TritonX-100, 10 mM imidazolium, and 8 M urea) to remove impurities, enzymes and non-target proteins. The target proteins were subsequently washed and collected using 50 ml buffer E (20 mM Tris-HCl pH 8.0, 50 mM NaCl, 0.1% TritonX-100, 250 mM imidazolium, and 8 M urea), and the denatured proteins were renatured via dialysis in a gradient of buffer F (50 mM Tris, 50 mM NaCl, 0–6 M urea, 5 mM GSSG, 2 mM GSH, and 1 mM DTT, pH 8.0) (Fig. S1). The 6X-His tag was subsequently cut using thrombin (Ye Sen, China), and the protein was reloaded on a Ni-IDA column to further remove His tag-labeled proteins. The final protein product was dissolved in 20 mM Tris-HCl (pH 8) at a concentration of 1.2 mg/ml for further testing.

Fluorescence binding assays

Binding affinity was primarily tested with a panel of 40 A. quadriimpressum habitat-related compounds, most of which are abundantly distributed in elms and A. quadriimpressum. Additionally, one newly identified repellent, cinnamaldehyde, was included25,27,28. After the molecular model of AquaOBP4 was established, we selected two additional compounds, which also came from elms and A. quadriimpressum, for model validation. All of the compounds were purchased from Sigma-Aldrich (Millipore Sigma, USA) with purities of greater than 95%. The CAS numbers, structures and sources of the tested compounds are listed in Table 2, and detailed information (including content) for each odorant is provided in Table S1.

Table 2 Detailed information for the compounds for testing against AquaOBP4.

N-phenyl-1-naphthylamine (1-NPN) was used as a reporter to measure the affinity of the fluorescent 1-NPN ligand to proteins29. The fluorescence spectra were recorded using a Fluoromax-4 spectrofluorometer (HORIBA Jobin Yvon, New York, USA) with 2 µM protein in 50 mM ammonium acetate (pH 7.2) at room temperature; the compounds and 1-NPN were dissolved in high-performance liquid chromatography (HPLC)-grade methanol at a concentration of 1 mM. Parameter selection was such that the slit widths for both excitation and emission were 10 nm. The reporter was excited at 337 nm, and the emission spectra were recorded between 380 and 500 nm. The AquaOBP4 saturation curve was tested using 1-NPN at 2 to 14 µM, and the affinities of the compounds were estimated using 2 µM 1-NPN and 10 to 70 µM concentrations of the compounds. The curves were then linearized using Scatchard plots. IC50 and Kd values were analyzed using Prism 6 (GraphPad Software, USA), and Ki values were calculated using the following equation: Ki = [IC50]/1 + [1-NPN]/Kd, where [1-NPN] is the concentration of dissociative 1-NPN and Kd is the dissociation constant of the protein/1-NPN complex. To investigate the response of AquaOBP4 at low pH, the fluorescence binding affinity between AquaOBP4 and one of the best ligands was measured at pH 5. Then, 50 mM ammonium acetate (pH 5) was used as a buffer to dissolve the AquaOBP4 protein.

Molecular modeling of AquaOBP4

The modeled structures of AquaOBP4 were obtained using a template of DmelOBP LUSH (1OOH) employing on-line Swiss-model software30. The molecular conformations of all ligands were constructed in Sketch mode and optimized using the Tripos force field and Gasteiger-Hückel charge. The Surflex-Dock algorithm of SYBYL 7.3 was employed for the molecular docking study31. The binding cavity was set as “Auto”, and the Total Score was used to evaluate the binding affinity between the ligand and protein32. All molecular modeling predictions between the putative AquaOBP4 protein and ligand were conducted using the Silicon Graphics® (SGI) Fuel Workstation (Silicon Graphics International Corp., CA, USA).

Electroantennogram responses of binding odorants

To measure the electrophysiological responses of the three best-binding ligands, including dibutyl phthalate, diisobutyl phthalate and butyl octyl phthalate, we used EAG assays33. In brief, glass electrodes fabricated with a PC-10 micropipette puller (Narishige, To kyo, Japan) were filled with a solution of 1 M potassium chloride and 1% polyvinylpyrrolidone. The reference electrode was then inserted into the eye of an A. quadriimpressum individual, and the recording electrode was placed at the truncated tips of the antennae using an MP-12 micromanipulator (Syntech, Kirchzarten, Germany). EAG signals were acquired with a high-impedance AC/DC pre-amplifier, and the data were analyzed with EAG Pro version 2.0 software (Syntech, Kirchzarten, Germany). A syringe cylinder was used as a cartridge to deliver 20 μl of odorant solution in paraffin oil at the desired dose (from lowest to highest). The odorant was loaded on a filter paper strip, which was then placed in a disposable syringe cylinder that served as the cartridge. The odorants were delivered into a consistent airstream (the flow rate was 500 ml/min, and humidity was 60–70%) at an approximate 1-cm distance from the antenna through a 20-G needle. The pulse duration was 0.1 s, and the recording time was set for 4 s, with a 2-min gap between stimuli for the recovery of EAG sensitivity. At least six individuals (three males and three females) were tested for each replicate at concentrations of 0.1, 1, and 10 µg/µl. Negative controls (paraffin oil) were performed first for each preparation, and the EAG peaks of each individual were normalized to the negative control.

Y-tube olfactometer

The responses of A. quadriimpressum to the three best ligands were assessed in a Y-tube olfactometer. The compounds were tested at an intermediate concentration of 10 µg/µl in paraffin oil. In these assays, 20 μl of the odorant was applied to a piece of filter paper (25 × 25 mm), which was then placed in one of the olfactometer arms. In the other arm, 20 μl of paraffin oil was used as a negative control. Thirty beetles (15 males and 15 females) were individually tested, with three replicates for each individual (for a total of 90 replicates). A red light source was used in the darkroom to avoid light interference. For the Y-tube (1-cm diameter, 100-cm base length, 10-cm arm length), one small hole was cut into the base of the Y-tube at 10 cm for releasing beetles. The source of the air that passed through the Y-tube was a pressurized tank of pure air. The air was first filtered through active carbon and then humidified by bubbling it through a bottle filled with double-distilled (dd) water before entering the Y-tube. The air flow traveled through both arms of the Y-tube olfactometer at a speed of 300 ml/min. Each individual beetle was placed at the entrance of the olfactometer (Fig. 1), and their “choice” was recorded when the beetle entered an arm and remained there for 30 s. If an insect made no choice within 3 min, the result was recorded as “no-choice”. The Y-tube olfactometer was cleaned with dehydrated alcohol and allowed to air-dry between trials involving different treatments or odorant dilutions. The positions of the odor sources were exchanged after every 10 beetles. The number of choice insects was used to show the repellency/attraction of odorants in different treatments, and significant differences were tested via two-way ANOVA.

Figure 1
figure 1

Experimental setup for the Y-tube olfactometer. Y-tube (1 cm diameter, 100 cm base length, 10 cm arm length). One small hole was cut into the base of the Y-tube at 10 cm for beetle release. For this experiment, a piece of filter paper (25 × 25 mm) was placed in both arms to test the odorants, and the beetles were then released from the hole at a 10 cm distance.

Results

AquaOBP4 cloning and expression

Based on previous RNA-Seq data, AquaOBP4 showed the highest expression level among all of the 15 identified candidate AquaOBPs. Additionally, AquaOBP4 exhibited the highest antenna-specific value, with an antenna/leg expression ratio of 680.82 (Table 1). Thus, we selected this protein for further binding and modeling studies. AquaOBP4 cloning yielded a 139-aa protein product, including start and stop codons, with six conserved cysteines and typical characteristics of classical OBPs. The results of recombinant pET28a-AquaOBP4 vector sequencing were consistent with the RNA-Seq data, which confirmed our previous results. The predicted MW of AquaOBP4 was 16.06 kDa, and the theoretical pI was 8.31, with a 21-aa signal peptide (http://web.expasy.org/compute_pi and http://www.cbs.dtu.dk/services/SignalP). Based on the results of the first attempt at small-scale expression, we optimized the induced expression conditions to 0.06 mM IPTG and 15 °C for 12 h. After ultrasonication, most of the target proteins were found in the form of inclusion body. Therefore, we decided to purify the AquaOBP4 protein under denaturing conditions. To maintain the natural structure of AquaOBP4 after expression and purification, the target protein was renatured, and the 6X-His tag was cut off. Finally, we obtained 18 mg of the target protein, with a purity of 95% for further study (Grab-IT version 2.5, Japan).

AquaOBP4 was observed to be the most abundant OBP in the antenna of A. quadriimpressum, and its expression level was notably higher than that of any other AquaOBP, with a fragments per kilobase of transcript per million mapped reads (FPKM) value of 1225.47. Predictions revealed that AquaOBP4 is an alkaline protein with a pI of 8.31 and a 21-aa signal peptide.

Binding assays

The binding affinity of the 1-NPN fluorescence reporter to AquaOBP4 was tested first, and the Kd value of AquaOBP4 was shown to equal 2.27 ± 0.16 μM at pH 7.2 (Fig. 2A). Further tests on 40 A. quadriimpressum habitat-related compounds showed that most of the candidate ligands presented no 1-NPN displacement whatsoever. However, five compounds showed 1-NPN displacement. After preliminary screening, five compounds with low binding levels were chosen to evaluate their dissociation constants, including dibutyl phthalate, diisobutyl phthalate, β-ionone, cis-3-hexenyl benzoate and cinnamaldehyde. Dibutyl phthalate and diisobutyl phthalate occur in the leaves of elm trees, and β-ionone and cis-3-hexenyl benzoate came from the feces of adult A. quadriimpressum. Cinnamaldehyde, which was recently identified as a strong repellent of A. quadriimpressum, was also detected. These compounds were tested at final concentrations of 10 to 70 μM to evaluate their dose-dependent effect. The results indicated that the repellent, cinnamaldehyde, showed weak binding to AquaOBP4, with a dissociation constant of 79.26 μM. The other four compounds were ranked as follows in order of the highest binding affinity: dibutyl phthalate (Ki = 24.22 μM), diisobutyl phthalate (Ki = 26.77 μM), β-ionone (Ki = 30.06 μM) and cis-3-hexenyl benzoate (Ki = 35.00 μM) (Table 3 and Fig. 2B).

Figure 2
figure 2

Binding curves of AquaOBP4 with the fluorescence reporter and active odorants. (A) Saturation curve of AquaOBP4 with the fluorescence reporter 1-NPN at pH = 7.2; The X axis ranged from 0–15 μM; a Scatchard plot is inserted under the curves. (B) Saturation curve of AquaOBP4 with the fluorescence reporter 1-NPN at pH = 5.0; The X axis ranged from 0–8 μM; a Scatchard plot is inserted under the curves. (C) Binding curves of five active odorants that bind to AquaOBP4; red: dibutyl phthalate, purple: diisobutyl phthalate, green: β-ionone, yellow: cis-3-hexenyl benzoate; gray: cinnamaldehyde.

Table 3 Binding affinities of AquaOBP4 to tested compounds.

The fluorescence binding assays showed that most of the tested compounds did not bind to AquaOBP4 in the first-round tests, although five ligands presented weak binding. Compared with the binding affinities of compounds from insects of other taxa, such as CquiOBP1 and AgamOBP1, the binding affinity of AquaOBP4 was relatively low9,13, and the results suggested that the best ligands for AquaOBP4 might still not have been identified at this stage. Therefore, we decided to establish a reliable homologous model of AquaOBP4 for predicting the active odorants.

3D model of AquaOBP4

Because the sequence identity between the target protein AquaOBP4 and the template protein DmelOBP LUSH was 31.82%, a 3D model of AquaOBP4 was reasonably constructed based on the crystal structure of DmelOBP LUSH. The AquaOBP4 structure exhibited the overall folding of “classical OBPs” and was mostly helical (Fig. 3A). The 3D structure of AquaOBP4 included six α-helices located between residues Ser23-Lys46 (α1), Thr48-Asp59 (α2), Ala66-Leu79 (α3), Asn87-Pro99 (α4), Asp100-Lys114 (α5), and Asp115-Met134 (α6). Additionally, the 3D structures of the targeted protein AquaOBP4 and the templated protein DmelOBP LUSH aligned well with each other (Fig. 3B); hence, the putative model of AquaOBP4 was successfully used for exploring the binding conformation and binding affinities between AquaOBP4 and ligands.

Figure 3
figure 3

(A) The predicted 3D model of AquaOBP4 based on the crystal structure of DmelOBP LUSH. (B) The alignment plot of the targeted protein AquaOBP4 and the template protein LUSH (yellow, ID: 1OOH).

Molecular docking

As shown in Fig. 4, the docking results for all ligands indicated that there were two different sites in the binding cavity of AquaOBP4. Site 1 was mainly a central binding region between helices α1, α2 and α3 and was surrounded by the unique, long, narrow hydrophobic helix α3. However, Site 2, located to the left of Site 1, far from the center region, was closely positioned behind helices α4 and α5 in AquaOBP4.

Figure 4
figure 4

Two different binding sites, Site 1 (cyan) and Site 2 (pink), for ligands in AquaOBP4. A ligand is indicated in CPK mode with different colors of Dibutyl phthalate (cyan) and cinnamaldehyde (pink). H-bond residues of putative AquaOBP4 are indicated as yellow sticks. Hydrophobic residues of putative AquaOBP4 are indicated as a brown helix.

Four ligands, including dibutyl phthalate, diisobutyl phthalate, β-ionone, and cis-3-hexenyl benzoate, aligned well according to the docking results and were almost located at the same position as Site 1 of the AquaOBP4 model. Another ligand, cinnamaldehyde, was positioned at Site 2 (Fig. 4), far from Site 1 in AquaOBP4. As shown in Table 4, the docking scores of these four ligands were much higher than that of cinnamaldehyde, which agreed with the results of the binding affinity experiment.

Table 4 Structures, binding affinities and binding scores of five ligands in the training set and two ligands in the test set for AquaOBP4.

To further clarify the binding conformation of ligands in AquaOBP4, the binding conformations of all ligands are shown in Fig. 5. The results showed almost the same H-bonding interactions between different O atoms of four ligands with high binding affinities and the key residue Trp60 of helix α2, with distances of 1.785, 2.140, 1.877, and 3.835 Å. It was clear that the predicted binding cavity for the four ligands in Site 1 was a visible hydrophobic region surrounded by three helixes, namely, α1, α2, and α3 (shown in brown), in AquaOBP4. Trp60 was indicated as an important residue for the location and orientation of these ligands in the binding pocket of AquaOBP4.

Figure 5
figure 5

The binding modes of seven active odorants. The binding modes of (A) dibutyl phthalate (cyan), (B) diisobutyl phthalate (purple), (C) β-ionone (orange), (D) cis-3-hexenyl benzoate (blue), (E) Cinnamaldehyde (pink), (F) butyl octyl phthalate (dark green) and (G) isoeugenol (light green) in AquaOBP4.

However, the low binding capacity of cinnamaldehyde alone revealed another H-bond with the Tyr77 residue in helix α3 of Site 2, with a distance of 1.945 Å, indicating that its location was far from the central binding region at Site 1 of AquaOBP4 and yielding a low binding score of 3.85.

Further validation of the docking model

To further verify the prediction capacity of the 3D model of AquaOBP4, two new ligands, butyl octyl phthalate and isoeugenol, were selected based on their structural similarities and were also observed to dock in the same binding cavity of AquaOBP4. The docking results for butyl octyl phthalate showed the highest binding score among all ligands of 8.02. This ligand was also verified to exhibit the best binding capacity to AquaOBP4, with a Ki of 12.70 μM in the binding assay (Fig. 6). Surprisingly, one of the long ester chains of butyl octyl phthalate was located at the central Site 1, whereas another long ester chain stretched to Site 2, with an H-bond of 2.658 Å to Tyr77 in helix α3 of AquaOBP4. Butyl octyl phthalate clearly presented a better match with the whole binding cavity of AquaOBP4 and showed a better binding capacity to AquaOBP4. Furthermore, the docking score of isoeugenol was quite low, at only 3.86, and its binding conformation was positioned only in Site 2 of AquaOBP4. The binding affinity assay revealed a low Ki of 170.54 μM (Fig. 5). We also chose butyl octyl phthalate to perform a binding assay at pH 5. The Kd value of AquaOBP4 with 1-NPN was 1.488 ± 0.08 μM, and the Ki of butyl octyl phthalate was 16.81 μM at pH 5. Therefore, butyl octyl phthalate showed a lower binding affinity at pH 5 than at pH 7, suggesting that a lower pH could decrease its binding affinity for AquaOBP4 (Fig. 6).

Figure 6
figure 6

Binding curves of AquaOBP4 to the chosen compounds from the prediction model. (A) Fluorescence competition curves of butyl octyl phthalate to AquaOBP4 with the fluorescence reporter 1-NPN. The concentration of tested compound is 10 μM, and the final amount of 1-NPN is 2 μM. Ki = 12.7. (B) Binding curves of two compounds from the prediction model.

EAG and behavioral responses of binding ligands

To further test the biological functions of the three best ligands in terms of binding to AquaOBP4, EAG and Y-tube olfactometer tests were performed. Dibutyl phthalate, diisobutyl phthalate and butyl octyl phthalate all elicited strong EAG responses (Fig. 7B). However, no obvious changes in the responses were obtained when different concentrations of three ligands were tested (Fig. 7A), suggesting that even lower concentrations of these analogs could elicit saturated antennal responses. Additionally, no differences in the responses of the three ligands were observed in either the amplitude- or dose-dependent curves. The Y-tube olfactometer results revealed that the three tested odorants could all attract A. quadriimpressum adults (1 µg/µl) because the number of insects was significantly greater on the odorant side than on the control side (p < 0.01). Additionally, no large differences were observed among the three tested odorants, although dibutyl phthalate exhibited slightly higher attractiveness (Fig. 7C).

Figure 7
figure 7

EAG and the behavioral responses of the ligands showing the best binding to AquaOBP4. (A) Dose-dependent curves of the tested ligands (0.1, 1 and 10 µg/µl). (B) Relative EAG responses of the three tested odorants at a concentration of 1 µg/µl. (C) Y-tube olfactometer results for the three tested odorants. The error bars indicate the SEM, and **indicates p < 0.01. N = 90.

Discussion

The fluorescence binding results showed that AquaOBP4 could easily bind phthalic acid derivatives, and the top three binding compounds all contained phthalic acid groups. However, AquaOBP4 did not bind to most phenylacetic acid single-lipid derivatives, such as methyl salicylate, methyl benzoate, acetic acid, and phenyl ester, with one exception, cis-3-hexenyl benzoate. No strict rules were observed, and only the references from the chemical structures themselves were used. Furthermore, AquaOBP4 showed no binding to methyl benzoate or leaf acetate, which are the precursors in the production of cis-3-hexenyl benzoate, but the binding of AquaOBP4 to cis-3-hexenyl benzoate suggested that the OBP-odorant complex was not merely a simple interaction between a specific chemical group and an amino acid residue; rather, the entire structure of the compound interacted with the conformation of OBP to influence the binding dynamics. Additionally, dibutyl phthalate and diisobutyl phthalate are isomers that show approximately the same binding affinity to AquaOBP4; some PRs have been reported to have the capacity to distinguish isomers34,35. Our results suggested that the selectivity of AquaOBP4 was not very strict, and the discriminate isomers were not initiated with a single OBP, which relies more upon ORs or higher olfactory neural pathways. Finally, butyl octyl phthalate is the analog of the isomers dibutyl phthalate and diisobutyl phthalate, although butyl octyl phthalate has four more carbons in one of its carbon chains than the two analogs. However, much stronger binding to AquaOBP4 was observed, indicating that the lengths of the carbon chains around analog chemicals could have an enormous impact on the binding affinity of AquaOBP4. Furthermore, a fluorescence binding assay conducted at pH 5 showed that a lower pH decreases the binding affinity of AquaOBP4, possibly due to a conformational change of AquaOBP4. This result has already been confirmed by structural studies in other insect species13,18. Most of the tested compounds did not bind to AquaOBP4 in the first round of tests, with only five ligands showing weak binding. Compared with the binding affinities of OBPs from other insect taxa, such as CquiOBP1 and AgamOBP1, the binding affinity of AquaOBP4 is relatively low9,13, and the results suggested that the best ligands for AquaOBP4 might not yet have been identified at this stage. Therefore, we decided to establish a reliable homologous model of AquaOBP4 for predicting the active odorants.

Molecular modeling has become one of the most important methods for studying the binding affinities between bioactive molecules and bio-macromolecules. To better understand the binding conformations of different ligands and AquaOBP4, the ligand–putative AquaOBP4 protein complexes were investigated through molecular docking. Two different sites in the binding cavity of the AquaOBP4 model were identified, and these ligand binding sites in the AquaOBP4 modeling were markedly different from the alcohol group from helix α3, helix α6 and the C-terminal strand in the LUSH template36. Additionally, the Trp60 residue of AquaOBP4 was observed to be important for the location and orientation of these ligands, whereas in the template of DmelOBP LUSH with alcohol, a group of the Thr57, Thr52 and Thr48 residues in helix α3 has been reported to form a network of concerted hydrogen bonds between the protein and the alcohol to provide a structural motif to increase the alcohol binding affinity at this site36,37. Invertebrate OBPs generally show a low isoelectric point, but AquaOBP4 is an alkaline protein, which caused us more difficulty in expressing and purifying this protein. As an alkaline OBP, DmelOBP LUSH has been better studied from both neurologic and biochemical aspects. In the present study, we expressed and purified another alkaline protein in a species unrelated to Drosophila. Binding assay and molecular docking results showed that although AquaOBP4 exhibited a high similarity to DmelOBP LUSH at the amino acid level, the selectivity and binding pocket of AquaOBP4 were quite different from those of DmelOBP LUSH38, which might indicate completely different biofunctionalities of these two OBPs.

Notably, AquaOBP4 also showed weak binding to cinnamaldehyde (79.26 μM), which is an antifeedant against A. quadriimpressum. Our previous results showed that cinnamaldehyde could elicit a strong physiological response on the antennae of A. quadriimpressum, and further behavioral tests confirmed the antifeedant function of cinnamaldehyde. This suggest that AquaOBP4 might participates in helping to transport cinnamaldehyde to olfactory receptors at the molecular level, however, it need more experiments to confirm such as by using RNAi technique, for that CSPs could participate in the transport or it could be transported in an unbound state (cinnamaldehyde is slightly soluble in aqueous solutions). Both β-ionone (Ki = 30.06 μM) and cis-3-hexenyl benzoate (Ki = 35.00 μM), which are proteins from the feces of adult A. quadriimpressum, also showed binding to AquaOBP4. Additional experiments are required to investigate the biological functions of these two odorants. The compound used for modeling validation, isoeugenol, showed a very low binding affinity to AquaOBP4, with a Ki of 170.54 μM, whereas another predicted compound, butyl octyl phthalate, showed the best binding affinity to AquaOBP4 detected to date, with a Ki of 12.70 μM. The results of fluorescence binding assays for the two new ligands coincided well with our docking model because the long ester chain of butyl octyl phthalate was located in the central Site 1, whereas another long ester chain stretched to Site 2. In addition, these results indicate that the predicted binding site of our docking model is reliable and can potentially be used in future large-scale compound screening for AquaOBP4. The top three binding compounds, dibutyl phthalate (Ki = 24.22 μM), diisobutyl phthalate (Ki = 26.77 μM) and butyl octyl phthalate (Ki = 12.70 μM), which originate from the leaves of elms, were selected for further electrophysiological and behavioral tests. Other studies have shown that these compounds also occur among the volatiles of some other plants, such as Calycopteris floribunda 39; these three odorants could have all elicited strong EAG responses and had significant attractive effects on adult A. quadriimpressum. Because AquaOBP4 also binds to cinnamaldehyde, which is an antifeedant, AquaOBP4 binding might be correlated with host specificity. Even at this stage, we still cannot firmly conclude that AquaOBP4 is not an aggregation pheromone-related OBP in A. quadriimpressum, similarly to D. melanogaster, because the aggregation pheromone component in A. quadriimpressum remains unclear, and the additional functions of AquaOBP4 require further exploration. However, our results suggest that the phthalic acid derivatives from elm leaves can be detected by A. quadriimpressum and can attract this pest via a mechanism that involves AquaOBP4. Additionally, AquaOBP4 is more likely linked with the foraging behavior of A. quadriimpressum, and the variety of functions of the insect OBP LUSH family is worth investigating in future studies.