Automated differentiation of mixed populations of free-flying female mosquitoes under semi-field conditions

Johnson, Brian J.; Weber, Michael; Al-Amin, Hasan Mohammad; Geier, Martin; Devine, Gregor J.

doi:10.1038/s41598-024-54233-3

Download PDF

Article
Open access
Published: 12 February 2024

Automated differentiation of mixed populations of free-flying female mosquitoes under semi-field conditions

Brian J. Johnson¹^na1,
Michael Weber²^na1,
Hasan Mohammad Al-Amin¹,
Martin Geier² &
…
Gregor J. Devine¹

Scientific Reports volume 14, Article number: 3494 (2024) Cite this article

506 Accesses
1 Altmetric
Metrics details

Subjects

Abstract

Great advances in automated identification systems, or ‘smart traps’, that differentiate insect species have been made in recent years, yet demonstrations of field-ready devices under free-flight conditions remain rare. Here, we describe the results of mixed-species identification of female mosquitoes using an advanced optoacoustic smart trap design under free-flying conditions. Point-of-capture classification was assessed using mixed populations of congeneric (Aedes albopictus and Aedes aegypti) and non-congeneric (Ae. aegypti and Anopheles stephensi) container-inhabiting species of medical importance. Culex quinquefasciatus, also common in container habitats, was included as a third species in all assessments. At the aggregate level, mixed collections of non-congeneric species (Ae. aegypti, Cx. quinquefasciatus, and An. stephensi) could be classified at accuracies exceeding 90% (% error = 3.7–7.1%). Conversely, error rates increased when analysing individual replicates (mean % error = 48.6; 95% CI 8.1–68.6) representative of daily trap captures and at the aggregate level when Ae. albopictus was released in the presence of Ae. aegypti and Cx. quinquefasciatus (% error = 7.8–31.2%). These findings highlight the many challenges yet to be overcome but also the potential operational utility of optoacoustic surveillance in low diversity settings typical of urban environments.

Improving the efficiency of aerosolized insecticide testing against mosquitoes

Article Open access 18 April 2023

Infrared light sensors permit rapid recording of wingbeat frequency and bioacoustic species identification of mosquitoes

Article Open access 11 May 2021

Field evaluation of two mosquito traps in Zhejiang Province, China

Article Open access 11 January 2021

Introduction

Accurate and timely mosquito surveillance is crucial for improving the effectiveness and evaluation of vector control measures. Unfortunately, traditional surveillance methods are often hindered by a lack of expert human resources and logistical difficulties^1,2. There is clearly tremendous utility in the development of robust and reliable automated trapping systems, or “smart” traps. By differentiating between insect species and transmitting counts remotely and in real-time³, these traps offer a robust and reliable solution for mosquito surveillance^4,5,6.

Most smart trap prototypes rely on either image or acoustic data acquisition, but it is the optoacoustic capture of mosquito wingbeat frequencies (WBF) that has historically received greatest attention^7,8,9,10,11. The greater focus of WBF is likely due to the inherent difficulties in remotely imaging insects to sufficiently control for variations in color, detail, focus, and angle¹². These variations pose a significant challenge when it comes to picturing mosquitoes in a way that reliably reveals their distinguishing morphological features. In response, image-based surveillance has been most widely adopted by citizen science campaigns wherein publicly captured images of mosquitoes are sent to a central repository for analysis by trained medical entomologists or taxonomists^13,14,15. Such campaigns are great public engagement tools with the potential to track broad species distributions and exotic incursions, but they are not yet suitable for traditional surveillance operations. However, substantial progress has been made recently in overcoming historic limitations^{16,17,18,19,20}, and it is expected that the number of field-tested photographic smart traps²¹ will increase significantly in the coming years.

Despite the heightened attention that optoacoustic surveillance has received, it is important to acknowledge that reliance on WBF as a diagnostic marker for species separation presents its own unique challenges. While WBF may differ markedly between species due to sexual selection^22,23,24,25, the range of female WBF is narrow and variations may occur in response to environmental, physiological, and behavioral factors^{26,27,28,29,30}. Resulting frequency overlap amongst closely and distantly related species^7,8 can greatly impede accurate species differentiation^7,8. While simple logical extensions to classification algorithms such as the time and place of capture may help to reduce potential confusion between species, some confusion is likely to remain. In spite of these challenges, WBF-based classification remains promising when there is a clear distinction between the genera being observed and when the number of species being identified is relatively small.

Here, we describe the development, performance and limitations of an innovative optoacoustic smart trap design that enabled us to reliably differentiate congeneric and non-congeneric species under free-flight conditions. We focus on the differentiation of medically and economically important mosquito species inhabiting low-diversity urban environments^31,32. Critically, each release scenario is designed to represent a current real-world surveillance challenge, including:

Scenario 1 Differentiation of the globally invasive, container-breeding mosquitoes Aedes albopictus and Aedes aegypti in the presence of Culex quinquefasciatus. Improving the differentiation of these species is essential for exotic species monitoring, particularly in first ports^33,34, and public health surveillance. Ae. aegypti and Ae. albopictus are globally invasive and are competent vectors of dengue, chikungunya, and other important diseases³⁵ and both are commonly collected together with Cx. quinquefasciatus^{36,37,38,39,40}.
Scenario 2 Differentiation of Anopheles stephensi and Ae aegypti in the presence of Cx. quinquefasciatus. Discriminating these species is critical to improving our understanding of the relative abundance, distribution, and host-seeking activity of the introduced malaria vector An. stephensi⁴¹ in urban environments in the Horn of Africa. An. stephensi is currently known to share larval habitats with Ae. aegypti in Africa⁴² and it is collected with both Ae. aegypti and Cx. quinquefasciatus, a widely distributed species in Africa⁴³, elsewhere in its range⁴⁴.

Results

Wingbeat frequency datasets

The developed BG-I trap system (Fig. 1) enabled us to generate substantial WBF datasets (Table 1) for each species tested quickly and without manipulation of insects (i.e., tethering) prior to recording. In total, 5,092 individual recordings were produced from 4,500 released mosquitoes, suggesting an ca. 12% recapture rate. The mean WBFs of mosquitoes assayed spanned 302.6 Hz (Table 1), with the lowest WBF recorded from Cx. quinquefasciatus (302.6 Hz) and the highest recorded from Ae. albopictus (741.3 Hz). Overlap in WBF distributions, quantified using the overlapping coefficient (OVL)⁴⁵, occurred for all species (Table 2, Fig. 2a,b). Notable overlaps occurred between Ae. aegypti and Ae. albopictus (OVL = 0.67) and between Cx. quinquefasciatus and An. stephensi (OVL = 0.68).

Table 1 Sample size, signal length, and mean female wingbeat frequency observed for each species recorded in the present study.

Full size table

Table 2 Overlapping coefficients (OVL) among all female wing beat frequency distribution pairs.

Full size table

Model selection

We found that the tested algorithms, XGBoost and Multilayer Perceptron (MLP), generally performed better in the absence of principal component analysis (PCA), whereas the use of data scaling did not improve general model performance (Tables S1, S2). In both sets of species comparisons, data feature type, i.e., Mel Frequency Cepstral Coefficient (MFCC), Power Spectral Density (PSD), and fundamental frequency, was the only parameter to contribute significantly to classification accuracy (Tables S3, S4). The use of MFCC, in combination with XGBoost algorithm, produced the most accurate classification models, with few exceptions. The use of PSD with XGBoost generally produced more accurate predictions than the use of MFCC for the classification of Ae. aegypti, Ae. albopictus, and Cx. quinquefasciatus, but this model was found to underperform relative to MFCC-based models when classifying replicate data. Based on chosen models, Ae. aegypti was predicted to be the identified with the greatest sensitivity, whereas Ae. albopictus and An. stephensi were predicted to be the identified with the lowest expected sensitivity (Table 3). Final chosen models for each combination of species included the use of MFCC without data scaling or cleaning in combination with the XGBoost classification algorithm.

Table 3 Summary of model classification performance (confusion matrix) for each release scenario.

Full size table

Application of classification models to free-flight data

Both levels of data cleaning, aimed at removing weak or incomplete recordings, resulted in the loss of true observations and the underreporting of trap totals in both release scenarios (Table 4), with significant losses of true observations when full data cleaning was employed for Scenario 1 (F_2,33 = 8.7, P < 0.01). The high agreement between raw remote and physical counts indicates a low occurrence of false recordings with the current trap configuration. As a result, raw remote recordings were used for all subsequent analyses.

Table 4 Departure of BG-I trap counts from physical trap counts at different levels of data cleaning for each free-flight release scenario.

Full size table

At the aggregate level (all collections summed per release scenario), automated classification of all species released in Scenario 2 was nearly accurate, with mean classification accuracy exceeding 90% (% error = 3.74–7.08%; Table 5; Fig. 2c,d). Classification error increased in Scenario 1 (% error = 7.79–31.23%) but was again highly accurate for Ae. aegypti (% error = 7.79, 95% CI 3.0–12.56%). In contrast, classification error increased markedly across individual replicates (mean % error = 44.7; 95% CI 30.0–59.5%; Fig. 2c,d) for both release scenarios. Mean species classification error across all replicates ranged from 29.6 to 63.4% (Table 5).

Table 5 Percent (%) error of individual species classifications at the replicate and aggregate levels for both release scenarios.

Full size table

Discussion

To our knowledge, we report findings from the first mixed-species test of a field-ready optoacoustic smart trap design under free-flight conditions. While others have achieved automatic classification of mosquito genera⁴⁶, sex^47,48, or both⁴⁹, none have attempted point-of-capture classification of mixed-species populations of free-flying mosquitoes. The work presented highlights both the challenges and opportunities presented by optoacoustic surveillance. For instance, we estimate that, for aggregate collections, non-congeneric species present in release scenario 2 (i.e., Ae. aegypti, Cx. quinquefasciatus, and An. stephensi) could be quantified with accuracies exceeding 90%. However, error rates increased when analysing individual (replicate) collections in both release scenarios and for aggregate collections containing Ae. albopictus in the presence of both Ae. aegypti and Cx. quinquefasciatus. These findings show the challenges yet to be overcome as well as the potential operational utility of optoacoustic surveillance in low diversity settings typical of urban habitats⁵⁰.

The study importantly introduces a novel optoacoustic smart trap design that increased the depth of acquired optoacoustic signals (signal length = 86–121 ms) relative to existing designs^49,51. The success of the tested design is attributed to its above-trap sensor placement that allows for prolonged signal acquisition. Acquired signal depth was sufficient for accurate quantification of aggregate collections of mixed-species under free-flight conditions in the presence of significant WBF overlap. The tested design was further found to be robust to false data acquisition, or captures, as supported by the low divergence of physical and remote trap counts across both release scenarios. However, it is important to acknowledge the absence of negative samples, or non-mosquito bycatch, in the present study. Although mosquito bycatch in non-light traps, such as the BG-Sentinel and the tested trap design, is low in comparison to traditional light traps (e.g., CDC-miniature light trap)⁵², bycatch collections can still exceed mosquito captures⁵³. Further research is needed to better understand how the presence of different groups of non-target insects may influence classification performance.

The study further presents a rare test of laboratory-trained classification algorithms against small, mixed-species collections typical of daily trap captures. The accuracy of our classification algorithms decreased significantly when classifying low replicate sample sizes relative to larger trap aggregates. This observation is critical since model validation has not been successfully extended beyond large WBF databases containing one or more separately recorded datasets^47,49,54. The lower accuracies observed across individual replicates suggest that the best approach for mixed-species classification is to aggregate trap collections over time or space. For low density species, such as Ae. aegypti^55,56, this may cause significant delays in analysis in the absence of extended trap networks, which may not be suitable for certain surveillance operations.

It is worth noting that the decrease in model performance when applied to individual replicates relative to trap aggregates was unexpected. Although small datasets tend to cause problems of model overfitting or underfitting in machine learning, these problems typically refer to the initial training of the chosen machine learning algorithm^57,58. As reasonably large training data sets were employed, discrepancies in classification performance are harder to explain considering the low expected error rates for the majority of species analyzed. Deviations in classification performance may be related to cohort-to-cohort differences (e.g., body size), which may lead to significant over- or underreporting of species counts when analyzing small sample sizes, such as was observed for Cx. quinquefasciatus in both release scenarios. Such differences may occur despite the use of single colonies reared under controlled rearing conditions, although these differences are not as large as those observed in field populations⁵⁹. Further research into the performance of classification models against cohorts reared under different environmental and resource conditions is warranted.

The study had further limitations that should be noted. Primary among these is the use of established mosquito colonies reared and released under controlled laboratory settings that likely improved classification accuracy relative to that expected under natural field conditions. This limitation is not unique to this study as it is shared by the majority of previous reports^{47,48,49,54,60}. Mosquito WBF fluctuates in response to environmental, physiological, and behavioral factors^{26,27,28,29,30}, and this variability has yet to be adequately accounted for in the field or laboratory by those attempting WBF-based classification. Some of the confusion among species created by this variability may be accounted for by simple logical extensions to classification algorithms, such as the time and place of recording⁷. For instance, separating collections by time of capture may significantly reduce classification errors for the more crepuscular Cx. quinquefasciatus⁶¹ in the presence of day-active mosquitoes like Ae. aegypti⁶² and Ae. albopictus⁶³, but some overlap will persist. Future research should prioritize testing and developing trap designs and classification models under natural field conditions or within semi-field systems situated within the natural environment and exposed to ambient environmental conditions^64,65.

In conclusion, wing beat-focused smart trap designs find their most obvious application in environments that have low species diversity, such as those that were simulated. However, limitations remain, and an emphasis on field-based studies is needed before integration into traditional surveillance operations. Despite these limitations, the potential operational utility of WBF focused smart traps remains high.

Methods

Trap and sensor design

The BG-I trap system (Fig. 1) consists of a bespoke infrared-LED array (192 × SFH-4641-Z, 940 nm, ams-OSRAM AG, Munich, Germany) with imbedded light sensor (Hamamatsu Photonics K.K., Shizuoka, Japan) mounted 20 cm above a BG-Counter 2® (Biogents AG, Regensburg, Germany). The IR-LED and light sensor array (10 cm in diameter) is supported and operated by a Raspberry Pi 3 (Model B, Raspberry Pi Ltd, Cambridge, UK) and SAMD21G microprocessor (Atmel® Corporation, San Jose, CA, USA). Suction is provided by a 12-V (3.6 W) fan positioned 30 cm below the BG-Counter 2 funnel and at a 90-degree angle from the funnel opening to reduce background light scatter.

During operation, the BG-I system continuously measures the intensity of reflected light from the observation volume, i.e., the distance between the sensor array and trap funnel. An individual observation is initiated by the registration of a mosquito capture by the BG-Counter 2 attached to the trap opening. Once a mosquito is registered, the BG-Counter 2 provides a simultaneous trigger signal to the BG-I recorder to retrieve the data collected from the preceding 200 ms, or just before capture. This data corresponds to the small portion of the light scattered and reflected by the insect during capture and transit through the observation volume. The light sensor, or photodiode, output is then amplified using proprietary electronics and then digitized by the 12-bit Analog-to-Digital Converter on the SAMD21G at the rate of 8 kHz. The digitized signal is transmitted to a Raspberry Pi for intermediate storage and transmission to a cloud server. The signals are converted to .wav files prior to analysis.

Mosquito rearing

A wingbeat database was constructed for each species from established laboratory colonies. All colonies were maintained in environmental chambers at standard conditions (27.0 ± 0.5 °C, 80.0 ± 5.0% RH, and 12:12 (L:D) h photo regime). In general, larvae were reared in 45 cm × 32 cm × 5.5 cm pans containing 1.0 L of reverse osmosis purified water. Aedes sp. and Cx. quinquefasciatus larvae were reared on a diet of Tetramin fish food (Tetra, Melle, Germany). Anopheles stephensi were fed on a standardized diet of Vipan and Micron Nature fry foods (Sera®, Heinsberg, Germany). Emergent adults were placed in 30 × 30 × 30 cm mesh cages (BugDorm, Taichung, Taiwan) and provided 10% sucrose solution ad libitum. Adults were aged 5–7 days at the time of recording.

Data collection and trap operation

Training datasets were generated for each species in single-species, large cohort capture experiments. A minimum of 3–5 cohorts (releases) reared at different times and from different egg stock were used for each species. Cohort sizes varied from 50 to 300 individuals, depending on adult availability. Adults were released into a 1.5 × 2.0 × 1 m mosquito mesh tent containing the BG-I trap station. CO₂ was supplied to the trap station from a 6 kg cylinder at a rate of 300 mL/min. Released mosquitoes were captured for a period of 24 h or until no free-flying mosquitoes could be observed in the tent. No collection bag was attached to the end of the trap body to allow for the possible re-capture of mosquitoes. Training datasets were generated for females only.

Data processing

The BG-I signal was analysed and processed using the librosa⁶⁶, scipy⁶⁷ and peakutils⁶⁸ Python libraries. Three different data feature classes where generated from the BG-I signal, including Mel Frequency Cepstral Coefficients (MFCC)⁶⁹, Power Spectral Density (PSD)⁷⁰, and Fundamental frequency⁴⁶. MFCC is derived from the Fourier transformation of the signal and is a representation of the short-term power spectrum of a sound. PSD describes how the power of a signal is distributed over frequency⁷⁰. Fundamental frequency is defined as the lowest frequency of a periodic waveform and is equivalent to the fundamental WBF. All three feature classes have been used in previous optoacoustic studies^3,46,71,72. In general, fundamental and harmonic frequencies were determined by harmonic analysis of the Fourier series which also enabled us to reliably identify PSD peaks based on multiples of the fundamental frequency estimate.

After initial signal processing, feature data was preprocessed in two ways prior to model training. First, various levels of data scaling, or standardization, were tested by applying different scalers (e.g., standard scaler, normalizer, and robust scaler) to each dataset prior to classification. Second, each data transformation method was tested without and in combination with principal component analysis (PCA). The use of PCA can reduce the number of input variables which can result in a simpler predictive model with better performance⁷³.

Free-flight capture experiments

The ability of the BG-I station to accurately differentiate and record sympatric species was tested under free-flight conditions using two different combinations of species, or release scenarios. The first scenario consisted of the urban, container-inhabiting congeneric species Ae. aegypti and Ae. albopictus in the presence of the commonly co-collected Cx. quinquefasciatus. The second scenario included the non-congeneric species Ae. aegypti, An. stephensi, and Cx. quinquefasciatus. Twelve releases of each grouping were performed during which captured mosquitoes were collected, counted manually, and results compared to the clustering results of the preferred classification algorithm. During each individual experiment, 20 females of each species in the necessary combinations were released into a mosquito mesh tent (1.5 × 2.0 × 1 m) containing the BG-I trap station. The one exception being the release of 40 females of Cx. quinquefasciatus during each replicate. The larger addition of Cx. quinquefasciatus was used to ensure adequate captures of this species as our colonized Cx. quinquefasciatus population maintains natural crepuscular peaks in host-seeking activity and collections at other times of the day can be low. Captured mosquitoes were collected in 120 min intervals and all remaining mosquitoes removed by aspiration. Only female mosquitoes were released during free-flight experiments. Experiments occurred between the hours of 0900–1900, with the majority of releases occurring between 1200 and 1300. All free-flight experiments occurred under controlled environmental conditions (27.0 ± 0.5 °C, 80.0 ± 5.0% RH, and 12:12 (L:D) h photo regime). A collection bag was attached to the end of the trap body for all free-flight experiments.

Post-capture analysis

Captured mosquitoes were chilled at − 20 °C for 30 min before being counted and morphologically identified. Manual species counts were then compared to model predictions of BG-I recordings for each capture interval to determine the difference (%) between remote and manual counts and classification error rates. Three levels of post-capture data cleaning were compared to determine which produced the greatest consensus between remote and manual counts. The three levels are (1) no data cleaning (i.e., raw files analysed), (2) low-level data cleaning to remove recordings with low signal power (< 0.2 × 1000), and (3) full data cleaning to remove recordings with low signal power, those with low harmonic detection (< 3 defined harmonics), and those with low-frequency to power ratios (< 1.0). Signal power was calculated in the time domain using the root mean square (RMS) method⁷⁴ or in the frequency domain (including frequency bands) using Parseval’s theorem⁷⁵. The calculation of RMS power and frequency band power ratios allows for the identification of signals that have low power or a large low-frequency content obscuring the true wingbeat signal.

Model selection, training and validation

During the training and validation phase, we evaluated the performance of a particular model for each release scenario based on classification accuracy as determined from cross-validation using 80–20% data splitting (training-validation sets). Validation was performed using the scikit-learn python package⁷⁶. The number of training iterations exceeded 30 in all cases. The two parent algorithms tested included XGBoost, a decision-tree-based ensemble machine learning algorithm that uses a gradient boosting framework⁷⁷, and Multilayer Perceptron (MLP), a class of feedforward artificial neural network⁷⁸. Each parent algorithm was trained at all levels of data scaling and transformation (i.e., PCA) for an individual feature type. A summary of tested models is presented in Table S1.

Performance of each validated model was assessed against experimental replicates as well as experimental aggregates (i.e., the sum of collections per release scenario). Due to the lack of replication of experimental aggregates, variance in model performance and classification accuracy was assessed using multiple validated models trained from randomly sampled subsets of the training data (n = 5, 75% data retention).

Statistical analyses

Linear mixed-effects models were used to determine the importance of model type, feature type, and data scaler on classification performance. Species was included as a random effect in the models to account for species-level differences in classification accuracy. Analysis of variance was used to determine if the percent difference of physical and BG-I species counts was influenced by the level of data cleaning employed. Prior to analysis, assumptions of normality and homogeneity of variance were determined using the Shapiro-Wilks test⁷⁹ and Bartlett’s test⁸⁰, respectively. Distributions of test results across replicates was found to satisfy assumptions of normality (W = 0.87–0.98, p = 0.07–0.98) and homogeneity of variance (χ² = 1.39–2.08, p = 0.36–0.50). The null hypothesis for all significance tests was one of no difference among the variables, models, and/or parameters being assessed. The absolute departure of automated and physical species counts was calculated as Percent Error using the formula \(\% e=\bigg|\frac{\left(p - a\right)}{a}\bigg|\times 100\), in which a represents the physical (actual) collection total and p represents the predicted collection total for each replicate or aggregate collection being analysed. All statistical analyses were performed using R version 4.04⁸¹. Data visualization and cleaning, model training, and post-capture analyses were performed using a custom Python-based (Python Software Foundation, version 3.9) data visualization and analysis platform.

Data availability

Data supporting the findings of this study are available in the Supplementary Data 1. The Python code file is available as a Supplementary Data 2. The python code file may also be found at https://github.com/mweber-bg/BGWingbeatAnalyzerCC.

References

Moise, I., Zulu, L., Fuller, D. & Beier, J. Persistent barriers to implementing efficacious mosquito control activities in the continental United States: Insights from vector control experts. In Current Topics in Neglected Tropical Diseases (ed. Rodriguez-Morales, A.) (IntechOpen, 2018).
Google Scholar
Akogbéto, M. C. et al. Six years of experience in entomological surveillance of indoor residual spraying against malaria transmission in Benin: Lessons learned, challenges and outlooks. Malaria J. 14, 1–12 (2015).
Article Google Scholar
Potamitis, I., Eliopoulos, P. & Rigakis, I. Automated remote insect surveillance at a global scale and the internet of things. Robotics 6, 19 (2017).
Article Google Scholar
Kim, D., DeBriere, T. J., Cherukumalli, S., White, G. S. & Burkett-Cadena, N. D. Infrared light sensors permit rapid recording of wingbeat frequency and bioacoustic species identification of mosquitoes. Sci. Rep. 11, 1–9 (2021).
Google Scholar
Rydhmer, K. et al. Automating insect monitoring using unsupervised near-infrared sensors. Sci. Rep. 12, 1–11 (2022).
Article Google Scholar
Ong, S.-Q., Ahmad, H., Nair, G., Isawasan, P. & Majid, A. H. A. Implementation of a deep learning model for automated classification of Aedes aegypti (Linnaeus) and Aedes albopictus (Skuse) in real time. Sci. Rep. 11, 1–12 (2021).
Article Google Scholar
Mukundarajan, H., Hol, F. J. H., Castillo, E. A., Newby, C. & Prakash, M. Using mobile phones as acoustic sensors for high-throughput mosquito surveillance. Elife 6, e27854 (2017).
Article PubMed PubMed Central Google Scholar
Sinka, M. E. et al. HumBug—An acoustic mosquito monitoring tool for use on budget smartphones. Methods Ecol. Evol. 12, 1848–1859 (2021).
Article Google Scholar
Johnson, B. J. & Ritchie, S. A. The siren’s song: Exploitation of female flight tones to passively capture male Aedes aegypti (Diptera: Culicidae). J. Med. Entomol. 53, 245–248 (2016).
Article PubMed ADS Google Scholar
Jakhete, S., Allan, S. & Mankin, R. Wingbeat frequency-sweep and visual stimuli for trapping male Aedes aegypti (Diptera: Culicidae). J. Med. Entomol. 54, 1415–1419 (2017).
Article CAS PubMed Google Scholar
Staunton, K. M. et al. A low-powered and highly selective trap for male Aedes (Diptera: Culicidae) surveillance: The male Aedes sound trap. J. Med. Entomol. 58, 408–415 (2021).
PubMed Google Scholar
Suzuki-Ohno, Y. et al. Deep learning increases the availability of organism photographs taken by citizens in citizen science programs. Sci. Rep. 12, 1–10 (2022).
Article Google Scholar
Sousa, L. B. et al. Citizen science and smartphone e-entomology enables low-cost upscaling of mosquito surveillance. Sci. Total Environ. 704, 135349 (2020).
Article ADS Google Scholar
Bartumeus, F., Oltra, A. & Palmer, J. R. Citizen science: A gateway for innovation in disease-carrying mosquito management? Trends Parasitol. 34, 727–729 (2018).
Article PubMed Google Scholar
Kampen, H. et al. Approaches to passive mosquito surveillance in the EU. Parasite Vector 8, 1–13 (2015).
Article Google Scholar
Park, J., Kim, D. I., Choi, B., Kang, W. & Kwon, H. W. Classification and morphological analysis of vector mosquitoes using deep convolutional neural networks. Sci. Rep. 10, 1012. https://doi.org/10.1038/s41598-020-57875-1 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Couret, J. et al. Delimiting cryptic morphological variation among human malaria vector species using convolutional neural networks. PLoS Negl. Trop. Dis. 14, e0008904. https://doi.org/10.1371/journal.pntd.0008904 (2020).
Article PubMed PubMed Central Google Scholar
Brey, J. et al. Modified mosquito programs’ surveillance needs and an image-based identification tool to address them. Front. Trop. Dis. 2, 62. https://doi.org/10.3389/fitd.2021.810062 (2022).
Article Google Scholar
Goodwin, A. et al. Mosquito species identification using convolutional neural networks with a multitiered ensemble model for novel species detection. Sci. Rep. 11, 13656. https://doi.org/10.1038/s41598-021-92891-9 (2021).
Article CAS PubMed PubMed Central ADS Google Scholar
Lee, S., Kim, H. & Cho, B.-K. Deep learning-based image classification for major mosquito species inhabiting Korea. Insects 14, 526 (2023).
Article PubMed PubMed Central Google Scholar
Liu, W.-L. et al. An IoT-based smart mosquito trap system embedded with real-time mosquito image processing by neural networks for mosquito surveillance. Front. Bioeng. Biotechnol. 11, 968 (2023).
Google Scholar
Warren, B., Gibson, G. & Russell, I. J. Sex recognition through midflight mating duets in Culex mosquitoes is mediated by acoustic distortion. Curr. Biol. 19, 485–491 (2009).
Article CAS PubMed Google Scholar
Gibson, G. & Russell, I. Flying in tune: Sexual recognition in mosquitoes. Curr. Biol. 16, 1311–1316 (2006).
Article CAS PubMed Google Scholar
Pennetier, C., Warren, B., Dabiré, K. R., Russell, I. J. & Gibson, G. “Singing on the wing” as a mechanism for species recognition in the malarial mosquito Anopheles gambiae. Curr. Biol. 20, 131–136 (2010).
Article CAS PubMed Google Scholar
Cator, L. J., Arthur, B. J., Harrington, L. C. & Hoy, R. R. Harmonic convergence in the love songs of the dengue vector mosquito. Science 323, 1077–1079 (2009).
Article CAS PubMed PubMed Central ADS Google Scholar
Staunton, K. M. et al. A novel methodology for recording wing beat frequencies of untethered male and female Aedes aegypti. J. Am. Mosq. Control Assoc. 35, 169–177 (2019).
Article PubMed Google Scholar
de Nadai, B., Maletzke, A., Corbi, J., Batista, G. & Reiskind, M. The impact of body size on Aedes [Stegomyia] aegypti wingbeat frequency: Implications for mosquito identification. Med. Vet. Entomol. 35, 617–624 (2021).
Article PubMed Google Scholar
Villarreal, S. M., Winokur, O. & Harrington, L. The impact of temperature and body size on fundamental flight tone variation in the mosquito vector Aedes aegypti (Diptera: Culicidae): Implications for acoustic lures. J. Med. Entomol. 54, 1116–1121 (2017).
Article PubMed PubMed Central Google Scholar
Brogdon, W. G. Measurement of flight tone differences between female Aedes aegypti and A. albopictus (Diptera: Culicidae). J. Med. Entomol. 31, 700–703 (1994).
Article CAS PubMed Google Scholar
Pantoja-Sánchez, H., Gomez, S., Velez, V., Avila, F. W. & Alfonso-Parra, C. Precopulatory acoustic interactions of the New World malaria vector Anopheles albimanus (Diptera: Culicidae). Parasite Vector 12, 1–12 (2019).
Article Google Scholar
Perrin, A., Glaizot, O. & Christe, P. Worldwide impacts of landscape anthropization on mosquito abundance and diversity: A meta-analysis. Glob. Change Biol. 28, 6857–6871 (2022).
Article CAS Google Scholar
Ferraguti, M. et al. Effects of landscape anthropization on mosquito community composition and abundance. Sci. Rep. 6, 1–9 (2016).
Article MathSciNet Google Scholar
Schmidt, T. L. et al. Tracking genetic invasions: Genome-wide single nucleotide polymorphisms reveal the source of pyrethroid-resistant Aedes aegypti (yellow fever mosquito) incursions at international ports. Evol. Appl. 12, 1136–1146 (2019).
Article CAS PubMed PubMed Central Google Scholar
Benedict, M. Q., Levine, R. S., Hawley, W. A. & Lounibos, L. P. Spread of the tiger: Global risk of invasion by the mosquito Aedes albopictus. Vector-Borne Zoonot. Dis. 7, 76–85 (2007).
Article Google Scholar
Leta, S. et al. Global risk mapping for major diseases transmitted by Aedes aegypti and Aedes albopictus. Int. J. Infect. Dis. 67, 25–35 (2018).
Article PubMed Google Scholar
Higa, Y. et al. Geographic distribution of Aedes aegypti and Aedes albopictus collected from used tires in Vietnam. J. Am. Mosq. Control Assoc. 26, 1–9 (2010).
Article PubMed Google Scholar
Lopez-Solis, A. D. et al. Aedes aegypti, Ae. albopictus and Culex quinquefasciatus adults found coexisting in urban and semiurban dwellings of Southern Chiapas, Mexico. Insects 14, 565 (2023).
Article PubMed PubMed Central Google Scholar
Leisnham, P. T., LaDeau, S. L. & Juliano, S. A. Spatial and temporal habitat segregation of mosquitoes in urban Florida. PLoS ONE 9(3), e91655 (2014).
Article PubMed PubMed Central ADS Google Scholar
Wilke, A. B. B. et al. Community composition and year-round abundance of vector species of mosquitoes make Miami-Dade County, Florida a receptive gateway for arbovirus entry to the United States. Sci. Rep. 9, 8732. https://doi.org/10.1038/s41598-019-45337-2 (2019).
Article CAS PubMed PubMed Central ADS Google Scholar
Dhimal, M. et al. Risk factors for the presence of chikungunya and dengue vectors (Aedes aegypti and Aedes albopictus), their altitudinal distribution and climatic determinants of their abundance in central Nepal. PLoS Negl. Trop. Dis. 9, e0003545 (2015).
Article PubMed PubMed Central ADS Google Scholar
Sinka, M. E. et al. A new malaria vector in Africa: Predicting the expansion range of Anopheles stephensi and identifying the urban populations at risk. Proc. Natl. Acad. Sci. U.S.A. 117, 24900–24908. https://doi.org/10.1073/pnas.2003976117 (2020).
Article CAS PubMed PubMed Central ADS Google Scholar
Balkew, M. et al. Geographical distribution of Anopheles stephensi in eastern Ethiopia. Parasit. Vector 13, 1–8 (2020).
Article Google Scholar
Samy, A. M. et al. Climate change influences on the global potential distribution of the mosquito Culex quinquefasciatus, vector of West Nile virus and lymphatic filariasis. PLoS ONE 11, e0163863 (2016).
Article PubMed PubMed Central Google Scholar
Mariappan, T., Thenmozhi, V., Udayakumar, P., Bhavaniumadevi, V. & Tyagi, B. An observation on breeding behaviour of three different vector species (Aedes aegypti Linnaeus 1762, Anopheles stephensi Liston 1901 and Culex quinquefasciatus Say 1823) in wells in the coastal region of Ramanathapuram district, Tamil Nadu, India. Int. J. Mosq. Res. 2, 42–44 (2015).
Google Scholar
Inman, H. F. & Bradley, E. L. The overlapping coefficient as a measure of agreement between probability distributions and point estimation of the overlap of two normal densities. Commun. Stat. Theor. Methods 18, 3851–3874 (1989).
Article MathSciNet Google Scholar
Potamitis, I. & Rigakis, I. Measuring the fundamental frequency and the harmonic properties of the wingbeat of a large number of mosquitoes in flight using 2D optoacoustic sensors. Appl. Acoust. 109, 54–60 (2016).
Article Google Scholar
Genoud, A. P., Basistyy, R., Williams, G. M. & Thomas, B. P. Optical remote sensing for monitoring flying mosquitoes, gender identification and discussion on species identification. Appl. Phys. B 124, 1–11 (2018).
Google Scholar
Ouyang, T.-H., Yang, E.-C., Jiang, J.-A. & Lin, T.-T. Mosquito vector monitoring system based on optical wingbeat classification. Comput. Electron. Agric. 118, 47–55 (2015).
Article Google Scholar
González-Pérez, M. I. et al. A novel optical sensor system for the automatic classification of mosquitoes by genus and sex with high levels of accuracy. Parasite Vector 15, 190. https://doi.org/10.1186/s13071-022-05324-5 (2022).
Article Google Scholar
Thongsripong, P. et al. Mosquito vector diversity across habitats in central Thailand endemic for dengue and other arthropod-borne diseases. PLoS Negl. Trop. Dis. 7, e2507 (2013).
Article PubMed PubMed Central Google Scholar
Geier, M. et al. The BG-counter: A smart Internet of Things (IoT) device for monitoring mosquito trap counts in the field while drinking coffee at your desk. In AMCA 82nd Annual Meeting (2016).
Johnson, B. J. et al. Development and field evaluation of the sentinel mosquito arbovirus capture kit (SMACK). Parasite Vector 8, 1–10 (2015).
Article CAS Google Scholar
Staunton, K. M. et al. Outcomes from international field trials with Male Aedes sound traps: Frequency-dependent effectiveness in capturing target species in relation to bycatch abundance. PLoS Negl. Trop. Dis. 15, e0009061 (2021).
Article PubMed PubMed Central Google Scholar
Kirkeby, C. et al. Advances in automatic identification of flying insects using optical sensors and machine learning. Sci. Rep. 11, 1–8 (2021).
Article Google Scholar
Villela, D. A. et al. A Bayesian hierarchical model for estimation of abundance and spatial density of Aedes aegypti. PLoS ONE 10, e0123794 (2015).
Article PubMed PubMed Central Google Scholar
Degener, C. M. et al. Temporal abundance of Aedes aegypti in Manaus, Brazil, measured by two trap types for adult mosquitoes. Mem. Inst. Oswaldo Cruz 109, 1030–1040 (2014).
Article CAS PubMed PubMed Central Google Scholar
Vabalas, A., Gowen, E., Poliakoff, E. & Casson, A. J. Machine learning algorithm validation with a limited sample size. PLoS ONE 14, e0224365 (2019).
Article CAS PubMed PubMed Central Google Scholar
Zhang, Y. & Ling, C. A strategy to apply machine learning to small datasets in materials science. NPJ Comput. Mater. 4, 25 (2018).
Article CAS ADS Google Scholar
Yeap, H. L., Endersby, N. M., Johnson, P. H., Ritchie, S. A. & Hoffmann, A. A. Body size and wing shape measurements as quality indicators of Aedes aegypti mosquitoes destined for field release. Am. J. Trop. Med. Hyg. 89, 78 (2013).
Article PubMed PubMed Central Google Scholar
Genoud, A. P., Gao, Y., Williams, G. M. & Thomas, B. P. A comparison of supervised machine learning algorithms for mosquito identification from backscattered optical signals. Ecol. Inform. 58, 101090 (2020).
Article Google Scholar
Mahanta, B., Handique, R., Dutta, P., Narain, K. & Mahanta, J. Temporal variations in biting density and rhythm of Culex quinquefasciatus in tea agro-ecosystem of Assam, India. Southeast Asian J. Trop. Med. Public Health 30, 804–809 (1999).
CAS PubMed Google Scholar
Trpis, M., McClelland, G., Gillett, J., Teesdale, C. & Rao, T. Diel periodicity in the landing of Aedes aegypti on man. Bull. World Health Organ. 48, 623 (1973).
CAS PubMed PubMed Central Google Scholar
Hawley, W. A. The biology of Aedes albopictus. J. Am. Mosq. Control Assoc. Suppl. 1, 1–39 (1988).
CAS PubMed Google Scholar
Ritchie, S. A. et al. A secure semi-field system for the study of Aedes aegypti. PLoS Negl. Trop. Dis. 5, e988 (2011).
Article PubMed PubMed Central Google Scholar
Ferguson, H. M. et al. Establishment of a large semi-field system for experimental study of African malaria vector ecology and control in Tanzania. Malaria J. 7, 1–15 (2008).
Article Google Scholar
McFee, B. et al. librosa: Audio and music signal analysis in python. In Proc. 14th Python in Science Conference, Vol. 8, 18–25 (2015).
Virtanen, P. et al. SciPy 1.0: Fundamental algorithms for scientific computing in Python. Nat. Methods 17, 261–272 (2020).
Article CAS PubMed PubMed Central Google Scholar
Negri, L. H. & Vestri, C. lucashn/peakutils: v1. 1.0. Zenodo. 10.5281/zenodo.887917 (2017).
Ayvaz, U. et al. Automatic speaker recognition using mel-frequency cepstral coefficients through machine learning. Comput. Mater. Contin. 71, 5511 (2022).
Google Scholar
Martin, R. Noise power spectral density estimation based on optimal smoothing and minimum statistics. IEEE Trans. Speech Audio Process. 9, 504–512 (2001).
Article Google Scholar
Potamitis, I. & Schäfer, P. On classifying insects from their wing-beat: New results. In Ecology and Acoustics: Emergent Properties from Community to Landscape 16–18 (2014).
Vasconcelos, D., Nunes, N. J. & Gomes, J. An annotated dataset of bioacoustic sensing and features of mosquitoes. Sci. Data 7, 382 (2020).
Article PubMed PubMed Central Google Scholar
Howley, T., Madden, M. G., O’Connell, M.-L. & Ryder, A. G. The effect of principal component analysis on machine learning accuracy with high dimensional spectral data. In Applications and Innovations in Intelligent Systems XIII (eds Macintosh, A. et al.) (Springer, 2005).
Google Scholar
Cartwright, K. V. Determining the effective or RMS voltage of various waveforms without calculus. Tech. Interface 8, 1–20 (2007).
Google Scholar
Lu, S.-D., Sian, H.-W., Wang, M.-H. & Liao, R.-M. Application of extension neural network with discrete wavelet transform and Parseval’s theorem for power quality analysis. Appl. Sci. 9, 2228 (2019).
Article Google Scholar
Pedregosa, F. et al. Scikit-learn: Machine learning in Python. J. Mach. Learn. Res. 12, 2825–2830 (2011).
MathSciNet Google Scholar
Chen, T. et al. Xgboost: Extreme gradient boosting. R Package Version 0.4-2, Vol. 1, 1–4 (2015).
Haykin, S. & Lippmann, R. Neural networks, a comprehensive foundation. Int. J. Neural Syst. 5, 363–364 (1994).
Article Google Scholar
Shapiro, S. S. & Wilk, M. B. An analysis of variance test for normality (complete samples). Biometrika 52, 591–611 (1965).
Article MathSciNet Google Scholar
Tobias, S. & Carlson, J. E. Brief report: Bartlett’s test of sphericity and chance findings in factor analysis. Multivar. Behav. Res. 4, 375–377 (1969).
Article CAS Google Scholar
R Core Team. R: A Language and Environment for Statistical Computing (R Foundation for Statistical Computing, 2013).

Download references

Acknowledgements

The authors thank Dr. Cameron Webb of The University of Sydney for providing the starting material for the Culex quinquefasciatus colony used in this study. They would also like to thank the Clinical Malaria Group at QIMR-Berghofer for providing egg stock of Anopheles stephensi for the duration of the study.

Author information

These authors contributed equally: Brian J. Johnson and Michael Weber.

Authors and Affiliations

Mosquito Control Laboratory, QIMR Berghofer Medical Research Institute, Brisbane, QLD, 4006, Australia
Brian J. Johnson, Hasan Mohammad Al-Amin & Gregor J. Devine
Biogents AG, Weissenburgstr. 22, 93055, Regensburg, Germany
Michael Weber & Martin Geier

Authors

Brian J. Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Michael Weber
View author publications
You can also search for this author in PubMed Google Scholar
Hasan Mohammad Al-Amin
View author publications
You can also search for this author in PubMed Google Scholar
Martin Geier
View author publications
You can also search for this author in PubMed Google Scholar
Gregor J. Devine
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.J.J., M.W., G.J.D. and M.G. conceived the experiments. B.J.J., H.M.A., and M.W. conducted the experiments, and B.J.J. performed statistical analysis and figure generation. B.J.J., G.J.D. and M.W. drafted the manuscript. All authors reviewed and contributed to the final manuscript.

Corresponding author

Correspondence to Brian J. Johnson.

Ethics declarations

Competing interests

MW and MG are employed by Biogents AG (Regensburg, Germany), the commercial supplier and manufacturer of the BG-Counter 2 used in this study. BJJ, GJD, and HMA declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Johnson, B.J., Weber, M., Al-Amin, H.M. et al. Automated differentiation of mixed populations of free-flying female mosquitoes under semi-field conditions. Sci Rep 14, 3494 (2024). https://doi.org/10.1038/s41598-024-54233-3

Download citation

Received: 09 June 2023
Accepted: 10 February 2024
Published: 12 February 2024
DOI: https://doi.org/10.1038/s41598-024-54233-3

Keywords

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Improving the efficiency of aerosolized insecticide testing against mosquitoes

Infrared light sensors permit rapid recording of wingbeat frequency and bioacoustic species identification of mosquitoes

Field evaluation of two mosquito traps in Zhejiang Province, China

Introduction

Results

Wingbeat frequency datasets

Model selection

Application of classification models to free-flight data

Discussion

Methods

Trap and sensor design

Mosquito rearing

Data collection and trap operation

Data processing

Free-flight capture experiments

Post-capture analysis

Model selection, training and validation

Statistical analyses

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information 1.

Supplementary Information 2.

Rights and permissions

About this article

Cite this article

Share this article

Keywords

Comments

Search

Quick links