Introduction

The common cuckoo Cuculus canorus (henceforth cuckoo) has fascinated humans since millennia1. It is an obligate brood parasite occurring throughout Europe and Asia, from England to Japan2,3,4 with a very large population size5. Still, its population trend is decreasing6. However, the species does not reach the threshold for it to be considered Vulnerable in the Red List categories5. The cuckoo has parasitized more than 200 different bird species3. The occurrence of cuckoos reflects overall species richness of their hosts. For this reason, the cuckoo has recently been suggested to reliably indicate biodiversity7,8,9. However, most importantly, the call of the cuckoo is recognized from its voice by the general public10. As a consequence, the cuckoo’s name evokes its typical call in numerous languages since thousands of years11,12.

Several studies have determined the relevance of auditory and visual signals in intraspecific and interspecific interactions in birds. Generally, at the intraspecific level, birds use calls and plumage to compete for a territory or acquire a mate13,14,15. Birds also use auditory and visual signals during predator–prey interactions16. The male cuckoo is much more vocal than the female, and it produces a very recognizable “cuck-ooo” call during the breeding season, consisting of two notes that are repeated many times17. Recent studies revealed that cuckoo calls vary more among than within individuals18,19. Furthermore, Møller et al. (2016) highlighted that the number of syllables in male cuckoo calls differs depending on environmental characteristics such as soil type, habitat, level of environmental radioactivity and the presence or the absence of male or female conspecifics.

Collection of data at continental and global scales produces hundreds of millions of observations of thousands of species every year20. From websites and social media to smartphones, apps, low-cost sensors and search engines data capture become easier and faster21,22. The recent availability of ‘big data’ associated with predictive analytics have profound implications for biological recording activities. An example of globally disseminated resources is www.xeno-canto.org (Xeno-canto, XC), a non-profit website set up to share recordings of bird sounds worldwide23,24. Its main goal is to improve knowledge of bird sounds by increasing accessibility and diffusion of bird sound recordings from across the globe25. Thereby, Xeno-canto data can help improve our knowledge of the complexity of cuckoo calls, and more specifically, the factors that cause such variation on a broader global scale.

The objectives of this study were to use a dedicated media and citizen science sources to explore at a wider spatial scale (1) the factors that influence the number of syllables in cuckoo calls, (2) whether the number of syllables of cuckoo calls reflects bird species richness, and (3) if cuckoo call activity is determined by environmental factors. Finally, we discuss how dedicated media sources can provide useful scientific data and thereby contribute to ecology and conservation.

Results

From the Xeno-canto database source, we identified and analysed 484 call recordings of the common cuckoo from 29 different countries in Europe, Asia and Africa (Fig. 1). Most records were from Germany (21.0%), Bulgaria (12.4%) and France (9.1%). Call recordings were registered in 22 different years during 1982–2016. More than 60% of call recordings were registered from 2014 to 2016.

Figure 1
figure 1

Geographical distribution of the number of syllables in male cuckoo calls obtained from the Xeno-canto website. The map was produced with GIS soſtware (ArcGIS 10.1)58 with free geographic background data. The geographical distribution of cuckoo Cuculus canorus calls was mapped using freely available data from http://www.xeno-canto.org.

Cuckoo call activity patterns

Cuckoo calls were recorded practically throughout the day (ESM, Fig. 1). The time of the day with more recordings was between 04:00 to 08:00 (27.5% of recordings). Conversely, from 12:00 to 16:00 cuckoo calls reached a minimum at 8.9%. Cuckoo calls were also registered during the night between 20:00 and 04:00 (12.8%) (ESM, Fig. 1). The number of syllables of cuckoo calls was not significantly associated with the date, time of day and duration of records.

Number of syllables of cuckoo calls and bird species richness

Each call recorded was composed on average of 16.99 syllables (max: 123, min: 1, SD: 16.7), and the duration of these records was on average 150.1 seconds (max: 1202.7, min: 3.2, SD: 236.23). A total of 99 bird species were heard in the background when cuckoo calls were recorded of which 49.5% were cuckoo hosts (ESM, Table 1). The number of syllables in cuckoo calls was positively correlated with bird species richness (Table 1).

Table 1 Results of fixed-effect parameters in GLMM, accounting for variation in variables in relation to female recording, date, hour, duration of recording, number of host races, population cuckoo density, species richness and latitude.

Intraspecific influence on number of syllables in cuckoo calls

Female cuckoos produced calls in 7.85% of cases when male calls were recorded, while the number of syllables was negatively correlated with the presence of female cuckoo calls (Fig. 2, Table 1). The number of syllables was positively correlated with the interaction between female calls and the number of host races, but it was not significantly correlated with the other variables (number of host races and population density) (Table 1).

Figure 2
figure 2

Number of syllables in male cuckoo calls in relation to the presence of a female call. The box plots show medians, quartiles, 5- and 95-percentiles and extreme values.

Discussion

The opportunity for access to a huge amount of data from social media and citizen science projects is changing the way in which scientific studies and analyses of biological records are performed. Such studies allow for analyses at broad spatial and temporal scales, as well as promising to connect the efforts of experts and non-experts in science20. We explored the characteristics of the calls of male cuckoos based on long-term recordings and calls from across a wide geographical range from the online website Xeno-canto.org.

Although the calls of the common cuckoo from the Xeno-canto database derive across Europe, Asia and Africa, most recordings were uploaded from Germany, Bulgaria and France. A similar heterogeneous source of data was observed in a project studying the song dialects of the yellowhammer Emberiza citronella26. These findings suggest that big data sources from citizen science projects can be influenced by particular amateur preferences (for species and/or habitats). Such preferences can potentially cause a bias if statistical analyses do not explicitly pay attention in order to minimize such bias.

The characteristics of the cuckoo call have for a long time played an important role in popular folklore. Thus, its common name is very similar among numerous languages, often associated with the particular vocalization of this bird11. The number of syllables in cuckoo calls is correlated with environmental variables and intraspecific interactions27,28. The log-normal distribution of the number of syllables reported here matches that of previous studies. This indicates that most recordings have calls with few syllables. However, the range in the number of syllables per call reported here was much more variable than in long-term studies in Ukraine and Denmark27,28.

It is often assumed that the timing of bird calls is related to weather and the function of calls. Therefore, passerines with calls having a function in the context of sexual selection sing mainly in early morning and at dusk14. The calls of the common cuckoo have not only a function in sexual selection, but may also affect potential hosts29,30. We hypothesize that the timing of calls fits the activity of hosts. The Xeno-canto database consisted of cuckoo calls recorded throughout the day. However, most calls were from early morning to midday, while cuckoo calls reached a minimum in the afternoon similar to the diurnal pattern of singing in other bird species14. This global pattern is also similar to the diurnal pattern of cuckoo calls in the Chernobyl area27. Interestingly, cuckoo calls were also recorded during the night and early morning, as reported in the literature31,32,33.

Cuckoo males produced calls with more syllables when a female cuckoo was present, but also when another male was present27. In contrast, the number of syllables in male cuckoo calls was strongly negatively correlated with the number of syllables in recordings of female cuckoos. However, it is possible that the recordings mainly focused on female calls, but that the calls of males from the vicinity were also heard. However, we cannot exclude the possibility that females are present without producing any calls.

Host-race formation or speciation takes place when a new host is exploited by a parasite, and the phenotype of cuckoo eggs eventually is adapted to those of the host34,35. Therefore, a brood parasite as the common cuckoo is divided into host races, each characterized by egg mimicry of different host species35,36,37,38. In our study, the number of syllables in cuckoo calls was positively correlated with the interaction between female recordings and the number of host races. This fact suggests that the number of syllables in cuckoo calls may reflects the already known relationship between host races and calls of female and male cuckoos, where female cuckoos show strong host preferences, while individual males mate with females that lay in the nests of different hosts34.

Previous studies have already shown that the presence of the cuckoo and its population density are reliable predictors of overall bird diversity, especially of cuckoo hosts7,8. Here we have shown that the number of syllables in cuckoo calls is positively correlated with bird species richness. Moreover, Møller et al. (2017) suggested that cuckoo calls are reliable signals of habitat quality expressed as the number of breeding bird species. Here we extended this pattern to a broad geographical scale, across the entire geographical range of the cuckoo.

In conclusion, a citizen science project using data from Xeno-canto.org allowed collection of a large number of cuckoo calls from Europe, Asia and Africa. When we analyzed Xeno-canto data we found that cuckoo call characteristics were related to the number of syllables of cuckoo calls and species richness (i.e. a positive correlation with bird species richness). There was also an intra-specific positive relationship between the interaction between female calls and the number of hosts races. Previous ecological studies used big data from citizen science and social media projects39,40, and these studies suggest that social media are changing the approach used in ecological and conservation research.

Methods

Data collection

We used Xeno-canto (http://www.xeno-canto.org/) as a data source for common cuckoo call recordings. Even if Xeno-canto mainly focuses on male cuckoos, females are occasionally recorded. Each recording was analysed and the following data were extracted from the dataset: sex (male; female if any; however we have to emphasize that normally recordings were dedicated to males, and that only sometimes were female calls recorded, confirming that the female was present close to the male); number of syllables; time of day (min:sec); date (year-month-day); geographical location (country, GIS coordinates: latitude, longitude); bird species in background; observer/recorder name and catalogue number.

Definition of variables

Number of syllables

We explored the number of syllables that a cuckoo male produces in a single call for each record27,28 (see Fig. 3). The number of syllables defined as each syllable being a single “cu-coo” was counted by waiting for a call to be finished in order to avoid inclusion of overlapping calls. A call was considered to be finished when at least a period of time similar to that of a syllable had passed without a new syllable being produced. We also counted the number of syllables in the subsequent two calls.

Figure 3
figure 3

An example of a sonogram with the count of the number of syllables in a male cuckoo song. The sonogram was prepared by Møller28.

Bird species richness

Bird species richness was the number of bird species singing in the background in each cuckoo call recording.

Cuckoo host species

The list of host species of the cuckoo was recorded from the following sources3,4,36,37,41

Population density of cuckoo

We estimated the population sizes of cuckoos for different countries as reported by Burfield and van Bommel42. Population sizes were reported as a range, and we used the mean for the analyses.

Cuckoo host races and their distribution

The common cuckoo has several host races, each classified by egg mimicry of different host species34,43,44. Moksnes and Røskaft4 categorized all cuckoo eggs produced by 17 different host races based on color and spottiness of the egg (location, color, density and size of spots). Besides, Møller et al.38 added to the compilation of host races by Moksnes and Røskaft five additional host races recently described38,45,46,47. Thus, we classified the distribution of all 22 host races of the cuckoo in the 28 European countries.

Statistical analysis

We tested for spatial auto-correlation (SAC) of the data by using Mantel tests48, based on Monte Carlo permutations (999 randomizations) to test for statistical significance49. In this study, the Mantel statistic rM evaluates the similarity between the number of syllables among sample sites and the geographical distance among these sites50. Study sites were treated as statistically independent observations because the spatial autocorrelation in the data was not significant (rM = −0.033, p = 0.843, n randomizations = 999).

We performed a principal components analysis (PCA) in order to avoid redundancy among variables, as a potential cause of multicollinearity, thereby removing strongly correlated predictors51. From PCA analysis we selected the following eight descriptors: female recording, date, hour of day, duration of record, number of host races, cuckoo population density, species richness and latitude, adequate to describe the effects of the number of syllables of cuckoo calls.

We used Generalized Linear Mixed Models (GLMM), using the package ‘lme4’52,53 to explore variation in cuckoo calls in relation to environmental variables and inter- and intraspecific characteristics. The number of syllables was log10-transformed and models were fitted supposing a log-normal distribution after exploration of the distribution of this variable, as suggested by Box and Cox54 using the package ‘MASS’55. The number of syllables in cuckoo calls was used as the response variable, while female call, date, hour of day, number of host races, cuckoo population density, bird species richness and latitude and longitude were used as predictors. The interaction between year and country were added as a random factor in the statistical model. The ‘best’ model explaining greater variation in data was assessed using Akaike’s Information Criterion (AIC)56. Confidence intervals for the significant variables were calculated by the Wald method, using the package ‘MASS’55. All statistical tests were performed with R software57.