Acoustic differentiation and classification of wild belugas and narwhals using echolocation clicks

Zahn, Marie J.; Rankin, Shannon; McCullough, Jennifer L. K.; Koblitz, Jens C.; Archer, Frederick; Rasmussen, Marianne H.; Laidre, Kristin L.

doi:10.1038/s41598-021-01441-w

Download PDF

Article
Open access
Published: 12 November 2021

Acoustic differentiation and classification of wild belugas and narwhals using echolocation clicks

Scientific Reports volume 11, Article number: 22141 (2021) Cite this article

3918 Accesses
7 Citations
7 Altmetric
Metrics details

Subjects

Abstract

Belugas (Delphinapterus leucas) and narwhals (Monodon monoceros) are highly social Arctic toothed whales with large vocal repertoires and similar acoustic profiles. Passive Acoustic Monitoring (PAM) that uses multiple hydrophones over large spatiotemporal scales has been a primary method to study their populations, particularly in response to rapid climate change and increasing underwater noise. This study marks the first acoustic comparison between wild belugas and narwhals from the same location and reveals that they can be acoustically differentiated and classified solely by echolocation clicks. Acoustic recordings were made in the pack ice of Baffin Bay, West Greenland, during 2013. Multivariate analyses and Random Forests classification models were applied to eighty-one single-species acoustic events comprised of numerous echolocation clicks. Results demonstrate a significant difference between species’ acoustic parameters where beluga echolocation was distinguished by higher frequency content, evidenced by higher peak frequencies, center frequencies, and frequency minimums and maximums. Spectral peaks, troughs, and center frequencies for beluga clicks were generally > 60 kHz and narwhal clicks < 60 kHz with overlap between 40–60 kHz. Classification model predictive performance was strong with an overall correct classification rate of 97.5% for the best model. The most important predictors for species assignment were defined by peaks and notches in frequency spectra. Our results provide strong support for the use of echolocation in PAM efforts to differentiate belugas and narwhals acoustically.

Allopatric humpback whales of differing generations share call types between foraging and wintering grounds

Article Open access 11 August 2021

Differentiation of two swim bladdered fish species using next generation wideband hydroacoustics

Article Open access 18 May 2021

Baleen whale acoustic presence and behaviour at a Mid-Atlantic migratory habitat, the Azores Archipelago

Article Open access 16 March 2020

Introduction

Only three species of cetaceans occupy the Arctic year-round: the beluga (Delphinapterus leucas), narwhal (Monodon monoceros), and bowhead whale (Balaena mysticetus). As toothed whales (odontocetes), the beluga and narwhal are closely related and are the only two members of the Monodontidae family. They use echolocation to identify objects and locate prey, unlike the bowhead whale, a baleen whale, that has not evolved this sense^1,2. Belugas are circumpolar in their distribution with approximately 22 subpopulations, or stocks, some of which are highly migratory and others resident in both Arctic and sub-Arctic waters^{3,4,5,6,7,8,9}. Most populations migrate from wintering regions among the pack ice and return to the same estuarine summering areas to feed, molt, and give birth^8,10,11. In contrast, narwhals occur in approximately 12 stocks and have a more restricted distribution occupying waters of the Canadian Arctic, West and East Greenland, and western Russia within the Atlantic Arctic^12,13,14. Narwhals in the Canadian Arctic and West Greenland undergo extensive annual migrations with high site fidelity from their summer ranges in fjords of Greenland and Baffin Island to their wintering grounds in Baffin Bay and northern Davis Strait^3,13,15. Across their respective distributions, belugas and most of the world’s narwhals overlap for much of the year in the waters of the Canadian Arctic and Baffin Bay, West Greenland, during their annual migrations (Fig. 1a).

In addition to the use of visual surveys and telemetry to study wild beluga and narwhal populations, Passive Acoustic Monitoring (PAM) uses hydrophones over large spatiotemporal scales to localize regions of ecological importance^{12,16,17,18,19,20}. Despite this, the acoustic profiles of Arctic odontocetes remain largely understudied, particularly during non-summer months, because they reside under sea ice much of the year and the deployment and recovery of acoustic equipment in the Arctic is challenging and costly¹². Further, PAM relies on the ability to distinguish between species solely based on their characteristic sounds (i.e., calls or echolocation clicks), which means individual or a combination of specific sonic identifiers must be known for each species. Call types typically used in acoustic classification of odontocetes include echolocation clicks, burst pulses, whistles, and combined signals^{20,21,22,23,24,25,26}. Echolocation, sometimes referred to as biosonar, is characterized by the emission of high frequency, relatively broadband clicks of high directionality and listening for returning echoes; it is a dominant sense for odontocetes, much like vision is for humans^1,2. Burst pulses, or “buzzes,” are short bursts or a series of broadband pulses (clicks) with a high repetition rate^21,25,27. Whistles are narrow-band and frequency-modulated tonal vocalizations^21,28. Finally, combined signals, or “mixed calls,” include overlaid or paired pulsed and tonal sounds^23,29,30. Acoustic classifiers may use a single call type or multiple call types to differentiate species³¹.

Belugas and narwhals have large vocal repertoires that feature similar acoustic profiles^17,23,27,32, which can make it difficult to distinguish between them acoustically without visual confirmation of species identity. To date, only one study has differentiated between belugas and narwhals acoustically¹⁷ since other PAM studies have focused on regions where they do not co-occur^12,33,34. Frouin-Mouy et al.¹⁷ identified narwhals when they observed whistles or buzzes combined with low-frequency echolocation clicks, and belugas were detected when bird-like whistles were combined with high-frequency clicks. When whistles and buzzes were absent, they used an increase in spectral power around the 20 kHz frequency band to identify narwhal mid-frequency clicks¹⁷. However, their comparison was narrow in scope as it was not their primary research objective. Thorough analyses of echolocation clicks have been conducted separately on belugas^{21,36,37,38,39,40} and narwhals^25,27,41,42, but no comparative study with data from the same region exists. The broadband signals of beluga and narwhal echolocation extend frequencies between 2–150 kHz, but high-frequency clicks have been reported for both species with energies up to 200 kHz^25,39. Measurements of beluga click peak frequencies in captive and wild environments vary widely between 40–120 kHz^{16,36,37,38,39}. Unlike belugas, narwhals have not been held in captivity where controlled experiments can occur, and as a result, narwhal phonation is poorly described compared to belugas and other delphinids. Narwhal echolocation has been characterized by clicks with frequency maxima between 30–70 kHz^25,30,43,44, with some as low as 2–10 kHz and 7–14 kHz³². Still, among these studies, differences in study design, sampling equipment, and location make direct comparisons between beluga and narwhal click characteristics challenging.

As the Arctic continues to experience profound environmental changes due to climate warming^45,46,47, monitoring changes to the seasonal presence of belugas and narwhals is a growing research priority. Projections suggest that trans-Arctic shipping routes like the Northwest Passage and Northern Sea Route will be ice-free by midcentury^48,49, leading to concerns surrounding the effects of increased human activities on Arctic cetaceans^50,51. Importantly, the spatiotemporal overlap of belugas and narwhals also overlaps with the vessel corridor for the Mary River Iron Ore Mine on Baffin Island and the Northwest Passage (Fig. 1a) where potentially large increases in underwater noise are expected in the coming decades^3,52,53,54. Arctic odontocetes are especially at risk to northward human expansion as hydrocarbon development and commercial shipping pose substantial threats via ship strikes, hearing damage, vessel disturbance, and increased underwater noise^3,50,51. Underwater anthropogenic sounds from seismic surveys, drilling and oil production, military sonar, and motorized vessels can substantially disturb the acoustic environment which they depend on^{53,55,56,57,58,59}.

Here, we use data collected from an offshore region in Baffin Bay, West Greenland to compare beluga and narwhal echolocation clicks. Our research objectives were twofold: (1) determine whether spectral properties between beluga and narwhal echolocation are significantly different, and (2) build an acoustic classifier to determine whether belugas and narwhals can be classified using only echolocation parameters. Employing methods to differentiate belugas and narwhals acoustically will equip scientists with the necessary tools to monitor their distribution and habitat-use changes. Further, using PAM to study Arctic odontocetes year-round will help managers mitigate any negative consequences of climate change and vessel traffic on these sentinel species.

Methods

Data collection

Aerial searches for narwhals and belugas were conducted out of Niaqornat, West Greenland, from an Air Greenland AS350 helicopter in spring 2013 (Fig. 1b). All field operations were in accordance with IACUC procedures as approved by the University of Washington (#4155-01, PI Laidre) and the US Office of Naval Research. The Government of Greenland and Greenland Institute of Natural Resources, Nuuk provided K. L. Laidre permission to conduct research in Greenland waters. All methods were carried out in accordance with relevant guidelines and regulations. Weather conditions allowed seven days between 21 and 31 March to search for whales 100–150 km offshore in leads and cracks in the pack ice > 98% concentration. When narwhals or belugas were observed, an aerial search radius of at least 5 km was required to ensure that the ice conditions were safe for landing, during which sightings for other whales and species in the vicinity were made. On the ice, a hydrophone array was deployed at the edge of a lead and all recordings were paired with visual identification of each species. Only one species was observed and recorded at each sampling location, so there were no occurrences in which both narwhals and belugas were present. Since the detection range for echolocation is < 1 km³⁹ and our search radius was > 5 km, we assume our data are single-species recordings.

The hydrophone array was composed of 16 individual Reson TC4013-5 receivers (sensitivity − 215 dB ± 2 dB re 1 V/µPa; flat ± 2 dB frequency response 1–150 kHz) positioned in a vertical, linear orientation. Prior to deployment, each hydrophone was calibrated and its frequency response determined. Each receiver was located 1 m apart along a 2 mm diameter line with the topmost hydrophone at 3 m below the surface and the bottom hydrophone at 18 m. A 4-kg weight attached to the bottom maintained a vertical orientation of the array. While recording, a custom software, MALTA (Microphone Array Localization Tool for Animals by CAE Software and Systems), was used to visually examine recordings of all 16 receivers in real-time. Hydrophone signals were amplified by 35 dB using a custom amplifier, and recordings were converted from analog to digital (500 kHz sampling rate; 16-bit resolution) using two eight-channel National Instruments PXI-6123 A/D converters. No high pass filter was applied, and the hydrophones served as a low pass filter (150 kHz, 1 pole). The clipping level was at 206 dB pp re 1 µPa at 100 kHz^39,41. To safeguard against potential file corruption and facilitate data post-processing, whale recordings were partitioned, loss-less, in 5-s-long sound files.

Event selection and acoustic parameter estimation

Individual echolocation clicks were detected using the open source passive acoustic analysis software PAMGuard (v. 2.01.03f)⁶⁰. Based on localization analyses using the same data^39,41, the highest amplitude signals were recorded on hydrophones 10 and 11 (positioned at 12 and 13 m deep, respectively) and thus provided the highest quality recordings. For species differentiation analyses and classification models, only data from clicks recorded on hydrophone 10 (12 m depth) were used to avoid pseudo-replication. Using the Click Detector module in PAMGuard, clicks were detected from hydrophone 10 recordings with a 14 dB signal-to-noise minimum threshold. To minimize false detections on low-frequency sounds, the detector included a 4th order IIR Butterworth high-pass filter with a 4 kHz corner frequency. No additional frequency filters were used. Click detections were labeled using PAMGuard click classifiers that were defined by specified frequency bins according to the peak frequency within each click. Each classifier was given a unique numeric code that corresponded to the target detection frequency range: 1 (4–20 kHz), 2 (20–50 kHz), 3 (50–70 kHz), 4 (70–100 kHz), 5 (100–150 kHz), 6 (150–250 kHz), and 0 (unclassified).

Groups of clicks (i.e., click trains) were then manually assigned to individual detection “events” using the bearing/time display in PAMGuard’s Viewer Mode (v. 2.01.03f). Although data from only one hydrophone were necessary for our analyses, bearing angles produced from two channels (hydrophones 10 and 11) were needed to effectively visualize and isolate click trains. Since the classification models employed in downstream analyses utilized subsamples of the entire dataset to evaluate model performance—much like a testing and training dataset—we assigned clicks to acoustic events using predetermined time windows to facilitate random partitioning of the data across all recordings. Two-minute intervals were used for narwhal recordings and 1-min intervals were used for beluga recordings in an effort to obtain a similar number of clicks per acoustic event between species, because the beluga recordings had more overlapping click trains. For all events, we included clicks where the maximal intensity of the beam was both centered (i.e., on-axis) and not centered (i.e., off-axis) on the recording system to reflect the nature of most PAM data. Once all clicks were assigned to events, a suite of acoustic parameters was calculated (Table 1). Using the PAMpal package (v. 0.12.6)⁶¹ in R (v. 4.1.0)⁶², echolocation parameter values were calculated for the click sequences selected in PAMGuard using standard measurement criteria^63,64,65. Default PAMpal settings were used that included a 10 kHz Butterworth high-pass filter and a FFT window length of 2.5 ms to use in calculations. Parameter values were calculated for individual clicks and then the mean parameter values determined for a given event, except for inter-click-interval (ICI) where the mode value was approximated for each acoustic event.

Table 1 Descriptions of echolocation acoustic parameters used as variables in species differentiation analyses and classification models. All parameters outlined here were used for the standard Random Forests classification model and BANTER’s first stage call classifier. Variable codes marked with an asterisk (*) indicate which acoustic parameters were used for multivariate differentiation analyses. A full list and the process used by the PAMpal package⁶¹ for calculating click parameters can be found in the R package documentation⁶⁵.

Full size table

Species differentiation

Multivariate analyses were used to assess potential differences in beluga and narwhal echolocation. When both the − 3 dB and − 10 dB measurement was calculated for acoustic parameters, only − 3 dB parameters were considered for differentiation analyses to remove redundant variables (see Table 1). To ensure − 10 dB measurements did not have a substantial contribution, we ran our analyses with both − 3 dB and − 10 dB measurements and our results were unaffected. Each acoustic parameter across all acoustic events was z-score transformed and a Euclidean distance dissimilarity matrix generated. A permutational multivariate analysis of variance (PerMANOVA; 999 permutations) was performed to test for differences between species among mean acoustic parameter values⁶⁶. However, a significant result from a perMANOVA can result from differences in the mean positions of each group in multivariate space, differences in within-group variance in multivariate space, or a combination of the two. Therefore, a permutation test of multivariate homogeneity of dispersions (PERMDISP; 999 permutations) was used to analyze whether within-group variance in multivariate space between beluga and narwhal acoustic parameter values was different^67,68. Differences between beluga and narwhal acoustic parameter values were visualized using principal component analysis (PCA) performed on a correlation matrix⁶⁹. As an ordination technique, PCA reduces a high-dimensional dataset with correlated variables into fewer, uncorrelated dimensions called principal components (PCs)⁷⁰. PCA variable loadings, or eigenvectors, on each PC were used to explore differences among groups. PCA eigenvalues provide the amount of variance explained by each PC and are used to determine which PCs are statistically significant. By applying the broken-stick rule and examining a scree plot, eigenvalues that are higher than what is expected by chance are considered to explain a significant amount of the variance in the original data⁷¹. All statistical analyses were conducted in R using the vegan package (v. 2.5-7)⁷².

Species classification using Random Forests

We used a Random Forests (RF) classification model⁷³ to classify belugas and narwhals using echolocation click parameters. RF has demonstrated to be an effective approach for bioacoustic studies^74,75,76 as it is unaffected by nonparametric data and can accommodate many correlated variables. An RF model consists of many individual decision trees; each of these trees uses a random subset of samples and predictors. By combining thousands of trees, the final aggregated ensemble tree explores differences among species across the entire space of predictors in the original data and maximizes predictive power. RF models do not use a separate testing dataset to assess model accuracy. The random subsample drawn for each individual tree is termed the “in-bag” sample, while the remainder of the data not included are referred to as “out-of-bag” (OOB). OOB samples are used to test model performance by calculating an OOB classification error rate.

The two primary parameters for RF classification are the number of randomly selected predictors to choose from at each node, mtry, and the number of randomly selected samples to classify in each tree, sampsize. We conducted a sensitivity analysis for mtry and sampsize to ensure we used parameter values that prevented overfitting and maximized classification accuracy. The model was fit over all possible combinations of both parameters within possible ranges (mtry: 2–19; sampsize: 2–18) and the model accuracy determined. For all combinations, the correct classification rate did not vary by more than 0.05%, and therefore the model result was not sensitive to mtry and sampsize. Thus, we used the default value for mtry (square root of the number of predictors) and used half of the total sample size for the smallest species class for sampsize.

For our final model, species were assigned a priori to acoustic events in PAMGuard and 20 acoustic parameters were used as predictors to classify belugas and narwhals (Table 1). Each predictor (i.e., acoustic parameter) was the calculated mean across all individual clicks for each acoustic event. A randomized subset of four acoustic parameters (mtry) was used to split observations at each node. Each tree was grown using a randomized subset of nine acoustic events (sampsize) from each species group. The model was structured such that equal subsamples were drawn from each species class without replacement to account for our unbalanced dataset and to capture as much variation in the acoustic data as possible. For passive acoustics, there are occasions when individual whales may dominate the recordings with stereotyped calls; therefore, if we sampled with replacement, our model may underestimate the variation in the data. Model stability was visualized by plotting the trace of cumulative OOB error rate by number of trees and plotting the distribution of the fraction of trees that objects were in-bag. Ten thousand trees were constructed in the forest (ntree) to achieve model stability. To determine whether our model’s correct classification rates were significantly greater than what was expected by chance alone (50%), a binomial test for statistical significance of model performance was conducted. Our classification model was built using the randomForest (v. 4.6-14)⁷⁷ and rfPermute (v. 2.5)⁷⁸ packages in R.

Species classification using BANTER

For comparison with the singular RF model above, we also employed BANTER (Bio-Acoustic eveNT classifiER), a supervised acoustic classification method that can utilize multiple call types²⁸, to classify belugas and narwhals. BANTER consists of two stages of RF models²⁸. The first stage is referred to as the “call classifier” in which individual classification models are built for each call type or detector. From the call classifier, BANTER produces a distribution of classification probabilities for each call type or detector. The second stage is the “event classifier” and uses the results from the first stage to classify acoustic events (collections of calls) to each species.

Although BANTER has the potential to use information from multiple call types (e.g., whistles or burst pulses), our objective was to determine whether belugas and narwhals can be classified using echolocation signals alone. Therefore, our call classifier consisted of separate RF models for each of the echolocation click detectors used in PAMGuard. Two click detectors (1 and 6) from the PAMGuard classification were removed due to insufficient data. For each detector, an RF model was built using 20 acoustic parameters (Table 1) to classify all individual echolocation clicks from that detector. RF models using default settings draw bootstrap samples from one pool containing the entire training dataset from all classes. Therefore, when an RF model is applied to an unbalanced dataset, the classifier will tend to correctly classify the dominant class. To offset this effect, BANTER draws equal random subsamples without replacement from each species group. Each model randomly selected 50 clicks from each species (sampsize) without replacement and each forest contained 20,000 trees. The number of parameters randomly selected to choose from at each node was set to the default (mtry = square root of the total number of parameters). Results from the call classifier provide mean assignment probabilities for each click detector. For example, the click detector 2 model produces the mean probability that clicks from detector 2 will be assigned to a beluga or narwhal. The call classifier also determines the proportion of each detector present for a given event. For example, all events that do not have clicks detected by click detector 2 will have a proportion of zero, and for events that do have clicks from detector 2 will have a proportion value between 0 and 1.

Results from the call classifier (i.e., mean assignment probabilities for each click detector and proportions of each detector per event) were then applied to the second stage event classifier as variables. The mode inter-click interval for each event was added as an additional event-level variable. The event classifier assigned all acoustic events to either a beluga or narwhal using a single RF model. Parameterization of this model was the same as the call classifier (mtry = default; ntree = 20,000), except the number of events randomly selected (sampsize) was set to nine. Relative variable importance was examined for both the call and event classifiers. To determine whether the correct classification rates of our BANTER call and event classifiers were significantly greater than what was expected by chance alone (50%), binomial tests for statistical significance were conducted for each model. Our BANTER classification model was built using the banter (v. 0.9.4)²⁸ and rfPermute⁷⁸ packages in R.

For the previous analyses, recordings were subdivided into 1- and 2-min acoustic events for belugas and narwhals, respectively. As an alternate approach to using standardized temporal intervals, we examined BANTER performance using events assigned to independent acoustic encounters. Acoustic encounters are defined by separate sightings of groups of whales and can vary in recording duration, ambient noise, and environmental and recording conditions (e.g., greater distance or different orientation between whales and receivers). Here, individual encounters were comprised of several acoustic events. If sounds from independent sightings were substantially different, encounters that have a larger number of events will have a greater representation in the species classification model used above. Using the same BANTER framework, we tested whether unique encounters for each species were similar. To accomplish this, a series of BANTER classification training models were created using all events from one encounter for each species (mtry = default; ntree = 10,000; sampsize = half the smallest class sample size) and then used to predict events from encounters that were not included. All unique pairwise combinations of acoustic encounters between species were used to build separate BANTER models. The correlation between the training confusion matrices and the prediction (validation) confusion matrices were then calculated. The presence of correlation (> 0.5) indicates that events from separate encounters are similar because the prediction classification scores are similar to the training scores.

Results

A total of 1:03 h of beluga recordings and 7:38 h of narwhal recordings were made 100 km or more offshore in the pack ice of Baffin Bay (Fig. 1b). Belugas were observed and recorded at two unique locations and narwhals at nine unique locations, in pods of approximately six to thirty individuals. Narwhal recordings used for analyses originated from five independent sightings (i.e., encounters) and beluga recordings from two. We estimate there were approximately 22–36 individual belugas and 63–120 narwhals sampled across all seven encounters. Recordings were paired with visual confirmation of species, so all echolocation clicks were labeled as originating from belugas or narwhals. Species groups were assigned to events created in PAMGuard. A total of 11,319 clicks were assigned to 81 separate events with a median of 71 clicks per event (Supplementary Table S1). Out of the 81 events, 19 were belugas (2537 clicks) and 62 were narwhals (8782 clicks). Twenty acoustic parameters were used to compare beluga and narwhal echolocation (Table 1). Signal parameter measurements for each species across all acoustic events are summarized in Table 2. Beluga acoustic events had a higher mean peak frequency (68.7 ± 10.1 kHz), − 3 dB center frequency (69.6 ± 10.6 kHz), and − 10 dB center frequency (70.2 ± 9.2 kHz) than narwhal events (43.7 ± 7.8 kHz, 43.8 ± 7.8 kHz, and 46.2 ± 8.5 kHz, respectively). Spectral peaks, troughs, and center frequencies (variables: peak, peak2, peak3, trough, trough2, centerHz_10dB, and centerHz_3dB) for beluga clicks were generally > 60 kHz and narwhal clicks < 60 kHz with overlap between 40–60 kHz (Table 2).

Table 2 Summary of signal parameter measurements for beluga and narwhal echolocation for all 81 acoustic events (beluga: n = 19; narwhal: n = 62). Mean ± standard deviation (s.d.) and range (min–max) are provided.

Full size table

Species acoustic differentiation

Results from the perMANOVA reveal significant differences between beluga and narwhal echolocation characteristics (F_1,79 = 42.35, p = 0.001; Supplementary Table S2). Beluga acoustic events also demonstrated a higher dispersion in multivariate space than narwhal acoustic events (PERMDISP, F_1,79 = 8.37, p = 0.004; Supplementary Table S2). The PCA further displayed strong evidence for beluga and narwhal acoustic differentiation; beluga events were more dispersed than narwhal events (Fig. 2a). Within the two-dimensional space of PC1 and PC2, 72.9% of the original trait variation was explained where most was captured by the first dimension (58.4%; Fig. 2a). According to the broken-stick model, PC1 was the only dimension found to explain a significant amount of the variation in the original data. Beluga acoustic events are largely located on the right side of the PCA ordination space, representing higher echolocation click parameter values as compared to narwhal acoustic events that are on the left side (Fig. 2b; Supplementary Fig. S1). These findings are consistent with average spectra, acoustic event spectrograms, and mean parameter values for each species where beluga spectra present more energy at higher frequencies than narwhal spectra (Fig. 3; Table 2).

Species acoustic classification

Both the RF and BANTER species classification models performed well with high OOB correct classification rates (p < 0.001 for all models; Table 3). The overall correct classification rate for the RF model was 92.6% (95% CI: 84.6–97.2%); 94.7% correct classification for beluga acoustic events and 91.9% for narwhal events. One beluga and five narwhal acoustic events were misclassified (Fig. 4a) and the most important predictors for species classification were − 3 dB and − 10 dB frequency minimums and peak frequency (Fig. 4b). Average spectra and waveforms for misclassified events presented signal patterns characteristic of off-axis clicks (i.e., clicks recorded away from the whale’s longitudinal acoustic axis) when compared to those that were correctly classified (Fig. 5).

Table 3 Confusion matrices for the (a) standard Random Forest model, (b) BANTER call classifiers (Detectors 2, 3, 4, and 5), and (c) BANTER event classifier. The total number of correct and incorrect classifications are shown between the groups identified a-priori (rows) and predictions by the classifier (columns). Out-of-bag (OOB) percent correct classification rates (95% confidence interval) demonstrate model performance. Binomial test p-values using a prior probability of 0.5 are provided where an asterisk (*) represents statistical significance (p < 0.05).

Full size table

BANTER’s two-stage approach achieved a higher accuracy than the standard RF model. The BANTER first stage call classifier used here consisted of four separate models for click detectors 2, 3, 4, and 5 (Table 3), each producing mean species assignment probabilities and click detector proportions that were subsequently used as variables for the second stage event classifier (Supplementary Table S3). BANTER’s event classifier achieved a correct classification score of 97.5% (95% CI: 91.4–99.7%); 100% correct classification for beluga acoustic events and 96.8% for narwhal events (Table 3). Among the event-level variables, detectors 2 (20–50 kHz) and 3 (50–70 kHz) were the most important classifiers (Fig. 4e). By examining the relative importance of the predictors (i.e., acoustic parameters; Table 1) for detector 2 and 3 call classifiers, the second peak frequency, the frequency difference between the second and third peaks, and the frequency trough were the most important for classification of individual clicks (Fig. 4d). However, since the event classifier uses mean assignment probabilities from each detector classifier as predictors, the most important variables for the detector 2 and 3 classifiers are not necessarily important variables for the event classifier. Examination of the vote distributions for the detector 2 and 3 call classifiers revealed that the detector 2 call classifier produced confident species classifications (i.e., the majority of samples were classified at high probability) compared to the detector 3 classifier (Supplementary Fig. S2). Overall, the BANTER model demonstrated strong predictive power with only two narwhal event misclassifications (Table 3; Fig. 4c). For correctly classified events, the model had high confidence for its species assignment evidenced by its vote distribution (Supplementary Fig. S3).

The correlation test to examine similarity between acoustic encounters revealed that events among different encounters were similar. A positive correlation between training confusion matrices (from models built using one encounter per species) and validation confusion matrices (from training models that predicted events from encounters not used in training model) indicated similarity between encounters. With the exception of one pair, all encounter pairs had a positive correlation and 6 of the 10 pairs had a correlation greater than 0.5 (Supplementary Table S4).

For both the single RF and BANTER models, the majority of the predictions for beluga and narwhal acoustic events were tightly clustered (Fig. 4a,c). These tightly clustered, correctly classified events were marked by clear differences between beluga and narwhal spectral characteristics. Beluga average spectra tended to be smoother, have a larger bandwidth, and higher -3 and -10 dB frequency minimums and maximums (Figs. 3a and 5a). Conversely, narwhal average spectra presented peaks and notches, a lower bandwidth, and lower frequency minimums and maximums (Figs. 3b and 5c). Average spectra from events that were misclassified and showed properties of both beluga and narwhal spectral characteristics generally had lower signal magnitudes (dB), smaller click sample sizes, and spectral variations that suggest a higher proportion of off-axis clicks (i.e., clicks not centered on the recording equipment; Figs. 3 and 5).

Discussion

Passive acoustics has been a primary method to monitor beluga and narwhal populations year-round in remote regions of the Arctic, providing insight into their seasonal distribution and migratory routes^{16,17,33,38,44}. Yet, prior to this study, the acoustic profiles of these two species were examined separately, despite their shared habitat ranges and acoustic features. Our results provide the first acoustic comparison between belugas and narwhals from the same region and reveal that they can be acoustically differentiated and classified using echolocation clicks alone. Belugas and narwhals have considerable acoustic overlap and variability in their social calls which makes species classification using acoustic parameters challenging^17,23,27,32. Our analysis of beluga and narwhal recordings from the same region, during the same season, and acquired using the same equipment delivers a rare in situ comparison that fills a critical knowledge gap in the acoustic ecology for each species. We also provide a robust BANTER classification model that sets a precedent and foundation for echolocation parameters to be used in future PAM classification efforts, whether for belugas and narwhals or other odontocete species.

Our multivariate analyses showed significant differences between beluga and narwhal echolocation characteristics (perMANOVA: p = 0.001) where beluga acoustic parameters were more dispersed and distinguished by higher frequency content (Figs. 2 and 3). Spectral peaks, troughs, and center frequencies for beluga clicks tended to be > 60 kHz and narwhal clicks < 60 kHz with overlap between 40–60 kHz (Table 2). The greater variation observed among beluga acoustic events may be due to the sample size being smaller (n = 19) than the narwhal dataset (n = 62) or may reflect the true variation in beluga echolocation. Additionally, the lower frequency content observed in narwhal echolocation may indicate that narwhals have a longer echolocation detection range for prey and PAM receivers than belugas. Previous work to quantify beluga echolocation parameters reveals high variability and evidence of biosonar adaptability^{16,36,37,40,79}. For example, estimates of beluga echolocation peak frequencies range widely between 40–120 kHz across captive and wild environments^{34,36,37,38,40,79} and have been shown to change depending on the ambient sound levels of the environment³⁶. Narwhal echolocation has been largely characterized by lower frequency clicks with maxima between 30–70 kHz^30,32,43,44, but high-frequency clicks have been reported where the entire bandwidth can extend up to 200 kHz²⁵. Frouin-Mouy et al.¹⁷ used a difference in spectral power around the 20 kHz frequency band to distinguish between species for mid-frequency clicks with peaks within 30–60 kHz¹⁷. However, their sample sizes were small with 20 beluga and 17 narwhal click trains (162 and 186 individual clicks, respectively), and their recordings came from five different locations and three different receivers which can introduce classification errors³⁵. Our findings are consistent with their preliminary comparison in which narwhal spectra contain more energy between 20–50 kHz than beluga spectra (Fig. 3). Yet, it remains unknown to what degree the variation observed in beluga and narwhal echolocation is due to context-specific active biosonar adjustments made by the whales, the orientation of whales relative to the receiver, differences in sound propagation in the water, population- or individual-specific characteristics, and/or differences in sampling design, recording equipment, or data processing.

Between the two classification models presented in this study, the BANTER model proved to be the strongest classifier with a correct classification rate of 97.5%. The BANTER event classifier accurately predicted all beluga events (100%) and misclassified only two narwhal events (3.2%; Table 3). As a balanced design, the BANTER classifier does not bias towards either species, but it reveals that beluga events are more diagnosable evidenced by the correct prediction of all beluga events by the event classifier. This may occur because a higher proportion of the beluga events—the smaller sample size group—was randomly selected for the training dataset compared to the narwhal group across all n trees. Results from BANTER’s two-stage approach identifies which frequency ranges contain the most useful information for species classification. The call classifiers for detectors 2 (20–50 kHz) and 3 (50–70 kHz) were found to be the most important for BANTER’s event classifier, suggesting clicks with peak frequencies between 20–70 kHz largely contained the information necessary for correct species assignment to acoustic events (Fig. 5a,c). Higher frequencies attenuate faster than lower frequencies¹; therefore, it is possible that the lower frequency classifiers (detectors 2 and 3) were the most important because lower frequency signals were detected more than higher frequency signals. These results also indicate that the BANTER model will be less affected by variations in detection distances between species that would more largely influence the higher frequency detector 4 and 5 classifiers.

While BANTER’s call classifiers used acoustic parameters for predictors, the event classifier used mean assignment probabilities and detector proportions for predictors. The most important predictors for the event classifier were the mean assignment probabilities for detector 2 (Fig. 4e). Given the reasonably high overall classification rate (85.4%) and vote distribution for the detector 2 call classifier, it is likely that the most important variables for this classifier were also important for the event classifier. For the detector 2 call classifier, the second peak frequency, frequency difference between the second and third peaks, and − 3 dB center frequency were the most important variables for the classification of clicks (Fig. 4d). Soldevilla et al.⁸⁰ demonstrated that Risso’s (Grampus griseus) and Pacific white-sided dolphins (Lagenorhynchus obliquidens) could be classified using spectral peak and notch patterns from echolocation clicks. Here, we similarly demonstrate the efficacy of using unique spectral properties to classify belugas and narwhals.

Our results from the single RF model demonstrate how a simple RF classifier using only echolocation parameters can still achieve a high correct species classification rate (92.6%). The RF model accurately predicted beluga events more than narwhal events, with one beluga misclassification (5.3%) and five narwhal misclassifications (8.1%; Table 3). Much like the BANTER model, the lower misclassification rate of beluga events may be in part due to the unbalanced dataset; a higher proportion of the beluga events were randomly selected in the RF model because it was the smaller sample size group. However, it is also possible that beluga events contain more diagnosable features and thus yield higher classification scores. The most important acoustic parameters for species classification in the standard RF model differed from those in BANTER’s detectors 2 and 3 call classifiers. Among all 20 acoustic parameters used in the RF model, frequency minimum (− 3 dB and − 10 dB) and peak frequency were the most important variables for species assignment (Fig. 4b).

Using echolocation predictors for species classification is an effective approach, particularly given its primary sensory role for odontocetes. There is evidence for individual and group-specific call types among belugas^81,82,83,84 and narwhals⁸⁵, which contributes to the variation in their vocal repertoire but also makes acoustic classification challenging. However, it is likely that echolocation clicks are characterized by more stable features due to their sensory function. There are occasions where whales are only echolocating and not producing other call types (e.g., whistles, burst pulses)¹⁷, and as such, a classifier that uses echolocation may offer more reliable year-round detections. While it is likely that the addition of other call types would improve classification^28,35, the BANTER classifier presented here demonstrates belugas and narwhals can be classified by clicks alone. Echolocation is increasingly being incorporated into PAM classification programs^28,74,80,86 and is specifically gaining traction for monitoring belugas^16,19,33. Noise from ice floes, vessels, or industrial activities below 40 kHz can mask lower frequency social calls (e.g., Lammers et al.³⁴ and Halliday et al.⁸⁷). The broadband, high frequency nature of echolocation clicks is largely protected from masking by lower frequency anthropogenic sounds. As vessel traffic is expected to increase in the Arctic, echolocation may be a stronger metric for PAM classification.

Yet, the directionality and high-frequency nature of echolocation clicks pose their own limitations to species detectability and classification. Acoustic receivers will record echolocation sounds when the whale is facing the receiver but are unlikely to detect whales that are pointing their sonar beam away from the recorder. Narwhals are known to forage for Greenland halibut (Reinhardtius hippoglossoides) and Gonatus squid species at depths often > 1000 m in the winter^88,89, so they may be detected by seafloor recorders during these months. While belugas can dive to depths > 500 m and reach the seafloor, they do not prey upon benthic species like narwhals; instead, they target fish species like the Arctic cod (Arctogadus glacialis) and polar cod (Boreogadus saida) that occupy shallower depths^90,91. Therefore, due to the differences in prey selection by belugas and narwhals, they may not be detected by seafloor recorders with the same confidence depending on the recording depth of the instrument. Additionally, the rapid attenuation of ultrasonic echolocation signals¹ results in a smaller detection range for echolocation clicks (< 1 km³⁹) compared to lower-frequency social calls. Multiple receivers and the inclusion of other call types (e.g., whistles) in the classifier may be required to detect whales at a sufficiently large spatial resolution. In addition to deploying more receivers, the instruments required to record echolocation clicks must record higher frequencies with higher sampling rates, which typically depend on more battery power and memory storage. Ongoing development of acoustic receivers to be both economical and reliable will support monitoring populations using echolocation clicks at large scales.

This study provides a unique case in which the methods employed were identical for each species studied, so confounding variables that can substantially affect parameterization, like differences in the frequency responses of recording systems used³⁵, are absent. However, the data used to build the classifiers originated from a limited number of independent acoustic encounters: two beluga and five narwhal. When testing for the similarity between independent acoustic encounters using a BANTER framework, our results indicate that there is likely not a strong effect of encounter on species classification. Additional data from multiple, independent groups of whales are needed to more accurately reflect the true variation in beluga and narwhal echolocation. While we show it is possible to differentiate and classify belugas and narwhals using echolocation clicks, incorporating data from additional independent encounters and other call types will strengthen the classifier against any individual- or pod-specific acoustic characteristics that may be present.

Future work to test the BANTER classification model presented here will elucidate its efficacy for PAM applications. Acoustic recordings used in this study were collected with a hydrophone deployed off the ice-edge positioned at a depth of 12 m. Assuming the new data follow the methodology employed here—including the recording equipment and sampling rate (500 kHz sampling rate; 16-bit resolution)—our model has the potential to be used on novel data. However, it remains unclear how effective the model presented here will be when data collected at depth, such as seafloor recorders where detectability of echolocation may decrease and recordings are expected to have a higher proportion of off-axis clicks, are applied. Additional examination is needed to assess the degree to which differences in recording equipment (e.g., Scripps Institution of Oceanography High-frequency Acoustic Recording Package [HARPs] or Ocean Instruments’ SoundTrap recorders) or receiver depth would affect the outcome of the presented classifier. Likewise, further consideration of which acoustic parameters are more stable across various equipment may mitigate effects of differing platforms^28,35.

Insufficient data on beluga and narwhal populations has made it difficult to determine major threats to these species and thus hindered specific conservation efforts⁹². Narwhals have been identified as one of the most sensitive Arctic marine mammals to climate change due to their specialized habitat niche and restricted distribution range⁹³. Both belugas and narwhals have been shown to have high site fidelity to their summering and wintering grounds, but little is known about what factors (e.g., prey availability or ice conditions) cause these behavior^88,94,95. Furthermore, global climate warming is causing a lengthened open-water season in the Arctic that has galvanized opportunities for increased hydrocarbon exploration and commercial and recreational shipping development^3,48,92,96. Some of the largest unexploited hydrocarbon reserves are found in the Arctic, particularly Baffin Bay^97,98, and most global trade depends on maritime transport. Over half of the distribution ranges of the three endemic Arctic whale species overlaps with known or anticipated offshore hydrocarbon deposits³. Stakeholders of international trade and hydrocarbon exploration need to be informed of what regions are critical to belugas and narwhals for vital processes (e.g., calving, feeding, migrating) in order to mitigate industrial activities¹⁷. Strict regulations on vessel speed, appropriate shipping lanes, and types of permissible activities will be needed to avoid harmful consequences⁹².

As advances in PAM increase efficiency and robustness of data collection and analyses in the Arctic, present and future distributions of belugas and narwhals can be recorded with greater confidence and resolution, thereby improving efforts to monitor their changes in response to climate change and increased underwater noise. The multivariate nature of acoustic parameters lends itself to the interactive effects between variables, making methods like RF classifiers that can handle correlated variables robust for PAM species detection. BANTER increases classification accuracy beyond a standard RF model by using information from misclassifications at the call classification stage to inform the overall event classifier. Results from this study fill a critical data gap in the acoustic ecology of belugas and narwhals and provide a promising approach for future PAM programs. As recording systems advance and training datasets expand, classification programs such as the one presented here will develop greater predictive power and allow acousticians to reliably monitor Arctic odontocete populations long-term. Despite the variation in beluga and narwhal acoustic repertoires, our study suggests that the use of echolocation parameters for species classification may provide the most reliable method to differentiate these iconic species.

Data availability

All code used for model design and implementation are freely available via Github: https://github.com/mjzahn/beluga_narwhal_classifier.

References

Madsen, P. T. & Wahlberg, M. Recording and quantification of ultrasonic echolocation clicks from free-ranging toothed whales. Deep. Res. Part I(54), 1421–1444 (2007).
Google Scholar
Au, W. W. L. Sonar of Dolphins (Springer, 1993).
Google Scholar
Reeves, R. R. et al. Distribution of endemic cetaceans in relation to hydrocarbon development and commercial shipping in a warming Arctic. Mar. Policy 44, 375–389 (2014).
Google Scholar
Hauser, D. D. W. et al. Habitat selection by two beluga whale populations in the Chukchi and Beaufort seas. PLoS One 12, e0172755 (2017).
PubMed PubMed Central Google Scholar
Vacquié-Garcia, J., Lydersen, C., Ims, R. A. & Kovacs, K. M. Habitats and movement patterns of white whales Delphinapterus leucas in Svalbard, Norway in a changing climate. Mov. Ecol. 6, 21 (2018).
PubMed PubMed Central Google Scholar
Lydersen, C., Martin, A. R., Kovacs, K. M. & Gjertz, I. Summer and autumn movements of white whales Delphinapterus leucas in Svalbard, Norway. Mar. Ecol. Prog. Ser. 219, 265–274 (2001).
ADS Google Scholar
Innes, S. et al. Surveys of belugas and narwhals in the Canadian High Arctic in 1996. NAMMCO Sci. Publ. 4, 169–190 (2002).
Google Scholar
Smith, T. G. & Martin, A. R. Distribution and movements of belugas, Delphinapterus leucas, in the Canadian High Arctic. Can. J. Fish. Aquat. Sci. 51, 1653–1663 (1994).
Google Scholar
Hobbs, R. et al. Global review of the conservation status of Monodontid stocks. Mar. Fish. Rev. 81, 1–53 (2019).
ADS Google Scholar
Frost, K. J. & Lowry, L. F. Distribution, abundance, and movements of beluga whales, Delphinapterus leucas, in coastal waters of western Alaska. In Advances in Research on the Beluga Whale, Delphinapterus leucas Vol. 224 (eds Smith, T. G. et al.) 39–57 (Canadian Bulletin of Fisheries and Aquatic Sciences, 1990).
Google Scholar
Lewis, A. E., Hammill, M. O., Power, M., Doidge, D. W. & Lesage, V. Movement and aggregation of eastern Hudson Bay beluga whales (Delphinapterus leucas): A comparison of patterns found through satellite telemetry and Nunavik Traditional Ecological Knowledge. Arctic 62, 13–24 (2009).
Google Scholar
Ahonen, H., Stafford, K. M., Lydersen, C., Steur, L. D. & Kovacs, K. M. A multi-year study of narwhal occurrence in the western Fram Strait—detected via passive acoustic monitoring. Polar Res. 38, 1–14 (2019).
Google Scholar
Heide-Jørgensen, M. P. et al. The migratory behaviour of narwhals (Monodon monoceros). Can. J. Zool. 81, 1298–1305 (2003).
Google Scholar
Richard, P. R. et al. Baffin Bay narwhal population distribution and numbers: Aerial surveys in the Canadian High Arctic, 2002–04. Arctic 63, 85–99 (2010).
Google Scholar
Dietz, R., Heide-Jørgensen, M. P., Richard, P. R. & Acquarone, M. Summer and fall movements of narwhals (Monodon monoceros) from northeastern Baffin Island towards northern Davis Strait. Arctic 54, 244–261 (2001).
Google Scholar
Castellote, M. et al. Monitoring white whales (Delphinapterus leucas) with echolocation loggers. Polar Biol. 36, 493–509 (2013).
Google Scholar
Frouin-Mouy, H., Kowarski, K., Martin, B. & Bröker, K. Seasonal trends in acoustic detection of marine mammals in Baffin Bay and Melville Bay, Northwest Greenland. Arctic 70, 59–76 (2017).
Google Scholar
Sousa-Lima, R. S., Norris, T. F., Oswald, J. N. & Fernandes, D. P. A review and inventory of fixed autonomous recorders for passive acoustic monitoring of marine mammals. Aquat. Mamm. 39, 23–53 (2013).
Google Scholar
Zhong, M. et al. Beluga whale acoustic signal classification using deep learning neural network models. J. Acoust. Soc. Am. 147, 1834–1841 (2020).
ADS PubMed Google Scholar
Castellote, M. et al. Seasonal distribution and foraging occurrence of Cook Inlet beluga whales based on passive acoustic monitoring. Endanger. Species Res. 41, 225–243 (2020).
Google Scholar
Sjare, B. L. & Smith, T. G. The vocal repertoire of white whales, Delphinapterus leucas, summering in Cunningham Inlet, Northwest Territories. Can. J. Zool. 64, 407–415 (1986).
Google Scholar
Chmelnitsky, E. G. & Ferguson, S. H. Beluga whale, Delphinapterus leucas, vocalizations from the Churchill River, Manitoba, Canada. J. Acoust. Soc. Am. 131, 4821–4835 (2012).
ADS PubMed Google Scholar
Marcoux, M., Auger-Méthé, M. & Humphries, M. M. Variability and context specificity of narwhal (Monodon monoceros) whistles and pulsed calls. Mar. Mammal Sci. 28, 649–665 (2012).
Google Scholar
Garland, E. C., Castellote, M. & Berchok, C. L. Beluga whale (Delphinapterus leucas) vocalizations and call classification from the eastern Beaufort Sea population. J. Acoust. Soc. Am. 137, 3054–3067 (2015).
ADS PubMed Google Scholar
Rasmussen, M. H., Koblitz, J. C. & Laidre, K. L. Buzzes and high-frequency clicks recorded from narwhals (Monodon monoceros) at their wintering ground. Aquat. Mamm. 41, 256–264 (2015).
Google Scholar
McCullough, J. L. K., Simonis, A. E., Sakai, T. & Oleson, E. M. Acoustic classification of false killer whales in the Hawaiian islands based on comprehensive vocal repertoire. JASA Express Lett. 1, 071201 (2021).
Google Scholar
Ford, J. K. B. & Fisher, H. D. Underwater acoustic signals of the narwhal (Monodon monoceros). Can. J. Zool. 56, 552–560 (1978).
Google Scholar
Rankin, S. et al. Acoustic classification of dolphins in the California Current using whistles, echolocation clicks, and burst pulses. Mar. Mammal Sci. 33, 520–540 (2017).
Google Scholar
Walmsley, S. F., Rendell, L., Hussey, N. E. & Marcoux, M. Vocal sequences in narwhals (Monodon monoceros). J. Acoust. Soc. Am. 147, 1078–1091 (2020).
ADS PubMed Google Scholar
Shapiro, A. D. Preliminary evidence for signature vocalizations among free-ranging narwhals (Monodon monceros). J. Acoust. Soc. Am. 120, 1695–1705 (2006).
ADS PubMed Google Scholar
Simões Amorim, T. O. et al. Integrative bioacoustics discrimination of eight delphinid species in the western South Atlantic Ocean. PLoS One 14, e0217977 (2019).
PubMed PubMed Central Google Scholar
Stafford, K. M., Laidre, K. L. & Heide-Jørgensen, M. P. First acoustic recordings of narwhals (Monodon monoceros) in winter. Mar. Mammal Sci. 28, 197–207 (2012).
Google Scholar
Castellote, M. et al. Dual instrument passive acoustic monitoring of belugas in Cook Inlet, Alaska. J. Acoust. Soc. Am. 139, 2697–2707 (2016).
ADS PubMed Google Scholar
Lammers, M. O. et al. Passive acoustic monitoring of Cook Inlet beluga whales (Delphinapterus leucas). J. Acoust. Soc. Am. 134, 2497–2504 (2013).
ADS PubMed Google Scholar
Roch, M. A., Stinner-Sloan, J., Baumann-Pickering, S. & Wiggins, S. M. Compensating for the effects of site and equipment variation on delphinid species identification from their echolocation clicks. J. Acoust. Soc. Am. 137, 22–29 (2015).
ADS PubMed Google Scholar
Au, W. W., Penner, R. H., Carder, D. A. & Scronce, B. Demonstration of adaptation in beluga whale echolocation signals. J. Acoust. Soc. Am. 77, 726–730 (1985).
ADS CAS PubMed Google Scholar
Au, W. W. L., Penner, R. H. & Turl, C. W. Propagation of beluga echolocation signals. J. Acoust. Soc. Am. 82, 807–813 (1987).
ADS CAS PubMed Google Scholar
Roy, N., Simard, Y., Gervaise, C. & Dtn, E. 3D tracking of foraging belugas from their clicks: Experiment from a coastal hydrophone array. Appl. Acoust. 71, 1050–1056 (2010).
Google Scholar
Zahn, M. J., Laidre, K. L., Stilz, P., Rasmussen, M. H. & Koblitz, J. C. Vertical sonar beam width of wild belugas (Delphinapterus leucas) in West Greenland. PLoS One 16, e0257054 (2021).
CAS PubMed PubMed Central Google Scholar
Rutenko, A. N. & Vishnyakov, A. A. Time sequences of sonar signals generated by a beluga whale when locating underwater objects. Acoust. Phys. 52, 314–323 (2006).
ADS Google Scholar
Koblitz, J. C., Stilz, P., Rasmussen, M. H. & Laidre, K. L. Highly directional sonar beam of narwhals (Monodon monoceros) measured with a vertical 16 hydrophone array. PLoS One 11, e0162069 (2016).
PubMed PubMed Central Google Scholar
Podolskiy, E. A. & Sugiyama, S. Soundscape of a narwhal summering ground in a glacier fjord (Inglefield Bredning, Greenland). J. Geophys. Res. Ocean. 125, e2020JC016116 (2020).
ADS Google Scholar
Miller, L. A., Pristed, J., Mohl, B. & Surlykke, A. The click-sounds of narwhals (Monodon monoceros) in Inglefield Bay, Northwest Greenland. Mar. Mammal Sci. 11, 491–502 (1995).
Google Scholar
Marcoux, M., Auger-Methe, M., Chmelnitsky, E., Ferguson, S. H. & Humphries, M. M. Local passive acoustic monitoring of narwhal presence in the Canadian Arctic: A pilot project. Arctic 64, 307–316 (2011).
Google Scholar
Overland, J. et al. The urgency of Arctic change. Polar Sci. 21, 6–13 (2019).
ADS Google Scholar
Comiso, J. C. & Hall, D. K. Climate trends in the Arctic as observed from space. WIREs Clim. Change 5, 389–409 (2014).
Google Scholar
Kwok, R. Arctic sea ice thickness, volume, and multiyear ice coverage: Losses and coupled variability (1958–2018). Environ. Res. Lett. 13, 105005 (2018).
Google Scholar
Overland, J. E. & Wang, M. When will the summer Arctic be nearly sea ice free?. Geophys. Res. Lett. 40, 2097–2101 (2013).
ADS Google Scholar
Smith, L. C. & Stephenson, S. R. New Trans-Arctic shipping routes navigable by midcentury. Proc. Natl. Acad. Sci. U.S.A. 110, E1191–E1195 (2013).
ADS CAS PubMed PubMed Central Google Scholar
Hauser, D. D. W., Laidre, K. L. & Stern, H. L. Vulnerability of Arctic marine mammals to vessel traffic in the increasingly ice-free Northwest Passage and Northern Sea Route. Proc. Natl. Acad. Sci. U.S.A. 115, 7617–7622 (2018).
CAS PubMed PubMed Central Google Scholar
Halliday, W. D., Pine, M. K. & Insley, S. J. Underwater noise and Arctic marine mammals: Review and policy recommendations. Environ. Rev. 28, 438–448 (2020).
Google Scholar
Halliday, W. D. et al. Underwater sound levels in the Canadian Arctic, 2014–2019. Mar. Pollut. Bull. 168, 112437 (2021).
CAS PubMed Google Scholar
Kochanowicz, Z. et al. Using western science and Inuit knowledge to model ship-source noise exposure for cetaceans (marine mammals) in Tallurutiup Imanga (Lancaster Sound), Nunavut, Canada. Mar. Policy 130, 104557 (2021).
Google Scholar
Stewart, R. E. A., Lesage, V., Lawson, J. W., Cleator, H. & Martin, K. A. Science technical review of the draft Environmental Impact Statement (EIS) for Baffinland’s Mary River Project (Canadian Science Advisory Secretariat, Fisheries and Oceans Canada, 2011).
Google Scholar
Heide-Jørgensen, M. P., Hansen, R. G., Westdal, K., Reeves, R. R. & Mosbech, A. Narwhals and seismic exploration: Is seismic noise increasing the risk of ice entrapments?. Biol. Conserv. 158, 50–54 (2013).
Google Scholar
Blackwell, S. B., Greene, C. R. & Richardson, W. J. Drilling and operational sounds from an oil production island in the ice-covered Beaufort Sea. J. Acoust. Soc. Am. 116, 3199–3211 (2004).
ADS PubMed Google Scholar
Yang, W. et al. Anthropogenic sound exposure-induced stress in captive dolphins and implications for cetacean health. Front. Mar. Sci. 8, 606736 (2021).
Google Scholar
Erbe, C. & Farmer, D. M. Zones of impact around icebreakers affecting beluga whales in the Beaufort Sea. J. Acoust. Soc. Am. 108, 1332–1340 (2000).
ADS CAS PubMed Google Scholar
Heide-Jørgensen, M. P. et al. Behavioral response study on seismic airgun and vessel exposures in narwhals. Front. Mar. Sci. 8, 658173 (2021).
Google Scholar
Gillespie, D., Mellinger, D. K., Gordon, J. & Al, E. PAMGUARD: Semiautomated, open source software for real-time acoustic detection and localization of cetaceans. Proc. Inst. Acoust. 30, 54–62 (2008).
Google Scholar
Sakai, T. PAMpal: Load and process passive acoustic data. R package version 0.12.6. http://cran.r-project.org/package=PAMpal (2021).
R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing http://www.r-project.org/ (2021).
Griffiths, E. T. et al. Detection and classification of narrow-band high frequency echolocation clicks from drifting recorders. J. Acoust. Soc. Am. 147, 3511–3522 (2020).
ADS PubMed Google Scholar
Baumann-Pickering, S., Wiggins, S. M., Hildebrand, J. A., Roch, M. A. & Schnitzler, H. Discriminating features of echolocation clicks of melon-headed whales (Peponocephala electra), bottlenose dolphins (Tursiops truncatus), and Gray’s spinner dolphins (Stenella longirostris longirostris). J. Acoust. Soc. Am. 128, 2212–2224 (2010).
ADS PubMed Google Scholar
Sakai, T. PAMpal standardClickCalcs. https://taikisan21.github.io/PAMpal/StandardCalcs.html (2021).
Anderson, M. J. A new method for non-parametric multivariate analysis of variance. Austral Ecol. 26, 32–46 (2001).
Google Scholar
Anderson, M. J. Distance-based tests for homogeneity of multivariate dispersions. Biometrics 62, 245–253 (2006).
MathSciNet PubMed MATH Google Scholar
Anderson, M. J. Permutational Multivariate Analysis of Variance (PERMANOVA). Wiley StatsRef Stat. Ref. Online https://doi.org/10.1002/9781118445112.stat07841 (2017).
Article Google Scholar
Pearson, K. On lines and planes of closest fit to systems of points in space. Philos. Mag. 2, 559–572 (1901).
MATH Google Scholar
Lever, J., Krzywinski, M. & Altman, N. Principal component analysis. Nat. Methods 14, 641–642 (2017).
CAS Google Scholar
Jackson, D. A. Stopping rules in principal components analysis: A comparison of heuristical and statistical approaches. Ecology 74, 2204–2214 (1993).
Google Scholar
Oksanen, J. et al. Vegan: Community ecology package. R package version 2.5-7. https://cran.r-project.org/package=vegan (2020).
Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).
MATH Google Scholar
Yang, L. et al. Description and classification of echolocation clicks of Indian Ocean humpback (Sousa plumbea) and Indo-Pacific bottlenose (Tursiops aduncus) dolphins from Menai Bay, Zanzibar, East Africa. PLoS One 15, e0230319 (2020).
CAS PubMed PubMed Central Google Scholar
Archer, F. I., Rankin, S., Stafford, K. M., Castellote, M. & Delarue, J. Quantifying spatial and temporal variation of North Pacific fin whale (Balaenoptera physalus) acoustic behavior. Mar. Mammal Sci. 36, 224–245 (2020).
Google Scholar
Ross, J. C. & Allen, P. E. Random Forest for improved analysis efficiency in passive acoustic monitoring. Ecol. Inform. 21, 34–39 (2014).
Google Scholar
Liaw, A. & Wiener, M. Classification and regression by randomForest. R News 2, 18–22 (2002).
Google Scholar
Archer, E. rfPermute: Estimate permutation p-values for Random Forest importance metrics. R package version 2.5. https://github.com/EricArcher/rfPermute (2021).
Gurevich, V. S. & Evans, W. E. Echolocation discrimination of complex planar targets by the Beluga whale (Delphinapterus leucas). J. Acoust. Soc. Am. 60, S5 (1976).
ADS Google Scholar
Soldevilla, M. S. et al. Classification of Risso’s and Pacific white-sided dolphins using spectral properties of echolocation clicks. J. Acoust. Soc. Am. 124, 609–624 (2008).
ADS PubMed Google Scholar
Morisaka, T., Yoshida, Y., Akune, Y., Mishima, H. & Nishimoto, S. Exchange of ‘signature’ calls in captive belugas (Delphinapterus leucas). J. Ethol. 31, 141–149 (2013).
Google Scholar
Vergara, V., Michaud, R. & Barrett-Lennard, L. G. What can captive whales tell us about their wild counterparts? Identification, usage, and ontogeny of contact calls in belugas (Delphinapterus leucas). Int. J. Comp. Psychol. 23, 278–309 (2010).
Google Scholar
Vergara, V. & Mikus, M. A. Contact call diversity in natural beluga entrapments in an Arctic estuary: Preliminary evidence of vocal signatures in wild belugas. Mar. Mammal Sci. 35, 434–465 (2019).
Google Scholar
Panova, E. M. et al. Intraspecific variability in the ‘vowel’-like sounds of beluga whales (Delphinapterus leucas): Intra- and interpopulation comparisons. Mar. Mammal Sci. 32, 452–465 (2016).
Google Scholar
Ames, A. E., Blackwell, S. B., Tervo, O. M. & Heide-Jørgensen, M. P. Evidence of stereotyped contact call use in narwhal (Monodon monoceros) mother-calf communication. PLoS One 16, e0254393 (2021).
CAS PubMed PubMed Central Google Scholar
Baumann-Pickering, S. et al. False killer whale and short-finned pilot whale acoustic identification. Endanger. Species Res. 28, 97–108 (2015).
Google Scholar
Halliday, W. D. et al. Potential exposure of beluga and bowhead whales to underwater noise from ship traffic in the Beaufort and Chukchi Seas. Ocean Coast. Manag. 204, 105473 (2021).
Google Scholar
Laidre, K. L., Jørgensen, O. A. & Treble, M. A. Deep-ocean predation by a high Arctic cetacean. ICES J. Mar. Sci. 61, 430–440 (2004).
Google Scholar
Laidre, K. L., Heide-Jørgensen, M. P., Dietz, R., Hobbs, R. C. & Jørgensen, O. A. Deep-diving by narwhals Monodon monoceros: Differences in foraging behavior between wintering areas?. Mar. Ecol. Prog. Ser. 261, 269–281 (2003).
ADS Google Scholar
Lydersen, C. & Kovacs, K. M. A review of the ecology and status of white whales (Delphinapterus leucas) in Svalbard, Norway. Polar Res. 40, 5509 (2021).
Google Scholar
Hauser, D. D. W. et al. Regional diving behavior of Pacific Arctic beluga whales Delphinapterus leucas and possible associations with prey. Mar. Ecol. Prog. Ser. 541, 245–264 (2015).
ADS Google Scholar
Ragen, T. J., Huntington, H. P. & Hovelsrud, G. K. Conservation of Arctic marine mammals faced with climate change. Ecol. Appl. 18, S166–S174 (2008).
PubMed Google Scholar
Laidre, K. L. et al. Quantifying the sensitivity of Arctic marine mammals to climate-induced habitat change. Ecol. Appl. 18, S97–S125 (2008).
PubMed Google Scholar
Heide-Jørgensen, M. P., Dietz, R., Laidre, K. L. & Richard, P. Autumn movements, home ranges, and winter density of narwhals (Monodon monoceros) tagged in Tremblay Sound, Baffin Island. Polar Biol. 25, 331–341 (2002).
Google Scholar
Hauser, D. D. W., Laidre, K. L., Suydam, R. S. & Richard, P. R. Population-specific home ranges and migration timing of Pacific Arctic beluga whales (Delphinapterus leucas). Polar Biol. 37, 1171–1183 (2014).
Google Scholar
Huntington, H. P. A preliminary assessment of threats to Arctic marine mammals and their conservation in the coming decades. Mar. Policy 33, 77–82 (2009).
Google Scholar
Gregersen, U., Hopper, J. R. & Knutz, P. C. Basin seismic stratigraphy and aspects of prospectivity in the NE Baffin Bay, Northwest Greenland. Mar. Pet. Geol. 46, 1–18 (2013).
Google Scholar
McCauley, R. D. et al. Widely used marine seismic survey air gun operations negatively impact zooplankton. Nat. Ecol. Evol. 1, 0195 (2017).
Google Scholar

Download references

Acknowledgements

We are grateful to Niaqornat’s community members and Greenland Institute of Natural Resources for their local support, Mikkel Villum Jensen for his help in the field, Air Greenland pilot Geir Akse for safe field operations, and the University of Tübingen and the German Oceanographic Museum Stralsund for providing recording equipment. We thank Jessica Crance, Taiki Sakai, and Dan Ovando for their support in implementing our analyses. We also thank Jay Barlow and Angela Szesciorka for providing R script and Pina Gruden for providing Matlab code. Funding was provided by the U.S. Office of Naval Research on K. L. Laidre’s grant (N00014-11-1-0201). The School of Aquatic and Fishery Sciences, University of Washington, and the Vetlesen Foundation supported M. J. Zahn.

Author information

Authors and Affiliations

School of Aquatic and Fishery Sciences, University of Washington, 1122 NE Boat Street, Seattle, WA, 98105, USA
Marie J. Zahn & Kristin L. Laidre
Southwest Fisheries Science Center, NOAA, 8901 La Jolla Shores Drive, La Jolla, CA, 92037, USA
Shannon Rankin & Frederick Archer
Pacific Islands Fisheries Science Center, NOAA, 1845 Wasp Boulevard, Building 176, Honolulu, HI, 96818, USA
Jennifer L. K. McCullough
Max Planck Institute of Animal Behavior, Advanced Research Technology Unit, Konstanz, Germany
Jens C. Koblitz
Centre for the Advanced Study of Collective Behaviour, University of Konstanz, Konstanz, Germany
Jens C. Koblitz
Department of Biology, University of Konstanz, Konstanz, Germany
Jens C. Koblitz
The University of Iceland’s Research Center in Húsavík, Húsavík, Iceland
Marianne H. Rasmussen
Polar Science Center, Applied Physics Laboratory, University of Washington, 1013 NE 40th Street, Seattle, WA, 98105, USA
Kristin L. Laidre

Authors

Marie J. Zahn
View author publications
You can also search for this author in PubMed Google Scholar
Shannon Rankin
View author publications
You can also search for this author in PubMed Google Scholar
Jennifer L. K. McCullough
View author publications
You can also search for this author in PubMed Google Scholar
Jens C. Koblitz
View author publications
You can also search for this author in PubMed Google Scholar
Frederick Archer
View author publications
You can also search for this author in PubMed Google Scholar
Marianne H. Rasmussen
View author publications
You can also search for this author in PubMed Google Scholar
Kristin L. Laidre
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

K.L.L. conceived the project and lead all fieldwork operations. K.L.L. and M.H.R. acquired funding for data acquisition and J.C.K. provided acoustic recording equipment. K.L.L., M.H.R., and J.C.K. collected data. Acoustic data were analyzed by M.J.Z with input from S.R., J.L.K.M., and F.A. The first draft of the manuscript was written by M.J.Z. All coauthors improved the manuscript.

Corresponding author

Correspondence to Marie J. Zahn.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Information.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zahn, M.J., Rankin, S., McCullough, J.L.K. et al. Acoustic differentiation and classification of wild belugas and narwhals using echolocation clicks. Sci Rep 11, 22141 (2021). https://doi.org/10.1038/s41598-021-01441-w

Download citation

Received: 22 July 2021
Accepted: 21 October 2021
Published: 12 November 2021
DOI: https://doi.org/10.1038/s41598-021-01441-w

This article is cited by

Spatial and temporal variability of the acoustic repertoire of Antarctic minke whales (Balaenoptera bonaerensis) in the Weddell Sea
- Diego Filún
- Ilse van Opzeeland
Scientific Reports (2023)
Beluga (Delphinapterus leucas) and narwhal (Monodon monoceros) echolocation click detection and differentiation from long-term Arctic acoustic recordings
- Joshua M. Jones
- Kaitlin E. Frasier
- John A. Hildebrand
Polar Biology (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.