Skip to main content

Thank you for visiting You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

Acoustic differentiation and classification of wild belugas and narwhals using echolocation clicks


Belugas (Delphinapterus leucas) and narwhals (Monodon monoceros) are highly social Arctic toothed whales with large vocal repertoires and similar acoustic profiles. Passive Acoustic Monitoring (PAM) that uses multiple hydrophones over large spatiotemporal scales has been a primary method to study their populations, particularly in response to rapid climate change and increasing underwater noise. This study marks the first acoustic comparison between wild belugas and narwhals from the same location and reveals that they can be acoustically differentiated and classified solely by echolocation clicks. Acoustic recordings were made in the pack ice of Baffin Bay, West Greenland, during 2013. Multivariate analyses and Random Forests classification models were applied to eighty-one single-species acoustic events comprised of numerous echolocation clicks. Results demonstrate a significant difference between species’ acoustic parameters where beluga echolocation was distinguished by higher frequency content, evidenced by higher peak frequencies, center frequencies, and frequency minimums and maximums. Spectral peaks, troughs, and center frequencies for beluga clicks were generally > 60 kHz and narwhal clicks < 60 kHz with overlap between 40–60 kHz. Classification model predictive performance was strong with an overall correct classification rate of 97.5% for the best model. The most important predictors for species assignment were defined by peaks and notches in frequency spectra. Our results provide strong support for the use of echolocation in PAM efforts to differentiate belugas and narwhals acoustically.


Only three species of cetaceans occupy the Arctic year-round: the beluga (Delphinapterus leucas), narwhal (Monodon monoceros), and bowhead whale (Balaena mysticetus). As toothed whales (odontocetes), the beluga and narwhal are closely related and are the only two members of the Monodontidae family. They use echolocation to identify objects and locate prey, unlike the bowhead whale, a baleen whale, that has not evolved this sense1,2. Belugas are circumpolar in their distribution with approximately 22 subpopulations, or stocks, some of which are highly migratory and others resident in both Arctic and sub-Arctic waters3,4,5,6,7,8,9. Most populations migrate from wintering regions among the pack ice and return to the same estuarine summering areas to feed, molt, and give birth8,10,11. In contrast, narwhals occur in approximately 12 stocks and have a more restricted distribution occupying waters of the Canadian Arctic, West and East Greenland, and western Russia within the Atlantic Arctic12,13,14. Narwhals in the Canadian Arctic and West Greenland undergo extensive annual migrations with high site fidelity from their summer ranges in fjords of Greenland and Baffin Island to their wintering grounds in Baffin Bay and northern Davis Strait3,13,15. Across their respective distributions, belugas and most of the world’s narwhals overlap for much of the year in the waters of the Canadian Arctic and Baffin Bay, West Greenland, during their annual migrations (Fig. 1a).

Figure 1
figure 1

Top panel: map of narwhal (purple) and beluga (green) annual ranges (a). Overlapping beluga and narwhal annual ranges is visible in dark purple. The Northwest Passage sea route between the Atlantic and Pacific oceans is shown in red. Bottom panel: map showing track lines of search effort, fuel depots, and recording sites for narwhals and belugas in Baffin Bay, West Greenland, between March 21st and 31st, 2013 (b). The black inset box in (a) demarcates the area shown in (b). Maps were created using ArcGIS’s ArcMap software (v. 10.8;

In addition to the use of visual surveys and telemetry to study wild beluga and narwhal populations, Passive Acoustic Monitoring (PAM) uses hydrophones over large spatiotemporal scales to localize regions of ecological importance12,16,17,18,19,20. Despite this, the acoustic profiles of Arctic odontocetes remain largely understudied, particularly during non-summer months, because they reside under sea ice much of the year and the deployment and recovery of acoustic equipment in the Arctic is challenging and costly12. Further, PAM relies on the ability to distinguish between species solely based on their characteristic sounds (i.e., calls or echolocation clicks), which means individual or a combination of specific sonic identifiers must be known for each species. Call types typically used in acoustic classification of odontocetes include echolocation clicks, burst pulses, whistles, and combined signals20,21,22,23,24,25,26. Echolocation, sometimes referred to as biosonar, is characterized by the emission of high frequency, relatively broadband clicks of high directionality and listening for returning echoes; it is a dominant sense for odontocetes, much like vision is for humans1,2. Burst pulses, or “buzzes,” are short bursts or a series of broadband pulses (clicks) with a high repetition rate21,25,27. Whistles are narrow-band and frequency-modulated tonal vocalizations21,28. Finally, combined signals, or “mixed calls,” include overlaid or paired pulsed and tonal sounds23,29,30. Acoustic classifiers may use a single call type or multiple call types to differentiate species31.

Belugas and narwhals have large vocal repertoires that feature similar acoustic profiles17,23,27,32, which can make it difficult to distinguish between them acoustically without visual confirmation of species identity. To date, only one study has differentiated between belugas and narwhals acoustically17 since other PAM studies have focused on regions where they do not co-occur12,33,34. Frouin-Mouy et al.17 identified narwhals when they observed whistles or buzzes combined with low-frequency echolocation clicks, and belugas were detected when bird-like whistles were combined with high-frequency clicks. When whistles and buzzes were absent, they used an increase in spectral power around the 20 kHz frequency band to identify narwhal mid-frequency clicks17. However, their comparison was narrow in scope as it was not their primary research objective. Thorough analyses of echolocation clicks have been conducted separately on belugas21,36,37,38,39,40 and narwhals25,27,41,42, but no comparative study with data from the same region exists. The broadband signals of beluga and narwhal echolocation extend frequencies between 2–150 kHz, but high-frequency clicks have been reported for both species with energies up to 200 kHz25,39. Measurements of beluga click peak frequencies in captive and wild environments vary widely between 40–120 kHz16,36,37,38,39. Unlike belugas, narwhals have not been held in captivity where controlled experiments can occur, and as a result, narwhal phonation is poorly described compared to belugas and other delphinids. Narwhal echolocation has been characterized by clicks with frequency maxima between 30–70 kHz25,30,43,44, with some as low as 2–10 kHz and 7–14 kHz32. Still, among these studies, differences in study design, sampling equipment, and location make direct comparisons between beluga and narwhal click characteristics challenging.

As the Arctic continues to experience profound environmental changes due to climate warming45,46,47, monitoring changes to the seasonal presence of belugas and narwhals is a growing research priority. Projections suggest that trans-Arctic shipping routes like the Northwest Passage and Northern Sea Route will be ice-free by midcentury48,49, leading to concerns surrounding the effects of increased human activities on Arctic cetaceans50,51. Importantly, the spatiotemporal overlap of belugas and narwhals also overlaps with the vessel corridor for the Mary River Iron Ore Mine on Baffin Island and the Northwest Passage (Fig. 1a) where potentially large increases in underwater noise are expected in the coming decades3,52,53,54. Arctic odontocetes are especially at risk to northward human expansion as hydrocarbon development and commercial shipping pose substantial threats via ship strikes, hearing damage, vessel disturbance, and increased underwater noise3,50,51. Underwater anthropogenic sounds from seismic surveys, drilling and oil production, military sonar, and motorized vessels can substantially disturb the acoustic environment which they depend on53,55,56,57,58,59.

Here, we use data collected from an offshore region in Baffin Bay, West Greenland to compare beluga and narwhal echolocation clicks. Our research objectives were twofold: (1) determine whether spectral properties between beluga and narwhal echolocation are significantly different, and (2) build an acoustic classifier to determine whether belugas and narwhals can be classified using only echolocation parameters. Employing methods to differentiate belugas and narwhals acoustically will equip scientists with the necessary tools to monitor their distribution and habitat-use changes. Further, using PAM to study Arctic odontocetes year-round will help managers mitigate any negative consequences of climate change and vessel traffic on these sentinel species.


Data collection

Aerial searches for narwhals and belugas were conducted out of Niaqornat, West Greenland, from an Air Greenland AS350 helicopter in spring 2013 (Fig. 1b). All field operations were in accordance with IACUC procedures as approved by the University of Washington (#4155-01, PI Laidre) and the US Office of Naval Research. The Government of Greenland and Greenland Institute of Natural Resources, Nuuk provided K. L. Laidre permission to conduct research in Greenland waters. All methods were carried out in accordance with relevant guidelines and regulations. Weather conditions allowed seven days between 21 and 31 March to search for whales 100–150 km offshore in leads and cracks in the pack ice > 98% concentration. When narwhals or belugas were observed, an aerial search radius of at least 5 km was required to ensure that the ice conditions were safe for landing, during which sightings for other whales and species in the vicinity were made. On the ice, a hydrophone array was deployed at the edge of a lead and all recordings were paired with visual identification of each species. Only one species was observed and recorded at each sampling location, so there were no occurrences in which both narwhals and belugas were present. Since the detection range for echolocation is < 1 km39 and our search radius was > 5 km, we assume our data are single-species recordings.

The hydrophone array was composed of 16 individual Reson TC4013-5 receivers (sensitivity − 215 dB ± 2 dB re 1 V/µPa; flat ± 2 dB frequency response 1–150 kHz) positioned in a vertical, linear orientation. Prior to deployment, each hydrophone was calibrated and its frequency response determined. Each receiver was located 1 m apart along a 2 mm diameter line with the topmost hydrophone at 3 m below the surface and the bottom hydrophone at 18 m. A 4-kg weight attached to the bottom maintained a vertical orientation of the array. While recording, a custom software, MALTA (Microphone Array Localization Tool for Animals by CAE Software and Systems), was used to visually examine recordings of all 16 receivers in real-time. Hydrophone signals were amplified by 35 dB using a custom amplifier, and recordings were converted from analog to digital (500 kHz sampling rate; 16-bit resolution) using two eight-channel National Instruments PXI-6123 A/D converters. No high pass filter was applied, and the hydrophones served as a low pass filter (150 kHz, 1 pole). The clipping level was at 206 dB pp re 1 µPa at 100 kHz39,41. To safeguard against potential file corruption and facilitate data post-processing, whale recordings were partitioned, loss-less, in 5-s-long sound files.

Event selection and acoustic parameter estimation

Individual echolocation clicks were detected using the open source passive acoustic analysis software PAMGuard (v. 2.01.03f)60. Based on localization analyses using the same data39,41, the highest amplitude signals were recorded on hydrophones 10 and 11 (positioned at 12 and 13 m deep, respectively) and thus provided the highest quality recordings. For species differentiation analyses and classification models, only data from clicks recorded on hydrophone 10 (12 m depth) were used to avoid pseudo-replication. Using the Click Detector module in PAMGuard, clicks were detected from hydrophone 10 recordings with a 14 dB signal-to-noise minimum threshold. To minimize false detections on low-frequency sounds, the detector included a 4th order IIR Butterworth high-pass filter with a 4 kHz corner frequency. No additional frequency filters were used. Click detections were labeled using PAMGuard click classifiers that were defined by specified frequency bins according to the peak frequency within each click. Each classifier was given a unique numeric code that corresponded to the target detection frequency range: 1 (4–20 kHz), 2 (20–50 kHz), 3 (50–70 kHz), 4 (70–100 kHz), 5 (100–150 kHz), 6 (150–250 kHz), and 0 (unclassified).

Groups of clicks (i.e., click trains) were then manually assigned to individual detection “events” using the bearing/time display in PAMGuard’s Viewer Mode (v. 2.01.03f). Although data from only one hydrophone were necessary for our analyses, bearing angles produced from two channels (hydrophones 10 and 11) were needed to effectively visualize and isolate click trains. Since the classification models employed in downstream analyses utilized subsamples of the entire dataset to evaluate model performance—much like a testing and training dataset—we assigned clicks to acoustic events using predetermined time windows to facilitate random partitioning of the data across all recordings. Two-minute intervals were used for narwhal recordings and 1-min intervals were used for beluga recordings in an effort to obtain a similar number of clicks per acoustic event between species, because the beluga recordings had more overlapping click trains. For all events, we included clicks where the maximal intensity of the beam was both centered (i.e., on-axis) and not centered (i.e., off-axis) on the recording system to reflect the nature of most PAM data. Once all clicks were assigned to events, a suite of acoustic parameters was calculated (Table 1). Using the PAMpal package (v. 0.12.6)61 in R (v. 4.1.0)62, echolocation parameter values were calculated for the click sequences selected in PAMGuard using standard measurement criteria63,64,65. Default PAMpal settings were used that included a 10 kHz Butterworth high-pass filter and a FFT window length of 2.5 ms to use in calculations. Parameter values were calculated for individual clicks and then the mean parameter values determined for a given event, except for inter-click-interval (ICI) where the mode value was approximated for each acoustic event.

Table 1 Descriptions of echolocation acoustic parameters used as variables in species differentiation analyses and classification models. All parameters outlined here were used for the standard Random Forests classification model and BANTER’s first stage call classifier. Variable codes marked with an asterisk (*) indicate which acoustic parameters were used for multivariate differentiation analyses. A full list and the process used by the PAMpal package61 for calculating click parameters can be found in the R package documentation65.

Species differentiation

Multivariate analyses were used to assess potential differences in beluga and narwhal echolocation. When both the − 3 dB and − 10 dB measurement was calculated for acoustic parameters, only − 3 dB parameters were considered for differentiation analyses to remove redundant variables (see Table 1). To ensure − 10 dB measurements did not have a substantial contribution, we ran our analyses with both − 3 dB and − 10 dB measurements and our results were unaffected. Each acoustic parameter across all acoustic events was z-score transformed and a Euclidean distance dissimilarity matrix generated. A permutational multivariate analysis of variance (PerMANOVA; 999 permutations) was performed to test for differences between species among mean acoustic parameter values66. However, a significant result from a perMANOVA can result from differences in the mean positions of each group in multivariate space, differences in within-group variance in multivariate space, or a combination of the two. Therefore, a permutation test of multivariate homogeneity of dispersions (PERMDISP; 999 permutations) was used to analyze whether within-group variance in multivariate space between beluga and narwhal acoustic parameter values was different67,68. Differences between beluga and narwhal acoustic parameter values were visualized using principal component analysis (PCA) performed on a correlation matrix69. As an ordination technique, PCA reduces a high-dimensional dataset with correlated variables into fewer, uncorrelated dimensions called principal components (PCs)70. PCA variable loadings, or eigenvectors, on each PC were used to explore differences among groups. PCA eigenvalues provide the amount of variance explained by each PC and are used to determine which PCs are statistically significant. By applying the broken-stick rule and examining a scree plot, eigenvalues that are higher than what is expected by chance are considered to explain a significant amount of the variance in the original data71. All statistical analyses were conducted in R using the vegan package (v. 2.5-7)72.

Species classification using Random Forests

We used a Random Forests (RF) classification model73 to classify belugas and narwhals using echolocation click parameters. RF has demonstrated to be an effective approach for bioacoustic studies74,75,76 as it is unaffected by nonparametric data and can accommodate many correlated variables. An RF model consists of many individual decision trees; each of these trees uses a random subset of samples and predictors. By combining thousands of trees, the final aggregated ensemble tree explores differences among species across the entire space of predictors in the original data and maximizes predictive power. RF models do not use a separate testing dataset to assess model accuracy. The random subsample drawn for each individual tree is termed the “in-bag” sample, while the remainder of the data not included are referred to as “out-of-bag” (OOB). OOB samples are used to test model performance by calculating an OOB classification error rate.

The two primary parameters for RF classification are the number of randomly selected predictors to choose from at each node, mtry, and the number of randomly selected samples to classify in each tree, sampsize. We conducted a sensitivity analysis for mtry and sampsize to ensure we used parameter values that prevented overfitting and maximized classification accuracy. The model was fit over all possible combinations of both parameters within possible ranges (mtry: 2–19; sampsize: 2–18) and the model accuracy determined. For all combinations, the correct classification rate did not vary by more than 0.05%, and therefore the model result was not sensitive to mtry and sampsize. Thus, we used the default value for mtry (square root of the number of predictors) and used half of the total sample size for the smallest species class for sampsize.

For our final model, species were assigned a priori to acoustic events in PAMGuard and 20 acoustic parameters were used as predictors to classify belugas and narwhals (Table 1). Each predictor (i.e., acoustic parameter) was the calculated mean across all individual clicks for each acoustic event. A randomized subset of four acoustic parameters (mtry) was used to split observations at each node. Each tree was grown using a randomized subset of nine acoustic events (sampsize) from each species group. The model was structured such that equal subsamples were drawn from each species class without replacement to account for our unbalanced dataset and to capture as much variation in the acoustic data as possible. For passive acoustics, there are occasions when individual whales may dominate the recordings with stereotyped calls; therefore, if we sampled with replacement, our model may underestimate the variation in the data. Model stability was visualized by plotting the trace of cumulative OOB error rate by number of trees and plotting the distribution of the fraction of trees that objects were in-bag. Ten thousand trees were constructed in the forest (ntree) to achieve model stability. To determine whether our model’s correct classification rates were significantly greater than what was expected by chance alone (50%), a binomial test for statistical significance of model performance was conducted. Our classification model was built using the randomForest (v. 4.6-14)77 and rfPermute (v. 2.5)78 packages in R.

Species classification using BANTER

For comparison with the singular RF model above, we also employed BANTER (Bio-Acoustic eveNT classifiER), a supervised acoustic classification method that can utilize multiple call types28, to classify belugas and narwhals. BANTER consists of two stages of RF models28. The first stage is referred to as the “call classifier” in which individual classification models are built for each call type or detector. From the call classifier, BANTER produces a distribution of classification probabilities for each call type or detector. The second stage is the “event classifier” and uses the results from the first stage to classify acoustic events (collections of calls) to each species.

Although BANTER has the potential to use information from multiple call types (e.g., whistles or burst pulses), our objective was to determine whether belugas and narwhals can be classified using echolocation signals alone. Therefore, our call classifier consisted of separate RF models for each of the echolocation click detectors used in PAMGuard. Two click detectors (1 and 6) from the PAMGuard classification were removed due to insufficient data. For each detector, an RF model was built using 20 acoustic parameters (Table 1) to classify all individual echolocation clicks from that detector. RF models using default settings draw bootstrap samples from one pool containing the entire training dataset from all classes. Therefore, when an RF model is applied to an unbalanced dataset, the classifier will tend to correctly classify the dominant class. To offset this effect, BANTER draws equal random subsamples without replacement from each species group. Each model randomly selected 50 clicks from each species (sampsize) without replacement and each forest contained 20,000 trees. The number of parameters randomly selected to choose from at each node was set to the default (mtry = square root of the total number of parameters). Results from the call classifier provide mean assignment probabilities for each click detector. For example, the click detector 2 model produces the mean probability that clicks from detector 2 will be assigned to a beluga or narwhal. The call classifier also determines the proportion of each detector present for a given event. For example, all events that do not have clicks detected by click detector 2 will have a proportion of zero, and for events that do have clicks from detector 2 will have a proportion value between 0 and 1.

Results from the call classifier (i.e., mean assignment probabilities for each click detector and proportions of each detector per event) were then applied to the second stage event classifier as variables. The mode inter-click interval for each event was added as an additional event-level variable. The event classifier assigned all acoustic events to either a beluga or narwhal using a single RF model. Parameterization of this model was the same as the call classifier (mtry = default; ntree = 20,000), except the number of events randomly selected (sampsize) was set to nine. Relative variable importance was examined for both the call and event classifiers. To determine whether the correct classification rates of our BANTER call and event classifiers were significantly greater than what was expected by chance alone (50%), binomial tests for statistical significance were conducted for each model. Our BANTER classification model was built using the banter (v. 0.9.4)28 and rfPermute78 packages in R.

For the previous analyses, recordings were subdivided into 1- and 2-min acoustic events for belugas and narwhals, respectively. As an alternate approach to using standardized temporal intervals, we examined BANTER performance using events assigned to independent acoustic encounters. Acoustic encounters are defined by separate sightings of groups of whales and can vary in recording duration, ambient noise, and environmental and recording conditions (e.g., greater distance or different orientation between whales and receivers). Here, individual encounters were comprised of several acoustic events. If sounds from independent sightings were substantially different, encounters that have a larger number of events will have a greater representation in the species classification model used above. Using the same BANTER framework, we tested whether unique encounters for each species were similar. To accomplish this, a series of BANTER classification training models were created using all events from one encounter for each species (mtry = default; ntree = 10,000; sampsize = half the smallest class sample size) and then used to predict events from encounters that were not included. All unique pairwise combinations of acoustic encounters between species were used to build separate BANTER models. The correlation between the training confusion matrices and the prediction (validation) confusion matrices were then calculated. The presence of correlation (> 0.5) indicates that events from separate encounters are similar because the prediction classification scores are similar to the training scores.


A total of 1:03 h of beluga recordings and 7:38 h of narwhal recordings were made 100 km or more offshore in the pack ice of Baffin Bay (Fig. 1b). Belugas were observed and recorded at two unique locations and narwhals at nine unique locations, in pods of approximately six to thirty individuals. Narwhal recordings used for analyses originated from five independent sightings (i.e., encounters) and beluga recordings from two. We estimate there were approximately 22–36 individual belugas and 63–120 narwhals sampled across all seven encounters. Recordings were paired with visual confirmation of species, so all echolocation clicks were labeled as originating from belugas or narwhals. Species groups were assigned to events created in PAMGuard. A total of 11,319 clicks were assigned to 81 separate events with a median of 71 clicks per event (Supplementary Table S1). Out of the 81 events, 19 were belugas (2537 clicks) and 62 were narwhals (8782 clicks). Twenty acoustic parameters were used to compare beluga and narwhal echolocation (Table 1). Signal parameter measurements for each species across all acoustic events are summarized in Table 2. Beluga acoustic events had a higher mean peak frequency (68.7 ± 10.1 kHz), − 3 dB center frequency (69.6 ± 10.6 kHz), and − 10 dB center frequency (70.2 ± 9.2 kHz) than narwhal events (43.7 ± 7.8 kHz, 43.8 ± 7.8 kHz, and 46.2 ± 8.5 kHz, respectively). Spectral peaks, troughs, and center frequencies (variables: peak, peak2, peak3, trough, trough2, centerHz_10dB, and centerHz_3dB) for beluga clicks were generally > 60 kHz and narwhal clicks < 60 kHz with overlap between 40–60 kHz (Table 2).

Table 2 Summary of signal parameter measurements for beluga and narwhal echolocation for all 81 acoustic events (beluga: n = 19; narwhal: n = 62). Mean ± standard deviation (s.d.) and range (min–max) are provided.

Species acoustic differentiation

Results from the perMANOVA reveal significant differences between beluga and narwhal echolocation characteristics (F1,79 = 42.35, p = 0.001; Supplementary Table S2). Beluga acoustic events also demonstrated a higher dispersion in multivariate space than narwhal acoustic events (PERMDISP, F1,79 = 8.37, p = 0.004; Supplementary Table S2). The PCA further displayed strong evidence for beluga and narwhal acoustic differentiation; beluga events were more dispersed than narwhal events (Fig. 2a). Within the two-dimensional space of PC1 and PC2, 72.9% of the original trait variation was explained where most was captured by the first dimension (58.4%; Fig. 2a). According to the broken-stick model, PC1 was the only dimension found to explain a significant amount of the variation in the original data. Beluga acoustic events are largely located on the right side of the PCA ordination space, representing higher echolocation click parameter values as compared to narwhal acoustic events that are on the left side (Fig. 2b; Supplementary Fig. S1). These findings are consistent with average spectra, acoustic event spectrograms, and mean parameter values for each species where beluga spectra present more energy at higher frequencies than narwhal spectra (Fig. 3; Table 2).

Figure 2
figure 2

Multivariate PCA ordination plot for beluga or narwhal acoustic events (a) and acoustic parameter eigenvectors (b). Principle component (PC) 1 explains 58.4% of the variation and PC2 explains 14.5%. The total variable contribution is shown in vector length and color, where purple is a low contribution and green is high. Ellipses in (a) show 95% confidence intervals. (a) and (b) represent the same multidimensional space.

Figure 3
figure 3

Average spectra and example concatenated spectrograms for beluga (a,c) and narwhal (b,d) acoustic events. Gray lines in (a) and (b) show average spectra for individual acoustic events: 19 beluga events and 62 narwhal events. Black lines in (a) and (b) represent the mean spectra for all echolocation clicks assigned to events for each species: 2537 beluga clicks and 8782 narwhal clicks. Example concatenated spectrograms in (c) and (d) show one acoustic event for each species (1024 point FFT; Hanning window).

Species acoustic classification

Both the RF and BANTER species classification models performed well with high OOB correct classification rates (p < 0.001 for all models; Table 3). The overall correct classification rate for the RF model was 92.6% (95% CI: 84.6–97.2%); 94.7% correct classification for beluga acoustic events and 91.9% for narwhal events. One beluga and five narwhal acoustic events were misclassified (Fig. 4a) and the most important predictors for species classification were − 3 dB and − 10 dB frequency minimums and peak frequency (Fig. 4b). Average spectra and waveforms for misclassified events presented signal patterns characteristic of off-axis clicks (i.e., clicks recorded away from the whale’s longitudinal acoustic axis) when compared to those that were correctly classified (Fig. 5).

Table 3 Confusion matrices for the (a) standard Random Forest model, (b) BANTER call classifiers (Detectors 2, 3, 4, and 5), and (c) BANTER event classifier. The total number of correct and incorrect classifications are shown between the groups identified a-priori (rows) and predictions by the classifier (columns). Out-of-bag (OOB) percent correct classification rates (95% confidence interval) demonstrate model performance. Binomial test p-values using a prior probability of 0.5 are provided where an asterisk (*) represents statistical significance (p < 0.05).
Figure 4
figure 4

Ordination plots (a,c) from a multidimensional scale (MDS) conversion of Random Forest (RF) proximity scores to visualize species classification predictions. The top panel (a,b) corresponds to the single RF classification model and the bottom panel (ce) are the results from the BANTER classification model. RF proximity scores are calculated for each pair of objects (i.e., acoustic events) to produce a N × N matrix, where N is the total number of objects. If objects occupy the same terminal node for one tree, their proximity score increases by one. All proximities are normalized by dividing by the total number of trees in the forest. Shaded regions show a-priori species groups with points colored according to their original (inner) and predicted (outer) species. Dimension 1 in (a) and (c) explains 98.3% and 98.4% of the total variation for the RF and BANTER models, respectively, and dimension 2 explains 0.8% and 0.6% of the variation, respectively. Relative variable importance is shown for both models (b,d,e). The heatmaps show ranked variable importance from the least important (light gray) to most important (dark gray). Descriptions of predictors are provided in Table 1 and Supplementary Table S3.

Figure 5
figure 5

Example average spectra for beluga (a,b) and narwhal (c,d) acoustic events that were correctly classified (a,c) and misclassified (b,d). The waveforms of the highest amplitude click for each event are provided. The beluga acoustic event in (b) was misclassified by only the RF model and the narwhal acoustic event in (d) was misclassified by both the RF and BANTER classification models.

BANTER’s two-stage approach achieved a higher accuracy than the standard RF model. The BANTER first stage call classifier used here consisted of four separate models for click detectors 2, 3, 4, and 5 (Table 3), each producing mean species assignment probabilities and click detector proportions that were subsequently used as variables for the second stage event classifier (Supplementary Table S3). BANTER’s event classifier achieved a correct classification score of 97.5% (95% CI: 91.4–99.7%); 100% correct classification for beluga acoustic events and 96.8% for narwhal events (Table 3). Among the event-level variables, detectors 2 (20–50 kHz) and 3 (50–70 kHz) were the most important classifiers (Fig. 4e). By examining the relative importance of the predictors (i.e., acoustic parameters; Table 1) for detector 2 and 3 call classifiers, the second peak frequency, the frequency difference between the second and third peaks, and the frequency trough were the most important for classification of individual clicks (Fig. 4d). However, since the event classifier uses mean assignment probabilities from each detector classifier as predictors, the most important variables for the detector 2 and 3 classifiers are not necessarily important variables for the event classifier. Examination of the vote distributions for the detector 2 and 3 call classifiers revealed that the detector 2 call classifier produced confident species classifications (i.e., the majority of samples were classified at high probability) compared to the detector 3 classifier (Supplementary Fig. S2). Overall, the BANTER model demonstrated strong predictive power with only two narwhal event misclassifications (Table 3; Fig. 4c). For correctly classified events, the model had high confidence for its species assignment evidenced by its vote distribution (Supplementary Fig. S3).

The correlation test to examine similarity between acoustic encounters revealed that events among different encounters were similar. A positive correlation between training confusion matrices (from models built using one encounter per species) and validation confusion matrices (from training models that predicted events from encounters not used in training model) indicated similarity between encounters. With the exception of one pair, all encounter pairs had a positive correlation and 6 of the 10 pairs had a correlation greater than 0.5 (Supplementary Table S4).

For both the single RF and BANTER models, the majority of the predictions for beluga and narwhal acoustic events were tightly clustered (Fig. 4a,c). These tightly clustered, correctly classified events were marked by clear differences between beluga and narwhal spectral characteristics. Beluga average spectra tended to be smoother, have a larger bandwidth, and higher -3 and -10 dB frequency minimums and maximums (Figs. 3a and 5a). Conversely, narwhal average spectra presented peaks and notches, a lower bandwidth, and lower frequency minimums and maximums (Figs. 3b and 5c). Average spectra from events that were misclassified and showed properties of both beluga and narwhal spectral characteristics generally had lower signal magnitudes (dB), smaller click sample sizes, and spectral variations that suggest a higher proportion of off-axis clicks (i.e., clicks not centered on the recording equipment; Figs. 3 and 5).


Passive acoustics has been a primary method to monitor beluga and narwhal populations year-round in remote regions of the Arctic, providing insight into their seasonal distribution and migratory routes16,17,33,38,44. Yet, prior to this study, the acoustic profiles of these two species were examined separately, despite their shared habitat ranges and acoustic features. Our results provide the first acoustic comparison between belugas and narwhals from the same region and reveal that they can be acoustically differentiated and classified using echolocation clicks alone. Belugas and narwhals have considerable acoustic overlap and variability in their social calls which makes species classification using acoustic parameters challenging17,23,27,32. Our analysis of beluga and narwhal recordings from the same region, during the same season, and acquired using the same equipment delivers a rare in situ comparison that fills a critical knowledge gap in the acoustic ecology for each species. We also provide a robust BANTER classification model that sets a precedent and foundation for echolocation parameters to be used in future PAM classification efforts, whether for belugas and narwhals or other odontocete species.

Our multivariate analyses showed significant differences between beluga and narwhal echolocation characteristics (perMANOVA: p = 0.001) where beluga acoustic parameters were more dispersed and distinguished by higher frequency content (Figs. 2 and 3). Spectral peaks, troughs, and center frequencies for beluga clicks tended to be > 60 kHz and narwhal clicks < 60 kHz with overlap between 40–60 kHz (Table 2). The greater variation observed among beluga acoustic events may be due to the sample size being smaller (n = 19) than the narwhal dataset (n = 62) or may reflect the true variation in beluga echolocation. Additionally, the lower frequency content observed in narwhal echolocation may indicate that narwhals have a longer echolocation detection range for prey and PAM receivers than belugas. Previous work to quantify beluga echolocation parameters reveals high variability and evidence of biosonar adaptability16,36,37,40,79. For example, estimates of beluga echolocation peak frequencies range widely between 40–120 kHz across captive and wild environments34,36,37,38,40,79 and have been shown to change depending on the ambient sound levels of the environment36. Narwhal echolocation has been largely characterized by lower frequency clicks with maxima between 30–70 kHz30,32,43,44, but high-frequency clicks have been reported where the entire bandwidth can extend up to 200 kHz25. Frouin-Mouy et al.17 used a difference in spectral power around the 20 kHz frequency band to distinguish between species for mid-frequency clicks with peaks within 30–60 kHz17. However, their sample sizes were small with 20 beluga and 17 narwhal click trains (162 and 186 individual clicks, respectively), and their recordings came from five different locations and three different receivers which can introduce classification errors35. Our findings are consistent with their preliminary comparison in which narwhal spectra contain more energy between 20–50 kHz than beluga spectra (Fig. 3). Yet, it remains unknown to what degree the variation observed in beluga and narwhal echolocation is due to context-specific active biosonar adjustments made by the whales, the orientation of whales relative to the receiver, differences in sound propagation in the water, population- or individual-specific characteristics, and/or differences in sampling design, recording equipment, or data processing.

Between the two classification models presented in this study, the BANTER model proved to be the strongest classifier with a correct classification rate of 97.5%. The BANTER event classifier accurately predicted all beluga events (100%) and misclassified only two narwhal events (3.2%; Table 3). As a balanced design, the BANTER classifier does not bias towards either species, but it reveals that beluga events are more diagnosable evidenced by the correct prediction of all beluga events by the event classifier. This may occur because a higher proportion of the beluga events—the smaller sample size group—was randomly selected for the training dataset compared to the narwhal group across all n trees. Results from BANTER’s two-stage approach identifies which frequency ranges contain the most useful information for species classification. The call classifiers for detectors 2 (20–50 kHz) and 3 (50–70 kHz) were found to be the most important for BANTER’s event classifier, suggesting clicks with peak frequencies between 20–70 kHz largely contained the information necessary for correct species assignment to acoustic events (Fig. 5a,c). Higher frequencies attenuate faster than lower frequencies1; therefore, it is possible that the lower frequency classifiers (detectors 2 and 3) were the most important because lower frequency signals were detected more than higher frequency signals. These results also indicate that the BANTER model will be less affected by variations in detection distances between species that would more largely influence the higher frequency detector 4 and 5 classifiers.

While BANTER’s call classifiers used acoustic parameters for predictors, the event classifier used mean assignment probabilities and detector proportions for predictors. The most important predictors for the event classifier were the mean assignment probabilities for detector 2 (Fig. 4e). Given the reasonably high overall classification rate (85.4%) and vote distribution for the detector 2 call classifier, it is likely that the most important variables for this classifier were also important for the event classifier. For the detector 2 call classifier, the second peak frequency, frequency difference between the second and third peaks, and − 3 dB center frequency were the most important variables for the classification of clicks (Fig. 4d). Soldevilla et al.80 demonstrated that Risso’s (Grampus griseus) and Pacific white-sided dolphins (Lagenorhynchus obliquidens) could be classified using spectral peak and notch patterns from echolocation clicks. Here, we similarly demonstrate the efficacy of using unique spectral properties to classify belugas and narwhals.

Our results from the single RF model demonstrate how a simple RF classifier using only echolocation parameters can still achieve a high correct species classification rate (92.6%). The RF model accurately predicted beluga events more than narwhal events, with one beluga misclassification (5.3%) and five narwhal misclassifications (8.1%; Table 3). Much like the BANTER model, the lower misclassification rate of beluga events may be in part due to the unbalanced dataset; a higher proportion of the beluga events were randomly selected in the RF model because it was the smaller sample size group. However, it is also possible that beluga events contain more diagnosable features and thus yield higher classification scores. The most important acoustic parameters for species classification in the standard RF model differed from those in BANTER’s detectors 2 and 3 call classifiers. Among all 20 acoustic parameters used in the RF model, frequency minimum (− 3 dB and − 10 dB) and peak frequency were the most important variables for species assignment (Fig. 4b).

Using echolocation predictors for species classification is an effective approach, particularly given its primary sensory role for odontocetes. There is evidence for individual and group-specific call types among belugas81,82,83,84 and narwhals85, which contributes to the variation in their vocal repertoire but also makes acoustic classification challenging. However, it is likely that echolocation clicks are characterized by more stable features due to their sensory function. There are occasions where whales are only echolocating and not producing other call types (e.g., whistles, burst pulses)17, and as such, a classifier that uses echolocation may offer more reliable year-round detections. While it is likely that the addition of other call types would improve classification28,35, the BANTER classifier presented here demonstrates belugas and narwhals can be classified by clicks alone. Echolocation is increasingly being incorporated into PAM classification programs28,74,80,86 and is specifically gaining traction for monitoring belugas16,19,33. Noise from ice floes, vessels, or industrial activities below 40 kHz can mask lower frequency social calls (e.g., Lammers et al.34 and Halliday et al.87). The broadband, high frequency nature of echolocation clicks is largely protected from masking by lower frequency anthropogenic sounds. As vessel traffic is expected to increase in the Arctic, echolocation may be a stronger metric for PAM classification.

Yet, the directionality and high-frequency nature of echolocation clicks pose their own limitations to species detectability and classification. Acoustic receivers will record echolocation sounds when the whale is facing the receiver but are unlikely to detect whales that are pointing their sonar beam away from the recorder. Narwhals are known to forage for Greenland halibut (Reinhardtius hippoglossoides) and Gonatus squid species at depths often > 1000 m in the winter88,89, so they may be detected by seafloor recorders during these months. While belugas can dive to depths > 500 m and reach the seafloor, they do not prey upon benthic species like narwhals; instead, they target fish species like the Arctic cod (Arctogadus glacialis) and polar cod (Boreogadus saida) that occupy shallower depths90,91. Therefore, due to the differences in prey selection by belugas and narwhals, they may not be detected by seafloor recorders with the same confidence depending on the recording depth of the instrument. Additionally, the rapid attenuation of ultrasonic echolocation signals1 results in a smaller detection range for echolocation clicks (< 1 km39) compared to lower-frequency social calls. Multiple receivers and the inclusion of other call types (e.g., whistles) in the classifier may be required to detect whales at a sufficiently large spatial resolution. In addition to deploying more receivers, the instruments required to record echolocation clicks must record higher frequencies with higher sampling rates, which typically depend on more battery power and memory storage. Ongoing development of acoustic receivers to be both economical and reliable will support monitoring populations using echolocation clicks at large scales.

This study provides a unique case in which the methods employed were identical for each species studied, so confounding variables that can substantially affect parameterization, like differences in the frequency responses of recording systems used35, are absent. However, the data used to build the classifiers originated from a limited number of independent acoustic encounters: two beluga and five narwhal. When testing for the similarity between independent acoustic encounters using a BANTER framework, our results indicate that there is likely not a strong effect of encounter on species classification. Additional data from multiple, independent groups of whales are needed to more accurately reflect the true variation in beluga and narwhal echolocation. While we show it is possible to differentiate and classify belugas and narwhals using echolocation clicks, incorporating data from additional independent encounters and other call types will strengthen the classifier against any individual- or pod-specific acoustic characteristics that may be present.

Future work to test the BANTER classification model presented here will elucidate its efficacy for PAM applications. Acoustic recordings used in this study were collected with a hydrophone deployed off the ice-edge positioned at a depth of 12 m. Assuming the new data follow the methodology employed here—including the recording equipment and sampling rate (500 kHz sampling rate; 16-bit resolution)—our model has the potential to be used on novel data. However, it remains unclear how effective the model presented here will be when data collected at depth, such as seafloor recorders where detectability of echolocation may decrease and recordings are expected to have a higher proportion of off-axis clicks, are applied. Additional examination is needed to assess the degree to which differences in recording equipment (e.g., Scripps Institution of Oceanography High-frequency Acoustic Recording Package [HARPs] or Ocean Instruments’ SoundTrap recorders) or receiver depth would affect the outcome of the presented classifier. Likewise, further consideration of which acoustic parameters are more stable across various equipment may mitigate effects of differing platforms28,35.

Insufficient data on beluga and narwhal populations has made it difficult to determine major threats to these species and thus hindered specific conservation efforts92. Narwhals have been identified as one of the most sensitive Arctic marine mammals to climate change due to their specialized habitat niche and restricted distribution range93. Both belugas and narwhals have been shown to have high site fidelity to their summering and wintering grounds, but little is known about what factors (e.g., prey availability or ice conditions) cause these behavior88,94,95. Furthermore, global climate warming is causing a lengthened open-water season in the Arctic that has galvanized opportunities for increased hydrocarbon exploration and commercial and recreational shipping development3,48,92,96. Some of the largest unexploited hydrocarbon reserves are found in the Arctic, particularly Baffin Bay97,98, and most global trade depends on maritime transport. Over half of the distribution ranges of the three endemic Arctic whale species overlaps with known or anticipated offshore hydrocarbon deposits3. Stakeholders of international trade and hydrocarbon exploration need to be informed of what regions are critical to belugas and narwhals for vital processes (e.g., calving, feeding, migrating) in order to mitigate industrial activities17. Strict regulations on vessel speed, appropriate shipping lanes, and types of permissible activities will be needed to avoid harmful consequences92.

As advances in PAM increase efficiency and robustness of data collection and analyses in the Arctic, present and future distributions of belugas and narwhals can be recorded with greater confidence and resolution, thereby improving efforts to monitor their changes in response to climate change and increased underwater noise. The multivariate nature of acoustic parameters lends itself to the interactive effects between variables, making methods like RF classifiers that can handle correlated variables robust for PAM species detection. BANTER increases classification accuracy beyond a standard RF model by using information from misclassifications at the call classification stage to inform the overall event classifier. Results from this study fill a critical data gap in the acoustic ecology of belugas and narwhals and provide a promising approach for future PAM programs. As recording systems advance and training datasets expand, classification programs such as the one presented here will develop greater predictive power and allow acousticians to reliably monitor Arctic odontocete populations long-term. Despite the variation in beluga and narwhal acoustic repertoires, our study suggests that the use of echolocation parameters for species classification may provide the most reliable method to differentiate these iconic species.

Data availability

All code used for model design and implementation are freely available via Github:


  1. 1.

    Madsen, P. T. & Wahlberg, M. Recording and quantification of ultrasonic echolocation clicks from free-ranging toothed whales. Deep. Res. Part I(54), 1421–1444 (2007).

    Google Scholar 

  2. 2.

    Au, W. W. L. Sonar of Dolphins (Springer, 1993).

    Google Scholar 

  3. 3.

    Reeves, R. R. et al. Distribution of endemic cetaceans in relation to hydrocarbon development and commercial shipping in a warming Arctic. Mar. Policy 44, 375–389 (2014).

    Google Scholar 

  4. 4.

    Hauser, D. D. W. et al. Habitat selection by two beluga whale populations in the Chukchi and Beaufort seas. PLoS One 12, e0172755 (2017).

    PubMed  PubMed Central  Google Scholar 

  5. 5.

    Vacquié-Garcia, J., Lydersen, C., Ims, R. A. & Kovacs, K. M. Habitats and movement patterns of white whales Delphinapterus leucas in Svalbard, Norway in a changing climate. Mov. Ecol. 6, 21 (2018).

    PubMed  PubMed Central  Google Scholar 

  6. 6.

    Lydersen, C., Martin, A. R., Kovacs, K. M. & Gjertz, I. Summer and autumn movements of white whales Delphinapterus leucas in Svalbard, Norway. Mar. Ecol. Prog. Ser. 219, 265–274 (2001).

    ADS  Google Scholar 

  7. 7.

    Innes, S. et al. Surveys of belugas and narwhals in the Canadian High Arctic in 1996. NAMMCO Sci. Publ. 4, 169–190 (2002).

    Google Scholar 

  8. 8.

    Smith, T. G. & Martin, A. R. Distribution and movements of belugas, Delphinapterus leucas, in the Canadian High Arctic. Can. J. Fish. Aquat. Sci. 51, 1653–1663 (1994).

    Google Scholar 

  9. 9.

    Hobbs, R. et al. Global review of the conservation status of Monodontid stocks. Mar. Fish. Rev. 81, 1–53 (2019).

    ADS  Google Scholar 

  10. 10.

    Frost, K. J. & Lowry, L. F. Distribution, abundance, and movements of beluga whales, Delphinapterus leucas, in coastal waters of western Alaska. In Advances in Research on the Beluga Whale, Delphinapterus leucas Vol. 224 (eds Smith, T. G. et al.) 39–57 (Canadian Bulletin of Fisheries and Aquatic Sciences, 1990).

    Google Scholar 

  11. 11.

    Lewis, A. E., Hammill, M. O., Power, M., Doidge, D. W. & Lesage, V. Movement and aggregation of eastern Hudson Bay beluga whales (Delphinapterus leucas): A comparison of patterns found through satellite telemetry and Nunavik Traditional Ecological Knowledge. Arctic 62, 13–24 (2009).

    Google Scholar 

  12. 12.

    Ahonen, H., Stafford, K. M., Lydersen, C., Steur, L. D. & Kovacs, K. M. A multi-year study of narwhal occurrence in the western Fram Strait—detected via passive acoustic monitoring. Polar Res. 38, 1–14 (2019).

    Google Scholar 

  13. 13.

    Heide-Jørgensen, M. P. et al. The migratory behaviour of narwhals (Monodon monoceros). Can. J. Zool. 81, 1298–1305 (2003).

    Google Scholar 

  14. 14.

    Richard, P. R. et al. Baffin Bay narwhal population distribution and numbers: Aerial surveys in the Canadian High Arctic, 2002–04. Arctic 63, 85–99 (2010).

    Google Scholar 

  15. 15.

    Dietz, R., Heide-Jørgensen, M. P., Richard, P. R. & Acquarone, M. Summer and fall movements of narwhals (Monodon monoceros) from northeastern Baffin Island towards northern Davis Strait. Arctic 54, 244–261 (2001).

    Google Scholar 

  16. 16.

    Castellote, M. et al. Monitoring white whales (Delphinapterus leucas) with echolocation loggers. Polar Biol. 36, 493–509 (2013).

    Google Scholar 

  17. 17.

    Frouin-Mouy, H., Kowarski, K., Martin, B. & Bröker, K. Seasonal trends in acoustic detection of marine mammals in Baffin Bay and Melville Bay, Northwest Greenland. Arctic 70, 59–76 (2017).

    Google Scholar 

  18. 18.

    Sousa-Lima, R. S., Norris, T. F., Oswald, J. N. & Fernandes, D. P. A review and inventory of fixed autonomous recorders for passive acoustic monitoring of marine mammals. Aquat. Mamm. 39, 23–53 (2013).

    Google Scholar 

  19. 19.

    Zhong, M. et al. Beluga whale acoustic signal classification using deep learning neural network models. J. Acoust. Soc. Am. 147, 1834–1841 (2020).

    ADS  PubMed  Google Scholar 

  20. 20.

    Castellote, M. et al. Seasonal distribution and foraging occurrence of Cook Inlet beluga whales based on passive acoustic monitoring. Endanger. Species Res. 41, 225–243 (2020).

    Google Scholar 

  21. 21.

    Sjare, B. L. & Smith, T. G. The vocal repertoire of white whales, Delphinapterus leucas, summering in Cunningham Inlet, Northwest Territories. Can. J. Zool. 64, 407–415 (1986).

    Google Scholar 

  22. 22.

    Chmelnitsky, E. G. & Ferguson, S. H. Beluga whale, Delphinapterus leucas, vocalizations from the Churchill River, Manitoba, Canada. J. Acoust. Soc. Am. 131, 4821–4835 (2012).

    ADS  PubMed  Google Scholar 

  23. 23.

    Marcoux, M., Auger-Méthé, M. & Humphries, M. M. Variability and context specificity of narwhal (Monodon monoceros) whistles and pulsed calls. Mar. Mammal Sci. 28, 649–665 (2012).

    Google Scholar 

  24. 24.

    Garland, E. C., Castellote, M. & Berchok, C. L. Beluga whale (Delphinapterus leucas) vocalizations and call classification from the eastern Beaufort Sea population. J. Acoust. Soc. Am. 137, 3054–3067 (2015).

    ADS  PubMed  Google Scholar 

  25. 25.

    Rasmussen, M. H., Koblitz, J. C. & Laidre, K. L. Buzzes and high-frequency clicks recorded from narwhals (Monodon monoceros) at their wintering ground. Aquat. Mamm. 41, 256–264 (2015).

    Google Scholar 

  26. 26.

    McCullough, J. L. K., Simonis, A. E., Sakai, T. & Oleson, E. M. Acoustic classification of false killer whales in the Hawaiian islands based on comprehensive vocal repertoire. JASA Express Lett. 1, 071201 (2021).

    Google Scholar 

  27. 27.

    Ford, J. K. B. & Fisher, H. D. Underwater acoustic signals of the narwhal (Monodon monoceros). Can. J. Zool. 56, 552–560 (1978).

    Google Scholar 

  28. 28.

    Rankin, S. et al. Acoustic classification of dolphins in the California Current using whistles, echolocation clicks, and burst pulses. Mar. Mammal Sci. 33, 520–540 (2017).

    Google Scholar 

  29. 29.

    Walmsley, S. F., Rendell, L., Hussey, N. E. & Marcoux, M. Vocal sequences in narwhals (Monodon monoceros). J. Acoust. Soc. Am. 147, 1078–1091 (2020).

    ADS  PubMed  Google Scholar 

  30. 30.

    Shapiro, A. D. Preliminary evidence for signature vocalizations among free-ranging narwhals (Monodon monceros). J. Acoust. Soc. Am. 120, 1695–1705 (2006).

    ADS  PubMed  Google Scholar 

  31. 31.

    Simões Amorim, T. O. et al. Integrative bioacoustics discrimination of eight delphinid species in the western South Atlantic Ocean. PLoS One 14, e0217977 (2019).

    PubMed  PubMed Central  Google Scholar 

  32. 32.

    Stafford, K. M., Laidre, K. L. & Heide-Jørgensen, M. P. First acoustic recordings of narwhals (Monodon monoceros) in winter. Mar. Mammal Sci. 28, 197–207 (2012).

    Google Scholar 

  33. 33.

    Castellote, M. et al. Dual instrument passive acoustic monitoring of belugas in Cook Inlet, Alaska. J. Acoust. Soc. Am. 139, 2697–2707 (2016).

    ADS  PubMed  Google Scholar 

  34. 34.

    Lammers, M. O. et al. Passive acoustic monitoring of Cook Inlet beluga whales (Delphinapterus leucas). J. Acoust. Soc. Am. 134, 2497–2504 (2013).

    ADS  PubMed  Google Scholar 

  35. 35.

    Roch, M. A., Stinner-Sloan, J., Baumann-Pickering, S. & Wiggins, S. M. Compensating for the effects of site and equipment variation on delphinid species identification from their echolocation clicks. J. Acoust. Soc. Am. 137, 22–29 (2015).

    ADS  PubMed  Google Scholar 

  36. 36.

    Au, W. W., Penner, R. H., Carder, D. A. & Scronce, B. Demonstration of adaptation in beluga whale echolocation signals. J. Acoust. Soc. Am. 77, 726–730 (1985).

    ADS  CAS  PubMed  Google Scholar 

  37. 37.

    Au, W. W. L., Penner, R. H. & Turl, C. W. Propagation of beluga echolocation signals. J. Acoust. Soc. Am. 82, 807–813 (1987).

    ADS  CAS  PubMed  Google Scholar 

  38. 38.

    Roy, N., Simard, Y., Gervaise, C. & Dtn, E. 3D tracking of foraging belugas from their clicks: Experiment from a coastal hydrophone array. Appl. Acoust. 71, 1050–1056 (2010).

    Google Scholar 

  39. 39.

    Zahn, M. J., Laidre, K. L., Stilz, P., Rasmussen, M. H. & Koblitz, J. C. Vertical sonar beam width of wild belugas (Delphinapterus leucas) in West Greenland. PLoS One 16, e0257054 (2021).

    CAS  PubMed  PubMed Central  Google Scholar 

  40. 40.

    Rutenko, A. N. & Vishnyakov, A. A. Time sequences of sonar signals generated by a beluga whale when locating underwater objects. Acoust. Phys. 52, 314–323 (2006).

    ADS  Google Scholar 

  41. 41.

    Koblitz, J. C., Stilz, P., Rasmussen, M. H. & Laidre, K. L. Highly directional sonar beam of narwhals (Monodon monoceros) measured with a vertical 16 hydrophone array. PLoS One 11, e0162069 (2016).

    PubMed  PubMed Central  Google Scholar 

  42. 42.

    Podolskiy, E. A. & Sugiyama, S. Soundscape of a narwhal summering ground in a glacier fjord (Inglefield Bredning, Greenland). J. Geophys. Res. Ocean. 125, e2020JC016116 (2020).

    ADS  Google Scholar 

  43. 43.

    Miller, L. A., Pristed, J., Mohl, B. & Surlykke, A. The click-sounds of narwhals (Monodon monoceros) in Inglefield Bay, Northwest Greenland. Mar. Mammal Sci. 11, 491–502 (1995).

    Google Scholar 

  44. 44.

    Marcoux, M., Auger-Methe, M., Chmelnitsky, E., Ferguson, S. H. & Humphries, M. M. Local passive acoustic monitoring of narwhal presence in the Canadian Arctic: A pilot project. Arctic 64, 307–316 (2011).

    Google Scholar 

  45. 45.

    Overland, J. et al. The urgency of Arctic change. Polar Sci. 21, 6–13 (2019).

    ADS  Google Scholar 

  46. 46.

    Comiso, J. C. & Hall, D. K. Climate trends in the Arctic as observed from space. WIREs Clim. Change 5, 389–409 (2014).

    Google Scholar 

  47. 47.

    Kwok, R. Arctic sea ice thickness, volume, and multiyear ice coverage: Losses and coupled variability (1958–2018). Environ. Res. Lett. 13, 105005 (2018).

    Google Scholar 

  48. 48.

    Overland, J. E. & Wang, M. When will the summer Arctic be nearly sea ice free?. Geophys. Res. Lett. 40, 2097–2101 (2013).

    ADS  Google Scholar 

  49. 49.

    Smith, L. C. & Stephenson, S. R. New Trans-Arctic shipping routes navigable by midcentury. Proc. Natl. Acad. Sci. U.S.A. 110, E1191–E1195 (2013).

    ADS  CAS  PubMed  PubMed Central  Google Scholar 

  50. 50.

    Hauser, D. D. W., Laidre, K. L. & Stern, H. L. Vulnerability of Arctic marine mammals to vessel traffic in the increasingly ice-free Northwest Passage and Northern Sea Route. Proc. Natl. Acad. Sci. U.S.A. 115, 7617–7622 (2018).

    CAS  PubMed  PubMed Central  Google Scholar 

  51. 51.

    Halliday, W. D., Pine, M. K. & Insley, S. J. Underwater noise and Arctic marine mammals: Review and policy recommendations. Environ. Rev. 28, 438–448 (2020).

    Google Scholar 

  52. 52.

    Halliday, W. D. et al. Underwater sound levels in the Canadian Arctic, 2014–2019. Mar. Pollut. Bull. 168, 112437 (2021).

    CAS  PubMed  Google Scholar 

  53. 53.

    Kochanowicz, Z. et al. Using western science and Inuit knowledge to model ship-source noise exposure for cetaceans (marine mammals) in Tallurutiup Imanga (Lancaster Sound), Nunavut, Canada. Mar. Policy 130, 104557 (2021).

    Google Scholar 

  54. 54.

    Stewart, R. E. A., Lesage, V., Lawson, J. W., Cleator, H. & Martin, K. A. Science technical review of the draft Environmental Impact Statement (EIS) for Baffinland’s Mary River Project (Canadian Science Advisory Secretariat, Fisheries and Oceans Canada, 2011).

    Google Scholar 

  55. 55.

    Heide-Jørgensen, M. P., Hansen, R. G., Westdal, K., Reeves, R. R. & Mosbech, A. Narwhals and seismic exploration: Is seismic noise increasing the risk of ice entrapments?. Biol. Conserv. 158, 50–54 (2013).

    Google Scholar 

  56. 56.

    Blackwell, S. B., Greene, C. R. & Richardson, W. J. Drilling and operational sounds from an oil production island in the ice-covered Beaufort Sea. J. Acoust. Soc. Am. 116, 3199–3211 (2004).

    ADS  PubMed  Google Scholar 

  57. 57.

    Yang, W. et al. Anthropogenic sound exposure-induced stress in captive dolphins and implications for cetacean health. Front. Mar. Sci. 8, 606736 (2021).

    Google Scholar 

  58. 58.

    Erbe, C. & Farmer, D. M. Zones of impact around icebreakers affecting beluga whales in the Beaufort Sea. J. Acoust. Soc. Am. 108, 1332–1340 (2000).

    ADS  CAS  PubMed  Google Scholar 

  59. 59.

    Heide-Jørgensen, M. P. et al. Behavioral response study on seismic airgun and vessel exposures in narwhals. Front. Mar. Sci. 8, 658173 (2021).

    Google Scholar 

  60. 60.

    Gillespie, D., Mellinger, D. K., Gordon, J. & Al, E. PAMGUARD: Semiautomated, open source software for real-time acoustic detection and localization of cetaceans. Proc. Inst. Acoust. 30, 54–62 (2008).

    Google Scholar 

  61. 61.

    Sakai, T. PAMpal: Load and process passive acoustic data. R package version 0.12.6. (2021).

  62. 62.

    R Core Team. R: A language and environment for statistical computing. R Foundation for Statistical Computing (2021).

  63. 63.

    Griffiths, E. T. et al. Detection and classification of narrow-band high frequency echolocation clicks from drifting recorders. J. Acoust. Soc. Am. 147, 3511–3522 (2020).

    ADS  PubMed  Google Scholar 

  64. 64.

    Baumann-Pickering, S., Wiggins, S. M., Hildebrand, J. A., Roch, M. A. & Schnitzler, H. Discriminating features of echolocation clicks of melon-headed whales (Peponocephala electra), bottlenose dolphins (Tursiops truncatus), and Gray’s spinner dolphins (Stenella longirostris longirostris). J. Acoust. Soc. Am. 128, 2212–2224 (2010).

    ADS  PubMed  Google Scholar 

  65. 65.

    Sakai, T. PAMpal standardClickCalcs. (2021).

  66. 66.

    Anderson, M. J. A new method for non-parametric multivariate analysis of variance. Austral Ecol. 26, 32–46 (2001).

    Google Scholar 

  67. 67.

    Anderson, M. J. Distance-based tests for homogeneity of multivariate dispersions. Biometrics 62, 245–253 (2006).

    MathSciNet  PubMed  MATH  Google Scholar 

  68. 68.

    Anderson, M. J. Permutational Multivariate Analysis of Variance (PERMANOVA). Wiley StatsRef Stat. Ref. Online (2017).

    Article  Google Scholar 

  69. 69.

    Pearson, K. On lines and planes of closest fit to systems of points in space. Philos. Mag. 2, 559–572 (1901).

    MATH  Google Scholar 

  70. 70.

    Lever, J., Krzywinski, M. & Altman, N. Principal component analysis. Nat. Methods 14, 641–642 (2017).

    CAS  Google Scholar 

  71. 71.

    Jackson, D. A. Stopping rules in principal components analysis: A comparison of heuristical and statistical approaches. Ecology 74, 2204–2214 (1993).

    Google Scholar 

  72. 72.

    Oksanen, J. et al. Vegan: Community ecology package. R package version 2.5-7. (2020).

  73. 73.

    Breiman, L. Random forests. Mach. Learn. 45, 5–32 (2001).

    MATH  Google Scholar 

  74. 74.

    Yang, L. et al. Description and classification of echolocation clicks of Indian Ocean humpback (Sousa plumbea) and Indo-Pacific bottlenose (Tursiops aduncus) dolphins from Menai Bay, Zanzibar, East Africa. PLoS One 15, e0230319 (2020).

    CAS  PubMed  PubMed Central  Google Scholar 

  75. 75.

    Archer, F. I., Rankin, S., Stafford, K. M., Castellote, M. & Delarue, J. Quantifying spatial and temporal variation of North Pacific fin whale (Balaenoptera physalus) acoustic behavior. Mar. Mammal Sci. 36, 224–245 (2020).

    Google Scholar 

  76. 76.

    Ross, J. C. & Allen, P. E. Random Forest for improved analysis efficiency in passive acoustic monitoring. Ecol. Inform. 21, 34–39 (2014).

    Google Scholar 

  77. 77.

    Liaw, A. & Wiener, M. Classification and regression by randomForest. R News 2, 18–22 (2002).

    Google Scholar 

  78. 78.

    Archer, E. rfPermute: Estimate permutation p-values for Random Forest importance metrics. R package version 2.5. (2021).

  79. 79.

    Gurevich, V. S. & Evans, W. E. Echolocation discrimination of complex planar targets by the Beluga whale (Delphinapterus leucas). J. Acoust. Soc. Am. 60, S5 (1976).

    ADS  Google Scholar 

  80. 80.

    Soldevilla, M. S. et al. Classification of Risso’s and Pacific white-sided dolphins using spectral properties of echolocation clicks. J. Acoust. Soc. Am. 124, 609–624 (2008).

    ADS  PubMed  Google Scholar 

  81. 81.

    Morisaka, T., Yoshida, Y., Akune, Y., Mishima, H. & Nishimoto, S. Exchange of ‘signature’ calls in captive belugas (Delphinapterus leucas). J. Ethol. 31, 141–149 (2013).

    Google Scholar 

  82. 82.

    Vergara, V., Michaud, R. & Barrett-Lennard, L. G. What can captive whales tell us about their wild counterparts? Identification, usage, and ontogeny of contact calls in belugas (Delphinapterus leucas). Int. J. Comp. Psychol. 23, 278–309 (2010).

    Google Scholar 

  83. 83.

    Vergara, V. & Mikus, M. A. Contact call diversity in natural beluga entrapments in an Arctic estuary: Preliminary evidence of vocal signatures in wild belugas. Mar. Mammal Sci. 35, 434–465 (2019).

    Google Scholar 

  84. 84.

    Panova, E. M. et al. Intraspecific variability in the ‘vowel’-like sounds of beluga whales (Delphinapterus leucas): Intra- and interpopulation comparisons. Mar. Mammal Sci. 32, 452–465 (2016).

    Google Scholar 

  85. 85.

    Ames, A. E., Blackwell, S. B., Tervo, O. M. & Heide-Jørgensen, M. P. Evidence of stereotyped contact call use in narwhal (Monodon monoceros) mother-calf communication. PLoS One 16, e0254393 (2021).

    CAS  PubMed  PubMed Central  Google Scholar 

  86. 86.

    Baumann-Pickering, S. et al. False killer whale and short-finned pilot whale acoustic identification. Endanger. Species Res. 28, 97–108 (2015).

    Google Scholar 

  87. 87.

    Halliday, W. D. et al. Potential exposure of beluga and bowhead whales to underwater noise from ship traffic in the Beaufort and Chukchi Seas. Ocean Coast. Manag. 204, 105473 (2021).

    Google Scholar 

  88. 88.

    Laidre, K. L., Jørgensen, O. A. & Treble, M. A. Deep-ocean predation by a high Arctic cetacean. ICES J. Mar. Sci. 61, 430–440 (2004).

    Google Scholar 

  89. 89.

    Laidre, K. L., Heide-Jørgensen, M. P., Dietz, R., Hobbs, R. C. & Jørgensen, O. A. Deep-diving by narwhals Monodon monoceros: Differences in foraging behavior between wintering areas?. Mar. Ecol. Prog. Ser. 261, 269–281 (2003).

    ADS  Google Scholar 

  90. 90.

    Lydersen, C. & Kovacs, K. M. A review of the ecology and status of white whales (Delphinapterus leucas) in Svalbard, Norway. Polar Res. 40, 5509 (2021).

    Google Scholar 

  91. 91.

    Hauser, D. D. W. et al. Regional diving behavior of Pacific Arctic beluga whales Delphinapterus leucas and possible associations with prey. Mar. Ecol. Prog. Ser. 541, 245–264 (2015).

    ADS  Google Scholar 

  92. 92.

    Ragen, T. J., Huntington, H. P. & Hovelsrud, G. K. Conservation of Arctic marine mammals faced with climate change. Ecol. Appl. 18, S166–S174 (2008).

    PubMed  Google Scholar 

  93. 93.

    Laidre, K. L. et al. Quantifying the sensitivity of Arctic marine mammals to climate-induced habitat change. Ecol. Appl. 18, S97–S125 (2008).

    PubMed  Google Scholar 

  94. 94.

    Heide-Jørgensen, M. P., Dietz, R., Laidre, K. L. & Richard, P. Autumn movements, home ranges, and winter density of narwhals (Monodon monoceros) tagged in Tremblay Sound, Baffin Island. Polar Biol. 25, 331–341 (2002).

    Google Scholar 

  95. 95.

    Hauser, D. D. W., Laidre, K. L., Suydam, R. S. & Richard, P. R. Population-specific home ranges and migration timing of Pacific Arctic beluga whales (Delphinapterus leucas). Polar Biol. 37, 1171–1183 (2014).

    Google Scholar 

  96. 96.

    Huntington, H. P. A preliminary assessment of threats to Arctic marine mammals and their conservation in the coming decades. Mar. Policy 33, 77–82 (2009).

    Google Scholar 

  97. 97.

    Gregersen, U., Hopper, J. R. & Knutz, P. C. Basin seismic stratigraphy and aspects of prospectivity in the NE Baffin Bay, Northwest Greenland. Mar. Pet. Geol. 46, 1–18 (2013).

    Google Scholar 

  98. 98.

    McCauley, R. D. et al. Widely used marine seismic survey air gun operations negatively impact zooplankton. Nat. Ecol. Evol. 1, 0195 (2017).

    Google Scholar 

Download references


We are grateful to Niaqornat’s community members and Greenland Institute of Natural Resources for their local support, Mikkel Villum Jensen for his help in the field, Air Greenland pilot Geir Akse for safe field operations, and the University of Tübingen and the German Oceanographic Museum Stralsund for providing recording equipment. We thank Jessica Crance, Taiki Sakai, and Dan Ovando for their support in implementing our analyses. We also thank Jay Barlow and Angela Szesciorka for providing R script and Pina Gruden for providing Matlab code. Funding was provided by the U.S. Office of Naval Research on K. L. Laidre’s grant (N00014-11-1-0201). The School of Aquatic and Fishery Sciences, University of Washington, and the Vetlesen Foundation supported M. J. Zahn.

Author information




K.L.L. conceived the project and lead all fieldwork operations. K.L.L. and M.H.R. acquired funding for data acquisition and J.C.K. provided acoustic recording equipment. K.L.L., M.H.R., and J.C.K. collected data. Acoustic data were analyzed by M.J.Z with input from S.R., J.L.K.M., and F.A. The first draft of the manuscript was written by M.J.Z. All coauthors improved the manuscript.

Corresponding author

Correspondence to Marie J. Zahn.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit

Reprints and Permissions

About this article

Verify currency and authenticity via CrossMark

Cite this article

Zahn, M.J., Rankin, S., McCullough, J.L.K. et al. Acoustic differentiation and classification of wild belugas and narwhals using echolocation clicks. Sci Rep 11, 22141 (2021).

Download citation

  • Received:

  • Accepted:

  • Published:

  • DOI:

Further reading


By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.


Quick links

Nature Briefing

Sign up for the Nature Briefing newsletter — what matters in science, free to your inbox daily.

Get the most important science stories of the day, free in your inbox. Sign up for Nature Briefing