An underwater observation dataset for fish classification and fishery assessment

McCann, Erin; Li, Liling; Pangle, Kevin; Johnson, Nicholas; Eickholt, Jesse

doi:10.1038/sdata.2018.190

Download PDF

Data Descriptor
Open access
Published: 09 October 2018

An underwater observation dataset for fish classification and fishery assessment

Erin McCann¹^nAff4,
Liling Li²,
Kevin Pangle¹,
Nicholas Johnson³ &
…
Jesse Eickholt^1,2

Scientific Data volume 5, Article number: 180190 (2018) Cite this article

8268 Accesses
20 Citations
9 Altmetric
Metrics details

Subjects

Abstract

Using Dual-Frequency Identification Sonar (DIDSON), fishery acoustic observation data was collected from the Ocqueoc River, a tributary of Lake Huron in northern Michigan, USA. Data were collected March through July 2013 and 2016 and included the identification, via technology or expert analysis, of eight fish species as they passed through the DIDSON’s field of view. A set of short DIDSON clips containing identified fish was curated. Additionally, two other datasets were created that include visualizations of the acoustic data and longer DIDSON clips. These datasets could complement future research characterizing the abundance and behavior of valued fishes such as walleye (Sander vitreus) or white sucker (Catostomus commersonii) or invasive fishes such as sea lamprey (Petromyzon marinus) or European carp (Cyprinus carpio). Given the abundance of DIDSON data and the fact that a portion of it is labeled, these data could aid in the creation of machine learning tools from DIDSON data, particularly for invasive sea lamprey which are amply represented and a destructive invader of the Laurentian Great Lakes.

Design Type(s)	observation design • time series design • biodiversity assessment objective
Measurement Type(s)	Species
Technology Type(s)	acoustic camera
Factor Type(s)	temporal_interval
Sample Characteristic(s)	Catostomus commersonii • Cyprinus carpio • Esox lucius • Micropterus dolomieu • Micropterus salmoides • Oncorhynchus mykiss • Petromyzon marinus • Sander vitreus • Ocqueoc River • freshwater river biome

Machine-accessible metadata file describing the reported data (ISA-Tab format)

Differentiation of two swim bladdered fish species using next generation wideband hydroacoustics

Article Open access 18 May 2021

Observational study on the non-linear response of dolphins to the presence of vessels

Article Open access 13 March 2024

An open access dataset for developing automated detectors of Antarctic baleen whale sounds and performance evaluation of two commonly used detectors

Article Open access 12 January 2021

Background & Summary

Dual-frequency Identification Sonar (DIDSON) is an underwater acoustic camera that has become increasingly popular in fisheries science for monitoring fish abundance and behavior in rivers and lakes¹. For example, DIDSON has been used to determine migration timing², quantify fish length^3,4 and estimate abundance of Chinook salmon (Oncorhynchus tshawytscha)^5,6, sockeye salmon (Oncorhynchus nerka)^2,6 and American eel (Anguilla rostrata)⁷. These studies have utilized DIDSON instead of, or in conjunction with, traditional fisheries assessment tools and have benefited in many different ways. This is because DIDSON is able to capture video-like imagery under a variety of environments including those with high turbidity and low light. DIDSON also allows for continuous monitoring without manipulation of the studied organism. This is in contrast to other fisheries assessment methods such as telemetry that require the capture and subsequent release of fish and as a result the behavioral inferences made about the fish’s natural behavior are limited because tagged fish were manipulated.

DIDSON generates a significant amount of data and efficient processing of these data has posed challenges when used to assess fish abundance and behavior. As a result, manual processing of the data with traditional tools is often impractical and indeed, the development of algorithms for semi-automated processing of DIDSON data has made it easier to process these large amounts of data^3,4,8,9. Nevertheless, fully automated fish identification is very challenging because many fish have similar body shapes and sizes and are difficult to distinguish in a DIDSON image. Presented here is a DIDSON dataset along with the methods for collecting and processing DIDSON data that could be used to create automated fish identification systems from DIDSON images. The purpose of releasing these DIDSON data is to provide the community with data that can be used to create and evaluate tools. The data are presented in their rawest form (i.e., acoustic data as collected from the DIDSON device), as well as a binary format that contains images of the visualized acoustic data. Having the data in multiple formats makes them readily available for use with either existing software such as Sound Metrics or with community developed customized tools or software. This dataset could be especially useful for helping future researchers and managers characterize the migration timing and abundance of valued and invasive fishes present in the Great Lakes and throughout North America.

Methods

Collecting DIDSON Data

DIDSON data were collected near the mouth of the Ocqueoc River, a tributary to Lake Huron located in Presque Isle County in northern Michigan, USA. The river is approximately 55 km long and has a 412 km² drainage basin with an average daily discharge of 2.8 m³/s. A DIDSON camera (Standard Version 300 m) was deployed in high frequency mode (1.8 MHz) from March 20 to July 2, 2013 and March 11 to July 22, 2016. During these times, a sampling window of 10 m was used to ensure relatively high image quality for accurate fish identification when manually viewed using the Sound Metrics (V5) software. In 2013 and 2016, the DIDSON was mounted within a welded aluminum frame to protect the DIDSON from river debris. Steel chain and stainless steel hose clamps were used to secure the aluminum frame. In 2013, the DIDSON was positioned at the river mouth on the right bank (referenced looking downstream) where the channel was 14 m wide and averaged 0.9 m deep at baseflow. The field of view was positioned horizontally and perpendicular to stream flow at a downward tilt of −7.8 degree with a viewing window from 0.5 m to 10.5 m. The channel morphology within the field of view was flat with a primarily sandy substrate and the water velocity ranged from 0.20 to 0.50 m³/s throughout the field season. In 2016, to avoid issues with rising water levels in Lake Huron, the DIDSON camera was positioned approximately 0.28 km upstream of the river mouth 0.5 m from the left bank (referenced looking downstream). The channel at this location was approximately 23 m wide and averaged 2 m deep at baseflow. The field of view was positioned horizontally and perpendicular to stream flow at a downward tilt of −9.3 degree with a viewing window from 2.5 m to 12.5 m. The channel morphology within the viewing window was flat with a primarily sandy substrate and the water velocity ranged from 0.10 to 0.30 m³/s throughout the field season.

Extracting and Labeling Targets

Data identifying and extraction for a variety of fish species within the DIDSON image is an important step towards gathering training data. Such training data could be used to develop algorithms and classifiers to automate processing of DIDSON data. Target images were collected primarily on three abundant fish species known to inhabit tributaries of northern Michigan, namely invasive sea lamprey (Petromyzon marinus), invasive European Carp (Cyprinus carpio) and native white sucker (Catostomus commersonii). To obtain correct identification of these species in the DIDSON data, a combination of passive integrated transponder (PIT) tag systems and underwater video cameras were used. PIT technology can detect fish anytime of the day, but is limited in that it can only detect fish that were previously captured, tagged and released. PIT technology is commonly used to track sea lamprey movement and behavior in streams^10,11. Video cameras imaged all fishes swimming in front of the camera regardless if it contained a tag, but video observation was limited to daylight hours and therefore was selective for diurnal and crepuscular fishes.

During the 2013 DIDSON deployment in the Ocqueoc River, a PIT system (Oregon RFID, Portland, Oregon) was installed within the DIDSON’s field of view. Specifically, two PIT antennas were deployed in the Ocqueoc field of view; the first 0 m to 7 m from the DIDSON and the second 7 m to 14 m from the DIDSON. Deploying two antennas allowed us to determine approximately where the tagged sea lamprey were located in the DIDSON image. PIT-tagged sea lamprey were released in the river plume, approximately 30 m downstream of the deployment site by placing them in a cage and then opening the cage door after 4 h of acclimation. 50 males and 50 females were released during May 29, May 31, June 5, June 6, June 10, and June 11. When tagged fish swam through the DIDSON view, they were detected by the PIT antenna. Because the PIT system and the DIDSON were time synchronized, these detections were then cross-referenced with the DIDSON footage using the Sound Metrics (V5) software to observe known targets. Once known targets were identified within the DIDSON image, a short clip of 5 to 20 s containing the known target was created. To supplement the number of examples of sea lamprey, additional DIDSON data was viewed using the Sound Metrics software and additional clips were generated based on expert analysis (see Technical Validation section for full details).

The 2016 Ocqueoc River DIDSON deployment site was deeper and wider than the 2013 deployment site, prohibiting the use of a PIT system to obtain known targets. Therefore, during this field season, a different technique for obtaining known fish targets was used. Two video cameras were mounted approximately 1 m above the river bottom within the DIDSON field of view to ensure complete overlap of the video camera views with the DIDSON view. The cameras were positioned approximately 3 m apart (at 6 m and 9 m from the DIDSON) to minimize their overlap, which maximized the combined video camera viewing area. Both video cameras faced the same direction as the DIDSON and collected continuous video data for 2 months. Figure 1 shows the placement of the DIDSON and video cameras. These video data were manually inspected using VLC video monitoring software. Known targets identified within the video data were then cross referenced with the DIDSON data (video and DIDSON were time synchronized). Once known targets were identified within the DIDSON image, a short clip of 5 to 20 seconds containing the known target was created. Figure 2 is a sample of the visualized DIDSON acoustic data.

**Figure 1: Placement of the DIDSON and video cameras for the 2016 Ocqueoc River DIDSON deployment.**

**Figure 2: Image extracted from DIDSON data showing a sea lamprey at approximately 9.5 m from the camera.**

Converting DIDSON Acoustic Data to Video

The DIDSON is an acoustic recording device and as a result the raw data cannot be viewed directly. The data are in binary format that starts with 512 bytes of metadata and followed by the values received from an acoustic array. These data are segmented by frames (i.e. a set of readings of the acoustic array at a particular time) and each frame starts with a header that contains information such as frame number, a time stamp, transmission mode, etc. Details on the DIDSON data file structure (i.e., how to interpret the binary data produced by a DIDSON device) was obtained by contacting the Sound Metrics corporation. The acoustic data represent reflectance of acoustic waves at a particular headings and time and can be visualized by transforming it to Cartesian coordinates and pixel intensities¹². This conversion process can be completed using proprietary software such as Sound Metrics but such software does limit how the underlying data can be accessed. These limitations are generally two-fold. First, existing software is typically limited in terms of extensibility. This is to say that it can be difficult to incorporate new processing algorithms (e.g., different filters, automated object trackers, classifiers). Second, these existing tools were not designed for parallel processing or batch processing. Analysis of large amounts of data necessitate tools that can scale and as a result parallel processing.

To convert the acoustic data to visual data, software such as Sound Metrics can be used to export the data as a video. Alternatively, the acoustic data can also be converted to visual data (i.e., a time series of grayscale 2D images) making use of available Matlab code (see https://github.com/nilsolav/ARISreader). This software was released by Handegrad and Williams¹³ and capable of converting the acoustic data into Cartesian coordinates¹². This existing code can be adapted to save the image data in a raw, binary form by representing it as a three-dimensional array with the third dimension being time. To facilitate the development of custom tools amenable to distributed storage and processing, the binary data was stored in a SequenceFile (i.e., an object in Hadoop for storing key-value pairs). With the image data on hand, standard image and video processing procedures can readily be applied. A Java program is provided to view the SequenceFiles and can be modified to store the data in other formats.

Code Availability

Danner, T., Li, L., & Eickholt, J., Source code for viewing SequenceFiles, available at Open Science Framework (Data Citation 1).

Source code is available on the Open Science Framework to view the SequenceFiles created from the raw DIDSON data. The code is packaged with instructions for use. The intent of the code is to illustrate how the visualized DIDSON data is stored in the SequenceFiles and how it can be accessed. As a utility the code is licensed freely for individual, academic or commercial research. It may not be repacked or sold without written permission. Full licensing details are included with the source code.

Data Records

The data are being provided in two formats that are the raw, acoustic data and the binary visualizations. The rationale for the multiple formats is that in some use cases it may be preferable to work with existing software (e.g., Sound Metrics) that is capable of viewing DIDSON data. In this case the acoustic data are needed. In other use cases, it may be desirable to work with data using visual processing tools and this would require image data (i.e., the visualized acoustic data). Having the data readily available in a binary format ensures that the data can be accessed without the need to convert it from its acoustic form.

The raw DIDSON data are available at Figshare via the following link under the collection name DIDSONRawFishDatasets.zip (Data Citation 2). Binary visualization data are available at the Open Science Framework under the names SEQVisualizedFishDatsets-PartI.zip, SEQVisualizedFishDatsets-PartII.zip, SEQVisualizedFishDatsets-PartIII.zip and SEQVisualizedFishDatsets-PartIV.zip (Data Citation 3). In sum, the dataset includes 3 subsets and supporting files.

Raw DIDSON Dataset

The raw DIDSON dataset contains the original data collected by the DIDSON device on the Ocqueoc River (denoted as the “raw_DIDSON” directory). This directory contains two subdirectories that separate the data by year (i.e., OC13 and OC16 for collection in 2013 and 2016, respectively). The naming of the raw DIDSON data follows the pattern of yyyy-mm-dd_hhmmss_HF.ddf that encodes the year, month, day and the start time of collection. Each file is 30 min in duration. The extension on these files is “ddf” and these may be viewed by Sound Metrics software or via existing, community licensed software (e.g., https://github.com/nilsolav/ARISreader). Note that not all of the data collected through the DIDSON deployments are contained here. The raw data presented is limited to the files that contained identifiable targets and from which subsequent clips were generated. In total, the raw DIDSON dataset contains 105 raw DIDSON files from 2013 and 95 from 2016. These data represent approximately 100 h of data collection (i.e., around 4 days of continuous collection) and much less than was collected over the multiple month deployments. From this 100 hours of DIDSON data, 524 clips with known targets were extracted.

This folder also contains spreadsheets that describe the location of known fish in the raw DIDSON data. In 2013, the focus was on sea lamprey and in 2016 other species were also identified. As a result, all targets listed in the spreadsheets for 2013 correspond to sea lamprey. The spreadsheet for 2016 states the species of the target. All spreadsheets link the name of the longer source file (i.e, raw 30 min data file) to a smaller clip and provide details about the temporal and spatial location of the target. Spreadsheet entries for sea lamprey targets that were identified by expert analysis of the DIDSON data also include a certainty rating. This value was chosen by the expert to express confidence in the target’s identification with 3 indicating the highest level of the expert’s confidence in the identification of a sea lamprey and 1 indicating the lowest level of confidence.

Raw DIDSON Clips

The raw DIDSON clips dataset (denoted as “raw_DIDSON_clips”) contains smaller clips of raw acoustic data that contain identified fish. These fish were identified by PIT tags, video surveillance or an expert and the clips are separated by both year of collection and identification method (i.e., the subdirectories are named OC13-by-Expert, OC13-by-PIT and OC16-by-VIDEO). Each subdirectory is further broken down by the species of fish that the clips contain. The naming of the raw DIDSON clips follows the template of yyyy-mm-dd_hhmmss_HF-S###.ddf that again encodes the year, month, day and start time of the source file. The ### is an added clip identifier that is unique for source file and can be used with the accompanying spreadsheets to reference the location of known fish in the clip. These clips were generated using the Sound Metrics software and as raw DIDSON data can be viewed using the aforementioned tools. Table 1 summarizes the number DIDSON clips with known fish by species.

Table 1 The number of raw DIDSON clips and SequenceFiles by fish species contained in the datasets.

Full size table

SequenceFile Dataset

The SequenceFile dataset contains a binary representation of the visualized raw acoustic data. As the raw DIDSON data represent sets of readings of the acoustic array at a particular time, it must be converted to pixel intensities on a Cartesian plane to form an image and several images in a series form a video. One means of representing a video is with a three dimensional array of unsigned bytes. Two of the dimensions represent an image and the third dimension represents the frame. Each element in the array represents a pixel’s intensity. The images here are grayscale and do not contain color information.

The container chosen to hold the converted video data was a SequenceFile. A SequenceFile is a data structure that is part of the Hadoop application programming interface (API) for storing binary files containing a series of key-value pairs. For the converted DIDSON data, the key is a String (i.e., text) that contains the source filename and the range of corresponding frames and the value is a 1D array of bytes (i.e., a flattened representation of the 3D array prepended with a 4 byte header containing the width and height of each frame). Each raw DIDSON file was segmented into sets of up to 200 frames and each set became a key-value pair (i.e., a record) in a SequenceFile. The frames in records overlap by 15 frames. This overlap allows video processing algroithms that operate over several frames to work in a distributed setting. In some applications the overlapping frames of each segment may need to be removed (i.e., ignore the first 15 frames). Figure 3 illustrates the ordering as to how the raw DIDSON data was converted to a SequenceFile. The pixels in the first frame were encoded row by row before moving to the subsequent frames.

The SequenceFile dataset was split up into 4 parts: SEQVisualizedFishDatsets-PartI.zip, SEQVisualizedFishDatsets-PartII.zip, SEQVisualizedFishDatsets-PartIII.zip and SEQisualizedFishDatsets-PartIV.zip (Data Citation 3). PartI contains the visualized data from 2013 and PartsII-IV contain the visualized data from 2016. Collectively, the contents of these 4 archives is the SequenceFile directory. As it was directly converted from the raw DIDSON clips dataset, it follows the same hierarchy. A SequenceFile can be displayed using the source code for viewing SequenceFiles. The display of SequenceFile and the original ddf file should yield the same video display. Note that while the SequenceFile format is part of the Hadoop API, a Hadoop cluster is not required to access the data. The viewer program referenced above can easily be modified to convert the data to other formats.

Supporting Files

This folder contains two videos in AVI format of raw DIDSON files that have been converted to video. These can be played by any AVI compatible player. The videos are around 20 minutes in length, contain a number of fish and illustrate the quality of the raw data when visualized.

Technical Validation

To collect known fish targets within the DIDSON field of view, DIDSON data were cross-referenced with PIT system data in 2013 and video camera data in 2016. A limitation of using video cameras to identify fish rather than a PIT system is that fish were only visible during daylight hours, even when infrared lights on the video cameras were activated at night. While use of video cameras provided a great opportunity to collect many known targets of white suckers and carp, they did not allow for the collection of very many sea lamprey targets because sea lamprey are nocturnal, and therefore were rarely observed during the day.

To supplement the known sea lamprey targets gathered using PIT detections (n = 65) and video data (n = 3), a trained expert manually reviewed a subset of the DIDSON data using the Sound Metrics (V5) software. Prior to manual processing, known sea lamprey targets that were gathered using the PIT system validation technique were used to train the expert reviewer, as well as test the reviewer’s accuracy at identifying sea lamprey during manual inspection of the DIDSON data. The blind reviewer watched 50 clips of DIDSON footage containing a combination of known sea lamprey images, non-target fish images, and blank images that contained only background noise. These clips were randomly selected from all days of the 2013 DIDSON deployment as well as from any time of day throughout a 24-hour period. This allowed us to evaluate whether the expert reviewer’s ability to detect sea lamprey changed over time. Of the 20 clips that contained known sea lamprey images, the expert reviewer correctly identified 19 (95% accuracy) of them, and none of the remaining 30 clips were falsely identified as containing sea lamprey.

Usage Notes

In general the DIDSON data can be used to develop or evaluate tools that characterize the abundance or behavior of fish. Large amounts of data from extended deployments are provided that can be used to develop unsupervised machine learning tasks such as signal filtering and object tracking and the data also contains label information with the position of known targets (e.g., sea lamprey). This label information can be used to develop supervised machine learning tools such as species-specific classifiers. By providing the data in multiple formats (i.e., raw acoustic data or binary visualizations contained in SequenceFiles), it is possible to work with the data through existing software in its raw format or immediately work with the visualized data using image and video processing tools. Existing software is ill suited to handle large amounts of DIDSON data or programmatically support custom, user analyses. The purpose of releasing these DIDSON data is to provide the community with data that can be used to create and evaluate tools that can handle large amounts of data or perform custom analyses. Classifiers could be used on the existing dataset or future DIDSON datasets to better characterize abundance and behavior of many fish species in the Great Lakes or elsewhere.

Standard DIDSON software suites such as Sound Metrics or community licensed software such as the ARISreader can be used to view and access the raw DIDSON data. The visualized data created from the raw DIDSON clips can be viewed and accessed using the SequenceViewer program (Data Citation 1). This program is written in Java and can readily be modified to export the underlying image data to other formats. The SequenceFiles can also be directly accessed through the Hadoop API. To develop training and evaluation data for classifiers, the included spreadsheets can be used to pinpoint the location of specific species of fish.

The amount of labeled DIDSON data presented here may not be of sufficient size to develop machine learning tools without the aid of newer techniques to leverage commonalities among image classification tasks. Transfer learning reuses feature extractors developed on much larger labeled image sets and then repurposes them through a refining process¹⁴. This effectively allows large, accurate classifiers to be developed even when only a small amount of labeled images are available¹⁵. Additionally, through random perturbations of images (e.g., shifts, rotations, amplifications), it is possible to create additional presentations of labeled data that can be used for model construction¹⁶.

The SequenceViewer program (Data Citation 1) is a Java program that illustrates how the records in a SequenceFile (i.e., the individual images in a video stream) can be accessed. Each image is stored as a series of bytes that represent the intensity of a grayscale image and the SequenceViewer program simply uses this data to display an image on the screen. Instead of displaying the image, the data could be saved in a different binary format, encoded into video or used as the input to a classifier or other tool.

Additional information

How to cite this article: McCann, E. et al. An underwater observation dataset for fish classification and fishery assessment. Sci. Data. 5:180190 doi: 10.1038/sdata.2018.190 (2018).

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

References

Moursund, R. A., Carlson, T. J. & Peters, R. D. A fisheries application of a dual-frequency identification sonar acoustic camera. ICES Journal of Marine Science 60 (3), 678–683 (2003).
Article Google Scholar
Holmes, J. A., Cronkite, G. M., Enzenhofer, H. J. & Mulligan, T. J. Accuracy and precision of fish-count data from a “dual-frequency identification sonar” (DIDSON) imaging system. ICES Journal of Marine Science 63 (3), 543–555 (2006).
Article Google Scholar
Boswell, K. M, Wilson, M. P & Cowan, J. H. Jr., A semiautomated approach to estimating fish size, abundance, and behavior from dual-frequency identification sonar (DIDSON) data. North American Journal of Fisheries Management 28 (3), 799–807 (2008).
Article Google Scholar
Burwen, D. L., Fleischman, S. J. & Miller, J. D. Accuracy and precision of salmon length estimates taken from DIDSON sonar images. Transactions of the American Fisheries Society 139 (5), 1306–1314 (2010).
Article Google Scholar
Tiffan, K. F., Rondorf, D. W. & Skalicky, J. J. Imaging fall chinook salmon redds in the Columbia river with a dual-frequency identification sonar. North American Journal of Fisheries Management 24 (4), 1421–1426 (2004).
Article Google Scholar
Mueller, A., Burwen, D. L., Boswell, K. M. & Mulligan, T. Tail-beat patterns in dual-frequency identification sonar echograms and their potential use for species identification and bioenergetics studies. Transactions of the American Fisheries Society 139 (3), 900–910 (2010).
Article Google Scholar
Mueller, A., Mulligan, T. & Withler, P. K. Classifying sonar images: can a computer-driven process identify eels? North American Journal of Fisheries Management 28 (6), 1876–1886 (2008).
Article Google Scholar
Kang, M. Semiautomated analysis of data from an imaging sonar for fish counting, sizing, and tracking in a post-processing application. Fisheries and aquatic sciences 14 (3), 218–225 (2011).
Article ADS Google Scholar
Li, L., Danner, T., Eickholt, J., McCann, E., Pangle, K. & Johnson, N. A distributed pipeline for DIDSON data processing. In Proceedings of the 2017 IEEE International Conference on Big Data (Big Data) 4301–4306 (2017).
Johnson, N. S., Yun, S., Thompson, H. T., Brant, C. O. & Li, W. A synthesized pheromone induces upstream movement in female sea lamprey and summons them into traps. Proceedings of the National Academy of Sciences 106 (4), 1021–1026 (2009).
Article CAS ADS Google Scholar
Johnson, N. S. et al. A portable trap with electric lead catches up to 75% of an invasive fish species. Scientific Reports 6, 28430 (2016).
Article CAS ADS Google Scholar
Negahdaripour, S. Calibration of DIDSON forward-scan acoustic video camera in Proceedings of MTS/IEEE OCEANS 1287–1294 (2005).
Handegard, N. O. & Williams, K. Automated tracking of fish in trawls using the DIDSON (Dual frequency IDentification SONar). ICES Journal of Marine Science 65 (4), 636–644 (2008).
Article Google Scholar
Shin, H. C. et al. Deep convolution neural networks for computer-aided detection: CNN architectures, dataset characteristics, and transfer learning. IEEE Transactions on Medical Imaging 35 (5) (2017).
Chollet, F. Deep Learning with Python. Manning Publications (2018).
Krizhevsky, A., Sutskever, I., Hinton, G. In Advances in Neural Information Processing Systems Vol. 25 eds Periera F., Burges C., Bottou L. & weinberger Q. (Curran Associates, 2012).

Data Citations

Danner, T., Li, L., & Eickholt, J. Open Science Framework https://doi.org/10.17605/OSF.IO/XY32D (2018)
McCann, E., Li, L., Pangle, K., Johnson, N., & Eickholt, J. FigShare https://doi.org/10.6084/m9.figshare.c.4039202 (2018)
McCann, E., Li, L., Pangle, K., Johnson, N., & Eickholt, J. Open Science Framework https://doi.org/10.17605/OSF.IO/SXEK6 (2018)

Download references

Acknowledgements

The authors would like to acknowledge Michigan Sea Grant, which supported Erin McCann during her time at Central Michigan University and the Great Lakes Fishery Commission for funding the collection of the DIDSON data. We thank Peter Hrodey (USFWS) and Samantha Nellis for helping to deploy and maintain the DIDSON unit during 2013. The authors would also like to thank Tyler Danner for his work on the SequenceViewer program. Any use of trade, product, or firm names is for descriptive purposes only and does not imply endorsement by the U.S. Government.

Author information

Erin McCann
Present address: Present address: Pacific Northwest National Laboratory, Richland, WA, 99352, USA.,

Authors and Affiliations

Department of Biology, Central Michigan University, Mt. Pleasant, 48859, MI, USA
Erin McCann, Kevin Pangle & Jesse Eickholt
Department of Computer Science, Central Michigan University, Mt. Pleasant, 48859, MI, USA
Liling Li & Jesse Eickholt
Great Lakes Science Center, U.S. Geological Survey, Hammond Bay Biological Station, Millersburg, 49759, MI, USA
Nicholas Johnson

Authors

Erin McCann
View author publications
You can also search for this author in PubMed Google Scholar
Liling Li
View author publications
You can also search for this author in PubMed Google Scholar
Kevin Pangle
View author publications
You can also search for this author in PubMed Google Scholar
Nicholas Johnson
View author publications
You can also search for this author in PubMed Google Scholar
Jesse Eickholt
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

E.M. and N.J. deployed the DIDSON cameras and EM supervised the collection of data and generated clips for target fish species. L.L. and J.E. extracted the image data from the raw DIDSON data and curated the dataset. E.M., L.L. and J.E. drafted the manuscript. K.P. and N.J. conceived the design for data collection and all authors revised and approved the manuscript.

Corresponding authors

Correspondence to Nicholas Johnson or Jesse Eickholt.

Ethics declarations

Competing interests

The authors declare no competing interests.

ISA-Tab metadata

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/ The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files made available in this article.

Reprints and permissions

About this article

Cite this article

McCann, E., Li, L., Pangle, K. et al. An underwater observation dataset for fish classification and fishery assessment. Sci Data 5, 180190 (2018). https://doi.org/10.1038/sdata.2018.190

Download citation

Received: 28 March 2018
Accepted: 31 July 2018
Published: 09 October 2018
DOI: https://doi.org/10.1038/sdata.2018.190

This article is cited by

Out of the shadows: automatic fish detection from acoustic cameras
- R. M. Connolly
- K. I. Jinks
- E. L. Jinks
Aquatic Ecology (2023)
A Dataset with Multibeam Forward-Looking Sonar for Underwater Object Detection
- Kaibing Xie
- Jian Yang
- Kang Qiu
Scientific Data (2022)

Subjects

Abstract

Similar content being viewed by others

Differentiation of two swim bladdered fish species using next generation wideband hydroacoustics

Observational study on the non-linear response of dolphins to the presence of vessels

An open access dataset for developing automated detectors of Antarctic baleen whale sounds and performance evaluation of two commonly used detectors

Background & Summary

Methods

Collecting DIDSON Data

Extracting and Labeling Targets

Converting DIDSON Acoustic Data to Video

Code Availability

Data Records

Raw DIDSON Dataset

Raw DIDSON Clips

SequenceFile Dataset

Supporting Files

Technical Validation

Usage Notes

Additional information

References

References

Data Citations

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

ISA-Tab metadata

ISA-Tab metadata

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Out of the shadows: automatic fish detection from acoustic cameras

A Dataset with Multibeam Forward-Looking Sonar for Underwater Object Detection

Search

Quick links