UNI-EM: An Environment for Deep Neural Network-Based Automated Segmentation of Neuronal Electron Microscopic Images

Urakubo, Hidetoshi; Bullmann, Torsten; Kubota, Yoshiyuki; Oba, Shigeyuki; Ishii, Shin

doi:10.1038/s41598-019-55431-0

Download PDF

Article
Open access
Published: 19 December 2019

UNI-EM: An Environment for Deep Neural Network-Based Automated Segmentation of Neuronal Electron Microscopic Images

Scientific Reports volume 9, Article number: 19413 (2019) Cite this article

4557 Accesses
22 Citations
2 Altmetric
Metrics details

Subjects

Abstract

Recently, there has been rapid expansion in the field of micro-connectomics, which targets the three-dimensional (3D) reconstruction of neuronal networks from stacks of two-dimensional (2D) electron microscopy (EM) images. The spatial scale of the 3D reconstruction increases rapidly owing to deep convolutional neural networks (CNNs) that enable automated image segmentation. Several research teams have developed their own software pipelines for CNN-based segmentation. However, the complexity of such pipelines makes their use difficult even for computer experts and impossible for non-experts. In this study, we developed a new software program, called UNI-EM, for 2D and 3D CNN-based segmentation. UNI-EM is a software collection for CNN-based EM image segmentation, including ground truth generation, training, inference, postprocessing, proofreading, and visualization. UNI-EM incorporates a set of 2D CNNs, i.e., U-Net, ResNet, HighwayNet, and DenseNet. We further wrapped flood-filling networks (FFNs) as a representative 3D CNN-based neuron segmentation algorithm. The 2D- and 3D-CNNs are known to demonstrate state-of-the-art level segmentation performance. We then provided two example workflows: mitochondria segmentation using a 2D CNN and neuron segmentation using FFNs. By following these example workflows, users can benefit from CNN-based segmentation without possessing knowledge of Python programming or CNN frameworks.

DeepACSON automated segmentation of white matter in 3D electron microscopy

Article Open access 10 February 2021

Modular segmentation, spatial analysis and visualization of volume electron microscopy datasets

Article 29 February 2024

Dense cellular segmentation for EM using 2D–3D neural network ensembles

Article Open access 28 January 2021

Introduction

In recent years, there has been a rapid expansion in the field of micro-connectomics, which targets the three-dimensional (3D) reconstruction of neuronal networks from stacks of two-dimensional (2D) electron microscopy (EM) images^1,2,3. Neuroscientists have successfully reconstructed large-scale neural circuits from species, such as mice⁴, fruit flies⁵, and zebrafish⁶. Such large-scale reconstructions require neuronal boundary detection (or neuron segmentation) of large numbers of EM images, and automation is critical even for smaller-scale segmentation.

For automated neuron segmentation, studies have validated the effectiveness of deep convolutional neural networks (CNNs)⁷. In particular, U-Net, which is a type of CNN, showed the highest accuracy in a neuron segmentation contest⁸, and similar CNNs also proved effective^9,10,11. Three-dimensional CNNs have also been developed for higher segmentation accuracy. Januszewski et al. developed a type of recursive 3D CNN called flood filling networks (FFNs)¹², which showed the highest segmentation accuracy in a public 3D EM dataset (FIB-25)¹³ and the second highest in another public 3D EM dataset (3D segmentation of neurites in EM images, SNEMI3D)¹⁴. Therefore, the use of such CNNs has become critical for accurate neuron segmentation.

Most CNN source codes are publicly available; however, it is not easy to perform segmentation even with these source codes. Users are required to prepare ground truth segmentation for their own EM images first and then to conduct preprocessing tasks, such as data conversion. The preprocessing and use of CNNs often require users to learn the underlying programming language, which is generally Python. After performing CNN-based segmentation, users need to conduct postprocessing, including proofreading, annotation, and visualization. In short, CNN-based neuron segmentation, although an important task, constitutes only a portion of the entire segmentation procedure.

Advanced connectomics laboratories have developed their own software pipelines to employ CNN-based segmentation, including Rhoana^15,16, Eyewire¹⁷, and the FFN segmentation pipeline⁵. The main objective of these pipelines is large-scale 3D reconstructions that are conducted by large teams including computer experts for setup and maintenance. They are too complicated to be used by smaller teams. EM segmentation is also handled by sophisticated standalone software packages, such as Reconstruct¹⁸, Ilastik¹⁹, Knossos²⁰, Microscopy Image Browser²¹, and VAST lite²². However, most software packages only target manual segmentation^18,20,22, and others currently do not support CNN-based segmentation^19,21. Recently, a plug-in for the widely used ImageJ software was developed to handle CNN-based segmentation²³. The use of this plug-in is advantageous; however, it currently provides only four types of U-Net models, and users need to launch a server on a Linux computer to train the U-Nets.

We therefore developed a unified environment for CNN-based automated segmentation of EM images (UNI-EM) for researchers with limited programming skills. UNI-EM implements several 2D CNNs^8,9,10,11 and 3D FFNs¹² on the widely used Tensorflow framework/Python²⁴. It also includes the proofreading software Dojo²⁵ as well as a series of 2D/3D filters for classic image processing. Those features enable users to follow the procedure of CNN-based segmentation, i.e., ground truth generation, training, inference, postprocessing, proofreading, and visualization. UNI-EM currently supports two major operating systems (OSs): Microsoft Windows 10 (64 bit) and Linux. We also provide Python installation-free versions of UNI-EM (Pyinstaller version). Thus, users do not need to install Python or any modules for CNN-based segmentation.

Results

Outline of software

UNI-EM is a software collection for CNN-based EM image segmentation that includes ground truth generation, training, inference, postprocessing, proofreading, and visualization (Fig. 1). UNI-EM is written in Python 3.6 and runs on Microsoft Windows 10 (64 bit) and Linux. We also built UNI-EM on the Python application bundler called Pyinstaller on Windows 10; thus, users can employ UNI-EM without installing the Python programming environment. CPU and GPU versions are available, and users can maximize the performance using the GPU version if the computer is equipped with an NVIDIA GPU card that has a NVIDIA compute capability over 3.5. The developed Python source code with an online manual is available at the public repository GitHub (https://github.com/urakubo/UNI-EM).

The main component of UNI-EM is a web-based proofreading software, Dojo (Fig. 1A)²⁵. Dojo provides a graphical user interface (GUI) for users to correct mis-segmentation arising from automated EM segmentation. We extended Dojo to have file import/export functions (png/tiff files), a more sophisticated GUI, and multiscale paint functions. With these extensions, users can employ Dojo not only for proofreading, but also for ground truth generation, both of which are important manual operation procedures for CNN-based segmentation. Dojo consists of a Python-based web/database server and an HTML5/JavaScript-based client interface. The server–client system allows multiple users to access it simultaneously through web browsers in an OS-independent manner. UNI-EM equips its own web browser called Chromium for the standalone use of Dojo with either a mouse or a stylus.

We also developed a new 3D annotator to visualize the proofread objects in a 3D space as well as to annotate these segmented objects (Fig. 1B). This annotator is a surface mesh-based 3D viewer with a table that shows segmented objects. Users can change the color and brightness of target objects and export the visualization results as png image files, as well as assign a name to each object and put marker points on the object surface. The results of these annotations can be exported as csv files for further analyses.

We then implemented a U-Net equipped with a GUI as a representative 2D CNN for EM-image segmentation⁸. U-Net has characteristic contracting and expansive convolution layers with skip connections, which showed the highest segmentation accuracy in the EM Segmentation Challenge in the International Symposium on Biomedical Imaging 2012 Conference (ISBI 2012) at the time of publication⁸. We similarly implemented ResNet⁹, Highway-Net¹⁰, and Dense-Net¹¹. All of the CNNs accept single-channel (gray-scale) or three-channel (RGB) images. Users can choose any combination of these CNNs, loss functions, training times, and data augmentation methods, through a command panel.

We further wrapped FFNs as a representative algorithm of 3D CNN-based neuron segmentation¹². FFNs are a recurrent CNN that infers a volume mask indicating whether target voxels belong to the centered object, and the inference program obtains an overall volume mask for each object using a flood filling algorithm. FFNs have outperformed many other algorithms in the segmentation accuracies of FIB-25¹³ and SNEMI3D¹⁴. Users can conduct a series of FFN processes, i.e., preprocessing, training, inference, and postprocessing, through a command panel.

The 2D CNNs and 3D FFNs were implemented on the Tensorflow framework²⁴. Its resource monitor Tensorboard can be conveniently accessed from UNI-EM, so users can easily check the status of a target CNN, such as the network topology and loss function. UNI-EM also has a GUI for 2D/3D classic image filters. Users can apply multiple image filters simultaneously to a stack of 2D images in a single execution. The target images of the CNNs and classic filters are opened/closed through a folder manager. Further, users can implement new CNN models through the “Plugin” dropdown menu. Details on how to implement a new CNN are outlined in the online manual (see Data availability).

Example workflows

In this section, we demonstrate how users can benefit from UNI-EM by introducing two example workflows. The first one is mitochondria segmentation using 2D CNNs, and the second one is neuron segmentation using 3D FFNs. In both cases, we targeted an EM image stack that was prepared for SNEMI3D²⁶. The target brain region is the mouse somatosensory cortex, and the EM images were obtained using scanning electron microscopy (SEM) in combination with an automatic tape-collecting ultra-microtome system (ATUM/SEM)¹⁴. The spatial resolution of the EM images was 6 nm per pixel (xy-plane) and 30 nm per Z slice, and the overall image volume was 6.1 × 6.1 × 3 μm. The images were passed through a contrast-limited adaptive histogram equalization filter (CLAHE; block size 127, histogram bins 256, max slope 1.50) before segmentation.

Case 1: Mitochondria segmentation using 2D CNN

Mitochondria are abundant where the metabolic demand is high, such as in synapses and active axons^27,28, and their detection and quantification are important for treating neuronal diseases²⁹. Because mitochondria possess characteristic oval shapes³⁰, their segmentation is a good target for 2D CNN-based segmentation³¹. However, it is not accessible to inexperienced users (Fig. 2A). Firstly, inexperienced users need to learn how to use Python, install a CNN framework, and download an implementation of the target CNN from a public repository. The other software packages need to be installed for ground truth generation, post-processing, and proofreading (Fig. 2A). These steps can be learned, but a major hurdle is the transfer of data, especially to a CNN, when the users must convert EM/segmentation images into HDF5 or npz format files. To confirm that UNI-EM decreases the arduousness of these tasks (Fig. 2B), two test users (H.K. and Y.F.) who were not skilled in Python programming were requested to perform the following procedure (Fig. 2C):

1.
Ground truth generation. The test users painted the mitochondrial regions of a single EM image using UNI-EM (Dojo). The generated ground truth was exported as an 8-bit grayscale PNG file (~20 min).
2.
Training. A 16-layer ResNet with a least-square loss function was trained using the ground truth (~10 min computation time).
3.
Inference. The trained ResNet was applied to test the EM images to obtain inferred 2D segmentation (~1 min).
4.
Postprocessing. The inferred 2D segmentation images were binarized, and then each isolated region in 3D space was labeled with a specific ID number (~10 min).
5.
Proofreading, annotation, and visualization. The test users proofread it with Dojo and visualized it with the 3D annotator (~30 min).

The test users successfully conducted the above procedure within the time indicated in parentheses and obtained the instance segmentation of mitochondria. The segmentation accuracy was sufficiently high without any proofreading (Fig. 2C, bottom and right panel; RAND score: 0.85; see Methods), as expected from published results on 2D CNN-based segmentation^31,32. The detailed instructions for the mitochondria segmentation task can be found at the public repository GitHub (see Data availability).

In the above process, we requested the test users to use a 16-layer ResNet with a least-square loss function for mitochondrial segmentation. This request was determined based on the following quantitative survey on the segmentation of mitochondria, synapses, and neurons (Fig. 3A). Here we utilized the RAND score as a measure of segmentation accuracy (see Methods). The larger RAND score denotes higher accuracy. We first confirmed that only one ground truth image was sufficient for the segmentation of mitochondria (Fig. 3B), and 10 ground truth images were sufficient for neurons and synaptic segmentations. We then confirmed that the square, dice, and logistic loss functions were appropriate for segmentation (Fig. 3C). All of the 2D CNN types showed high accuracy in mitochondria segmentation (Fig. 3D, green lines; >0.9 RAND score). In addition, U-Net was not appropriate for membrane segmentation (Fig. 3D, red line; ~0.3 RAND score), and the segmentation accuracies in synapses are not high regardless of the type of CNN (Fig. 3D; ~0.3 RAND score). The accuracy of mitochondria segmentation in a standard CNN (network topology: ResNet; loss function: least square; number of layers: 9; training epochs: 2000; number of training images: 5) was indeed comparable with the accuracy in a recent 3D CNN-based, state-of-the-art algorithm³². The segmentation accuracy of the 3D CNN was quantified as Jaccard 0.92, Dice 0.96, and conformity 0.91 (semantic segmentation; ATUM/SEM data), whereas that of our standard 2D CNN was quantified as Jaccard 0.91, Dice 0.95, conformity 0.90 (semantic segmentation). Here, the larger scores of Jaccard, Dice, and conformity indicate higher accuracy³². Their 3D CNN requires 77 h of training time on a NVIDIA K40 GPU, whereas our standard CNN required only 5 min on a NIVDIA GTX1070 GPU. In addition, the 3D CNN was trained using the 3D ground truth, which requires excessive and tedious manual labeling. Overall, the implemented 2D CNN-based segmentations showed a sufficiently high and competitive accuracy compared to the current state-of-the-art mitochondrial segmentation algorithm³².

Case 2: Neuron segmentation using 3D FFNs

We next asked a test user (N.Y.) to conduct neuron segmentation using 3D FFNs¹², which is a primary topic in micro-connectomics. Various 2D and 3D CNNs have been proposed for accurate neuron segmentation^33,34. FFNs currently show some of the highest segmentation accuracies in neuron segmentation¹², although they require laborious work to generate the 3D ground truth. Users can generate the 3D ground truth using Dojo, but we recommend VAST lite for this purpose²². In the present case, we used the ground truth included in the SNEMI3D dataset. The test user successfully conducted the following procedure through the command panel (Fig. 4A):

1.
Preprocessing. Stacks of target EM images and ground truth images were converted into FFN-specialized style files (~1 h computation time; Fig. 4B).
2.
Training. FFNs were trained with the preprocessed EM-image/segmentation files (~2 weeks computation time on a NIVDIA GTX1080Ti GPU; Fig. 4B).
3.
Inference. The trained FFNs were applied to a stack of test EM images for the inference of 3D segmentation (~1 h computation time on a NIVDIA GTX1080Ti GPU; Fig. 4B).
4.
Postprocessing. The output segmentation files were converted into a PNG file stack (~10 min computation time; Fig. 4B).
5.
Proofreading and visualization. The converted PNG files and EM images were imported into Dojo for proofreading as well as the 3D annotator for visualization (Fig. 4B).

Note that the trained FFNs directly inferred a 3D instance segmentation from a stack of 2D EM images. The FFNs gave a reasonably accurate neuron segmentation (Fig. 4B, right), whose RAND score was 0.84 (after 7 million training epochs; see Methods)¹². This score was obtained without any postprocessing and specific parameter turning for the SNEMI3D dataset, and the topological structure of the neurites was well preserved in the segmentation results. Januszewski et al. reported a RAND score of 0.975 in the case of the SNEMI3D dataset¹². This score was obtained with two additional processes: automated agglomeration of oversegmentation and a 2D watershed¹². Thus, there is room for further improvement. Although FFNs require long training time (~2 weeks), users can benefit from their precise inference, which drastically decreases the subsequent proofreading work.

System design

UNI-EM was developed under the Python development environment and Python bindings for v5 of the Qt application framework for GUI (PyQt5). The combination of Python and PyQt5 is typical for Python GUI desktop applications (e.g., Sommer et al.¹⁹), and UNI-EM utilizes this combination for GUI-equipped 2D CNNs and 3D FFNs (Fig. 5). The desktop application style is appropriate for CNN computing because CNN training/inference often occupies all of the GPU resources of a desktop computer, and the shared usage of a single GPU is ineffective. On the other hand, Dojo, the 3D annotator, and Tensorboard are web applications. The web application style provides remote accessibility to these applications; hence, multiple users can simultaneously use them (remote users in Fig. 5). Tensorboard enables the remote inspection of CNN training, Dojo enables multiple users to correct mis-segmentation simultaneously, and the 3D annotator enables multiuser annotation. Together, UNI-EM is comprised of desktop and web application systems, and this heterogeneity enables a wide range of applications from individual to shared use.

Discussion

We presented a software package called UNI-EM for CNN-based automated EM segmentation. UNI-EM unifies pieces of software for CNN-based segmentation. We validated its effectiveness using two example workflows: mitochondria segmentation using a 2D CNN and neuron segmentation using 3D FFNs. Test users who did not possess Python programming skills were able to perform the overall procedure successfully, and the resulting segmentation accuracies were comparable to those of state-of-the-art methods. Therefore, UNI-EM is a beneficial tool for researchers with limited programming skills.

In recent years, the popularity of CNNs in generic image segmentation as well as EM image segmentation has greatly increased⁷. Numerous CNN-based segmentation algorithms have been proposed, and their source codes are often released along with journal publication. However, it is difficult to use such CNN source code as doing so often requires knowledge of Python and a CNN framework. In such situations, UNI-EM provides an opportunity for researchers to examine the effectiveness of multiple CNNs based on their own EM images, without knowledge of Python. Based on the results, they can decide if they want to use these CNNs professionally for large-scale segmentation. UNI-EM therefore functions as a testing platform.

Two-dimensional CNN-based segmentation combined with subsequent Z-slice connection into 3D objects is effective if the target objects have simple shapes like that of mitochondria. In the example workflow, the test users successfully extracted the oval-shaped mitochondria within 2 h, and the segmentation accuracy was higher than those of conventional machine learning methods such as AdaBoost¹⁵. The proposed approach is also effective for neuron segmentation if the users can utilize high-performance Z slice connectors, such as rule-based connectors¹⁵, multicut algorithms³⁵, and the graph-based active learning of agglomeration³⁶. Incorporation of these connectors into UNI-EM is an important future direction because the current UNI-EM only provides 3D labeling and 3D watersheds to connect the 2D segments.

Many 3D CNNs have been proposed for highly accurate neuron segmentation^12,34,37,38. FFNs are one such 3D CNNs¹², but we have to acknowledge two remaining barriers from its common use. First, FFNs require a long training period over one week. Second, they require a certain amount of 3D ground truth segmentation. In our experience, two-week labor was required to manually draw 3D ground truth using a sophisticated paint tool²². FFNs are of course still an excellent selection if we consider the time for manual correction of mis-segmentation arising from other segmentation methods.

The proofreading software Dojo with extensions is one of the main components of UNI-EM²⁵. Similar to Dojo, numerous excellent proofreading and manual segmentation tools are available, e.g., Reconstruct¹⁸, Ilastik¹⁹, TrakEM2³⁹, VAST lite²², Knossos²⁰, webKnossos⁴⁰, Microscopy Image Browser²¹, CATMAID⁴¹, NeuTu⁴², and Neuroglancer⁴³. The primary advantage of Dojo is its web application architecture. A web application has numerous advantages; there is no need for the end users to install any software except for the web browser, OS independency, and cloud resource accessibility, and multiuser access is typically included. However, a distinct web/database server needs to be launched. To avoid this task, UNI-EM itself contains the backend web/database server of Dojo. Users can employ UNI-EM as both single-user and collaborative applications, without launching any distinct servers.

Almost all of UNI-EM programs are written in high-level interpreter languages, i.e., Python, JavaScript, HTML, and CSS, and only the matching cube mesh generator is currently written in a C++ compiler language. The interpreter languages generally have lesser abilities to manage CPU and memory resources and show reduced performance. On the other hand, CNN frameworks such as TensorFlow and PyTorch provide application programming interfaces on high-level languages, such as Python. Thus, users can easily incorporate new CNN models into UNI-EM. The instructions for extending UNI-EM are provided in an online manual (see Data availability).

Methods

RAND score

We utilized the foreground-restricted RAND score as a metric of segmentation performance⁷. The RAND score is defined as follows. Suppose p_ij is the joint probability that a target pixel belongs to object i of inferred segmentation and object j of ground truth segmentation (Σ_ij p_ij = 1). Subsequently, s_i = Σ_j p_ij is the marginal probability for the inferred segmentation, and t_j = Σ_i p_ij is the marginal probability for the ground truth segmentation. Subsequently, the RAND score, ${V}_{\alpha }^{{\rm{Rand}}}$, can be defined as follows:

$${V}_{\alpha }^{{\rm{Rand}}}=\frac{{\sum }_{ij}{p}_{ij}^{2}}{\alpha {\sum }_{k}{s}_{k}^{2}+(1-\alpha ){\sum }_{k}{t}_{k}^{2}},$$

where the RAND F-score α is set to be 0.5. The split score (α → 0) can be interpreted as the precision in the classification of pixel pairs as belonging to the same (positive class) or different objects (negative class). The merge score (α → 1) can be interpreted as recall. Generally, ${V}_{\alpha }^{{\rm{Rand}}}$ becomes equal to 1 if the segmentation is accurate. Note that, as utilized in a neuron segmentation contest⁷, the RAND scores of instance segmentation were obtained in the case of neuron segmentation in the 2D CNNs and FFNs (Figs. 2 and 4), i.e., isolated neurons were counted as independent objects. On the other hand, the RAND scores of semantic segmentation were obtained in the cases of synapses and mitochondria in the 2D CNNs (Fig. 2) to compare the scores with those in a 3D CNN³².

Data availability

The datasets generated and/or analyzed during the current study are available from the corresponding author upon reasonable request. UNI-EM is available at https://github.com/urakubo/UNI-EM.

References

Briggman, K. L. & Bock, D. D. Volume electron microscopy for neuronal circuit reconstruction. Curr. Opin. Neurobiol. 22, 154–161 (2012).
Article CAS Google Scholar
Helmstaedter, M. Cellular-resolution connectomics: challenges of dense neural circuit reconstruction. Nat. Methods 10, 501–507 (2013).
Article CAS Google Scholar
Morgan, J. L. & Lichtman, J. W. Why not connectomics? Nat. Methods 10, 494–500 (2013).
Article CAS Google Scholar
Lee, W. C. et al. Anatomy and function of an excitatory network in the visual cortex. Nature 532, 370–374 (2016).
Article ADS CAS Google Scholar
Li, P. H. et al. Automated reconstruction of a serial-section EM Drosophila brain with flood-filling networks and local realignment. bioRxiv, 605634 (2019).
Hildebrand, D. G. C. et al. Whole-brain serial-section electron microscopy in larval zebrafish. Nature 545, 345–349 (2017).
Article ADS CAS Google Scholar
Arganda-Carreras, I. et al. Crowdsourcing the creation of image segmentation algorithms for connectomics. Front. Neuroanat. 9, 142 (2015).
Article Google Scholar
Ronneberger, O., Fischer, P. & Brox, T. U-Net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention—MICCAI 2015 9351, 234–241 (2015).
Article Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. Identity mappings in deep residual networks. European Conf. Comput. Vision 630–645 (2016).
Srivastava, R. K., Greff, K. & Schmidhuber, J. Highway networks. arXiv preprint, arXiv 1505, 00387 (2015).
Google Scholar
Huang, G., Liu, Z. & Weinberger, K. Q. Densely connected convolutional networks. arXiv preprint, arXiv:1608.06993 (2016).
Januszewski, M. et al. High-precision automated reconstruction of neurons with flood-filling networks. Nat. Methods 15, 605–610 (2018).
Article CAS Google Scholar
Takemura, S. Y. et al. Synaptic circuits and their variations within different columns in the visual system of Drosophila. Proc. Natl. Acad. Sci. USA 112, 13711–13716 (2015).
Article ADS CAS Google Scholar
Kasthuri, N. et al. Saturated reconstruction of a volume of neocortex. Cell 162, 648–661 (2015).
Article CAS Google Scholar
Kaynig, V. et al. Large-scale automatic reconstruction of neuronal processes from electron microscopy images. Med. Image Anal. 22, 77–88 (2015).
Article Google Scholar
Haehn, D. et al. Scalable interactive visualization for connectomics. Informatics 4 (2017).
Bae, J. A. et al. Digital museum of retinal ganglion cells with dense anatomy and physiology. Cell 173, 1293–1306 e1219 (2018).
Article CAS Google Scholar
Fiala, J. C. Reconstruct: A free editor for serial section microscopy. J. Microsc. 218, 52–61 (2005).
Article MathSciNet CAS Google Scholar
Sommer, C., Straehle, C., Kothe, U. & Hamprecht, F. A. Ilastik: Interactive learning and segmentation toolkit. 2011 8th IEEE International Symposium on Biomedical Imaging: From Nano to Macro, 230–233 (2011).
Helmstaedter, M., Briggman, K. L. & Denk, W. High-accuracy neurite reconstruction for high-throughput neuroanatomy. Nat. Neurosci. 14, 1081–1088 (2011).
Article CAS Google Scholar
Belevich, I., Joensuu, M., Kumar, D., Vihinen, H. & Jokitalo, E. Microscopy image browser: A platform for segmentation and analysis of multidimensional datasets. PLoS Biol. 14, e1002340 (2016).
Article Google Scholar
Berger, D. R., Seung, H. S. & Lichtman, J. W. VAST (Volume Annotation and Segmentation Tool): Efficient manual and semi-automatic labeling of large 3D image stacks. Front. Neural Circuits 12, 88 (2018).
Article Google Scholar
Falk, T. et al. U-Net: Deep learning for cell counting, detection, and morphometry. Nat. Methods 16, 67–70 (2019).
Article CAS Google Scholar
Abadi, M. et al. In Proceedings of the 12th USENIX conference on Operating Systems Design and Implementation 265–283 (USENIX Association, Savannah, GA, USA, 2016).
Haehn, D. et al. Design and evaluation of interactive proofreading tools for connectomics. Proceedings IEEE SciVis 20, 2466–2475 (2014).
Google Scholar
Arganda-Carreras, I., Seung, H. S., Vishwanathan, A. & Berger, D. R. SNEMI3D: 3D Segmentation of neurites in EM images, http://brainiac2.mit.edu/SNEMI3D/ (2013).
Saxton, W. M. & Hollenbeck, P. J. The axonal transport of mitochondria. J. Cell Sci. 125, 2095–2104 (2012).
Article CAS Google Scholar
Ohno, N. et al. Mitochondrial immobilization mediated by syntaphilin facilitates survival of demyelinated axons. Proc. Natl. Acad. Sci. USA 111, 9953–9958 (2014).
Article ADS CAS Google Scholar
Nunnari, J. & Suomalainen, A. Mitochondria: in sickness and in health. Cell 148, 1145–1159 (2012).
Article CAS Google Scholar
Frey, T. G. & Mannella, C. A. The internal structure of mitochondria. Trends Biochem. Sci. 25, 319–324 (2000).
Article CAS Google Scholar
Oztel, I., Yolcu, G., Ersoy, I., White, T. & Bunyak, F. Mitochondria segmentation in electron microscopy volumes using deep convolutional neural network. IEEE Int. C. Bioinform. 1195–1200 (2017).
Xiao, C. et al. Automatic mitochondria segmentation for EM data using a 3D supervised convolutional network. Front. Neuroanat. 12, 92 (2018).
Article CAS Google Scholar
Ciresan, D., Giusti, A., Gambardella, L. M. & Jurgen, S. Deep neural networks segment neuronal membranes in electron microscopy images. Adv. Neural Inf. Process. Syst. 25, 2843–2851 (2012).
Google Scholar
Lee, K., Zung, J., Li, P., Jain, V. & Seung, H. S. Superhuman accuracy on the SNEMI3D connectomics challenge. arXiv preprint, arXiv:1706.00120 (2017).
Beier, T. et al. Multicut brings automated neurite segmentation closer to human performance. Nat. Methods 14, 101–102 (2017).
Article ADS CAS Google Scholar
Nunez-Iglesias, J., Kennedy, R., Plaza, S. M., Chakraborty, A. & Katz, W. T. Graph-based active learning of agglomeration (GALA): A Python library to segment 2D and 3D neuroimages. Front. Neuroinform. 8, 34 (2014).
Article Google Scholar
Lee, K., Zlateski, A., Vishwanathan, A. & Seung, H. S. In Proceedings of the 28th International Conference on Neural Information Processing Systems - Volume 2 3573-3581 (MIT Press, Montreal, Canada, 2015).
Zeng, T., Wu, B. & Ji, S. DeepEM3D: Approaching human-level performance on 3D anisotropic EM image segmentation. Bioinformatics 33, 2555–2562 (2017).
Article CAS Google Scholar
Cardona, A. et al. TrakEM2 software for neural circuit reconstruction. PLoS One 7, e38011 (2012).
Article ADS CAS Google Scholar
Boergens, K. M. et al. WebKnossos: Efficient online 3D data annotation for connectomics. Nat. Methods 14, 691–694 (2017).
Article CAS Google Scholar
Saalfeld, S., Cardona, A., Hartenstein, V. & Tomancak, P. CATMAID: Collaborative annotation toolkit for massive amounts of image data. Bioinformatics 25, 1984–1986 (2009).
Article CAS Google Scholar
Zhao, T., Olbris, D. J., Yu, Y. & Plaza, S. M. NeuTu: Software for collaborative, large-scale, segmentation-based connectome reconstruction. Front. Neural Circuits 12, 101 (2018).
Article CAS Google Scholar
Neuroglancer: webGL-based viewer for volumetric data, https://github.com/google/neuroglancer (2016).

Download references

Acknowledgements

We would like to thank Ryoji Miyamoto, Noboru Yamaguchi, Hiroko Kita, and Yoshihisa Fujita for their technical assistance. This work was partly supported by AMED Brain/MINDS (JP19dm0207001 to SI and JP19dm0207084 to YK), JST CREST (JPMJCR1652), MEXT KAKENHI on Innovative Areas “Brain information dynamics underlying multi-area interconnectivity and parallel processing” (17H06310 to SI and 17H06311 to YK), RIKEN CBS on the Collaboration Research for Development of Techniques in Brain Science Database Field, and JSPS KAKENHI (17K00404 to HU and 19H03336 to YK).

Author information

Authors and Affiliations

Integrated Systems Biology Laboratory, Department of Systems Science, Graduate School of Informatics, Kyoto University, Yoshida-Honmachi 36-1, Sakyo-ku, Kyoto, 606-8501, Japan
Hidetoshi Urakubo, Torsten Bullmann, Shigeyuki Oba & Shin Ishii
Carl-Ludwig-Institute for Physiology, University Leipzig, Liebigstr. 27, 04103, Leipzig, Germany
Torsten Bullmann
Division of Cerebral Circuitry, National Institute for Physiological Sciences, 5-1 Myodaiji-Higashiyama, Okazaki, Aichi, 444-8787, Japan
Yoshiyuki Kubota
Department of Physiological Sciences, The Graduate University for Advanced Studies (SOKENDAI), 5-1 Myodaiji-Higashiyama, Okazaki, Aichi, 444-8787, Japan
Yoshiyuki Kubota

Authors

Hidetoshi Urakubo
View author publications
You can also search for this author in PubMed Google Scholar
Torsten Bullmann
View author publications
You can also search for this author in PubMed Google Scholar
Yoshiyuki Kubota
View author publications
You can also search for this author in PubMed Google Scholar
Shigeyuki Oba
View author publications
You can also search for this author in PubMed Google Scholar
Shin Ishii
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

H.U. and S.I. designed the study. H.U. designed the overall software architecture. T.B. wrote the 2D CNN-based segmentation programs. Y.K. tested the software package and provided an example workflow. H.U., T.B., Y.K., S.O. and S.I. wrote the manuscript.

Corresponding author

Correspondence to Hidetoshi Urakubo.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Urakubo, H., Bullmann, T., Kubota, Y. et al. UNI-EM: An Environment for Deep Neural Network-Based Automated Segmentation of Neuronal Electron Microscopic Images. Sci Rep 9, 19413 (2019). https://doi.org/10.1038/s41598-019-55431-0

Download citation

Received: 16 April 2019
Accepted: 28 November 2019
Published: 19 December 2019
DOI: https://doi.org/10.1038/s41598-019-55431-0

This article is cited by

Seg2Link: an efficient and versatile solution for semi-automatic cell segmentation in 3D image stacks
- Chentao Wen
- Mami Matsumoto
- Koutarou D. Kimura
Scientific Reports (2023)
Seasonal Arctic sea ice forecasting with probabilistic deep learning
- Tom R. Andersson
- J. Scott Hosking
- Emily Shuckburgh
Nature Communications (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.