Neuron type classification in rat brain based on integrative convolutional and tree-based recurrent neural networks

Zhang, Tielin; Zeng, Yi; Zhang, Yue; Zhang, Xinhe; Shi, Mengting; Tang, Likai; Zhang, Duzhen; Xu, Bo

doi:10.1038/s41598-021-86780-4

Download PDF

Article
Open access
Published: 31 March 2021

Neuron type classification in rat brain based on integrative convolutional and tree-based recurrent neural networks

Tielin Zhang¹,
Yi Zeng^1,2,3,
Yue Zhang⁴,
Xinhe Zhang¹,
Mengting Shi^1,2,
Likai Tang⁵,
Duzhen Zhang^1,2 &
…
Bo Xu^1,2,3

Scientific Reports volume 11, Article number: 7291 (2021) Cite this article

4096 Accesses
16 Citations
1 Altmetric
Metrics details

Subjects

Abstract

The study of cellular complexity in the nervous system based on anatomy has shown more practical and objective advantages in morphology than other perspectives on molecular, physiological, and evolutionary aspects. However, morphology-based neuron type classification in the whole rat brain is challenging, given the significant number of neuron types, limited reconstructed neuron samples, and diverse data formats. Here, we report that different types of deep neural network modules may well process different kinds of features and that the integration of these submodules will show power on the representation and classification of neuron types. For SWC-format data, which are compressed but unstructured, we construct a tree-based recurrent neural network (Tree-RNN) module. For 2D or 3D slice-format data, which are structured but with large volumes of pixels, we construct a convolutional neural network (CNN) module. We also generate a virtually simulated dataset with two classes, reconstruct a CASIA rat-neuron dataset with 2.6 million neurons without labels, and select the NeuroMorpho-rat dataset with 35,000 neurons containing hierarchical labels. In the twelve-class classification task, the proposed model achieves state-of-the-art performance compared with other models, e.g., the CNN, RNN, and support vector machine based on hand-designed features.

Learning cellular morphology with neural networks

Article Open access 21 June 2019

Machine learning-based classification of mitochondrial morphology in primary neurons and brain

Article Open access 04 March 2021

DeNeRD: high-throughput detection of neurons for brain-wide analysis with deep learning

Article Open access 25 September 2019

Introduction

It has been more than 100 years since Santiago Ramón y Cajal first proposed the neuron doctrine by staining sparse cell populations in 1906. After that, more research came and joined the ranks, which have promoted this research area's development. Currently, we have more elaborate and efficient equipment for a better understanding of the neuron doctrine from different scales, disciplines, and perspectives.

Like Mendeleev in 1869, who developed the periodic table, many researchers have made efforts to conduct neuron type classification and multi granularity neural identity measurements from different brain regions¹. Many improvements have been achieved in neural classification based on the molecules, electrophysiology transcriptome, genome, biophysics, and morphology, to understand biological structures and functions better systematically and reproducibly. However, there is currently no broadly accepted and generally agreed-upon approach for neuron type classification until now.

For a finer neuron categorization, one smart way is to incorporate the features from different perspectives as much as possible. For example, Zeng et al. took advantage of quantitative features (e.g., structural, functional, and molecular criteria) measured from the same tissue^2,3. However, limited by experimental procedures or experimental equipment, for most cases, only one or two specific criteria can be obtained from them. From the transcriptomics perspective, 23,822 cells were categorized into 133 cell types with the single-cell RNA sequencing method in the primary visual cortex and the anterior lateral motor cortex⁴. From the molecular criteria perspective, the relationships between molecular cell types and the morphological, physiological, and behavioral correlations measured by spatially resolved transcriptomic methods have been well identified⁵. In addition, the developmental and evolutionary definition of a cell type based on the molecular basis is also conducted within and between different species⁶. From the genome's perspective, the Ivy Glioblastoma Atlas is constructed based on different features of genomic alterations and gene expression patterns⁷. From the perspective of electrophysiology, the Allen Cell Types Database has generated 170 individual biophysically detailed neuron models and identified the distinctiveness of the intrinsic properties between subsets of these cells in the cerebral cortex⁸.

The prevailing schemes of neuron-type nomenclature contain the morphology definition, anatomical definition, molecular definition, physiological definition, evolutionary definition⁶, and developmental definition (multiple species comparison or lineage tracing on a larger time scale an animal). Hence, neuron types could be glutamatergic neurons, GABAergic neurons, parvalbumin-expressing neurons, somatostatin-expressing neurons, or glial cells. The study of cellular complexity in the nervous system based on anatomy shows more practical and objective advantages in morphology than other perspectives, such as the molecular, physiological, and evolutionary aspects. New progress in neuron staining, slicing, and reconstruction methods in mouse and rat brains^9,10,11 have also contributed to neuronal morphology identification.

NeuroMorpho.org is one of the largest public neuronal morphology databases of digitally reconstructed neurons by species, brain regions, and cell types¹² and currently contains 721 cell types, 112,244 neurons, and 317 brain regions. Hippocampome.org is another morphology dataset with detailed axonal and dendritic patterns for the mouse hippocampus, containing 122 neuron types categorized by integrating morphology and neurotransmitters, synaptic specificity, electrophysiology, and molecular biomarkers¹³. The 3D morphologies and somatic electrophysiological responses combine for 170 individual neuron identifications in the primary visual cortex⁸. Anatomical methods have also been well integrated with genomic methods for validating the neuronal cell types and their connections in multiple species in the BRAIN Initiative Cell Census Consortium¹⁴.

However, the exact class numbers of rat neurons from taxonomy are still unknown¹⁵. This dilemma possibly comes from two main reasons: one is that the definition of different types of neurons is still not clear; the other is that one specific neuron type usually contains many diverse subtypes⁴, making their identification difficult, especially for some inhibitory neurons.

Traditional methods depend mainly on specific hand-designed features (e.g., the size of the soma, the number of branches, the angle of bifurcation, the branch level, the branching order, the neuron length, the dendritic and axonal shapes, and branching patterns, and the spine density), which are labor-intensive and possibly failed for some neurons with very complex dendritic or axonal arbors. Hence, the neuron-type classification automatically is becoming more necessary and urgent for the further analysis of multiscale neural circuitry and the functional simulation of biological networks.

The morphologies of neurons are usually described in two types of neuronal formats: one type is the SWC-format file; the other is the 2D- or 3D-image format, as shown in Fig. 1a. However, the SWC-format file is low dimensional and unstructured (e.g., compressed as the structure of the non-strict binary tree, and invariant with exchanging neighborhood structures under the same bifurcate; moreover, the node list length is closely connected with the sampling rate). The 3D-image format is structured but with too high information dimensions (e.g., fewer samples than the complexity of the morphology, similar local features, invariance to translation rotation, and large scale of diameters). This paper focuses on feature detection from different data formats and tries to find some practical solutions for neuron type classification by morphology.

Deep neural networks (DNNs) are powerful in both structured and unstructured information processing. They possess many different kinds of subtypes, for example, the feed-forward type convolutional neural network (CNN) or the loop-based recurrent neural network (RNN). These different types of architectures make DNNs powerful for both spatial and temporal information processing. For example, DNNs have been successfully applied to spatial information abstraction tasks (e.g., ImageNet classification^16,17,18,19), sequential information prediction tasks (e.g., video-to-text transformation^20,21), long-term planning tasks (e.g., go games²² and Atari 2600 games²³), and so on. The development of neuroscience and machine learning has also contributed to the development of automatic neuron type classification and categorization¹, including expert knowledge²⁴, the Bayesian classifier²⁵, Bayesian clustering¹⁵, K-means clustering²⁶, affinity propagation clustering²⁷, harmonic co-clustering with the L-measure²⁸ and hierarchical clustering²⁹.

In this paper, we obtain inspiration from the morphology of neurons, which are more similar to tree-type structures, and then construct an integrative DNN model that contains a Tree-RNN³⁰ and ResNet CNN for a better neuron-type classification. We design a Tree-RNN for feature learning from the unstructured SWC-format file data and construct a ResNet CNN for feature detection from structured 2D neuron images. We combine these two models, i.e., integrating the high-level features (at the last full-connection layers for each model) together by concatenating two feature vectors together as a longer vector then classified it with a new three-layer neural network. The model is then verified on the virtually simulated dataset (with generated two-class neuron data, relatively simple), CASIA rat-brain dataset (with 2.6 million reconstructed neurons without labels), and standard NeuroMorpho-rat dataset (with 35,000 labeled neurons with two main classes and twelve subclasses downloaded from the NeuroMorpho website, relatively hard).

Results

Virtually simulated dataset

Usually, it is hard to identify which reason has caused the lousy performance of classification when both the complexity of datasets and the complexity of methodologies are not measured. Here, we try to lock the dataset's complexity first by generating some virtual simulated samples first and then testing them with the proposed classification methods. By making this effort, we can quickly filter out some designed but relatively weak classifiers.

Z-jump in the SWC-format file is a commonly occurred problem, especially for some samples without well tracing, where the values of z axes of some feature points jumped (Fig. 1b, left). Some post-processing methods such as smoothing or filtering can handle this type of problem at some scale. For example, a demo of automatic repair of the Z-jump problem (as shown in Fig. 1b, right) is given with the “zcorr” function from Trees toolbox²⁴. Then, point resampling is performed on the SWC-format files to obtain a better point distribution. This process includes diameter alignment and morphology smoothing. Finally, we use the Trees toolbox²⁴ for the virtual simulated sample generation. For example, the 2-class virtual simulated dataset is generated with different neuron properties. The 2D images, which can be considered the projection of raw 3D images, are also generated with different hyperparameters. In order to keep the training samples balanced, we also use this Trees toolbox to expand the dataset (including both SWC files and images) by generating similar samples (with little pruning and addition of samples in the target class, 90%) and carry out the transformation of images by rotating, zooming in or zooming out.

The virtually simulated dataset contains two basic types of neurons: the basket cell as shown in Fig. 1c and the pyramidal cell as shown in Fig. 1d. The source code is available at https://github.com/thomasaimondy/ treestoolbox/tree/master/casia. The two types of neurons have 500 samples for each class, with both the SWC-format and 2D-image-format data converted by the Trees toolbox.

NeuroMorpho-rat dataset

Several (approximately 20% by calculation) of the neuronal morphology files at NeuroMorpho.org present the Z-jump problem. The Trees toolbox has a “zcorr” function designed for SWC file repairment. Figure 2 shows neurons after being repaired and processed by the Houdini software based on different hyperparameters (e.g., “zcorr = 10”). The data contains two main classes (principle cell and interneuron cell) and 12 subclasses, including six types of principal cells (e.g., ganglion, granule, medium spiny, parachromaffin, Purkinje, and pyramidal cells) in Fig. 2a, three types of interneuron neurons (e.g., basket, GABAergic, and nitrergic) in Fig. 2b, two types of glial cells (e.g., microglia and astrocytes) in Fig. 2c and one type of sensory receptor cell in Fig. 2d. The number of selected rat samples (e.g., the rat neurons) with labels is 35,000.

CASIA rat-neuron dataset

Most of the state-of-the-art single-cell reconstruction algorithms^10,11,31 use the tracing-based method for the identification of neuron structures. However, it is time-consuming and difficult to achieve a full-brain-size reconstruction atlas in this way. The raw slice data in the CASIA rat-neuron dataset are from the lab of Qingming Luo⁹. Here, we reconstruct neurons with classification methods, which classifies (or separates) all of the biological structures from the background. The activation spreading method is then used for the single-cell morphology identification (e.g., SWC-format file). This method is efficient and can obtain 2.6 million neurons in a much shorter time, even though some connections with much longer distances are missing. The detailed introduction of biological tissue recognition, reconstruction, and segmentation will be further described in our next paper.

We use most of the well-preserved relatively local information (e.g., soma and the locally connected synapse located within 500 µm) as the essential information for neuron type classification. However, this dataset does not contain neuron labels. This paper uses it as the pretraining dataset before neuron classification of the labeled data (e.g., NeuroMorpho-rat dataset) in the CNN-based and RNN-based models.

The pretraining is commonly used in many DNN architectures. First, two DNN architectures with one ordinary and one mirror network are designed. Then, the two networks are connected for the information encoding and decoding. As an input image X, for example, the network encoded it into a Y with smaller dimensions and then decoded it back into X (represented as $\widehat{X}$), where the loss of network is designed as the minimal square error of X and $\widehat{X}$. We reconstruct both the CASIA rat-neuron dataset and the virtually simulated dataset, and the download link is https://github.com/thomasaimondy/neuromorpho_neuron_type_classification.

Figure 3 shows the reconstruction procedure of the neuronal morphology. Figure 3a shows the reconstructed structures include somas and synapses, presented in gray, where each blue point represents the center position of soma, and the number nearby is the index of the soma. Both somas and synapses have a corresponding 3D position (e.g., X, Y, Z positions) and 3D size (e.g., the soma's radius in different directions). Figure 3b describes the reconstructed somas clustered based on the soma positions and their neighborhood pixels. Figure 3c shows the reconstruction and separation of each neuron in the area of the prefrontal cortex. The region size is 1300 × 1000 × 100 pixels and contains 100 neurons of different types. Some standard pyramidal cells and basket cells with morphological characteristics can be easily identified. Figure 3d shows some samples of the reconstructed CASIA rat neurons randomly selected from 2.6 million cells in a single rat brain. The morphologies of the neurons vary greatly, which shows the great challenge of their classification. Figure 3e shows the reconstructed whole rat brain, in which different colors represent different densities of the number of neurons.

Neuron type classification

We tested the different classification models on three datasets: the virtually simulated dataset, NeuroMorpho-rat dataset, and CASIA rat-neuron dataset. Each sample in each dataset contains both three 2D images and a matched SWC-format file. The hand-designed features have also been calculated for these samples. There is a total of 31,501 raw images and 34,296 SWC-format files with labels. The sample ratio of training, validation, and test samples is 8:1:1. For the better training of DNNs, we expanded only the training samples, including 37,510 images (by translation, rotation, and scaling) and 40,196 SWC-format files (by adding or deleting some branches with Trees toolbox) in the 2-class task; and also 98,700 images and 99,996 SWC-format files in the 12-class task.

We tested the CNN model on 2D image samples and tested the Tree-RNN and standard RNN on the SWC-format dataset. The features in the hidden layers of the CNN and RNN are considered stable features from different data sources, and they are then integrated (by connecting two feature vectors) for the classification of the DNN model. The hand-designed features are classified with the support vector machine (SVM) model, which serves as the comparison experiment. The loss of SVM is L2-norm, and the stop-learning error is 1e-3.

For the virtually simulated dataset, basket and pyramidal cells are selected out with obvious morphology differences as the two classes of simple samples. The proposed models show 100% test classification accuracy on the virtually simulated dataset with two classes, including the proposed CNN, RNN, and standard SVM models.

For the NeuroMorpho-rat dataset, during the learning procedure in the DNN for the 12-class neuron type classification, the test accuracy is calculated as in Fig. 4a. The x-axis is the number of iterations. Here, we set this number to 100 for simplicity. The model in this figure integrates both the CNN and RNN features well for better classification performance. As the training time goes by, the test accuracy increases quickly, and the curve finally stops at approximately 90% accuracy.

The loss cost is another indicator of the learning procedure of the DNN model. As shown in Fig. 4b, the loss curve will decrease as the number of learning iterations increases, and at approximately 40 iterations, it obtains the smallest test loss; however, this is not the global minimal tuning point, and the test accuracy is still low. Hence, the network will continue training until it stops at the maximum number of iterations (100) with the dropout principle for anti-overfitting. All of the algorithms are run in a standard desktop computer with 4 GPUs (Tesla K40). The computation time for SVM is around 10 min; for RNN is around 2–6 h; for CNN is around 6 h; for Tree-RNN, it is around 24 h.

Figure 4c shows the IDs and the selected 12-type neuron names in the NeuroMorpho-rat dataset. As shown in Fig. 4d, in a two-class classification task, in which the two neuron types are principle cells and intern cells, for the single classification module, the RNN-based method obtains the best performances (90.24% accuracy for the Tree-RNN, and 86.11% and 86.63% accuracies for the standard RNN on raw SWC and fixed SWC data, respectively). Compared with them, the CNN methods obtain accuracies of approximately 84.04% to 87.87% based on the different qualities of the preprocessed image datasets (e.g., the raw SWC file, the resampled SWC files, and the XY-aligned SWC files). The SVM method for the classification of the hand-designed features achieves the lowest accuracy (84.78%). Then, we integrate the feature vectors of the CNN and RNN and obtain a 90.57% accuracy (with the “Multi” label in the figure).

There are six types of principal cells in the twelve-class classification task, three types of interneuron neurons, 2 types of glial cells, and 1 type of sensory receptor cell. Here, we can obtain a similar conclusion as for the two classes. For the single classification module, the best performance is that of the Tree-RNN on the SWC-format files, with a 90.33% accuracy; the CNN-based methods are approximately 83.24% to 86% accurate, and the SVM method is 81.64% accurate. Compared with these results, the proposed integrative method reaches a 91.90% accuracy, which is the highest and state-of-the-art performance on the NeuroMorpho-rat dataset.

Figure 4e shows the classification matrix for the CNN model on the 12-class neuron images. Each value in the matrix represents the number of target-classified labels ($N$) after the conversion of $log_{10} N$. The results show that some of the neuron types are more highly correlated; for example, the neuron morphology in class “1” is closer to that in classes “4”, “6”, “9” and “11”, and class “4” is highly connected to class “6”. Hence, misclassification usually occurs in these specific neuron types. This result shows that the different classes of neurons may share similar features, which shows the characteristics of the data's inner class and provides inspiration for better neuron-type assignments.

t-SNE analysis on DNN-based features

DNN features for most of the neurons have hundreds of dimensions, for example, 512 for CNN and 128 for Tree-RNN, even the hand-designed features have 19 dimensions, which is much higher than that of the human beings might well perceive and understand. Here, we select t-SNE³² for the in-depth analysis of the distributions of high-dimensional features. The t-SNE can decrease the dimensions of the features to two or three dimensions, by which humans might easily obtain a better understanding.

Figure 4f shows the t-SNE analysis resulted from the 2-class DNN-based features from both the 2D-images and SWC-format files from the NeuroMorpho-rat dataset. There are two classes: “0” is for the principal cell, and “1” is for the interneuron cell. The initial data vector before t-SNE is 640 dimensions (with the integration of 512 from the CNN and 128 from the RNN). In this figure, the different cell types are separate, and only a few interneurons are miscategorized as principal cells. This result shows that CNN is powerful in feature detection. Figure 4g shows the t-SNE distribution of 35,000 rat neurons in 12 classes. Different classes of neurons can be categorized into different groups of samples. This means that the categorization of neuron types fits the characteristics of features learned from both the CNN and RNN.

Analysis of hand-designed features

It is a long history for the hand-designed neuronal features that might contribute to the neuron-type classification to some scale. However, even the neurons in the same class are largely diverse, making it a hard problem for identifying neuron types given some limited features. Table 1 shows a collection of hand-designed neuronal features defined by the previous studies, including 19 features from morphology. For example, it contains “TotalLength” that indicates the summation of the length of all parts of neuronal structures (e.g., length of soma, dendrites, and axons), “MaxBranchOrder” introduces the maximum number of branch orders from the soma to terminals (axons or dendrites), and “SomaSurface” shows the total surface area of the soma.

Table 1 The hand-designed features related to the neuron morphology after determining the mathematical statistics.

Full size table

SVM is one of the most commonly used pattern-recognition algorithms. Here, we select it as the baseline classifier to further analyze these hand-designed features in Table 1. Not all of the features contribute to identifying the selected 12 neuron types with the SVM classifier. This phenomenon is not surprised for the limited numbers of features and neuron types.

Further analysis is given in Fig. 5a, where contributions of different hand-designed features for the neuron-type classification are tested. From the figure, with the SVM classifier, we find that some features positively contribute to the classification performance, for example, the average diameter and overall width. In contrast, some others harm the accuracy, for example, the number of branches, the total surface, and the max branch order. This result indicates that some definitions of neuronal features by humans might be ineffective for these 12-type neurons. Additionally, the distributions of features in Fig. 5, to some extent, show the importance of different features.

Figure 5d shows the total-length feature, in which the minimum length in it is tens of micrometers, and the maximum length is 36,000 µm. The y-axis is the count of the neurons at different length scales. The curve's distribution is the power-law distribution, which shows that more of the neurons have local-type connections instead of long-term-type connections.

Figure 5e shows the max path distance feature, and the range of the size is from 0 to 10,000. Most neurons are in two ranges: from 0 to 100 µm and from 200 to 300 µm. The count number of the max path distance exhibits linear attenuation with the distance.

Figure 5f shows the number of branches feature, in which the count number of neurons dramatically decreases with the increase of the number of branches. The total number of branches ranges from 0 to 1,700, which means that the maximum number of branches in a neuron can reach 1,700, i.e., a large number for most traditional neurons. However, most neurons stay in a range of branch orders from 0 to 300, following a power-law-like distribution.

Figure 5g shows the max branch order, which is another positive hand-designed feature for neuron type classification. However, Fig. 5h shows a negative feature, i.e., the soma surface, ranging from 0 to 10,000. The soma surface contributes little to the neuron classification instead of well accepted as an important neuron type feature. It also follows a normal-like distribution, very different from the power-law distribution of other features. Further analysis of what kind of feature distribution will contribute to the classification is a next-step candidate key research point.

t-SNE analysis of hand-designed features

Figures 5b and c show the t-SNE analysis results for the hand-designed features with a 19-dimensional vector on two classes and 12 classes, respectively. The hand-designed features are mostly from anatomical experiments or the simple statistics of neuron characteristics, which may not be the best indicator for neuron types. The comparisons between DNN-based features (Figs. 4f and g) and hand-designed features (Figs. 5b and c) show that the DNN-based features have better clustering performance. Hence, they will be better for the neuron type indicator in neuron type classification and categorization.

Discussion

Designing the proper morphological features for neuron categorization and classifying them are the two main challenges. Some traditional methods use hand-designed features and shallow network models for the neuron type classification. However, the morphologies of neurons are complicated and are usually described as raw 3D images with billions of voxels or SWC-format files with unstructured data lengths. These data characteristics cause the traditional machine learning methods to fail. Different types of DNN-based efforts have solved these problems to some scale, especially in brain-tissue segmentation³³, tracing³⁴, and classification³⁵.

This paper, similar but different from other DNN-based algorithms, proposed an integrative deep learning model for better neuron-feature generation and neuron-type classification. The model contains a CNN module for feature detection in structured 2D images via the axes' projection from 3D raw images. We also self-designed a proper tree-based RNN for unstructured feature learning from SWC-format files.

Compared with the traditional hand-designed neuron features (e.g., the size of the soma and the number of branches), the DNN-based features have a more extended dimensional representation. For example, they contain a 512-dimensional CNN feature vector and a 128-dimensional Tree-RNN feature vector. Moreover, the DNN-based feature has better clustering characteristics in the t-SNE distribution. These neuron morphology representations may contribute to a finer neuron categorization and classification for the whole rat brain. The CNN model is pre-trained on the CASIA-rat dataset and then integrates with Tree-RNN to retrain and test virtual simulated and NeuroMorpho-rat datasets. The integrated model achieves the best performance compared with traditional SVM, CNN, and RNN models on the standard NeuroMorpho-rat dataset.

The structural and functional identifications of different neural types and related network topologies are important for the next-step research on biology-inspired artificial intelligence. Yin et al. have analyzed degree centrality, closeness centrality, and betweenness centrality of conventional and clustered networks to better understand the biological efforts to optimize the network information transfer³⁶. Hence, deeper integration of these multi-scale biologically-plausible inspirations with artificial or spiking neural networks is necessary towards brain-inspired artificial intelligence.

Methods

The overall processing procedure is shown in Fig. 6a, which is designed to answer the following three basic questions related to neuron classification and categorization:

Which are the most proper models for different neural data formats? The neuron morphology community usually uses tree-like SWC-format data and raw 3D pixel data for the morphology description. However, SWC-format data have variable lengths related to the complexity of structures, and 3D data may contain millions of pixels, which is a big data problem for the next-step classification. Based on these two challenges, we select the RNN and Tree-RNN to classify variable-length SWC-format data and select the ResNet CNN to classify 2D images (converted from 3D raw images by the projection of the X, Y, and Z axes), as shown in the models for learning features in Fig. 6a.

What level of granularity of a cell type definition is necessary? Different scales of neuron classification are considered in the classification in Fig. 6a. For the large scale, two neuron types are classified, including principle cells and intern cells. For the small scale, 12 sub-neuron types are classified, including six types of principal cells (e.g., ganglion, granule, medium spiny, parachromaffin, Purkinje, and pyramidal cells), three types of interneurons (e.g., basket, GABAergic, and nitrergic cells), two types of glial cells (e.g., microglia and astrocyte cells) and 1 type of sensory receptor cell.

What are the fundamental features guiding the categorization of various cell types? Usually, people use hand-designed features after a careful selection, for example, the size of the soma, the number of branches, the angle of bifurcation, the branch level, the branching order, the neuron length, the dendritic shapes, or the spine density. However, these features are too simple compared with the far more complex neuron types. The DNN-based feature learned from both SWC-format and image-format data may give us more hints about the proper definition of features, as shown in the categorization in Fig. 6a.

ResNet CNN for the 2D-morphology image classification

As shown in Fig. 6b (a more detailed version is shown in Appendix Fig. 1), for the classification of neuron images, we construct a ResNet (residual neural network)³⁷. The residual blocks in the ResNet (i.e., the dotted box in Fig. 6d) are carefully designed for the vanishing gradient problem. Compared with traditional DNNs, the ResNet protects the integrity of information and avoids gradient disappearance and gradient explosion, which makes it more powerful for 2D image classification. In addition, taking the complicated structural features of neuron morphology into consideration, we set the layer number of the ResNet to 18 and the learning rate to $10^{ - 5}$ for previous layers and $10^{ - 4}$ for the last fully connected layers. The remaining computational units are similar to those of the traditional CNN, and the detailed parameters of the CNN can be found in Table 2.

Table 2 The hyperparameters for the CNN, RNN, and Tree-RNN.

Full size table

Tree-RNN for unstructured SWC-format data classification

SWC-format files (i.e., the initials of the last names of E.W. Stockley, H.V. Wheal, and H.M. Cole³⁸) present the tree structure and describe the order of root nodes or branch nodes of neurons. They are non-structured and difficult to analyze further by traditional machine learning methods. In addition, the data in an SWC-format file also usually contain significant morphological errors, e.g., false mergers and false splits, which make the neuron type classification difficult.

Long short-term memory (LSTM)³⁹ networks have gate mechanisms and are more powerful for variable spatial or temporal information processing. Here, we select both LSTM and traditional RNN units as the basic RNN module for morphology's unstructured feature learning. As shown in Fig. 6e, we use standard 2-layer LSTM³⁹ and RNN modules (and LSTM) to classify unstructured SWC-format data.

To achieve a better classification, we design a convolutional layer for data dimension reduction and then integrate it into the recurrent module with the full connection. The number of hidden units of the RNN and LSTM is 128 for both of them. The output layer has 2 or 12 classes according to the experiments.

Furthermore, the neuron morphology in an SWC-format file describes the tree-type architectures of the neuron structures; hence, the “tree-like structure” will be valuable in the data organization of the SWC-format file. We then hand-design the “tree-structure-based RNN” (Tree-RNN), as shown in Fig. 6c, which connects different RNN modules by a basic tree structure. Each RNN module has a 2-layer LSTM (or RNN) and consists of two hidden layers (each with 128 neurons). The features in the Tree-RNN before the final fully-connected layers can also be combined with feature vectors in the CNN for integrative feature representation and classification (e.g., the “Multi” model). Detailed parameters of the RNN and Tree-RNN are given in Table 2. In addition, we also give further performance comparisons between different DNN models, as shown in Table 3, where accuracy and the area under the curve are tested^30,40,41,42.

Table 3 The performance of neuron-type classification with different DNN architectures, including accuracy and the area under the curve (AUC).

Full size table

The hand-designed features

The hand-designed features include both the features related to morphology (e.g., the length, surface size, width, depth, or the number of branches) and the features obtained after completing the mathematical calculations (e.g., the fractal dimension, the calculated surface area of the soma, and average bifurcation angles at remote or local scales), as shown in Table 1. The samples' labels cover different scales, from neuron types and brain regions to species and developmental characteristics. These features will be used for the feature-based SVM classification and the comparison analysis with DNN-based features in categorization.

Trees toolbox for preprocessing SWC-format files

We use the Trees toolbox for neuron morphology generation, conversion, feature calculation, and the repair of SWC-format files²⁴.

$$ \begin{array}{*{20}c} {\delta z = z_{min} } & {if(\delta z > Z_{th} )} \\ \end{array} $$

(1)

As shown in Eq. (1), ${\updelta }_{z}$ is the difference between the pre- and post-nodes at the Z-axis of an SWC-format file. $Z_{th}$ is the predefined hyperparameter (e.g., shown in Fig. 1b). Here, we set $z_{min}$ to zero.

t-SNE configuration for feature analysis

The t-SNE (t-distributed stochastic neighbor embedding) is a nonlinear dimension-reduction algorithm for mining high-dimensional data³². Compared with the traditional linear dimension-reduction algorithm, i.e., PCA, which cannot explain complex polynomial relations between features and performs poorly when they focus on different data points in lower-dimensional regions, t-SNE preserves both the local and global structure of the data and is a very suitable dimension-reduction algorithm for the process of feature clustering.

$$ p_{j/i} = \frac{{exp( - ||x_{i}^{640d} - x_{j}^{640d} ||^{2} /2\sigma_{i}^{2} )}}{{\sum\limits_{k \ne i} e xp( - ||x_{i}^{640d} - x_{k}^{640d} ||^{2} /2\sigma_{i}^{2} )}} $$

(2)

For the DNN-based features, we use two feature points $x_{i}^{640d}$ and $x_{j}^{640d}$ (both of which have 640 dimensions) in the high-dimensional space and calculate the Gaussian distribution centered on $x_{i}^{640d}$ by $p_{ij} = \frac{{p_{j/i} + p_{i/j} }}{2n}$. $\sigma_{i}$ represents the variance and $n$ is the number of candidate points.

When mapping the data to low-dimensional space, we select the 2D space. We still need to reflect the similarity between high-dimensional and low-dimensional data points in the form of conditional probability $q_{ij}$.

$$ q_{ij} = \frac{{\left( {1 + ||y_{i}^{2d} - y_{j}^{2d} ||^{2} } \right)^{ - 1} }}{{\sum\limits_{k \ne l} {\left( {1 + ||y_{k}^{2d} - y_{l}^{2d} ||^{2} } \right)^{ - 1} } }} $$

(3)

We use the conditional probability distribution $P_{i}$ (in the low-dimensional space) and $Q_{i}$ (high-dimensional space) with the conditional probability between every two points to measure the similarity (Kullback–Leibler divergence) between these two distributions (as shown in Eq. (4)).

$$ C = KL(P||Q) = \sum\limits_{i} {\sum\limits_{j} {p_{ij} } } log\frac{{p_{ij} }}{{q_{ij} }} $$

(4)

The gradient is calculated by:

$$ \frac{\partial C}{{\partial y_{i}^{2d} }} = 4\sum\limits_{j} {\left( {p_{ij} - q_{ij} } \right)} \left( {y_{i}^{2d} - y_{j}^{2d} } \right)\left( {1 + \left\| {y_{i}^{2d} - y_{j}^{2d} } \right\|^{2} } \right)^{ - 1} $$

(5)

Then, the stochastic gradient descent algorithm is trained, and the best 2D clustering samples are obtained, which could be considered the best 2D information representation for the original 640D distribution.

Data availability

The source code, three types of datasets (for the raw data of neuromorpho-rat, please downloading it directly from neuromorpho.org), and the trained DNN-based models are available https://github.com/thomasaimondy/neuromorpho_neuron_type_classification.

References

Armañanzas, R. & Ascoli, G. A. Towards the automatic classification of neurons. Trends Neurosci. 38, 307–318 (2015).
Article PubMed PubMed Central Google Scholar
Zeng, H. & Sanes, J. R. Neuronal cell-type classification: challenges, opportunities and the path forward. Nat. Rev. Neurosci. 18, 530–546 (2017).
Article CAS PubMed Google Scholar
Gouwens, N. W. et al. Classification of electrophysiological and morphological neuron types in the mouse visual cortex. Nat. Neurosci. 22, 1182–1195 (2019).
Article CAS PubMed PubMed Central Google Scholar
Tasic, B. et al. Shared and distinct transcriptomic cell types across neocortical areas. Nature 563, 72–78 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Lein, E., Borm, L. E. & Linnarsson, S. The promise of spatial transcriptomics for neuroscience in the era of molecular cell typing. Science 358, 64–69 (2017).
Article ADS CAS PubMed Google Scholar
Arendt, D. et al. The origin and evolution of cell types. Nat. Rev. Genet. 17, 744–757 (2016).
Article CAS PubMed Google Scholar
Puchalski, R. B. et al. An anatomic transcriptional atlas of human glioblastoma. Science 360, 660–663 (2018).
Article ADS CAS PubMed PubMed Central Google Scholar
Gouwens, N. W. et al. Systematic generation of biophysically detailed models for diverse cortical neuron types. Nat. Commun. 9, 710 (2018).
Article ADS PubMed PubMed Central Google Scholar
Li, A. et al. Micro-optical sectioning tomography to obtain a high-resolution atlas of the mouse brain. Science 330, 1404–1408 (2010).
Article ADS CAS PubMed Google Scholar
Quan, T. et al. NeuroGPS-Tree: automatic reconstruction of large-scale neuronal populations with dense neurites. Nat. Methods 13, 51–54 (2016).
Article CAS PubMed Google Scholar
Akram, M. A., Nanda, S., Maraver, P., Armañanzas, R. & Ascoli, G. A. An open repository for single-cell reconstructions of the brain forest. Sci. Data 5, 180006 (2018).
Article PubMed PubMed Central Google Scholar
Ascoli, G. A., Donohue, D. E. & Halavi, M. NeuroMorpho.Org: a central resource for neuronal morphologies. J. Neurosci. 27, 9247–9251 (2007).
Article CAS PubMed PubMed Central Google Scholar
Wheeler, D. W. et al. Hippocampome.org: a knowledge base of neuron types in the rodent hippocampus. Elife 4, e09960 (2015).
Article PubMed PubMed Central Google Scholar
Ecker, J. R. et al. The BRAIN initiative cell census consortium: lessons learned toward generating a comprehensive brain cell atlas. Neuron 96, 542–557 (2017).
Article CAS PubMed PubMed Central Google Scholar
DeFelipe, J. et al. New insights into the classification and nomenclature of cortical GABAergic interneurons. Nat. Rev. Neurosci. 14, 202–216 (2013).
Article CAS PubMed PubMed Central Google Scholar
LeCun, Y., Bengio, Y. & Hinton, G. Deep learning. Nature 521, 436–444 (2015).
Article ADS CAS PubMed Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. in Proceedings of the IEEE international conference on computer vision. 1026–1034.
Deng, J. et al. in 2009 IEEE conference on computer vision and pattern recognition. 248–255 (IEEE, 2009).
Zhang, T., Zeng, Y. & Xu, B. HCNN: a neural network model for combining local and global features towards human-like classification. Int. J. Pattern Recognit. Artif. Intell. 30, 1655004 (2016).
Article MathSciNet Google Scholar
Venugopalan, S. et al. in Proceedings of the IEEE international conference on computer vision. 4534–4542 (2015).
Mnih, V. et al. Human-level control through deep reinforcement learning. Nature 518, 529–533 (2015).
Article ADS CAS PubMed Google Scholar
Silver, D. et al. Mastering the game of Go with deep neural networks and tree search. Nature 529, 484–489 (2016).
Article ADS CAS PubMed Google Scholar
Oh, J., Guo, X., Lee, H., Lewis, R. L. & Singh, S. in Advances in neural information processing systems. 2863–2871 (2015).
Cuntz, H., Forstner, F., Borst, A. & Häusser, M. One rule to grow them all: a general theory of neuronal branching and its practical application. PLoS Comput. Biol. 6, e1000877 (2010).
Article ADS MathSciNet PubMed PubMed Central Google Scholar
Mihaljević, B., Benavides-Piccione, R., Bielza, C., DeFelipe, J. & Larrañaga, P. Bayesian network classifiers for categorizing cortical GABAergic interneurons. Neuroinformatics 13, 193–208 (2015).
Article PubMed Google Scholar
Karagiannis, A. et al. Classification of NPY-expressing neocortical interneurons. J. Neurosci. 29, 3642–3659 (2009).
Article CAS PubMed PubMed Central Google Scholar
Santana, R., McGarry, L., Bielza, C., Larrañaga, P. & Yuste, R. Classification of neocortical interneurons using affinity propagation. Front. Neural Circuits 7, 185 (2013).
Article PubMed PubMed Central Google Scholar
Lu, Y., Carin, L., Coifman, R., Shain, W. & Roysam, B. Quantitative arbor analytics: unsupervised harmonic co-clustering of populations of brain cell arbors based on L-measure. Neuroinformatics 13, 47–63 (2015).
Article PubMed Google Scholar
Hosp, J. A. et al. Morpho-physiological criteria divide dentate gyrus interneurons into classes. Hippocampus 24, 189–203 (2014).
Article CAS PubMed Google Scholar
Tai, K. S., Socher, R. & Manning, C. D. Improved semantic representations from tree-structured long short-term memory networks. Arxiv 1503(00075), 1556–1566. https://doi.org/10.3115/v1/P15-1150 (2015).
Article Google Scholar
Peng, H. et al. BigNeuron: large-scale 3D neuron reconstruction from optical microscopy images. Neuron 87, 252–256 (2015).
Article CAS PubMed PubMed Central Google Scholar
van der Maaten, L. & Hinton, G. Visualizing data using t-SNE. J. Mach. Learn. Res. 9, 2579–2605 (2008).
MATH Google Scholar
Tan, C. et al. deepbrainseg: automated brain region segmentation for micro-optical images with a convolutional neural network. Front. Neurosci. 14, 179. https://doi.org/10.3389/fnins.2020.00179 (2020).
Article PubMed PubMed Central Google Scholar
Zhou, Z., Kuo, H. C., Peng, H. & Long, F. DeepNeuron: an open deep learning toolbox for neuron tracing. Brain Inform 5, 3. https://doi.org/10.1186/s40708-018-0081-2 (2018).
Article PubMed PubMed Central Google Scholar
Lin, X. & Zheng, J. A neuronal morphology classification approach based on locally cumulative connected deep neural networks. Appl. Sci. 9, 3876. https://doi.org/10.3390/app9183876 (2019).
Article Google Scholar
Yin, C. et al. Network science characteristics of brain-derived neuronal cultures deciphered from quantitative phase imaging data. Sci. Rep. 10, 15078. https://doi.org/10.1038/s41598-020-72013-7 (2020).
Article CAS PubMed PubMed Central Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. in Proceedings of the IEEE conference on computer vision and pattern recognition. 770–778 (IEEE, 2016).
Stockley, E. W., Cole, H. M., Brown, A. D. & Wheal, H. V. A system for quantitative morphological measurement and electrotonic modelling of neurons: three-dimensional reconstruction. J. Neurosci. Methods 47, 39–51 (1993).
Article CAS PubMed Google Scholar
Hochreiter, S. & Schmidhuber, J. Long short-term memory. Neural Comput. 9, 1735–1780 (1997).
Article CAS PubMed Google Scholar
He, K., Zhang, X., Ren, S. & Sun, J. in 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR). 770–778 (IEEE).
Ronneberger, O., Fischer, P. & Brox, T. U-Net: convolutional networks for biomedical image segmentation. Lect. Notes Comput. Sci. 9351, 234–241. https://doi.org/10.1007/978-3-319-24574-4_28 (2015).
Article Google Scholar
Zhou, C., Sun, C., Liu, Z. & Lau, F. C. M. A C-LSTM Neural Network for Text Classification. CoRR abs/1511.08630, 1511.08630 (2015).

Download references

Acknowledgements

All of the images in this paper are generated by ourselves. The neuron-related images (i.e., Figs. 1a–d, 2a, b, 3a–e, 6a) are processed with the software Houdini16.5.350 (Non-commercial license, https://www.sidefx.com/docs/). Figures. 4 and 5 are generated with Python codes. Images in Fig. 6 are processed with Office Visio. This study is supported by the National Key Research and Development Program of China (Grant No. 2020AAA0104305), the National Natural Science Foundation of China (Grant No. 61806195), the Strategic Priority Research Program of the Chinese Academy of Sciences (Grant No. XDB32070100, XDA27010404), and the Beijing Brain Science Project (Z181100001518006).

Author information

Authors and Affiliations

Institute of Automation, Chinese Academy of Sciences, Beijing, China
Tielin Zhang, Yi Zeng, Xinhe Zhang, Mengting Shi, Duzhen Zhang & Bo Xu
University of Chinese Academy of Sciences, Beijing, China
Yi Zeng, Mengting Shi, Duzhen Zhang & Bo Xu
Center for Excellence in Brain Science and Intelligence Technology, Chinese Academy of Sciences, Shanghai, China
Yi Zeng & Bo Xu
Electronics and Communication Engineering, Peking University, Beijing, China
Yue Zhang
Department of Automation, Tsinghua University, Beijing, China
Likai Tang

Authors

Tielin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Yi Zeng
View author publications
You can also search for this author in PubMed Google Scholar
Yue Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Xinhe Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Mengting Shi
View author publications
You can also search for this author in PubMed Google Scholar
Likai Tang
View author publications
You can also search for this author in PubMed Google Scholar
Duzhen Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Bo Xu
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

T.Z. and Y.Z. designed architectures of models. T.Z., L.T., Y.Z., and D.Z. performed the experiments. X.Z. prepared the datasets. M.S. performed the t-SNE analysis. T.Z., Y.Z., Y.Z., and B.X. wrote/refined the paper.

Corresponding authors

Correspondence to Tielin Zhang or Yi Zeng.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhang, T., Zeng, Y., Zhang, Y. et al. Neuron type classification in rat brain based on integrative convolutional and tree-based recurrent neural networks. Sci Rep 11, 7291 (2021). https://doi.org/10.1038/s41598-021-86780-4

Download citation

Received: 08 November 2019
Accepted: 17 March 2021
Published: 31 March 2021
DOI: https://doi.org/10.1038/s41598-021-86780-4

This article is cited by

Moderate white light exposure enhanced spatial memory retrieval by activating a central amygdala-involved circuit in mice
- MengJuan Shang
- MeiLun Shen
- JunLing Xing
Communications Biology (2023)
Application of quantum machine learning using quantum kernel algorithms on multiclass neuron M-type classification
- Xavier Vasques
- Hanhee Paik
- Laura Cif
Scientific Reports (2023)
Evolutionarily conservative and non-conservative regulatory networks during primate interneuron development revealed by single-cell RNA and ATAC sequencing
- Ziqi Zhao
- Dan Zhang
- Zhiheng Xu
Cell Research (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Learning cellular morphology with neural networks

Machine learning-based classification of mitochondrial morphology in primary neurons and brain

DeNeRD: high-throughput detection of neurons for brain-wide analysis with deep learning

Introduction

Results

Virtually simulated dataset

NeuroMorpho-rat dataset

CASIA rat-neuron dataset

Neuron type classification

t-SNE analysis on DNN-based features

Analysis of hand-designed features

t-SNE analysis of hand-designed features

Discussion

Methods

ResNet CNN for the 2D-morphology image classification

Tree-RNN for unstructured SWC-format data classification

The hand-designed features

Trees toolbox for preprocessing SWC-format files

t-SNE configuration for feature analysis

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary Information

Supplementary Information

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Moderate white light exposure enhanced spatial memory retrieval by activating a central amygdala-involved circuit in mice

Application of quantum machine learning using quantum kernel algorithms on multiclass neuron M-type classification

Evolutionarily conservative and non-conservative regulatory networks during primate interneuron development revealed by single-cell RNA and ATAC sequencing

Comments

Search

Quick links