Interpretable machine learning approach for neuron-centric analysis of human cortical cytoarchitecture

Štajduhar, Andrija; Lipić, Tomislav; Lončarić, Sven; Judaš, Miloš; Sedmak, Goran

doi:10.1038/s41598-023-32154-x

Download PDF

Article
Open access
Published: 05 April 2023

Interpretable machine learning approach for neuron-centric analysis of human cortical cytoarchitecture

Andrija Štajduhar^1,2,
Tomislav Lipić³^na1,
Sven Lončarić⁴,
Miloš Judaš² &
…
Goran Sedmak²

Scientific Reports volume 13, Article number: 5567 (2023) Cite this article

1721 Accesses
2 Altmetric
Metrics details

Subjects

Abstract

The complexity of the cerebral cortex underlies its function and distinguishes us as humans. Here, we present a principled veridical data science methodology for quantitative histology that shifts focus from image-level investigations towards neuron-level representations of cortical regions, with the neurons in the image as a subject of study, rather than pixel-wise image content. Our methodology relies on the automatic segmentation of neurons across whole histological sections and an extensive set of engineered features, which reflect the neuronal phenotype of individual neurons and the properties of neurons’ neighborhoods. The neuron-level representations are used in an interpretable machine learning pipeline for mapping the phenotype to cortical layers. To validate our approach, we created a unique dataset of cortical layers manually annotated by three experts in neuroanatomy and histology. The presented methodology offers high interpretability of the results, providing a deeper understanding of human cortex organization, which may help formulate new scientific hypotheses, as well as to cope with systematic uncertainty in data and model predictions.

Multi-layered maps of neuropil with segmentation-guided contrastive learning

Article Open access 20 November 2023

Discrimination of the hierarchical structure of cortical layers in 2-photon microscopy data by combined unsupervised and supervised machine learning

Article Open access 15 May 2019

Cellular 3D-reconstruction and analysis in the human cerebral cortex using automatic serial sections

Article Open access 02 September 2021

Introduction

The human cerebral cortex is a highly organized, complex structure composed of billions of neurons. One of the most prominent features of the human cerebral cortex are cortical layers—laminar structures parallel to the surface of the cerebral hemisphere and superimposed one on top of the other. This layered structure is caused by variations in cell density, size, and shape of neurons, specific for each cortical layer. The entire cerebral cortex can be subdivided, based on the number of layers, into a six-layered neocortex (or isocortex) and allocortex which can further be subdivided into a two-layered paleocortex, three-layered archicortex and usually a five-layered mesocortex. Today, the most used classification of the neocortical layers is one based on the concept developed by Korbinian Brodmann at the beginning of the 20th century¹. In this classification, the neocortex is composed of six layers differentiated by neuronal features such as neuronal type, number, size, shape, density, etc. In his seminal work, Brodmann also summarized previous work done on the composition of the neocortex showing that in the generic neocortex, researchers differed significantly in describing the number of layers ranging from four to seven. Thus, we can infer that cortical layers, although biological features of the cerebral cortex, are delineated by arbitrary criteria developed by human observers. Furthermore, the composition, size, and number of layers are not constant throughout the cerebral cortex. Based on variations of these cytoarchitectonic features cerebral cortex can be divided into smaller cortical, cytoarchitectonic areas. At the beginning of the 20th-century researchers in the field of cytoarchitectonics (i.e. study of the cortical building plan) developed several cytoarchitectonic maps which divided the cerebral cortex into smaller structural units with two of the most influential being one developed by Brodmann^1,2 and the other by von Economo and Koskinas³. For every cytoarchitectonic area, a clear set of features can be defined which distinguishes them from other areas. However, borders between two areas are not always clear cut, but are, rather, an area of transition with gradual changes from one to the other. In these transitioning parts it is often difficult for the human observer to precisely and consistently delineate both cortical areas and laminas within areas. Interest in the analysis of these structures is driven by the evidence of the relationship between features of cytoarchitectonic structure and cortical functions. Today, it is believed that the way neurons are distributed in the brain determines its function. The subtleties in this fine structure of the brain underlying its function can be characterized in great detail by studying the organization of cells across the cortex⁴. However, investigations in this field are mostly done manually, require a significant amount of researchers’ time, introduce observer-dependent bias and hinder the reproducibility of the research⁵. As technology advances, more and more digitized histological data becomes available. Computer-aided methods provide means for faster, more objective, and higher-throughput investigations of the cortical structures through automatized processing of histological sections of the cortex. This enables researchers to answer various scientific questions by better understanding the anatomical and functional organization of the brain, as well as observing subtle changes in the brain structures caused by neurological and psychiatric diseases.

Ever since the first methods which introduced automation in the analysis of cortical layers, the central idea was the sampling of different tissue measures along transverse lines drawn either manually or semi-automatically across the cortex, perpendicular to the laminar structure and spanning the full width of the cortex^6,7,8,9. An important step was the development of gray-level index (GLI)¹⁰, a method that measures areal fractions of darkly stained cell bodies across the cortex, yielding different neuron density profiles depending on the location of interest and identifies positions at the cortical ribbon where cytoarchitectonic features change¹¹. Blocks of adjacent profiles may be represented by feature vectors and compared between areas of the cortex. Profile features were used for the estimation of cell counts between cortical areas, providing realistic information about neuron density¹². Higher levels of automation enabled faster analysis of larger datasets. The BigBrain is a high-resolution 3D digital atlas of the whole human brain, providing huge amounts of high-resolution histological data for neuroanatomical studies¹³. The GLI profiles were combined with machine learning methods to create laminar segmentation on the BigBrain dataset, creating parcellations across large brain areas^14,15. Cortical layers were also segmented using convolutional neural networks¹⁶, and a comparison with GLI approach is given¹⁷. The first article that uses features and statistics of individual neurons appeared in 2017, where the authors used automatic segmentation of cells in the mouse brain and analyze cellular shape statistics without machine learning models¹⁸. In the year 2018, the first approach that does not use profiles across the cortex was proposed¹⁹. A combined approach of unsupervised and supervised machine learning was used on a dataset of 2-photon microscopic images of the rat cortex. One can observe the transition towards automation, analysis of larger datasets, and usage of machine learning methods in the field that was until recently dominated by the usage of classical image processing techniques that use various filtering, image-wide pixel transformations, thresholding, and similar operations.

Machine learning-based methods rely on a training dataset to develop their predictive capabilities, allowing them to generalize and make predictions on unseen data. In this context, the unseen data refers to portions of tissue that have not been manually delineated or labeled by human researchers. In machine learning, such an approach is known as supervised learning, sometimes also referred to as predictive modeling. Having adequate learning data is essential in developing successful models, and human labels are considered the gold standard. However, over the years, the impact of human bias in brain parcellation has been increasingly recognized and many methods sought to overcome this issue by developing objective quantitative measures and usage of statistics to distinguish between different layers and brain areas²⁰. In a recent paper²¹, authors use neuronal density estimates to infer local neuronal connectivity while addressing the issue of human bias in the manual segmentation of cortical layers and use an unsupervised clustering approach to identify and represent the laminar structure. An important aspect to consider in the analysis of images of the human brain, or biomedical images in general, is to what extent automated systems should recreate the work of human investigators. Computer vision systems can give access to underlying image contents that are not visible, process every image equally, and provide partial or complete automation of the process²², especially concerning the recently available massive amounts of high-resolution and multimodal data, far beyond the capabilities of any kind of manual analysis. Such systems can derive and analyze detailed anatomical and biologically meaningful information on a large scale and reveal currently neglected structuring principles and provide a deeper understanding of the laminar structure. This suggests that there is a need to move beyond the limitations of manually created parcellation in conventional atlases towards data-driven analysis. Ideally, a representation of a histological section that would contain information to enable objective and unsupervised classification or parcellation of layers and even reveal sub-layering would help resolve many unanswered issues in the study of brain anatomy and physiology.

In this paper, we hypothesize that the current methods do not offer such capabilities due to their inability to capture subtle cytoarchitectonic features. Consequently, they express low explainability and interpretability of their results. Methods based on deep that operate on striding windows across the image may provide convincing parcellations, but we cannot ask what tissue characteristics lead to these results. Here, we investigate the possibility of developing neuron phenotyping²³ that captures cytoarchitectonic details and can be used for inference about the brain structure using only local tissue information at the cellular level. The usability of this approach is demonstrated through the task of distinguishing cortical layers by classifying each neuron within the six cortical layers and white matter using a supervised machine learning method. The method uses phenotype characterization as an input and predicts the individual neuron’s layer, natively offering a high level of interpretability. We also demonstrate the human ability to distinguish between the cortical layers and explore how learning-based methods can generalize from such noisy labels. The developed framework provides a capability to investigate which neuronal features are characteristic for different areas and presents a prospect for future investigations in the field of cytoarchitectonics.

Materials and methods

Histological data were obtained from the Zagreb Neuroembryological Collection²⁴. Samples used in this study were taken from the prefrontal cortex of two brains (brain 1 55 years old female, post-mortem delay 24h; brain 2 age not available; male, post-mortem delay 4h). Sections were taken from the dorsal and ventral part of the typical six-layered homotypic isocortex of the prefrontal cortex^3,4. Brains were fixed in $4\%$ PFA for two weeks. Following sampling, sections were dehydrated through a series of ethanol and embedded in paraffin. Sections were cut using rotating microtome at thicknesses of $10\;\upmu \hbox {m}$ and $20\;\upmu \hbox {m}$. The tissue was stained using the NeuN immunohistochemistry method according to standard protocol²⁵. NeuN is an RNA-binding nuclear protein, derived from the RBFOX3 gene, which regulates alternative splicing in neurons and is expressed explicitly in all neurons of used tissue specimens. In the experiments, $10\;\upmu \hbox {m}$ and $20\;\upmu \hbox {m}$ sections were used in order to test whether tissue thickness would impact the results. Histological sections were digitized using the Hamamatsu Nanozoomer 2.0 scanner (Hamamatsu Photonics, Japan) at 40x magnification, corresponding to $0.226\mu \hbox {m/pixel}$ resolution. Example histological sections in Fig. 1 show varying neuronal morphology and cellular distribution across the cortical layers. Computational experiments were performed using custom scripts written in Python 3.8 and standard publicly available libraries.

Neuron-level features

In manual delineation, the density and size of neurons are the most important characterizations of the laminar structure. Based on anatomical descriptions and kernel density estimates, three populations of similar densities are assumed (layers II and IV as dense, layers III, V, and VI as average, and Layer I and white matter as sparse), two populations of similar sizes (layers III, V and VI containing on average larger neurons, and layers I, II, IV and white matter containing on average smaller neurons). By plotting a histogram of neuron densities and sizes, one can observe that the features express a multimodal distribution, which can be separated using the minimization of intraclass variance²⁶. Figure 2 shows a visualization of separating the neuron populations across the histological section, revealing the laminar structure. In contrast to the classical pixel-based approach to cortical layer segmentation, we used neuron-level tissue descriptors to characterize and examine the underlying tissue properties. We develop several feature classes that describe each neuron in the tissue and use a machine learning model to determine the layer of individual neurons. By classifying all neurons in the tissue, we obtain the parcellation of the laminar structure. To the best knowledge of the authors, this is the first bottom-up approach in the analysis of brain cytoarchitectonics that builds from the cellular level and infers about larger structures based on morphological and textural features of individual neurons. Here, we discuss the development of those features, as well as some rationale behind the choices made in their development and selection.

The first step in obtaining the neuron characterization is the segmentation of neurons from the background tissue. The segmentations were obtained using automated methods^27,28 which use grayscale-guided watershed on anisotropically diffused images to separate neurons, rather than often used distance maps obtained from the grey-level threshold, providing a binary image of segmented, non-overlapping neuron areas. As the goal here is to create consistent results across the tissue, other segmentation methods may be used as well, such as a recently proposed instance segmentation via contour proposals²⁹, especially for different staining methods. This step yielded the locations and segmentations of neurons, from which other neuronal characteristics are developed.

Secondly, neuron segmentations were analyzed using ImageJ particle analysis pipeline³⁰. An overlap of binary segmentations and the original image was made, and ImageJ’s function analyze particles was used to produce measurements of neurons’ bodies. Those were the area, perimeter, circularity, roundness, and Feret’s diameter as well as the mean, median, skewness, and kurtosis of the gray values. More details on particle measurement can be found in ImageJ’s documentation³¹. These features form the basis for investigations in brain microanatomy, as they are often, although not at this level of precision, perceived by the eye of neuroanatomists. By this, we obtain the first neuronal characteristics, which may be visualized to reveal patterns of their appearance across the cortical layers, as shown in Fig. 2. It should be noted that values based on image intensity were not used in the further analysis as it was concluded that these were not usable generally, as they may be heavily influenced by uneven staining across the section and exhibit different values for different staining procedures. These simple measurements do not possess a discriminative power to create clear classifications of neurons within the layers. Therefore, richer descriptors that incorporate neuron neighborhoods were computed, as described below.

It is worth mentioning that density-based clustering algorithms are often used to segment the areas of similar point densities^32,33,34, which may be brought into relation with cortical layers having a roughly uniform density within each layer. However, it seems that the neuron distribution in the cortex is such that their intrinsic structure may not be clustered by a single set of global density parameters, as used in clustering methods. Nevertheless, these methods provided insight into some cortical properties. Meaningful clusters were created when considering neurons within the radius between $100\;\upmu \hbox {m}$ and $300\;\upmu \hbox {m}$, containing between 300 and 800 neurons. This lead to the conclusion that the changing nature of neuron distribution in the brain is best characterized when performing measurements in this range. This range approximately also corresponds to the biological limits of interlayer distances. It is important to emphasize that although choosing a predefined radius or a number of neighbors may seem equivalent, analysis of nearest neighbors is preferred over the fixed radius approach. A predefined radius may be interpreted differently, depending on the image resolution. The specified range around a neuron allows for a more detailed and precise analysis of the microstructure of the tissue in that specific area by balancing between obtaining enough information to capture local tissue properties and not being confounded by reaching too far from the neuron toward other layers and incorporating information which is not in the neuron’s vicinity and is, therefore, less relevant for neuron’s phenotype. Also, if a fixed maximum number of neighbors for each neuron is used, efficient data structures like kd-trees^35,36 may be precomputed. Considering the large number of neurons found in a histological section, efficiency may be of critical importance.

To measure properties of neurons’ neighborhoods, nearest k neighbors were considered, for $k \in [50,100,250,500,1000]$. The distances to a neuron’s k-th nearest neighbor were used as a feature, as well as their mean, max, min, skewness, kurtosis, and entropy. Basic measures of individual neurons were computed in a similar fashion to produce, for instance, the average area of neighboring 100 neurons, as shown at the right in Fig. 2. A convex hull of neurons’ k-neighbours gives information about the area around a neuron and a number of its neighbors and is described using hull area, perimeter, average nearest distance for neurons found in the hull, and standard deviation of nearest distances. Dispersion of neurons may be quantified using nearest neighbor index (NNI), a measure that describes whether points follow usually subjective patterns of regular, clustered, or random distribution. The NNI measures the distance between each point and its nearest neighbor’s location. All the nearest neighbor distances are averaged, and if the average distance is less than the average for a random distribution, the distribution of the features being analyzed is considered clustered. If the average distance is greater than a random distribution, the features are considered regularly dispersed. The index is expressed as the ratio of the mean observed distance divided by the expected distance, which is based on a random distribution with the same number of points covering the same total area,

$$\begin{aligned} NNI_i = \displaystyle \frac{\frac{1}{n} \sum _{j=1}^n d(i,j)}{0.5 \sqrt{HullArea(i)/n}}. \end{aligned}$$

(1)

Neurons in all layers except layer I and white matter tend more towards uniformly dispersed distribution, especially neurons of layer IV which tend more towards random distribution.

Depending on its position in the cortex, a neuron may be placed more toward the middle or more toward the edge of its layer. The computation of properties of its neighborhood may be confounded by reaching into adjacent layers and using neurons with different properties for the computation of statistics. To identify this case, measurements may be taken only from neurons found within the range of angle, or slices. Features measured in several directions can identify border neurons and changes in neuronal properties in different directions. Slices may be regarded as measurement units reaching from a single neuron, each unit representing a population of neighboring neurons found in a given direction from the central neuron. The relationship of different populations within an area has been extensively studied in the frame of biological diversity of species, landscapes and other^37,38. Considering the neurons in a slice as members of a single species, and the k neighbors of a neuron as the population of all species in their habitat, biodiversity measures evaluate the relationship between the species. In this context, the number of slices is the number of different species, or richness, and the relative abundance of the different species in an area as evenness. The two most often used such measures are the Shannon index³⁹ and Simpson index⁴⁰. The Shannon index gives a quantitative measure of the uncertainty in predicting the species of an individual chosen randomly from the population. The Simpson index measures the probability that the two individuals who are randomly chosen (with replacement) from the total population will be of the same species.

$$\begin{aligned} Shannon = - \sum _{i=1}^{R} p_i \ln p_i = \ln \left( \frac{1}{\prod _{i=1}^{R} p_i^{p_i}} \right) , \quad Simpson = \sum _{i=1}^{R} p_i^2, \end{aligned}$$

(2)

where R is the number of different species or, here, slices, and $p_i$ is the proportion of species of the ith type in the population or proportion of neurons in ith slice to the number of the neurons in k-neighbourhood. If all slices have an equal number of neurons, $p_i$ values equal 1/R, and the Shannon index takes the maximum value of $\ln R$. If the numbers are unequal, the weighted geometric mean of the $p_i$ values is larger, which results in the index having smaller values. The index equals zero if the neurons from only one slice are present since there is no uncertainty in predicting the slice they are in. The index gives information about the relation between the number of types and the presence of the dominant type. The mean proportional abundance of the slices increases with decreasing number of slices and with the increasing abundance of the slice with the largest number of neurons, the index obtains small values in regions of high diversity like neurons on borders between the layers, thin layers, and especially layer I neurons. The index is large in homogeneous areas like the middle of layer III, where slices reaching from a neuron remain in the area of the layer.

Experimental subjects statement

All specimens were collected during regular autopsies at pathology departments of the University of Zagreb, School of Medicine, approved by the Ethics Committee of the University of Zagreb, School of Medicine and in accordance with the Declaration of Helsinki, and informed consent was obtained from the next of kin.

Results

The distribution of neurons’ features across the cortex provides insight into different aspects of the cytoarchitectonic organization. This detailed, neuron-level approach allows for tissue inspection following known cytoarchitectonic principles like, for instance, the distribution of the largest neurons. Those with the largest area were found in layer III of the cortex and were followed by neurons of layer V and layer VI. Out of the 50 largest neurons, $43 (86\%)$ were found in layer III, $5 (10\%)$ in layer V, and $2 (4\%)$ in layer VI. Out of the 500 largest neurons, $268 (54\%)$ were found in layer III, $142 (28\%)$ in layer V, $87 (17\%)$ in layer VI, and only $3 (1\%)$ in layer IV. This comparison confirms that the computed features yield meaningful results and follow neuroanatomical observations. Visualization of the ratio of the distribution of largest and smallest 500 neurons among the layers is shown in Fig. 3. Neuron circularity and roundness were found the lowest in layer VI which is known to consist of multipolar neurons with dendrites reaching in different directions. Variations in grayscale intensity were expressed differentially in the cortical layers. Neurons with the highest mean grayscale values were mostly found in layer I, showing low NeuN dye intake. Neurons with the lowest median were predominantly found in layer VI, in layer IV and the middle of layer III, sometimes referred to as layer IIIb. No conclusion was made or the reason found for neurons of layer VI having such large NeuN uptake properties that resulted in lower individual grayscale intensities. Measures regarding neuron shape such as area, circularity or perimeter were shown to provide more discriminative power, which is not unexpected since the findings in neuroanatomical research rely to large extent on the shape and size of neurons.

Using local neuron density, layer I and white matter can be distinguished by having small neuron density, thus identifying sparse regions of the section, or dense regions containing layers II and IV, as shown in Fig. 2. The sparse region may further be split using the hull area feature—neurons in the white matter will have a large hull area, in contrast to the neurons of layer I, whose hull is bound between the border of the tissue and the dense layer II. By computing distances to layer I and white matter, cortical thickness and depth of each neuron are derived, as shown in Fig. 4.

Machine learning pipeline

Although the developed neuron feature sets provide quantitative descriptors of cortical organization, they are not sufficient to provide a clear classification of the correct cortical layer. While some features may be more expressed in certain layers than in others, it is not straightforward to determine what exactly is changing between layers, or the impact and interconnection of different features. This led to an assumption that there is information contained in the developed features that can be analyzed, combined, and used to produce a precise classification of neurons concerning their location within the cortical layers through more complex and more expressive models. A supervised machine learning approach is used on a dataset of manually segmented layers in order to accurately predict the layer of each neuron in the histological section. Thus, layer segmentations can be obtained throughout the section. Feature attributions for the model are investigated to identify informative tissue features.

To obtain the training dataset from which the machine learning method will learn to classify neurons according to their layers, portions of both digitized histological sections were given to three human experts in histology and cytoarchitectonics who manually delineated borders between the layers of the cortex. The apparent inconsistencies and mutual disagreement between the experts, as seen in Fig. 5, show the presence of experts’ bias. The experts disagreed on the boundaries of all layers, except on the very apparent layer I/layer II boundary. The manually labeled dataset contained 12,647 neurons in the $10\;\upmu \hbox {m}$ section and 9821 neurons in the $10\;\upmu \hbox {m}$ section.

Boosted decision trees, a state-of-the-art supervised learning method on tabular input data such as the computed neuron features^41,42 were chosen for prediction and interpretation of cortical lamination for its several advantages. Decision trees mirror human decision-making more closely than other approaches⁴³, which is especially useful when modeling human activities, such as the manual delineation of cortical layers, a decision-making process based on a combination of information about neurons’ characteristics. We used CatBoost⁴⁴, a method based on gradient boosting over decision trees which is one of the most successful models for dealing with tabular data. The model was trained for 100 iterations with a learning rate of 0.1 and default other parameters. The best generalizations were obtained by combining the manual labels of all three raters in an ensemble. Three separate models were trained, one for each rater, and using softmax objective output probabilities were summed, and the final prediction was made using a maximum over all classes for each rater. Results of this approach are shown in the right of Fig. 6. Classes of neurons are predicted, and neurons are accurately classified in a way that follows the laminar pattern of the cortex.

Experiments with different sets of features have shown that although some combinations of features do achieve high accuracy on training data, that itself does not guarantee that the model will perform well on the whole histological section. The introduction of features based on the distance to sparse or dense regions has significantly improved the model’s ability to separate regions of the sections into parcels following the laminar layout of the cortex.

Performance analysis

Without the existence of single ground truth for reference, the measurement of the model’s performance is considered in the context of inter-rater variability. Training data was split into $75\%$ training and $25\%$ test subsets, and predictions of the model were compared with the experts’ manual labels. Comparing neuron layer predictions, average agreement between two experts was $0.755 \pm 0.049$ for $10\;\upmu \hbox {m}$ and $0.809 \pm 0.049$ for $20\;\upmu \hbox {m}$ histological section. The average accuracy of the model, when compared to the three experts, was $0.872\pm 0.042$ and $0.897 \pm 0.047$. One could relate this to the accuracy in Wagstyl’s segmentation approach on the BigBrain, where the cross-validation, average per-point accuracy on the test fold was $0.83 \pm 0.02$.

Discussion

Analysis of individual feature attribution

In this study, we proposed a novel approach to analyzing cortical features in order to facilitate more detailed and specific scientific investigations. Currently, a gold standard is a manual annotation by trained experts. However, human experts are often biased, and results obtained in the such analysis are often inconsistent. An important feature in delineating cortical layers and areas is neuronal size. Our findings demonstrated that in analyzed regions, which belong to the homotypic isocortex of the prefrontal cortex, neurons in layer III are in general larger than neurons in layer V. Although a general sense is that neurons in the layer V are larger than neurons in layer III that is not true for the prefrontal cortex. The largest pyramidal neurons in the cerebral cortex are indeed found in layer V (Betz’s cells of the motor cortex), however, in most cortical areas pyramidal neurons in layer V are smaller than neurons in layer III³. This finding confirms that the computed features yield meaningful results and follow neuroanatomical observations.

For a deeper understanding of both the model and features being used in the pipeline, an investigation of the impact of features on the prediction of the neuron’s class was performed on both global (model) and instance (individual neuron) levels. A recent approach for measuring feature attributions in learning models, the SHAP measure⁴⁵, was used for the estimation of neuron features that contribute the most to neuron classification within the layers. Details on SHAP values and their influence on model outputs for both $10\;\upmu \hbox {m}$ and $20\;\upmu \hbox {m}$ datasets are presented in Fig. 7, and in detail for each cortical layer in both sections in the Supplementary Information file, Fig. S.1 and Fig. S.2.

An important aspect of this approach is the ability to identify features that contribute to predicting a single instance of the data, for each neuron. For instance, Fig. 8 shows which neuron features for a neuron of layer VI contributed to the increase of the base SHAP value and making the prediction. The figure also shows the impact of features that decreased the output value for the prediction of the same neuron as a white matter neuron.

The cortical depth feature had a large impact on the model’s output. This is because it helps integrate simpler features like local densities with anatomical observations regarding the position of a neuron within the cortex. Features based on oriented measurements that measure the change of cytoarchitectonic properties in different directions also had considerable feature importance, being able to identify neurons on the border of cortical layers. It was shown that building on lower-level features yields features that have greater discriminative power and thus greater importance. This is due to their capacity to overcome local variations in neuron features and take, for instance, the mean of those features. In contrast, using features of a neuron such as an area, there is no increase in the accuracy of prediction, which is reflected in the low importance of these features. This is probably why different methods for local pattern analysis and classical image feature extraction methods are not very successful in cortical layer segmentation. Range of variability radius was established, giving an estimate of the size of the neuron’s neighborhood in which measurements should be made, so it is large enough to overcome local variations in neuron distribution and recognize its location within the cortical structure on one hand, and on the other narrow enough so that measurements are not confounded by reaching too far into adjacent layers.

To investigate the effects of section thickness, we have analyzed the number of nearest neighbors in several fixed ranges and established that in $10\;\upmu \hbox {m}$ section $55.6\%\pm 0.7\%$ fewer neurons are found, compared to the $20\;\upmu \hbox {m}$ sections. On the other hand, one can observe the number of nearest neighbors in the top most informative features selected by the models. Observing the top 20 features in models trained on each section, an increase in the average number of neighbors in features can be noted in the $10\;\upmu \hbox {m}$ section (630 compared to 580), however, the difference is not statistically significant. Both results support the reasoning in the Materials and Methods section about the hypothetical range in which nearest-neighbor values should be computed, and that one should prefer the number of neighbors over predefined ranges.

During experiments with different models, it was noticed that parts of layer III and layer VI are sometimes divided into sub-layers that follow the direction of the laminar structure, although being very short and not extending through a significant portion of the slice. Further investigations using the developed methodology may provide more detailed insight into the sub-layering of cortical structure.

Limitations

The proposed method has demonstrated the ability to generalize across sections with a very limited training dataset, showing promising results and indicating that it could be transferable to other brains. However, due to the complexity of the brain, further research using larger amounts of histological data of greater variability is needed to demonstrate with certainty the degree to which these results can be generalized across different brains. This will also allow for further testing of the approach in different brain areas, tissue staining, and cutting planes. In oblique slicing, where the angle of the slicing plane may influence the shape of the neurons, for instance, the pyramidal neurons will not appear as triangles if the slicing was perpendicular to the neuronal columns. This limitation may be overcome by using the 3D representation of neurons.

Conclusion

We have proposed a new methodology for modeling brain cytoarchitectonics which builds from the cellular level and infers about larger structures by creating data-driven, neuron-level tissue descriptors based on features of individual neurons, or neuronal phenotyping. This is in contrast to today’s other approaches in neuroscience, which are mostly based on pixel data. The movement from pixel-wise towards neuron-centric analysis, in which the structure of the brain is studied through the lens of the relationship between the neurons, in contrast to relying solely on changing values of pixels in the histological image, conveys a new paradigm in the field and enables methods from other disciplines to be introduced. Here, we refer to the shift in the way histological data is gathered, examined, and comprehended, and the introduction of machine learning methods that operate on tabular data, for which the neuron representations had to be made first. By leaning more on data-driven methods, our approach lowers the need for human-dependent interventions and interpretations, which allows for more objective and reproducible quantification on a large scale. These settings enable novel insights into the organization of cortical microstructure and subtle differences in neuropathology. By enabling the emergence of new, better descriptions and understanding of the brain structure in different areas and stages of development, our work facilitates movement towards fully automatic, high-throughput, objective investigations, allowing the processing of ever-larger amounts of histological data available globally in research centers today.

The scenario of using the proposed methodology was demonstrated on a particular brain region and validated using a set of data manually labeled by three experts. Our methodology is easily extensible with novel neuronal features such as different stainings or receptor maps and allows the use of other machine learning-based computational methods, such as graph neural networks, which will entice future research initiatives in the field of computational neuroscience.

Data availability

The datasets generated during and/or analyzed during the current study are available from the corresponding author upon reasonable request.

References

Brodmann, K. Vergleichende Lokalisationslehre der Grosshirnrinde in ihren Prinzipien dargestellt auf Grund des Zellenbaues (Barth, 1909).
Google Scholar
Judaš, M., Cepanec, M. & Sedmak, G. Brodmann’s map of the human cerebral cortex-or Brodmann’s maps?. Transl. Neurosci. 3, 67–74 (2012).
Article Google Scholar
von Economo, C. F. & Koskinas, G. N. Die cytoarchitektonik der hirnrinde des erwachsenen menschen (Springer, 1925).
Google Scholar
Kaas, J. H. The functional organization of somatosensory cortex in primates. Ann. Anat. Anatomischer Anzeiger 175, 509–518 (1993).
Article CAS PubMed Google Scholar
Lutnick, B. et al. An integrated iterative annotation technique for easing neural network training in medical image analysis. Nat. Mach. Intell. 1, 112–119 (2019).
Article PubMed PubMed Central Google Scholar
Hudspeth, A., Ruark, J. & Kelly, J. Cytoarchitectonic mapping by microdensitometry. Proc. Natl. Acad. Sci. 73, 2928–2931 (1976).
Article ADS CAS PubMed PubMed Central Google Scholar
Hopf, A. Registration of the myeloarchitecture of the human frontal lobe with an extinction method. J. Hirnforsch. 10, 259 (1968).
CAS PubMed Google Scholar
Schleicher, A., Amunts, K., Geyer, S., Morosan, P. & Zilles, K. Observer-independent method for microstructural parcellation of cerebral cortex: A quantitative approach to cytoarchitectonics. Neuroimage 9, 165–177. https://doi.org/10.1006/nimg.1998.0385 (1999).
Article CAS PubMed Google Scholar
Zilles, K., Schleicher, A., Palomero-Gallagher, N. & Amunts, K. Quantitative analysis of cyto-and receptor architecture of the human brain. In Brain Mapping: The Methods (Second Edition) 573–602 (Elsevier, 2002).
Schleicher, A., Zilles, K. & Kretschmann, H. Automatische registrierung und auswertung eines grauwertindex in histologischen schnitten. Verh Anat Ges 72, 413–415 (1978).
Google Scholar
Amunts, K. & Zilles, K. Architectonic mapping of the human brain beyond Brodmann. Neuron 88, 1086–1107 (2015).
Article CAS PubMed Google Scholar
Meyer, H. S. et al. Number and laminar distribution of neurons in a thalamocortical projection column of rat vibrissal cortex. Cereb. Cortex 20, 2277–2286 (2010).
Article PubMed PubMed Central Google Scholar
Amunts, K. et al. Bigbrain: An ultrahigh-resolution 3d human brain model. Science 340, 1472–1475 (2013).
Article ADS CAS PubMed Google Scholar
Wagstyl, K. et al. Mapping cortical laminar structure in the 3d bigbrain. Cereb. Cortex 28, 2551–2562 (2018).
Article PubMed PubMed Central Google Scholar
Quabs, J. et al. Cytoarchitecture, probability maps and segregation of the human insula. Neuroimage 260, 119453 (2022).
Article PubMed Google Scholar
Wagstyl, K. et al. Automated segmentation of cortical layers in bigbrain reveals divergent cortical and laminar thickness gradients in sensory and motor cortices. PLoS Biol. 18, e3000678 (2020).
Article CAS PubMed PubMed Central Google Scholar
Kiwitz, K., Schiffer, C., Spitzer, H., Dickscheid, T. & Amunts, K. Deep learning networks reflect cytoarchitectonic features used in brain mapping. Sci. Rep. 10, 1–15 (2020).
Article Google Scholar
Nosova, S., Snopova, L. & Turlapov, V. Automatic detection of neurons, astrocytes, and layers for nissl-stained mouse cortex. J. WSCG 25, 143–150 (2017).
Google Scholar
Li, D. et al. Discrimination of the hierarchical structure of cortical layers in 2-photon microscopy data by combined unsupervised and supervised machine learning. Sci. Rep. 9, 7424 (2019).
Article ADS PubMed PubMed Central Google Scholar
Tizhoosh, H. R. et al. Searching images for consensus: Can AI remove observer variability in pathology?. Am. J. Pathol. 191, 1702–1708 (2021).
Article PubMed Google Scholar
van Albada, S. J. et al. Bringing anatomical information into neuronal network models. arXiv preprint arXiv:2007.00031 (2020).
Danuser, G. Computer vision in cell biology. Cell 147, 973–978 (2011).
Article CAS PubMed Google Scholar
Grys, B. T. et al. Machine learning and computer vision approaches for phenotypic profiling. J. Cell Biol. 216, 65–71 (2017).
Article CAS PubMed PubMed Central Google Scholar
Judaš, M. et al. The Zagreb Collection of human brains: A unique, versatile, but underexploited resource for the neuroscience community. Ann. N. Y. Acad. Sci. 1225, E101–E130 (2011).
Article Google Scholar
Hsu, S.-M., Raine, L. & Fanger, H. The use of antiavidin antibody and avidin-biotin-peroxidase complex in immunoperoxidase technics. Am. J. Clin. Pathol. 75, 816–821 (1981).
Article CAS PubMed Google Scholar
Otsu, N. A threshold selection method from gray-level histograms. IEEE Trans. Syst. Man Cybern. 9, 62–66 (1979).
Article Google Scholar
Štajduhar, A., Džaja, D., Judaš, M. & Lončarić, S. Automatic detection of neurons in neun-stained histological images of human brain. Physica A 519, 237–246 (2019).
Article ADS MathSciNet MATH Google Scholar
Štajduhar, A., Lepage, C., Judaš, M., Lončarić, S. & Evans, A. C. 3d localization of neurons in bright-field histological images. In ELMAR (ELMAR), 2018 60th International Symposium 75–78 (IEEE, 2018).
Upschulte, E., Harmeling, S., Amunts, K. & Dickscheid, T. Contour proposal networks for biomedical instance segmentation. Med. Image Anal. 77, 102371 (2022).
Article PubMed Google Scholar
Rueden, C. T. et al. Imagej 2: Imagej for the next generation of scientific image data. BMC Bioinform. 18, 529 (2017).
Article Google Scholar
ImageJ analyze menu. https://imagej.nih.gov/ij/docs/menus/analyze.html. Accessed: 2022-09-30.
Rodriguez, A. & Laio, A. Clustering by fast search and find of density peaks. Science 344, 1492–1496 (2014).
Article ADS CAS PubMed Google Scholar
Ankerst, M., Breunig, M. M., Kriegel, H.-P. & Sander, J. Optics: ordering points to identify the clustering structure. In ACM Sigmod Record Vol. 28/2 49–60 (ACM, 1999).
Campello, R. J., Moulavi, D., Zimek, A. & Sander, J. Hierarchical density estimates for data clustering, visualization, and outlier detection. ACM Trans. Knowl. Discov. Data (TKDD) 10, 5 (2015).
Google Scholar
Bentley, J. L. Multidimensional binary search trees used for associative searching. Commun. ACM 18, 509–517 (1975).
Article MATH Google Scholar
Maneewongvatana, S. & Mount, D. M. It’s okay to be skinny, if your friends are fat. In Center for Geometric Computing 4th Annual Workshop on Computational Geometry Vol. 2 1–8 (1999).
Magurran, A. E. Measuring Biological Diversity (Wiley, 2013).
Google Scholar
Nagendra, H. Opposite trends in response for the Shannon and Simpson indices of landscape diversity. Appl. Geogr. 22, 175–186 (2002).
Article Google Scholar
Shannon, C. E. A mathematical theory of communication. Bell Syst. Tech. J. 27, 379–423 (1948).
Article MathSciNet MATH Google Scholar
Simpson, E. H. Measurement of diversity. Nature 163, 688 (1949).
Article ADS MATH Google Scholar
Bramer, M. Principles of Data Mining Vol. 180 (Springer, 2007).
MATH Google Scholar
Liu, S., Wang, X., Liu, M. & Zhu, J. Towards better analysis of machine learning models: A visual analytics perspective. Visual Informatics 1, 48–56 (2017).
Article Google Scholar
James, G., Witten, D., Hastie, T. & Tibshirani, R. An Introduction to Statistical Learning Vol. 112 (Springer, 2013).
Book MATH Google Scholar
Dorogush, A. V., Ershov, V. & Gulin, A. Catboost: Gradient boosting with categorical features support. arXiv preprint arXiv:1810.11363 (2018).
Lundberg, S. M. & Lee, S.-I. A unified approach to interpreting model predictions. In Advances in Neural Information Processing Systems Vol. 30 (eds Guyon, I. et al.) 4765–4774 (Curran Associates, Inc., 2017).
Google Scholar

Download references

Acknowledgements

This publication was supported by the European Union through the European Regional Development Fund, Operational Programme Competitiveness, and Cohesion Operational Programme, Grant Agreements No. KK.01.1.1.01.0007, CoRE - Neuro and KK.01.1.1.01.0009, “DATACROSS”; and the Canada First Research Excellence Fund, awarded to McGill University for the Healthy Brains for Healthy Lives initiative. The authors extend their gratitude to Dora Sedmak from the Croatian Institute for Brain Research (CIBR), School of Medicine, University of Zagreb, and Jennifer Novek from Montreal Neurological Institute (MNI), McGill University, for their effort in neuron labeling and helpful discussions. Special thanks to Claude Lepage from the MNI, McGill University, for reading the paper thoroughly and providing constructive feedback. The authors dedicate this paper to the memory of Tomislav Lipić.

Author information

Tomislav Lipić is deceased.

Authors and Affiliations

School of Public Health “Andrija Štampar”, School of Medicine, University of Zagreb, 10000, Zagreb, Croatia
Andrija Štajduhar
Croatian Institute for Brain Research, School of Medicine, University of Zagreb, 10000, Zagreb, Croatia
Andrija Štajduhar, Miloš Judaš & Goran Sedmak
Laboratory for Machine Learning and Knowledge Representation, Ruder Bošković Institute, 10000, Zagreb, Croatia
Tomislav Lipić
Faculty of Electrical Engineering and Computing, University of Zagreb, 10000, Zagreb, Croatia
Sven Lončarić

Authors

Andrija Štajduhar
View author publications
You can also search for this author in PubMed Google Scholar
Tomislav Lipić
View author publications
You can also search for this author in PubMed Google Scholar
Sven Lončarić
View author publications
You can also search for this author in PubMed Google Scholar
Miloš Judaš
View author publications
You can also search for this author in PubMed Google Scholar
Goran Sedmak
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

A.Š. and T.L. conceived the paper and ran experiments. S.L. and M.J. initiated and supervised the research. M.J and G.S. provided data and designed experiments. All authors collaboratively analyzed and interpreted the results. A.Š., T.L. and G.S. prepared the manuscript. All authors revised and verified the final version of the manuscript.

Corresponding author

Correspondence to Andrija Štajduhar.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary Figures.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Štajduhar, A., Lipić, T., Lončarić, S. et al. Interpretable machine learning approach for neuron-centric analysis of human cortical cytoarchitecture. Sci Rep 13, 5567 (2023). https://doi.org/10.1038/s41598-023-32154-x

Download citation

Received: 11 October 2022
Accepted: 23 March 2023
Published: 05 April 2023
DOI: https://doi.org/10.1038/s41598-023-32154-x

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.