Database of literature derived cellular measurements from the murine basal ganglia

Bjerke, Ingvild E.; Puchades, Maja A.; Bjaalie, Jan G.; Leergaard, Trygve B.

doi:10.1038/s41597-020-0550-3

Download PDF

Data Descriptor
Open access
Published: 06 July 2020

Database of literature derived cellular measurements from the murine basal ganglia

Scientific Data volume 7, Article number: 211 (2020) Cite this article

1908 Accesses
8 Citations
6 Altmetric
Metrics details

Subjects

Abstract

Quantitative measurements and descriptive statistics of different cellular elements in the brain are typically published in journal articles as text, tables, and example figures, and represent an important basis for the creation of biologically constrained computational models, design of intervention studies, and comparison of subject groups. Such data can be challenging to extract from publications and difficult to normalise and compare across studies, and few studies have so far attempted to integrate quantitative information available in journal articles. We here present a database of quantitative information about cellular parameters in the frequently studied murine basal ganglia. The database holds a curated and normalised selection of currently available data collected from the literature and public repositories, providing the most comprehensive collection of quantitative neuroanatomical data from the basal ganglia to date. The database is shared as a downloadable resource from the EBRAINS Knowledge Graph (https://kg.ebrains.eu), together with a workflow that allows interested researchers to update and expand the database with data from future reports.

Measurement(s)	Cell number • synapse • dendritic spine • Cell Density • Synapse density • dendritic spine density • Distribution • neuron morphology trait • basal ganglion morphology trait
Technology Type(s)	digital curation
Sample Characteristic - Organism	Mus musculus • Rattus norvegicus

Machine-accessible metadata file describing the reported data: https://doi.org/10.6084/m9.figshare.12199088

Waxholm Space atlas of the rat brain: a 3D atlas supporting data analysis and integration

Article Open access 02 October 2023

Spatial atlas of the mouse central nervous system at molecular resolution

Article Open access 27 September 2023

The molecular cytoarchitecture of the adult mouse brain

Article Open access 13 December 2023

Background & Summary

Quantitative knowledge about the number and normal variation of different cell types of the brain and their subcellular elements, such as synapses and dendritic spines, is of broad interest for neuroscientists. This is important for several purposes, including building and constraining computational models^1,2,3, guiding new experimental research, and comparing data from individual or groups of subjects. The need for quantitative measurements of neural architecture has led to development of numerous experimental methods for unbiased quantification of neuroanatomical features. Examples include cell counting methods⁴, stereological approaches to obtain numbers, areas or volumes^5,6, and point pattern analyses to characterise spatial distributions of cells or cellular elements^7,8. The results of such studies are typically published in original papers, reporting e.g. estimates of total numbers or densities of cells^9,10 or relative amounts of cells, synapses, spines, or other parameters in different experimental groups^11,12. While individual papers may be easily interpreted, it is becoming increasingly challenging to overview the steadily growing amount of publications¹³ and to evaluate the consistency and comparability of information. The traditional research paper format is not particularly well suited to make comparisons, as data may be distributed across text, tables and figures, with units of measurements and nomenclatures that vary across papers. Although units of measurements can be effectively converted and nomenclature differences may be possible to resolve, this requires significant time and effort from the reader. In some cases, findings are reported in non-standard units (e.g. as percentage of control, number per section), which may make them impossible to compare to other results. Researchers investigating brain structure and function in animal models may find it difficult to answer relatively simple questions, such as: What is the average number of cells or subcellular structures in my brain region of interest, and how much do these numbers vary? Which parameters have been quantified before and what were the methods used to do so? Can data from two studies be compared? Are the results reported in the literature within the same range?

Neuroanatomical information is available from several databases. The temporal-lobe database (www.temporal-lobe.com) presents connections in the rat hippocampal region are presented schematically in an interactive PDF, allowing the user to quickly view aggregated information¹⁴. In the Brain Architecture Management System (BAMS, https://bams1.org/) project, Bota and colleagues compiled reports and scored the strength of connections between regions across the brain on a semi-quantitative scale¹⁵. The hippocampome (www.hippocampome.org), a database of neuronal cell types in the hippocampus, contains interactive matrices showing the location, cytochemistry, electrophysiology, and connectivity of different cell types¹⁶. The NeuroMorpho database¹⁷ (www.neuromorpho.org) is an extensive collection of published neuron morphologies, with useful quantitative information about the features of individual neurons. In addition, several efforts have been made to estimate brain cell numbers in histological material using computational image analysis^18,19,20. To our knowledge, no systematic effort has been made to collect and normalise information from several sources about the number and distribution of cells, synapses and spines in different brain regions in a database.

We here present a database of publicly available quantitative measurements of cells, synapses and dendritic spines of the frequently investigated murine basal ganglia. These are regions of high interest for basic experimental studies of voluntary movements, procedural learning and neurodegenerative diseases such as Parkinson’s and Huntington’s disease^21,22. Quantitative information about the cellular architecture of these regions in normal animals is needed for computational modelling efforts^1,2, and represent an important benchmark for interpretation of results from experimental studies in different animal disease models^23,24. The database holds > 1200 quantitative estimates derived from the literature and public repositories, normalised to standard units of measurements and mapped to common anatomical reference atlases. To our knowledge, this is the most extensive collection of available information on cellular basal ganglia parameters to date. The database is publicly shared via EBRAINS²⁵, together with a workflow for updating it with results from future analyses. We believe this can be a valuable benchmark resource for anatomical studies or efforts to model the murine basal ganglia.

Methods

Overview of study design

We created a database of data derived from the literature and public repositories. We here use the term derived data to describe the specific analytic results of a study, e.g. the number of cells in a given region, as opposed to the raw data that were used to generate this number. We limited our scope to quantitative information about number, distribution and morphology of cells and subcellular elements of the normal, rodent basal ganglia. We here consider the concept of the basal ganglia to include dorsal and ventral parts of the striatum (caudoputamen and nucleus accumbens) and pallidum (external globus pallidus, entopeduncular nucleus and ventral pallidum), as well as the subthalamic nucleus and substantia nigra^26,27. We designed a database using Microsoft Access, and set up a search string to query the literature. Specific inclusion criteria were used to narrow the number of papers to include. For each paper, the methods and results sections were carefully read and annotated, and relevant metadata elements were integrated in the database. We also searched for repositories with relevant data. Wherever necessary and possible, we standardised terms and units used to describe data, and novel workflows were developed to map data to common schemes for regions and cell types of interest. Lastly, in order to make our database accessible and usable to the community, we shared it as a dataset through the EBRAINS Knowledge Graph (RRID:SCR_017612). Each main part of the study design (database design, search strategy, data / metadata standardisation, and data sharing) will be elaborated in the following.

Database design

We organised data derived from the literature in a Microsoft Access database with 45 tables, the most important of which are summarised in Fig. 1. All fields in all tables of the database are listed and explained in Supplementary File 1.

Search strategy

Literature search strategy

PubMed was queried via Ovid Medline for papers from 1946-present. This search string included key words related to (1) species of interest, i.e. rat and mouse; (2) brain regions of interest, i.e. basal ganglia regions; (3) methods of interest, i.e. typical methods employed in anatomical and morphological studies; and (4) parameters of interest, i.e. numbers, densities or distributions of cells, synapses, axonal boutons, or dendritic spines. The papers needed to contain one key word from each of the categories in either the title or the abstract (the parts were combined with AND operators).

Three searches were performed, and search strings used are included in Supplementary File 2. In this iteration, we included all number, density and distribution data from the basal ganglia of adult, naïve rats or mice. However, the derived data was quite heterogeneous and few numbers could be compared. In the second and third search, we opted to include more data representing similar parameters. To this end, we narrowed the scope to data from the substantia nigra (second search) and caudoputamen (third search), but broadened the inclusion criteria to include all control animals of all (postnatal) ages.

The first search was performed on January 3^rd, 2018, and a total of 2246 papers were returned. All of these papers were manually screened, and included or excluded based on a set of predefined criteria. The data had to fit with the overall criteria specified in the search string (e.g. neuroscience related, murine data, and original article format) and be available in English. Furthermore, we only included papers with data related to adult, naïve animals, that is, animals that had not been subject to any experimental or control manipulations, behavioural tests or training, or any other experimental intervention. The only exception to this criterion was made where pooled data from two control groups were given (e.g. sham operation and naïve control) where individual measurements had been statistically compared and proven similar. Animals with genetic manipulations, e.g. fluorescent expression in certain cells, were also excluded. Non-naïve animals were excluded to reduce the number of included publications in this first iteration of the search. However, in later and more specific queries, studies of non-naïve animals were included in order to avoid missing clearly relevant data (see below). Papers had to present quantitative data of interest in text or tabular format, excluding papers presenting data in graphs only. Lastly, data needed to be possible to normalise to a common unit of measurement. This generally meant that data had to be presented as numbers representing either a region of interest or a standard unit (square or cubic nano, micro-, or millimetre). In contrast, we excluded data that were presented as relative measures such as percentage of control or numbers per section. After manual screening, we included 65 publications with data of interest from the normal adult rat or mouse basal ganglia. An additional eight papers were included through tracking references of particularly relevant papers approach so that 72 papers were ultimately included in the first search.

The search string employed was a compromise between sensitivity and specificity, with the use of keywords related to tissue preparation method (e.g. immunohistochemistry, immunofluorescence, histology) reducing the number of search entries considerably. Since a substantial proportion of papers found with the initial search were excluded during manual screening, these keywords were included to narrow the number of papers returned. Nevertheless, to avoid missing clearly relevant papers, we performed an additional, targeted search for papers particularly conducting stereological counting in the next iterations of the search. Thus, two separate search strings were used for the second search: 1) the same string as in the first search, but with substantia nigra keywords only (performed on August 14^th, 2018); and 2) an additional search string including only (stereolog*) and the keywords related to substantia nigra (performed on August 22^nd, 2018). In the third search, we repeated both parts of the second search, but with the striatum (caudoputamen) as the region of interest. The two parts of the last search were performed 1) on November 30^th, 2018 and 2) on January 17^th, 2019. All the papers were screened manually, according to essentially the same criteria as for the first search, except that we included all control animals of all (postnatal) ages, including those genetically altered to express fluorescence in certain cells. We also included studies using animals that had been treated according to standard neuroanatomical protocols, e.g. axonal tract tracing experiments. The second search returned 1168 papers of which 84 were included, while the third search yielded 1806 papers of which 91 were included. Because some of the papers appeared in more than one of the search rounds, the total number of publications ultimately included in the database was 239. The search strategy, inclusion criteria and results for each iteration of the search is summarised in Fig. 2.

To limit the selection of papers and thus the scope of the survey, we excluded papers presenting data in graphs only. However, for studies presenting some material in text and some in graphs, we digitised graphs to extract all relevant data from the paper. We used a web-based plot digitiser (https://apps.automeris.io/wpd/) to import graph images, added reference points, and extracted the relevant means and error measurements. As this approach was quite time consuming, we used it for selected papers in the first and second search only. We included a field in the database to specify whether an estimate was extracted from text or from a graph.

Repository search strategy

Several data and metadata repositories exist with various types of neuroscience information, and the Neuroscience Information Framework (NIF, www.neuinfo.org; RRID:SCR_002894)²⁸ catalogues these resources. We therefore searched the NIF for portals or databases related to rat or mouse, which returned 281 public repositories with information from rats or mice. From these, we selected nine resources that appeared to be relevant to the current project. To be included, a repository had to include relevant derived data in addition to appropriate metadata. Two repositories fulfilled these criteria: Neuromorpho (www.neuromorpho.org; RRID:SCR_002145; neuronal morphology information²⁹, e.g. soma size, number of bifurcations; see frequently asked questions at www.neuromorpho.org for a full list) and Mouse Brain Architecture (www.brainarchitecture.org; RRID:SCR_004683; cell densities). Since the NeuroMorpho database is organised in several archives, each containing data from one laboratory, each such archive was treated as a separate source in our database, with the prefix “NMO” in the source name indicating that the data came from NeuroMorpho. A table of the evaluated repositories is included in Supplementary File 3. Lastly, we included information extracted from an Allen mouse brain in situ hybridisation experiment, available as a derived dataset via the EBRAINS Knowledge Graph³⁰.

Data and metadata standardisation

To give a unified view of data, we mapped them to key features in common schemes. Two particularly important such features in neuroscience are anatomical region and cell type of interest. Indeed, other databases have generally been structured around regions¹⁴ or cell types^16,31, or both¹⁵. In the following, we describe the workflows established in this project to map data to common terms for regions and cell types of interest, as well as how all data were standardised to common units of measurement.

Mapping data to semantically defined anatomical regions of interest

Reference atlases are commonly used in neuroscience in order to relate data to anatomical locations in the brain; however, there are several alternatives to reference atlas available just for the rat^32,33,34,35 or mouse brain^36,37, that vary with respect to how they name and define regions. Data related to a specific region in one atlas are therefore not necessarily easily compared to data related to a similarly named region in another. Even data related to the same region in different versions of the same atlas may not be directly comparable, since some borders may have been significantly revised between atlas versions.

In our database, all data were related to the three-dimensional (3D) standard atlas templates used in EBRAINS – the Waxholm space atlas of the rat brain (WHS, version 1.01^34,38; RRID:SCR_017124) and the Allen Mouse Brain Common Coordinate Framework (CCF, version 3³⁷). Using the QuickNII software for registration of 2D section images to 3D atlases³⁹ (RRID:SCR_016854), we mapped plates from several of the most common atlases^{32,33,40,41,42,43,44,45,46,47,48,49} to WHS or CCF (Fig. 3a). The location metadata, specifying the parameters used to spatially register the different atlases to the WHS or CCF, are available as datasets from the EBRAINS Knowledge Graph^{50,51,52,53,54,55,56,57,58,59,60,61,62}. We used the spatially co-registered atlas diagrams to inspect and define the spatial relationships between our regions of interest (basal ganglia regions) in the WHS or CCF and regions defined in other atlases. The type of relationships were categorized as identical, part of, includes, overlapping, or non-overlapping (Fig. 3c). The latter was used only in cases where regions could be expected to be related (e.g. by sharing the same name), but were found not to be. Descriptive comments about the relationships were added. In addition, to semi-quantitatively describe the degree of comparability of two regions, we applied a region comparability score, ranging from zero (non-overlapping structures) to 10 (completely identical structures). The criteria underlying this scoring system and categorization of relationships are provided in Supplementary File 4. The accumulated information about the spatial relationships defined between atlas regions are shared through the EBRAINS Knowledge Graph as separate data sets^61,62.

The locations of data presented in papers were not always defined with use of terms from a specific reference atlas. We considered data to be related to a region in an atlas only in cases where it could be clearly inferred which region (or set of regions) in the cited atlas authors referred to. This generally involved use of a specific reference to an atlas and a region name existing in that particular atlas, with a few exceptions where authors referred to a region at a lower granularity than given in the atlas. For example, although the exact term ‘substantia nigra’ does not appear in most atlases (the region is usually subdivided, at least into a reticular and a compact part), it is reasonable to use this term to refer to the various substructures together. The crucial point is to define the inclusion or exclusion of subdivisions as they are named in the particular atlas. In cases where this was not clearly defined, it was reflected in our translation by storing the coverage and specificity as “unknown”. For data not defined in terms of a reference atlas, we considered the region to be defined in a ‘custom’ parcellation scheme. In these cases, knowledge about relations to our atlases could only be inferred from the documentation provided by the authors. To translate such custom terms, we therefore carefully considered the documentation and assigned an atlas term based on our knowledge about the basal ganglia regions and terms typically used to describe them. In general, more well-documented regions of interest allowed for more accurate translation with higher confidence.

For each mention of a region of interest, we included metadata describing how it was documented and stored this in the “Region records” table. We furthermore calculated a score to capture the degree to which each region of interest was documented (referred to as a “documentation score”. Different types of documentation were weighed differently, and a score between 1 and 10 was calculated. Information about the documentation factors and their weight in the documentation score can be found under the “Region records” table section in Supplementary File 1.

Mapping data to cell types of interest

All of the objects for which we collected quantitative information in this study belong to a cell: subcellular objects originate from a cell of interest, and reconstructed and counted cells have an identity. Cell type classification is not trivial^63,64, as there are many complementary approaches to the task (e.g. cytochemical, electrophysiological, morphological), and thus no standard ontologies of cell types exist. To map data to cell types, we captured information about the various phenotypes that a cell might have. This approach was inspired by ongoing work from the INCF special interest group on Neuroinformatics for cell types (https://www.incf.org/sig/neuroinformatics-cell-types). We included seven broad phenotype categories: brain region (e.g. striatum, substantia nigra), expression (e.g. parvalbumin, tyrosine hydroxylase), electrophysiology (e.g. fast spiking), morphology (e.g. spiny neuron, giant neuron), connectivity (e.g. direct pathway neuron), local connectivity (e.g. perisomatic neuron), and circuit function (e.g. inhibitory neuron). For every derived data set, information was stored about the phenotype recorded for the particular cell. One or more phenotypes might be used in a particular study to classify the neuron type(s), and based on the phenotype(s) identified, a putative cell type was assigned. Some data spanned several different cell types, for example when numbers of all objects of interest were counted regardless of type (e.g. counting all dendritic spines or cell bodies in a certain area). In these cases, data are relevant for all cell types, and have simply been linked to the type “Cell”, “Neuron”, or “Glia”, depending on the phenotypes identified.

Standardisation to common units of measurement

Prior to data entry, we converted all density units to square or cubic milli- or micrometres. Standard errors were converted to standard deviations by dividing by the square root of the sample size. Information about calculations performed to standardise a measurement was entered in the database. For data given per square milli- or micrometres, we calculated the volumetric density by dividing numbers by section thickness⁶⁵. These were entered in the database in addition to original 2D counts. Numbers obtained by direct counts without any corrections were corrected using Abercrombie’s formula^4,66 prior to calculating volumetric density. These calculations are elaborated in Supplementary File 5. We did not standardise total number estimates before entering these to the database, but rather indicated whether counts were uni- or bilateral. When it was not clear whether estimates were uni- or bilateral, we contacted the corresponding author of the paper to clarify. If no clarification was obtained, this field was indicated as “Unknown”. In some cases assumptions, interpretations and slight modifications were made to give data similar formats, and we followed specific rules to ensure consistency throughout the data entry process. Details about this can be found in Supplementary File 6.

Sharing the database through the EBRAINS Knowledge Graph

We exported .csv files from all the tables in the database. In addition, we made and exported query tables containing selected metadata elements from multiple tables for quantitative estimates, distributions, and cell morphologies. We also created a version of the database specifically designed to input data, and an Excel sheet configured for converting data from any density unit to volumetric densities or bilateral counts to unilateral ones. All of these elements (.csv files, empty database version, and Excel conversion sheet) are shared under a single dataset through EBRAINS²⁵.

Data Records

The database created here, hereafter referred to as the “Murine basal ganglia database”, is shared via EBRAINS²⁵ (https://ebrains.eu). It contains information from 375 experiments reported in 245 sources; from these, we extracted 1228 quantitative estimates (501 total number estimates and 727 density estimates), 50 neuronal morphologies, and 18 distribution records of basal ganglia cellular parameters. The content of the Murine basal ganglia database is summarised in Fig. 4.

The shared dataset includes .csv files for all tables in the Murine basal ganglia database as well as the original.accdb file; these files contain the full set of metadata collected during the creation of the database. In addition, we share .csv and .xlsx files for data extracted from the database. These files contain all the numerical, distribution and morphology data available from the Murine basal ganglia database, with selected metadata that we considered relevant for most users. Furthermore, we have established a workflow to allow other researchers to expand upon the knowledge contained in the current version (detailed in usage notes below). To support the use of this workflow, we share an empty version of the Murine basal ganglia database (.accdb) with a spreadsheet (.xlsx), through which researchers can collect and / or contribute more information.

Technical Validation

In the following, we first consider how the search strings and selection criteria have affected the results of the PubMed search and content of the database. We then evaluate the validity of the graph data extraction procedure. Lastly, we assess and discuss the variability of a selection of the data contained in our database by summarising the information available about the number of tyrosine hydroxylase (TH) positive neurons in the substantia nigra, and the total number of neurons in the caudoputamen.

Selected papers and repositories

The most common reason for excluding papers were that they did not contain data of interest (54% of papers excluded) or that data were from experimentally manipulated animals without inclusion of a normal control group (15% of papers excluded). Among the studies in which relevant quantitative data had been obtained, 40–45% were excluded in each iteration of the search (11% of all papers) because data were not possible to normalise, due to lack of metadata necessary for comparing the data across studies or re-using them in a different context. Examples include papers where numbers were expressed per section or as percentage of control, or in rare cases, without specification of the unit of measurement. 8% of all search results were excluded because data were presented in graphs only, in each search this concerned 48–59% of the papers of interest with data that otherwise could have been normalised to a common unit of measurement. In a limited selection of papers presenting some data in text and other data in graphs we converted graph data to numeric data (see, Methods) to increase the amount of data extracted, but as this was time consuming it was not feasible to perform on a larger collection of data. In the end, 6% of papers were included. The percentages described here are based on data from the second and third search; the proportions of papers excluded based on the various criteria were relatively similar for the first search, except that the included percentage (3%) was lower since only completely untreated adult animals were included.

Searching and screening papers manually is a time consuming task, and in our literature search led to exclusion of more than 90% of papers. We observe that other literature mining projects have presented similar exclusion rates⁶⁷. This illustrates that designing search strings that are both sensitive and specific is a significant challenge.

Validation of data extracted from graphs

Papers from the first iteration of the search that presented the same numbers both in graphs and text were used to validate the graph extraction approach. For these cases, we extracted the numbers and error measurements using the graph plot digitiser (see Methods), and compared the resulting numbers with those presented in the text. This showed a negligible discrepancy between means extracted from text and graph (0.08–1% difference), and relatively low differences between extracted standard errors (5–12% difference).

Variability in a selection of quantitative estimates from the murine basal ganglia

We here present summary data from some of the parameters available in the Murine basal ganglia database. To assess whether variance could be considerably reduced by selecting data obtained by certain methods, we sequentially filtered the data according to methodological metadata (see sections below for details).

Tyrosine hydroxylase positive neurons in the substantia nigra

The principal neurons of the substantia nigra are dopaminergic neurons, which can be visualised by using antibodies against the enzyme tyrosine hydroxylase. TH neurons contribute to motor behaviour by their projections to the striatum, and are frequently investigated in murine models for mechanisms of Parkinson’s disease²⁶.

In our database, unilateral estimates of the total number of TH neurons in the substantia nigra, pars compacta of the adult (P56 and older) C57BL/6 mouse range from 1090 to 16145 (mean = 6065, SD = 3456, n = 30 estimates). The same range and very similar variation is seen when selecting only stereological studies (range = 1090 to 16145, mean = 6495, SD = 3503, n = 26 estimates). Further filtering of stereological estimates by excluding those that are anatomically non-specific or only partly covering the pars compacta, does not reduce variation either (range = 3360 to 16145, mean = 7706, SD = 3680, n = 14 estimates). Only two of the 30 total number estimates for the C57BL/6 mouse substantia nigra, pars compacta are connected to an antibody with a unique RRID; filtering results based on the exact primary antibody used is therefore not possible. For the adult (P60 and older) rat (all strains), the range of unilateral values in the database is 3260 to 11969 (mean = 7733, SD = 3252, n = 8 estimates). Box plots summarizing these estimates and similar ones from the whole substantia nigra are given in Fig. 5a.

Neuron numbers and densities in the caudoputamen

The caudate-putamen complex (hereafter referred to as the caudoputamen) is the largest part of the basal ganglia, receiving axonal projections from the cerebral cortex, and extending projections to several other parts of the basal ganglia circuitry⁶⁸. There are two main types of principal neurons in the caudoputamen, identifiable by the different types of dopamine receptors they possess²⁶. Because well-validated and replicated antibodies against these receptors are lacking⁶⁹, studies of the caudoputamen frequently assess total neuron numbers using histochemistry or neuronal markers such as NeuN antibodies.

Unilateral estimates in the database representing the total number of neurons in the caudoputamen of adult mice (all strains) range from 856649 to 1711615 (mean = 1107325, SD = 296707, n = 10 estimates). Note that six of these estimates come from the same study. Estimates of neuron density range from 32166 to 151112 per cubic millimetre (mean = 90407, SD = 42133, n = 23 estimates). The range of density estimates is the same and variability not reduced by selecting stereological estimates only (mean = 88705, SD = 44009, n = 18 studies). In rats, only very few estimates of total numbers for the caudoputamen are available in the database. The estimated neuron density in the adult rat caudoputamen varies from 19129 to 64050 neurons per cubic millimetre (mean = 35529, SD = 15029, n = 13 estimates). The neuron density estimates for caudoputamen are summarised in box plots in Fig. 5b.

Possible reasons for observed variability

Our assessment of the variability of quantitative neuroanatomical data from the substantia nigra show that for TH expressing neurons in the pars compacta of C57BL/6 mice, the reported numbers range from approximately 1000 to over 16,000 cells unilaterally. High variability is also seen in the caudoputamen data. Interestingly, the reported neuron density (per cubic millimetre) is on average ~twice as high in the mouse than in the rat. Although the numbers reported within the species varies a lot, the ratio between the mean densities correspond well with estimated scaling rules between rat and mouse brains⁷⁰. For the mouse caudoputamen, estimates of total neuron numbers in the database range from 856649 to 1711615 in one hemisphere. Few studies include the range of values collected in addition to summary statistics, but it is clear that the variability between studies is much higher than that within studies. For example, in a study comparing neuron numbers in the caudoputamen across different mouse strains⁷¹, the difference between the bilateral average of the groups with the highest and lowest number was 324926. It is thus highly unlikely that the range we observe across studies, of almost one million cells unilaterally, can be attributed solely to biological variance. Instead, the reasons for the large variation in numbers reported from within a region are likely to be manifold. Due to the wide-spread lack of methodological metadata in papers, the size of groups containing estimates obtained by clearly defined and similar methods was too small to support formal statistical analysis on differences in variability. Collection of more data to the Murine basal ganglia database, combined with improved reporting practices, could allow such analyses in the future. Nevertheless, we believe that the present data collection, shared through the EBRAINS Knowledge Graph, can be useful for finding and comparing published data. The ability to filter the data based on metadata elements might also be useful to select appropriate data, depending on the need of the user. Combined with our defined workflows for contributing more information, we believe these results will make it easier to select, organise, compare and share quantitative information from the literature or from new analyses in the future. We describe these uses of the database in detail below.

Usage Notes

Our database is shared through the EBRAINS Knowledge Graph as part of a dataset entitled “Database of quantitative cellular and subcellular morphological properties from rat and mouse basal ganglia”²⁵. It comprises three main parts (see Data records for details): 1) the Murine basal ganglia database (Database_v1.accdb); 2) spreadsheets with all the quantitative estimates, morphologies and distributions contained in the Murine basal ganglia database (files in .xlsx and .csv format) with selected metadata; and 3) an empty version of the Murine basal ganglia database (Input_database.accdb) and a spreadsheet (Input_sheet .xlsx) facilitating collection of new data. We here briefly explain how researchers with different interests may utilise different parts of this dataset. These descriptions are intended as example use cases, and the reader is referred to other sources^72,73 for guides on the use of Microsoft Access and Excel (RRID:SCR_016137). Because maintaining and updating information is a challenge with any database that is seldom addressed, we go on to describe a workflow through which other researchers can organise and share more data, using shared database and spreadsheet templates.

Using the Murine basal ganglia database and the data extracted from it

Exploring the reported numbers of tyrosine hydroxylase neurons in substantia nigra

A researcher wants to look up the reported numbers of TH neurons in substantia nigra. Having downloaded the dataset titled “Database of quantitative cellular and subcellular morphological properties from rat and mouse basal ganglia”²⁵, (s)he opens the README file to get a quick overview of the contents. There, (s)he sees that the Data Extracts-folder contains queries that include all the numbers available in the database. (S)he finds that such extracts are likely to meet his/her questions, and navigates to the relevant folder. S(h)e opens the.xlxs file called “Cell counts” and selects the sheet called “Total number estimates”. The first three columns show the cell types that have been quantified, the species, and the regions of interest. The researcher filters the records to “Tyrosine hydroxylase expressing” cells, “Mus musculus” and “Pars compacta” (to simultaneously filter multiple columns in Microsoft Excel, select all the columns to be filtered, and under the Data tab click “Filter”). This yields 70 records, each one with accompanying metadata elements related to the animals, counting method, and region of interest. To explore the data further, e.g. by extracting descriptive measurements, (s)he copies the filtered records to a new sheet (in Microsoft Excel, go to Find & Select, click “Go to special” and select “Visible cells only”).

Finding studies using a specific primary antibody

A researcher has used immunohistochemistry to visualize parvalbumin positive neurons, and quantified labelled cells using stereological analysis. To verify the results (s)he is now interested in finding quantitative data from studies where the same antibody has been used. (S)he downloads the dataset titled “Database of quantitative cellular and subcellular morphological properties from rat and mouse basal ganglia”²⁵ from EBRAINS, and upon looking at the “Cell counts” data extracts finds that they do not contain metadata specifying the antibody used. (S)he therefore navigates to the Database-folder and opens the.accdb file and the text file called “Tables_description”. In the text file, (s)he finds that antibodies are stored in the lookup table “Reporters” with connections to the table “Sources” via other tables. (S)he navigates to the Create table and clicks the Query Wizard. After selecting the Simple Query Wizard, (s)he selects the “Reporter name” and “Reporter unique ID” fields from the “Reporters” table and the “Source name” and “Source ID” field from the “Sources” table. (S)he clicks “Finish” and is presented with a query including a list of antibodies, their unique RRIDs, and the name and DOI of the source in which they were used. (S)he clicks the “Reporter unique ID” column header and scrolls to see if the antibody of interest (RRID:AB_10000344) is among the listed IDs. It is, and (s)he filters the list to these records. This yields a list of four studies where the antibody has been used. The researcher can now look for results from these studies in the Cell counts data extract by filtering it to the relevant Source names, or look up the original papers by use of the DOIs.

Overviewing the methodological parameters of a study

Upon identifying the studies using an antibody of interest, the researcher described above wants to get a quick overview of the methods used in these studies. In the Tables description-file, (s)he reads that the tables “Specimens” and “Specimen_treatments” contain information related to the treatment of tissue reported in included papers. (S)he therefore creates a new query, including the “Source name” from the “Sources” table, the “Specimen form” from the “Specimens” table and the fields “Solution”, “Purpose”, “Time”, “Time unit” and “Temperature” from the “Specimen treatments” table. Since (s)he is primarily interested in the methods related to parvalbumin stained material, (s)he also includes the “Cell type putative” field from the table “Derived data records”. She clicks “Finish”, and filters the resulting query to his/her studies of interest by clicking the “Source name” column header and selecting the relevant Source names. (S)he also clicks the “Cell type putative” column header and uses the text filter to select only records that contains “Parvalbumin”. This yields 13 records summarizing the treatments used for each specimen.

Using the workflow for harvesting, organizing and updating neuroanatomical data

In order to compare numbers reported across studies, it is first necessary to systematically extract relevant data and metadata and to standardise these to common units and concepts. We here present a workflow enabling users to harvest and organise their quantitative neuroanatomical data from the literature or public databases. This workflow includes a template version of the Murine basal ganglia database (with forms supporting input of largely standardised metadata) available through the dataset hosted in the EBRAINS Knowledge Graph²⁵, a novel procedure for translating terms for regions of interest to common terminology, and a preliminary approach for mapping data to cell-types of interest. We include steps through which other researchers can enter new information to the database and share this with the broader community. The workflow is summarised in Fig. 6.

Translation of terms across neuroanatomical nomenclatures

A key part of the overall workflow is the translation of semantic terms existing in different reference atlases to terms used in the standard reference atlases used by EBRAINS (Supplementary Fig. 1). These include the Waxholm Space atlas of the Rat Brain^34,35,74 and the Allen Mouse Brain Common Coordinate Framework³⁷. Anatomical metadata found in sources essentially enters one of three routes. In the first route, terms that are consistent with the nomenclature for one of the EBRAINS standard atlases are directly entered in the database. In the second route, terms that are consistent with another atlas nomenclature are translated to the closest matching region term in the relevant EBRAINS atlas. The basis for making such a translation is given by the spatial relationships between regions delineated in the different atlases used, and regions in the EBRAINS standard atlases (see Methods for details); these relationships are available as datasets through the EBRAINS Knowledge Graph^61,62. The third route for inserting anatomical metadata found in sources, are for terms that are not consistent with any standard nomenclature. These are treated as “custom” terms, and translated to the closest EBRAINS atlas term by using the documentation available from the source. This was the most commonly used route in the current project, used for ~62% of reported regions. Note that 4% of the sources used an atlas for which relationships to EBRAINS atlases were not available, and these also entered the custom region translation route.

Updating the Murine basal ganglia database

A database is only up-to-date as long as the data are maintained and expanded with new information. In addition to sharing the content aggregated and organised through this project, we therefore outline how researchers could contribute to the Murine basal ganglia database in the future. Researchers might want to add more data from the literature (green arrows, Fig. 6) or from own experiments (purple arrows, Fig. 6). The first step for anyone wishing to add more data from the published literature is to identify potential new sources. This could be done through a literature search similar to that described in this paper (with date filters constraining the search to the period after the current search was performed, see Supplementary File 2 for a list of the search strings used here). Alternatively, advances in text mining might yield opportunities for more automatic search strategies⁷⁵. The next step in the workflow is interpretation: the source needs to be examined manually to identify the relevant data and metadata elements to be extracted. Once produced or identified through the literature, data should be extracted and metadata standardised. For this purpose, we share an Excel workbook with sheets where data can be entered in any format. Upon insertion of the number and unit, calculated fields standardise data to represent number per square or cubic micro- or millimetre. For cell counts, volumetric densities are also estimated from 2D counts given that section thickness is provided, according to the calculations described in Supplementary File 5. The Excel workbook is also tailored for input of basic metadata as required by EBRAINS (https://ebrains.eu/). It may be used in a relatively simple route to collecting data and contributing to the database (long green and purple lines in Fig. 6). Alternatively, it may be used as a means to organise and standardise data before entering it with extended metadata in the database. For this purpose, we share an empty version of the Murine basal ganglia database (an “input portal”) specifically designed to add new data with the full extent of metadata collected for the current project. The Excel sheet and Access database tailored for input are shared as .xlxs and .accdb files, respectively, and can be downloaded together with the Murine basal ganglia database through the dataset hosted at EBRAINS Knowledge Graph²⁵. In our experience, the interpretation, extraction, standardisation and integration steps might require from half an hour to several hours per publication; generally, less time is required to integrate data that is provided in tabular format and using standard units of measurement (square or cubic micro- or millimetres), with a clearly described methodology. The files shared here to facilitate collection of new data may be populated and stored locally by the user, or shared with the community. The last step of the workflow outlined here (Fig. 6) is thus sharing the data. This could be done through any data sharing platform, e.g. Zenodo (www.zenodo.org) or Figshare (www.figshare.com). The EBRAINS curation service (curation-support@ebrains.eu) and Knowledge Graph offers the advantage of being tailored to neuroscience data, and would allow for new data collections to be linked to the version of the Murine basal ganglia database presented here²⁵.

Code availability

The QuickNII (RRID:SCR_016854) tool was used for spatial co-registration of atlases. Microscoft Access 2016 was used to create the database.

References

Egger, R., Dercksen, V., Udvary, D., Hege, H.-C. & Oberlaender, M. Generation of dense statistical connectomes from sparse morphological data. Front. Neuroanat. 8, 1–18 (2014).
Article Google Scholar
Markram, H. et al. Reconstruction and simulation of neocortical microcircuitry. Cell 163, 456–492 (2015).
Article CAS PubMed Google Scholar
Bezaire, M. J. & Soltesz, I. Quantitative assessment of CA1 local circuits: knowledge base for interneuron-pyramidal cell connectivity. Hippocampus 23, 751–785 (2013).
Article PubMed PubMed Central Google Scholar
Abercrombie, M. Estimation of nuclear population from microtome sections. Anat. Rec. 94, 239–247 (1946).
Article CAS PubMed Google Scholar
Schmitz, C. & Hof, P. Design-based stereology in neuroscience. Neuroscience 130, 813–831 (2005).
Article CAS PubMed Google Scholar
Brændgaard, H. & Gundersen, H. J. G. The impact of recent stereological advances on quantitative studies of the nervous system. J. Neurosci. Methods 18, 39–78 (1986).
Article PubMed Google Scholar
Bjaalie, J., Diggle, P., Nikundiwe, A., Karagulle, T. & Brodal, P. Spatial segregation between populations of ponto-cerebellar neurons: Statistical analysis of multivariate spatial interactions. Anat. Rec. 231, 510–523 (1991).
Article CAS PubMed Google Scholar
Prodanov, D., Nagelkerke, N. & Marani, E. Spatial clustering analysis in neuroanatomy: Applications of different approaches to motor nerve fiber distribution. J. Neurosci. Methods 160, 93–108 (2007).
Article PubMed Google Scholar
West, M. J., Østergaard, K., Andreassen, O. A. & Finsen, B. Estimation of the number of somatostatin neurons in the striatum: An in situ hybridization study using the optical fractionator method. J. Comp. Neurol. 370, 11–22 (1996).
Article CAS PubMed Google Scholar
Oorschot, D. Total number of neurons in the neostriatal, pallidal, subthalamic, and substantia nigral nuclei of the rat basal ganglia: A stereological study using the cavalieri and optical disector methods. J. Comp. Neurol. 599, 580–599 (1996).
Article Google Scholar
Yu, Z. et al. Nitrated α-synuclein induces the loss of dopaminergic neurons in the substantia nigra of rats. PLoS One 5, (2010).
Singh, A. et al. Long term exposure to cypermethrin induces nigrostriatal dopaminergic neurodegeneration in adult rats: postnatal exposure enhances the susceptibility during adulthood. Neurobiol. Aging 33, 404–415 (2012).
Article CAS PubMed Google Scholar
Bornmann, L. & Mutz, R. Growth rates of modern science: A bibliometric analysis based on the number of publications and cited references. J. Assoc. Inf. Sci. Technol. 66, 2215–2222 (2015).
Article CAS Google Scholar
van Strien, N. M., Cappaert, N. L. & Witter, M. P. The anatomy of memory: an interactive overview of the parahippocampal-hippocampal network. Nat Rev Neurosci 10, 272–282 (2009).
Article PubMed CAS Google Scholar
Bota, M., Dong, H.-W. & Swanson, L. Brain Architecture Management System. Neuroinformatics 3, 015–048 (2005).
Article Google Scholar
Wheeler, D. et al. Hippocampome.org: a knowledge base of neuron types in the rodent hippocampus. Elife 4, 1138–1142 (2015).
Google Scholar
Ascoli, G., Donohue, D. & Halavi, M. NeuroMorpho.Org: A central resource for neuronal morphologies. J. Neurosci. 27, 9247–9251 (2007).
Article CAS PubMed PubMed Central Google Scholar
Erö, C., Gewaltig, M., Keller, D. & Markram, H. A cell atlas for the mouse brain. Front. Neuroinform. 12, 1–16 (2018).
Article Google Scholar
Kim, Y. et al. Brain-wide maps reveal stereotyped cell-type-based cortical architecture and subcortical sexual dimorphism. Cell 171, 456–469.e22 (2017).
Article CAS PubMed PubMed Central Google Scholar
Murakami, T. et al. A three-dimensional single-cell-resolution whole-brain atlas using CUBIC-X expansion microscopy and tissue clearing. Nat. Neurosci. 21, 625 (2018).
Article CAS PubMed Google Scholar
Obeso, J. et al. The basal ganglia in Parkinson’s disease: Current concepts and unexplained observations. Ann. Neurol. 64, S30–S46 (2009).
Article Google Scholar
Bunner, K. D. & Rebec, G. V. Corticostriatal dysfunction in Huntington’s disease: The basics. Front. Hum. Neurosci. 10, 317 (2016).
Article PubMed PubMed Central CAS Google Scholar
Vidyadhara, D. J., Yarreiphang, H., Raju, T. R. & Alladi, P. A. Admixing of MPTP-resistant and susceptible mice strains augments nigrostriatal neuronal correlates to resist MPTP-induced neurodegeneration. Mol. Neurobiol. 54, 6148–6162 (2017).
Article CAS PubMed Google Scholar
Baquet, Z., Williams, D., Brody, J. & Smeyne, R. A comparison of model-based (2D) and design-based (3D) stereological methods for estimating cell number in the substantia nigra pars compacta (SNpc) of the C57BL/6J mouse. Neuroscience 161, 1082–1090 (2009).
Article CAS PubMed Google Scholar
Bjerke, I., Puchades, M., Bjaalie, J. G. & Leergaard, T. Database of quantitative cellular and subcellular morphological properties from rat and mouse basal ganglia. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/DYXZ-76U (2019).
Gerfen, C. R. & Bolam, J. P. The neuroanatomical organization of the basal ganglia. Handb. Behav. Neurosci. 24, 3–32 (2016).
Article Google Scholar
Olmos, J. & Heimer, L. The concepts of the ventral striatopallidal system and extended amygdala. Ann. N. Y. Acad. Sci. 877, 1–32 (1999).
Article ADS PubMed Google Scholar
Gupta, A. et al. Federated access to heterogeneous information resources in the neuroscience information framework (NIF). Neuroinformatics 6, 205–217 (2008).
Article PubMed PubMed Central Google Scholar
Polavaram, S., Gillette, T., Parekh, R. & Ascoli, G. Statistical analysis and data mining of digital reconstructions of dendritic morphologies. Front. Neuroanat. 8, 1–16 (2014).
Article Google Scholar
Yates, S. & Puchades, M. Extraction of parvalbumin positive cells from an Allen mouse brain in situ hybridisation experiment. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/6DYS-M3W (2019).
Martone, M. et al. The Cell Centered Database project: An update on building community resources for managing and sharing 3D imaging data. J. Struct. Biol. 161, 220–231 (2008).
Article PubMed Google Scholar
Paxinos, G. & Watson, C. The rat brain in stereotaxic coordinates. (Elsevier Inc (2013).
Swanson, L. Brain Maps III: Structure of the rat brain. (Elsevier (2004).
Papp, E., Leergaard, T. B., Calabrese, E., Johnson, G. A. & Bjaalie, J. G. Waxholm Space atlas of the Sprague Dawley rat brain. Neuroimage 97, 374–386 (2014).
Article PubMed Google Scholar
Kjonigsen, L., Lillehaug, S., Bjaalie, J., Witter, M. & Leergaard, T. Waxholm Space atlas of the rat brain hippocampal region: Three-dimensional delineations based on magnetic resonance and diffusion tensor imaging. Neuroimage 108, 441–449 (2015).
Article PubMed Google Scholar
Paxinos, G. & Franklin, K. The mouse brain in stereotaxic coordinates. (Academic Press (2012).
Oh, S. et al. A mesoscale connectome of the mouse brain. Nature 508, 207–214 (2014).
Article ADS CAS PubMed PubMed Central Google Scholar
Papp, E. A., Leergaard, T. B., Calabrese, E., Allan Johnson, G. & Bjaalie, J. G. Addendum to “Waxholm Space atlas of the Sprague Dawley rat brain” [NeuroImage 97 (2014) 374-386]. Neuroimage 105, 561–562 (2015).
Article PubMed Google Scholar
Puchades, M., Csucs, G., Ledergerber, D., Leergaard, T. & Bjaalie, J. Spatial registration of serial microscopic brain images to three-dimensional reference atlases with the QuickNII tool. PLoS One 14, (2019).
Franklin, K. & Paxinos, G. The mouse brain in stereotaxic coordinates. (Academic Press (1996).
Franklin, K. & Paxinos, G. The mouse brain in stereotaxic coordinates. (Academic Press (2007).
Paxinos, G. & Watson, C. The rat brain in stereotaxic coordinates. (Academic Press (1998).
Paxinos, G. & Watson, C. The rat brain in stereotaxic coordinates. (Academic Press (2007).
Paxinos, G. & Watson, C. The rat brain in stereotaxic coordinates. (Academic Press (1986).
Paxinos, G. & Watson, C. The rat brain in stereotaxic coordinates. (Elsevier (2005).
Swanson, L. Brain Maps: Structure of the rat brain. (Elsevier (1992).
Swanson, L. Brain Maps II: Structure of the rat brain. (Elsevier (1998).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Paxinos & Watson’s “The Rat Brain in Stereotaxic Coordinates” (3rd edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/KNB2-GMN (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Paxinos & Watson’s “The Rat Brain in Stereotaxic Coordinates” (5th edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/KQ5K-S0D (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Swanson’s “Brain Maps: Structure of the Rat Brain” (3rd edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/ZFXB-23F (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Paxinos & Watson’s “The Rat Brain in Stereotaxic Coordinates” (4th edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/W3R1-R4A (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Paxinos & Watson’s “The Rat Brain in Stereotaxic Coordinates” (1st edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/YRKH-626 (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Swanson’s “Brain Maps: Structure of the Rat Brain” (4th edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/486N-966 (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Paxinos & Watson’s “The Rat Brain in Stereotaxic Coordinates” (7th edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/APWV-37H (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Swanson’s “Brain Maps: Structure of the Rat Brain” (2nd edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/EEQA-9RM (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Swanson’s “Brain Maps: Structure of the Rat Brain” (1st edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/ZB03-H5G (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Franklin & Paxinos’ “The Mouse Brain in Stereotaxic Coordinates” (3rd edition) spatially registered to the Allen mouse brain Common Coordinate Framework. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/1BT9-YYD (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Paxinos & Watson’s “The Rat Brain in Stereotaxic Coordinates” (6th edition) spatially registered to the Waxholm Space atlas of the rat brain. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/XQ8J-TNE (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Franklin & Paxinos’ “The Mouse Brain in Stereotaxic Coordinates” (4th edition) spatially registered to the Allen Mouse Common Coordinate Framework. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/WFCZ-FSN (2019).
Bjerke, I., Schlegel, U., Puchades, M., Bjaalie, J. & Leergaard, T. Paxinos & Franklin’s “The Mouse Brain in Stereotaxic Coordinates” (2nd edition) spatially registered to the Allen Mouse Common Coordinate Framework. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/BTKK-CRY (2019).
Bjerke, I., Puchades, M., Bjaalie, J. & Leergaard, T. Comparability of basal ganglia delineations across different mouse brain atlases. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/MWAS-3S6 (2019).
Bjerke, I., Puchades, M., Bjaalie, J. & Leergaard, T. Comparability of basal ganglia delineations across different rat brain atlases. Human Brain Project Neuroinformatics Platform https://doi.org/10.25493/D2M9-BSK (2019).
Hamilton, D. et al. Name-calling in the hippocampus (and beyond): coming to terms with neuron types and properties. Brain Informatics 4, 1–12 (2017).
Article CAS PubMed Google Scholar
Ascoli, G. et al. Petilla terminology: nomenclature of features of GABAergic interneurons of the cerebral cortex. Nat. Rev. Neurosci. 9, 557–568 (2008).
Article CAS PubMed Google Scholar
Keller, D., Erö, C. & Markram, H. Cell densities in the mouse brain: A systematic review. Front. Neuroanat. 12, 83 (2018).
Article CAS PubMed PubMed Central Google Scholar
Coggeshall, R. A consideration of neural counting methods. Trends Neurosci. 15, 9–13 (1992).
Article CAS PubMed Google Scholar
Sugar, J., Witter, M., van Strien, N. & Cappaert, N. The retrosplenial cortex: intrinsic connectivity and connections with the (para)hippocampal region in the rat. An interactive connectome. Front. Neuroinform. 5, 1–13 (2011).
Article Google Scholar
Voorn, P., Vanderschuren, L., Groenewegen, H., Robbins, T. & Pennartz, C. Putting a spin on the dorsal-ventral divide of the striatum. Trends Neurosci. 27, 468–474 (2004).
Article CAS PubMed Google Scholar
Cullity, E., Madsen, H., Perry, C. & Kim, J. Postnatal developmental trajectory of dopamine receptor 1 and 2 expression in cortical and striatal brain regions. J. Comp. Neurol. 1–17 (2018).
Herculano-Houzel, S., Mota, B. & Lent, R. Cellular scaling rules for rodent brains. Proc. Natl. Acad. Sci. U. S. A. 103, 12138–12143 (2006).
Article ADS CAS PubMed PubMed Central Google Scholar
Rosen, G. D. & Williams, R. W. Complex trait analysis of the mouse striatum: independent QTLs modulate volume and neuron number. BMC Neurosci. 2, 5 (2001).
Article CAS PubMed PubMed Central Google Scholar
Barrows, A., Young, M. & Stockman, J. Microsoft Access 2010 all-in-one for dummies. (Wiley Publishing (2010).
Frye, C. Microsoft Excel 2019. (Microsoft Press (2019).
Osen, K., Imad, P., Wennberg, A., Papp, E. & Leergaard, T. Waxholm Space atlas of the rat brain auditory system: Three-dimensional delineations based on structural and diffusion tensor magnetic resonance imaging. Neuroimage 199, 38–56 (2019).
Article PubMed Google Scholar
French, L. et al. Text mining for neuroanatomy using WhiteText with an updated corpus and a new web application. Front. Neuroinform. 9, 13 (2015).
Article PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank Krister Andersson, Camilla Blixhavn, Heidi Kleven, Martin Øvsthus, Lyuba Zehl and Oliver Schmid for useful discussions, and Ulrike Schlegel for help with sharing datasets through the EBRAINS Knowledge Graph. This work was funded by the European Union’s Horizon 2020 Framework Programme for Research and Innovation Programme under the Specific Grant Agreement No. 785907 (Human Brain Project SGA2), Specific Grant Agreement No. 945539 (Human Brain Project SGA3), and The Research Council of Norway under Grant Agreement No. 269774 (INCF Norwegian Node). The funders had no role in study design, data collection and analysis, decision to publish, or preparation of the manuscript.

Author information

Authors and Affiliations

Department of Molecular Medicine, Institute of Basic Medical Sciences, University of Oslo, Oslo, Norway
Ingvild E. Bjerke, Maja A. Puchades, Jan G. Bjaalie & Trygve B. Leergaard

Authors

Ingvild E. Bjerke
View author publications
You can also search for this author in PubMed Google Scholar
Maja A. Puchades
View author publications
You can also search for this author in PubMed Google Scholar
Jan G. Bjaalie
View author publications
You can also search for this author in PubMed Google Scholar
Trygve B. Leergaard
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

I.E.B. contributed to conceiving of the study; collected, organised and analysed the data; and co-authored the manuscript. M.A.P. and J.G.B. contributed to conceiving and supervising the study. T.B.L. conceived and supervised the study, and co-authored the manuscript. All authors reviewed and approved of the manuscript.

Corresponding author

Correspondence to Trygve B. Leergaard.

Ethics declarations

Competing Interests

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary file 1

Supplementary file 2

Supplementary file 3

Supplementary file 4

Supplementary file 5

Supplementary file 6

Supplementary figure 1

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

The Creative Commons Public Domain Dedication waiver http://creativecommons.org/publicdomain/zero/1.0/ applies to the metadata files associated with this article.

Reprints and permissions

About this article

Cite this article

Bjerke, I.E., Puchades, M.A., Bjaalie, J.G. et al. Database of literature derived cellular measurements from the murine basal ganglia. Sci Data 7, 211 (2020). https://doi.org/10.1038/s41597-020-0550-3

Download citation

Received: 20 January 2020
Accepted: 04 June 2020
Published: 06 July 2020
DOI: https://doi.org/10.1038/s41597-020-0550-3

This article is cited by

AtOM, an ontology model to standardize use of brain atlases in tools, workflows, and data infrastructures
- Heidi Kleven
- Thomas H. Gillespie
- Trygve B. Leergaard
Scientific Data (2023)