Abstract
The differences in size and function between primate and rodent brains, and the association of disturbed excitatory/inhibitory balance with many neurodevelopmental disorders highlight the importance to study primate ganglionic eminences (GEs) development. Here we used single-cell RNA and ATAC sequencing to characterize the emergence of cell diversity in monkey and human GEs where most striatal and cortical interneurons are generated. We identified regional and temporal diversity among progenitor cells which give rise to a variety of interneurons. These cells are specified within the primate GEs by well conserved gene regulatory networks, similar to those identified in mice. However, we detected, in human, several novel regulatory pathways or factors involved in the specification and migration of interneurons. Importantly, comparison of progenitors between our human and published mouse GE datasets led to the discovery and confirmation of outer radial glial cells in GEs in human cortex. Our findings reveal both evolutionarily conservative and nonconservative regulatory networks in primate GEs, which may contribute to their larger brain sizes and more complex neural networks compared with mouse.
Similar content being viewed by others
Introduction
The complex and orderly neural network of human brain is mainly comprised of excitatory glutamate neurons and ɣ-aminobutyric acid-containing (GABAergic) inhibitory interneurons.1,2,3 Interneurons are highly diverse neurons with the characteristics of different morphologies, connectivity and biological functions.4,5,6,7,8 Disturbance of excitatory/inhibitory balance is associated with many neurodevelopmental and neuropsychiatric disorders, such as autism spectrum disorder and schizophrenia.9,10,11,12 During brain development, only a few interneurons originate and function at local regions, while almost all the rest of interneurons are derived from the ganglionic eminences (GEs), the transitory structures during the embryonic stage.13,14,15 The GEs, located between the thalamus and caudate nucleus, are the origins of interneurons. GEs can be divided into three subregions, including lateral, medial and caudal ganglionic eminences (LGE, MGE and CGE).14,15,16,17 Each subregion expresses specific genes and generates temporally and spatially distinct interneurons targeted to the whole brain. The MGE mainly generates the GABAergic cells migrating to the cerebral cortex, striatum and globus pallidus, which are characterized by the expression of NKX2-1 and LHX6.17,18,19,20,21 The LGE derived interneurons mainly comprise the precursors of the striatal medium spiny neurons (MSNs) and olfactory bulb interneurons, which can be identified by the expression of ISL1 and CHD7 respectively.22,23,24,25,26 The CGE mainly produces 5HT3aR+ cortical interneurons with the specific expression of NR2F1 and NR2F2.5,18,27,28 Specific expression of genes and regulatory networks during GE development contribute to the regional cell diversity and interneuron migration.29,30,31
Single cell RNA-seq (scRNA-seq) has been applied in the researches of developmental neuroscience.32,33,34 Compared with traditional bulk RNA-seq, which generates the mixed expression data from tissues, scRNA-seq can provide transcriptional profiles of single cells.35 Technology with such a high resolution can distinguish different cell types and facilitate our investigation of the complexed neural system.36 Several single cell studies in mice have revealed the diversity of molecular and transcriptional characteristics of GE interneurons during embryonic development.17,37,38,39 However, only a few studies have been reported for the generation of inhibitory neurons during GE development in primate, and the conservation and heterogeneities between rodents and primates are still limited due to the lack or quality of transcriptional analysis.40 In addition, the complicated regulatory networks underlying the sophisticated cell proliferation, spatial distinction, lineage determination and interneuron migration are still not clear.41,42
In this study, we used the scRNA-seq and single cell Assay for Targeting Accessible-Chromatin with high-throughput sequencing (scATAC-seq), a technology which enables the capturing of open regions on chromatins,43,44 to investigate the transcriptional trajectories profiles of GEs in the fetuses of human and Macaca fascicularis. We found that most of the gene regulatory networks were well conserved in the developing GEs between human and macaque, including those involved in cell proliferation, cell fate determination and migration. We identified those genes that are potentially important for cell-fate determination in different human GE regions. Moreover, we discovered novel regulatory pathways that may be involved in the fate determination of MGE and LGE, and cell migration in GEs. Importantly, the existence of outer radial glial cells was confirmed in human developing GEs.
Results
Cell diversity in the early developmental stages of primate GEs
We preformed droplet-based scRNA-seq of GEs from three Macaca fascicularis and two human fetuses in order to dissect and compare the developmental profile of primate GEs. Macaque samples were from gastral week (GW) 7, 10 and 12 when GEs are still in the early stage of development. Human samples were collected from GW 9 and 13 which are likely in the comparable developmental windows (Fig. 1a). After sequencing, mapping and quality control, we got 13,782 cells from human and 29,269 cells from macaque (Supplementary information, Table S1). Then the data from two species were integrated by Seurat3.0.45 Unsupervised method was used to cluster different cells (Fig. 1b), and we detected 21 clusters of cells (Supplementary information, Fig. S1a). Each clusters were annotated by well-known markers for the classification of those cells (Fig. 1b). We removed two clusters of excitatory neurons from neighboring cerebral cortex expressing NRN1 and SLA46,47 (Supplementary information, Fig. S1b, c). We also found the clusters of microglia and hemocytes discerned by their corresponding markers, CX3CR1 and SLC4A148,49 (Supplementary information, Fig. S1d), which were also removed. Among the remaining GE derived cells, we could distinguish cells from GE progenitors and post-mitotic cells by differential gene expression analysis (Supplementary information, Fig. S1e). Neural progenitors including radial glia cells (RGCs) and intermediate progenitors (IPCs) of GEs showed high expression levels of NES, VIM or ASCL1, DLL150 (Fig. 1c, d and Supplementary information, Fig. S1e, f). Proliferating cells in cell cycle could be marked by TOP2A51 (Fig. 1c, d). Postmitotic cells could be further defined by regional specific markers (Fig. 1c). MGE comprised cells specifically expressing NKX2-1 and LHX6, while MEIS2, ISLR2 are expressed in LGE and NR2F1, NR2F2 in CGE (Fig. 1d and Supplementary information, Fig. S1e, f). We then calculated the normalized cell ratio of human and macaque cells for each cell type, which was visualized in river plot, suggesting that no significant difference between human and macaque GE cell partitions (Fig. 1e and Supplementary information, Fig. S1g). Similarity analysis based on gene expression between different cell types from human and macaque also proved the accuracy in the integration of the data from the two species (Fig. 1f).
To present the trajectory of cell development in primate GEs, we performed RNA velocity analysis (scVelo),52,53 a method to infer trajectory by reconstructing the developmental sequence of transcriptional changes. As predicted, cells originated from GE progenitors proliferated and generated IPCs,54 and then differentiated into various lineages in MGE, LGE and CGE (Fig. 1g). In conclusion, our analysis confirmed the basic evolutional conservation between macaque and human, and revealed the cell diversity and developmental trajectory in human and macaque GEs.
Specific cell identities of primate GE progenitors
In mammalian developing brain, neurons are generated by the RGCs located in the ventricular zone (VZ) and IPCs in the subventricular zone (SVZ) in the cerebral cortex and GE.55,56 We subset the neural progenitor cells from the human and Macaca fascicularis GE, including RGCs and IPCs (Fig. 2a). RGCs were distinguished by the high expression levels of HES1, VIM and NES, while IPC was identified by the expression of ASCL1, HES6 and DLX250,57 (Fig. 2b). Among all the RGCs, we observed the regional identities of different GE progenitors. Expression of NKX2-1, PAX6 and NR2F2 represented the MGE, LGE and CGE progenitors respectively (Fig. 2c).
Studies of the cerebral cortex revealed a specific proliferative region outside the SVZ/VZ, named outer subventricular zone (oSVZ) in the primate neocortex.56 Outer radial glia cells (oRGs) located in the oSVZ are different from RGCs in the VZ, and they are considered to be the major contributor of brain volume expansion in primates, especially human.56 Notably, we identified the primate specific outer radial glia cells (oRGs) within the RGCs of GEs based on the concentrated and specific expression of FAM107A, HOPX and TNC58 (Fig. 2a, b). We performed immunostaining of HOPX and confirmed the expression of HOPX in both the VZ and SVZ of the GW13 human and GW12 macaca LGE, as well as those areas far away from the VZ, similar to those in the cortex (Fig. 2d). However, we did not observe any positive signals in mouse GE or cortex (Supplementary information, Fig. S2a). This indicates that oRGs exist not only in the cerebral cortex, but also in GEs during the primate brain development.
We then sought to find the differences of GE progenitors between primate and mouse. We integrated our primate data with a published mouse GE dataset (see Method and Material), and subset the progenitor clusters. Focusing on the HES1+ RGCs, we performed differential gene expression analysis between different species (Fig. 2e and Supplementary information, Fig. S2b). A group of genes highly expressed in human and macaque were identified (Fig. 2e and Supplementary information, Fig. S2c). Among those differentially expressed genes (DEGs), some of them were previously reported as specifically expressed genes in human RGCs, such as MOXD1 and TCIM58,59,60 (Fig. 2e and Supplementary information, Fig. S2c), which verified our discovery. Other proliferation related genes such as CLU61 and VEPH1,62 might contribute to species differences as well (Fig. 2e and Supplementary information, Fig. S2c).
The transitions from RGCs to IPCs and mitotic to postmitotic cells play important roles in cell development and differentiation during brain development. Analysis of IPCs revealed that RGS16, a regulator of G protein-coupled receptor signaling cascades,50,63,64 was specifically expressed in those cells that just got into postmitotic period in both primate and mouse (Fig. 2f and Supplementary information, Fig. S2d, e). Gene regulatory network performed by scATAC-seq showed that RGS16 was regulated by several proliferation related genes, such as PAX6, GLI3 and HES565,66,67 (Fig. 2f). Moreover, regional specific markers, such as NKX2-1 in MGE, NR2F2 in CGE and ISL1 in LGE started to express correlated to the reduced expression of RGS16 (Supplementary information, Fig. S2d). These patterns suggest that RGS16 may act as a transition factor between IPCs and cell fate-determined precursors.
The above results indicate that oRGs are present in primate in both developing GE and cerebral cortex, which may contribute to the species difference between primates and rodents.
Cell fate determination in the developing MGE and CGE
To depict the developmental lineages in MGE, LHX6-expressing groups were selected for further differential gene expression analysis and trajectory reconstruction (Fig. 3a and Supplementary information, Fig. S3a). We detected four distinct cell types with one of them presumed to be precursors based on their gene expression profiles. The three other branches specifically comprised SST-, LHX8- and CRABP1-expressing cells respectively (Fig. 3b, c).
MGE precursors were specified by the expression of transcription factors NKX2-1, TOX3, and PLS3 (Fig. 3b), which were gradually expressed during cell maturation. All of the presumed more mature cells in MGE shared the expression of NRXN1 and CHL1 (Supplementary information, Fig. S3b), whose functions are related to cell adhesion and axon projection according to biological process gene ontology (Fig. 3d).
Throughout the lineage of SST+ interneurons, ZEB2 and PLS3 were expressed along with genes including ERBB4 and CXCR4 (Fig. 3b and Supplementary information, Fig. S3b). Previous studies have reported that ERBB4 and CXCR4 regulate the tangential migration of interneurons derived from GE towards cerebral cortex in mice.68,69 These patterns suggest that the SST+ interneurons originated from MGE would migrate to the cerebral cortex.
LHX8+ cells are MGE-derived cholinergic projection neurons destined to the basal telencephalon. Most of LHX8+ cells were detected in the MGE of human GW9 and macaque GW7, but barely in the three older samples (Fig. 3f and Supplementary information, Fig. S3c). Similarly, much more LHX8+ cells were detected at E11 and 14 than that at E17 in mouse MGE (Supplementary information, Fig. S3c). This suggests that LHX8+ cholinergic projection neurons may be generated early in primate brain development, which is consistent with the findings in mouse.
We also found a small group of cells that specifically express CRABP1 and ANGPT2 alongside the LHX8+ cells (Fig. 3b–e). We analyzed those two groups of cells independently in order to find out more details about them. Feature plots revealed that ANGPT2 expression was completely separated from that of LHX8, while CRABP1 was expressed in a small percentage of LHX8+ cells (Fig. 3e). Further inspection of DEGs led to the finding of ST18 and ETV1 expression in the ANGPT2+ group (Supplementary information, Fig. S3d). ST18 and ETV1 were described as the markers of cortical parvalbumin (PV) interneurons precursor.17 We assumed that those cells would migrate towards cerebral cortex during brain development and represent the precursors of cortical PV interneurons.
The above results revealed that the cells differentiate early in MGE and the cell lineages diversify in a time-dependent manner, which was also presented in pseudotime analysis constructed by Monocle370 (Fig. 3g, h).
CGE mainly generated 5HT3aR+ cortical interneurons. We extracted the CGE cells to inspect their cellular diversity (Supplementary information, Fig. S3e). Almost all of the CGE cells highly expressed NR2F1 and NR2F2 (Supplementary information, Fig. S3f). At this development stage, we detected the expression of CALB2 and CCK (Supplementary information, Fig. S3f), which could represent a group of bipolar and multipolar interneurons in the adult cerebral cortex.5,18,71 However, other classic CGE-derived interneurons markers, such as RELN and VIP were barely expressed in our data (Supplementary information, Fig. S3f). These results suggested that the CCK+ and CALB2+ interneurons might be specified earlier than the RELN+ and VIP+ interneurons in the primate CGE.
Cell fate decision in the developing LGE
LGE was known as the origin of striatal MSNs and olfactory bulb (OB) interneurons.22,24 To reveal its developmental trajectories, we selected the LGE lineage from the primate combined data. After unsupervised clustering, seven different cell types were detected in the primate developing LGE (Fig. 4a and Supplementary information, Fig. S4a). We found that striatal MSN precursors could be divided by the expression of FOXP1 into dorsal LGE (dLGE) and ventral LGE (vLGE).72,73 In dLGE, the majority of cells were the TSHZ1+ striatonigral (D1) MSNs. In vLGE, there were two types of interneurons, with striatonigral (D1) MSNs expressing PDYN and striatopallidal (D2) MSNs expressing PENK74,75,76(Fig. 4b).
We then performed differential gene expression analysis between the OB lineage and three other striatal lineages. Volcano plot revealed that the OB lineage cells expressed at high levels of the stem cell related genes, such as TOX3, CHD7 and ID277,78,79 (Fig. 4c). Gene ontology analysis of those DEGs was in support of the evidence that OB interneuron precursors exist in the LGE (Fig. 4d). The cell distribution at different time showed that most of the OB lineage cells were detected from GE at later stages both in human and macaque (Supplementary information, Fig. S4b). These results suggest that OB interneuron precursors were generated later than striatal MSN precursors in LGE and would then migrate to OB for further proliferating.23
Interestingly, we also found a group of cells shared some MGE markers different from the striatal MSNs, such as LHX8, NKX2-1 and LHX6, and LGE markers including ZFHX3 and ISLR280 (Fig. 4e and Supplementary information, Fig. S4c). The absence of MEIS2 expression in those cells further indicated that they were different from LGE derived cells (Supplementary information, Fig. S4c). To figure out the identity and origin of those cells, we found their specific markers that were not expressed in other typical LGE or MGE clusters (Fig. 4e). NRP1 encodes Neuropilin-1, a receptor in Semaphorin signaling pathway.81,82 NRP1 showed concentrated expression in those cells, while SEMA3A were expressed mainly in those typical LGE MSNs co-expressing EBF1 (Fig. 4e). Therefore, we assumed that this group of cells were possibly derived from MGE, and migrated passing along developing striatum to the cortex under the regulation of SEMA3 signaling pathway.14 CellChat analysis,83 a tool that can calculate cell–cell signaling pathways between clusters by predicting major signaling inputs and outputs, showed that SEMA3 signaling pathway was highly activated in the LGE group. In this pathway, SEMA3A-expressing cells might release the signal and NRP1-expressing cells would response to SEMA3A (Fig. 4f). Another highly expressed gene in those cells, PEG10, drew our attention. PEG10 can produce long noncoding RNA (lncRNA) as well as PEG10 protein, both of which were reported to promoted cell migration in cancer cells.84,85 Previous study has shown that MIR7-3HG acted as a competing endogenous RNA to release PEG10 RNA from microRNA27-A-3p.86 MIR7-3HG was also expressed at high levels in those cells from human samples (Supplementary information, Fig. S4c), which might regulate the PEG10 expression. In addition, we downloaded the in situ sequencing data provided by Sun’s group80 and transformed signals into images. A group of NKX2-1+ and LHX8+ cells were located away from MGE at human GW12 (Supplementary information, Fig. S4d). These results indicate that a group of LHX8+ interneurons from MGE would migrate along LGE and be regulated by the SEMA3A-NRP1 signaling pathway, as well as the MIR7-3HG-PEG10 pathways during GE development (Fig. 4g). Combined with the pseudotime analysis, our researches on LGE revealed the regional genetic regulatory model (Fig. 4h).
Transcriptional regulatory network in the human GE
We also performed scATAC-seq of human GE at GW9 to investigate in more detail the transcription regulatory networks in human developing GEs. The single-cell assay from two technical replicates of scATAC-seq data was employed and we obtained a consensus set of 240,672 accessible peaks representing potential cis-regulatory elements (Supplementary information, Fig. S5a). We integrated the human scATAC-seq data with our GW9 scRNA-seq data by ArchR87 and found four major cell types matched between two sets of data (Fig. 5a and Supplementary information, Fig. S5b). We used the gene score, which is a prediction tool of gene expression based on the accessibility of regulatory elements, to visualize typical marker genes expression levels for GE progenitors, MGE, LGE and CGE on the UMAP plots (Fig. 5b). We built a marker gene list for the four major cell types based on the human scRNA-seq data (Supplementary information, Table. S2). ZNF503 and LHX8 were the representative makers of LGE and MGE lineages in the marker list (Supplementary information, Table S2 and Fig. S5c). We found two specific accessible genomic regions (open chromatin peaks) upstream of ZNF503 in LGE and one unique peak upstream of LHX8 in MGE (Fig. 5c). This unique peak of LHX8 contains a highly H3K4Me3 modification region based on UCSF brain methylation database in UCSC browser (http://genome.ucsc.edu), as well as a predicted binding site for PRX2. Both peaks related to ZNF503 contained predicted target motif of MEIS1, a marker of LGE80 (Supplementary information, Fig. S5d). These findings suggest the potential regulatory mechanisms of LHX8 expression in MGE and ZNF503 expression in LGE.
We then generated gene regulatory networks (GRNs) by linking distal cis-regulatory elements to individual genes using a method developed by Tu lab (see Method and Material) in order to identify candidate transcriptional factors (TFs) and regulators. Among all the regulatory pairs captured, we selected those upstream TFs and downstream genes that were both in the marker list visualized by Cytoscape88 (Fig. 5d). Each group of cells showed their specific as well as shared regulatory networks. For example, apart from well-known TFs such as NKX2-1 and LHX6, MGE also shared ARX and MEF2C with CGE, while NR2F2 and NFIB specifically regulated CGE development. LGE seemed to have a relative independent regulatory network mainly consisted of ISL1, FOXP1 and ZFHX4 (Fig. 5d). We got similar results by extracting the hub genes from three GE subregions based on protein–protein interaction (PPI) (Fig. 5e).
We next sought to search for unknown genes that might play important roles during GE development. MIR9-1HG (also named as C1orf61), the microRNA9-1 host gene, stood out from our results. It was involved in progenitors and MGE development (Fig. 5d). The high expression level of MIR9-1HG in progenitors and MGE clusters showed by feature plot confirmed our regulatory network results (Supplementary information, Fig. S5e). We looked through the potential upstream genes of MIR9-1HG in order to figure out its regulatory mechanism. Combined with the opening chromatin peaks around MIR9-1HG and potential binding motifs of TFs, three TFs, MAZ, TCF4 and ASCL1, were selected from 8 candidates (Fig. 5f). MAZ has been reported as a transcriptional activator binding protein,89 suggesting that MAZ would support the expression of MIR9-1HG. The binding motif of TCF4, a TF involved in the initiation of neural differentiation,90,91 was mapped to the open chromatin peaks around the transcription start site of MIR9-1HG (Fig. 5). Correlation analysis among gene expression levels in different cell types showed the significantly positive correlation between TCF4 and MIR9-1HG in all kinds of cells in our data (Fig. 5g and Supplementary information, Fig. S5e). This suggests that MIR9-1HG expression was positively regulated by TCF4. Next, we found ASCL1 binding motifs located at the open chromatin peaks downstream of MIR9-1HG, especially in the progenitor cluster (Fig. 5h). ASCL1 showed significant negative correlation with MIR9-1HG in progenitors but positive correlation in MGE and LGE (Fig. 5g and Supplementary information, Fig. S5e). We assumed that ASCL1 might inhibit MIR9-1HG expression in the progenitors but not in MGE. We further inspect the potential downstream genes of MIR9-1HG. Since MIR9-1HG is a lncRNA and microRNA9-1 can be spliced or generated from it, we searched for the target genes of hsa-mir-9-1 from three different databases, TargetScanVert, miRDB and TargetMiner.92,93,94 Four genes were found to be regulated by both hsa-mir-9-1-3p and 5p in all those databases. Two of them were previously described as LGE specific TFs, ZFHX4 and ID4. The expression levels of these two genes showed remarkably negative correlation with MIR9-1HG in MGE and LGE (Fig. 5g). In MGE, MIR9-1HG was expressed at high levels where ZFHX4 and ID4 were barely expressed, while the gene expression patterns were opposite in LGE (Supplementary information, Fig. S5e). Altogether, these results propose a new regulatory pathway in which MIR9-1HG is regulated by ASCL1 and TCF4. MIR9-1HG-encoding hsa-mir-9-1 would inhibit the expression of ZFHX4 and ID4. This may explain how the distinct fates of MGE and LGE are determined during early brain development in human.
Discussion
In this study, we investigated the single-cell transcriptional identity of primate embryonic GEs and the gene regulatory networks that are potentially involved in cell fate determination in different subregions, as well as in migrating routes and destinations through scRNA-seq and scATAC-seq. Recently, a study adopted scRNA-seq and in situ sequencing to explore the origin of diverse interneurons in human fetal brains was published just before the submission of this manuscript. The study provided details on the temporal and spatial identity of interneuron precursors based on the expression of different genes.80 In addition, their in situ sequencing data will provide a rich resource for other groups including ours to confirm the expression of many genes. Another study comparing human and mouse interneuron development was published during the review of our article. The study also adopted scRNA-seq analysis to show details of the cell identities during human GE development as we did,95 but had not revealed the gene regulatory networks or unique developmental mechanism in human GEs compared to that in rodents. Focusing on the neurogenesis stage, we studied the early cell differentiation and migration in human and Macaca fascicularis developing MGE, LGE and CGE. Integration between human and macaque datasets revealed the well conserved GE cell diversities, the existence of oRGs in primate GEs and a novel regulatory pathway in human LGE. By combining the analyses of scATAC-seq and scRNA-seq data, we discovered the TFs that are potentially important for cell fate determination in different human GE regions, altogether with their gene regulatory networks including a novel regulatory pathway that may lead to the distinct fates of MGE and LGE.
oRGs are considered to be the major driving force for brain volume expansion in primates, especially human.56 Our results confirmed the existence of oRGs in human developing GE. This would explain the expansion and increased cell density of human GEs, compared with mouse GEs.55,56,96 Other genes found by differential gene expression analysis between human and mouse progenitors in GE partially matched the previous studies in cerebral cortex.97 For example, FAM107A is expressed in the oRGs of both cerebral cortex and GEs.58 In addition, we found that cell proliferation associated genes TCIM and CLU are expressed at high levels in the human GE progenitors.58,59 The comparison among macaque, human and mouse is hard to display. When we perform lift-over between the macaque and human genome, some human genes could not be found in the orthologous gene list downloaded from Ensembl V104 due to the incomplete annotation of the Macaca fascicularis genome. Nevertheless, we inspected those DEGs in macaque GE progenitors and found the conserved expression of CLU in primate RGCs, while TCIM was not expressed in macaque. The regulatory roles of those genes in the progenitors need further investigation.
The cortical interneurons mainly comprise PV+ or SST+ GABAergic interneurons derived from MGE, and VIP+ or RELN+ 5HT3aR+ interneurons derived from CGE.5,18,71 In mice, the early differentiation and molecular markers of GE interneurons have been well investigated.17,37,39,71 Our study has confirmed the cell diversity during MGE development but not CGE. Most cells in CGE expressed NR2F2 and NR2F1 but barely VIP and RELN (Supplementary information, Fig. S1h and Fig. S3f), the typical marker for CGE derived mature interneurons.5,18 We assume that more specific cell fates and functions in CGE remain unclear at the stages studied here. In MGE, we identified a small group of cells specifically expressing CRABP1 and ANGPT2. Combined with the scRNA-seq analysis of PV+ interneurons and the developmental process of GE in mice,17,98,99 those cells were defined as the PV+ interneuron precursors derived from MGE.
The migration of GE interneurons begins at early embryonic stage and continues until the extinction of the germ layer in the brain. Several signal pathways have been shown to regulate the migration of interneurons. CXCL12/CXCR4 signaling controls the migration of interneurons during brain development.68,69 We detected the expression of ERBB4 and CXCR4 in the SST+ interneuron lineage, as well as CXCL12 in the LGE lineage. Our results indicate that the migration of SST+ interneurons is under the regulation of CXCL12/CXCR4 signaling pathway, mainly through the CXCL12 released from LGE. The other cell migration related pathway is the SEMA3-NRP signaling, with SEMA3A repulsing the migration of NRP1+ neurons.14,81,82,100,101 In LGE, we detected the expression of SEMA3A, NRP1 and NRP2. However, SEMA3A is expressed in the typical striatal interneurons, while NRP1 is expressed in a group of MGE-derived interneurons. This is consistent with previous reports that striatal interneurons will inhibit cortical interneurons derived from MGE from straying into striatum through SEMA3-NRP signaling pathway in mouse.14 The in situ sequencing data provided by Sun’s group80 also proved that a group of NKX2-1+ and LHX8+ cells were located near LGE at human GW12 (Supplementary information, Fig. S4d). In this case, our results indicate that some NKX2-1+ and LHX8+ interneurons in primate MGE would pass along LGE towards their specific destination under the regulation of SEMA3A-NRP signaling pathway. Another potential regulatory network we found in the migrating MGE cells was the MIR7-3HG-PEG10 pathway. PEG10 were expressed both in those LHX8+ interneurons and some striatal interneurons. In human cancer cells, both lncRNA and proteins encoded by PEG10 were verified to promote cell migration and invasion.84,85 Recently, a study demonstrated that PEG10 could package its own mRNA for secretion in both human and mouse.102 Altogether, these findings suggest that PEG10 promotes the interneuron migration and is involved in intercellular communication during human GE development. However, the annotations of lncRNA are not so precise in Macaca fascicularis genome compared with protein coding genes. That is why we could not identify the homologous gene of human MIR7-3HG in macaque. Our analysis of the developmental MGE and LGE revealed the regional genetic regulatory model and a new possible gene regulatory pathway influencing the surrounding cells.
During brain development, the differences in gene expression determine the diversity among brain regions. The regional characteristics are regulated by different and complicated gene sets, instead of one or two genes. The major regulatory networks of GEs have been identified previously. For instance, MGE development is considered to be induced by sonic Hedgehog (SHH) signaling pathway in the ventral forebrain. NKX2-1 expression is induced by SHH. NKX2-1 then stimulates the expression of LHX6, and SOX6 which is required for SST+ and PV+ interneurons development.13,103 scATAC-seq enabled us to find more potential regulatory networks during human GE development. We identified MIR9-1HG as a potential master regulatory gene in GE progenitors and MGE. ASCL1 and TCF4 regulate its expression, and the encoded mir9-1 might suppress the expression of ZFHX4 and ID4 in MGE. However, the very upstream genes of this regulatory pathway remain to be disclosed. Our study uncovered several crucial TFs in different regions of human GE. Moreover, we found a possible specific pathway that participates in the regulation of differential gene expression between MGE and LGE.
In summary, our study demonstrates the strong conservation of cell diversity and lineage characteristics in GEs between human and Macaca fascicularis. We found some DEGs between human and mouse GE progenitors and discovered the presence of oRGs in primate GE. Moreover, our study verified the conservation of gene regulatory networks in the developing primate GE and explored the new mechanisms involved in cell migration and fate determination of different GE regions.
Materials and methods
Human sample collection
The human clinical tissues of pregnancies at GW 9 and 13 were obtained upon therapeutic termination of pregnancy at Peking University Third Hospital and department of obstetrics and gynecology at Peking Union Medical College Hospital, Beijing, China. An informed consent document was signed by the patient before the collection of the human sample. The whole experiment was examined by Ethics Committee of Peking University Third Hospital (M2021461) and Peking Union Medical College Hospital (JS-2471).
Macaca fascicularis subjects
The macaque samples at GW7, 10, 12 were collected at Beijing Institute of Xieerxin Biology Resource under the supervision of veterinarian. All surgery programs were approved by the Institutional Animal Care and Use Committee of Xieerxin Biology Resource (XEX20212025).
Brain tissue dissection and cells dissociation
The human and macaque brain tissues were dissected in ice-cold normal saline under the dissection microscope. GEs from all five samples were collected. Half of the human GW 9 GE was stored in liquid nitrogen for scATAC-seq. Half of the human GW13 brain was stored in 4% paraformaldehyde (PFA) for immunostaining. Each tissue for scRNA-seq was dissociated in 500 μL dissociation agent (400 U/mL DNaseI on hibernate E buffer, 10 U/mL papain) at 37°C on a thermocycler for 15 min. Dissociation was terminated by 500 μL 10% FBS in Hibernate E buffer. Cells were centrifuged at 4°C for 5–10 min. After removing the supernatant, 1 mL HA buffer was added in and cells were dispersed into single-cell suspension by repetitive pipetting.
sc-RNA Seq
Using single cell 3′ GEM, Library & Gel Bead Kit V3.1 (10× Genomics, 1000075) and Chromium Single Cell B Chip Kit (10× Genomics, 1000074), the prepared cell suspension (300–600 living cells per μL determined by Count Star) was loaded onto the Chromium single cell controller (10× Genomics) to generate single-cell gel beads in the emulsion according to the manufacturer’s protocol. Then single cells were suspended in PBS containing 0.04% BSA. The target cell will be recovered to about 8000 cells by estimation. Captured cells were lysed to release their RNA, which were then barcoded through reverse transcription in individual GEMs. Then reverse transcription was performed on a S1000TM Touch Thermal Cycler (Bio Rad) at 53°C for 45 min, followed by 85°C for 5 min. The cDNA was kept at 4°C, then was amplified and the quality was assessed using an Agilent 4200 (performed by CapitalBio Technology, Beijing).
Single-cell gene expression quantification
Cell Ranger (v5.0.1) was used to convert raw sequencing data to FastQ format, perform quality control and read counting of gene with default parameters. Human samples were mapped to the GRCh38 (hg38) and Macaque samples were mapped to the (v5.0). Gene-cell data matrix was analyzed by Seurat (v4.0.4), and we used MiQC (v1.1.3)104 (posterior.cutoff = 0.95, model.slot = “flexmix_model”, model.type = “spline”) to remove low quality genes. Cells expressing more than 20% of mitochondrial genes were also excluded. We also removed the cells expressing less than 800 genes or more than 6000 genes. Cells that expressed hemoglobin genes were also filtered out. Then we performed doublet analysis using DoubletFinder (v.2.0.3)105 and removed the predicted doublet. Count data were normalized and scaled by the SCTransform function.
Identification of marker genes among clusters in scRNA-seq and cell type annotation
Principal Component 1–11 were used to cluster the cells by FindClusters function under the resolution of 0.9. The marker genes of each cluster were identified by using the FindAllMarkers function (thresh.use = 0.25, only.pos = TRUE, min.pct = 0.25, logfc.threshold = 0.5, test.use = “wilcox”). Genes with P < 0.05 were selected as markers. Clusters were visualized by UMAP and verified by several well-known marker genes. RGC and IPCs of GE showed high expression level of HES5 or ASCL1. MGE comprised cells specifically expressing NKX2-1 and LHX6, while MEIS2, ISL1 in LGE and CALB2, NR2F2 in CGE, microglia and pericytes could be discerned by their corresponding markers, namely CX3CR1 and COL3A1. And we used FindMarkers function (thresh.use = 0.5, only.pos = TRUE, min.pct = 0.25, logfc.threshold = 0.5, test.use = “wilcox”) to find specific genes in each individual cluster.
Integrate human and Macaca fascicularis scRNA-seq by seurat3.0
Seurat3.045 was used to integrate different assay based on species by the function FindIntegrationAnchors and IntegrateData with default parameters. Then we used RunPCA function (reduction = “pca”, npcs = 50) to get the Principal Component of the assay, RunUMAP function (reduction = “pca”, dims = 1:11) to visualize data, FindNeighbors function (dims = 1:11) and FindClusters function (resolution = 0.9) to get biological significant clusters. And we performed Glmnet (v4.1-3) to calculate the expression similarity between different species via penalized maximum likelihood (alpha = 0.99, nfolds = 10), we also compared the cluster annotation based on integrated assays with individual assays and visualized it in Sankey plot.
Integrate human scRNA-seq by seurat3.0
The method was just similar to previously mentioned and we integrated matrix based on different time points. As to further cluster and analysis the important clusters, the same pipeline was used. We used RunPCA, FindNeighbors, FindClusters, RunUMAP, FindAllMarkers and FindMarkers functions on four main cell types (GE progenitors, MGE, CGE, LGE). The PC number we used was 18 and the resolution we used was 0.65.
Integrate mouse scRNA-seq by seurat3.0
All the GEO datasets were obtained from the GEO website (GEO: GSE142768, GSE184879), which is publicly accessible at https://www.ncbi.nlm.nih.gov/geo. SCTransform function was used to normalize and scale the count matrix. Two datasets of E12.5 mouse MGE and LGE cells were integrated and calculated by the same Seurat pipeline. We performed FindIntegrationAnchors and IntegrateData function with default parameters to integrate two Seurat objects. Then we used 16 principal Components calculated by RunPCA function to construct the Shared Nearest Neighbor graph by FindNeighbors function. FindClusters function was performed to get the clusters (resolution = 0.5). Three MGE data of E11, E14 and E17 used the same Seurat pipeline (dims = 1:16, resolution = 0.5).
Cell-cycle analysis
We used Seurat package-supplied cell cycle related genes, 43 genes of S phase and 54 genes of G2/M phase, to calculate the stage of the cells by performing the CellCycleScoring function in Seurat.
Gene Ontology (GO) enrichment analysis
We used clusterProfiler (v4.1.4)106 to get enriched GO terms of marker genes. We picked genes (P < 0.05) found in previous study and used enrichGO function (ont = “BP”, pAjustMethod = “BH”, pvalueCutoff = 0.05, qvalueCutoff = 0.05).
Trajectory analysis
Splicing-specific count data were calculated by Velocyto (v.0.17.17)52 for downstream RNA velocity analysis with default parameters. The resulting loom files were then further analyzed using the scVelo (v0.2.3).53 We performed gene selection, normalization (min_shared_counts = 30, n_top_genes = 2000), moment estimation (n_pcs = 30, n_neighbors = 30), and estimation of RNA velocities. We used the stream embedding function to visualize RNA velocities on UMAP. Monocle 370 was also used to infer the pseudotime trajectories of the major clusters. Monocle “cds” was constructed by our data calculated by Seurat. Then we defined the cells showed as precursor as the “root_state”.
Cell–cell communication analysis
Cell–cell interactions between each cluster were identified and visualized by CellChat (v1.1.2).83 Seurat objects for human sample were merged, createCellChat function was applied to create a CellChat object from Seurat normalized data, then addMeta and setIdent function were used to add clusters labeled in previous Seurat object. We focused on the CellChat human database and performed “identifyOverExpressedGenes”, “identifyOverExpressedInteractions” functions to identify over-expressed ligands or receptors. Then we used “projectData” function to project gene expression data onto PPI network. “computeCommunProb” and “filterCommunication” functions (min. cells = 10) were used to compute communication probability and infer cellular communication network. “computeCommunProbPathway” and “aggregateNet” functions were performed to infer the cell–cell communication at a signaling pathway level between each cluster.
Nuclei isolation from frozen brain tissue
Nuclei from previous frozen GEs of human GW9 were isolated. Tissue was cut into small pieces and ground in 2 mL of ice-cold homogenization buffer [20 mM Tris, pH 8.0 (Thermo Fisher Scientific)], 500 mM sucrose (Sigma), 50 mM KCl (Thermo Fisher Scientific), 10 mM MgCl2 (Thermo Fisher Scientific), 0.1% NP-40 (Roche), 1× protease inhibitor cocktail (Roche), and 1% nuclease-free BSA, and 0.1 mM (DTT). To release the nuclei, tissues were homogenized by strokes. 30 μM cell strainer was used to filter nuclei into a 15 mL centrifuge tube. After centrifuging for 5 min at 4°C, nuclei were obtained and washed twice with 1 mL of ice-cold blocking buffer (1× PBS supplemented with 1% BSA). Another centrifuging was performed and the nuclei were collected in 50 μL of 1× PBS containing 1% BSA and counted by DAPI staining.
scATAC-Seq library preparation and sequencing
We used DNBelab C Series Single-Cell ATAC Library Prep Set (MGI, #1000021878107) to prepare single-cell ATAC-seq libraries. We then created the human GE library. The transposed single-nucleus suspensions were converted to barcoded scATAC-seq libraries. After procedures including droplet encapsulation, pre-amplification, emulsion breakage, capture beads collection, DNA amplification, and purification, indexed sequencing libraries were prepared according to the user guide. We measured the concentrations of sequencing library with Qubit ssDNA Assay Kit (Thermo Fisher Scientific). The library was sequenced using a paired-end 50 sequencing scheme by the BGISEQ-500 platform at China National GeneBank (CNGB).108
scATAC-seq process
Raw sequencing reads from BGISEQ-500 sequencer were filtered and demultiplexed using PISA, and aligned to the hg19 human genome. Fragment data were further processed using ArchR (V2.0.1).87 The cells with transcription start site (TSS) enrichment score less than five and fragment number less than 1000 were removed. We did doublet analysis by “addDoubletScores” and “filterDoublets” functions in AchrR. Then we performed “iterative LSI” and used major pc2-pc30 to cluster by Seurat’s “FindClusters” function with resolution 0.6. We used gene activity scores to identified different cell types for various marker genes. Clusters were verified by several well-known marker genes as previous described. 240672 peaks were found by performed Peak calling using MACS2 (v2.1.1), and we can link peak to gene by the addPeak2GeneLinks function. We can also use CIS-BP database to identify accessible cis-regulatory elements (CREs).
Identification of marker genes among clusters in scATAC-seq
We use “getMarkerFeatures” function (useMatrix = “GeneScoreMatrix”, bias = c (“TSSEnrichment”, “log10 (nFrags)”, testMethod = “wilcoxon”)) and FDR ≤ 0.01 & |Log2FC| ≥ 1 to find marker genes among clusters in scATAC-seq.
Build gene regulatory networks
To match single-cell transcriptomes and epigenomes, we combined scATAC-seq with human GW9 scRNA-seq. We performed “addGeneIntegrationMatrix” function to integrate gene expressing matrix of scRNA-seq on “geneScoreMatrix” of scATAC-seq. Then we predicted cis-linkages between CREs and downstream genes. Moreover, we inferred cis-regulatory interactions and defined a TF-gene GRN. This method made by Tu lab has not been published. We extracted important TFs based on the marker genes found in major clusters (GE progenitors, MGE, CGE, LGE). The hub gene was calculated based on PPI within the genes found in GRN. And networks were visualized in Cytoscape (v3.9.0).
Immunofluorescence
Brains were fixed in 4% PFA before treatment and dehydrated in 30% sucrose for cryosections. Tissue sections of 20 μm sectioned by Leica CM 1950 were used for further immunostaining. Sections were processed by blocking buffer (3% BSA + 10% FBS + 0.3% Triton X-100 in 1× PBS) for 1 h, and incubated by primary antibodies of HOPX (Sigma, HPA030180) at 4°C overnight. After washing by 1× PBS three times for 10 min, sections were incubated by secondary antibodies and DAPI for 1 h. Immunofluorescence pictures were taken using Zeiss confocal LSM 800.
Data availability
The single cell sequencing data were uploaded to China National Center for Bioinformation (https://www.cncb.ac.cn/) with human data in PRJCA007893 and Macaca fascicularis (crab-eating monkey) data in PRJCA007270. The scATAC-seq data reported in this study are also available in the CNGB Nucleotide Sequence Archive (CNSA: https://db.cngb.org/cnsa; accession number CNP0002651).
Change history
16 December 2022
A Correction to this paper has been published: https://doi.org/10.1038/s41422-022-00757-0
References
Assous, M. & Tepper, J. M. Excitatory extrinsic afferents to striatal interneurons and interactions with striatal microcircuitry. Eur. J. Neurosci. 49, 593–603 (2019).
Beierlein, M., Gibson, J. R. & Connors, B. W. A network of electrically coupled interneurons drives synchronized inhibition in neocortex. Nat. Neurosci. 3, 904–910 (2000).
Zhang, X. J. et al. Precise inhibitory microcircuit assembly of developmentally related neocortical interneurons in clusters. Nat. Commun. 8, 16091 (2017).
Kepecs, A. & Fishell, G. Interneuron cell types are fit to function. Nature 505, 318–326 (2014).
Tremblay, R., Lee, S. & Rudy, B. GABAergic interneurons in the neocortex: from cellular properties to circuits. Neuron 91, 260–292 (2016).
Zhang, T. et al. Neuron type classification in rat brain based on integrative convolutional and tree-based recurrent neural networks. Sci. Rep. 11, 7291 (2021).
Sultan, K. T. et al. Clonally related GABAergic interneurons do not randomly disperse but frequently form local clusters in the forebrain. Neuron 92, 31–44 (2016).
Yao, X. H. et al. Electrical coupling regulates layer 1 interneuron microcircuit formation in the neocortex. Nat. Commun. 7, 12229 (2016).
Dienel, S. J. & Lewis, D. A. Alterations in cortical interneurons and cognitive function in schizophrenia. Neurobiol. Dis. 131, 104208 (2019).
Xu, M. Y. & Wong, A. H. C. GABAergic inhibitory neurons as therapeutic targets for cognitive impairment in schizophrenia. Acta Pharmacol. Sin. 39, 733–753 (2018).
Nomura, T. Interneuron dysfunction and inhibitory deficits in autism and fragile X syndrome. Cells 10, 2610 (2021).
Rapanelli, M., Frick, L. R. & Pittenger, C. The role of interneurons in autism and tourette syndrome. Trends Neurosci. 40, 397–407 (2017).
Laclef, C. & Metin, C. Conserved rules in embryonic development of cortical interneurons. Semin. Cell Dev. Biol. 76, 86–100 (2018).
Hernandez-Miranda, L. R., Parnavelas, J. G. & Chiara, F. Molecules and mechanisms involved in the generation and migration of cortical interneurons. ASN Neuro. 2, e00031 (2010).
Hansen, D. V. et al. Non-epithelial stem cells and cortical interneuron production in the human ganglionic eminences. Nat. Neurosci. 16, 1576–1587 (2013).
Nery, S., Fishell, G. & Corbin, J. G. The caudal ganglionic eminence is a source of distinct cortical and subcortical cell populations. Nat. Neurosci. 5, 1279–1287 (2002).
Mi, D. et al. Early emergence of cortical interneuron diversity in the mouse embryo. Science 360, 81–85 (2018).
Wamsley, B. & Fishell, G. Genetic and activity-dependent mechanisms underlying interneuron diversity. Nat. Rev. Neurosci. 18, 299–309 (2017).
Anderson, S. A., Eisenstat, D. D., Shi, L. & Rubenstein, J. L. Interneuron migration from basal forebrain to neocortex: dependence on Dlx genes. Science 278, 474–476 (1997).
Sandberg, M. et al. Transcriptional networks controlled by NKX2-1 in the development of forebrain GABAergic neurons. Neuron 91, 1260–1275 (2016).
Du, T., Xu, Q., Ocbina, P. J. & Anderson, S. A. NKX2.1 specifies cortical interneuron fate by activating Lhx6. Development 135, 1559–1567 (2008).
Hamasaki, T., Goto, S., Nishikawa, S. & Ushio, Y. Neuronal cell migration for the developmental formation of the mammalian striatum. Brain Res. Brain Res. Rev. 41, 1–12 (2003).
Wichterle, H., Garcia-Verdugo, J. M., Herrera, D. G. & Alvarez-Buylla, A. Young neurons from medial ganglionic eminence disperse in adult and embryonic brain. Nat. Neurosci. 2, 461–466 (1999).
Stenman, J., Toresson, H. & Campbell, K. Identification of two distinct progenitor populations in the lateral ganglionic eminence: implications for striatal and olfactory bulb neurogenesis. J. Neurosci. 23, 167–174 (2003).
de Carlos, J. A., Lopez-Mascaraque, L. & Valverde, F. Dynamics of cell migration from the lateral ganglionic eminence in the rat. J. Neurosci. 16, 6146–6156 (1996).
Xu, Z. et al. SP8 and SP9 coordinately promote D2-type medium spiny neuron production by activating Six3 expression. Development 145, dev165456 (2018).
Touzot, A., Ruiz-Reig, N., Vitalis, T. & Studer, M. Molecular control of two novel migratory paths for CGE-derived interneurons in the developing mouse brain. Development 143, 1753–1765 (2016).
Miyoshi, G. et al. Prox1 regulates the subtype-specific development of caudal ganglionic eminence-derived GABAergic cortical interneurons. J. Neurosci. 35, 12869–12889 (2015).
Zimmer-Bensch, G. Diverse facets of cortical interneuron migration regulation - Implications of neuronal activity and epigenetics. Brain Res. 1700, 160–169 (2018).
Ghanem, N. et al. Distinct cis-regulatory elements from the Dlx1/Dlx2 locus mark different progenitor cell populations in the ganglionic eminences and different subtypes of adult cortical interneurons. J. Neurosci. 27, 5012–5022 (2007).
Meganathan, K. et al. Regulatory networks specifying cortical interneurons from human embryonic stem cells reveal roles for CHD2 in interneuron development. Proc. Natl. Acad. Sci. USA 114, E11180–E11189 (2017).
Johnson, M. B. & Walsh, C. A. Cerebral cortical neuron diversity and development at single-cell resolution. Curr. Opin. Neurobiol. 42, 9–16 (2017).
Morarach, K. et al. Diversification of molecularly defined myenteric neuron classes revealed by single-cell RNA sequencing. Nat. Neurosci. 24, 34–46 (2021).
Ozel, M. N. et al. Neuronal diversity and convergence in a visual system developmental atlas. Nature 589, 88–95 (2021).
Shapiro, E., Biezuner, T. & Linnarsson, S. Single-cell sequencing-based technologies will revolutionize whole-organism science. Nat. Rev. Genet. 14, 618–630 (2013).
Harbom, L. J., Michel, N. & McConnell, M. J. Single-cell analysis of diversity in human stem cell-derived neurons. Cell Tissue Res. 371, 171–179 (2018).
Mayer, C. et al. Developmental diversification of cortical inhibitory interneurons. Nature 555, 457–462 (2018).
Allaway, K. C. et al. Genetic and epigenetic coordination of cortical interneuron development. Nature 597, 693–697 (2021).
Chen, Y. J. et al. Single-cell RNA sequencing identifies distinct mouse medial ganglionic eminence cell types. Sci. Rep. 7, 45656 (2017).
Wang, C. et al. Human and monkey striatal interneurons are derived from the medial ganglionic eminence but not from the adult subventricular zone. J. Neurosci. 34, 10906–10923 (2014).
Kim, D. W. et al. Gene regulatory networks controlling differentiation, survival, and diversification of hypothalamic Lhx6-expressing GABAergic neurons. Commun. Biol. 4, 95 (2021).
Lee, P. R., Cohen, J. E., Iacobas, D. A., Iacobas, S. & Fields, R. D. Gene networks activated by specific patterns of action potentials in dorsal root ganglia neurons. Sci. Rep. 7, 43765 (2017).
Cusanovich, D. A. et al. Multiplex single cell profiling of chromatin accessibility by combinatorial cellular indexing. Science 348, 910–914 (2015).
Buenrostro, J. D. et al. Single-cell chromatin accessibility reveals principles of regulatory variation. Nature 523, 486–490 (2015).
Stuart, T. et al. Comprehensive integration of single-cell data. Cell 177, 1888–1902 (2019).
Utsunomiya, S. et al. Ezh1 regulates expression of Cpg15/Neuritin in mouse cortical neurons. Drug Discov. Ther. 15, 55–65 (2021).
Mayer, S. et al. Multimodal single-cell analysis reveals physiological maturation in the developing human neocortex. Neuron 102, 143–158 (2019).
Gyoneva, S. et al. Cx3cr1-deficient microglia exhibit a premature aging transcriptome. Life Sci. Alliance 2, e201900453 (2019).
Bertocchio, J. P. et al. Red blood cell AE1/Band 3 transports in dominant distal renal tubular acidosis patients. Kidney Int. Rep. 5, 348–357 (2020).
Lin, Y. et al. Behavior and lineage progression of neural progenitors in the mammalian cortex. Curr. Opin. Neurobiol. 66, 144–157 (2021).
Malhotra, S. et al. A tri-marker proliferation index predicts biochemical recurrence after surgery for prostate cancer. PLoS ONE 6, e20293 (2011).
La Manno, G. et al. RNA velocity of single cells. Nature 560, 494–498 (2018).
Bergen, V., Lange, M., Peidli, S., Wolf, F. A. & Theis, F. J. Generalizing RNA velocity to transient cell states through dynamical modeling. Nat. Biotechnol. 38, 1408–1414 (2020).
Shen, Z. et al. Distinct progenitor behavior underlying neocortical gliogenesis related to tumorigenesis. Cell Rep. 34, 108853 (2021).
Dehay, C., Kennedy, H. & Kosik, K. S. The outer subventricular zone and primate-specific cortical complexification. Neuron 85, 683–694 (2015).
Hansen, D. V., Lui, J. H., Parker, P. R. & Kriegstein, A. R. Neurogenic radial glia in the outer subventricular zone of human neocortex. Nature 464, 554–561 (2010).
Wilkinson, G., Dennis, D. & Schuurmans, C. Proneural genes in neocortical development. Neuroscience 253, 256–273 (2013).
Pollen, A. A. et al. Molecular identity of human outer radial glia during cortical development. Cell 163, 55–67 (2015).
Lui, J. H. et al. Radial glia require PDGFD-PDGFRbeta signalling in human but not mouse neocortex. Nature 515, 264–268 (2014).
Jung, Y. et al. TC1(C8orf4) regulates hematopoietic stem/progenitor cells and hematopoiesis. PLoS ONE 9, e100311 (2014).
Rizzi, F., Caccamo, A. E., Belloni, L. & Bettuzzi, S. Clusterin is a short half-life, poly-ubiquitinated protein, which controls the fate of prostate cancer cells. J. Cell. Physiol. 219, 314–323 (2010).
Brown, T. J., Kollara, A., Shathasivam, P. & Ringuette, M. J. Ventricular zone expressed PH domain containing 1 (VEPH1): an adaptor protein capable of modulating multiple signaling transduction pathways during normal and pathological development. Cell Commun. Signal. 17, 116 (2019).
Derrien, A. & Druey, K. M. RGS16 function is regulated by epidermal growth factor receptor-mediated tyrosine phosphorylation. J. Biol. Chem. 276, 48532–48538 (2001).
Soundararajan, M. et al. Structural diversity in the RGS domain and its interaction with heterotrimeric G protein alpha-subunits. Proc. Natl. Acad. Sci. USA 105, 6457–6462 (2008).
Duparc, R. H. et al. Pax6 controls the proliferation rate of neuroepithelial progenitors from the mouse optic vesicle. Dev. Biol. 301, 374–387 (2007).
Zhu, G. et al. HES5 promotes cell proliferation and invasion through activation of STAT3 and predicts poor survival in hepatocellular carcinoma. Exp. Mol. Pathol. 99, 474–484 (2015).
Letelier, J. et al. The Shh/Gli3 gene regulatory network precedes the origin of paired fins and reveals the deep homology between distal fins and digits. Proc. Natl. Acad. Sci. USA 118, e2100575118 (2021).
Yang, S. et al. Cxcl12/Cxcr4 signaling controls the migration and process orientation of A9-A10 dopaminergic neurons. Development 140, 4554–4564 (2013).
Lopez-Bendito, G. et al. Chemokine signaling controls intracortical migration and final distribution of GABAergic interneurons. J. Neurosci. 28, 1613–1624 (2008).
Trapnell, C. et al. The dynamics and regulators of cell fate decisions are revealed by pseudotemporal ordering of single cells. Nat. Biotechnol. 32, 381–386 (2014).
Lim, L., Mi, D., Llorca, A. & Marin, O. Development and functional diversification of cortical interneurons. Neuron 100, 294–313 (2018).
Takahashi, K., Liu, F. C., Hirokawa, K. & Takahashi, H. Expression of Foxp4 in the developing and adult rat forebrain. J. Neurosci. Res. 86, 3106–3116 (2008).
Ko, H. A., Chen, S. Y., Chen, H. Y., Hao, H. J. & Liu, F. C. Cell type-selective expression of the zinc finger-containing gene Nolz-1/Zfp503 in the developing mouse striatum. Neurosci. Lett. 548, 44–49 (2013).
Chen, R. et al. Decoding molecular and cellular heterogeneity of mouse nucleus accumbens. Nat. Neurosci. 12, 1757–1771 (2021).
Durieux, P. F., Schiffmann, S. N. & de Kerchove d’Exaerde, A. Targeting neuronal populations of the striatum. Front. Neuroanat. 5, 40 (2011).
Ena, S., de Kerchove d’Exaerde, A. & Schiffmann, S. N. Unraveling the differential functions and regulation of striatal neuron sub-populations in motor control, reward, and motivational processes. Front. Behav. Neurosci. 5, 47 (2011).
Sahu, S. K. et al. TOX3 regulates neural progenitor identity. Biochim. Biophys. Acta 1859, 833–840 (2016).
Chai, M. et al. Chromatin remodeler CHD7 regulates the stem cell identity of human neural progenitors. Genes Dev. 32, 165–180 (2018).
Ruzinova, M. B. & Benezra, R. Id proteins in development, cell cycle and cancer. Trends Cell Biol. 13, 410–418 (2003).
Yu, Y. et al. Interneuron origin and molecular diversity in the human fetal brain. Nat. Neurosci. 12, 1745–1756 (2021).
Kawasaki, T. et al. Requirement of neuropilin 1-mediated Sema3A signals in patterning of the sympathetic nervous system. Development 129, 671–680 (2002).
Barber et al. Semaphorin3A-neuropilin1 signalling is involved in the generation of cortical interneurons. Brain Struct. Funct. 222, 2217–2233 (2017).
Jin, S. et al. Inference and analysis of cell-cell communication using CellChat. Nat. Commun. 12, 1088 (2021).
Pandya, N. J. et al. Secreted retrovirus-like GAG-domain-containing protein PEG10 is regulated by UBE3A and is involved in Angelman syndrome pathophysiology. Cell Rep. Med. 2, 100360 (2021).
Li, X. et al. PEG10 promotes human breast cancer cell proliferation, migration and invasion. Int. J. Oncol. 48, 1933–1942 (2016).
Ding, F., Jiang, K., Sheng, Y., Li, C. & Zhu, H. LncRNA MIR7-3HG executes a positive role in retinoblastoma progression via modulating miR-27a-3p/PEG10 axis. Exp. Eye Res. 193, 107960 (2020).
Granja, J. M. et al. ArchR is a scalable software package for integrative single-cell chromatin accessibility analysis. Nat. Genet. 53, 403–411 (2021).
Shannon, P. et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 13, 2498–2504 (2003).
Ray, B. K., Murphy, R., Ray, P. & Ray, A. SAF-2, a splice variant of SAF-1, acts as a negative regulator of transcription. J. Biol. Chem. 277, 46822–46830 (2002).
Forrest, M. P., Waite, A. J., Martin-Rendon, E. & Blake, D. J. Knockdown of human TCF4 affects multiple signaling pathways involved in cell survival, epithelial to mesenchymal transition and neuronal differentiation. PLoS ONE 8, e73169 (2013).
Forrest, M. P. et al. The psychiatric risk gene transcription factor 4 (TCF4) regulates neurodevelopmental pathways associated with schizophrenia, autism, and intellectual disability. Schizophr. Bull. 44, 1100–1110 (2018).
Chen, Y. & Wang, X. miRDB: an online database for prediction of functional microRNA targets. Nucleic Acids Res. 48, D127–D131 (2020).
Kozomara, A., Birgaoanu, M. & Griffiths-Jones, S. miRBase: from microRNA sequences to function. Nucleic Acids Res. 47, D155–D162 (2019).
Agarwal, V., Bell, G. W., Nam, J. W. & Bartel, D. P. Predicting effective microRNA target sites in mammalian mRNAs. Elife 4, 05005 (2015).
Shi, Y. et al. Mouse and human share conserved transcriptional programs for interneuron development. Science 374, eabj6641 (2021).
Berg, J. et al. Human neocortical expansion involves glutamatergic neuron diversification. Nature 598, 151–158 (2021).
Florio, M. et al. Evolution and cell-type specificity of human-specific genes preferentially expressed in progenitors of fetal neocortex. Elife 7, 32332 (2018).
Munoz-Manchado, A. B. et al. Diversity of interneurons in the dorsal striatum revealed by single-cell RNA sequencing and PatchSeq. Cell Rep. 24, 2179–2190 (2018).
Enterria-Morales, D. et al. Molecular targets for endogenous glial cell line-derived neurotrophic factor modulation in striatal parvalbumin interneurons. Brain Commun. 2, fcaa105 (2020).
Mardakheh, F. K., Self, A. & Marshall, C. J. RHO binding to FAM65A regulates Golgi reorientation during cell migration. J. Cell Sci. 129, 4466–4479 (2016).
Marin, O., Yaron, A., Bagri, A., Tessier-Lavigne, M. & Rubenstein, J. L. Sorting of striatal and cortical interneurons regulated by semaphorin-neuropilin interactions. Science 293, 872–875 (2001).
Segel, M. et al. Mammalian retrovirus-like protein PEG10 packages its own mRNA and can be pseudotyped for mRNA delivery. Science 373, 882–889 (2021).
Liu, Z. et al. Sp9 regulates medial ganglionic eminence-derived cortical interneuron development. Cereb. Cortex 29, 2653–2667 (2019).
Hippen, A. A. et al. miQC: An adaptive probabilistic framework for quality control of single-cell RNA-sequencing data. PLoS Comput. Biol. 17, e1009290 (2021).
McGinnis, C. S., Murrow, L. M. & Gartner, Z. J. DoubletFinder: doublet detection in single-cell RNA sequencing data using artificial nearest neighbors. Cell Syst. 8, 329–337 (2019).
Wu, T. et al. clusterProfiler 4.0: A universal enrichment tool for interpreting omics data. Innovation 2, 100141 (2021).
Han, L. et al. Single-cell atlas of a non-human primate reveals new pathogenic mechanisms of COVID-19. bioRxiv, https://doi.org/10.1101/2020.04.10.022103 (2020).
Huang, J. et al. A reference human genome dataset of the BGISEQ-500 sequencer. Gigascience 6, 1–9 (2017).
Acknowledgements
We thank Dr. Da Mi and Dr. Yongchun Yu for their kind advices for this study. This study was supported by grants from major projects of the Ministry of Science and Technology of China (2021ZD0202300), the National Natural Science Foundation of China (31921002, 31730108, 32061143026, 91854118), and the Strategic Priority Research Program and Innovation Program of the Chinese Academy of Sciences (XDB32020100, YJKYYQ20200052, QYZDJ-SSW-SMC007).
Author information
Authors and Affiliations
Contributions
Z.X. conceived and designed the project. Z.Z., D.Z., and F.Y. analyzed the scRNA-seq and scATAC-seq data. M.X. and Z.Z. dissected the brain tissues and Z.Z. did IF staining. T.P. and C.L. performed the scATAC-seq. Y.L. and Q.T. developed the GRNs. P.Z., R.L., J.K., and L.Z. collected the human tissues. Z.X. and Z.Z. wrote the manuscript. F.Y. and S.Z. organized figures. Q.W., Q.T., F.G., and Y.W. supervised the experiment and edited the manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
The original online version of this article was revised: the supplementary files 2, 3 and 4 were incorrect and have been replaced by updated versions.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Zhao, Z., Zhang, D., Yang, F. et al. Evolutionarily conservative and non-conservative regulatory networks during primate interneuron development revealed by single-cell RNA and ATAC sequencing. Cell Res 32, 425–436 (2022). https://doi.org/10.1038/s41422-022-00635-9
Received:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41422-022-00635-9
This article is cited by
-
PCDHA9 as a candidate gene for amyotrophic lateral sclerosis
Nature Communications (2024)
-
Apically localized PANX1 impacts neuroepithelial expansion in human cerebral organoids
Cell Death Discovery (2024)