Thank you for visiting nature.com. You are using a browser version with limited support for CSS. To obtain the best experience, we recommend you use a more up to date browser (or turn off compatibility mode in Internet Explorer). In the meantime, to ensure continued support, we are displaying the site without styles and JavaScript.

# Altered gene expression profiles impair the nervous system development in individuals with 15q13.3 microdeletion

## Abstract

The 15q13.3 microdeletion has pleiotropic effects ranging from apparently healthy to severely affected individuals. The underlying basis of the variable phenotype remains elusive. We analyzed gene expression using blood from three individuals with 15q13.3 microdeletion and brain cortex tissue from ten mice Df[h15q13]/+. We assessed differentially expressed genes (DEGs), protein–protein interaction (PPI) functional modules, and gene expression in brain developmental stages. The deleted genes’ haploinsufficiency was not transcriptionally compensated, suggesting a dosage effect may contribute to the pathomechanism. DEGs shared between tested individuals and a corresponding mouse model show a significant overlap including genes involved in monogenic neurodevelopmental disorders. Yet, network-wide dysregulatory effects suggest the phenotype is not caused by a single critical gene. A significant proportion of blood DEGs, silenced in adult brain, have maximum expression during the prenatal brain development. Based on DEGs and their PPI partners we identified altered functional modules related to developmental processes, including nervous system development. We show that the 15q13.3 microdeletion has a ubiquitous impact on the transcriptome pattern, especially dysregulation of genes involved in brain development. The high phenotypic variability seen in 15q13.3 microdeletion could stem from an increased vulnerability during brain development, instead of a specific pathomechanism.

## Introduction

Individuals with 15q13.3 microdeletion (OMIM #612001) show clinical manifestations ranging from no obvious symptoms to severe intellectual disability, neuropsychiatric disorders, and epilepsy1 (Fig. 1A,B). The most common 15q13.3 deletion spans 2 Mb and includes eight RefSeq genes (CHRNA7, FAN1, TRPM1, KLF13, OTUD7A, MTMR10, ARHGAP11B, and MIR211, (Fig. 1A)2. Many functional and association studies inquired which gene(s) encompassed by the deleted region could be responsible for the phenotype. However, the results were as variable as the clinical manifestation and different groups proposed multiple candidates (CHRNA73,4, OTUD7A5,6, FAN17, ARHGAP11B8, TRPM19, KLF1310), to explain the symptoms (Fig. 1A). A mouse model of the 15q13.3 microdeletion syndrome (Df[h15q13]/+) shows manifestations similar to affected humans ranging from attention deficits to impaired behavior and disrupted prefrontal cortex processing11,12. Thus, the microdeletion effect is stable across species and clearly impairs nervous system function.

Recently, it was suggested that many disrupted biological pathways such as Wnt signaling or ribosome biogenesis may be involved in the molecular mechanism underlying the disease rather than singular dosage-affected genes13. While Zhang et al. used a multiomics approach to identify perturbed biological processes, the multiple employed analyses showed disagreement with respect to the pathomechanism. Also, no shared dysregulated genes were identified between human induced pluripotent stem cells (iPSCs) and mouse cortex13. This could be a result of the in vitro setup and neuronal differentiation protocols, which generally impact gene expression profiles14. We thus sought to inquire gene expression profiles in subjects with 15q13.3 microdeletion in a native/in vivo state.

One major challenge of transcriptomics in a clinical setting is tissue-specific gene expression15,16 and the fact that most of the times the only accessible tissue to probe is blood17. In our previous work we showed, however, that known genes for neurodevelopmental disorders are not necessarily expressed in the adult brain and that genes which are relevant during embryonic development of the central nervous system can be silenced at a later timepoint15. Thus, although not regarded as a representative tissue, blood transcriptomics has the potential to reveal aspects missed in other tissues or iPSCs and it is much easier to analyze in a clinical context or personalized medicine setting.

In the present study, we analyzed the changes in gene expression profiles in the blood of three individuals with heterozygous microdeletion 15q13.3 and intellectual disability associated with epilepsy (Fig. 1C). We identified a significant overlap of differentially expressed genes (DEGs) between humans and mouse (Df[h15q13]/+) cortex. The gene ontology (GO) category most significantly enriched with DEGs was “nervous system development–GO:0007399” and DEGs in blood, which are not expressed in the adult brain, revealed maximum expression levels in the prenatal stage of brain development. The disrupted gene expression profile could lead to an increased vulnerability in the early stages of nervous system development.

## Materials and methods

### Ethics approval

This study was approved and monitored by the ethics committee of the University of Leipzig, Germany (224/16-ek and 402/16-ek). Informed consent was obtained from all subjects and/or their legal guardians. All methods were performed in accordance with the relevant guidelines and regulations. No direct animal experiments have been carried out in this study.

### Chr15q13.2q13.3 microdeletion individuals and mouse model (Df[h15q13]/+)

Three individuals with diagnosed heterozygous 15q13.2q13.3 microdeletions were recruited from Epilepsy Center Kleinwachau, Radeberg, Germany and previously described18. For all individuals we performed Illumina TruSight One Panel and microarray analysis. Patient 1 is a 49 year old male with a deletion of 2.56 Mbp (15q13.2q13.3(30366247_32927476) × 1). The second patient is female, 63 years old and carries a deletion on chromosome 15 of 1.57 Mbp (15q13.2q13.3(30936285_32514341) × 1). Patient 3, a 27 year old male, has a deletion of 2.14 Mbp on chromosome 15 (15q13.2q13.3(30371774_32514341) × 1). He was additionally diagnosed with a maternally inherited splicing-variant in the remaining TRPM1 allele (Chr15:31340091; NM_002420.5; c.1557 + 1G > A) and a heterozygous de novo point mutation in MITF (Chr3:70001032; NM_000248.3; c.629A > G, p.(Asn210Ser)). Pathogenic variants in MITF gene cause albinism, which was also clinically diagnosed in this individual. His ophthalmological phenotype (severe myopia, astigmatism, and pendular nystagmus) were clinical symptoms of the autosomal recessive TRPM1 phenotype (one allele being deleted as part of the 15q13.3 microdeletion and the other allele carrying the maternally inherited splicing variant). In sum, he suffered from a complex combined phenotype with neurologic symptoms attributed to the 15q13.3 microdeletion. Blood RNA samples were taken from the three individuals (two males and one female, aged 27–63 years) and four control subjects (two males and two females, aged 20–52 years).

To identify molecular changes which are consistent across species we used the data generated by Gordon and colleagues (GSE129891)19 and Al-Absi and colleagues12. We analyzed transcriptomes from cerebral cortex tissue of mice with heterozygous deletions on mouse chromosome 7qC syntenic to human 15q13.311,19.

### RNA extraction and sequencing

RNA was extracted from PAXgene blood samples using PAXgene Blood RNA Kit (Qiagen). RNA sequencing (RNA-seq) libraries were prepared using TruSeq RNA Library Prep Kit v2 (Illumina, San Diego, CA) and sequenced on an Illumina NovaSeq platform with 151 bp paired-end reads.

### Differential gene expression (DEG) analysis

RNA-seq reads were mapped to the human genome assembly hg38 with STAR (version 2.6.1d)20. We computed the transcript levels with htseq-count (version 0.6.0)21. From GSE129891 we analyzed read counts of ten wild type and ten (Df[h15q13]/+) mouse cerebral cortex samples. Genes with a sum of less than 10 reads in all samples together were excluded from further analysis. Differential expression of genes was determined with the R package DESeq2 (version 1.30.1)22, which uses the Benjamini–Hochberg method to correct for multiple testing23. Genes were considered to be significantly differentially expressed if p-adj < 0.05. To check clustering of RNA-sequencing samples of subjects and controls, a principal component analysis (PCA) was performed with the R package pcaExplorer (version 2.6.0)24. RNA count data were variance stabilized transformed and all expressed genes were used for computing the principal components (Additional file 1).

### Expression of DEGs in different developmental stages

Expression data of DEGs were obtained from PTEE (version 1.1)15 for the adult brain cortex. Genes expressed at a low level in adult brain tissue may be expressed at a higher level in the developing brain and therefore, could still play a significant role in neurodevelopment. Hence, for DEGs expressed < 1.5 TPM in adult brain cortex (according to PTEE), expression levels in different developmental stages were obtained from the R package ABAEnrichment (version 1.20.0)25 for the whole brain. We used a Tukey’s HSD test to determine whether this group of DEGs (< 1.5 TPM in adult brain cortex) displayed a significantly different expression profile between the brain developmental stages.

DEGs which were expressed < 1.5 TPM in adult brain cortex but > 1.5 RPKM in prenatal stage of the whole brain and reached their maximum of expression in the prenatal stage were selected and further analyzed for GO enrichment.

### DEGs involved in NDDs

A list of genes, which are known to play a role in NDDs, was obtained from PTEE15. DEGs of 15q13.2q13.3 microdeletion patients were compared to the list of NDD genes, to determine DEGs that could contribute to the neurological symptoms observed in those patients. The significance for enrichment of DEGs with NDD genes was calculated using a binomial test in R26.

### GO enrichment

Gene ontology enrichment analysis was performed with the R package GOfuncR (version 1.14.0) for up- and down-regulated DEGs27. GO nodes with a family wise error rate (FWER) < 0.05 were considered significantly enriched. To check for unspecific GO enrichment analysis results, the four control subjects were split in two groups and differential gene expression and GO enrichment analyses were performed for those two control groups.

### Identification of activated/inactivated DEG-interacted functional modules

We investigated the activity of DEG-interacted functional modules to elucidate the roles of DEGs in 15q microdeletion. The DEG-interacted network was constructed by the DEGs and their interacting partners in the human protein interaction network (PIN), which was obtained from the InBio Map database28. A DEG-interacted functional module is a subnetwork of the DEG-interacted network formed by genes annotated by the same biological processes. The functional annotations of genes were obtained from GO29,30, and only the annotations supported by experiments were used in this study. Additionally, to ensure the DEGs’ participation and functional association among genes, all the functional modules were required to contain at least one DEG and one interaction. To determine if the member genes of the tested functional module were overrepresented at the top of the entire ranked gene list, we performed the gene set enrichment analysis (GSEA)31 for evaluating each module’s activity and inactivity separately. To assess the activity (inactivity), the entire gene list was ranked downward (upward) by the fold change of genes between the 15q microdeletion and controls. We then calculated the enrichment score (ES) for each functional module by walking down the ranked gene list. The ES of functional module f is defined as below:

$$ES_{f} = \max \left( {S_{i} } \right), i \in \left\{ {1, \ldots ,N} \right\}$$
$$S_{i} = \mathop \sum \limits_{j = 0}^{i} p \times \frac{1}{{N_{f} }} - \left( {1 - p} \right) \times \frac{1}{{\left( {N - N_{f} } \right)}},\;\;where\;\;p = \left\{ {\begin{array}{*{20}c} {0 } \\ {1, \;if\;gene\; i \in f} \\ \end{array} } \right.$$

where Si is the score of gene i, i is ordered by fold change, Nf is the number of genes in the tested functional module, N is the number of total ranked genes, and p is a binary parameter. To estimate the significance of ESf, we produced 1000 scores ESrand calculated from 1000 randomly permutated gene lists. Then, we denoted the standard score z, which was defined as below, as the activity or inactivity of functional module f.

$$z = \frac{{ES_{f} - \mu }}{\sigma }$$

where $$\mu$$ and $$\sigma$$ are respectively the mean and standard deviation of 1000 ESrand. Finally, the functional modules possessing z of activity greater than two and z of inactivity less than zero were defined as activated; and the functional modules with z of inactivity greater than two and z of activity less than zero were defined as inactivated. In other words, since GSEA ranks genes according to their fold change, the activated functional modules are significantly enriched with up-regulated genes, while the inactivated ones are enriched with down-regulated genes. The discovered activated or inactivated functional modules were further summarized/clustered by the REVIGO32 algorithm with similarity ≥ 0.9 that was calculated from Resnik33 algorithm and visualized using the treemap package34.

To predict key transcription factors and cofactors that drive transcriptomic differences between microdeletion individuals and controls we used Mining Algorithm for GenetIc Controllers (MAGIC), which leverages ENCODE ChIP-seq data to look for statistical enrichment of transcription factors and cofactors in genes and flanking regions35.

## Results

### Transcriptional changes in 15q13.3 microdeletion individuals and Df[h15q13]/+ mice

To study the effects of the 15q13.3 microdeletion on transcriptional regulation, we performed RNA-seq from three individuals carrying a heterozygous 15q13.3 microdeletion (Fig. 1C) and four control subjects. Further, to identify robust changes across species and tissues, we analyzed cerebral cortex tissue from ten mice (Df[h15q13]/+) reported by Gordon et al.19 and Al-Absi et al.12. While in the 15q13.3 microdeletion subjects we identified 2334 genes (adjusted p-value < 0.05) with altered expression levels compared to controls (Additional file 2), only genes within the deleted region withstood multiple testing correction in the mouse models, with additional two genes in the data of Al-Absi and colleagues (Additional file 2). This could be related to a difference in synteny between the mouse and human chromosomal regions, to the high interindividual variability of our subjects, or to the generally mild impact on gene expression with genes not reaching the dysregulation threshold necessary to withstand conservative multiple testing correction. The immediate result of applying multiple testing correction is that the probability that a true effect may be rejected will increase36. To control for false positives, but also to avoid erroneously rejecting real effects we decided to focus on DEGs shared between human and mouse. We, thus, considered genes with uncorrected p-value < 0.05 in the mouse and identified 68 shared genes between human and the Gordon dataset and 322 shared with the dataset from Al-Absi (Additional file 2). In total there were 23 genes that were shared between all datasets, of which 4 were the genes included in the deletion (Fig. 2). To test whether the number of overlapping genes is higher than expected by chance we performed 100,000 random samplings considering a total of 20,000 genes. This yielded a p-value < 0.001 suggesting the overlap of 23 genes (Fig. 2), as well as the overlap of 19 genes not included in the deletion (p-value < 0.001) are significant. By contrast, when we considered DEGs among controls, only one gene were shared with the (Df[h15q13]/+) mouse models, which is an amount expected to occur by chance (p-value = 0.3 from 100,000 simulations).

To check whether the gene dosage affects gene expression, we identified genes located in the deleted site, which are expressed in blood and brain cortex (Additional file 3). Four genes have an expression higher than 1.5 TPMs in brain cortex15 (FAN1, MTMR10, KLF13, OTUD7A) of which MTMR10 and KLF13 are also highly expressed in blood (Additional file 3). These genes were significantly downregulated in both human and mouse samples (Additional file 2). Moreover, although FAN1 and OTUD7A display low expression levels in blood (Additional file 3), they were also significantly differentially expressed in our blood transcriptome analysis (Additional file 2).

We next focused on shared DEGs between 15q13.3 microdeletion individuals and the mouse models. Variants in four of these genes (KAT6A, CHD7, KMT2C, AP1S1) are known causes for monogenic NDD37. Considering differentially expressed genes shared with at least one of the mouse datasets there are ten genes which have a gene ontology (GO) annotation related to brain development (KDM7A, LRP6, PAFAH1B3, CDK5RAP3, B4GALT2, FPGS, FZD1, TGFBR2, IRS2, ATXN1) and also eight genes related to gene expression (EPAS1, HMG20A, EDA, TGFBR2, ATP7A, AFF1, ZFP36L1, LRRK2). Interestingly, we could also identify components of the major histocompatibility complex, class II to be dysregulated in both blood and brain (Additional file 2), which may reflect a disturbed inflammatory or immune process.

### Molecular pathways affected by transcriptome alterations in 15q13.3 microdeletion

To identify molecular pathways that may be affected by the gene expression profiles we performed GO enrichment analysis followed by protein–protein-interaction (PPI) networks, as previously described38,39. Using the mouse data, we identified general GO categories like cellular components or developmental processes, including neurodevelopment, to be enriched with DEGs (Additional file 4). For human subjects, there were two less general GO terms which were most significantly enriched with DEGs: “nervous system development” (GO:0007399, p-value after family wise error rate (FWER) multiple correction = 0.036) with 32 associated genes and “DNA binding” (GO:0003677, p-value FWER multiple correction = 0.046) with 183 associated genes (Table 1, Additional file 4).

FWER: family-wise error rate corrected p-value; #genes: number of DEGs involved in the function. For genes included in the nodes refer to Additional file 4.

To better delineate molecular pathways involved in the copy-number variant (CNV) pathomechanism, we further focused on identifying the functional modules formed by DEGs from human subjects and their PPI partners. This revealed that most nodes clustered in cellular processes like metabolic pathways, signaling, or cellular components (Fig. 3, Additional file 4). Since regulation of gene expression appeared to be perturbed, we tested if the gene expression profile matches dysregulation of one or more transcription factors based on ENCODE Chip-seq data35. This analysis revealed no significant enrichment for genes associated with a known transcriptional regulator, suggesting that a single gene cannot explain the observed expression profile. Interestingly, inactivated functional modules clusters are mainly involved in immune response and regulation of gene expression (Fig. 3B, Additional file 4). Oligodendrocyte differentiation and development appear to be affected, which together with the “positive regulation of neuron death” (Fig. 3A,B , Additional file 4) could explain the impaired nervous system development. To check whether those molecular pathways were specifically identified in the microdeletion individuals, we analyzed differential gene expression between two control groups. GO terms significantly enriched with DEGs of the control groups were mostly related to immune response, and to a much lesser extent to gene expression regulation (Additional file 4). Thus, the identification of those molecular pathways in the individuals bearing 15q13.3 microdeletion may not be related to the deleted region, but rather to the analyzed tissue. In contrast, we did not identify any GO terms related to nervous system development in the controls. This supports our hypothesis that the effect on pathways related to nervous system development in the affected individuals is a result of the microdeletion.

We showed that affected genes were enriched in pathways related to nervous system development (Table 1) and that PPIs influence apoptosis and neuron death (Fig. 3B). This prompted us to inquire all DEGs that have known Mendelian associations with monogenic NDD. We identified 252 of the DEGs to be related to monogenic intellectual disability (Fig. 3C, Additional file 2). The number of genes is significantly higher than expected by chance (p-value binomial test = 0.003), which could suggest an underlying polygenic effect that leads to an increased risk for a neurodevelopmental disorder.

### Dysregulated genes in 15q13.3 microdeletion individuals expressed in the developing brain

Further, we asked whether blood DEGs, which are not expressed in the adult brain cortex, may have been expressed in the developing brain. We identified 358 DEGs, which are expressed in blood but not in the adult brain. We used the ABAEnrichment package in R25 to check the expression levels of these genes during the different stages of brain development (Additional file 5). For 245 of the 358 genes, we could retrieve expression levels from the Allen Brain Atlas. Our analysis revealed that for DEGs, which are silenced in the adult brain, there is a significant enrichment for the ones with a maximum expression level in the developing brain (p-value = 0.04, Fig. 4). A GO enrichment analysis of the 53 genes showed that several of these genes are involved in chromosome organization during cell division, but also identified e.g. the DRAXIN gene to be dysregulated, which is involved in the development of spinal cord (Additional file 5).

## Discussion

The 15q13.3 microdeletion is associated with pleiotropic effects and has been described in a wide spectrum of clinical contexts ranging from apparently healthy individuals to some severely affected with ID, epilepsy or even schizophrenia (Fig. 1B)2. It is difficult to dissect the mechanisms contributing to the nervous system developmental disturbance mostly because of the limitations of in vitro approaches aiming to reproduce human brain development10. Thus, the etiology of the 15q13.3 microdeletion’s range of hypervariable symptoms remains elusive.

To understand how dysregulation of gene expression contributes to 15q13.3 microdeletion pathomechanisms, we aimed to circumvent artefacts introduced by conventional in vitro approaches. Thus, we analyzed transcriptional profiles from three individuals with 15q13.3 microdeletion in an in vivo state in blood, which is an easily accessible tissue. To identify genes, which are robustly dysregulated we additionally analyzed brain cortex tissue from a 15q13.3 microdeletion mouse model.

We initially checked for dosage effects of the genes included in the microdeletion (Fig. 1A). This revealed that there are four genes with high expression in brain cortex (Additional file 3)15: FAN1, MTMR10, KLF13, OTUD7A, all of which showed significant down-regulation in blood (Additional file 2) of our subjects, as well as mouse brain cortex, confirming that a gene dosage effect of the microdeletion contributes to the transcriptional dysregulation. Other genes included in the typical deletion region: TRPM1, CHRNA7, MIR211, ARHGAP11B, display low expression levels in brain cortex and blood (Additional file 3) and were not significantly differentially expressed in the 15q13.3 microdeletion individuals. While MIR211 is a microRNA, which was not sequenced probably as a result of the library preparation protocol and ARHGAP11B is a human specific gene37, Trpm1 and Chrna7 were down-regulated in mouse brain cortex (Additional file 2), further supporting the importance of the haploinsufficiency.

We next focused on dysregulated genes across the two different tissues in the human subjects and the mouse model. One of the shared down-regulated genes is CHD7, which is frequently associated with CHARGE syndrome and has been shown to be highly relevant for neuronal differentiation and brain development40. CDH7, but also other shared dysregulated genes like KMT2C and KAT6A are involved in chromatin remodeling and hence in regulation of gene expression. This is in accordance with the findings of Zhang et al., who described a global epigenomic reprogramming of iPSCs from 15q13.3 microdeletion individuals13. However, in their approach they were not able to identify driving factors, potentially secondary to the bias induced by in vitro cultivation. This could also explain why they did not identify any significant shared dysregulations with the mouse model (Additional file 6) and their multiomics approach yielded a rather low correlation level among the multiple analyses. Interestingly, the overlap of shared differentially expressed genes in our blood RNA-seq analysis and the iPSCs was not significant, but there was a significant overlap to the iNs (p-value < 0.001, Additional file 6). This may suggest that the reprogramming of the iPSCs can influence the gene expression profile to a large extent, while after culturing the expression levels could be more similar to the status quo.

To identify affected molecular pathways we performed a GO analysis of DEGs. This showed a significant enrichment of DEGs that are involved in DNA binding (Table 1). Moreover, an analysis of functional modules formed by DEGs and their PPI partners confirmed that gene expression regulation is affected (Fig. 3B, Additional file 4). Yet, based on ENCODE data35, we did not identify any transcription factor that could explain the observed transcriptional profile. This is in accordance with the observation of Zhang et al. that the disease-relevant impact of the 15q13.3 microdeletion is probably caused by the combinatorial effects of several genes, rather than a single “master” gene. Our analysis showed network-wide dysregulatory effects and explains why knockout models of singular genes encompassed in the deletion could not fully recapitulate the phenotype3,4,5,6,7,8,9,10.

The GO analysis also revealed that DEGs show a significant enrichment in the nervous system development category (Table 1). Indeed, we could show that the set of dysregulated genes contained a significant proportion (p-value = 0.003) of genes which have been related to monogenic NDD (Fig. 3C, Additional file 2). The PPI functional module analysis identified more specific developmental processes of the nervous system like oligodendrocyte differentiation, axon and neuron projection development, as well as “positive regulation of neuron death” to be affected (Fig. 3A,B, Additional file 4). This aligns with experimental data from Df[h15q13]/ + mice, which show that both loss of OTUD7A and CHRNA7 contribute to dendrite outgrowth defects3,6, and also with the recently described involvement of Klf13 in the development of cortical interneurons10.

To determine whether other DEGs silenced in the adult human brain might have played a role in the nervous system development, we used the Allen Brain Atlas data, which provides brain gene expression data during different developmental stages25.This showed that a significant number of genes found to be differentially expressed in blood, but silenced in the adult brain had maximum expression levels in the prenatal stage (Fig. 4).

Our data suggests that network-wide dysregulatory effects contribute to 15q13.3 microdeletion pathomechanisms. There are several lines of evidence that indicate a disturbed nervous system development, suggesting the severity of symptoms of individuals with 15q13.3 microdeletion is probably determined in the early embryonic stages. The identification of dysregulated genes clustering in inflammatory and immune pathways may be related to the analyzed tissue, namely blood. However, since we identified components of the major histocompatibility complex, class II to be dysregulated in the mouse brain, we cannot rule out that immune insults could contribute to the increased vulnerability of 15q13.3 microdeletion-bearing offspring.

A major limitation of our study is the small cohort, in which individual-characteristic gene expression levels, which were not caused by the microdeletion, can have a big impact on the analysis. Additionally, while the human controls were gender-, ancestry-, and age-matched to the tested individuals, the lack of familial controls could introduce additional noise in the differentially expressed genes. We attempted to circumvent these limitations by comparing our data to the mouse model. However, a larger cohort and potentially the analysis of an additional tissue like skin, could further refine the analysis and unveil which gene network is mostly responsible for the pathomechanism. This is crucial for directing future efforts to minimize the severity of the phenotype.

## Data availability

RNA sequencing reads and expression profiles have been submitted to the Gene Expression Omnibus (http://www.ncbi.nlm.nih.gov/geo/) under accession number GSE197903. The code used for analyzing data has been deposited under https://github.com/akhilvelluva/15q13.3.

## References

1. Van Bon, B. W., Mefford, H. C. & de Vries, B. B. 15q133 Microdeletion (GeneReviews®, Seattle, 2015).

2. Lowther, C. et al. Delineating the 15q13.3 microdeletion phenotype: A case series and comprehensive review of the literature. Genet. Med. 17, 149–157 (2015).

3. Gillentine, M. A. et al. Functional consequences of CHRNA7 copy-number alterations in induced pluripotent stem cells and neural progenitor cells. Am. J. Hum. Genet. 101, 874–887 (2017).

4. Hoppman-Chaney, N., Wain, K., Seger, P. R., Superneau, D. W. & Hodge, J. C. Identification of single gene deletions at 15q13.3: Further evidence that CHRNA7 causes the 15q13.3 microdeletion syndrome phenotype. Clin. Genet. 83, 345–351 (2013).

5. Yin, J. et al. Otud7a knockout mice recapitulate many neurological features of 15q13.3 microdeletion syndrome. Am. J. Hum. Genet. 102, 296–308 (2018).

6. Uddin, M. et al. OTUD7A regulates neurodevelopmental phenotypes in the 15q13.3 microdeletion syndrome. Am. J. Hum. Genet. 102, 278–295 (2018).

7. Ionita-Laza, I. et al. Scan statistic-based analysis of exome sequencing data identifies FAN1 at 15q13.3 as a susceptibility gene for schizophrenia and autism. Proc. Natl. Acad. Sci. 111, 343–348 (2014).

8. Florio, M. et al. Human-specific gene ARHGAP11B promotes basal progenitor amplification and neocortex expansion. Science 347, 1465–1470 (2015).

9. Hori, T. et al. Mice with mutations in Trpm1, a gene in the locus of 15q13.3 microdeletion syndrome, display pronounced hyperactivity and decreased anxiety-like behavior. Mol. Brain 14, 1–16 (2021).

10. Malwade, S. et al. Identification of vulnerable interneuron subtypes in 15q13.3 microdeletion syndrome using single-cell transcriptomics. Biol. Psychiatry 91, 727–739 (2021).

11. Nilsson, S. R. O. et al. A mouse model of the 15q13.3 microdeletion syndrome shows prefrontal neurophysiological dysfunctions and attentional impairment. Psychopharmacology 233, 2151–2163 (2016).

12. Al-Absi, A.-R., Qvist, P., Glerup, S., Sanchez, C. & Nyengaard, J. R. Df(h15q13)/+ mouse model reveals loss of astrocytes and synaptic-related changes of the excitatory and inhibitory circuits in the medial prefrontal cortex. Cereb. Cortex 31, 1609–1621. https://doi.org/10.1093/cercor/bhaa313 (2021).

13. Zhang, S. et al. Network effects of the 15q13.3 microdeletion on the transcriptome and epigenome in human-induced neurons. Biol. Psychiatry 89, 497–509 (2021).

14. Solomon, E. et al. Global transcriptome profile of the developmental principles of in vitro iPSC-to-motor neuron differentiation. BMC Mol. Cell Biol. 22, 1–21 (2021).

15. Velluva, A. et al. Phenotype-tissue expression and exploration (PTEE) resource facilitates the choice of tissue for RNA-seq-based clinical genetics studies. BMC Genom. 22, 802 (2021).

16. Frésard, L. et al. Identification of rare-disease genes using blood transcriptome sequencing and large control cohorts. Nat. Med. 25, 911–919 (2019).

17. Curry, P. D. K., Broda, K. L. & Carroll, C. J. The role of RNA-sequencing as a new genetic diagnosis tool. Curr. Genet. Med. Rep. 9, 13–21 (2021).

18. Zacher, P. et al. The genetic landscape of intellectual disability and epilepsy in adults and the elderly: A systematic genetic work-up of 150 individuals. Genet. Med. 23, 1492–1497 (2021).

19. Gordon, A. et al. Transcriptomic networks implicate neuronal energetic abnormalities in three mouse models harboring autism and schizophrenia-associated mutations. Mol. Psychiatry 26, 1520–1534 (2021).

20. Dobin, A. et al. STAR: Ultrafast universal RNA-seq aligner. Bioinformatics 29, 15–21 (2013).

21. Anders, S., Pyl, P. T. & Huber, W. HTSeq-a python framework to work with high-throughput sequencing data. Bioinformatics 31, 166–169 (2015).

22. Love, M. I., Huber, W. & Anders, S. Moderated estimation of fold change and dispersion for RNA-seq data with DESeq2. Genome Biol. 15, 1–21 (2014).

23. Benjamini, Y., Drai, D., Elmer, G., Kafkafi, N. & Golani, I. Controlling the false discovery rate in behavior genetics research. Behav. Brain Res. 125, 279–284 (2001).

24. Marini, F. & Binder, H. PcaExplorer: An R/Bioconductor package for interacting with RNA-seq principal components. BMC Bioinform. 20, 1–8 (2019).

25. Grote, S., Prüfer, K., Kelso, J. & Dannemann, M. ABAEnrichment: An R package to test for gene set expression enrichment in the adult and developing human brain. Bioinformatics 32, 3201–3203 (2016).

26. Team RC. R: A Language and Environment for Statistical Computing. Vienna, Austria (2013).

27. Grote S. GOfuncR: Gene ontology enrichment using FUNC. R package version 151 (2018).

28. Li, T. et al. A scored human protein–protein interaction network to catalyze genomic interpretation. Nat. Methods 14, 61–64 (2016).

29. Ashburner, M. et al. Gene ontology: Tool for the unification of biology. Nat. Genet. 25, 25–29 (2000).

30. Carbon, S. et al. The gene ontology resource: 20 years and still GOing strong. Nucleic Acids Res. 47, D330–D338 (2019).

31. Subramanian, A. et al. Gene set enrichment analysis: A knowledge-based approach for interpreting genome-wide expression profiles. Proc. Natl. Acad. Sci. 102, 15545–15550 (2005).

32. Supek, F., Bošnjak, M., Škunca, N. & Šmuc, T. Revigo summarizes and visualizes long lists of gene ontology terms. PLoS ONE 6, e21800 (2011).

33. Resnik, P. Semantic similarity in a taxonomy: An information-based measure and its application to problems of ambiguity in natural language. J. Artif. Intell. Res. 11, 95–130 (1999).

34. Tennekes M. Package “treemap” Type Package Title Treemap Visualization (2021).

35. Roopra, A. MAgIC: A tool for predicting transcription factors and cofactors driving gene sets using ENCODE data. PLoS Comput. Biol. 16, e1007800 (2020).

36. Groenwold, R. H. H., Goeman, J. J. & Le Cessie, S. Multiple testing: When is many too much?. Eur. J. Endocrinol. 184, E11–E14 (2021).

37. Xing, L. et al. Expression of human-specific ARHGAP11B in mice leads to neocortex expansion and increased memory flexibility. EMBO J. 40, e107093 (2021).

38. Jäger, E. et al. Dendritic cells regulate GPR34 through mitogenic signals and undergo apoptosis in its absence. J. Immunol. 196, 2504–2513 (2016).

39. Le Duc, D. et al. Reduced lipolysis in lipoma phenocopies lipid accumulation in obesity. Int. J. Obes. 45, 565–576 (2021).

40. Feng, W. et al. Chd7 is indispensable for mammalian brain development through activation of a neuronal differentiation programme. Nat. Commun. 8, 1–14 (2017).

41. Karczewski, K. J. et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature 581, 434–443 (2020).

## Acknowledgements

We are grateful that our patients and their families agreed to participate in this study. We thank Sandra Schinkel, Kathleen Lehmann, and Sophie Behrendt for their great technical assistance and Rigo Schulz for server support. We are grateful to Torsten Schöneberg for his input and help to draw Fig. 1.

## Funding

Open Access funding enabled and organized by Projekt DEAL. This study is funded by the Else Kröner-Fresenius-Stiftung 2020_EKEA.42 to D.L.D. and the German Research Foundation SFB 1052 project B10 to D.L.D. and A.G.. D.L.D. and A.M. are funded through the “Clinician Scientist Programm, Medizinische Fakultät der Universität Leipzig”.

## Author information

Authors

### Contributions

M.K. performed gene expression analyses, contributed to the design of the study and writing of the manuscript. A.V. and L.B. supported bioinformatic analyses and performed G.O. enrichment analyses. M.R. performed wet lab work and contributed to the design of the study. C.C.L. performed P.P.I. analyses and contributed to manuscript writing. PZ recruited patients, performed phenotyping, and coordinated the genetic diagnosis. T.B., A.T., K.P., and J.H. performed genetic diagnosis and contributed to the writing of the manuscript. A.K., A.M., N.S., T.L., K.P., J.R.L., and A.G. contributed to analysis design, data interpretation, and writing of the manuscript. R.A.J. and D.L.D. designed the study, coordinated the contact to patients, contributed to genetic diagnosis, gene expression analysis, and writing of the manuscript.

### Corresponding authors

Correspondence to Rami Abou Jamra or Diana Le Duc.

## Ethics declarations

### Competing interests

The authors declare no competing interests.

### Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

## Rights and permissions

Reprints and Permissions

Körner, M.B., Velluva, A., Bundalian, L. et al. Altered gene expression profiles impair the nervous system development in individuals with 15q13.3 microdeletion. Sci Rep 12, 13507 (2022). https://doi.org/10.1038/s41598-022-17604-2

• Accepted:

• Published:

• DOI: https://doi.org/10.1038/s41598-022-17604-2