Exploring the biological role of postzygotic and germinal de novo mutations in ASD

Alonso-Gonzalez, A.; Calaza, M.; Amigo, J.; González-Peñas, J.; Martínez-Regueiro, R.; Fernández-Prieto, M.; Parellada, M.; Arango, C.; Rodriguez-Fontenla, Cristina; Carracedo, A.

doi:10.1038/s41598-020-79412-w

Download PDF

Article
Open access
Published: 11 January 2021

Exploring the biological role of postzygotic and germinal de novo mutations in ASD

A. Alonso-Gonzalez^1,2,
M. Calaza^1,2,
J. Amigo³,
J. González-Peñas⁴,
R. Martínez-Regueiro^1,2,
M. Fernández-Prieto^1,2,
M. Parellada⁴,
C. Arango⁴,
Cristina Rodriguez-Fontenla²^na1 &
…
A. Carracedo^1,2,3^na1

Scientific Reports volume 11, Article number: 319 (2021) Cite this article

1729 Accesses
4 Citations
9 Altmetric
Metrics details

Subjects

Abstract

De novo mutations (DNMs), including germinal and postzygotic mutations (PZMs), are a strong source of causality for Autism Spectrum Disorder (ASD). However, the biological processes involved behind them remain unexplored. Our aim was to detect DNMs (germinal and PZMs) in a Spanish ASD cohort (360 trios) and to explore their role across different biological hierarchies (gene, biological pathway, cell and brain areas) using bioinformatic approaches. For the majority of the analysis, a combined ASD cohort (N = 2171 trios) was created using previously published data by the Autism Sequencing Consortium (ASC). New plausible candidate genes for ASD such as FMR1 and NFIA were found. In addition, genes harboring PZMs were significantly enriched for miR-137 targets in comparison with germinal DNMs that were enriched in GO terms related to synaptic transmission. The expression pattern of genes with PZMs was restricted to early mid-fetal cortex. In contrast, the analysis of genes with germinal DNMs revealed a spatio-temporal window from early to mid-fetal development stages, with expression in the amygdala, cerebellum, cortex and striatum. These results provide evidence of the pathogenic role of PZMs and suggest the existence of distinct mechanisms between PZMs and germinal DNMs that are influencing ASD risk.

Rare coding variation provides insight into the genetic architecture and phenotypic context of autism

Article 18 August 2022

Nonrandom occurrence of multiple de novo coding variants in a proband indicates the existence of an oligogenic model in autism

Article 23 July 2019

Genomic analysis of 116 autism families strengthens known risk genes and highlights promising candidates

Article Open access 22 March 2024

Background

Autism Spectrum Disorder (ASD) is a neurodevelopmental disorder (NDD) characterized by deficits in communication and social interaction together with restricted interests and repetitive behaviors¹. ASD prevalence among children in the United States stands at around 1.5% and has rapidly risen in recent years. In addition to the core symptoms of ASD, other conditions such as epilepsy or intellectual disability are often present. Comorbidity is a characteristic of ASD that can appear at any time during child’s development. Since many ASD cases with comorbidity have a clear genetic background and early detection is key for intervention, the genetic diagnosis in this type of cases is a challenge².

Twin and family studies have estimated ASD heritability to be about 80% and subsequent genetic studies have demonstrated that the largest part of this heritability (50%) is explained by common variation^3,4. However, de novo rare genetic variation (minor allele frequency < 0.1%), including small insertions and deletions (indels), copy number variants and single nucleotide variants confers higher individual risk^5,6,7. Germinal de novo mutations (DNMs) occur within germ cells and they are transmitted to the offspring when the zygote is formed after fertilization. Thus, every single cell line of the resulting embryo will carry an identical genetic load. Another type of DNMs, postzygotic mutations (PZMs), arise during zygote mitosis, leading to a mosaic of genetically different cell lines⁸. The frequency of mutagenesis and the generation of PZMs is increased prior to gastrulation and neurogenesis⁹.

PZMs involved in ASD pathogenesis are usually detectable through deep sequencing of brain tissues. However, this technique often entails a huge challenge due to the inability to obtain ASD brain samples¹⁰. In contrast, next generation sequencing technologies can be used to detect mosaic mutations in peripheral blood of affected individuals by increasing the depth of coverage¹¹. Thus, it is possible to obtain enough sequencing reads containing the reference and the alternate allele to accurately calculate the alternate allele frequency (AAF)¹². In PZMs, the AAF value shifts from the expected 50/50 ratio for heterozygous germinal mutations. High coverage whole exome sequencing (WES) (depth > 200×) provides enough sensitivity to detect PZMs presenting AAF values as lower as 15%^13,14. It is worthy to note that most WES studies have missed PZMs due to the commonly employed pipelines. The development of new variant calling pipelines is therefore needed and some efforts have been done in this regard^15,16,17,18.

It has been estimated that 7.5% of DNMs are PZMs that contribute about 4% to the overall architecture of ASD. PZMs have been identified in high-confidence ASD risk genes. Other novel ASD candidate genes such as KLF16 and MSANTD2, were discovered after studying the contribution of PZMs to ASD risk in large collections of ASD probands¹⁸. This points to the fact that some genes carry a larger number of mutations in a mosaic state than other genes. In addition, a detailed analysis of non-synonymous PZMs has revealed that these variants are mainly found in brain-expressed genes and in Loss-of-function (LoF)-constrained exons. The spatio-temporal analysis across different developmental stages also points to brain areas, like the amygdala, that have not been previously highlighted by other WES studies in which PZMs were not considered^16,18.

The relevance of PZMs in the pathogenesis of ASD and the biological processes in which genes carrying PZMs are involved, remain largely unexplored. Moreover, the contribution of PZMs to the phenotypic presentation is another subject that should be studied in more detail using large-scale studies. Hence, it is suspected that ASD probands carrying mosaic mutations might be less affected than probands carrying germinal mutations as it happens in other NDDs such as Proteus syndrome or several brain malformations^19,20. Therefore, the main aim of this study was to accurately detect DNMs (germinal and PZMs) in a cohort of Spanish trios with ASD (360). The novel DNMs detected in the Spanish cohort were combined with a list of DNMs previously published by the Autism Sequencing Consortium (ASC)¹⁸ in a cohort of 5947 families (4032 ASD trios and 1918 quads) in order to study if different ASD risk genes tend to accumulate one or another type of mutations using different bioinformatic approaches. In addition, the different biological implications of germinal and PZMs in ASD were explored through enrichment analysis approaches, which have not been applied before to this class of mutations across different hierarchical levels (gene, GO terms, neuronal cell types and brain areas) (Additional file 4. Fig. S1).

Methods

Subjects

DNA was extracted from peripheral blood of the Spanish ASD samples (360 trios; unaffected parents and affected proband) using the GentraPuregene blood kit (Qiagen Inc., Valencia, CA, USA). Subjects from Santiago (N = 136) were recruited from Complexo Hospitalario Universitario de Santiago de Compostela and Galician ASD organizations. Subjects from Madrid (N = 224) were recruited as part of AMITEA program at the Child and Adolescent Department of Psychiatry, Hospital General Universitario Gregorio Marañón. Only individuals 3 years old or older were included. All participants had a clinical diagnosis of ASD made by trained pediatric neurologists or psychiatrists based on the Diagnostic and Statistical Manual of Mental Disorders, Fourth Edition Text Revision and Fifth Edition (DSM-IV-TR and DSM-5) criteria. The Autism Diagnostic Observation Schedule (ADOS) and the Autism Diagnostic Interview-Revised (ADI-R) were also administered when necessary. Informed consent signed by each participating subject or legal guardian and approval from the corresponding Research Ethics Committee were obtained before the start of the study. All participants, parents or legal representatives provided written informed consent at enrollment and the study was conducted according to the declaration of Helsinki.

Sample quality control and DNMs detection

Data processing and annotation

WES of DNA extracted from the Spanish 360 trios was performed by the ASC (https://genome.emory.edu/ASC/)²¹. One multi-sample VCF with the raw results was retrieved from the ASC. Individual files containing coding variants per individual were obtained using bcftools and were annotated using SnpEff (Genomic variant annotations and functional effect prediction toolbox) version 4.3 T (http://snpeff.sourceforge.net/).

Sample specific quality control

To check family relationships in the Spanish cohort (360 trios), information of Mendelian error counts was obtained using the "–mendel” option available in VCFtools (http://vcftools.sourceforge.net/). Samples whose Mendelian errors significantly deviated from the expectation were not considered for subsequent analysis.

To identify discrepancy between nominal designed and genetically determined sex the “–sexcheck” option in PLINK was used to infer correct sex from genotypes on chromosome X and Y.

Finally, to identify outlier samples in the Spanish cohort (360 trios), the “pseq i-stats” command in PLINK was used. Samples in which any of the following parameters: count of alternate, minor, heterozygous genotypes, number of called variants or genotyping rate deviated more than 4 SD from the mean were eliminated. Therefore, the whole trio was dropped if any member was considered an outlier.

The samples of the Spanish cohort (360 trios) that passed all the quality controls mentioned above were the same as those included in Satterstrom et al.²¹.

DNMs detection

To detect DNMs in the Spanish cohort (360 trios), defined as those mutations that are strictly present in probands and not in parents, the filtering options published by Lim et al. were employed¹⁸. In this study, variants classified as PZMs were resequenced by three different sequencing technologies reaching a high validation rate (87–97%).

Briefly, we define DNMs as those variants whose genotypes were 1/0 or 1/1 in probands and 0/0 in parents. Then, variants with GQ ≥ 20 and alternate read depth ≥ 7 were considered. Variants that present two or more alleles in the ExAC database (http://exac.broadinstitute.org/) were filtered out. Inframe indels were also filtered and only biallelic DNMs were considered. In addition, we filtered out variants that were less than 20 base pairs apart from each other to reduce false positives, and variants whose RVIS (Residual Variation Intolerance Score) retrieved from ExAC was higher than 75% were also filtered out. RVIS is designed to rank genes in terms of whether they have more or less common functional genetic variation relative to the genome-wide expectation given the amount of apparently neutral variation the gene has. Intolerant genes are more likely to be better candidates in NDDs. Thus, RVIS values represented as percentiles reflect the relative rank of the genes, with those genes above 75th percentile being the most tolerant and therefore less likely to harbor mutations with a role in ASD.

SnpEff was employed to classify exonic variants according to the definition of their predictive impact: high, moderate and low impact on the canonical transcript. Low impact variants included silent mutations, moderate impact variants included missense mutations and high impact included splicing and nonsense mutations. Only base substitutions were considered so frameshift variants were filtered out.

Two different in silico prediction tools (CADD and SIFT) were used to classify missense mutations. Probably damaging mutations were those predicted as damaging by SIFT and variants with CADD score > 20 (Additional file 1; Table S1).

Finally, DNMs were classified as germinal or PZMs based on the AAF (number of alternate reads/(total number of reference + alternate reads)). DNMs with an AAF ≥ 0.40 were classified as germinal and DNMs with an AAF < 0.40 were classified as PZMs¹⁸.

90 samples from the Spanish cohort (360 trios) were already analyzed by Lim et al.¹⁸ and they were used as positive controls to check if the detection of PZMs in the Spanish cohort was accurately made. Therefore, it was proved that most of DNMs were accurately detected and classified as germinal or PZMs (Additional file 1; Tables S1, S2).

For the majority of the analysis, we used a dataset called “combined cohort” (N = 2171) that includes the non-synonymous DNMs detected in the 360 Spanish trios plus the non-synonymous DNMs identified in individuals with ASD sequenced by the ASC and published previously. Duplicated variants in both cohorts were eliminated (Supplementary Table 3 of Lim et al.)¹⁸ (Additional file 1; Table S1 and S3). For some analysis, we also defined a control cohort of healthy siblings published by the ASC (same sequencing depth and variant calling procedures than the probands of the Spanish cohort) (N = 288)^1,18 (Additional file 1; Table S4).

Transmission and de novo association test (TADA-denovo)

TADA-Denovo (http://www.compgen.pitt.edu/TADA/TADA_guide.html#tada-analysis-of-de-novo-data-tada-denovo) was run to discover and to prioritize ASD risk genes for both DNMs (germinal and PZMs) in the Spanish cohort (N = 360) (Additional file 1; Table S1) and in the combined dataset (N = 2171) (Additional file 1; Table S3). TADA takes into account the mutational burden of the genes as well as the multiple mutational classes²². TADA was independently run in two different gene-sets for the Spanish cohort (genes harboring PZMs (PZMs genes) in the Spanish cohort = 105; genes harboring germinal DNMs (Germinal genes) in the Spanish cohort = 181) and for the combined cohort (PZMs genes in the combined cohort = 362; germinal genes in the combined cohort = 1210) (Additional file 2; Tables S5, S6, S7 and S8). The control cohort (N = 288) was employed to set up and to estimate the parameters needed by TADA¹⁸ (Additional file 1; Table S4).

Two classes of DNMs were included in the analysis: LoF and probably damaging missense mutations. To set up mutational rates for each mutational category, we used the per gene mutation rates table data computed by Samocha et al.²³ and then, the following formula was applied to calibrate them: LoF; (nonsense + splice) x (syn_obs/syn_exp) and probably damaging missense; missense × (N_{Prob.damaging} /N_allmissense) x (syn_obs/syn_exp). Syn_obs is the observed number of synonymous mutations in the control cohort of unaffected siblings (N = 119)¹⁸, and syn_exp is the expected number of synonymous DNMs in the same cohort calculated from the sum of per-gene synonymous DNMs rates (2*n*µ) (N = 79.04). N_{prob.damaging} is the number of probably damaging missense mutations in the control cohort (N = 212) and N _allmissense were the total of missense mutations in the control cohort (N = 296). To estimate the relative risk (γ) for each mutational category, we calculated the burden (ƛ) of mutations of each type in cases (Spanish cohort) over controls (LoF = 2.21; probably damaging missense = 1.36). Then we applied the following formula to calculate relative risk: ɣ = 1 + (ƛ − 1)/π, where π, the fraction of risk genes, was set as 0.05 (the default parameter). Finally, by running TADA-Denovo with the parameters described above, uncorrected p values for each gene were calculated obtaining null distributions (N repetitions = 10,000). TADA-Denovo computes BF (Bayesian Factor) to each gene. To determine an appropriate threshold that allows declaring a “significant gene”, TADA uses the Bayesian FDR approach to control for the rate of false discoveries. q-values for each gene were calculated using the Bayesian FDR approach provided by TADA. Manhattan plots which show the results of TADA p values (− log10) for the combined cohort (PZMs and germinal mutations) were done with R package qqman²⁴.

Genes with FDR < 0.1 (germinal genes) and FDR < 0.3 (PZMs genes) were classified according to SFARI criteria (https://gene.sfari.org/database/gene-scoring/). Moreover, OMIM database (Online Mendelian Inheritance in Man) (https://www.omim.org/) was consulted to search for Mendelian diseases related to these genes.

Gene-set enrichment analysis of PZMs and germinal mutations

Gene-set enrichment analyses of those genes carrying missense and nonsense DNMs (germinal and PZMs) was done by DNENRICH²⁵. DNENRICH estimates the enrichment of DNMs within pre-defined groups of genes accounting for gene size, tri-nucleotide context and functional effect of the mutations. DNMs included in this analysis were germinal and PZMs identified in the Spanish cohort (germinal DNMs = 236; PZMs = 164) (Additional file 3; Tables S9 and S10). For the analysis of the combined cohort, a subset of PZMs was created in order to ensure that the analyzed PZMs likely contribute to the phenotype (PZMs = 676) (Additional file 1; Table S11). For that purpose, individuals with germinal mutations in ASD risk genes (SFARI scores 1 and 2) were eliminated from the PZMs dataset. Thus, germinal DNMs and the subset of PZMs from the combined cohort were used in the analysis (germinal DNMs = 2270; PZMs = 676) (Additional file 3; Table S12 and S13). The analysis was also run independently in unaffected siblings using data previously published by the ASC (germinal DNMs = 780; PZMs = 239) (Additional file 1; Table S14).

The gene name alias and the gene size matrix provided by DNENRICH were used as input files used in this analysis together with the following gene-sets: (1) FMRP target genes identified by Darnell et al.²⁶ and downloaded from Genebook (http://zzz.bwh.harvard.edu/genebook/) (N = 788); (2) Genes included in the GO:0006325 chromatin organization (N = 723) (http://www.geneontology.org/); (3) Synaptic genes (N = 903)²⁷; (4) Human orthologs of genes essential in mice (N = 2472)²⁸; (5) CHD8 target genes in human mid fetal brain (N = 2725)²⁹; (6) List of SFARI genes (N = 990) (https://gene.sfari.org/autdb/HG_Home); (7) LoF intolerant genes (pLi > 0.9) (N = 3230)^29,30; 8) RBFOX target genes (N = 587)³¹; (9) miR-137 target genes (N = 428)³²; (10) CELF-4 target genes (N = 954)³³; (11) Allele biased genes in differentiating neurons (N = 802)³⁴; (12) Known intellectual disability genes (N = 1547)³⁵; (13) Intergenic and Intronic Brain Expressed Enhancers (BEE) (N = 673)³⁶; (14) Genomic intervals surrounding known telencephalon genes scanned for enhancers (N = 79)³⁷; and (15) miR-138 target genes (N = 255)³⁸. Empirical p values were obtained from one million permutations for each gene-set.

Gene ontology enrichment analysis

An exploratory GO enrichment analysis was carried out using the Enrichr tool. This analysis allows studying if genes harboring germinal DNMs or PZMs are involved in different biological processes. To this aim, the combined dataset of germinal missense and nonsense DNMs and the subset of PZMs were employed (germinal genes = 1972; PZMs genes = 624) (Additional file 3; Table S15). In addition, the REViGO tool (http://revigo.irb.hr/) was employed to visualize GO terms in semantic similarity-based scatterplots using SimRel as a semantic similarity measure. Thus, the top 30 enriched GO terms in each group (PZM vs germinal) were visualized using a modification of the R script provided by the REViGO online tool.

Network visualization of the top 50 enriched terms in each group of genes was performed with Enrichment Map, a Cytoscape (v.3.6.1) plugin for functional enrichment visualization³⁹. Each node represents a gene-set (GO term) and the size of the node is proportional to the number of genes participating in the GO term (overlap coefficient). Nodes were considered as connected when the overlap coefficient was greater than 0.7 and edge-width represents the overlap between gene-sets. The border-width of each node represents the corresponding p value for each GO term.

Expression cell-type enrichment analysis and expression analysis across brain regions and developmental periods

Expression Weighted Cell-type Enrichment (EWCE) method (https://github.com/NathanSkene/EWCE) was used to explore whether genes harboring germinal DNMs (N = 1972) and genes harboring PZMs (N = 624) (Additional file 3; Table S15) were differentially expressed across several neuronal cell types. EWCE involves testing whether the given genes in a target list have higher levels of expression in a given cell type compared to what is expected by chance. Brain single-cell transcriptomic data from Karolinska Institute (ctd_allKI) was used for the EWCE analysis. Brain regions included in the KI mouse super dataset are the neocortex, hippocampus, hypothalamus, striatum, and midbrain, as well as samples enriched for oligodendrocytes, dopaminergic neurons and cortical parvalbumin interneurons (total cells = 9970). Background gene-set comprises all human-mice orthologous. Probability distribution for our gene lists was calculated by randomly sampling 100,000 genes from the background set controlling for transcript length and GC content. Bootstrapping function was then applied on level 1 annotation.

pSI (specificity index statistic), an R package, was employed to study the expression of genes harboring PZMs and germinal mutations across different brain regions and neurodevelopmental periods^40,41. Lists of specifically expressed human genes (human.rda) obtained from BrainSpan data (gene-sets for 6 brain regions and gene-sets for 10 developmental periods) were employed. The Fisher iteration test included in the pSI package was used to analyze if the listed genes harboring PZMs and germinal mutations were significantly overrepresented. Brain areas significantly enriched with PZMs or germinal genes in specific developmental periods (p adjusted value < 0.05) were represented as a matrix. Biorender (https://biorender.com) was used to draw the brain images (Fig. 6).

Ethics approval and consent to participate

The corresponding Research Ethics Committee of Galicia approved our study: (Comité Ético de Investigación Galicia (the only IEC authorized in this autonomous region); Number: 2012/098; Approval: 28-June-2012; Title: Contribución a la búsqueda de las causas genéticas de los trastornos del espectro autista. All participants, parents or legal representatives provided written informed consent at enrollment and the study was conducted according to the declaration of Helsinki. The ASC data employed in this study were already published (https://doi.org/10.1038/nn.4598). The corresponding ethics committee has approved these genetic data.

Results

Transmission and de novo association test (TADA-denovo)

Transmission and De novo Association test (TADA) assesses if a gene is affecting ASD risk based on several parameters: the gene mutation rate, the recurrence of DNMs in the gene and the severity of the mutations²². Thus, TADA-Denovo analysis was independently run in both datasets (germinal and PZMs genes). The main aim of TADA-Denovo is to identify those genes that could be differentially involved in ASD etiology depending on the type of DNMs harbored by them (germinal or PZMs). We focused the analysis on damaging mutations (LoF and likely pathogenic missense variants) to increase the likelihood of finding “strong” candidate genes. First, the set of genes from the Spanish cohort (360 trios) (germinal genes = 181; PZMs genes = 105) was analyzed. The analysis of the germinal gene list identified 12 genes with an FDR < 0.3 (Table 1 and Additional file 2; Table S16) including 3 genes (SCN2A, ARID1B and CHD8) with an FDR < 0.1. The analysis of the PZMs gene list identified 13 genes with an FDR < 0.3 (Table 1 and Additional file 2; Table S17) of which 4 genes (KMT2C, FRG1, GRIN2B and MAP2K3) had an FDR < 0.1.

Table 1 ASD risk genes carrying germinal DNMs and PZMs in the Spanish cohort.

Full size table

In the combined cohort (genes from the Spanish cohort plus genes from the Lim et al. publication¹⁸ (germinal genes = 1210; PZMs genes = 362) TADA identified 34 genes with an FDR < 0.1 (Table 2 and Fig. 1a) and 103 genes with an FDR < 0.3 (Additional file 2; Table S18). Three of the genes (SCN2A, ARID1B, CHD8) with germinal DNMs were prioritized (FDR < 0.1) both in the combined cohort and in the Spanish cohort when TADA was employed. Analysis of PZMs genes in the combined cohort identified three genes (FRG1, KMT2C and NFIA) with an FDR < 0.1, and 14 genes with an FDR < 0.3 (Table 2 and Fig. 1b; Additional file 2; Table S19). Only two genes, KMT2C and FRG1, have remained significant after FDR correction (< 0.1) in both the Spanish and the combined cohort.

Table 2 ASD risk genes carrying germinal DNMs and PZMs in the combined cohort.

Full size table

A total of 17 genes (50%) from the set of germinal genes in the combined cohort (34 genes, FDR < 0.1) were identified as “high confidence” or “strong” ASD candidates following SFARI Gene scoring criteria (scores 1, 2, 1 s and 2 s). In addition, 11 of these genes (64, 70%) have shown an FDR < 0.1 in previous TADA analysis⁶. Moreover, 10 of the remaining genes identified by TADA (FDR < 0.1) are included in SFARI gene lists (scores 3, 4 and 5) and 5 of the genes identified by TADA were reported in relation with another disease (not ASD) by OMIM database (Additional file 4; Table S20).

PZM analysis has shown association of KMT2C (SFARI score s2) as well as other 3 genes FDR < 0.1). It is worth to note that NFIA has been previously reported as a plausible candidate gene in ASD (SFARI score 4) but this is the first time that FRG1 is reported in ASD. SMARCA4, PRKDC, KLF16, GRIN2B and HNRNPU (SFARI score 3 and 4) were among those plausible ASD candidate genes previously identified with an FDR value between 0.1 and 0.3. GRIN2B was previously reported by SFARI as a strong ASD risk gene (score 1) (Additional file 4; Table S21).

Gene-set enrichment analysis of PZMs and germinal mutations

DNENRICH was run to estimate a statistical significance of enrichment for germinal and PZMs within previously ASD and NDDs associated gene-sets. Synonymous mutations were excluded from the analysis because they are unlikely to contribute to ASD phenotype and only nonsense and missense mutations were considered. First, gene-set enrichment analysis was performed using the list of genes and DNMs (germinal and PZMs) from the Spanish cohort (Additional file 3; Tables S9 and S10) against several background gene lists (see “Methods” section). Our results indicate that germinal genes shown enrichment in several gene-sets (germinal genes = 228, germinal DNMs = 236): FMRP target genes (p value = 0.003), known intellectual disability genes (p value = 0.0073), LoF intolerant genes (p value = 0.002), SFARI genes (p value = 1 × 10^–7) and genes involved in chromatin organization (p value = 0.00018) (Table 3). However, only the LoF intolerant gene-set has shown association with the list of PZMs genes (PZMs genes = 155, PZMs = 164) (Table 3).

Table 3 Results of the gene-set enrichment analysis for the list of genes harboring germinal DNMs and PZMs (Spanish cohort).

Full size table

DNENRICH analysis in the combined cohort demonstrated enrichment for several gene-sets for both germinal and PZMs genes: chromatin organization, SFARI genes, LoF intolerant genes, CHD8 target genes and essential genes. In addition, the germinal gene list (germinal genes = 1972, germinal DNMs = 2270) showed enrichment for FMRP target genes (p value = 1 × 10^–6), known intellectual disability genes (p value = 1 × 10^–6) and synaptic genes (p value = 4 × 10^–6) (Table 4, Fig. 2). PZMs genes (PZMs genes = 624, PZMs = 676), have only shown association in the case of the miR-137 target gene-set (p value = 0.0019) (Table 4, Fig. 2). The same analysis was performed using the list of germinal genes from unaffected siblings (germinal genes = 744, germinal DNMs = 780; PZMs genes = 237, PZMs = 239) (Additional file 1; Table S14). A significant enrichment was identified only with the FMRP targets gene-set (data not shown).

Table 4 Results of the gene-set enrichment analysis for the list of genes harboring germinal DNMs and PZMs (combined cohort).

Full size table

Gene ontology enrichment analysis

GO enrichment analysis revealed remarkable differences between germinal and PZMs gene lists from the combined cohort (Additional file 3; Table S15). The germinal set showed a significant enrichment in different GO terms related to synaptic function and transcription regulation. In particular, it is worth to note the association of GO terms related to ion transport: GO:0006814, Benjamini-Hochberg-corrected p[Pbh] = 0.005; GO0035725, Pbh = 0.005 and GO0006816, Pbh = 0.006 (Additional file 4; Table S22, Fig. 3a). The GO terms enriched in the subset of PZMs are related to regulation of gene expression, biosynthesis, differentiation or migration: GO0010629, p[Pbh] = 0.074; GO:2000113, p[Pbh] = 0.092; GO:0045652, p[Pbh] = 0.092; GO:0030336, p[Pbh] = 0.0995. (Additional file 4; Table S23, Fig. 3b).

GO enrichment analysis was depicted by semantically clustering the top 50 enriched terms for germinal and PZMs gene lists. The germinal gene list resulted in three differentiated clusters: neuron development and differentiation, synaptic functions, and chromatin modifications. The existence of a fourth cluster, which includes terms related to embryonic development, was also highlighted (Fig. 4a). In the case of PZMs genes, all the clusters were partially related to each other. However, we identified another cluster that includes terms related to the regulation of core processes (e.g., protein phosphorylation, regulation of growth, negative regulation of cellular biosynthetic processes, positive regulation of transcription DNA template). It is also important to highlight the GO terms related to neuron and embryonic development (Fig. 4b).

Expression cell-type enrichment analysis and expression analysis across brain regions and developmental periods.

First, we examined whether germinal genes or PZMs genes from the combined cohort, were differentially expressed in the transcriptome dataset corresponding to level 1 cell types. As expected, germinal genes were significantly enriched in several cell types (Additional file 3; Table S24; Fig. 5). The most enriched cell types were those related to neurotransmission (dopaminergic neuroblast; p value < 0.0001; embryonic dopaminergic neurons, p value < 0.0001; embryonic GABAergic neurons, p value < 0.0001; serotonergic neurons, p value < 0.0001). The PZMs gene list showed enrichment for three different cell types: pyramidal CA1 neurons; p value = 0.0066, pyramidal somatosensory (SS); p value = 0.0152, embryonic midbrain nucleus neurons; p value = 0.0185 (Additional file 3; Table S25, Fig. 5).

To gain insight into the spatiotemporal distribution, we analyzed the expression of germinal and PZMs genes (Additional file 3; Table S15) across several brain regions and different neurodevelopmental periods obtained from BrainSpan. Germinal genes were significantly expressed in the cortex, striatum, cerebellum and amygdala in prenatal stages (early, early mid and late) (Additional file 3; Table S26, Fig. 6a,c). PZMs genes were significantly expressed in the cortex during the early mid-fetal period. Although we did not find found a significant enrichment in other brain areas or neurodevelopmental periods for PZMs, p values close to the significance threshold were found in cortex (early, late mid-fetal) and amygdala (late mid-fetal) (Additional file 3; Table S27, Fig. 6b,d).

Discussion

ASD is a complex NDD, characterized by its clinical and genetic heterogeneity. It is estimated that over 1000 genes are involved in causing ASD but it is difficult to track them down and to functionally characterize them. Thus, it is widely believed that the vast majority of ASD genetic factors remain largely unknown⁶. In addition, DNMs are a strong source of genetic causality in ASD even though they have been usually identified in a germinal state. This means that most of the ASD genetic studies do not consider DNMs that appear post-zygotically. Recent studies have suggested that some ASD candidate genes could carry more mosaic mutations than others^15,16,17,18. However, the phenotypic effect of PZMs and its impact in clinical diagnosis is not completely understood. A recent study showed that brain malformations can occur when around 10% of peripheral blood cells harbor the mosaic mutation⁴². In addition, mosaic mutations could act as phenotype modifiers, resulting in patients who have less severe symptoms⁴³.

This study has explored the possibility of finding new genetic risk factors in ASD when PZMs are also considered as well as it was focused on the deepening of their biological role. Three genes were associated in the PZM analysis done by TADA (q-value < 0.1; FRG1, KMT2C and NFIA) in the combined cohort. FRG1 has not been previously associated with NDDs while both, KMT2C and NFIA, have been previously reported as possible implicated in ASD and intellectual disability^44,45. However, none of these genes were associated to ASD in the germinal analysis, which seems to point that they tend to differentially harbor PZMs.

Facioscapulohumeral muscular dystrophy (FSHD) region gene 1 (FRG1) (also other aliases) encodes a cytoplasmic protein involved in muscular and vascular development. FRG1 mapped 100 kb centromeric of the repeated units on chromosome 4q35 that contribute to the development of Facioscapulohumeral Muscular Dystrophy (FSHD, OMIM #158900)⁴⁶. Although the molecular pathogenesis of FSHD remains unveiled, it is a muscular disease that shows comorbidity with several neurological symptoms such as epilepsy or intellectual disability⁴⁷. FRG1 can act as a splicing regulator and regulates the expression of Rbfox1, an RNA-binding protein that it is involved in neuronal migration and synapse formation. Mutations in RBFOX1 have been associated with ASD^48,49. Our study reveals the association of FRG1 in ASD when PZMs are considered. FRG1 association was not reported in the previous Lim et al. publication because it can be explained by novel nonsense PZM found in the Spanish cohort (Additional File 1, Table S1). However, this association should be interpreted with caution because FRG1 is not constraint for truncating (pLI = 0) or for missense variants (Z = 0.18).

KMT2C (lysine (K)-specific methyltransferase 2C) encodes a methyltransferase that regulates gene transcription. De novo LoF mutations in KMT2C have been detected in individuals with Kleefstra syndrome 2 (OMIM #617768). Kleefstra syndrome is caused by haploinsufficiency of the euchromatin histone methyltransferase 1 (EHMT1) and characterized by ASD and other comorbid symptomatology as intellectual disability and a delayed psychomotor development. These symptoms could be explained by a molecular interplay between KMT2C and EHMT1 both involved in the regulation of synaptic plasticity in the adult brain⁴⁴. WES focused on the detection of PZMs using new bioinformatic pipelines have found mosaic mutations within KMT2C¹⁵. These previous findings and the association of KMT2C in our study seem to point that some genes previously identified as high-confidence ASD risk genes can also harbor PZMs.

The third gene associated in the PZM analysis is NFIA (Nuclear Factor I A). It encodes a member of the NF1 (nuclear factor 1) family of transcription factors determining the regulation of gliogenesis and other neuronal processes⁵⁰. Although two DNMs in NFIA have been previously reported in patients with intellectual disability, there is no reliable evidence to consider this gene as a high-confidence ASD risk gene⁵¹. However, it is worth noting that an NFIA-related disorder caused by inherited or DNMs has been reported. This syndrome is characterized by brain malformations and it is usually accompanied by renal urinary tract defects (OMIM #613735)⁵². The clinical presentation of the syndrome is highly variable and rarely all the described features are present in one individual⁵³. To our knowledge, this is the first time that NFIA is pointed as a plausible candidate gene in ASD. This association would have been dismissed without the analysis of PZMs.

The fact that different genes across the genome carry different types of DNMs (germinal and/or PZM), could be due to the fact that some mutations can be lethal in a germinal state but not in a mosaic state. This is the case, for example, of Rett syndrome, in which MECP2 mutations are lethal in males and dominant in females, but in a few cases, mosaic mutations have been reported to be compatible with male viability⁵⁴. However, our results, interpreted in conjunction with previous genetic and phenotypic studies, seem to point to other reasonable hypothesis: clinical manifestations caused by PZMs are usually less severe than those due to germinal mutations. For instance, Kleefstra syndrome is caused by germinal LoF DNMs in KMT2C but, neverthless, PZMs cause a milder phenotype presentation⁵⁵. Thus, patients with less severe phenotypes and core ASD features are overrepresented in our cohort, allowing the detection of KMT2C association in the PZM analysis done with TADA, while patients carrying germinal DNMs in the same gene might have been excluded for presenting a syndromic form. For the same reason, NFIA could appear associated with ASD only when the mutations are mosaic.

From the biological point of view, the gene-set enrichment analysis done with both lists of genes harboring PZMs and germinal mutations both in the Spanish and in the combined cohort showed a significant enrichment for several gene-sets previously involved in ASD pathogenesis: FMRP targets, genes involved in chromatin organization and high-confidence genes from SFARI⁶. Curiously, only the gene-set enrichment analysis for genes harboring PZMs (combined cohort) showed a significant enrichment for miR-137 targets. The fact that this association was found in the combined cohort and not in the Spanish cohort could be mainly due to the inclusion of a greater number of PZMs and a subsequent increase of the statistical power. miR-137 is a non-coding RNA with a critical role during brain development⁵⁶. The expression of miR-137 is crucial to maintain the processes of neuronal differentiation and proliferation so it is involved in neuronal maturation, dendrite development and synaptogenesis^57,58,59. In addition, common genetic variants within the gene that encodes miR-137 have been associated with ASD and schizophrenia^60,61. In fact, the complete loss of miR-137 in mice is lethal, but a partial loss results in a phenotype that reproduces some of the core symptoms of ASD, such as repetitive behavior and impaired social functions⁶².

The GO enrichment analysis also points out to different biological functions for those genes harboring germinal DNMs and PZMs^18,63 _.Thus, GO terms mainly related to ion transport and modulation of the synaptic function were highlighted in germinal genes. However, PZMs genes were enriched in GO terms mainly related to the negative regulation of gene expression. The involvement of PZMs in gene regulation processes and not in fundamental biological processes such as synaptic transmission also supports the hypothesis that PZMs can cause milder clinical manifestations.

In order to provide a novel insight of the biological role of germinal DNMs versus PZMs in other biological hierarchies, two types of analysis have been carried out: a cell-type enrichment analysis and an spatio-temporal expression analysis across neurodevelopmental periods and brain areas. Germinal genes have shown a significant enrichment in both excitatory and inhibitory neurons while PZMs genes were mainly expressed in pyramidal neurons. In addition, it is worth to note that genes with germinal DNMs were expressed in both fetal and adult neurons across several brain areas (cortex, striatum, cerebellum and amygdala). However, the expression of genes carrying PZMs was restricted to the mid-fetal cortex. Interestingly, the study performed by Lim et al. highlighted the amygdala as the main brain area in which PZMs were enriched for expression. It should be considered that Lims study only includes PZMs within critical exons, in comparison with the current study that includes all genes carrying non-synonymous PZMs¹⁸ .

An overall view of our results suggests the existence of distinct biological mechanisms caused by PZMs and germinal genes that could influence ASD susceptibility in different manners. Thus, impaired neuronal communication linked to mutations in germinal genes support the theory that excitatory/inhibitory imbalance contributes to ASD^21,64. The fact that several brain areas are similarly affected during development might explain the clinical heterogeneity in ASD and the high comorbidity with other disorders such as epilepsy or intellectual disability in the presence of germinal variants. By contrast, although a replication in larger cohorts is needed, it has been suggested that individuals harboring PZMs might be less affected in terms of cognitive abilities. In these patients, some biological processes such as neurogenesis, neuronal migration and differentiation that occur within early developmental stages might be disrupted. However, this would only take place in some brain cells while the remaining cells will maintain a normal functioning. In agreement with this hypothesis, it was found that brains of children with ASD have shown patches of abnormal laminar organization that could be the result of an altered migration of a small fraction of cells⁶⁵.

Our results point out to a critical neurodevelopmental period during mid-fetal development in which the disruption of neurodevelopmental processes will take place. We demonstrated that ASD associated genes were expressed more often between mid-to-late fetal periods. In addition, although some genes harbor mutations in a mosaic state it is possible that the disruption of certain genes expressed during this crucial period would be enough to the later manifestation of ASD core symptoms.

Conclusions

In conclusion, this study provides an additional insight into the role of PZMs in ASD etiology. It supports previous evidence of a pathogenic role for PZMs and their contribution to ASD risk. Moreover, our results suggest that PZMs could be involved in different biological processes and neurodevelopmental stages that germinal mutations. It has been revealed that ASD risk genes can differentially harbor germinal and PZMs, which highlights the clinical importance of the detection of both types of mutations. In addition, the integration of the genetic information (germinal and PZM) together with the phenotypical characterization could be also helpful in clinical practice.

Data availability

All data generated during this study are included in this published article and its supplementary information files.

Abbreviations

AAF:: Alternate allele frequency
ADI-R:: Autism diagnostic interview-revised
ADOS:: Autism diagnostic observation schedule
ASD:: Autism spectrum disorder
BEE:: Brain expressed enhancers
BF:: Bayesian factor
DNMs:: De novo mutations
EWCE:: Expression weighted cell type enrichment
ExAC:: Exome aggregation consortium
FDR:: False discovery rate
FSHD:: Facioscapulohumeral muscular dystrophy
GO:: Gene ontology
GQ:: Genotype quality
LoF:: Loss of function
NDD:: Neurodevelopmental disorder
OMIM:: Online mendelian inheritance in man
pSI:: Specifity index statistic
PZMs:: Postzygotic mutations
TADA:: Transmission and de novo association test
VCF:: Variant call format
WES:: Whole exome sequencing

References

American Psychiatric Association. Diagnostic and Statistical Manual of Mental Disorders (American Psychiatric Publishing, Arlington, 2013).
Book Google Scholar
Lai, M.-C., Lombardo, M. V. & Baron-Cohen, S. Autism. Lancet Lond. Engl. 383, 896–910 (2014).
Article Google Scholar
Sandin, S. et al. The heritability of autism spectrum disorder. JAMA 318, 1182–1184 (2017).
Article PubMed PubMed Central Google Scholar
Gaugler, T. et al. Most genetic risk for autism resides with common variation. Nat. Genet. 46, 881–885 (2014).
Article CAS PubMed PubMed Central Google Scholar
Sanders, S. J. et al. Insights into autism spectrum disorder genomic architecture and biology from 71 risk loci. Neuron 87, 1215–1233 (2015).
Article CAS PubMed PubMed Central Google Scholar
De Rubeis, S. et al. Synaptic, transcriptional and chromatin genes disrupted in autism. Nature 515, 209–215 (2014).
Article PubMed PubMed Central CAS Google Scholar
Pinto, D. et al. Convergence of genes and cellular pathways dysregulated in autism spectrum disorders. Am. J. Hum. Genet. 94, 677–694 (2014).
Article CAS PubMed PubMed Central Google Scholar
Biesecker, L. G. & Spinner, N. B. A genomic view of mosaicism and human disease. Nat. Rev. Genet. 14, 307–320 (2013).
Article CAS PubMed Google Scholar
Bae, T. et al. Different mutational rates and mechanisms in human cells at pregastrulation and neurogenesis. Science 359, 550–555 (2018).
Article CAS PubMed ADS Google Scholar
D’Gama, A. M. et al. Targeted DNA sequencing from autism spectrum disorder brains implicates multiple genetic mechanisms. Neuron 88, 910–917 (2015).
Article MathSciNet PubMed PubMed Central CAS Google Scholar
Poduri, A., Evrony, G. D., Cai, X. & Walsh, C. A. Somatic mutation, genomic variation, and neurological disease. Science 341, 1237758 (2013).
Article PubMed PubMed Central CAS Google Scholar
D’Gama, A. M. & Walsh, C. A. Somatic mosaicism and neurodevelopmental disease. Nat. Neurosci. 21, 1504–1514 (2018).
Article PubMed CAS Google Scholar
Genovese, G. et al. Clonal hematopoiesis and blood-cancer risk inferred from blood DNA sequence. N. Engl. J. Med. 371, 2477–2487 (2014).
Article PubMed PubMed Central CAS Google Scholar
Pagnamenta, A. T. et al. Exome sequencing can detect pathogenic mosaic mutations present at low allele frequencies. J. Hum. Genet. 57, 70–72 (2012).
Article CAS PubMed Google Scholar
Freed, D. & Pevsner, J. The contribution of mosaic variants to autism spectrum disorder. PLoS Genet. 12, e1006245 (2016).
Article PubMed PubMed Central CAS Google Scholar
Dou, Y. et al. Postzygotic single-nucleotide mosaicisms contribute to the etiology of autism spectrum disorder and autistic traits and the origin of mutations. Hum. Mutat. 38, 1002–1013 (2017).
Article CAS PubMed PubMed Central Google Scholar
Krupp, D. R. et al. Exonic mosaic mutations contribute risk for autism spectrum disorder. Am. J. Hum. Genet. 101, 369–390 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lim, E. T. et al. Rates, distribution and implications of postzygotic mosaic mutations in autism spectrum disorder. Nat. Neurosci. 20, 1217–1224 (2017).
Article CAS PubMed PubMed Central Google Scholar
Lindhurst, M. J. et al. A mosaic activating mutation in AKT1 associated with the Proteus syndrome. N. Engl. J. Med. 365, 611–619 (2011).
Article CAS PubMed PubMed Central Google Scholar
Lee, J. H. et al. De novo somatic mutations in components of the PI3K-AKT3-mTOR pathway cause hemimegalencephaly. Nat. Genet. 44, 941–945 (2012).
Article CAS PubMed PubMed Central Google Scholar
Satterstrom, F. K. et al. Large-scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism. Cell 180, 568–584 (2020).
CAS PubMed PubMed Central Google Scholar
He, X. et al. Integrated model of de novo and inherited genetic variants yields greater power to identify risk genes. PLoS Genet. 9, e1003671 (2013).
Article CAS PubMed PubMed Central Google Scholar
Samocha, K. E. et al. A framework for the interpretation of de novo mutation in human disease. Nat. Genet. 46, 944–950 (2014).
Article CAS PubMed PubMed Central Google Scholar
Turner, S. qqman: an R package for visualizing GWAS results using Q–Q and manhattan plots. J. Open Source Softw. 3, 731 (2018).
Article ADS Google Scholar
Fromer, M. et al. De novo mutations in schizophrenia implicate synaptic networks. Nature 506, 179–184 (2014).
Article CAS PubMed PubMed Central ADS Google Scholar
Darnell, J. C. et al. FMRP stalls ribosomal translocation on mRNAs linked to synaptic function and autism. Cell 146, 247–261 (2011).
Article CAS PubMed PubMed Central Google Scholar
Kirov, G. et al. De novo CNV analysis implicates specific abnormalities of postsynaptic signalling complexes in the pathogenesis of schizophrenia. Mol. Psychiatry 17, 142–153 (2012).
Article CAS PubMed Google Scholar
Georgi, B., Voight, B. F. & Bućan, M. From mouse to human: evolutionary genomics analysis of human orthologs of essential genes. PLoS Genet. 9, e1003484 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cotney, J. et al. The autism-associated chromatin modifier CHD8 regulates other autism risk genes during human neurodevelopment. Nat. Commun. 6, 6404 (2015).
Article CAS PubMed ADS Google Scholar
Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016).
Article CAS PubMed PubMed Central Google Scholar
Weyn-Vanhentenryck, S. M. et al. HITS-CLIP and integrative modeling define the Rbfox splicing-regulatory network linked to brain development and autism. Cell Rep. 6, 1139–1152 (2014).
Article CAS PubMed PubMed Central Google Scholar
Collins, A. L. et al. Transcriptional targets of the schizophrenia risk gene MIR137. Transl. Psychiatry 4, e404 (2014).
Article CAS PubMed PubMed Central Google Scholar
Wagnon, J. L. et al. CELF4 regulates translation and local abundance of a vast set of mRNAs, including genes associated with regulation of synaptic function. PLoS Genet. 8, e1003067 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lin, M. et al. Allele-biased expression in differentiating human neurons: implications for neuropsychiatric disorders. PLoS ONE 7, e44017 (2012).
Article CAS PubMed PubMed Central ADS Google Scholar
Lelieveld, S. H. et al. Meta-analysis of 2,104 trios provides support for 10 new genes for intellectual disability. Nat. Neurosci. 19, 1194–1196 (2016).
Article CAS PubMed Google Scholar
Bartonicek, N. et al. Intergenic disease-associated regions are abundant in novel transcripts. Genome Biol. 18, 241 (2017).
Article CAS PubMed PubMed Central Google Scholar
Visel, A. et al. A high-resolution enhancer atlas of the developing telencephalon. Cell 152, 895–908 (2013).
Article CAS PubMed PubMed Central Google Scholar
Ching, A.-S. & Ahmad-Annuar, A. A perspective on the role of microRNA-128 regulation in mental and behavioral disorders. Front. Cell. Neurosci. 9, 465 (2015).
Article PubMed PubMed Central CAS Google Scholar
Merico, D., Isserlin, R., Stueker, O., Emili, A. & Bader, G. D. Enrichment map: a network-based method for gene-set enrichment visualization and interpretation. PLoS ONE 5, e13984 (2010).
Article PubMed PubMed Central ADS CAS Google Scholar
Dougherty, J. D., Schmidt, E. F., Nakajima, M. & Heintz, N. Analytical approaches to RNA profiling data for the identification of genes enriched in specific cells. Nucleic Acids Res. 38, 4218–4230 (2010).
Article CAS PubMed PubMed Central Google Scholar
Xu, X., Wells, A. B., O’Brien, D. R., Nehorai, A. & Dougherty, J. D. Cell type-specific expression analysis to identify putative cellular mechanisms for neurogenetic disorders. J. Neurosci. Off. J. Soc. Neurosci. 34, 1420–1431 (2014).
Article CAS Google Scholar
Jamuar, S. S. & Walsh, C. A. Somatic mutations in cerebral cortical malformations. N. Engl. J. Med. 371, 2038 (2014).
Article PubMed CAS Google Scholar
de Lange, I. M. et al. Mosaicism of de novo pathogenic SCN1A variants in epilepsy is a frequent phenomenon that correlates with variable phenotypes. Epilepsia 59, 690–703 (2018).
Article PubMed CAS Google Scholar
Koemans, T. S. et al. Functional convergence of histone methyltransferases EHMT1 and KMT2C involved in intellectual disability and autism spectrum disorder. PLoS Genet. 13, e1006864 (2017).
Article PubMed PubMed Central CAS Google Scholar
Schanze, I. et al. NFIB haploinsufficiency is associated with intellectual disability and macrocephaly. Am. J. Hum. Genet. 103, 752–768 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hanel, M. L. et al. Facioscapulohumeral muscular dystrophy (FSHD) region gene 1 (FRG1) is a dynamic nuclear and sarcomeric protein. Differ. Res. Biol. Divers. 81, 107–118 (2011).
Article CAS Google Scholar
Saito, Y. et al. Facioscapulohumeral muscular dystrophy with severe mental retardation and epilepsy. Brain Dev. 29, 231–233 (2007).
Article PubMed Google Scholar
Pistoni, M. et al. Rbfox1 downregulation and altered calpain 3 splicing by FRG1 in a mouse model of Facioscapulohumeral muscular dystrophy (FSHD). PLoS Genet. 9, e1003186 (2013).
Article CAS PubMed PubMed Central Google Scholar
Hamada, N. et al. Essential role of the nuclear isoform of RBFOX1, a candidate gene for autism spectrum disorders, in the brain development. Sci. Rep. 6, 30805 (2016).
Article CAS PubMed PubMed Central ADS Google Scholar
Glasgow, S. M. et al. Glia-specific enhancers and chromatin structure regulate NFIA expression and glioma tumorigenesis. Nat. Neurosci. 20, 1520–1528 (2017).
Article CAS PubMed PubMed Central Google Scholar
Iossifov, I. et al. De novo gene disruptions in children on the autistic spectrum. Neuron 74, 285–299 (2012).
Article CAS PubMed PubMed Central Google Scholar
Lu, W. et al. NFIA haploinsufficiency is associated with a CNS malformation syndrome and urinary tract defects. PLoS Genet. 3, e80 (2007).
Article PubMed PubMed Central CAS Google Scholar
Revah-Politi, A. et al. Loss-of-function variants in NFIA provide further support that NFIA is a critical gene in 1p32-p31 deletion syndrome: a four patient series. Am. J. Med. Genet. A 173, 3158–3164 (2017).
Article CAS PubMed Google Scholar
Pieras, J. I. et al. Somatic mosaicism for Y120X mutation in the MECP2 gene causes atypical Rett syndrome in a male. Brain Dev. 33, 608–611 (2011).
Article PubMed ADS Google Scholar
Kramer, J. M. et al. Epigenetic regulation of learning and memory by drosophila EHMT/G9a. PLoS Biol. 9, e1000569 (2011).
Article CAS PubMed PubMed Central Google Scholar
Mahmoudi, E. & Cairns, M. J. MiR-137: an important player in neural development and neoplastic transformation. Mol. Psychiatry 22, 44–55 (2017).
Article CAS PubMed Google Scholar
Szulwach, K. E. et al. Cross talk between microRNA and epigenetic regulation in adult neurogenesis. J. Cell Biol. 189, 127–141 (2010).
Article CAS PubMed PubMed Central Google Scholar
Smrt, R. D. et al. MicroRNA miR-137 regulates neuronal maturation by targeting ubiquitin ligase mind bomb-1. Stem Cells Dayt. Ohio 28, 1060–1070 (2010).
Article CAS Google Scholar
He, E. et al. MIR137 schizophrenia-associated locus controls synaptic function by regulating synaptogenesis, synapse maturation and synaptic transmission. Hum. Mol. Genet. 27, 1879–1891 (2018).
Article CAS PubMed PubMed Central Google Scholar
Cross-Disorder Group of the Psychiatric Genomics Consortium. Identification of risk loci with shared effects on five major psychiatric disorders: a genome-wide analysis. Lancet Lond. Engl. 381, 1371–1379 (2013).
Article CAS Google Scholar
Ripke, S. et al. Genome-wide association analysis identifies 13 new risk loci for schizophrenia. Nat. Genet. 45, 1150–1159 (2013).
Article CAS PubMed PubMed Central Google Scholar
Cheng, Y. et al. Partial loss of psychiatric risk gene Mir137 in mice causes repetitive behavior and impairs sociability and learning via increased Pde10a. Nat. Neurosci. 21, 1689–1703 (2018).
Article CAS PubMed PubMed Central Google Scholar
Gompers, A. L. et al. Germline Chd8 haploinsufficiency alters brain development in mouse. Nat. Neurosci. 20, 1062–1073 (2017).
Article CAS PubMed PubMed Central Google Scholar
Nelson, S. B. & Valakh, V. Excitatory/inhibitory balance and circuit homeostasis in autism spectrum disorders. Neuron 87, 684–698 (2015).
Article CAS PubMed PubMed Central Google Scholar
Stoner, R. et al. Patches of disorganization in the neocortex of children with autism. N. Engl. J. Med. 370, 1209–1219 (2014).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We would like to warmly thank the ASC (Autism Sequencing Consortium) (https://genome.emory.edu/ASC/) that has sequenced the 360 Spanish trios.

Funding

AA-G was supported by Fundación María José Jove. CR-F was supported by a contract from the FEDER. Instituto de Salud Carlos III/PI1900809/Cofinanciado FEDER supported this study.

Author information

These authors contributed equally: María Cristina Rodriguez-Fontenla and A. Carracedo.

Authors and Affiliations

Grupo de Medicina Xenómica, Fundación Instituto de Investigación Sanitaria de Santiago de Compostela (FIDIS), Universidade de Santiago de Compostela, Santiago de Compostela, Spain
A. Alonso-Gonzalez, M. Calaza, R. Martínez-Regueiro, M. Fernández-Prieto & A. Carracedo
Genomics and Bioinformatics Group, Center for Research in Molecular Medicine and Chronic Diseases (CiMUS), Universidade de Santiago de Compostela, Av Barcelona 31, 15706, Santiago de Compostela, Spain
A. Alonso-Gonzalez, M. Calaza, R. Martínez-Regueiro, M. Fernández-Prieto, Cristina Rodriguez-Fontenla & A. Carracedo
Fundación Pública Galega de Medicina Xenómica (FPGMX), Centro de Investigación Biomédica en Red, Enfermedades Raras (CIBERER), Universidad de Santiago de Compostela, Santiago de Compostela, Spain
J. Amigo & A. Carracedo
Centro De Investigación Biomédica en Red de Salud Mental (CIBERSAM), Hospital General Universitario Gregorio Marañón, Instituto de Investigación Sanitaria Gregorio Marañón, IiSGM, School of Medicine, Universidad Complutense, Madrid, Spain
J. González-Peñas, M. Parellada & C. Arango

Authors

A. Alonso-Gonzalez
View author publications
You can also search for this author in PubMed Google Scholar
M. Calaza
View author publications
You can also search for this author in PubMed Google Scholar
J. Amigo
View author publications
You can also search for this author in PubMed Google Scholar
J. González-Peñas
View author publications
You can also search for this author in PubMed Google Scholar
R. Martínez-Regueiro
View author publications
You can also search for this author in PubMed Google Scholar
M. Fernández-Prieto
View author publications
You can also search for this author in PubMed Google Scholar
M. Parellada
View author publications
You can also search for this author in PubMed Google Scholar
C. Arango
View author publications
You can also search for this author in PubMed Google Scholar
Cristina Rodriguez-Fontenla
View author publications
You can also search for this author in PubMed Google Scholar
A. Carracedo
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

J.P., R.M.-R., M.F.-P., C.A. and M.P. participated in the recruitment of samples. A.A.-G., C.R.-F., M.-C.C. and J.A. have performed the analyses. A.A.-G. wrote the paper. M.P., C.A., A.C. and C.R.-F. critically revised the work and approved the final content. A.A.-G., C.R.-F., M.-C.C. and A.C. participated in the design and coordination of this study.

Corresponding author

Correspondence to Cristina Rodriguez-Fontenla.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Alonso-Gonzalez, A., Calaza, M., Amigo, J. et al. Exploring the biological role of postzygotic and germinal de novo mutations in ASD. Sci Rep 11, 319 (2021). https://doi.org/10.1038/s41598-020-79412-w

Download citation

Received: 26 March 2020
Accepted: 30 November 2020
Published: 11 January 2021
DOI: https://doi.org/10.1038/s41598-020-79412-w

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Rare coding variation provides insight into the genetic architecture and phenotypic context of autism

Nonrandom occurrence of multiple de novo coding variants in a proband indicates the existence of an oligogenic model in autism

Genomic analysis of 116 autism families strengthens known risk genes and highlights promising candidates

Background

Methods

Subjects

Sample quality control and DNMs detection

Data processing and annotation

Sample specific quality control

DNMs detection

Transmission and de novo association test (TADA-denovo)

Gene-set enrichment analysis of PZMs and germinal mutations

Gene ontology enrichment analysis

Expression cell-type enrichment analysis and expression analysis across brain regions and developmental periods

Ethics approval and consent to participate

Results

Transmission and de novo association test (TADA-denovo)

Gene-set enrichment analysis of PZMs and germinal mutations

Gene ontology enrichment analysis

Expression cell-type enrichment analysis and expression analysis across brain regions and developmental periods.

Discussion

Conclusions

Data availability

Abbreviations

References

Acknowledgements

Funding

Author information

Authors and Affiliations

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Publisher's note

Supplementary information

Supplementary Information 1.

Supplementary Information 2.

Supplementary Information 3.

Supplementary Information 4.

Rights and permissions

About this article

Cite this article

Share this article

Comments

Search

Quick links