Abstract
Bipolar disorder (BD) is a global medical issue, afflicting around 1% of the population with manic and depressive episodes. Despite various genetic studies, the genetic architecture and pathogenesis of BD have not been fully resolved. Besides germline variants, postzygotic mosaic variants are proposed as new candidate mechanisms contributing to BD. Here, we performed extensive deep exome sequencing (DES, ~300×) and validation experiments to investigate the roles of mosaic variants in BD with 235 BD cases (194 probands of trios and 41 single cases) and 39 controls. We found an enrichment of developmental disorder (DD) genes in the genes hit by deleterious mosaic variants in BD (P = 0.000552), including a ClinVar-registered pathogenic variant in ARID2. An enrichment of deleterious mosaic variants was also observed for autism spectrum disorder (ASD) genes (P = 0.000428). The proteins coded by the DD/ASD genes with non-synonymous mosaic variants in BD form more protein-protein interaction than expected, suggesting molecular mechanisms shared with DD/ASD but restricted to a subset of cells in BD. We also found significant enrichment of mitochondrial heteroplasmic variants, another class of mosaic variants, in mitochondrial tRNA genes in BD (P = 0.0102). Among them, recurrent m.3243 A > G variants known as causal for mitochondrial diseases were found in two unrelated BD probands with allele fractions of 5–12%, lower than in mitochondrial diseases. Despite the limitation of using peripheral tissues, our DES investigation supports the possible contribution of deleterious mosaic variants in the nuclear genome responsible for severer phenotypes, such as DD/ASD, to the risk of BD and further demonstrates that the same paradigm can be applied to the mitochondrial genome. These results, as well as the enrichment of heteroplasmic mitochondrial tRNA variants in BD, add a new piece to the understanding of the genetic architecture of BD and provide general insights into the pathological roles of mosaic variants in human diseases.
Similar content being viewed by others
Introduction
Bipolar disorder (BD) afflicts around 1% of the population with depressive and manic episodes. The sufferings of the patients and the societal cost [1] necessitate the development of effective therapeutic management. Although current medications are indispensable for patients, they are not perfect solutions due to potential adverse effects and treatment resistance in many cases. We need to understand the biological mechanisms of BD to develop better treatment for future psychiatry [2].
Encouraged by the high heritability of BD [3], various genomic approaches have started to elucidate the biological mechanisms of BD. Large-scale genome-wide association studies revealed many associated loci [4,5,6]. Whole-exome sequencing (WES) with multiplex families suggested potentially disease-associated genes [7, 8]. Trio-based WES studies found the relevance of the high probability of being loss-of-function intolerant (pLI) genes [9,10,11]. A large-scale WES study reported one promising candidate gene, AKAP11 [12]. Short copy number variations in ASTN2, DLG2, and PCDH15 were associated with BD [13]. While these studies have contributed to our understanding of BD, the associated loci and genes are not enough to fully explain the biological mechanisms of BD.
Besides germline variants, postzygotic mosaic variants arising during development are new candidate mechanisms explaining a remaining part of the genetic architecture of psychiatric disorders [14, 15]. Mosaic variants have been indicated as components of the genetic architecture of autism spectrum disorder (ASD) [16,17,18,19,20,21] and schizophrenia [22,23,24]. Inspired by our preliminary data of mosaic variants in BD, we proposed a hypothesis that deleterious mosaic de novo variants (mDNVs) in the genes associated with developmental disorder (DD) by germline de novo variants (gDNVs) would contribute to BD [9]. However, our previous analysis using conventional WES data (~50× coverage) had an inherent limitation in the sensitivity of mosaic variants. The mosaic variants remain to be extensively investigated to decipher the enigmatic genetic architecture of BD.
Here, we tested our proposed hypothesis by trio-based deep exome sequencing (DES, around 300×) with an expanded cohort of BD (N = 235: 194 trio-based probands and 41 non-trio cases) and 39 controls. We used peripheral tissues for DES to detect mosaic variants of early developmental origin that should be basically shared between the brain and peripheral tissues. Bae et al. reported that over 90% of the mosaic variants with variant allele fractions (VAFs) above 2% in at least one brain region are detectable in another tissue [25], which means peripheral tissues are useful surrogates when targeting potentially relevant mosaic variants in the brain. Indeed, several studies have reported the contribution of mosaic variants to ASD using peripheral tissues [18,19,20,21]. We also sought another class of mosaic variants, mitochondrial heteroplasmic variants, to extend our initial hypothesis of mosaic variants (study design in Fig. 1). We found the enrichment of deleterious mDNVs in DD and ASD genes, supporting the association of mosaic variants in DD/ASD genes and BD. We also found the enrichment of heteroplasmic variants in mitochondrial tRNA genes in BD, including recurrent m.3243 A > G variants, one of the most major causal variants of mitochondrial diseases (a subtype of DD), indicating mitochondrial tRNA variants as a new candidate background of BD.
Materials and methods
See Supplementary Methods for details.
Study participants
We recruited 194 BD probands with their parents (trios), 41 BD single cases, and age-matched 39 healthy controls without psychiatric disorders, including 18 unaffected siblings of BD probands. The BD participants were clinically diagnosed with BD or schizoaffective disorder (SCZAD) by trained psychiatrists, verified by the Diagnostic and Statistical Manual of Mental Disorders as previously [10]. The saliva or blood was collected after obtaining written informed consent. Among the 235 BD participants, 131 overlapped with our previous study [9], and 104 were newly recruited. This study was designed according to the Helsinki declaration and approved by the Research Ethics Committee, Faculty of Medicine, Juntendo University, RIKEN Wako Research Ethics First Committee, the Ethics Committee on Genetics of Niigata University, and the Research Ethical Committee of Tohoku Medical Megabank Organization, Tohoku University.
Deep exome sequencing
The DNA from the BD (N = 235) and control (N = 39) participants newly underwent DES. The exonic and mitochondrial DNA was enriched by SureSelect Human All Exon v5/6 + mitochondrial probes (Agilent Technologies, Santa Clara, CA, USA) and sequenced by NovaSeq6000 or NextSeq2000 (Illumina, San Diego, CA, USA) at a theoretical depth of 500×. The sample data are summarized in Table S1. A BD sample with low depth (<100×) was omitted (N = 1).
Read alignment and variant calling
We performed an alignment and quality control based on the GATK Best Practice [26,27,28] using GATK-4.0/4.1 with two reference genomes (hg38 and hs37d5) (Table S2). We called the candidate exonic and mitochondrial mosaic variants in trio-based BD probands by Mutect2 [29] with parental sequencing data to filter out transmitted variants and sequencing artifacts. The non-trio-based BD cases (N = 41) and controls (N = 39) underwent Mutect2 calling without parental data. We used our in-house Panel of Normals (PoN) (Supplementary Data), the PoN by Broad Institute, and gnomAD-r2.1.1 non-PASS alleles [30] to exclude sequencing artifacts. We selected the final exonic mDNVs and the mitochondrial heteroplasmic variants in the probands (pHets) by hard filtering (Table S2). The 41 non-trio-based BD cases and 39 controls were limited to exploratory analysis. For trio-based BD probands, the outliers in per-individual mDNV counts that deviated from the expected Poisson distributions were excluded from the subsequent analysis (N = 3), resulting in 190 trios for the subsequent analysis.
The gDNVs in the same trio-based BD probands (N = 190) were partly compiled from our previous study [9] and additionally called for new trios, using GATK-HaplotypeCaller, triodenovo-0.06 [31], and DNMFilter-0.1.1 [32] (Table S2). The mitochondrial homoplasmic variants in the BD probands (pHoms) were called by GATK-HaplotypeCaller. Since the analytical engine of HaplotypeCaller is shared with Mutect2, the calling bias should be consistent between the mDNVs and gDNVs or between pHets and pHom. The variants detected by the hs37d5 pipeline underwent liftover to the hg38 coordinate for variant annotations.
Variant annotation
The exonic variants were annotated with gnomAD-r2.1.1 [30], ToMMo-8.3KJPN (Japanese population dataset) [33], SnpEff-4.3 [34], MPC [35], and dbNSFP-4.0a [36] (seven algorithms for missense variants: SIFT, PolyPhen-2 HumVar/HumDiv, LRT, MutationTaster, MutationAssessor, and PROVEAN [37,38,39,40,41,42]). We defined deleterious variants as loss-of-function (LoF: nonsense, frameshift, and canonical splice) or damaging missense (MPC ≥ 2 or predicted as damaging by all the seven algorithms above). The mitochondrial variants were annotated with gnomAD-v3.1 [43], ToMMo-8.3KJPN [33], HelixMTdb [44], SnpEff-4.3 [34], MitoTIP [45], and PON-mt-tRNA [46].
Validation of mosaic variants
The candidate mosaic variants underwent target amplicon sequencing (TAS) validation by MiSeq/iSeq100 (Illumina) (median 26 719×). For pHets, the candidates underwent TAS for all the trio members to exclude the possibility of misalignment due to nuclear mitochondrial DNA (NUMT) [47]. TAS libraries were prepared and analyzed as previously [9, 48, 49]. The variants failing in TAS validation were excluded from the subsequent analysis.
The enrichment analysis of mDNVs
We assessed the enrichment of DD and ASD genes in the genes hit by mDNVs and gDNVs in BD, ASD probands, and unaffected ASD siblings by DNENRICH [50] with one million permutations. The gDNVs/mDNVs in ASD probands and unaffected ASD siblings were compiled from previous ASD studies [18, 51] as positive and negative controls, respectively. The DD genes (285 genes) were defined as genes associated with DD through gDNVs in Kaplanis et al. [52]. The ASD genes (924 genes) were defined as SFARI scores 1–3. We limited the comparison to single nucleotide variants (SNVs) because insertion/deletions (INDELs) are sensitive to the pipeline difference.
We performed gene ontology (GO) analysis by DNENRICH [50] with one million permutations using the same gene-set and procedure as previously [9]. We used STRING [53] for protein-protein interaction (PPI) analysis (https://string-db.org/) and Cytoscape-v3.7.2 [54] for visualization.
We analyzed DES data from the age-matched 39 controls of our recruitment, including 18 unaffected siblings of BD probands, to rule out apparent technical artifacts and assess the possibility of mosaic variants due to clonal hematopoietic expansion frequently observed in individuals without psychiatric disorders.
Heteroplasmic variant analysis
We classified the mitochondrial genic variants into four classes: synonymous, non-synonymous, tRNA, and rRNA. The intergenic regions were excluded because of much higher mutation rates than the genic regions [55]. We limited the analysis to SNVs, excluding low-mappability regions (Supplementary Data). First, we compared the proportions of target variants to total genic variants in the pHets and other variant sets: pHoms of the same BD probands (N = 190) and all the possible mitochondrial genic variants. We also checked the characteristics of pHets in two population datasets as references: ToMMo trio generation II (ToMMo GenII, N = 518 trios) [33, 56, 57] and Wei et al. [55] (N = 1526 duos). We called the pHets in ToMMo dataset using the matching procedure for our BD. We compiled the pHets in Wei et al. [55], which extensively studied mitochondrial heteroplasmic variants in mother-proband duos, selecting de novo heteroplasmic variants in their original publication. The schizophrenia trio data (N = 615 trios) were derived from Fromer et al. [50] (dbGaP phs000687.v1.p1) with authorization and processed by the matching procedure for the BD trios. The statistical assessments for rare proportions were performed by Fisher’s exact test (FET).
Results
The enrichment of the DD/ASD genes in mDNVs in BD
We obtained DES data at ~300× non-duplicated depth for exonic regions (median depth = 296× [hg38] and 332× [hs37d5] for BD cases and 300× [hg38] and 337× [hs37d5] for controls, Table S3). The depth in hg38 was lower than in hs37d5 due to the alignment to alternative contigs. The estimated contamination rates were low (<0.2%) enough for detecting mosaic variants with low allelic fractions. We detected 162 high-confidence exonic mDNVs from 190 trio-based BD probands (Table S4), including 27 mDNVs detected previously [9]. In the 131 probands commonly analyzed in our current and previous studies [9], we identified 119 mDNVs, whereas only 27 of them were detected in our previous conventional WES study [9]. This 4.4-fold increase clearly indicates the advantage of DES. The sequencing data of the parents as technical controls facilitated the effective removal of sequencing artifacts, achieving a validation rate of 93.8% for exonic mDNVs (SNV 94.3% [116/123]; INDEL 80% [4/5]). The detection sensitivity was satisfactory (162/190 = 0.85 exonic mDNV per sample), consistent with the theoretical estimate of one exonic mDNV per sample with an even sequencing depth of 300× [14]. The similarity of the mutational patterns of mDNVs in BD and gDNVs in unaffected ASD siblings [51] enabled us to apply the theoretical mutation rates in DNENRICH, originally designed for gDNVs, to mDNVs (cosine similarity = 0.901, Fig. S1).
First, we tested our primary hypothesis of the possible enrichment of DD genes in the genes hit by deleterious mDNVs with theoretical expectations using DNENRICH, which considers gene length and trinucleotide mutational contexts. While the genes hit by synonymous mDNVs in BD were not enriched in DD genes (P = 1.00), the genes hit by deleterious mDNVs in BD were enriched in DD genes (P = 0.000552) (Fig. 2A). We also observed the enrichment of ASD genes in the genes hit by deleterious mDNVs in BD (P = 0.000428) but not in the genes hit by synonymous mDNVs in BD (P = 0.295) (Fig. 2A). While the data came from analysis with peripheral tissues, these results extend the categories of genes in the primary hypothesis from DD genes to DD/ASD genes. In contrast, these enrichments were not observed for the deleterious gDNVs in the same BD probands (Table S5, Fig. 2B) and the mDNVs/gDNVs in unaffected ASD siblings compiled from previous ASD studies [18, 51] (Fig. 2C, D). No enrichment of DD/ASD genes in the genes hit by mDNVs in the unaffected ASD siblings as negative control individuals indicates that the DD/ASD genes are not naturally vulnerable to mosaic mutations more than theoretical expectations. Reassuringly, the same analytical procedure confirmed the enrichments of DD/ASD genes in the gene hit by deleterious mDNVs/gDNVs in ASD probands as positive controls [18, 51] (Fig. S2). These contrasts still held when limiting the mDNVs to LoF variants (Fig. S3). The enrichment of DD/ASD genes in the genes hit by deleterious mDNVs in BD was significant even after stringent Bonferroni correction (corrected P = 0.0199 and 0.0154 for DD and ASD genes, respectively, n = 36 tests). Based on the fold change of deleterious mDNVs in DD genes in BD to the theoretical expectation (5.73-fold, 6/190 samples), the statistical power from the sample size of 190 was 0.89 at the α-level of 0.05, supporting our observation.
DD genes of interest hit by deleterious mDNVs
Subsequently, we took a closer look at the individual deleterious (LoF and damaging missense) mDNVs in DD/ASD genes (Table 1). Of note, one BD proband had an LoF mDNV in ARID2 (p.Arg1769*), a recurrent variant observed in multiple patients with Coffin-Siris syndrome as germline variants (ClinVar Accession: VCV000451913.4). Coffin-Siris syndrome is a severe DD characterized by intellectual disability, delayed development of speech and motor skills, facial features, and hypoplasia of fingers or toes. This fact supported the potential pathogenicity of the LoF mDNV in ARID2 for BD. The pathogenic relevance of an LoF mDNV in KMT2C, another robust causal mutation for DD if present as gDNVs, was discussed previously [9].
Among damaging missense mDNVs, the missense mDNV in SLC35A2 was of particular interest because several mDNVs in SLC35A2 are causal for malformation of cortical development with oligodendroglial hyperplasia in epilepsy (MOGHE), an epileptic syndrome with neuropathological abnormality in the brain [58], while gDNVs in SLC35A2 are causal for a developmental disorder, congenital disorders of glycosylation (CDG) [59] (Fig. 3A). The mDNV in SLC35A2 in BD would also have a pathological effect in specific brain regions. The damaging missense mDNV in HIVEP2, a gene coding a brain-expressed transcription factor, hit a conserved amino acid in the DNA binding domain (Fig. 3B), the essential region of the DNA-binding transcription factor. The damaging missense mDNV in SRCAP, which hit the DNA-binding domain of SRCAP protein consisting of a chromatin-remodeling complex, was previously analyzed in detail [9]. Overall, the damaging missense mDNVs in the DD/ASD genes were positioned at the genomic regions coding well-conserved amino acids (Fig. S4), supporting their potential pathogenicity. As expected, none of the 12 deleterious mDNVs in DD/ASD genes was observed as germline variants in the general population (gnomAD and ToMMo) (Table 1).
Since many DD genes overlap oncogenic genes [52], further caution should be appreciated for possible clonal expansion of blood cells, including clonal hematopoiesis with indeterminate potential (CHIP). Among the genes hit by deleterious mDNVs, ARID2, KMT2C, and GNAS are included in the 713 genes associated with cancer in TCGA (https://portal.gdc.cancer.gov/, 2022-05-24) or 87 genes associated with CHIP compiled previously [9]. We formerly confirmed that the mDNV in KMT2C was of early developmental origin [9]. We also determined in this study that the LoF mDNV in ARID2 was of early developmental origin, validating the same variant in the nail DNA (an ectodermal tissue) with a VAF of 4.1% (VAF in saliva = 5.4%). This variant detected in an ectodermal tissue is expected to be shared among various tissues, including the brain. The origin of the variant in GNAS remained unknown due to the difficulty of re-contacting. We could not exclude the possibility of CHIP for other mDNVs, but CHIP should not be frequent in 190 BD probands with a median age of 37.0 because CHIP is not frequent before age 40 [60,61,62]. At least the age-matched 39 controls, including 18 unaffected siblings of BD probands, did not have deleterious mosaic variants in DD genes with similar depth of DES, indicating that deleterious mosaic variants in DD genes were not apparently frequent in healthy controls.
Mechanistic insight for BD
To obtain mechanistic insight into how mDNVs contribute to the risk for BD, we performed a series of bioinformatics analyses. For this, we used the genes hit by any non-synonymous mDNVs (n = 113) rather than those hit by deleterious mDNVs (n = 33) to increase statistical power. This expansion was justified because we observed nominal enrichments of DD and ASD genes in the genes hit by non-synonymous mDNVs (P = 0.0189 and 0.00519, respectively).
First, we performed gene ontology (GO) enrichment analysis for the genes hit by non-synonymous mDNVs using DNENRICH, which adjusts for gene length and trinucleotide mutational contexts (Table S6). The top-ranked GO was positive regulation of neuron projection development (P = 0.000237), which includes three DD/ASD genes (ARHGAP35, PLXNB1, and UNC13A). This neurodevelopmental GO was consistent with the nominal enrichment of DD/ASD genes in the genes hit by non-synonymous mDNVs. Still, none survived after the adjustment to FDR (FDR < 0.1), probably due to a limited number of the input genes.
We then checked the protein-protein interaction (PPI) networks of proteins coded by the genes with non-synonymous mDNVs in BD. These proteins formed PPI networks more than expected (P = 0.0233, background = whole genome). The proteins coded by the DD/ASD genes with non-synonymous mDNVs in BD also formed PPI networks more than expected (P = 0.0419, background = 1053 DD/ASD genes), consistent with the nominal enrichment of DD/ASD genes in the genes hit by non-synonymous mDNVs. By visualizing the PPI of genes affected by non-synonymous mDNVs, we found four networks. Of these, the largest network consisting of 22 genes particularly involved many DD/ASD genes, where 11 of them (ARID2, KMT2C, SRCAP, and others) are known to be associated with DD/ASD (Fig. 3C).
Extensive detection of mosaic variants in non-trio BD cases
In addition to 194 trio-based BD probands, we sought deleterious mosaic variants in the DD/ASD genes from 41 non-trio-based BD cases as an exploratory search (not included in the statistical analysis above). Three cases had deleterious mosaic variants in the DD genes: CHAMP1, NACC1, and BCL11B (Table 1). Another patient had an LoF mosaic variant in an ASD gene: MED13 (SFARI score 1, pLI 1.00). Yet MED13 is not in the DD gene list in Kaplanis et al. [52], but was reported to be associated with developmental delay and intellectual disability by autosomal dominant mode, frequently through gDNVs [63]. These data provide additional examples of BD cases with deleterious mosaic variants in the DD/ASD genes, resulting in 9 and 14 individuals with deleterious mosaic variants in DD and ASD genes, respectively, among 231 BD participants.
We observed no deleterious mosaic variants in DD genes but one deleterious mosaic variant in an ASD gene (p.Trp1981* in LRP1) in 39 controls of our recruitment. The contrast of deleterious mosaic variant rates of 9/231 (0.039) in BD vs. 0/39 (0.00) in controls for DD genes and 14/231 (0.061) in BD vs. 1/39 (0.026) in controls for ASD genes did not contradict our hypothesis. However, the statistical power was insufficient to assess these enrichments due to a small sample size of controls. For reference, these contrasts indicated P values of 0.366 and 0.704 for DD and ASD genes, respectively, by two-sided FET. Further confirmation needs a larger sample size of controls.
Mitochondrial heteroplasmic variants in BD
The enrichment of DD/ASD genes in the genes hit by exonic deleterious mDNVs encouraged us to search for similar phenomena in the mitochondrial genome. At a median depth of 711×, we detected 33 high-confident mitochondrial heteroplasmic variants in 190 trio-based BD probands (pHets, Table S7) with VAFs of 1–36% (median 4.1%). The trio-based TAS validation confirmed low heteroplasmic variants (VAF < 10%) by excluding misalignment due to transmitted NUMT (Table S8) (validation rate = 90.7%, all were SNVs). To investigate the characteristics of pHets, we compared the high-confident pHets to homoplasmic variants in the same probands (pHoms, n = 382, Table S9), calculating the proportion of four classes of mitochondrial genic variants: synonymous, non-synonymous, tRNA, and rRNA. The proportion of tRNA variants in pHets was higher than in pHoms (P = 0.00374, two-sided FET, 3.86-fold enrichment), while the proportions of the other three classes in pHets were not different from those in pHoms (Fig. 4A).
The tRNA variants were more observed as heteroplasmy than homoplasmy in the gnomAD-v3.1 dataset as the general population (Fig. S5A), probably due to selection bias to tRNA variants. The question is whether the enrichment of tRNA variants in BD pHets was derived only from (i) the general feature of heteroplasmy compared to homoplasmy or also derived from (ii) a characteristic of heteroplasmy in BD. The proportion of tRNA variants in the BD pHets was higher than in the gnomAD heteroplasmy (P = 0.0226, two-sided FET, 2.42-fold enrichment). However, the gnomAD heteroplasmy included common heteroplasmy, which could have fewer tRNA variants than rare heteroplasmy like those in BD. Thus, we checked the possibility of (ii) by comparing the BD pHets to the newly arising rare pHets in two population datasets: ToMMo GenII [57] and Wei et al. [55]. Since the sequencing methods and analytical pipelines were different, we compared the proportion of tRNA pHets to the total pHets in each set with theoretical proportion (i.e., all the possible mitochondrial genic SNVs). While the proportion of tRNA pHets in the two population datasets is not higher than in all the possible genic SNVs (P = 1.00 and 0.382 for ToMMo GenII and Wei et al. data, respectively, two-sided FET), the proportion of tRNA pHets in BD was higher than in all the possible genic SNVs (Fig. 4B, P = 0.0102, two-sided FET, 2.82-fold enrichment). The enrichment of tRNA pHets in BD remained significant when limiting the tRNA pHets to the deleterious ones (Fig. 4C, P = 0.00994, two-sided FET, 4.72-fold enrichment). These enrichments were not likely to be derived from the bias of SNV patterns in tRNA regions because the SNV patterns in tRNA regions were similar to those in whole mitochondrial genic regions (cosine similarity = 0.894, Fig. S5B). Thus, the enrichment of tRNA variants in BD pHets could be derived not only from the general feature of heteroplasmic variants but also from a characteristic of BD. Besides high-confident variants, we detected two more low-confident heteroplasmic tRNA variants in low-mappability regions in 190 trio-based BD probands, including the ClinVar-Pathogenic m.5703 G > A variant (not included in the statistical analysis above, Table S7).
While we could not statistically assess the enrichment of LoF variants due to the limited number, one participant had an LoF pHet (p.Gly260* in MT-CO3). Mitochondrial LoF variants are almost none as homoplasmy in large-scale databases [43, 44]), indicating phenotypic disadvantages resulting from mitochondrial LoF variants. Another interesting variant was an initiator pHet (c.3 G > A) in the AUG start codon of MT-CO2. The c3.G > A in the AUG start codon is not observed as homoplasmy in gnomAD and HelixMTdb [43, 44]. In theory, the LoF and AUG initiator-codon pHets could have a pathological effect on BD.
Extensive detection of heteroplasmic variants in BD
As an extensive exploratory analysis, we sought heteroplasmic tRNA variants, including those in low-mappability regions, from 231 BD and 39 controls, irrespective of transmission status. We found 15 individuals with heteroplasmic tRNA variants in 231 BD participants, including nine tRNA variants as trio-based pHets (Table S7). In contrast, we found no heteroplasmic tRNA variants in 39 controls by the same pipeline (15/231 [0.065] vs. 0/39 [0.00]). Despite the insufficient statistical power due to limited controls (P = 0.139 by two-sided FET) and the low-mappability of several variants, this observation did not contradict the possible enrichment of heteroplasmic tRNA variants in BD. Among them, seven deleterious tRNA variants were notable for their rarity in the general population (Table 2). Combining the above results, the heteroplasmic variants of interest for potential pathogenicity (deleterious tRNA, LoF, initiator codon, or ClinVar-registered) are illustrated and listed in Fig. 4D and Table 2.
Recurrent m.3243 A > G variants in BD
In this study, the m.3243 A > G variant (deleterious tRNA variant) was the only mosaic variant recurrently found in two unrelated BD probands. The m.3243 A > G variant is causative for severe mitochondrial disease, MELAS (mitochondrial myopathy, encephalopathy, lactic acidosis, and stroke-like episodes) [64]. The VAFs of m.3243 A > G in BD probands (5–12%) were lower than the average VAFs in MELAS pedigrees (around 20% in the leucocytes [65]), suggesting a possible correlation between VAFs and disease severities. The m.3243 A > G variant of the ID524 patient was assumed to be de novo, while the m.3243 A > G variant of ID404 was inferred as maternally inherited with low-level heteroplasmy because the mother of ID404, diagnosed with recurrent depression, had the m.3243 A > G variant with a VAF of 2.0%. The two pedigrees seemed genetically unrelated, because the PHI score between the two BD probands was −0.0147 by vcftools-v0.1.17 relatedness2 [66, 67] calculation of exonic common SNPs. The pedigree of ID404 had two other psychiatric patients with schizophrenia in the mother’s pedigree compatible with maternal inheritance. While the DNA samples from the two schizophrenia patients and systematic pedigree information were unavailable, this information motivated us to investigate pHets in schizophrenia. Selecting 143 schizophrenia trios with the probands’ average mitochondrial depth ≥50 as eligible from Fromer et al. data [50], we detected 4 pHets in the 143 trios, including one m.3243 A > G variant with a VAF of 13.3% in one schizophrenia proband (frequency = 6.99 × 10−3).
Then, we queried if the m.3243 A > G variant is frequent in large-scale population datasets (Table S10). First, we investigated Japanese population samples, ToMMo-8.3KJPN (median mitochondrial depth of 786×) [33, 56]. The m.3243 A > G variants with VAFs ≥ 1% were detected in 10 of 7720 individuals (frequency = 1.30 × 10−3). The proportion of individuals with m.3243 A > G variants in BD (2/231 = 8.66 × 10−3) was higher than in ToMMo-8.3KJPN with similar depths of around 750× (P = 0.0464, two-sided FET). Next, we checked the gnomAD v3.1 database and found 57 m.3243 A > G variants (VAFs ≥ 1%) in 56 383 individuals, including ambiguous calls of VAFs < 10% [43]. Since the median mitochondrial depth of 2700× in gnomAD-v3.1 indicates sufficient sensitivity to VAF of 1%, we can safely interpret the upper estimate of the frequency of m.3243 A > G variants with VAFs ≥ 1% as 1.01 × 10−3 in gnomAD-v3.1. Even using this upper estimate, the proportion of individuals with m.3243 A > G variants in BD was higher than in gnomAD-v3.1 (P = 0.0247, two-sided FET). We also checked m.3243 A > G frequency in HelixMTdb [44], an investigation using saliva DNA like our BD study, and found the frequency of m.3243 A > G variant as 2.60 × 10−4 at an average mitochondrial depth of 182×. We queried the downsampled (to 182×) data of BD to match the source tissue (saliva) and sequencing depth between BD and HelixMTdb, confirming that the two pHets of m.3243 A > G in BD were still callable at 182× depth. The proportion of individuals with m.3243 A > G variants in BD was also higher than in HelixMTdb (P = 0.00186, two-sided FET) in the comparison matching the source tissue and sequencing depth. Thus, the frequency of m.3243 A > G variants in BD should be higher than in the three population datasets.
Discussion
Here, we report the characteristics of exonic and mitochondrial mosaic variants in BD comprehensively detected by DES using peripheral tissues. The genes hit by deleterious mosaic variants in BD were enriched in DD/ASD genes (Fig. 2). These exonic deleterious mosaic variants were not observed in the general population (Table 1). The PPI network analysis suggested that BD has shared pathological mechanisms with DD/ASD through mosaic variants (Fig. 3). The mitochondrial heteroplasmic variants in BD probands were enriched in tRNA regions (Fig. 4), including recurrent m.3243 A > G variants, and the deleterious tRNA variants in BD were rare in the general population (Table 2).
While using peripheral tissues, the enrichment of DD genes in the genes hit by deleterious mDNVs in BD (Fig. 2) supports our primary hypothesis that deleterious mosaic variants in DD genes are associated with BD. Post hoc power calculation indicates that our sample size is enough to address this issue, and deleterious mosaic variants in DD genes were not observed in DES from 39 control samples. Our data extend Sanders et al.’s perspective on the relevance of rare genetic diseases to neuropsychiatric symptoms [68]: the deleterious mosaic variants in DD genes would result in less severe phenotypes (e.g., BD) as a biological spectrum from DD (Fig. S6). The mosaic LoF variant in ARID2 in BD, which causes Coffin-Siris syndrome if it exists as a germline variant, is a notable example of this paradigm. As a more general pathological insight, mosaic variants in DD/ASD genes could be one of the molecular explanations behind the shared molecular pathologies among various psychiatric/neurodevelopmental disorders [69]. Indeed, the PPI of the proteins coded by the DD/ASD genes with non-synonymous mDNVs in BD, especially the tight interactions of ARID2, KMT2C, and SRCAP as chromatin remodeling proteins, should be an exemplar shared molecular mechanism between BD and DD/ASD.
We also found the potential relevance of mitochondrial heteroplasmic (mosaic) variants to BD. The enrichment of tRNA heteroplasmic variants, especially those with predicted deleterious effects, suggests that tRNA variants could explain a part of BD. The enrichment of heteroplasmic tRNA variants in BD was calculated by the proportions of tRNA to total variants in our study. Assuming the mutation rate of mtDNA as a molecular property in BD is no less than in population controls, the tRNA variants would be more observed in BD than in population controls. Wang et al. reported that tRNA heteroplasmic variants are more observed in ASD than in unaffected ASD siblings, while the overall heteroplasmic variant rates were not different [70]. This contrast could also be applicable to BD. Indeed, our preliminary direct comparison suggests enrichment of heteroplasmic tRNA variants in BD qualified with a limited sample size of controls (BD vs. control = 15/231 [0.065] vs. 0/39 [0.00]). Mitochondrial dysfunction has been hypothesized to explain BD [71], particularly BD with maternal inheritance [72]. Since mitochondria have critical roles in brain function as energy sources of ATP and calcium ion regulators, mitochondrial dysfunction should result in alterations in brain function, including psychiatric symptoms [73]. Indeed, patients with mitochondrial diseases tend to have psychiatric symptoms, including BD [74,75,76]. Especially, mitochondrial disease patients with m.3243 A > G variants are more susceptible to BD type I than the general population (odds ratio = 12.8) [76]. Brain-specific Polg mutant mouse, in which various mitochondrial DNA variants are accumulated in their brains due to mitochondria-specific DNA polymerase gamma impairment, shows recurrent mood disorder-like phenotypes, including antidepressant-induced manic-like behavior [77]. Here, we found the enrichment of a specific class of mitochondrial DNA variants as a potential background for BD. The question is whether the heteroplasmy with lower fractions than mitochondrial diseases has a pathogenic effect. Some thresholds of VAFs should exist at the cellular level, but the thresholds for psychiatric disorders are not sure, which could be addressed in future molecular studies. Technically, we could detect low heteroplasmic variants with trio-based validation by removing false positives due to misalignment by NUMT, which were otherwise difficult to confirm without trios. This method could be used in future mitochondrial analysis to detect low heteroplasmic variants precisely.
Of note, we found recurrent m.3243 A > G variants from two unrelated BD probands and two patients with other psychiatric disorders, schizophrenia, and recurrent depression. The higher frequency of the m.3243 A > G variant in BD than in the general population suggests a possible association of the m.3243 A > G variant with BD. Indeed, mitochondrial disease patients with m.3243 A > G variants are more likely to have BD than the general population [76]. Our observation of the m.3243 A > G variants in BD and schizophrenia is consistent with Munakata et al. [78], in which the m.3243 A > G variants were found in two and one brains of 15 BD and 13 schizophrenia patients, respectively (VAFs of 1~2%), but none in the brains of 14 healthy controls. Low heteroplasmic m.3243 A > G variants might be observed frequently in the brains as a trans-diagnostic background of major psychiatric disorders. The pathogenicity of the m.3243 A > G variant is well established for mitochondrial disease, MELAS [64], which is much more severe than BD and schizophrenia. Lower VAFs of m.3243 A > G in BD and schizophrenia than MELAS indicates a possible phenotypic spectrum by VAFs: low heteroplasmic variants, which would result in mitochondrial diseases with high VAFs, should result in milder phenotypes, including psychiatric symptoms (Fig. S6). Indeed, high VAFs of m.3243 A > G variants are related to severe neurological symptoms [79]. One study using neurons differentiated from induced pluripotent stem cells from patients with the m.3243 A > G variant reported that the neuronal dysfunctions from m.3243 A > G follow the VAFs (e.g., more than 30% of mutant alleles lead to neuronal burst irregularity in vitro) [80]. We can speculate that heteroplasmy levels exceed the threshold only in a subgroup of neurons in our BD probands, resulting in psychiatric symptoms related to BD. If this holds, a new therapeutic strategy of taurine for MELAS [81] is a potential therapeutic repurposed for psychiatric patients with m.3243 A > G. Other treatment strategies targeting mitochondrial dysfunctions, especially those derived from m.3243 A > G [82, 83], are also potential therapeutics for psychiatric patients. In parallel with drug development, genomic stratification could be a promising path to precision medicine in psychiatry.
Despite the findings above, we are aware of the following limitations. First, the enrichment of DD/ASD genes in the genes hit by deleterious mDNVs in BD was based on a theoretical estimate. While we confirmed that this enrichment was not observed in gDNVs in BD and mDNVs/gDNVs in unaffected siblings in ASD studies, future direct case-control comparisons will provide further conclusive results. Second, this study detected the mDNVs in BD in peripheral tissues. We validated two critical mDNVs in ARID2 and KMT2C as of early developmental origin, but a possible clonal expansion of blood cells cannot be completely excluded for other mDNVs. Direct investigation of the human brain is desirable for psychiatric research. Besides, our recruitment of BD participants would cause ascertainment bias among BD patients. They were voluntarily interested in scientific research as a family and could have different intellectual capacities from general BD patients. These limitations are also applicable to our analysis of mitochondrial DNA variants. The heteroplasmy in BD also needs case-control comparison and direct investigation of the brain samples. Brain samples are more required for mitochondrial analysis because mitochondrial heteroplasmy is highly variable across tissues. We used various datasets with different source tissues and depths to assess the enrichment of m.3243 A > G heteroplasmy in BD, but the exact frequency of m.3243 A > G heteroplasmy should be investigated in a research design matching the source tissues and sequencing depths.
The use of peripheral tissues is a major limitation of our study. We did not target brain-specific mosaic variants, which could contribute to BD. The mosaic variants in the brain were enriched in the genes involved in neural function with a bias to exonic regions [84, 85]. These variants should be targeted in future investigations. However, we emphasize that peripheral tissues have potential clinical significance. While it is true that postmortem brain samples are the best material for studying mosaic variants directly involved in the pathogenesis of BD, the brains of living patients are usually inaccessible. In contrast, peripheral tissues could be used for clinical genomic diagnosis and risk prediction. Besides, the high accessibility of peripheral tissues enables large-scale case-control comparisons and studies of rare diseases. The possibility of mosaic variants due to clonal hematopoietic expansion can be excluded by assaying other tissues (e.g., nails) as performed in our study. We extensively discuss this issue in Supplementary Note.
Another limitation is the functional prediction of exonic variants. We used canonical transcripts for annotation with the “SnpEff -canon” argument (Supplementary Methods) as a fair analysis as much as possible. However, the complex alternative splicing patterns in the brain could be relevant to the mechanisms of psychiatric disorders [86, 87]. The correct functional annotations for brain-expressed genes need further sophistication that should be addressed in future studies.
In conclusion, we find the enrichment of DD/ASD genes in the genes hit by deleterious mosaic variants in BD with certain statistical evidence, supporting the primary hypothesis of this study. We also find the enrichment of mitochondrial tRNA heteroplasmic variants in BD, including recurrent m.3243 A > G. The m.3243 A > G variants are more observed in BD than in the general population and detected with lower allele fractions than mitochondrial diseases, indicating a possible phenotypic spectrum following variant allele fractions. Since our study has the limitation of peripheral tissue and insufficient control samples, future well-powered case-control investigations into brain samples are required. Despite these limitations, our data shed new light on the genetic architecture of BD and pave the way for precision medicine in clinical psychiatry by genomic stratification.
Data availability
The conventional WES data with consent for database registration are available through the NBDC Human Database, Japan (JGAS000273/JGAD000379). The deep exome sequence data can be accessed via formal collaboration.
References
GBD 2016 Disease Injury Incidence Prevalence Collaborators. Global, regional, and national incidence, prevalence, and years lived with disability for 328 diseases and injuries for 195 countries, 1990-2016: a systematic analysis for the Global Burden of Disease Study 2016. Lancet. 2017;390:1211–59.
Kato T. Current understanding of bipolar disorder: toward integration of biological basis and treatment strategies. Psychiatry Clin Neurosci. 2019;73:526–40.
McGuffin P, Rijsdijk F, Andrew M, Sham P, Katz R, Cardno A. The heritability of bipolar affective disorder and the genetic relationship to unipolar depression. Arch Gen Psychiatry. 2003;60:497–502.
Ikeda M, Takahashi A, Kamatani Y, Okahisa Y, Kunugi H, Mori N, et al. A genome-wide association study identifies two novel susceptibility loci and trans population polygenicity associated with bipolar disorder. Mol Psychiatry. 2018;23:639–47.
Stahl EA, Breen G, Forstner AJ, McQuillin A, Ripke S, Trubetskoy V, et al. Genome-wide association study identifies 30 loci associated with bipolar disorder. Nat Genet. 2019;51:793–803.
Mullins N, Forstner AJ, O’Connell KS, Coombes B, Coleman JRI, Qiao Z, et al. Genome-wide association study of more than 40,000 bipolar disorder cases provides new insights into the underlying biology. Nat Genet. 2021;53:817–29.
Goes FS, Pirooznia M, Parla JS, Kramer M, Ghiban E, Mavruk S, et al. Exome Sequencing of Familial Bipolar Disorder. JAMA Psychiatry. 2016;73:590–7.
Rao AR, Yourshaw M, Christensen B, Nelson SF, Kerner B. Rare deleterious mutations are associated with disease in bipolar disorder families. Mol Psychiatry. 2017;22:1009–14.
Nishioka M, Kazuno AA, Nakamura T, Sakai N, Hayama T, Fujii K, et al. Systematic analysis of exonic germline and postzygotic de novo mutations in bipolar disorder. Nat Commun. 2021;12:3750.
Kataoka M, Matoba N, Sawada T, Kazuno AA, Ishiwata M, Fujii K, et al. Exome sequencing for bipolar disorder points to roles of de novo loss-of-function and protein-altering mutations. Mol Psychiatry. 2016;21:885–93.
Goes FS, Pirooznia M, Tehan M, Zandi PP, McGrath J, Wolyniec P, et al. De novo variation in bipolar disorder. Mol Psychiatry. 2019: advanced online publication https://doi.org/10.1038/s41380-41019-40611-41381.
Palmer DS, Howrigan DP, Chapman SB, Adolfsson R, Bass N, Blackwood D, et al. Exome sequencing in bipolar disorder identifies AKAP11 as a risk gene shared with schizophrenia. Nat Genet. 2022;54:541–7.
Kushima I, Nakatochi M, Aleksic B, Okada T, Kimura H, Kato H, et al. Cross-disorder analysis of genic and regulatory copy number variations in bipolar disorder, schizophrenia, and autism spectrum disorder. Biol Psychiatry. 2022;92:362–74.
Nishioka M, Bundo M, Iwamoto K, Kato T. Somatic mutations in the human brain: implications for psychiatric research. Mol Psychiatry. 2019;24:839–56.
Bizzotto S, Walsh CA. Genetic mosaicism in the human brain: from lineage tracing to neuropsychiatric disorders. Nat Rev Neurosci. 2022;23:275–86.
Rodin RE, Dou Y, Kwon M, Sherman MA, D’Gama AM, Doan RN, et al. The landscape of somatic mutation in cerebral cortex of autistic and neurotypical individuals revealed by ultra-deep whole-genome sequencing. Nat Neurosci. 2021;24:176–85.
Sherman MA, Rodin RE, Genovese G, Dias C, Barton AR, Mukamel RE, et al. Large mosaic copy number variations confer autism risk. Nat Neurosci. 2021;24:197–203.
Krupp DR, Barnard RA, Duffourd Y, Evans SA, Mulqueen RM, Bernier R, et al. Exonic mosaic mutations contribute risk for autism spectrum disorder. Am J Hum Genet. 2017;101:369–90.
Lim ET, Uddin M, De Rubeis S, Chan Y, Kamumbu AS, Zhang X, et al. Rates, distribution and implications of postzygotic mosaic mutations in autism spectrum disorder. Nat Neurosci. 2017;20:1217–24.
Dou Y, Yang X, Li Z, Wang S, Zhang Z, Ye AY, et al. Postzygotic single-nucleotide mosaicisms contribute to the etiology of autism spectrum disorder and autistic traits and the origin of mutations. Hum Mutat. 2017;38:1002–13.
Freed D, Pevsner J. The contribution of mosaic variants to autism spectrum disorder. PLoS Genet. 2016;12:e1006245.
Bundo M, Toyoshima M, Okada Y, Akamatsu W, Ueda J, Nemoto-Miyauchi T, et al. Increased l1 retrotransposition in the neuronal genome in schizophrenia. Neuron. 2014;81:306–13.
Fullard JF, Charney AW, Voloudakis G, Uzilov AV, Haroutunian V, Roussos P. Assessment of somatic single-nucleotide variation in brain tissue of cases with schizophrenia. Transl Psychiatry. 2019;9:21.
Kim MH, Kim IB, Lee J, Cha DH, Park SM, Kim JH, et al. Low-level brain somatic mutations are implicated in schizophrenia. Biol Psychiatry. 2021;90:35–46.
Bae T, Tomasini L, Mariani J, Zhou B, Roychowdhury T, Franjic D, et al. Different mutational rates and mechanisms in human cells at pregastrulation and neurogenesis. Science. 2018;359:550–5.
Li H, Durbin R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics. 2009;25:1754–60.
McKenna A, Hanna M, Banks E, Sivachenko A, Cibulskis K, Kernytsky A, et al. The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. Genome Res. 2010;20:1297–303.
DePristo MA, Banks E, Poplin R, Garimella KV, Maguire JR, Hartl C, et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet. 2011;43:491–8.
Cibulskis K, Lawrence MS, Carter SL, Sivachenko A, Jaffe D, Sougnez C, et al. Sensitive detection of somatic point mutations in impure and heterogeneous cancer samples. Nat Biotechnol. 2013;31:213–9.
Karczewski KJ, Francioli LC, Tiao G, Cummings BB, Alfoldi J, Wang Q, et al. The mutational constraint spectrum quantified from variation in 141,456 humans. Nature. 2020;581:434–43.
Wei Q, Zhan X, Zhong X, Liu Y, Han Y, Chen W, et al. A Bayesian framework for de novo mutation calling in parents-offspring trios. Bioinformatics. 2015;31:1375–81.
Liu Y, Li B, Tan R, Zhu X, Wang Y. A gradient-boosting approach for filtering de novo mutations in parent-offspring trios. Bioinformatics. 2014;30:1830–6.
Nagasaki M, Yasuda J, Katsuoka F, Nariai N, Kojima K, Kawai Y, et al. Rare variant discovery by deep whole-genome sequencing of 1,070 Japanese individuals. Nat Commun. 2015;6:8018.
Cingolani P, Platts A, Wang le L, Coon M, Nguyen T, Wang L, et al. A program for annotating and predicting the effects of single nucleotide polymorphisms, SnpEff: SNPs in the genome of Drosophila melanogaster strain w1118; iso-2; iso-3. Fly (Austin). 2012;6:80–92.
Samocha KE, Kosmicki JA, Karczewski KJ, O’Donnell-Luria AH, Pierce-Hoffman E, MacArthur DG, et al. Regional missense constraint improves variant deleteriousness prediction. bioRxiv. 2017. https://doi.org/10.1101/148353.
Liu X, Wu C, Li C, Boerwinkle E. dbNSFP v3.0: A One-Stop Database of Functional Predictions and Annotations for Human Nonsynonymous and Splice-Site SNVs. Hum Mutat. 2016;37:235–41.
Sim NL, Kumar P, Hu J, Henikoff S, Schneider G, Ng PC. SIFT web server: predicting effects of amino acid substitutions on proteins. Nucleic Acids Res. 2012;40:W452–457.
Adzhubei IA, Schmidt S, Peshkin L, Ramensky VE, Gerasimova A, Bork P, et al. A method and server for predicting damaging missense mutations. Nat Methods. 2010;7:248–9.
Chun S, Fay JC. Identification of deleterious mutations within three human genomes. Genome Res. 2009;19:1553–61.
Schwarz JM, Cooper DN, Schuelke M, Seelow D. MutationTaster2: mutation prediction for the deep-sequencing age. Nat Methods. 2014;11:361–2.
Reva B, Antipin Y, Sander C. Predicting the functional impact of protein mutations: application to cancer genomics. Nucleic Acids Res. 2011;39:e118.
Choi Y, Chan AP. PROVEAN web server: a tool to predict the functional effect of amino acid substitutions and indels. Bioinformatics. 2015;31:2745–7.
Laricchia KM, Lake NJ, Watts NA, Shand M, Haessly A, Gauthier L, et al. Mitochondrial DNA variation across 56,434 individuals in gnomAD. Genome Res. 2022;32:569–82.
Bolze A, Mendez F, White S, Tanudjaja F, Isaksson M, Jiang R, et al. A catalog of homoplasmic and heteroplasmic mitochondrial DNA variants in humans. bioRxiv. 2020. https://doi.org/10.1101/798264.
Sonney S, Leipzig J, Lott MT, Zhang S, Procaccio V, Wallace DC, et al. Predicting the pathogenicity of novel variants in mitochondrial tRNA with MitoTIP. PLoS Comput Biol. 2017;13:e1005867.
Niroula A, Vihinen M. PON-mt-tRNA: a multifactorial probability-based method for classification of mitochondrial tRNA variations. Nucleic Acids Res. 2016;44:2020–7.
Wei W, Pagnamenta AT, Gleadall N, Sanchis-Juan A, Stephens J, Broxholme J, et al. Nuclear-mitochondrial DNA segments resemble paternally inherited mitochondrial DNA in humans. Nat Commun. 2020;11:1740.
Nishioka M, Bundo M, Ueda J, Katsuoka F, Sato Y, Kuroki Y, et al. Identification of somatic mutations in postmortem human brains by whole genome sequencing and their implications for psychiatric disorders. Psychiatry Clin Neurosci. 2018;72:280–94.
Nishioka M, Bundo M, Ueda J, Yoshikawa A, Nishimura F, Sasaki T, et al. Identification of somatic mutations in monozygotic twins discordant for psychiatric disorders. NPJ Schizophr. 2018;4:7.
Fromer M, Pocklington AJ, Kavanagh DH, Williams HJ, Dwyer S, Gormley P, et al. De novo mutations in schizophrenia implicate synaptic networks. Nature. 2014;506:179–84.
Satterstrom FK, Kosmicki JA, Wang J, Breen MS, De Rubeis S, An JY, et al. Large-scale exome sequencing study implicates both developmental and functional changes in the neurobiology of autism. Cell. 2020;180:568–84 e523.
Kaplanis J, Samocha KE, Wiel L, Zhang Z, Arvai KJ, Eberhardt RY, et al. Evidence for 28 genetic disorders discovered by combining healthcare and research data. Nature. 2020;586:757–62.
Szklarczyk D, Gable AL, Nastou KC, Lyon D, Kirsch R, Pyysalo S, et al. The STRING database in 2021: customizable protein-protein networks, and functional characterization of user-uploaded gene/measurement sets. Nucleic Acids Res. 2021;49:D605–D612.
Shannon P, Markiel A, Ozier O, Baliga NS, Wang JT, Ramage D, et al. Cytoscape: a software environment for integrated models of biomolecular interaction networks. Genome Res. 2003;13:2498–504.
Wei W, Tuna S, Keogh MJ, Smith KR, Aitman TJ, Beales PL, et al. Germline selection shapes human mitochondrial DNA diversity. Science. 2019;364:eaau6520.
Takayama J, Tadaka S, Yano K, Katsuoka F, Gocho C, Funayama T, et al. Construction and integration of three de novo Japanese human genome assemblies toward a population-specific reference. Nat Commun. 2021;12:226.
Kuriyama S, Metoki H, Kikuya M, Obara T, Ishikuro M, Yamanaka C, et al. Cohort Profile: Tohoku Medical Megabank Project Birth and Three-Generation Cohort Study (TMM BirThree Cohort Study): rationale, progress and perspective. Int J Epidemiol. 2020;49:18–19m.
Bonduelle T, Hartlieb T, Baldassari S, Sim NS, Kim SH, Kang HC, et al. Frequent SLC35A2 brain mosaicism in mild malformation of cortical development with oligodendroglial hyperplasia in epilepsy (MOGHE). Acta Neuropathol Commun. 2021;9:3.
Ng BG, Sosicka P, Agadi S, Almannai M, Bacino CA, Barone R, et al. SLC35A2-CDG: Functional characterization, expanded molecular, clinical, and biochemical phenotypes of 30 unreported Individuals. Hum Mutat. 2019;40:908–25.
Genovese G, Kahler AK, Handsaker RE, Lindberg J, Rose SA, Bakhoum SF, et al. Clonal hematopoiesis and blood-cancer risk inferred from blood DNA sequence. N. Engl J Med. 2014;371:2477–87.
Jaiswal S, Fontanillas P, Flannick J, Manning A, Grauman PV, Mar BG, et al. Age-related clonal hematopoiesis associated with adverse outcomes. N. Engl J Med. 2014;371:2488–98.
Watson CJ, Papula AL, Poon GYP, Wong WH, Young AL, Druley TE, et al. The evolutionary dynamics and fitness landscape of clonal hematopoiesis. Science. 2020;367:1449–54.
Snijders Blok L, Hiatt SM, Bowling KM, Prokop JW, Engel KL, Cochran JN, et al. De novo mutations in MED13, a component of the Mediator complex, are associated with a novel neurodevelopmental disorder. Hum Genet. 2018;137:375–88.
Goto Y, Nonaka I, Horai S. A mutation in the tRNA(Leu)(UUR) gene associated with the MELAS subgroup of mitochondrial encephalomyopathies. Nature. 1990;348:651–3.
de Laat P, Koene S, van den Heuvel LP, Rodenburg RJ, Janssen MC, Smeitink JA. Clinical features and heteroplasmy in blood, urine and saliva in 34 Dutch families carrying the m.3243A > G mutation. J Inherit Metab Dis. 2012;35:1059–69.
Danecek P, Auton A, Abecasis G, Albers CA, Banks E, DePristo MA, et al. The variant call format and VCFtools. Bioinformatics. 2011;27:2156–8.
Manichaikul A, Mychaleckyj JC, Rich SS, Daly K, Sale M, Chen WM. Robust relationship inference in genome-wide association studies. Bioinformatics. 2010;26:2867–73.
Sanders SJ, Sahin M, Hostyk J, Thurm A, Jacquemont S, Avillach P, et al. A framework for the investigation of rare genetic disorders in neuropsychiatry. Nat Med. 2019;25:1477–87.
Gandal MJ, Haney JR, Parikshak NN, Leppa V, Ramaswami G, Hartl C, et al. Shared molecular neuropathology across major psychiatric disorders parallels polygenic overlap. Science. 2018;359:693–7.
Wang Y, Guo X, Hong X, Wang G, Pearson C, Zuckerman B, et al. Association of mitochondrial DNA content, heteroplasmies and inter-generational transmission with autism. Nat Commun. 2022;13:3790.
Kato T, Kato N. Mitochondrial dysfunction in bipolar disorder. Bipolar Disord. 2000;2:180–90.
McMahon FJ, Stine OC, Meyers DA, Simpson SG, DePaulo JR. Patterns of maternal transmission in bipolar affective disorder. Am J Hum Genet. 1995;56:1277–86.
Manji H, Kato T, Di Prospero NA, Ness S, Beal MF, Krams M, et al. Impaired mitochondrial function in psychiatric disorders. Nat Rev Neurosci. 2012;13:293–307.
Anglin RE, Garside SL, Tarnopolsky MA, Mazurek MF, Rosebush PI. The psychiatric manifestations of mitochondrial disorders: a case and review of the literature. J Clin Psychiatry. 2012;73:506–12.
Inczedy-Farkas G, Remenyi V, Gal A, Varga Z, Balla P, Udvardy-Meszaros A, et al. Psychiatric symptoms of patients with primary mitochondrial DNA disorders. Behav Brain Funct. 2012;8:9.
Colasanti A, Bugiardini E, Amawi S, Poole OV, Skorupinska I, Skorupinska M, et al. Primary mitochondrial diseases increase susceptibility to bipolar affective disorder. J Neurol Neurosurg Psychiatry. 2020;91:892–4.
Kasahara T, Takata A, Kato TM, Kubota-Sakashita M, Sawada T, Kakita A, et al. Depression-like episodes in mice harboring mtDNA deletions in paraventricular thalamus. Mol Psychiatry. 2016;21:39–48.
Munakata K, Iwamoto K, Bundo M, Kato T, Mitochondrial DNA. 3243A>G mutation and increased expression of LARS2 gene in the brains of patients with bipolar disorder and schizophrenia. Biol Psychiatry. 2005;57:525–32.
Pickett SJ, Grady JP, Ng YS, Gorman GS, Schaefer AM, Wilson IJ, et al. Phenotypic heterogeneity in m.3243A>G mitochondrial disease: The role of nuclear factors. Ann Clin Transl Neurol. 2018;5:333–45.
Klein Gunnewiek TM, Van Hugte EJH, Frega M, Guardia GS, Foreman K, Panneman D, et al. m.3243A > G-Induced mitochondrial dysfunction impairs human neuronal development and reduces neuronal network activity and synchronicity. Cell Rep. 2020;31:107538.
Ohsawa Y, Hagiwara H, Nishimatsu SI, Hirakawa A, Kamimura N, Ohtsubo H, et al. Taurine supplementation for prevention of stroke-like episodes in MELAS: a multicentre, open-label, 52-week phase III trial. J Neurol Neurosurg Psychiatry. 2019;90:529–36.
Ma H, Folmes CD, Wu J, Morey R, Mora-Castilla S, Ocampo A, et al. Metabolic rescue in pluripotent cells from patients with mtDNA disease. Nature. 2015;524:234–8.
Kobayashi H, Hatakeyama H, Nishimura H, Yokota M, Suzuki S, Tomabechi Y, et al. Chemical reversal of abnormalities in cells carrying mitochondrial DNA mutations. Nat Chem Biol. 2021;17:335–43.
Lodato MA, Woodworth MB, Lee S, Evrony GD, Mehta BK, Karger A, et al. Somatic mutation in single human neurons tracks developmental and transcriptional history. Science. 2015;350:94–98.
Lodato MA, Rodin RE, Bohrson CL, Coulter ME, Barton AR, Kwon M, et al. Aging and neurodegeneration are associated with increased mutations in single human neurons. Science. 2018;359:555–9.
Vuong CK, Black DL, Zheng S. The neurogenetics of alternative splicing. Nat Rev Neurosci. 2016;17:265–81.
Zhang CY, Xiao X, Zhang Z, Hu Z, Li M. An alternative splicing hypothesis for neuropathology of schizophrenia: evidence from studies on historical candidate genes and multi-omics data. Mol Psychiatry. 2022;27:95–112.
Acknowledgements
We thank all the participants of this study. We thank Yuichiro Watanabe, Muneko Kataoka, and Nana Matoba for data collection; Keisuke Fukumoto, Sumie Goto, and Atsuko Komori for experimental assistance. We thank the Support Unit for Bio-Material Analysis at RIKEN CBS for experimental assistance, the Laboratory for Molecular Dynamics of Mental Disorders members at RIKEN for valuable discussion, and the RIKEN HOKUSAI team for managing the high-performance computing system. This work is supported by the following funding agencies: AMED under Grant Number JP22tm0424224 (TK and GT), JP19dm0207074 (TK), JP21wm0425006 (TK), JP20dm0107133 (AT), JP22km0405214 (AT), 18dm0307028 (MN and AT), JP17km0105001 (JT, SK, and GT), JP21tm0124005 (JT, SK, and GT), JP21zf0127001 (GT); JSPS KAKENHI under Grant Number JP18H05435 (TK), JP22H00468 (TK), JP20H03605 (TK), JP20H05777 (AT), JP21H02855 (AT), JP21K15722 (MN); Subsidies for Current Expenditures to Private Institutions of Higher Education from the Promotion and Mutual Aid Corporation for Private Schools of Japan; Mochida Memorial Foundation for Medical and Pharmaceutical Research (MN).
The Bulgarian Trio Sequencing study is an accumulation of exome sequencing performed and/or funded by the Broad Institute, Cardiff University, Icahn School of Medicine at Mount Sinai, and the Wellcome Trust Sanger Institute. Work at the Broad Institute was funded by Fidelity Foundations, the Sylvan Herman Foundation and philanthropic gifts from Kent and Liz Dauten, Ted and Vada Stanley, and an anonymous donor to the Stanley Center for Psychiatric Research. Work at Cardiff was supported by Medical Research Council (MRC) Centre (G0800509) and Program Grants (G0801418), the European Community’s Seventh Framework Programme (HEALTH-F2-2010-241909 (Project EU-GEI)). Work at the Icahn School of Medicine at Mount Sinai was supported by the Friedman Brain Institute, the Institute for Genomics and Multiscale Biology, and the National Institutes of Health grants R01HG005827 (SMP) and R01MH071681 (PS). Work at the Wellcome Trust Sanger Institute was supported by The Wellcome Trust (WT089062 and WT098051). The recruitment of the trios in Bulgaria was funded by the Janssen Research Foundation.
Author information
Authors and Affiliations
Contributions
MN, AT, and TK conceived the study design. NS, TS, and TK recruited the participants and performed the clinical assessment. TH and KF helped with the enrollment of participants. MN and JU were in charge of BD data management. JT, SK, and GT were in charge of ToMMo data management. MN and JT performed bioinformatics analyses. AK and MI performed molecular experiments. MN wrote the original draft. AT, TK, and JT mainly reviewed the original draft. TK, AT, and MN jointly supervised the project. All the authors checked the final manuscript.
Corresponding authors
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Nishioka, M., Takayama, J., Sakai, N. et al. Deep exome sequencing identifies enrichment of deleterious mosaic variants in neurodevelopmental disorder genes and mitochondrial tRNA regions in bipolar disorder. Mol Psychiatry 28, 4294–4306 (2023). https://doi.org/10.1038/s41380-023-02096-x
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1038/s41380-023-02096-x
This article is cited by
-
Increased somatic mosaicism in autosomal and X chromosomes for suicide death
Molecular Psychiatry (2024)