Familial Adult Myoclonic Epilepsy (FAME) is characterised by cortical myoclonic tremor usually from the second decade of life and overt myoclonic or generalised tonic-clonic seizures. Four independent loci have been implicated in FAME on chromosomes (chr) 2, 3, 5 and 8. Using whole genome sequencing and repeat primed PCR, we provide evidence that chr2-linked FAME (FAME2) is caused by an expansion of an ATTTC pentamer within the first intron of STARD7. The ATTTC expansions segregate in 158/158 individuals typically affected by FAME from 22 pedigrees including 16 previously reported families recruited worldwide. RNA sequencing from patient derived fibroblasts shows no accumulation of the AUUUU or AUUUC repeat sequences and STARD7 gene expression is not affected. These data, in combination with other genes bearing similar mutations that have been implicated in FAME, suggest ATTTC expansions may cause this disorder, irrespective of the genomic locus involved.
FAME (also referred to as Familial Cortical Myoclonic Tremor and Epilepsy or Benign Adult onset Familial Myoclonic Epilepsy [OMIM phenotypic series: PS601068]) is characterised by cortical myoclonic tremor and overt myoclonic and later generalised tonic-clonic seizures (GTCS)1. Onset of symptoms occurs in the second to third decade with variable expressivity within and between families; anticipation has been noted in some families1. The frequency of GTCS varies from 15 to 100% in 22 different families reported here (Table 1)2. Seizures are typically controlled with anti-epileptic drugs for generalised epilepsies, although rarely individuals have drug resistant epilepsy. FAME has been mapped to four distinct chromosomal loci. Most families link to chromosomes 8q243 or 2p11.2-q11.24, with an additional two families mapping to chromosome 5p15.31-p155 and one to chromosome 3q26.32-q286. There is one report of autosomal recessive FAME caused by mutation in CNTN2 where the phenotype was disputed7,8. Candidate genes and variants that fall within these common linkage intervals have been suggested for chr2 (ADRA2B) and chr5 (CTNND2); however, none of these genes have been shown to be allelic in all FAME families with linkage to the same interval1. We previously showed using identity-by-descent mapping that there are at least four distinct founder loci linked to FAME2 (OMIM:607876) on chr29.
The genetic cause of FAME has long remained elusive. The cause of FAME1, which is linked to chr8 (OMIM:601068), has recently been shown to be a complex repeat expansion of pentameric TTTTA and inserted TTTCA repeats into the fourth intron of the SAMD12 gene10,11. In the same study, TNRC6A (chr16) and RAPGEF2 (chr4) were implicated as FAME genes within single families, respectively, found via direct detection of the same repeated TTTTA and TTTCA sequences11.
Here, we use bioinformatic analysis of short-read whole-genome sequencing to identify ATTTT and ATTTC repeat expansions in the FAME2 linkage interval. We screen for an intronic ATTTC expansion in the first intron of STARD7 by repeat-primed PCR and show it segregates with FAME2 in 158 affected individuals from 22 families. We use long-read sequencing to suggest the ATTTT and ATTTC expansions may be somatically unstable. We analyse clinical data and show evidence of anticipation over multiple generations of a large FAME2 family. Finally, we demonstrate that the presence of the ATTTC repeat has no effect on protein or mRNA expression levels of STARD7 in available patient cell lines. These data suggest the repeat sequence alone is pathogenic, independent of an effect on the coding sequence of the encompassing gene.
Discovery of a repeat expansion in STARD7
We analysed Illumina HiSeq X-10 whole-genome sequencing data initially from two individuals from a large Australian-New Zealand FAME family, one from an Italian family and three from a French-Spanish family (Table 1 and Supplementary Table 1; Families 1, 3 and 19, respectively)2,12,13 with two repeat expansion detection methods, ExpansionHunter and exSTRa14,15, to look for similar combined ATTTT and ATTTC repeat expansions on both the forward and reverse chromosome strands within the FAME2 interval. This revealed an expansion of an ATTTT repeat and insertion of an ATTTC repeat in the context of the reverse strand of chr2 within the first intron of STARD7 (StAR-related lipid transfer domain-containing 7) in all FAME samples tested (Fig. 1a, Supplementary Fig. 1). The endogenous ATTTT repeat in intron 1 of STARD7 was also found to be variable in length in the normal population but not expanded to the same extent as repeats found in individuals with FAME. The ATTTC repeat was not present in any whole-genome sequencing data from 69 control samples (Supplementary Fig. 1), nor is it reported in the Simple Repeats track in the UCSC genome browser (build hg38)16.
Segregation of STARD7 ATTTC expansions by repeat-primed PCR
We developed a repeat-primed PCR (RP-PCR) assay to rapidly identify the expansion in 137/137 affected individuals from 16 independently reported FAME2 families worldwide (Fig. 1c, d, Table 1, Supplementary Table 1, Supplementary Data 1, Supplementary Fig. 2; Families 1, 3–6, 8–10, 12–16, 19, 20 and 22). Of the 24 individuals tested in these families that did not have a FAME diagnosis, two were positive for the ATTTC expansion; both were from younger generations and likely presymptomatic. We tested an additional 72 individuals (52 unrelated and 20 cases from six families with multiple affected individuals) with clinical similarity to FAME. Of these, 20/20 familial and 1/52 singleton cases were positive for an ATTTC expansion in STARD7 (Table 1, Supplementary Fig. 2; Families 2, 7, 11, 17, 18 and 21 [singleton case]). The 52 unrelated subjects comprised 13 subjects with generalised epilepsy and tremor and 39 with myoclonic epilepsy with onset over the age of 19 years; 8/52 cases had a family history of epilepsy. Finally, within the families we tested, there were 13 individuals where the diagnosis of FAME was uncertain, usually due to a history of tremor with no other diagnostic features. Of these, 8/13 carried the ATTTC expansion. Two of the individuals with uncertain diagnosis that tested negative, were a mother and daughter pair from Family 1 (Supplementary Fig. 2a [red box] III-13 and IV-65) and subsequent analyses with microsatellite markers showed that these individuals did not have the same haplotype as affected carriers of the ATTTC expansion (Supplementary Fig. 3). The ATTTC repeat expansion did not amplify in any of 28 control DNA samples extracted from unaffected individuals unrelated to FAME.
In all 158 individuals that tested positive for the ATTTC expansion, we observed that priming from ATTTT repeats was only successful from the telomeric end of the endogenous repeat and priming from ATTTC repeats was only possible from the centromeric end of the endogenous repeat. This suggested the structure of the pathogenic repeat in the context of the forward strand of chr2 was (AAATG)n[N](AAAAT)n, where (n) represents the unknown number of each repeat sequence.
Long-read sequencing reveals the repeat structure
The total numbers of repeats could not be determined by the RP-PCR assay, therefore we investigated some of these with long-read sequencing (Fig. 2). In one individual from the Australian-New Zealand family (Family 1: IV-98) a single molecule real-time (SMRT) read and a single Oxford Nanopore read were found that spanned the repeat. The SMRT read generated to 99% base accuracy by circular consensus calling was comprised of four subreads and contained 274 AAATG and 387 AAAAT repeats, without interruption from other sequences. The Oxford Nanopore read contained 345 AAATG and 390 AAAAT repeats with some interruptions, suggesting somatic variation of repeat sizes may occur within the one individual. In a second individual (Family 5; III-37), a single Oxford Nanopore read spanned the expanded repeats with 588 AAATG and 340 AAAAT repeats; 4645 bp in total length. The natural variability in the length of the endogenous ATTTT repeat sequence meant that is was not feasible to use that sequence for mutation screening; however, the ATTTC repeat primer was diagnostic for FAME with a sensitivity of 100% in all families with linkage or suggestive linkage to chr2. This included two families with the previously identified ADRA2B; c.675_686delTGGTGGGGCTTTinsGTTTGGCAG; p.H225_L229delinsQ225_F_G_R228 variant strongly suggesting that allele is not causative (Table 1; Family 4 & 15)17.
Evidence of anticipation in a large FAME2 family
In view of the discovery that FAME2 and FAME1 are caused by similar dynamic mutations of ATTTC repeats, and the demonstration of clinical anticipation in FAME111, we searched for evidence of anticipation in our pedigrees. We examined the median onset age of any relevant symptom, where available, for each generation in the Australian/New Zealand family (Family 1). We found evidence of anticipation; generation III had a median onset of 30 years (range 14–60 y, n = 6), in generation IV median onset was 17 years (8–50 y, n = 30) and the median onset in generation V was 12 years (4–19 y, n = 16). The remaining families were either too small or onset data were unavailable for anticipation to be robustly assessed.
STARD7 transcript and protein abundance are not altered
Reverse transcriptase, quantitative PCR using primer pairs spanning the repeat containing intron between exons one and two and a second pair spanning between exons three and four showed no significant differences in STARD7 transcript expression in patient-derived fibroblast cell lines (Fig. 3a). Protein abundance was also unaltered, confirmed by western blotting using an antibody to STARD7 protein that was previously validated using STARD7-knockout cell lines (Fig. 3b)18. RNA-Seq data from six patient-derived fibroblasts (four from Family 1 and two from Family 5) showed there was no significant difference in gene expression of STARD7 between affected and unaffected individuals along the entire length of the gene (Supplementary Fig. 4; p = 0.838; False Discovery Rate = 1). Reads containing ATTTC repeats were not present in the RNA-Seq data despite robust expression of STARD7. This is consistent with the observations from lymphoblastoid cell lines (LCLs) derived from individuals with FAME1, where no reads with repeats were found11.
The pathogenic ATTTC insertion and expansion was always accompanied by the endogenous ATTTT pentanucleotide repeat in all cases of FAME2 that we describe here, replicating the findings in the cases of FAME with expansions in SAMD12, TNRC6A, RAPGEF210,11,19 and the report of a similar expansion in MARCH6 causing chr5-linked FAME20. The same observation also holds for spinocerebellar ataxia 37 (SCA37, OMIM: 615945), which is caused by the same repeat expansion in the first intron of DAB121. For SCA37, it has been hypothesised that the thymidine to cytosine transition occurs after expansion of the endogenous ATTTT repeat to ~200 copies followed by further expansion of the mutant ATTTC sequence22. The ATTTT/ATTTC strand of the repeat is aligned with the direction of gene expression in all genes reported thus far, regardless of their chromosomal orientation. The mechanism of disease pathogenesis has been suggested to be RNA toxicity21. In zebrafish embryos, direct injection of RNA containing 58 copies of the AUUUC repeat was lethal or caused developmental defects in 81%, while the effect of injecting RNA containing 139 AUUUU repeats was not significantly different from controls21. Accumulation of AUUUC repeat containing RNA was observed in the brain of some individuals with FAME1, but we did not have access to similar biopsy tissue from individuals with FAME211. While we found no significant change in expression of STARD7 in patient-derived cell lines, it is possible that expression of this gene is regulated differently in the non-proliferating cells of the brain. Profiling expression of all known genes implicated with pathogenic ATTTC dynamic mutations using gene expression data from the GTEX portal https://www.gtexportal.org23 shows that DAB1 has high expression specifically in cerebellum while the five genes implicated in FAME thus far are more broadly expressed throughout the brain (Fig. 4). This difference in expression may partly explain the absence of epilepsy in individuals with SCA37.
STARD7 is a member of the START (StAR-related lipid transfer) domain-containing family of lipid transfer proteins with functions including intra-mitochondrial lipid transfer of phosphatidylcholine24. Previously, increased levels of choline have been detected by proton magnetic resonance spectroscopy (1H-MRS) in the cerebellum of 11 individuals from three Italian families all shown here to have the ATTTC dynamic mutation25 (Table 1). This observation may be peculiar to FAME2 families since the SAMD12, RAPGEF2, TNRC6A and MARCH6 genes do not have overlapping molecular functions.
In conclusion, we have identified the molecular basis of FAME2 is an inserted expanded ATTTC repeat in the first intron of the STARD7 gene, in 22 pedigrees with 266 affected individuals. The insertion segregates with disease status in 100% of individuals tested from families with linkage or suggestive linkage to chromosome 2 providing substantial genetic evidence that this mutation is causal in this syndrome. The FAME2 locus is the most frequently observed linked region for Caucasian individuals affected by this disorder whereas chromosome 8 thus far is limited to Asian individuals, therefore molecular genetic testing should take this into consideration if choosing to screen by RP-PCR. Identification of the gene and causative mutation for FAME2 opens the opportunity to explore the origins of the ATTTT/ATTTC expansion through a detailed comparison of the haplotypes and repeat structures of these individuals as has been done for SCA3722. There may be many additional undiagnosed individuals with a spectrum of FAME-related symptoms whose genetic causes may be due to ATTTC insertion and expansion at one of the FAME loci. This is especially likely in families that have multiple individuals with tremor and a low frequency of GTCS. As no preventive or curative treatments are currently available for FAME, these findings may have important therapeutic implications, including RNA-targeting treatments, such as antisense oligonucleotides or RNA-targeting Cas9 (RCas9)26.
This study was approved by the Human Research Ethics Committees of the University of Melbourne and the University of Adelaide. Written, informed consent was obtained from all participants in the study.
Adelaide: Human genomic DNA extracting from peripheral blood lymphocytes was prepared from two individuals in Family 1 (IV-3 and V-118) for sequencing using the TruSeq Nano DNA Library Preparation Kit (Illumina). Mapping of 150 bp, paired-end sequence reads to the UCSC hg19 build of the genome and calling of single nucleotide variants from whole-genome sequencing (WGS) data generated using an Illumina HiSeqX10 platform (Kinghorn Centre for Clinical Genomics, Sydney, Australia), was performed as previously described with the minor modification of using the Genome Analysis Toolkit (GATK) version 3.8 software27,28. Filtering of both coding and non-coding variants within the chr2 linkage interval shared between both individuals under a dominant model and absent from the gnomAD variant database29 at a frequency >0.001 was performed using the bcftools isec command from htslib v1.9. Single nucleotide variants and indels were annotated with ANNOVAR30. Reads containing the expanded repeat were visualised using the Integrative Genomics Viewer (IGV) v2.4.5 with soft-clipped reads unmasked31.
Rome: WGS library was prepared from the genomic DNA of the individual (PM195; Family 4) by using TruSeq DNA PCR-Free KIT (Illumina, San Diego, CA, USA) and sequenced 150 bp paired-end reads on an Illumina HiSeq producing 470,174,247 fragments, corresponding to about 39X coverage after mapping and removal of duplicated reads. Reads were quality filtered and aligned to the reference human genome sequence (GRCh38/hg38) with BWA-MEM v.0.7.1532. Resulting BAM files underwent local realignment around insertion-deletion sites, duplicate marking and recalibration steps with GATK v3.828. Variant calling was performed with HaplotypeCaller v3.8 with standard parameters, and output VCF files were recalibrated with VariantRecalibrator from GATK v3.8. Genomic variant annotation was carried out with VarSeq v1.4.7 (Golden Helix, Inc., Bozeman, MT, www.goldenhelix.com) and only variants with a minimum read depth of 5X were included in the downstream analysis. Thereafter, only variants in the pericentromeric region of interest of chr2 (chr2: 91,800,000–106,700,000) were considered.
Prioritisation of variants of potential interest was carried out through three distinct analyses. For the first analysis, all variants reported to be pathogenic or potentially pathogenic in the clinical databases of ClinVar, HGMD Professional v2017.2 and/or Centogene CentoMD v4.1 were retained. For the second analysis, we focused on variants in exonic regions without a reported clinical annotation. We excluded variants with a population frequency above 1% in the databases of 1000 Genomes Project, National Heart, Lung and Blood Institute (NHLBI, https://www.nhlbi.nih.gov/) Exome Sequencing Project (ESP, http://evs.gs.washington.edu/), ExAC (Exome Aggregation Consortium, http://exac.broadinstitute.org/) and gnomAD (The Genome Aggregation Database, https://gnomad.broadinstitute.org/), along with variants recorded in the Personal Genomics internal database. We retained all the non-synonymous variants predicted to alter the protein structure or function by at least three of the following in silico prediction tools: Mutation Taster, SIFT, Polyphen-2, MutationAssessor and FATHMM. For the third analysis, we prioritised the variants outside exonic regions by considering rare variants (frequency below 1% in frequency population databases, including the Personal Genomics internal database) and with a predicted significant effect on the protein structure or function by at least three of the in silico prediction tools. Variants were then prioritised by considering their presence in regulatory regions as reported in the ENCODE database (https://www.encodeproject.org/). The manual inspection of the BAM files, by using Integrative Genomics Viewer (IGV), allowed us to evaluate the coverage of the variants and the quality of the aligned reads.
The identification of putative genomic expansions, structural variants or copy number variations was carried out by using Lumpy v0.2.1333 and Manta v1.2.234 software. The ExpansionHunter tool v2.5.314 was adopted to estimate the size of potential repetitions of short unit sequences.
DNA was extracted for all long-read sequencing protocols using the QIAsymphony system from skin fibroblasts (passage 6) cultured in Dulbecco’s modified Eagle’s Medium (DMEM; Life Technologies) with 10% fetal calf serum. Pacific Biosciences (PacBio) single molecule real-time (SMRT) sequencing data were obtained in two batches: In the first batch, two Australian FAME2 carriers (Family 1: IV-44 and IV-98) were sequenced with two flow cells per sample. Resulting bam files were converted to fastq using the SMRT Link software v5.1.0 bam2fastq program. Resulting fastq files were either mapped directly to the human genome hg38 build using NGM-LR35 with structural variants called by Sniffles35 or used as input for de novo assembly with Canu v1.7. In the second batch, a single sample (Family 1: IV-98) was sequenced. DNA fragment sizes were determined with the Femto Pulse capillary electrophoresis system (Agilent Technologies, Santa Clara, CA). DNA fragments of size greater than 6 kb were selected with BluePippin (Sage Science, Beverly, MA) pulsed field gel electrophoresis system. Sequencing was carried out for 20 h per SMRT cell on the Sequel system with Binding Kit 3.0 (PacBio, 101–500–400) and Sequencing Kit 3.0 (PacBio 101–427–800). Circular consensus calling was performed using CCS 3.2.1 software. Reads were mapped to the GRCh38 build of the human genome using pbmm2 with “-c 0 -L 0.01” for CCS reads and “-c 0 -L 0.1” for subreads.
Oxford nanopore data were obtained for DNA samples extracted from fibroblasts from two individuals from Family 1, as described above, and two from Family 5 (II-37 and IV-29 Fig. S2e). For each of the four participant samples, 3 µg of DNA was prepared for Oxford Nanopore 1D genomic sequencing by ligation using the SQK-LSK108 kit and was run on a FLO-MIN106 flow cell for 48 h. Basecalling was performed on MinKNOW 18.01.6 with MinKNOW Core 1.11.5 and Albacore v2.1. Data were either mapped with NGM-LR or assembled with Canu v1.7 as described below, using suggested settings for nanopore sequencing reads.
De novo whole-genome assembly of one individual with input of both PacBio and nanopore sequencing from one individual from Family 1 was carried out using the Canu v1.7 assembler with default starting parameters for a genome size of 3.6 Gbp. Recalibrated reads from Canu were mapped to the hg38 build of the human genome using NGM-LR as described above.
Repeat expansion analysis
WGS was performed for two affected individuals from Family 1 on the Illumina HiSeq X10 platform, one individual from Family 3 as described above, and three affected individuals from Family 19 on the Illumina HiSeq platform. A cohort of 69 individuals without FAME were used for comparison, with 150 bp paired-end sequencing performed on the Illumina HiSeq X platform (Kinghorn Centre for Clinical Genomics, Sydney, Australia). Library preparation for 53 of the samples used the Illumina TruSeq Nano DNA HT Library Preparation Kit; the other 16 samples used KAPA Hyper Prep Kit PCR-free library preparation.
Reads were aligned to the hg19 reference genome with BWA-MEM v0.7.17-r118832, then duplicate marking, local realignment and recalibration were performed with GATK v4.0.3.028. Repeat expansion analysis targeting two FAME2 loci, the ATTTT repeat and predicted ATTTC insertion in STARD7, was performed using ExpansionHunter v2.5.514 and exSTRa v0.88.3 with Bio-STR-exSTRa v1.0.115. Custom files defining the FAME2-AAAAT and FAME2-AAATG repeat loci were created for ExpansionHunter (below) and exSTRa (Supplementary Table 2).
Supplementary Figure 1 shows the repeat sizes predicted by ExpansionHunter and empirical cumulative distribution function of repeated bases from exSTRa for the two FAME2 loci. Significance testing was performed using the exSTRa tsum_test function with 100,000 permutations in case-control mode comparing each affected individual with FAME to the 69 unaffected individuals without FAME. All FAME2 carriers were significant outliers for the FAME2-AAATG locus (p < 0.0001 for all individuals) while only four samples were significant outliers (p < 0.05) for the FAME2-AAAAT locus.
Total RNA was extracted from patient-derived primary skin fibroblasts of four Australian/New Zealand FAME, two Italian FAME and four age-matched controls using QIAGEN RNeasy kits, as per the manufacturer’s protocol. Library preparation and RNA-Seq were performed as a service by the UCLA Neuroscience Genomics Core Facility. The TruSeq v2 kit (Illumina) was used to generate un-stranded libraries with 150-bp mean fragment sizes and 50-bp paired-end sequencing performed using the HiSeq2500 (Illumina). Sequence data were mapped to the GRCh38 build of the human genome using HISAT2 v2.1.036. Read counts were generated with StringTie v1.3.336. Differential expression between FAME and control samples was determined using the exact test from the edgeR v3.26.5 package in R v3.6.037. Differentially expressed genes were filtered to false discovery rate (FDR) < 0.05 and log base 2-fold change (LFC) > = 1 or < = −1.
RNA was extracted from four patient-derived primary skin fibroblast cell lines from Family 1 and four control fibroblast cell lines from adult donors not affected by FAME as described above under RNA-Seq. cDNA were generated from 1 μg of total RNA using the iScript reverse transcription kit (Bio-Rad, Gladesville, NSW, Australia; cat# 1708891), according to the manufacturer’s protocol.
Quantification of differentially expressed transcripts was performed with the relative standard curve method using SYBR green fluorescence intensity for detection. Products were amplified in 1 × iTaq Universal SYBR Green supermix (Bio-Rad; cat# 1725121) with primers at 1μM final concentration. Each sample and standard was amplified with three technical replicates on an Applied Biosystems StepOnePlus. Expression values were determined relative to a dilution curve of a cDNA standard made from pooled control fibroblast cDNA. Specificity of products was determined by melt curve analysis at the conclusion of each run. Expression values of each gene were normalised to HPRT1 expression values from the same sample.
Fibroblasts were cultured as described in Supplementary methods then lysed with lysis buffer (150 mM NaCl, 1% Triton X-100, 1 mM EDTA, 0.25% Sodium deoxycholate, 50 mM Tris. Added protease inhibitor, 50 mM NaF and 0.1 mM Na3VO4). Extracts were separated by 4–12% polyacrylamide gel and transferred to nitrocellulose membrane by electroblotting. STARD7 was detected with rabbit polyclonal anti-human/mouse/rat STARD7 (Proteintech cat# 15689–1-AP) at 1:500 dilution followed by anti-rabbit IgG conjugated to horseradish peroxidase (HRP) at 1:2000 (Dako cat# P0448). Enhanced chemiluminescent detection (Bio-Rad cat# 1705061) was visualised with the chemidoc detection system (Bio-Rad). Full blots are available in the Source Data file.
PCR amplification and sequencing of repeats (Rome)
Pentanucleotide repeats were analysed in duplicate by long-range PCR with Expand Long Template PCR System (Roche) according to the manufacturer’s recommendation. Some 200 ng genomic DNA were amplified with primers STARD7F and STARD7R (300 nM), dNTP (350 µM) buffer 1 (1×) Enzyme 0.5 U (×50 µl reaction). After 2 min of initial denaturation at 94 °C, DNA samples underwent 10 cycles of amplification (denaturation 94 °C for 10 s, annealing 56 °C for 30 s, elongation 68 °C 3 min) followed by an additional 20 cycles (94 °C for 15 s, annealing 56 °C for 30 s, elongation 68 °C 45 s + 20 s each cycle elongation for each successive cycle). PCR products were separated by electrophoresis on 1% agarose gel. DNA was extracted from the agarose gel slice and the number of repeat units was determined by Sanger sequencing (Eurofins Genomics Sequencing Service).
Primers for both Adelaide and Rome are shown in Supplementary Table 3.
Adelaide: Reaction mixes included 100 ng genomic DNA, 0.5 µM FAM-labelled locus specific (RP-PCR-FAME2-P1 or P2) and RP-PCR-P3 primers, and 0.05 µM repeat specific primer (one of RP-PCR-FAME2–4.5 to 4.8) with Expand Long Template polymerase (Roche, cat# 25524324) or Taq polymerase (Roche, cat# 18697220). The initial RP-PCR step was at 95 °C for 5 min followed by 10 cycles (95 °C for 30 s, 48 °C + 1.0 °C each cycle for 45 s and 65 °C + 1.0 °C each cycle for 5 min) continuing to 30 cycles (95 °C for 30 s, 58 °C for 1 min and 72 °C for 5 min) and ending with 72 °C for 7 min. Fragment analysis was performed on the RP-PCR products with an ABI3730 DNA analyser.
Rome: The pentanucleotide repeat sequence in STARD7 gene was amplified by ATTTT and ATTTC RP-PCR with the following primers: STARD7R* 5′FAM-labelled (locus specific primer), RP-PCR-STARD7-P3 (generic primer) and RP-PCR-STARD7-P4 primers specific for the short pentanucleotide repeat (ATTTT) and for the possible expanded (ATTTC) repeat or possible (ATTTC) repeat interruption. PCR was performed with 100 ng DNA, 1.5 mM MgCl2, 200 µM dNTP, 0.4 µM locus specific primer, 0.4 µM generic primer, 0.2 µM repeat primer, 2.5 U Polymed Taq in 25 µl volume. The initial PCR step was at 94 °C for 15 min followed by 35 cycles (94 °C for 45 s, 60 °C for 30 s and 72 °C for 2 min) and 72 °C elongation for 30 min. Capillary electrophoresis was performed on ABI310 GEN ANALYZER (Applied Biosystems).
Further information on research design is available in the Nature Research Reporting Summary linked to this article.
Source data for Figs. 1a, 3a, 3b, Supplementary Figs. 1a, b and 4b are provided in the Source Data files of this manuscript. RNA-Seq data are available from the NCBI BioProject PRJNA563467. Whole-genome sequencing data are available from the corresponding author on request, subject to human research ethics approval and patient consent.
van den Ende, T., Sharifi, S., van der Salm, S. M. A. & van Rootselaar, A.-F. Familial cortical myoclonic tremor and epilepsy, an enigmatic disorder: from phenotypes to pathophysiology and genetics. a systematic review. Tremor Hyperkinetic Mov. N. Y. N. 8, 503 (2018).
Crompton, D. E. et al. Familial adult myoclonic epilepsy: recognition of mild phenotypes and refinement of the 2q locus. Arch. Neurol. 69, 474–481 (2012).
Mikami, M. et al. Localization of a gene for benign adult familial myoclonic epilepsy to chromosome 8q23.3-q24.1. Am. J. Hum. Genet. 65, 745–751 (1999).
Guerrini, R. et al. Autosomal dominant cortical myoclonus and epilepsy (ADCME) with complex partial and generalized seizures: A newly recognized epilepsy syndrome with linkage to chromosome 2p11.1-q12.2. Brain J. Neurol. 124, 2459–2475 (2001).
Depienne, C. et al. Familial cortical myoclonic tremor with epilepsy: the third locus (FCMTE3) maps to 5p. Neurology 74, 2000–2003 (2010).
Yeetong, P. et al. A newly identified locus for benign adult familial myoclonic epilepsy on chromosome 3q26.32-3q28. Eur. J. Hum. Genet. EJHG 21, 225–228 (2013).
Stogmann, E. et al. Autosomal recessive cortical myoclonic tremor and epilepsy: association with a mutation in the potassium channel associated gene CNTN2. Brain J. Neurol. 136, 1155–1160 (2013).
Striano, P., Zara, F., Striano, S. & Minetti, C. Autosomal recessive epilepsy associated with contactin 2 mutation is different from familial cortical tremor, myoclonus and epilepsy. Brain 136, e253–e253 (2013).
Henden, L. et al. Identity by descent fine mapping of familial adult myoclonus epilepsy (FAME) to 2p11.2-2q11.2. Hum. Genet. 135, 1117–1125 (2016).
Cen, Z. et al. Intronic pentanucleotide TTTCA repeat insertion in the SAMD12 gene causes familial cortical myoclonic tremor with epilepsy type 1. Brain J. Neurol. 141, 2280–2288 (2018).
Ishiura, H. et al. Expansions of intronic TTTCA and TTTTA repeats in benign adult familial myoclonic epilepsy. Nat. Genet. 50, 581–590 (2018).
Suppa, A. et al. Clinical, neuropsychological, neurophysiologic, and genetic features of a new Italian pedigree with familial cortical myoclonic tremor with epilepsy. Epilepsia 50, 1284–1288 (2009).
Saint-Martin, C. et al. Refinement of the 2p11.1-q12.2 locus responsible for cortical tremor associated with epilepsy and exclusion of candidate genes. Neurogenetics 9, 69–71 (2008).
Dolzhenko, E. et al. Detection of long repeat expansions from PCR-free whole-genome sequence data. Genome Res. 27, 1895–1903 (2017).
Tankard, R. M. et al. Detecting expansions of tandem repeats in cohorts sequenced with short-read sequencing Data. Am. J. Hum. Genet. 103, 858–873 (2018).
Benson, G. Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 27, 573–580 (1999).
De Fusco, M. et al. The α2B -adrenergic receptor is mutant in cortical myoclonus and epilepsy. Ann. Neurol. 75, 77–87 (2013).
Saita, S. et al. PARL partitions the lipid transfer protein STARD7 between the cytosol and mitochondria. EMBO J. 37, e97909 (2018).
Zeng, S. et al. Long-read sequencing identified intronic repeat expansions in SAMD12 from Chinese pedigrees affected with familial cortical myoclonic tremor with epilepsy. J. Med. Genet. 56, 265–270 (2018).
Florian, R. T. et al. Unstable TTTTA/TTTCA expansions in MARCH6 are associated with Familial Adult Myoclonic Epilepsy type 3. Nat. Commun. https://doi.org/10.1038/s41467-019-12763-9 (2019).
Seixas, A. I. et al. A Pentanucleotide ATTTC repeat insertion in the non-coding region of DAB1, mapping to SCA37, causes spinocerebellar ataxia. Am. J. Hum. Genet. 101, 87–103 (2017).
Loureiro, J. R. et al. Mutational mechanism for DAB1 (ATTTC)n insertion in SCA37: ATTTT repeat lengthening and nucleotide substitution. Hum. Mutat. 40, 404–412 (2018).
GTEx Consortium et al. Genetic effects on gene expression across human tissues. Nature 550, 204–213 (2017).
Flores-Martin, J., Rena, V., Angeletti, S., Panzetta-Dutari, G. M. & Genti-Raimondi, S. The lipid transfer protein StarD7: structure, function, and regulation. Int. J. Mol. Sci. 14, 6170–6186 (2013).
Striano, P. et al. (1)H-MR spectroscopy indicates prominent cerebellar dysfunction in benign adult familial myoclonic epilepsy. Epilepsia 50, 1491–1497 (2009).
Batra, R. et al. Elimination of toxic microsatellite repeat expansion RNA by RNA-targeting Cas9. Cell 170, 899–912.e10 (2017).
Corbett, M. A. et al. Dominant KCNA2 mutation causes episodic ataxia and pharmacoresponsive epilepsy. Neurology 87, 1975–1984 (2016).
Van der Auwera, G. A. et al. From FastQ data to high confidence variant calls: the Genome Analysis Toolkit best practices pipeline. Curr. Protoc. Bioinforma. 11, 11.10.1–11.10.33 (2013).
Lek, M. et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature 536, 285–291 (2016).
Wang, K., Li, M. & Hakonarson, H. ANNOVAR: functional annotation of genetic variants from high-throughput sequencing data. Nucleic Acids Res. 38, e164 (2010).
Thorvaldsdóttir, H., Robinson, J. T. & Mesirov, J. P. Integrative Genomics Viewer (IGV): high-performance genomics data visualization and exploration. Brief. Bioinform. 14, 178–192 (2013).
Li, H. & Durbin, R. Fast and accurate short read alignment with Burrows-Wheeler transform. Bioinformatics 25, 1754–1760 (2009).
Layer, R. M., Chiang, C., Quinlan, A. R. & Hall, I. M. LUMPY: a probabilistic framework for structural variant discovery. Genome Biol. 15, R84 (2014).
Chen, X. et al. Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications. Bioinformatics 32, 1220–1222 (2016).
Sedlazeck, F. J. et al. Accurate detection of complex structural variations using single-molecule sequencing. Nat. Methods 15, 461–468 (2018).
Pertea, M., Kim, D., Pertea, G. M., Leek, J. T. & Salzberg, S. L. Transcript-level expression analysis of RNA-seq experiments with HISAT, StringTie and Ballgown. Nat. Protoc. 11, 1650–1667 (2016).
Robinson, M. D., McCarthy, D. J. & Smyth, G. K. edgeR: a Bioconductor package for differential expression analysis of digital gene expression data. Bioinformatics 26, 139–140 (2010).
Licchetta, L. et al. A novel pedigree with familial cortical myoclonic tremor and epilepsy (FCMTE): clinical characterization, refinement of the FCMTE2 locus, and confirmation of a founder haplotype. Epilepsia 54, 1298–1306 (2013).
Gardella, E. et al. Autosomal dominant early-onset cortical myoclonus, photic-induced myoclonus, and epilepsy in a large pedigree. Epilepsia 47, 1643–1649 (2006).
Madia, F. et al. Benign adult familial myoclonic epilepsy (BAFME): evidence of an extended founder haplotype on chromosome 2p11.1-q12.2 in five Italian families. Neurogenetics 9, 139–142 (2008).
de Falco, F. A. et al. Benign adult familial myoclonic epilepsy: genetic heterogeneity and allelism with ADCME. Neurology 60, 1381–1385 (2003).
Coppola, A. et al. Psychiatric comorbidities in patients from seven families with autosomal dominant cortical tremor, myoclonus, and epilepsy. Epilepsy Behav. 56, 38–43 (2016).
Striano, P., Madia, F., Minetti, C., Striano, S. & Zara, F. Electroclinical and genetic findings in a family with cortical tremor, myoclonus, and epilepsy. Epilepsia 46, 1993–1995 (2005).
Striano, P. et al. A New Benign Adult Familial Myoclonic Epilepsy (BAFME) Pedigree Suggesting Linkage to Chromosome 2p11.1-q12.2. Epilepsia 45, 190–192 (2004).
Coppola, A. et al. Natural history and long-term evolution in families with autosomal dominant cortical tremor, myoclonus, and epilepsy. Epilepsia 52, 1245–1250 (2011).
van Coller, R., van Rootselaar, A.-F., Schutte, C. & van der Meyden, C. H. Familial cortical myoclonic tremor and epilepsy: Description of a new South African pedigree with 30 year follow up. Parkinsonism Relat. Disord. 38, 35–40 (2017).
Labauge, P. et al. Absence of linkage to 8q24 in a European family with familial adult myoclonic epilepsy (FAME). Neurology 58, 941–944 (2002).
We wish to thank the many families involved in this study. We thank Dr. Tessa Mattiske, Dr. Mark Holloway and (Andy) Hung Nguyen for technical assistance. Dr. Joel Geoghegan and Dr. Andreas Schreiber for assistance with PacBio sequencing. We wish to acknowledge the following sources of funding: NHMRC (Jozef Gecz, Ingrid Scheffer Sam Berkovic), Women’s and Children’s Hospital Research Foundation (Mark Corbett, Jozef Gecz), Muir Maxwell Trust and Epilepsy Society (Simona Balestrini, Sanjay M. Sisodiya), The European Fund for Regional Development from the European Union (grant 01492947) and the province of Friesland, Dystonia Medical Research Foundation, Stichting Wetenschapsfonds Dystonie Vereniging, Fonds Psychische Gezondheid, Phelps Stichting, from Ipsen & Allergan Farmaceutics, Merz, and Actelion (Marina A.J. Tijssen). The Italian Ministry of Health (grant GR2013–02356227) and Istituto Superiore di Sanità, Italy, (grant PGR00229-PGR00919 and Farmindustria) Undiagnosed Disease Network Italy, (Francesco Brancati). The Fondation maladies rares, University Hospital Essen (Christel Depienne). This work was partly done at NIHR University College London Hospitals Biomedical Research Centre, which receives a proportion of funding from the UK Department of Health’s NIHR Biomedical Research Centres funding scheme.
A.W. and S. Chakraborty. are employees and shareholders of Pacific Biosciences. There are no other competing interests to declare.
Peer review information Nature Communications thanks Peter Todd, Rhys Thomas and the other, anonymous, reviewer(s) for their contribution to the peer review of this work. Peer reviewer reports are available.
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Corbett, M.A., Kroes, T., Veneziano, L. et al. Intronic ATTTC repeat expansions in STARD7 in familial adult myoclonic epilepsy linked to chromosome 2. Nat Commun 10, 4920 (2019) doi:10.1038/s41467-019-12671-y