Introduction

The association of gonadal dysgenesis and deafness was firstly reported in 1951 and has later been referred to as Perrault syndrome (MIM #233400).1 It is a rare autosomal recessive disorder characterized by ovarian dysgenesis with normal karyotype (46, XX) and primary ovarian insufficiency in females, and by bilateral sensorineural hearing loss (HL) in both genders. The severity of HL may range from moderate with early-childhood onset to profound with prelingual onset. In addition, variable neurological symptoms may be present in some families, such as intellectual disability, developmental delay, muscle weakness, cerebellar ataxia and peripheral sensorimotor neuropathy. In general, sensorineural HL is the first clinical manifestation, although a formal diagnosis of Perrault syndrome will not be made until a failure of puberty (primary amenorrhea) or a secondary amenorrhea is noted in affected young women.2

The genetic and molecular bases of Perrault syndrome have long remained unknown; the advent of next-generation sequencing has been fundamental to allow comprehensive genetic analysis and gene discovery for such rare inherited condition, irrespective of pre-existing linkage data in a patient's family. Indeed, in the last few years, causative mutations were identified in five genes: HSD17B4 (17-beta hydroxysteroid dehydrogenase 4; MIM *601860), HARS2 (mitochondrial histidyl-tRNA synthetase; MIM *600783), LARS2 (mitochondrial leucyl-tRNA synthetase; MIM *604544), CLPP (mitochondrial ATP-dependent chambered protease; MIM *601119) and C10ORF2 (mitochondrial DNA helicase Twinkle; MIM *606075).3, 4, 5, 6, 7 Notably, four of them are nuclear genes coding for mitochondrial proteins, pointing to an important role for these organelles in the maintenance of normal hearing and ovarian function. However, the so-far-identified Perrault syndrome genes collectively explain only a fraction of cases (10 families worldwide, of the 17 so-far genetically analyzed by next-generation sequencing), suggesting that additional disease-causing genes still remain to be discovered.2, 6, 7, 8

Here, we sought to determine the molecular basis of Perrault syndrome in one non-consanguineous Italian family by whole-exome sequencing.

Materials and methods

Clinical evaluation

One Italian Perrault syndrome family, with two affected (one male, II1; one female, II3) and two healthy siblings, was recruited. The study was approved by the Ethical Committee of the Fondazione IRCCS Cà Granda Ospedale Maggiore Policlinico of Milan and performed according to the Declaration of Helsinki. DNA samples were obtained after having acquired a written informed consent from all participants and anonymized according to the Italian legislation on sensible data recording.

The female proband (II3) presented with premature ovarian failure at 31 years and profound congenital HL (Supplementary Figure S1). She had menarche at the age of 13 years and regular menses until the age of 28 years, when she experienced phases of polymenorrhea and oligomenorrhea before being hospitalized for secondary amenorrhea. In that occasion, she presented with high levels of follicle-stimulating and luteinizing hormones (118 and 45.4 IU l−1, respectively), and concomitant decrease of estradiol levels (<20 pg ml−1). Abdominal ultrasounds evidenced a bicornuate uterus, hypoplastic left ovary and probable dysgenesis of the right ovary, which could not be visualized. The proband’s oldest brother (II1) was diagnosed with congenital bilateral profound sensorineural HL (Supplementary Figure S1). Both siblings had normal speech development and received bilateral cochlear implants. Both have normal karyotype (as assessed by cytogenetic analysis), neurological and cognitive function. A maternal cousin, not available to the study, was referred to have congenital hypoacusia, mental retardation and café-au-lait spots.

Genetic analyses

Whole-exome sequencing was performed on five family members at the service facility of the Yale Center for Genome Analysis. In brief, 75-bp paired-end libraries were constructed starting from 1 μg of blood-derived genomic DNA, and captured on biotinylated DNA baits using the SeqCap EZ Human Exome Library v.2.0 (Roche NimbleGen, Basel, Switzerland). After streptavidin-based purification, amplification and barcoding, samples were run on a single lane of an HiSeq 2000 (Illumina, San Diego, CA, USA). Quality control, alignment to the human reference genome (hg19, GRCh37 build) and variant calling were performed in October 2013 using the Whole-Exome sequencing Pipeline web tool.9

For the prioritization of potentially pathogenic variants, artifacts were filtered out by comparison with exomes sequenced at the same service facility for other conditions. Nonsense, frameshift and canonical splice site variants (that is, variants within 2 base pairs from splice sites) were considered as more likely to be deleterious. Common single-nucleotide variants and insertion–deletions were excluded by filtering against 1000 Genome Project, National Heart, Lung, and Blood Institute Exome Sequencing Project (NHLBI-ESP; 6500 exomes) and dbSNP135 (except variants flagged as clinically relevant) databases. The threshold for the maximum minor allele frequency was set to 1%.10 Consistently with a recessive inheritance model, only homozygous or compound heterozygous variants shared among affected individuals were considered. The potential pathogenicity of single-nucleotide variants was assessed using SIFT,11 MutationTaster2,12 PolyPhen-2,13 MutationAssessor,14 FATHMM,15 Likelihood Ratio Test16 and Condel.17

Sanger sequencing

LARS2 exons 10 and 17 were PCR amplified using sets of primers designed on the basis of the known genomic sequence of the gene (GenBank accession number NG_033907.1). PCRs were performed on 10–20 ng of genomic DNA, following standard procedures. Primer sequences, as well as the specific PCR conditions for each primer couple, are available on request.

Direct sequencing of amplified fragments was performed on both strands as described.18

Pathogenic variants were submitted to the NCBI ClinVar database (accession numbers SCV000238519-21).

In silico protein analysis and homology modeling

In silico analyses of remote homology were performed on the editing and leucine-specific domains, using as queries human LARS2 and E. coli LeuRS amino-acid sequences (UniProt accession numbers Q15031 and P07813), as described.6 In brief, each sequence was aligned by the HHblits algorithm19 to all possible similar sequences from bacteria, fungi and protists, producing a consensus sequence that highlights, for each residue, the underlying homology among species.

Homology modeling was performed by I-TASSER,20 using as templates the E. coli LeuRS in either the aminoacylation or the editing conformation (PDB entries 4AQ7 and 4ARI, respectively) and the FoldX algorithm for structure optimization and in silico mutagenesis.

Results

Exome sequencing identifies two novel potentially pathogenic missense variants in LARS2

Whole-exome sequencing, performed on five family members (I1, I2, II1, II2 and II3, Figure 1a), yielded on average 5.2 Gb high-quality sequence data per exome, with about 98.9% coverage of the target and a 70 × mean coverage (range 51 × –81 ×), with more than 93% of the target covered >10 × (Supplementary Table S1).

Figure 1
figure 1

Family segregation and in silico analyses of the newly identified mutations responsible for Perrault syndrome. (a) LARS2 mutations segregating with Perrault syndrome in the analyzed family. (left) Pedigree of the family under study. The gender of the proband’s unaffected siblings is not reported to anonymize the family tree. Genotypes are indicated below each symbol (W, wild-type alleles, M, mutant alleles). The arrow indicates the proband (II3). Half-filled and filled symbols denote the sole presence of hearing loss and the concomitant presence of hearing loss and premature ovarian failure, respectively. On the right, electropherograms of the regions surrounding the identified mutations in the LARS2 gene are shown. The positions of the C>T and G>A transitions (numbered according to NM_015340.3 cDNA) are indicated by an arrow. Y=C or T; R=G or A. (b) Schematic representation of the LARS2 protein showing the position of the newly identified mutations, as well as the previously reported ones. The editing and the leucine-specific (LS) domains of the protein are enlarged to better visualize the location of mutant residues. The positions of the HIGH and KMSKS motifs are also reported. (c) Consensus sequences of LARS2 LS and editing (partial) domains from alignments to humans (upper) and E. coli (lower). Sequence analysis was based on remote homology of the amino-acid regions surrounding the identified mutations using the HHblits algorithm. Consensus sequences are displayed with WebLogo. The position of the residues involved in the mutations are indicated by an asterisk. A full color version of this figure is available at the Journal of Human Genetics journal online.

Exome analysis (Supplementary Table S2) revealed that both affected siblings were compound heterozygous for two novel missense variants in LARS2: a NM_015340.3:c.899C>T transition (p.Thr300Met), which was inherited from their mother, and a NM_015340.3:c.1912G>A transition (p.Glu638Lys), which was inherited from their father (Figure 1a). The p.Thr300Met variant was predicted to be damaging by all programs except FATHMM, whereas the p.Glu638Lys variant was predicted to be deleterious by five (SIFT, Polyphen HumVar, Polyphen HumDiv, MutationTaster and Likelihood Ratio Test) out of eight programs (Supplementary Table S3). The presence of candidate variants and cosegregation with the phenotype in all family members was confirmed by Sanger sequencing (Figure 1a). In addition, the identified missense variants are absent both in an in-house database of about 3500 ethnically matched control exomes and in the Exome Aggregation Consortium (ExAC) browser, which includes data from >60 000 individuals (http://exac.broadinstitute.org/, last accessed August 2015), suggesting that they likely represent private mutations.

Protein sequence analysis and molecular modeling support the pathogenicity of identified variants

The LARS2 gene localizes on chromosome 3 and codes for a 903-amino-acid-long mitochondrial leucyl-tRNA synthetase, which belongs to the class 1 of aminoacyl-tRNA synthetases. Aminoacyl-tRNA synthetases are essential enzymes that charge the tRNA with its cognate amino acid, thus providing the substrates for protein synthesis. The LARS2 protein (LeuRS) is composed of five principal domains, known as catalytic, editing, leucine-specific domains, anticodon recognition and C-terminal domains (Figure 1b), and two catalytically important motifs, namely the HIGH sequence—at the amino terminus— and the KMSKS loop, responsible for the binding of the 3′ end of the tRNA during aminoacylation. The enzyme, similar to several other aminoacyl-tRNA synthetases, can exist in two different conformations: the aminoacylation and the editing ones. Whereas in the former the KMSKS loop moves as an integral part of the leucine-specific domain and closely interacts with the 3′ end of the tRNA, thus promoting catalysis, in the latter it remains in an open, relaxed conformation with no contacts to the 3′ end of the acceptor stem.21

The site of one of the mutant residues, p.Glu638, is highly conserved across eukaryotes, from yeast to humans. In particular, it is located in the catalytic domain just upstream of the KMSKS loop (Figure 1b). In contrast, the other mutant residue, p.Thr300, is located within the editing domain and is conserved in mammals, but not in more distantly related species (Supplementary Figure S2). However, owing to the ancient evolutionary origin of aminoacyl-tRNA synthetases, the interpretation of missense mutations in these genes is particularly challenging, as single-protein domains can maintain a conserved three-dimensional structure without a strong conservation of the amino-acid sequence. This may also explain why some of the commonly used programs to predict the deleteriousness of a given amino-acid substitution fail to recognize candidate LARS2 mutations as damaging (Supplementary Table S3). To circumvent this problem, we applied a sequence-alignment method that detects remote homology and builds consensus profiles of highly diverse sequences.19 This analysis revealed that p.Thr300 and p.Glu638 correspond to the consensus amino acid for both the human-based and E. coli-based alignments, suggesting the functional and possibly structural relevance of these residues (Figure 1c). In addition, neither methionine nor lysine is seen in any species at positions p.Thr300 and p.Glu638, respectively.

To predict the consequences of the newly identified variants on the LARS2 protein structure, molecular models of the human protein (Figure 2a) were built by homology modeling, using as templates the E. coli LeuRS in either the aminoacylation or the editing conformation.

Figure 2
figure 2

Structural analysis of LARS2 amino acids affected by novel Perrault syndrome mutations. (a) Surface representation of human mitochondrial LeuRS in the aminoacylation conformation (model based on E. coli 4AQ7 pdb structure), showing the different domains colored as follows: gray, N-terminal; yellow, catalytic; cyan, editing; red, anticodon-binding; orange, C-terminal. The tRNALeu (black ribbon) and the leucine-adenylate analog (black sticks) are docked in the catalytic domain. The mutated residues are highlighted in magenta, with their atoms drawn as van der Waals spheres. For clarity purposes, the tRNALeu in the editing conformation is also included and shown as green ribbons (from 4ARI pdb structure). (b) Surface representation of the amino-acid-binding pocket of the editing site in the editing conformation (human model based on E. coli LeuRS 4ARI pdb structure), with tRNALeu docked into it (green stick). Residue p.Thr300 (magenta) is part of the pocket, coordinating the binding of the substrate. (c) Ribbon representation of part of the catalytic domain surrounding residue p.Glu638, in the aminoacylation conformation (human model based on E. coli 4AQ7 pdb structure). The residues of the KMSKS motif are colored pink. Residue p.Glu638 is salt bridging p.Lys644 at the edge of the KMSKS loop, stabilizing its closed state. H-bond between p.Glu638 and p.Thr603 is also highlighted. A full color version of this figure is available at the Journal of Human Genetics journal online.

Residue p.Thr300 is part of a conserved threonine-rich region that shapes the pocket of the amino-acid binding in the LeuRS editing site (Figure 2b). Mutation of this threonine to a methionine could either interfere with the cleavage of the incorrectly paired aminoacyl-tRNA molecule or perturb the discrimination of the cognate amino acid, possibly leading to leucine hydrolization from Leu-tRNALeu.

Residue p.Glu638 is adjacent to the conserved and catalytically important 639-KMSKS-643 loop. This loop, which has a critical role in the activation and aminoacylation reactions, can adopt three conformational states: open, semi-open and closed, depending on the functional configuration of the enzyme. In the aminoacylation configuration, the KMSKS loop is in a fully closed state and forms base-specific interactions with the adenine base at the 3′ of the tRNA, whereas in the editing configuration it is in a fully open state, unable to form specific interactions to the adenine. The movement from the editing to the aminoacylation configuration brings the loop closer to the active site, allowing it to make critical interactions with the 3′ strand, which correctly position the tRNA extremity for the transfer reaction. In particular, p.Glu368 salt bridges with p.Lys644 at the other edge of the loop, possibly helping the full closure of the loop and stabilizing the closed conformation (Figure 2c). The p.Glu638Lys mutation will disallow the interaction with residue p.Lys644, and the charge repulsion will probably interfere with the conformational change from the open to the closed state.

Discussion

Mutations in LARS2 were only recently identified as responsible for Perrault syndrome in two families, one of Palestinian and the other of Slovenian ancestry,6 in which a total of three genetic defects were reported (Table 1). Here, we describe two novel disease-causing variants, thus providing the first independent replication of the involvement of LARS2 in Perrault syndrome pathogenesis, and significantly expanding the current spectrum of pathogenic mutations for this gene (Table 1). Interestingly, the newly reported mutations occur in the editing and near the leucine-specific domain, consistent with two of the three previously reported mutations (Figure 1b). In particular, p.Thr300Met affects the pocket of the amino-acid binding in the LeuRS editing site; we thus hypothesize that this residue could be important for the proofreading mechanism of the enzyme. On the basis of experiments monitoring the deacylation of Ile-tRNALeu, it has been proposed that the editing site of human LeuRS is not operational.22 However, more recent in-depth kinetic analyses established that the physiological target for E.coli LeuRS translational quality control is norvaline, a side product of the leucine biosynthetic pathway that accumulates under limited-oxygen growth conditions, suggesting that the LeuRS editing has been evolutionarily optimized to eliminate non-proteinogenic norvaline rather than isoleucine.23 Hence, the prime biological importance of the editing domain of LeuRS might consist in excluding from protein synthesis a range of non-proteinogenic amino acids (including norvaline), which are ubiquitous in many cellular metabolic pathways and can accumulate in stress conditions or can be taken up in the diet.24

Table 1 Molecular and clinical features of LARS2-dependent Perrault syndrome probands

Mutations in almost every nuclear-encoded mitochondrial aminoacyl-tRNA synthetase (18 out of 19) are associated with a wide variety of syndromes and diseases, all caused by defective aminoacylation activity of the corresponding enzyme.25, 26 Usually, defects in each synthetase result either in a tissue-specific disease, most commonly affecting the central nervous system, or in a syndromic condition. Apart from HARS2 and LARS2, only mutations in an additional mitochondrial tRNA synthetase (AARS2) seem to affect gonadal function,27 whereas deafness is reported in patients with mutations in SARS2, FARS2, IARS2, NARS2 and the dual-localized (mitochondrial and cytoplasmic) KARS.25, 28, 29, 30 However, at present, the number of patients with mutations in different mitochondrial aminoacyl-tRNA synthetases is too small to provide conclusive evidence of robust genotype–phenotype correlations.

Concerning genotype–phenotype correlations in LARS2-dependent Perrault syndrome, the here-described Italian proband shows a more severe audiological phenotype (congenital profound HL affecting all frequencies), but a milder ovarian phenotype (normal menarche at age 13 and primary ovarian insufficiency at 28 y.o.) compared with previously reported cases (Table 1). Moreover, to our knowledge, the presence of bicornuate uterus has never been associated with the Perrault syndrome phenotype before. These observations suggest the existence of a certain degree of clinical variability, which may be related to the specific genetic alteration and/or cell type-specific mitochondrial functions, metabolite requirements, as well as differential expression in basal mRNA levels of tRNA synthetases.