Introduction

Hearing impairment, or deafness (in its most severe form), is one of the most common human sensory disorders. In the USA, ~1 in 1000 children is born with a congenital or prelingual profound hearing loss, and another 1 in 1000 children develops deafness before maturity.1, 2 According to its clinical characteristics, hearing loss can be categorized as syndromic hearing loss or non-syndromic hearing loss (NSHL).3 NSHL is a common sensory deafness and can be divided into the following two types: prelingual and postlingual deafness. In the prelingual NSHL category, ~75–80% are autosomal recessive, 20–25% are dominant and 1–1.5% are X-linked. Furthermore, inheritance patterns are autosomal dominant in nearly all of the hereditary postlingual NSHL families.3, 4 Currently, 57 loci (DFNA1-57) for autosomal dominant, non-syndromic, sensorineural hearing loss have been mapped to chromosomal regions; however, only 32 causative genes coding for a wide variety of proteins have been cloned since 1997 (http://hereditaryhearingloss.org/). This is because the identification of the predisposing gene for many hearing loss cases/families is fundamentally restricted by a small pedigree size, locus heterogeneity, limited power of traditional positional cloning strategies and so on.5

We have focused on the hearing impaired Chinese pedigree since 2002, and we have mapped a deafness locus named DFNA42 (which has been named DFNA52, OMIM entry 607683) via a genome-wide scan, in addition to previous screenings of POU4F3 by traditional methods. Unfortunately, we did not identify any disease-causing mutations at that time.3

POU4F3 is a transcription factor, which contains two functional domains, a POU-specific (POUS) domain and a POU-homeo domain (POUHD), and POU4F3 is necessary for the maturation and survival of hair cells in the inner ear.6 There have been several reports about autosomal dominant mutations in the POU4F3 gene that are associated with NSHL.7, 8, 9, 10 The POU4F3 gene encodes for a protein, which exclusively distributes in the nucleus; however, the mutant protein is localized both to the nucleus and the cytoplasm. Furthermore, the mutant protein’s half-life is longer than that of the wild type.11, 12, 13

Material and methods

Subject recruitment and clinical examination

A family with five generations of hearing impairment, which came from the Hunan Province of China, was identified by the Department of Otorhinolaryngology at the Xiangya Hospital of Central South University, Changsha, China. In this family, there were 17 affected and 21 unaffected individuals (Figure 1a). The onset age of hearing loss in this family ranged from 16 to 30 years. The characteristics of the hearing loss in this pedigree were consistent with bilateral, postlingual, progressive, sensorineural and non-syndromic hearing impairment. Clinical and audio logical examinations were performed on all of the family members, and the clinical characteristics of this family have been described previously.3 Written informed consent was obtained from all of the study participants. The research study was approved by the Human Ethics Committee of the State Key Laboratory of Medical Genetics, Central South University.

Figure 1
figure 1

Identification of POU4F3 mutation in a hearing loss Chinese family. (a) The autosomal dominant, non-syndromic hearing impairment pedigree. Roman numerals refer to generations, and individuals within a generation are numbered from left to right, as is the standard convention. Proband was noted with an arrow. Filled symbol refers to the patient. Open symbols with a question mark refer to the individuals whose data are unknown. Open symbols refer to unaffected individuals. (b) The DNA sequence of the mutation p.L201fs in POU4F3 in the Chinese hearing loss pedigree. The first line represents the control sequence, and the second line represents the patient sequence. The mutation information is indicated by red lettering. A full color version of this figure is available at the Journal of Human Genetics journal online.

Whole-exome sequencing and bioinformatics analysis

Genomic DNA was extracted from peripheral blood lymphocytes obtained from all of the recruited family members using a standard phenol–chloroform method. For each individual selected for whole-exome sequencing, 1 μg of genomic DNA was fragmented, selected and amplified with a PCR. We used the Agilent SureSelect Human All Exon V4+UTRs enrichment platform (Agilent Technologies, Santa Clara, CA, USA) to collect the protein-coding regions of the human genomic DNA, according to the manufacturer’s instructions. An exon-enriched library was sequenced on an Illumina Hiseq 2000 platform following the manufacturer’s instructions (Illumina, San Diego, CA, USA).

Generated reads were aligned to the human reference genome (University of California Santa Cruz (UCSC, http://genome.ucsc.edu) hg19 and National Center for Biotechnology Information (NCBI) b37.1). Sequencing quality was evaluated based on the Illumina GERALD report and the fast Quality Control. Reads that passed were then aligned to the human reference genome (UCSC hg19) using the Burrows-Wheeler Aligner (University of California, Santa Cruz, CA, USA).14

We used Samtools and/or the Genome Analysis Toolkit (The Broad Institute of Harvard and MIT, Cambridge, MA, USA) to identify single-nucleotide polymorphisms (SNPs) and insertions and deletions.15 The SNPs with a read depth ⩾8 × and a quality ⩾30 × were reserved for subsequent analysis. If SNPs and insertions and deletions affected coding sequence, SeattleSeq annotation was annotated to predict the effect of the variants on protein function. Considering the basis of the SNP database (dbSNP), 1000 genome annotations and the supplied PolyPhen prediction, every non-synonymous variant not assigned a 'benign' prediction was considered to be damaging.

Sanger sequencing of the implicated gene

Taking advantage of the PCR amplification and Sanger sequencing, we validated the presence of variants in the candidate genes identified via exome sequencing. Primers were designed using the online tool Primer3. PCR amplification was performed using a GeneAmp PCR System 2720 (Applied Biosystems, Foster City, CA, USA) in a 10 μl mixture containing 5 μl of Premix Ex Taq (Takara Bio, Dalian, China), 30 ng of primers and 30 ng of genomic DNA. PCR amplification was performed as follows: an initial denaturating step at 95 °C for 5 min, followed by 32 cycles of denaturating at 95 °C for 30 s, annealing at 55–60 °C for 30 s, extension at 72 °C for 30 s and then finally extension at 72 °C for 10 min. PCR products were verified by 6% polyacrylamide gel electrophoresis and silver staining.

Results

Whole-exome sequencing identified three candidate genes

We performed whole-exome sequencing on two affected individuals (IV: 13 and V: 6) and one unaffected individual (V: 3) from the hearing impaired family (Figure 1a). An average of 9.9 gigabases (Gb) was generated per individual, and ~99% passed the quality assessment and were aligned to the human reference sequence. Approximately 2.4 Gb were mapped to the targeted exome (NCBI consensus coding sequence) with a mean depth of ~60-fold. By comparing our non-synonymous single-nucleotide variants and insertions and deletions variants against dbSNP137, we first excluded high-frequency (minor allele frequency >0.01) polymorphisms present in the 1000 Genomes Project and NHLBI GO Exome Sequencing Project (ESP-6500) and then excluded variants in our 500 in-house control samples using exomeSeq data. Second, we extracted the segregated variants using whole-exome sequencing and removed the shared SNPs in the three individuals. We were able to subsequently select three candidate variants in three genes (PCDHGB3, SH3RF2 and POU4F3) that were carried by both of the patients for further analysis (Table 1). One is a frameshift mutation (c.602delT, p.L201fs) in the gene POU4F3 (NM_002700.2), another is a missense mutation (c.655A>G, p.I219V) in the gene SH3RF2 (NM_152550.3) and the last one is a missense mutation (c.1739G>A, p.R580H) in the gene PCDHGB3 (NM_018924.4).

Table 1 Direct identification of the causal genes by exome sequencing

Co-segregating of the three candidate genes

We used Sanger sequencing to validate the variations of PCDHGB3, SH3RF2, and POU4F3 and analyze the co-segregating status of the variant in these family members. After screening the available DNA of the pedigree, we found that all of the patients carried heterozygous p.L201fs mutations in POU4F3 (Figure 1b). Furthermore, the mutation was not detected in other unaffected members of this family and 500 unrelated controls from the same ethnic background. Unfortunately, neither SH3RF2 nor PCDHGB3 was co-segregated within this Chinese family. These results suggested that this novel mutation in POU4F3 is not a rare polymorphism, but rather is a causative mutation for autosomal dominant, non-syndromic hearing impairment in this Chinese family.

Results of clustalx1.83 software

Taking advantage of the clustalx1.83 software (Conway Institute UCD, Dublin, Germany), we analyzed variant c.602delT and found that it was located at the beginning of the POUS domain of POU4F3 (Figure 2b). We found that a termination codon occurred early because of the mutation, resulting in a truncated protein. Compared with previous studies, the mutation found in this study affects both the POUS and POUHD domains.

Figure 2
figure 2

POU4F3 mutations. (a) Schematic of the POU4F3 gene with the position of the mutations shown based on the human genomic sequence. (b) POU4F3 protein with the positions of alterations. POU4F3 is a multi-domain protein carrying a POU-specific (POUS) domain and a POU-homeo domain (POUHD). The p.L201fs, p.G221fs and p.L223P alterations were clustered in the POUS domain. The p.L289F, p.I295Tfs and p.R326K alterations were clustered in the POUHD domain. A full color version of this figure is available at the Journal of Human Genetics journal online.

Discussion

We demonstrated that whole-exome sequencing of just a few samples from one family can be applied to identify the causative gene. Furthermore, identification of disease-causing genes may be achieved using the combined strategy of exome sequencing and traditional analysis with even fewer samples required. This approach, together with continuing improvement of sequencing technology, will make medical genomics of hearing impairment a reality.

To date, approximately five mutations in POU4F3 (c.977G>A, c.880del8, c.865C>T, c.668T>C and c.662del14) have been found (Figure 2a). We have made a brief genotype–phenotype analysis according to limited clinical characteristics that included four reported families. We classified them into two groups in terms of the location of their variants. Accordingly, the Chinese family (c.602delT) and the Korean family (c.662del14) belonged to the first group where the mutations were located in the first domain of POU4F3, whereas the Dutch family (c.865C>T) and family H (c.880del8) belonged to the second group where the mutations were located in the second domain of POU4F3.16 Interestingly, all of the four families were characterized with a large variability in terms of onset age and progressive autosomal dominant non-syndromic hearing loss.11, 12, 17 However, the pedigrees in the second group displayed abnormal function of the vestibular apparatus, and only half of the patients had tinnitus, whereas families in the first group showed no malformations of the inner ear by computed tomography examination or abnormal function by vestibular testing. However, all of the patients in the Chinese family had tinnitus.3 Moreover, examination of the history and clinical records of the affected members in our family suggested that high-frequency loss in this pedigree begins in the second or third decade and progresses to profound deafness involving all frequencies. This feature of the other three families was unknown. We know that the variants of the first group are located in the first domain, which would impair both of the domains in POU4F3, whereas the mutations in the second group only impaired the second domain. Therefore, we inferred that the first domain could be related to tinnitus, and the second domain may be sufficient for the occurrence of hearing loss.

Therefore, further functional studies of variants of this specific gene are needed to illustrate the pathogenic mechanism(s) that underlie hearing impairment.