Waardenburg syndrome (WS) is a rare autosomal dominant disorder with an estimated prevalence of 1:42,000.1 It is characterized by congenital sensorineural hearing loss and depigmentation of the eyes, hair and skin, deafness, with or without dystopia canthorum (a lateral displacement of the inner canthi of the eyes),2,3,4 grouped into four subtypes (WS1–4). First reported in Europe,5 this disease has also been commonly reported in other ethnic groups,6,7 but it is rarely been reported in Chinese population.8

WS rarity and genetic heterogeneity limited the ability to make accurate clinical diagnosis in individual patients. Presently, seven genes are related to WS: PAX3 (encoding the paired box 3 transcription factor), MITF (microphthalmia-associated transcription factor), EDNRB (endothelin receptor type B), EDN3 (endothelin 3), SNAI2 (snail homolog 2), SOX10, and recently reported KITLG (encoding KIT ligand).9 As a result, the accurate diagnosis of WS is difficult to achieve by traditional genetic testing methods, for example, Sanger sequencing. In this paper, a Chinese family affected by WS1 is reported, whole-exome sequencing (WES) was performed and a nonsense mutation in PAX3 gene in Chinese population was discovered.

In the present study, three affected patients (I-2, II-2, III-1) and one unaffected member (I-1) from an isolated village in Inner Mongolia in China were involved in our study (Figure 1a). The proband (I-2) was found when she sought treatment due to pneumonia. Dystopia canthorum was present in all patients, but the symptom of III-1 was less severe. Pigmentary disturbance of iris was present in two patients (I-2, II-2) (Figure 1b). Hair hypopigmentation was present in two patients (I-2, II-2), while only one patient (II-2) had sensorineural hearing loss. The clinical features of the proband suggest WS; however, other disorders associated with hearing loss cannot be excluded.

Figure 1
figure 1

(a) Family pedigree of WS1. Square indicates male, and circles indicate females. Different clinical features are displayed by different position of black squares. The proband is I-2. (b) The picture of eyes of the proband, showing the color of left eye is brilliant blue.

Whole-exome sequencing was performed in order to identify the causal genes. DNA was isolated from peripheral blood by standard procedures (Tiangen, China. Cat. No. DP348). Whole-exome enrichment was performed using SureSelect XT Target Enrichment System (51 Mb) according to the manufacturer’s protocols (Agilent, Santa Clara, CA, USA). Captured libraries were loaded onto the HiSeq 2500 platform (Illumina, San Diego, CA, USA). Workflow recommended by the Genome Analysis Tool kit (GATK) was applied to analyze the variants. Reads were aligned to the human reference genome (UCSC GRCh37/hg19) by using the Burrows-Wheeler Aligner (BWA v0.7.12).10 Presumed PCR duplicates were removed using Picard’s MarkDuplicates. Local alignment optimization and base quality recalibration was performed with the GATK. Indels and single-nucleotide variants were identified by means of the GATK HaplotypeCaller and Unified Genotyper algorithms. Variants were functionally annotated and filtered using our cloud-based rare disease NGS analysis platform (https://www.gene.ac/) with build in public databases (dbSNP, OMIM, ESP, Clinvar, 1,000 Genomes,11 and ExAC)12,13 and HGMD Professional database. Exonic sequence alterations and intronic variants at exon-intron boundaries, with unknown frequency or minor allele frequency (MAF) <1% and not present in the homozygous state in those databases were retained.

A mean depth of target region of 121.81× was achieved, and 86% of targeted bases were read >30 times by whole-exome sequencing. A mean of only 7.11% bases in the targeted coding regions had <10× coverage, and a mean of only 5.08% bases had <4× coverage. A total of 584,454 SNPs were identified (see Supplementary Dataset). We identified mutation c.598C>T in exon5 of PAX3 gene was likely to be pathogenic, which is a nonsense mutation causing the glutamine to stop codon substitution at codon 200 (Figure 2). This variant of Irish origin was firstly reported in 1995 by Baldwin et al.,2,14 and documented in LOVD database (http://grenada.lumc.nl/LOVD2/WS/). It was predicted to be disease-causing by in silico analysis using Mutation Taster (http://www.mutationtaster.org/), with P-value of 0.99969, suggesting a high possibility of protein functional alteration, which has not been described in previous report. This variant co-segregates with all affected individuals in the family and was not observed in unaffected family members. Gln200 is localized in a phylogenetically conserved HD domain and is evolutionarily conserved from mouse to humans (Figures 2b and c). This mutation was further validated by Sanger sequencing (Figure 2a). The amplicon was directly sequenced using the ABI BigDye Terminator Sequencing Kit (Applied Biosystems, Shanghai, China) and an automated capillary sequencer (ABI 3730×l; Applied Biosystems). Sequence electropherograms were analyzed using the Chromas software.

Figure 2
figure 2

(a) Sequence chromatograms of the family: a heterozygous change, c.598C>T, was identified in patients I-2, II-2 and III-1) (b) Schematic diagram of the PAX3 gene. The boxes indicate the structure of the PAX3 protein. The arrow of patient indicates the position of mutation in PAX3 protein. (c) Cross-species multiple alignment of PAX3 protein sequences, showing the evolutionally conserved sequence around position Q200.

Recently, WES has been demonstrated as a powerful approach to elucidate the genetic basis of rare Mendelian disorders with unknown etiology.15,16,17,18 In addition, WES is increasingly being utilized as a diagnostic tool for specific genetic diseases with significant genetic and phenotypic heterogeneity.15 The progression of genomic technologies over the past few years has led to a disruptive shift in our approach to the identification of new Mendelian disorders for numerous previously unresolved rare disorders.18,19 WES is appropriate because of lower turnaround time, significantly reduced cost, and more focused and comprehensive coverage of target genes, and easier data analysis compared with whole genome sequencing.

In conclusion, we identified a nonsense pathogenic variant (c.598C>T; p.Gln200*) in the PAX3 gene that has not been reported in Chinese population before. This report will contribute to a better WS mutation spectrum identified so far. We also demonstrated that WES could be a powerful approach to identify pathogenic variants in patients with WS.