Introduction

Isolated congenital anosmia (OMIM 107200) is a rare condition where patients have no recollection of ever being able to smell1. Unlike syndromic forms of anosmia, ICA patients have no additional symptoms, and no other underlying disease-causing condition can be identified1. ICA usually occurs sporadically, although some familial cases have previously been reported1,2,3,4,5,6. While several causal genes have been identified for syndromic cases of anosmia (such as Kallmann syndrome and Bardet-Biedl syndrome)7,8,9, the underlying genetic architecture of ICA is barely known. Since Glaser et al. first reported ICA as a potentially hereditary trait nearly 100 years ago10, there have been few candidate genes identified as the potential genetic cause for ICA3, 6, 11,12,13. Moreover, most reported anosmia cases are inherited through autosomal dominant patterns2, 14, however, in some instances, it is an X-linked trait3, 6, 10. For instance, In 2015, Karstensen et al. for the first time reported a hemizygous nonsense mutation in CNGA2 in an ICA family (two brothers) with an X-linked pattern of inheritance3, but the lack of additional family members precluded definitive assignment of this gene as associated with this disease. CNGA2 encodes the alpha subunit of a cyclic nucleotide-gated olfactory channel. Interestingly, deletion of any of the three main olfactory transduction components in mice, namely Gnal 15, Adcy3 16 and Cnga2 17, causes significant reduction of physiological responses to odorants consistent with ICA phenotype. Also, Alkelai et al.6 reported a family with congenital general anosmia segregating an X-linked missense mutation in the TENM1 gene6. Together, these studies indicate that ICA is very heterogeneous and that rare pathogenic variants in multiple genes may contribute to the phenotype.

Here, we report exome-sequencing results of a large consanguineous Iranian family segregating ICA in an X-linked pattern of inheritance. We identified a novel stop-gain mutation in CNGA2 gene that in concert with Cnga2 knockout mice and results of the Karstensen et al. study3 further supports the role of CNGA2 in the pathogenesis of ICA in humans.

Results

Exome sequencing results

Whole-exome sequencing to a mean coverage of above 70X was performed (Supplemental Fig.Ā 1). Using Varseq software, we identified 88,042 variants shared in the all sequenced family members (10 individuals; TableĀ 1) that have a genotype quality score above 20. Since the disease model of ICA based on the pedigree was consistent with autosomal recessive or X-linked, we filtered for homozygous or hemizygous variants that are found in all the five affected individuals, but in none of the healthy controls. 16 variants passed this filtering step of which only CNGA2.577Cā€‰>ā€‰T has minor allele frequency (MAF) less than 0.01 or absent in the public database (dbSNP Common 144 (Database of Single Nucleotide Polymorphism, NCBI), 1000 Genome project phase 3 (www.1000genomes.org), Exome Aggregation Consortium version 0.3 (EXaC), Exome Variant Server; NHLBI GO Exome Sequencing Project (ESP), Seattle, WA (http://evs.gs.washington.edu/EVS/), and the Iranian genome project (http://irangenes.com/data-2/). This loss of function mutation occurs in exon 6 of CNGA2 gene, is rare (unreported in public databases) and highly conserved (Fig.Ā 1C) by ConSurf 18.

Table 1 ICA Family members ID and variants filtering steps.
Figure 1
figure 1

(A) Pedigree structure of ICA family. The proband is noted by an arrow. Family members for which whole exome sequencing has been performed are noted by a star above the individualā€™s symbol. The genotype of identified variant for family members for which DNA sample was available is mentioned below the individualā€™s symbol. (B) Sanger sequencing traces (CC/TGA) showing the c.950 Cā€‰>ā€‰T (p.Arg193*) mutation in exon 6 of the CNGA2 gene. The segregation of this mutation has been confirmed in 12 available DNA samples (Five affected and seven unaffected individuals) from this family. (C) The amino-acid sequence CNGA2 (pArg193*) colored according to the conservation scores. Conservation scores were calculated by ConSurf tool18. ConSurf estimates the evolutionary conservation of amino acid residues in a peptide based on the phylogenetic relations between homologous sequences as well as amino acidā€™s structural and functional importance.

Segregation analyses

We confirmed this variant by Sanger sequencing (Fig.Ā 1B) and showed that the variant co-segregates with the disease as all the five affected individuals have a hemizygous mutation (one mutation on the X chromosome in males), while unaffected family members are heterozygous or homozygous for the reference allele.

Discussion

Anosmia occurs mostly in the middle aged and elderly; affecting up to 5% of individuals over the age of 45 years old19, 20. Rarely, however, it can occur as a congenital condition (prevalence 1 in 10,000)21 (Isolated Congenital Anosmia) (Online Mendelian Inheritance in Man [OMIM] % 107200). Few candidate genes have been identified as potential causal genes for ICA in humans3, 6, 11,12,13, 22. For instance, Karstensen et al.3 reported a mutation in CNGA2 in two brothers with an X-linked pattern of ICA3. However, the lack of additional family members precluded definitive assignment of this gene as associated with this disease. Also, Alkelai et al.6 reported a family with congenital general anosmia segregating an X-linked missense mutation in the TENM1 gene6. In the present study, we employed whole-exome sequencing to identify the underlying genetic variants associated with ICA in a large Iranian family (five affected and five un-affected family members). The identified loss of function variant has not been reported in the public databases and falls within exon 6 of CNGA2 gene. We confirmed the segregation of the variant with Sanger sequencing and showed all five affected individuals are hemizygous for this variant, whereas all healthy individuals are either heterozygous or homozygous for the reference allele. Furthermore, one may speculate that the healthy female individuals, who carry this mutation, show to some extent hyposmia. However, the results of smell test on the three apparently healthy females, who carry CNGA2 mutation as well, show that they fall within normosmia category (score of 19 to 24 of smell test) (TableĀ 1). Moreover, due to the lack of brain MRI data, we cannot verify whether there is a defect in olfactory structures (i.e. olfactory bulb and olfactory sulci) in the affected individuals. There have been also contradicting reports on the ICA characterization in humans. Generally ICA is characterized by the absence or severe hypoplasia of olfactory bulbs23,24,25,26, however, brain MRI studies of ICA patients by Karstensen et al.3 and Ghadami et al. 2004 show that there is an apparently normal or slightly reduced olfactory structure in ICA. These studies suggest the existence of an ICA sub-phenotype characterized by an apparently normal olfactory structure3, 5.

Olfactory receptors are the largest group of orphan G protein-coupled receptors27. The CNG olfactory channels are made of alpha subunit and two modulatory subunits, encoded by CNGA2, CNGB1b and CNGA4, respectively28, 29. The alpha subunit of CNG channel is critical for olfactory sensory neurons to generate an odor-induced action potential. Knockout mice models lacking these essential CNG channel subunits strongly support the critical role of this canonical pathway in olfactory signaling15,16,17. Cnga2 knockout mice are congenitally anosmic and have severely impaired olfactory function17. Moreover, CNGA2 is highly expressed in the olfactory sensory neurons of zebrafish, mouse and humans30,31,32. Our study together with Karstensen et al.3 and Cnga2 knock-out mice model strongly support the role of CNGA2 gene in pathogenicity of familial cases of ICA. Together, these results indicate that mutations in key olfactory signaling pathway genes are responsible for human disease.

Material and Methods

Family description

All participants, or their legal guardian, provided written and informed consent. The institutional review boards of Department of Medical Genetics, Tarbiat Modares University, and Stanford University reviewed and approved the project. All methods in this study were performed in accordance with the approved guidelines and regulations. The institutional review board of Stanford University approved the experimental protocols. The family pedigree is shown in Fig.Ā 1A. All the affected individuals underwent clinical and physical examination at the Department of Medical Genetics, DeNA Laboratory, Tehran, Iran.

For assessment of olfactory function in affected individuals, we used Iran Smell Identification Test (Iran-SIT)33 which is a commercially Iranian adopted version of the University of Pennsylvania Smell Identification Test (UPSIT)34. In this test, the familiar smells in Iranian environment and culture is used to test the function of an individualā€™s olfactory function in a scale of 1 to 24. A score of 1ā€“9 indicates anosmic; 9ā€“13 severe microsmia; 13ā€“17 mild microsmia, and 19ā€“24 normosmia state (TableĀ 1). The smell test has been performed on all the available affected individuals. For the apparently healthy individuals, smell test has been done only on the three available individuals (II-10, III-3 and III-7 of TableĀ 1 and Fig.Ā 1A).

Exome sequencing and variants calling

Library preparation was performed using KAPA HyperPlus kit followed by exome capture using the IDT xGenĀ® Exome Research Panel v1.0. Briefly, 0.4ā€‰Ī¼g of gDNA was fragmented to a peak size of 150ā€“200ā€‰bp using the KAPA Frag enzyme. The fragmented genomic DNA were end-repaired, A-tailed, ligated to indexing-specific paired-end adaptors and PCR amplified according to manufacturerā€™s instructions. The PCR reactions were cleaned using the Agencourt AMPure XP beads. To capture exonic regions, 500 ng of pooled libraries (8 libraries per pool) was hybridized to biotinylated oligonucleotides for 4ā€‰hours at 65ā€‰Ā°C. The captured libraries were pulled down using Dynabeads MyOne Streptavidin M-270 (Invitrogen). A post-capture PCR was then performed to amplify the captured libraries and to add the barcode sequences for multiplex sequencing for 12 cycles. Afterwards, amplified libraries were purified with AmpPure XP Beads. Qubit fluorometer and Bioanalyzer high sensitivity chips were used to determine the final concentration of each captured library. The pooled libraries were paired-end sequenced on two lane of Illumina HiSeq4000 at the Stanford Center for Genomics and Personalized Medicine according to standard protocols.

Bioinformatics analyses

DNA libraries were processed and analyzed using the Roche Sequencing Solutions (Bina Technologies version 2.7.9) whole-exome analysis workflow with default settings. Briefly, libraries were mapped with BWA mem 0.7.5 software to human genome (hg19 version), and then realigned around indels with GATK IndelRealigner. Next, base recalibration was performed with GATK BaseRecalibrator taking into account the read group, quality scores, and cycle and context covariates. Variants were called with GATK HaplotypeCaller with the parametersā€“variant_index_ type LINEARā€“variant_index_parameter 128000. VQSR was used to recalibrate the variants, first with GATK VariantRecalibrator and then ApplyRecalibration. Variant filtering and annotation was done using Golden Helix VarSeq Version 1.1 software (http://goldenhelix.com/products/VarSeq/). After importing the variant call files (gVCF files) of each member of the family, the variants were organized by pedigree. Using the 1000 genomes variant frequencies (phase 1), the Exome Aggregation Consortium (ExAC) variant frequency database version 0.3 (Cambridge, MA), and the NHLBI Exome Sequencing Project (https://esp.gs.washington.edu/drupal/) V2 Exome Variant Frequencies, rare variants (minor allele frequency <1%) were filtered. Variants were then classified according to whether they were deemed to be coding.

Sanger sequencing

To validate the variant and to verify its segregation in the family, we used 5ā€² TCTACATTGCGGACCTCTTC3ā€² as forward primer and 5ā€² TCTAAGAGAACACCCCGAGA3ā€² as reverse primer for PCR amplification of the variant sequence. PCR amplification was performed using following reagents: REDTaq ReadyMix PCR Reaction Mix (Sigma-Aldrich, St. Louis) 25ā€‰ul, 1ā€‰ul forward primer (10ā€‰uM), 1ā€‰ul reverse primer (10ā€‰uM), 1ā€‰ul DNA (50ā€‰ng/ul), and 22ā€‰ul of water per PCR reaction. An initial denaturation step at 94Ā° for 3ā€‰min was followed by 35 cycles of 94Ā° for 30ā€‰sec, 57Ā° for 30ā€‰sec, 72Ā° for 30ā€‰sec, and the process completed by a final extension at 72Ā° for 7ā€‰min. The PCR amplification resulted in a single DNA band on a standard 1% Agarose Gel, and was purified by Agencourt AMPure XP beads (Beckman Coulter, Inc) before submitting for Sanger sequencing. The reverse primer 5ā€² TCTAAGAGAACACCCCGAGA 3ā€² was used as sequencing primer. Sanger sequencing was carried out by the Stanford PAN facility using ABI 3130xl Genetic Analyzer.