Exome sequencing reveals a high genetic heterogeneity on familial Hirschsprung disease

Luzón-Toro, Berta; Gui, Hongsheng; Ruiz-Ferrer, Macarena; Sze-Man Tang, Clara; Fernández, Raquel M.; Sham, Pak-Chung; Torroglosa, Ana; Kwong-Hang Tam, Paul; Espino-Paisán, Laura; Cherny, Stacey S.; Bleda, Marta; Enguix-Riego, María del Valle; Dopazo, Joaquín; Antiñolo, Guillermo; García-Barceló, María-Mercé; Borrego, Salud

doi:10.1038/srep16473

Download PDF

Article
Open access
Published: 12 November 2015

Exome sequencing reveals a high genetic heterogeneity on familial Hirschsprung disease

Berta Luzón-Toro^1,2^na1,
Hongsheng Gui^3,4^na1,
Macarena Ruiz-Ferrer^1,2^na1,
Clara Sze-Man Tang^4,5^na1,
Raquel M. Fernández^1,2^na1,
Pak-Chung Sham^3,4,6,7^na1,
Ana Torroglosa^1,2^na1,
Paul Kwong-Hang Tam^5,7^na1,
Laura Espino-Paisán¹^na1,
Stacey S. Cherny^3,4,6^na1,
Marta Bleda^2,8^na1^nAff10,
María del Valle Enguix-Riego^1,2^na1,
Joaquín Dopazo^2,8,9^na1,
Guillermo Antiñolo^1,2^na1,
María-Mercé García-Barceló^5,7^na1 &
…
Salud Borrego^1,2^na1

Scientific Reports volume 5, Article number: 16473 (2015) Cite this article

2999 Accesses
25 Citations
5 Altmetric
Metrics details

Subjects

Enteric neuropathies

Abstract

Hirschsprung disease (HSCR; OMIM 142623) is a developmental disorder characterized by aganglionosis along variable lengths of the distal gastrointestinal tract, which results in intestinal obstruction. Interactions among known HSCR genes and/or unknown disease susceptibility loci lead to variable severity of phenotype. Neither linkage nor genome-wide association studies have efficiently contributed to completely dissect the genetic pathways underlying this complex genetic disorder. We have performed whole exome sequencing of 16 HSCR patients from 8 unrelated families with SOLID platform. Variants shared by affected relatives were validated by Sanger sequencing. We searched for genes recurrently mutated across families. Only variations in the FAT3 gene were significantly enriched in five families. Within-family analysis identified compound heterozygotes for AHNAK and several genes (N = 23) with heterozygous variants that co-segregated with the phenotype. Network and pathway analyses facilitated the discovery of polygenic inheritance involving FAT3, HSCR known genes and their gene partners. Altogether, our approach has facilitated the detection of more than one damaging variant in biologically plausible genes that could jointly contribute to the phenotype. Our data may contribute to the understanding of the complex interactions that occur during enteric nervous system development and the etiopathology of familial HSCR.

Whole genome sequencing reveals epistasis effects within RET for Hirschsprung disease

Article Open access 28 November 2022

Novel findings from family-based exome sequencing for children with biliary atresia

Article Open access 08 November 2021

Large-scale sequencing identifies multiple genes and rare variants associated with Crohn’s disease susceptibility

Article 29 August 2022

Introduction

Hirschsprung disease (HSCR, OMIM 142623) is a developmental disorder occurring in approximately 1 of 5,000 live births¹. HSCR mostly presents sporadically, with only 5–20% of cases being familial and manifests with low, sex dependent penetrance and variable expression of phenotype. It is characterized by the absence of ganglion cells along variable lengths of the distal gastrointestinal tract, which results in tonic contraction of the aganglionic colon segment and functional intestinal obstruction. Such aganglionosis is associated with a delay in the entry of neural crest-derived cells into the foregut, as well as a delayed progression of enteric neural crest cells along the gut^2,3.

The genetic aetiology of the disease is likely to be heterogeneous and involving common and rare variants acting alone or in combination⁴. For only a small fraction of HSCR patients, the phenotype is caused by unique damaging mutations in coding sequences of genes encoding protein components of signalling pathways involved in the development of the enteric nervous system (ENS), with the RET proto-oncogene being the most important. RET coding rare mutations account for up to 50% of the familial cases and 10–20% of the sporadic cases. About 5% of the cases are due to mutations in genes other than RET, namely GDNF, NRTN, PSPN, EDNRB, EDN3, ECE1, NTF3, NTRK3, SOX10, PHOX2B, L1CAM, ZFHX1B, KIAA1279, TCF4, PROK1, PROKR1, PROKR2, GFRA1, NRG1, NRG3, Class 3 SEMAPHORINs (3A, 3C and 3D) and DNMT3B, which are known to play an important role in the development of the ENS^2,5,6,7. Furthermore, a specific RET haplotype was identified clearly associated with the sporadic forms of HSCR, characterised by the presence of a common RET variant (rs2435357; 10:g.43086608T > C) located in a gut-specific RET enhancer element in intron 1, which it has been demonstrated to disrupt binding of SOX10^8,9,10,11,12. In addition, the interaction between RET rs2435357 and NRG1 rs7835688 (8:g.32531041C > G) variants was detected in Chinese HSCR patients¹³. Recently, a study has established those RET variants, rs2435357 and rs2506030 (10:g.42952399A > G) and SEMA3A variant, rs11766001 (7:g.84515886A > C), as common susceptibility alleles in HSCR subjects¹⁴.

The complexity of the disorder is also evident in those families where the segregation of the HSCR phenotype does not follow any recognizable pattern of inheritance, shows intrafamilial variability or affected individuals do not carry mutations in any of the known HSCR genes. Such patterns might be explained by the joint effect of more than one variant either in coding or non-coding sequences¹⁵. The emergence of next-generation sequencing (NGS) could help us to decode the complex genetic background of HSCR. However, to unmask the underlying variants is indeed a challenging effort where making sense of NGS data requires cutting edge bioinformatic approaches. One possible solution is the combination of whole exome sequencing (WES) together with interaction network and pathway analyses, which allow prioritizing candidate genes with functional relationship¹⁶ and constitute an integrated and comprehensive approach in exploring a disease in the post-genomic era. Therefore, these analyses play an increasingly important role in the diagnosis of complex and polygenic disorders¹⁷.

Here we present the first exome sequencing study of familial HSCR from a Caucasian population, including 8 Spanish families with 2 HSCR patients per family. Family studies provide an opportunity to explore and interpret the as-yet-unidentified genetic variation underlying many complex genetic diseases, such as familial HSCR.

Results

Forty-eight individuals, including 8 families with 2 patients each (Table 1 and Fig. 1), were successfully whole exome sequenced using the SOLID 5500xl platform. The average read depth achieved for target regions was 34.9X. Although it only achieved on-average 76.9% of targeted regions with read depth >10 fold, this metric increased to 85.7% when considering regions covered by 4 or more sequence reads (Supplementary S1 Table). Quality metrics for clean variants in exonic regions were within the normal range (dbSNP137 coverage >95%, Ti/TV > 3; S1 Table). Pairwise identity by descent (IBD) verified the pedigree structure (Supplementary S1 Table).

Table 1 Characteristics of the patients and pre-screening results.

Full size table

These 8 families had previously been screened for mutations in known HSCR genes by Sanger sequencing. All five rare damaging mutations identified were confirmed by WES (Table 1). After variant filtering, an average of 160 (standard deviation = 31) rare damaging variants was detected in each HSCR patient (Table 2). Variants were prioritized according if they were shared within families and/or linked to any of the 19 HSCR genes through protein-protein interaction (PPI) network or pathway analyses. For each family, the number of variants present in both affected individuals would depend on their kinship relationship (Table 2 and Fig. 1). For example, within family 7 there were only two missense single nucleotide variants (SNVs) shared because the affected family members were actually fourth-degree relatives. In contrast, affected individuals of family 8 shared 90 variants (11 as loss of function mutations, LOF) since they were siblings. A total of 246 variants potentially involved in the disease were selected for validation by Sanger sequencing. This validation verified the 89.8% of our selected rare damaging variants.

Table 2 Summary of rare damaging variants in each family.

Full size table

In an attempt to uncover possible causal variants we checked for cosegregation of variants with the phenotype according to the pedigree structure. Complete segregation of variants with the phenotype was only observed for families 1, 5 and 8. A few heterozygous variants fully cosegregated with the phenotype (possibly explaining the affected individuals in each family) in families 1 and 5 (Tables 2 and Supplementary S2 Table). Interestingly, two LOF mutations (in DPYD and QTRTD1) fully cosegregated with HSCR in family 1, but none LOF found for family 5. We prioritized DPYD in family 1 and CNTN5 in family 5 (Fig. 1A) based on their described functional involvement in neuronal development. Affected individuals of family 8 were compound heterozygous for AHNAK, each different variant having been inherited from each unaffected parent (Fig. 1A). However, these genes did not carry any rare damaging variant in the remaining families.

To assess whether there was a susceptibility gene contributing to HSCR common to all these families, we performed linkage and rare variant association tests for genes that were mutated in at least two families. A total of nine genes were included (Table 3). This rare variant association test showed FAT3 as the only significantly gene associated with the HSCR phenotype (p < 0.05 after Bonferroni correction). Subsequent comparison of FAT3 enrichment between HSCR families and MGP controls also revealed significant difference (SKAT p-value <s3.65e-06), but not between MGP controls and 1304 public controls (VAAST p-value = 0.07).

Table 3 Genes recurrently mutated.

Full size table

A LOD score of 1.25 also suggested a moderate genetic linkage across families. Indeed, different FAT3 variants were found in 5 families (Table 4). The variants were not necessarily shared by the affected members within a family. Only affected members of families 2, 3 and 6 shared the same FAT3 variant (Genotypes of individuals with variants within FAT3 can be found in Supplementary S2 Table).

Table 4 Variants in FAT3 gene found in the study.

Full size table

Data from those families carrying variants in FAT3 (2, 3, 4, 6 and 7) were re-analyzed through PPI network and biological pathways to pinpoint additional genes that could account for the incomplete segregation of FAT3 or other ENS genes with HSCR (Fig. 2). Variants in the two related genes, PLAU and FBN1, co-existed in patients of family 2, likewise for variants in CREBBP and TSC2 in patients of family 6 (Fig. 1B). These co-existing variants had been inherited from each unaffected parent. Genes linked to the ENS were identified in family 3 (NTF3, IRAK3 and KDR) and in family 4 (GFRA1, ZHX2 and TPCN1) jointly cosegregating with the phenotype (Fig. 1C). Interestingly, although each patient of family 7 had two related genes mutated, these differed between patients (III1:FAT3/SEMA3D; IV1: FAT3/PTCH1) Moreover, the FAT3 variants were also at different sites. Thus, a clear pattern of polygenic inheritance was observed in family 7, for which FAT3 (two different variants), SEMA3D and PTCH1 genes seem to be necessary to explain the two affected patients (Fig. 1D). Detail of joint cosegregation pattern for each family was included in Supplementary S2 Table. No single pathway seemed to be disrupted across families.

Discussion

As disease-associated common variants fail to provide reasonable genetic mechanism for most common disorders, rarer or structural variants or low frequency variants with intermediate effect have been proposed to explain the missing heritability^18,19. Rare damaging variants usually have higher penetrance and confer larger effects on an individual’s risk of developing disease than common variants do. However, as each one of them is negatively selected and eventually disappear, such variants need to be generated (de novo) and accumulated to keep an overall consistent effect at population level²⁰. Study designs involving >=2 cases in one or more families greatly reduce the difficulty of finding disease-causing variants among the huge amount of genomic data generated by WES analyses²¹. This strategy has been proved efficient and effective for uncovering causal genes for a few Mendelian disorders. Regarding HSCR, Yang et al. firstly adopted exome sequencing on a Chinese family and reported CDS mutations in NRG3, a gene in which copy number variations had previously been associated with the disorder^22,23.

Previous studies have suggested how complex interactions among known HSCR genes or still unknown susceptibility loci lead to incomplete penetrance and variable severity of disease²⁴. Although no shared RET rare damaging mutations were found in any of these 8 families, we cannot rule out the contribution of this gene given the demonstrated contribution of the common RET intron 1 risk allele (rs2435357T) to HSCR¹⁰. Homozygous genotype (T/T) was present in 37.5% of our patients. Indeed, linkage analyses have revealed significant allele sharing in 10q11 region yet no coding sequence RET mutations have been identified²⁴. With this information, we speculate that the presence of the common RET risk allele may have increased the susceptibility to develop HSCR in some of our families¹⁰.

We have performed the first WES on Caucasian familial HSCR devoid of obvious mutations in the main HSCR genes. This has enabled us to systematically narrow down gene lists from all rare damaging coding sequence variants identified. We highlighted the presence of FAT3 gene, because it was the only one with overrepresented rare damaging variants (6 variants from 5 out of 8 families). All six variants detected in FAT3 belong to cadherin domains of the protein. Cadherins are a family of molecules that mediate Ca²⁺− dependent cell-cell adhesion and thus modulate a wide variety of processes such as cell polarisation and migration²⁵. They also play a role in the interactions between neurites derived from specific subsets of neurons during development^26,27. Among all the gene ontology classifications where FAT3 is implicated, we emphasized its role in the cell morphogenesis in neuron differentiation²⁸. In addition, FAT3 gene is expressed in embryonic stem cells²⁹. Mutations within this gene may lead to abnormal development of neural crest derived stem cells³⁰. All these features, together with its occurrence in most of our families, suggest the implication of this gene in the aetiology of HSCR.

A large amount of literature has been published to support the interaction of pairs of genes for HSCR^6,13,16 and this inspired us to interpret non-monogenic HSCR families by a combination of genes. We have specially focused on those genes with a role in neural development and on those that could be related with the disease through a specific process. The presence of genes previously associated to HSCR in our candidate gene list reinforces the validity of our method to detect potential genes related with the disease. The gene-pairs that came up in our analysis are known to interact because they belong to the same interacting network or pathway for enteric nervous system development. In family 7 we detected variations in SEMA3D and in another interesting gene, PTCH1, in which common variations have been already associated with HSCR in a pathway-based epitasis analysis³¹. Regarding family 4, GFRA1 was altered together with ZHX2 and TPCN1. The ZHX2 gene, a novel regulator of neural progenitor cell maintenance, is specifically expressed in neural progenitor cells during cortical neurogenesis³². In silico analysis indicates that TPCN1 gene is functionally related with PTCH1³³. Thus, we speculate that variations in this gene could be associated with the alterations occurring during embryonic development that lead to HSCR phenotype. In family 3, we found rare damaging variants in KDR and IRAK3 genes, in addition to variants in NTF3 and SEMA3D. The KDR gene encodes the main receptor of VEGF in blood vessels, which is a growth factor for endothelial cells implicated in processes such as cellular proliferation, differentiation, survival and migration³⁴. Recently, it has been associated with neuronal survival³⁵ and this function could potentially link this gene with the etiopathogenesis of HSCR. The IRAK3 gene has been associated with necrotizing enterocolitis, which has been shown to exist in specific subgroups of HSCR patients and it determines severe long-term sequel as it is the most invalidating and life-threatening complication of the disease³⁶. Another interesting gene was CREBBP, found in family 6, which is essential for differentiation of neurons and glial cells during the embryonic cortex³⁷ and tube neural development³⁸. We highlight the presence of the variant in this gene together with the variant in TSC2 gene, which has been related with the disruption of differentiation and maturation of neural precursors because its protein product regulates both cellular proliferation³⁹ and migration⁴⁰, crucial processes in HSCR. Thus, CREBBP and TSC2 genes seem to have a relevant role in family 6 and also they could play an interesting role in the disorder. Therefore, the contribution of different combinations of variants in known HSCR-associated genes and other ENS candidate genes, together with variants in FAT3 may lead to the expression of HSCR phenotype in each patient of these families. Nevertheless, as none of these gene combinations were recurrently found, nor they were tested in vivo, future large-scale studies or functional experiments are needed to verify their causality.

Regarding the families that did not carry rare damaging variants in FAT3, we highlight family 8 which presented two variants in AHNAK gene in heterozygosis that cosegregated with the disease. This gene was originally reported in neural crest-derived tumour cells⁴¹ and lately reported to be expressed in migrating neural crest cells⁴². Both AHNAK variants follow a recessive model of inheritance that may contribute to explain the phenotype of this family and both patients also carry a variant in ENDRB. We suggest that such ENDRB variant in this context could modulate the penetrance of variants in AHNAK or modify the expression of the disease in affected individuals. Finally, DYPD and CNTN5 genes fitting with dominant model of inheritance in families 1 and 5, respectively, were prioritized due to its role in neurodevelopment. DPYD gene has been linked to neural development through a microarray analysis where it is expressed in embryonic progenitor cells⁴³. CNTN5 has been implicated in cell surface interactions during nervous system development and neurite outgrowth which could also contribute to the etiology of disease⁴⁴.

There are some limitations in our study. False negative (FN) signal due to technology weakness is one concern for our exome sequencing. First of all, exome that includes only coding sequences is still in expansion; hence the concentration on earlier Consensus CDS is prone to FN for new functional sequences. Secondly, a small fraction of the exome (~5–10%) is poorly covered or altogether missed, largely owing to factors that are not specific to exome capture⁴⁵. Thirdly, non-coding regions are only partially covered and completely ignored in our downstream analysis. By the reduction of sequencing cost and development of advanced analytical tool, whole genome sequencing provides a better strategy to avoid above FN issues. The results provided in our study support a notable degree of inter- and intrafamilial genetic heterogeneity. Specifically, variant sharing within a family was a necessary criterion for gene prioritization; however, this may not always be the underlying model given possible heterogeneity within each family. Different genes need to be found out to explain the differences among the patients in the same family. Incomplete penetrance and interfamilial variation are usually detected for mutations in HSCR genes.

In summary, our results have led to the identification of several new genes that likely play a role in HSCR or ENS development, although the complexity of familial HSCR cases revealed by this study highlights the current difficulties in genetic counselling.

Methods

HSCR families and controls

HSCR families were selected from a larger clinical database consisting of 26 families, all derived from our Department of Genetics, Reproduction and Fetal Medicine in which their probands were previously screened by Sanger sequencing for 19 HSCR candidate genes (Supplementary S3 Table). Rare coding variants found in any of these genes were checked in all family members for cosegregation pattern. A total of 8 Spanish HSCR families (Table 1) comprising 16 affected (n = 16; 12 males and 4 females) and 32 unaffected individuals could not be fully explained by previously gene screening and they were included in this exome sequencing project. The study was carried out in accordance with the tenets of the Declaration of Helsinki and the Institutional Review Board of our institution. All experimental protocols were approved by Hospital Universitario Virgen del Rocío. Prior to their participation, written informed consent was obtained from all subjects.

In addition, data from 252 Spanish phenotyped healthy individuals that had been whole exome sequenced in the context of the Medical Genome Project, were included for comparisons⁴⁶.

We have also compared the difference of FAT3 burden between such public controls and 527 Spanish controls, using the same software under the same parameters (unpublished data).

DNA library preparation and sequencing

Peripheral blood was collected and genomic DNA was isolated from current available cases and family healthy individuals, using MagNA Pure LC system (Roche, Indianapolis, IN) according to the manufacturer’s instructions. DNA samples were stored at −80 °C until used. Exome sequencing by ABI Solid 5500xl (single-end, 75 bp length, Nimblegen 2.0 capture array) was performed on the 48 individual exomes, as well as on the control group. Briefly, library preparation and exome capture were performed according to a protocol based on the Baylor College of Medicine protocol version 2.1 with several modifications. Briefly, 5 μg of input genomic DNA was sheared, end repaired and ligated with specific adaptors. A fragment size distribution ranging from 160 bp to 180 bp after shearing and 200–250 bp after adaptor ligation was verified by Bioanalyzer (Agilent Technologies, Santa Clara, CA). The library was amplified by precapture linker-mediated polymerase chain reaction (LM-PCR) using Fast-Start High Fidelity PCR System (Roche, Indianapolis, IN). After purification, 2 μg of LM-PCR product was hybridized to V3 NimbleGen SeqCap EZ Exome libraries. After washing, amplification was performed by postcapture LM-PCR using FastStart High Fidelity PCR System (Roche). Capture enrichment was measured by qPCR according to NimbleGen protocol. The successfully captured DNA was measured by Quant-iT^TM PicoGreen_dsDNA reagent (Invitrogen, Carlsbad, CA) and subjected to standard sample preparation procedures for sequencing with SOLiD 5500xl platform as recommended by the manufacturer. Shortly, emulsion PCR was performed on E80 scale (about 1 billion template beads) using a concentration of 0.616 PM of enriched captured DNA. After breaking and enrichment, around 150 to 300 million templated beads were sequenced per lane on six-lane SOLiD 5500xl slides.

Variant calling and evaluation

Raw read sequences were aligned by Bfast⁴⁷. The aligned reads in BAM format were then pre-processed for calling with local Indel realignment, PCR duplicates removal and base quality recalibration⁴⁸. Single nucleotide variants (SNVs) and short insertions or deletions (Indels) were called by GATK2.0⁴⁸. Variant quality score recalibration (VQSR) and hard filtration were adopted on raw SNVs and Indels respectively. In detail, variants with low VQSR lod score (below zero), SNVs labelled as “QD < 2.0” or “MQ < 40.0” or “FS > 60.0” or “HaplotypeScore > 13.0” or “MQRankSum < −12.5” or “ReadPosRankSum < −8.0” and Indels labelled as “QD < 2.0” or “ReadPosRankSum < −20.0” or “InbreedingCoeff < −0.8” or “FS > 200.0” were excluded as low-quality variants. Individual genotypes were evaluated by genotyping quality; heterozygotes were kept only if they were supported by > 4 reads and the ratio for alternative allele was above 0.25. Comparatively, reference or alternative homozygote was accepted if it was supported by >4 reads and ratio for reference or alternative allele was above 0.95. After above quality control, clean variant sets from each individual were evaluated by GATK VarEval module for total number of variants, dbSNP137 coverage and Transition/Transversion (Ti/Tv) ratio. Pairwise kinship estimation was used to evaluate cross-individual relationship; this was done by PLINK using high-quality common (minor allele frequency (MAF) > 1%) and independent (linkage disequilibrium R-square < 0.2) SNPs⁴⁹.

Rare damaging variant filtering

All variants falling into exonic regions (hg19 Refgene) were included and were then filtered against public databases (dbSNP137, 1000 Human Genome Project, NHLBI Exome Sequencing Project) and 252 healthy Spanish individuals using Annovar and Galaxy⁵⁰. Only those variants with (MAF) <0.01 in each database were retained and treated as rare variants. KGGSeq was used to further exclude synonymous or non-deleterious missense variants to keep only rare damaging variants⁵¹. In detail, a logic model that integrates prediction scores of different programs (Polyphen2, Sift, MutationTaster, PhyloP and Likelihood ratio) was used to differentiate damaging and non-damaging variants⁵². Finally, the same variants belonging to >=2 families (out of 8 in total) were excluded as they are more likely technical artefacts given the attribute of rare damaging variants we defined.

Analysis of rare damaging variants

Figure 2 gave the overall flowchart for pinpointing candidate genes that could be related to HSCR phenotype. Assuming that a major susceptibility gene is underlying multiple families, we searched for gene recurrence with different rare damaging variants, each of which was shared by two affected relatives in the same family. To provide statistical implication, the distribution of rare damaging variants in those recurrent genes were compared between HSCR families and 1304 background genomes (1057 1000 genome project genomes, 54 Complete Genomics genomes, 184 genomes from Danish exomes and 9 genomes from 10 Gen data) using pVAAST, a disease-gene identification tool incorporating both genetic linkage and rare variant association information⁵³; in addition, gene-level burden were also compared between HSCR families and 527 MGP Spanish healthy controls using raremetal^54,55. In parallel, variants shared within each family were prioritized by pedigree cosegregation. Heterozygotes with full penetrance, de novo mutations, homozygous or compound heterozygotes were considered based on a possible mode of inheritance (autosomal dominant, autosomal recessive and X-linked) for each pedigree. Biological knowledge (canonical pathways, coexpression networks and protein-protein interactions) were used to fish out remaining variants located in genes linking up with recurrent genes and genes previously implicated HSCR candidate genes; these were done by KGGSeq⁵¹, DAPPLE⁵⁶, GeneMania⁵⁷ and NetGestalt⁵⁸. Joint effects for the genes from the same pathway/network were examined by tracing their combined cosegregation in those families not explainable by recurrent genes.

Finally, we checked the expression in ENS of our new candidate genes using Expression Atlas⁵⁹ and The Human Protein Atlas⁶⁰ databases.

Validation by Sanger sequencing

Two batches of rare damaging variants were sent for wet-lab Sanger validation: the first batch covering most variants shared within each family, the second batch for interesting genes/variants linking to recurrent genes or ENS candidate genes. Each potential disease-causing variant was confirmed by Sanger sequencing and cosegregation analyses were performed in the rest of the family members with available DNA samples. Primers for validation were designed using Primer3 (http://bioinfo.ut.ee/primer30.4.0/). PCR products were purified with Illustra^TM ExoProStar^TM 1-Step (GE Healthcare Life Science). Sequences were prepared with BigDye Terminator v3.1 Cycle Sequencing (Life Technologies) and finally sequenced in an automated ABI 3730 (Life Technologies).

Gene accession numbers

RET (NM_020975), PIGV (NM_001202554), DPYD (NM_000110), QTRTD1 (NM_024638), FAT3 (NM_001008781), TSC2 (NM_000548), THBS4 (NM_003248), PLAU (NM_001145031), FBN1 (NM_000138), SEMA3D (NM_152754), NTF3 (NM_001102654), IRAK3 (NM_007199), KDR (NM_002253), GFRA1 (NM_001145453), ZHX2 (NM_014943), TPCN1 (NM_001143819), AADACL4 (NM_001013630), RPE65 (NM_000329), HRNR (NM_001009931), DISP1 (NM_032890), LYPD6 (NM_194317), TTN (NM_00126755), POLQ (NM_199420), RHOBTB3 (NM_014899), COL10A1 (NM_000493), RNF148 (NM_198085), SHOC2 (NM_007373), AALAD2 (NM_005467), CNTN5 (NM_014361), B4GALNT3 (NM_173593), PLEKHO2 (NM_025201), PKD1L2 (NM_052892), FKBP10 (NM_021939), MAN2B1 (NM_000528), PLVAP (NM_031310), TMPRSS15 (NM_002772), COL6A6 (NM_001102608), TNXB (NM_019105), CREBBP (NM_004380), TSC2 (NM_000548), PTCH1 (NM_000264), EDNRB (NM_001201397), AHNAK (NM_001620).

Data submission

Data are submitted to ClinVar database. Available at URL: http://www.ncbi.nlm.nih.gov/clinvar/?LinkName=orgtrack_clinvar&from_uid=505435.

Additional Information

How to cite this article: Luzón-Toro, B. et al. Exome sequencing reveals a high genetic heterogeneity on familial Hirschsprung disease. Sci. Rep. 5, 16473; doi: 10.1038/srep16473 (2015).

References

Badner, J. A., Sieber, W. K., Garver, K. L. & Chakravarti, A. A genetic study of Hirschsprung disease. Am J Hum Genet 46, 568–580 (1990).
CAS PubMed PubMed Central Google Scholar
Amiel, J. et al. Hirschsprung disease, associated syndromes and genetics: a review. J Med Genet 45, 1–14, 10.1136/jmg.2007.053959 (2008).
Article CAS PubMed Google Scholar
Chakravarti, A, L. S. In The Metabolic and Molecular Bases of Inherited Disease, 8th ed (ed Beaudet, A. R., Scriver, C. R., Sly, W. & Valle, D. ) Ch. 251, (McGraw-Hill, 2001).
Alves, M. M. et al. Contribution of rare and common variants determine complex diseases-Hirschsprung disease as a model. Dev Biol 382, 320–329 (2013).
Article CAS Google Scholar
Borrego, S., Ruiz-Ferrer, M., Fernandez, R. M. & Antinolo, G. Hirschsprung’s disease as a model of complex genetic etiology. Histol Histopathol 28, 1117–1136 (2013).
CAS PubMed Google Scholar
Jiang, Q. et al. Functional Loss of Semaphorin 3C and/or Semaphorin 3D and Their Epistatic Interaction with Ret Are Critical to Hirschsprung Disease Liability. Am J Hum Genet 96, 581–596 (2015).
Article CAS Google Scholar
Torroglosa, A. et al. Involvement of DNMT3B in the pathogenesis of Hirschsprung disease and its possible role as a regulator of neurogenesis in the human enteric nervous system. Genet Med 16, 703–710 (2014).
Article CAS Google Scholar
Borrego, S. et al. Specific polymorphisms in the RET proto-oncogene are over-represented in patients with Hirschsprung disease and may represent loci modifying phenotypic expression. J Med Genet 36, 771–774 (1999).
Article CAS Google Scholar
Borrego, S. et al. A founding locus within the RET proto-oncogene may account for a large proportion of apparently sporadic Hirschsprung disease and a subset of cases of sporadic medullary thyroid carcinoma. Am J Hum Genet 72, 88–100 (2003).
Article CAS Google Scholar
Emison, E. S. et al. A common sex-dependent mutation in a RET enhancer underlies Hirschsprung disease risk. Nature 434, 857–863 (2005).
Article CAS ADS Google Scholar
Emison, E. S. et al. Differential contributions of rare and common, coding and noncoding Ret mutations to multifactorial Hirschsprung disease liability. Am J Hum Genet 87, 60–74 (2010).
Article CAS Google Scholar
Miao, X. et al. Reduced RET expression in gut tissue of individuals carrying risk alleles of Hirschsprung’s disease. Hum Mol Genet 19, 1461–1467 (2010).
Article CAS Google Scholar
Gui, H. et al. RET and NRG1 interplay in Hirschsprung disease. Hum Genet 132, 591–600 (2013).
Article CAS Google Scholar
Kapoor, A. et al. Population variation in total genetic risk of Hirschsprung disease from common RET, SEMA3 and NRG1 susceptibility polymorphisms. Human Mol Genet 24, 2997–3003 (2015).
Article CAS Google Scholar
Sanchez-Mejias, A., Fernandez, R. M., Lopez-Alonso, M., Antinolo, G. & Borrego, S. Contribution of RET, NTRK3 and EDN3 to the expression of Hirschsprung disease in a multiplex family. J Med Genet 46, 862–864 (2009).
Article CAS Google Scholar
Fernandez, R. M. et al. Pathways systematically associated to Hirschsprung’s disease. Orphanet J Rare Dis 8, 187 (2013).
Article Google Scholar
Minguez, P., Gotz, S., Montaner, D., Al-Shahrour, F. & Dopazo, J. SNOW, a web-based tool for the statistical analysis of protein-protein interaction networks. Nucleic Acids Res 37, W109–114 (2009).
Article CAS Google Scholar
Hindorff, L. A. et al. Potential etiologic and functional implications of genome-wide association loci for human diseases and traits. Proc Natl Acad Sci USA 106, 9362–9367 (2009).
Article CAS ADS Google Scholar
Manolio, T. A. et al. Finding the missing heritability of complex diseases. Nature 461, 747–753 (2009).
Article CAS ADS Google Scholar
Pritchard, J. K. & Cox, N. J. The allelic architecture of human disease genes: common disease-common variant … or not? Hum Mol Genet 11, 2417–2423 (2002).
Article CAS Google Scholar
Kiezun, A. et al. Exome sequencing and the genetic basis of complex traits. Nat Genet 44, 623–630 (2012).
Article CAS Google Scholar
Yang, J. et al. Exome sequencing identified NRG3 as a novel susceptible gene of Hirschsprung’s disease in a Chinese population. Mol Neurobiol 47, 957–966 (2013).
Article CAS Google Scholar
Tang, C. S. et al. Genome-wide copy number analysis uncovers a new HSCR gene: NRG3. PLoS Genet 8, e1002687 (2012).
Article CAS Google Scholar
Bolk, S. et al. A human model for multigenic inheritance: phenotypic expression in Hirschsprung disease requires both the RET gene and a new 9q31 locus. Proc Natl Acad Sci USA 97, 268–273 (2000).
Article CAS ADS Google Scholar
Deans, M. R. et al. Control of neuronal morphology by the atypical cadherin Fat3. Neuron 71, 820–832 (2011).
Article CAS Google Scholar
Nagae, S., Tanoue, T. & Takeichi, M. Temporal and spatial expression profiles of the Fat3 protein, a giant cadherin molecule, during mouse development. Dev Dyn 236, 534–543 (2007).
Article CAS Google Scholar
Redies, C., Treubert-Zimmermann, U. & Luo, J. Cadherins as regulators for the emergence of neural nets from embryonic divisions. J Physiol Paris 97, 5–15 (2003).
Article CAS Google Scholar
Eppig, J. T. et al. The Mouse Genome Database (MGD): facilitating mouse as a model for human biology and disease. Nucleic Acids Res 43, D726–736 (2015).
Article CAS Google Scholar
Katoh, Y. & Katoh, M. Comparative integromics on FAT1, FAT2, FAT3 and FAT4. Int J Mol Med 18, 523–528 (2006).
CAS PubMed Google Scholar
Mosher, J. T. et al. Intrinsic differences among spatially distinct neural crest stem cells in terms of migratory properties, fate determination and ability to colonize the enteric nervous system. Dev Biol 303, 1–15 (2007).
Article CAS Google Scholar
Wang, Y. et al. Common genetic variations in Patched1 (PTCH1) gene and risk of hirschsprung disease in the Han Chinese population. PloS one 8, e75407 (2013).
Article CAS ADS Google Scholar
Wu, C. et al. ZHX2 Interacts with Ephrin-B and regulates neural progenitor maintenance in the developing cerebral cortex. J Neurosci 29, 7404–7412 (2009).
Article CAS Google Scholar
Rebhan, M., Chalifa-Caspi, V., Prilusky, J. & Lancet, D. GeneCards: a novel functional genomics compendium with automated data mining and query reformulation support. Bioinformatics 14, 656–664 (1998).
Article CAS Google Scholar
Lazarovici, P., Marcinkiewicz, C. & Lelkes, P. I. Cross talk between the cardiovascular and nervous systems: neurotrophic effects of vascular endothelial growth factor (VEGF) and angiogenic effects of nerve growth factor (NGF)-implications in drug development. Curr Pharm Des 12, 2609–2622 (2006).
Article CAS Google Scholar
Tillo, M. et al. VEGF189 binds NRP1 and is sufficient for VEGF/NRP1-dependent neuronal patterning in the developing brain. Development 142, 314–319 (2015).
Article CAS Google Scholar
De Filippo, C. et al. Genomics approach to the analysis of bacterial communities dynamics in Hirschsprung’s disease-associated enterocolitis: a pilot study. Pediatr Surg Int 26, 465–471 (2010).
Article CAS Google Scholar
Wang, J. et al. CBP histone acetyltransferase activity regulates embryonic neural differentiation in the normal and Rubinstein-Taybi syndrome brain. Dev Cell 18, 114–125 (2010).
Article CAS Google Scholar
Bhattacherjee, V. et al. CBP/p300 and associated transcriptional co-activators exhibit distinct expression patterns during murine craniofacial and neural tube development. Int J Dev Biol 53, 1097–1104 (2009).
Article CAS Google Scholar
Crino, P. B., Trojanowski, J. Q., Dichter, M. A. & Eberwine, J. Embryonic neuronal markers in tuberous sclerosis: single-cell molecular pathology. Proc Natl Acad Sci USA 93 (1996).
Goncharova, E. A., James, M. L., Kudryashova, T. V., Goncharov, D. A. & Krymskaya, V. P. Tumor suppressors TSC1 and TSC2 differentially modulate actin cytoskeleton and motility of mouse embryonic fibroblasts. PloS One 9, e111476 (2014).
Article ADS Google Scholar
Shtivelman, E., Cohen, F. E. & Bishop, J. M. A human gene (AHNAK) encoding an unusually large protein with a 1.2-microns polyionic rod structure. Proc Natl Acad Sci USA 89, 5472–5476 (1992).
Article CAS ADS Google Scholar
Downs, K. M., McHugh, J., Copp, A. J. & Shtivelman, E. Multiple developmental roles of Ahnak are suggested by localization to sites of placentation and neural plate fusion in the mouse conceptus. Mech Dev 119 Suppl 1, S31–38 (2002).
Article Google Scholar
Edgar, R. et al. LifeMap Discovery: the embryonic development, stem cells and regenerative medicine research portal. PloS One 8, e66629 (2013).
Article CAS ADS Google Scholar
Zuko, A. et al. Contactins in the neurobiology of autism. Eur J Pharmacol 719, 63–74 (2013).
Article CAS Google Scholar
Bamshad, M. J. et al. Exome sequencing as a tool for Mendelian disease gene discovery. Nat Rev Genet 12, 745–755 (2011).
Article CAS Google Scholar
Garcia-Alonso, L. et al. The role of the interactome in the maintenance of deleterious variability in human populations. Mol Syst Biol 10, 752 (2014).
Article Google Scholar
Homer, N., Merriman, B. & Nelson, S. F. BFAST: an alignment tool for large scale genome resequencing. PloS One 4, e7767 (2009).
Article ADS Google Scholar
DePristo, M. A. et al. A framework for variation discovery and genotyping using next-generation DNA sequencing data. Nat Genet 43, 491–498 (2011).
Article CAS Google Scholar
Kang, H. M. et al. Variance component model to account for sample structure in genome-wide association studies. Nat Genet 42, 348–354 (2010).
Article CAS Google Scholar
Goecks, J., Nekrutenko, A., Taylor, J. & Galaxy, T. Galaxy: a comprehensive approach for supporting accessible, reproducible and transparent computational research in the life sciences. Genome Biol 11, R86 (2010).
Article Google Scholar
Li, M. X., Gui, H. S., Kwan, J. S., Bao, S. Y. & Sham, P. C. A comprehensive framework for prioritizing variants in exome sequencing studies of Mendelian diseases. Nucleic Acids Res 40, e53 (2012).
Article CAS Google Scholar
Li, M. X. et al. Predicting mendelian disease-causing non-synonymous single nucleotide variants in exome sequencing studies. PLoS Genet 9, e1003143 (2013).
Article CAS Google Scholar
Hu, H. et al. A unified test of linkage analysis and rare-variant association for analysis of pedigree sequence data. Nat Biotechnol 32, 663–669 (2014).
Article CAS Google Scholar
Liu, D. J. et al. Meta-analysis of gene-level tests for rare variant association. Nat Genet 46, 200–204 (2014).
Article CAS Google Scholar
Wu, M. C. et al. Rare-variant association testing for sequencing data with the sequence kernel association test. Am J Hum Genet 89, 82–93 (2011).
Article CAS Google Scholar
Rossin, E. J. et al. Proteins encoded in genomic regions associated with immune-mediated disease physically interact and suggest underlying biology. PLoS Genet 7, e1001273 (2011).
Article CAS Google Scholar
Warde-Farley, D. et al. The GeneMANIA prediction server: biological network integration for gene prioritization and predicting gene function. Nucleic Acids Res 38, W214–220 (2010).
Article CAS Google Scholar
Shi, Z., Wang, J. & Zhang, B. NetGestalt: integrating multidimensional omics data over biological networks. Nat Methods 10, 597–598 (2013).
Article CAS Google Scholar
Petryszak, R. et al. Expression Atlas update–a database of gene and transcript expression from microarray- and sequencing-based functional genomics experiments. Nucleic Acids Res 42, D926–932 (2014).
Article CAS Google Scholar
Ponten, F., Schwenk, J. M., Asplund, A. & Edqvist, P. H. The Human Protein Atlas as a proteomic resource for biomarker discovery. J Intern Med 270, 428–446 (2011).
Article CAS Google Scholar

Download references

Acknowledgements

We would like to thank the patients and families that have participated in this study.

Author information

Marta Bleda
Present address: Department of Medicine, University of Cambridge, School of Clinical Medicine, Addenbrooke’s Hospital, Cambridge, United Kingdom
Luzón-Toro Berta and Gui Hongsheng contributed equally to this work.

Authors and Affiliations

Department of Genetics, Reproduction and Fetal Medicine, Institute of Biomedicine of Seville (IBIS), University Hospital Virgen del Rocío/CSIC/University of Seville, Seville, Spain
Berta Luzón-Toro, Macarena Ruiz-Ferrer, Raquel M. Fernández, Ana Torroglosa, Laura Espino-Paisán, María del Valle Enguix-Riego, Guillermo Antiñolo & Salud Borrego
Centre for Biomedical Network Research on Rare Diseases (CIBERER), Spain
Berta Luzón-Toro, Macarena Ruiz-Ferrer, Raquel M. Fernández, Ana Torroglosa, Marta Bleda, María del Valle Enguix-Riego, Joaquín Dopazo, Guillermo Antiñolo & Salud Borrego
Centre for Genomic Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
Hongsheng Gui, Pak-Chung Sham & Stacey S. Cherny
Department of Psychiatry, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
Hongsheng Gui, Clara Sze-Man Tang, Pak-Chung Sham & Stacey S. Cherny
Department of Surgery, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
Clara Sze-Man Tang, Paul Kwong-Hang Tam & María-Mercé García-Barceló
State Key Laboratory of Brain and Cognitive Sciences, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
Pak-Chung Sham & Stacey S. Cherny
Centre for Reproduction, Development and Growth, Li Ka Shing Faculty of Medicine, The University of Hong Kong, Hong Kong, China
Pak-Chung Sham, Paul Kwong-Hang Tam & María-Mercé García-Barceló
Computational Genomics Department, Centro de Investigación Príncipe Felipe (CIPF), Valencia, Spain
Marta Bleda & Joaquín Dopazo
Functional Genomics Node, (INB) at CIPF, Valencia, Spain
Joaquín Dopazo

Authors

Berta Luzón-Toro
View author publications
You can also search for this author in PubMed Google Scholar
Hongsheng Gui
View author publications
You can also search for this author in PubMed Google Scholar
Macarena Ruiz-Ferrer
View author publications
You can also search for this author in PubMed Google Scholar
Clara Sze-Man Tang
View author publications
You can also search for this author in PubMed Google Scholar
Raquel M. Fernández
View author publications
You can also search for this author in PubMed Google Scholar
Pak-Chung Sham
View author publications
You can also search for this author in PubMed Google Scholar
Ana Torroglosa
View author publications
You can also search for this author in PubMed Google Scholar
Paul Kwong-Hang Tam
View author publications
You can also search for this author in PubMed Google Scholar
Laura Espino-Paisán
View author publications
You can also search for this author in PubMed Google Scholar
Stacey S. Cherny
View author publications
You can also search for this author in PubMed Google Scholar
Marta Bleda
View author publications
You can also search for this author in PubMed Google Scholar
María del Valle Enguix-Riego
View author publications
You can also search for this author in PubMed Google Scholar
Joaquín Dopazo
View author publications
You can also search for this author in PubMed Google Scholar
Guillermo Antiñolo
View author publications
You can also search for this author in PubMed Google Scholar
María-Mercé García-Barceló
View author publications
You can also search for this author in PubMed Google Scholar
Salud Borrego
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Drafted the paper: B.L.-T., H.G., M.R.-F., M.-M.G.-B. and S.B.; collaborated with valuable contributions to the paper: R.M.F., G.A. and P.T.; performed the genetic studies: B.L.-T., M.R.-F., A.T., L.E.-P. and M.V.E.-R.; analyzed the NGS data: H.G. and C.S.-M.T.; provided controls WES data and analyses: M.B., J.D. and G.A.; guided data analysis: S.S.C. and P.-C.S.; guided result interpretation and manuscript writing: M.-M.G.-B. and S.B.; conceived and designed the experiments: M.R.-F., R.M.F., G.A. and S.B.; recruited and ascertained the families: S.B.

Ethics declarations

Competing interests

The authors declare no competing financial interests.

Electronic supplementary material

Supplementary Tables

Rights and permissions

This work is licensed under a Creative Commons Attribution 4.0 International License. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in the credit line; if the material is not included under the Creative Commons license, users will need to obtain permission from the license holder to reproduce the material. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/

Reprints and permissions

About this article

Cite this article

Luzón-Toro, B., Gui, H., Ruiz-Ferrer, M. et al. Exome sequencing reveals a high genetic heterogeneity on familial Hirschsprung disease. Sci Rep 5, 16473 (2015). https://doi.org/10.1038/srep16473

Download citation

Received: 24 July 2015
Accepted: 14 October 2015
Published: 12 November 2015
DOI: https://doi.org/10.1038/srep16473

This article is cited by

Comprehensive characterization of the genetic landscape of familial Hirschsprung’s disease
- Jun Xiao
- Lu-Wen Hao
- Jie-Xiong Feng
World Journal of Pediatrics (2023)
Whole genome sequencing reveals epistasis effects within RET for Hirschsprung disease
- Yanbing Wang
- Timothy Shin Heng Mak
- Pak Chung Sham
Scientific Reports (2022)
Identification of FAT3 as a new candidate gene for adolescent idiopathic scoliosis
- Dina Nada
- Cédric Julien
- Alain Moreau
Scientific Reports (2022)
Circ-ITCH overexpression promoted cell proliferation and migration in Hirschsprung disease through miR-146b-5p/RET axis
- Ren-Peng Xia
- Fan Zhao
- Chong-Gao Zhou
Pediatric Research (2022)
Disorders of the enteric nervous system — a holistic view
- Beate Niesler
- Stefanie Kuerten
- Karl-Herbert Schäfer
Nature Reviews Gastroenterology & Hepatology (2021)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Introduction

Results

Discussion

Methods

HSCR families and controls

DNA library preparation and sequencing

Variant calling and evaluation

Rare damaging variant filtering

Analysis of rare damaging variants

Validation by Sanger sequencing

Gene accession numbers

Data submission

Additional Information

References

Acknowledgements

Author information

Authors and Affiliations

Contributions

Ethics declarations

Competing interests

Electronic supplementary material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Comments

Search

Quick links