Genetic information from discordant sibling pairs points to ESRP2 as a candidate trans-acting regulator of the CF modifier gene SCNN1B

Becker, Tim; Pich, Andreas; Tamm, Stephanie; Hedtfeld, Silke; Ibrahim, Mohammed; Altmüller, Janine; Dalibor, Nina; Toliat, Mohammad Reza; Janciauskiene, Sabina; Tümmler, Burkhard; Stanke, Frauke

doi:10.1038/s41598-020-79804-y

Download PDF

Article
Open access
Published: 31 December 2020

Genetic information from discordant sibling pairs points to ESRP2 as a candidate trans-acting regulator of the CF modifier gene SCNN1B

Tim Becker^1,2,
Andreas Pich³,
Stephanie Tamm⁴,
Silke Hedtfeld⁴,
Mohammed Ibrahim³,
Janine Altmüller⁵,
Nina Dalibor⁵,
Mohammad Reza Toliat ORCID: orcid.org/0000-0002-9248-3200⁵,
Sabina Janciauskiene^6,7,
Burkhard Tümmler^4,6 &
…
Frauke Stanke^4,6

Scientific Reports volume 10, Article number: 22447 (2020) Cite this article

1931 Accesses
5 Citations
2 Altmetric
Metrics details

Subjects

Abstract

SCNN1B encodes the beta subunit of the epithelial sodium channel ENaC. Previously, we reported an association between SNP markers of SCNN1B gene and disease severity in cystic fibrosis-affected sibling pairs. We hypothesized that factors interacting with the SCNN1B genomic sequence are responsible for intrapair discordance. Concordant and discordant pairs differed at six SCNN1B markers (Praw = 0.0075, Pcorr = 0.0397 corrected for multiple testing). To identify the factors binding to these six SCNN1B SNPs, we performed an electrophoretic mobility shift assay and captured the DNA–protein complexes. Based on protein mass spectrometry data, the epithelial splicing regulatory protein ESRP2 was identified when using SCNN1B-derived probes and the ESRP2-SCNN1B interaction was independently confirmed by coimmunoprecipitation assays. We observed an alternative SCNN1B transcript and demonstrated in 16HBE14o− cells that levels of this transcript are decreased upon ESRP2 silencing by siRNA. Furthermore, we confirmed that mildly and severely affected siblings have different ESPR2 genetic backgrounds and that ESRP2 markers are linked to the response of CF patients’ nasal epithelium to amiloride, indicating ENaC involvement (Pbest = 0.0131, Pcorr = 0.068 for multiple testing). Our findings demonstrate that sibling pairs clinically discordant for CF can be used to identify meaningful DNA regulatory elements and interacting factors.

Tissue-specific enhancer–gene maps from multimodal single-cell data identify causal disease alleles

Article 09 April 2024

Genomic data in the All of Us Research Program

Article Open access 19 February 2024

Exome-wide analysis implicates rare protein-altering variants in human handedness

Article Open access 02 April 2024

Introduction

Genetic variation in humans contributes significantly to phenotypic variation. The question of which single nucleotide polymorphism (SNP) determines disease outcome and/or severity, has been addressed in more than 2000 genome-wide association studies (GWAS)¹. However, the results have revealed that more than 90% of the polymorphisms identified by GWAS do not directly alter the gene’s coding sequence. This has led to the conclusion that clinically relevant variation of the human genome mediates gene regulation, i.e., the transcript expression level or the composition of transcript isoforms². Recent genome and epigenome studies have substantiated this hypothesis³. Enrichment of elements known for covalent modifications of DNA bases or their associated nucleosomes⁴ among disease- and trait-associated genetic variants determined by GWAS has also been noted.

The gene affected in cystic fibrosis (CF), cystic fibrosis transmembrane conductance regulator (CFTR), encodes a chloride- and bicarbonate channel of epithelia^5,6,7 that localizes with the epithelial sodium channel ENaC at the apical membrane of many⁸, albeit not all⁹, epithelial cells. Both CFTR and ENaC act synergistically to regulate salt and fluid transport across the epithelium^8,10, and the SCNN1B gene, encoding the beta subunit of ENaC, is a highly plausible modifier gene of CF.

Previously, we focused on the three genes encoding the subunits of ENaC as candidate genes in the European CF twin and sibling study¹¹. In an association study on extreme phenotypes, anthropometry and lung function data were used to select patients whose clinical data fell below the 25th centile (severely affected) or above the 75th centile (mildly affected) for both clinical parameters¹². Three groups of affected patient pairs were defined as follows by a ranking algorithm used to describe the severity of CF: concordant mildly affected sibling pairs, concordant severely affected sibling pairs and discordant sibling pairs. Discordant sibling pairs were composed of one mildly and one severely affected sibling¹².

When discordant sibling pairs were compared to concordant sibling pairs, one SCNN1B haplotype defined by SNPs rs238547–rs152730–rs250563 occurred more frequently among discordant than among concordant siblings¹¹. We concluded that this signal cannot be fully explained by a variant observed within SCNN1B because discordant siblings have a dissimilar phenotype by definition¹², and yet these siblings share an SCNN1B intragenic haplotype¹¹. Our working hypothesis relied on the idea that the association signal in SCNN1B delineates a functional regulatory element, whereby a DNA-binding protein encoded in trans interacts with this regulatory element, stably binding to the haplotype of the regulatory element that is predominant among discordant siblings. Hence, the genetic variation of the interaction partner can determine the phenotype causing intrapair discordance in affected sibling pairs.

In association studies involving affected patient pairs, interaction between a regulatory element and a DNA-binding protein may result in a paradoxical situation. The regulatory element is recognized through an INTERpair comparison by an association with the phenotype “discordance of sibs”. However, genetic information at the regulatory element is shared by both siblings within a pair, which provides an opportunity to identify the DNA-binding protein encoded in trans by an INTRApair comparison. If the phenotype is caused by interaction of the DNA-binding protein with the regulatory element, mildly and severely affected siblings of discordant pairs must carry different genetic information at the locus encoding the DNA-binding protein.

Results

Six SNPs within SCNN1B differ between concordant and discordant CF patient pairs

We previously reported that intrapair discordance for CF disease severity is associated with three intragenic markers spanning SCNN1B from codon 3 to codon 293. To describe the genomic fragment for which concordant and discordant pairs carry different genetic information, we analyzed 7 previously typed markers¹¹ and 49 SNPs genotyped for fine-mapping in the 16p12 region, encompassing the entire SCNN1G/SCNN1B-locus (Fig. 1). Next, we employed a haplotype-based fine-mapping strategy previously used to identify causative variants within this cohort¹³. To determine whether concordant and discordant pairs carried the same or different genetic information, we employed the software package FAMHAP to construct two-marker haplotypes composed of two informative markers. By using this approach, we found a significant difference in two-marker-haplotype distributions for two adjacent genomic fragments defined by markers rs152730–rs152745 and rs152745–rs152740 (Praw = 0.0075 and Praw = 0.00869, respectively; corrected for multiple testing of all informative markers at the SCNN1G/SCNN1B-locus Pcorr = 0.0397, Fig. 1). We concluded that the variant(s) that determine intrapair discordance are located on the genomic fragment between rs152730 and rs152740. Based on the allele frequency distribution among concordant and discordant pairs, we selected representatives for the contrasting haplotypes for Sanger resequencing of the mapped genomic fragment (Table 1). We chose three homozygotes for the haplotype TTAGA, two homozygotes for the haplotype GGAAT and one homozygote for the haplotype GTCAT for sequencing of the rs152730–rs152740 genomic fragment on contrasting alleles at markers rs152730–rs8044970–rs63982–rs152745–rs152740. We used long-range PCR to amplify an 8269 bp and an 8856 bp product encompassing the sequence of interest (Table 2). Sanger sequencing was performed using internal primers positioned every 500 bp on the forward and reverse strands. Using the software CodonCode Aligner, 476 primary reads with a median length of 737 bp were aligned to the reference sequence, assuring coverage of at least 4 reads per haplotype at each genomic position. Based on this alignment, we identified 6 SNP positions for carriers of the contrasting haplotypes for which concordant and discordant pairs had different genetic information. At the six SNPs rs152730–rs152731–rs152745–rs152744–rs152741–rs152740, alleles associated with intrapair concordance carried the haplotype GCAGTT; in contrast, alleles associated with intrapair discordance carried the haplotype TTGACA. None of these six SNPs reside within the coding sequence of SCNN1B. However, according to in silico analyses, they possibly alter the secondary structure of the pre-mRNA (SupplTab. 1, SupplTab. 2, SupplFig. 1).

Table 1 Haplotype and diplotype distribution observed among concordant and discordant cystic fibrosis F508del homozygous sibling pairs for markers rs152730, rs8044970, rs63982, rs152745, rs152740.

Full size table

Table 2 Variants observed on contrasting haplotypes identified after Sanger re-sequencing of the 8000 bp genomic fragment defined by rs152730–rs152740 associated with intrapair discordance.

Full size table

An uncommon alternative SCNN1B transcript generated by intron retention in epithelial cell lines

Because none of the six identified SNPs affect the amino acid sequence of the SCNN1B protein, we next aimed to determine whether SCNN1B undergoes alternative splicing. Alternative transcripts were inferred from mapped expressed sequence tags (ESTs, SupplFig. 2). As a source for polyA + RNA, we used T84 cells, which are derived from colon carcinoma, 16HBE14o-cells, which are virus-transformed non-CF respiratory epithelial cells, and CFBE41o- and CFTE29o-cells, both of which are immortalized respiratory epithelial cells derived from F508del-CFTR homozygous CF patients. Primers for combinatorial reverse-transcription PCR were designed to reflect ESTs reported for SCNN1B in the area of interest defined by SNPs rs152730–rs152740 (SupplFig. 2A). By using primers located in exons 3 and 4 or exons 3 and 5, we detected wild-type SCNN1B in all four epithelial cell lines (Fig. 1C). Additionally, we amplified a 280 bp product using one primer located within exon 3 and one primer located 100 bp 3′ of the splice site at the end of exon 3 (Fig. 1C and SupplFig. 2). We specifically investigated this intronic sequence because it has been reported to be retained in EST clones BM694355 and BU730506, which are generated from a cDNA library prepared from retinal pigment epithelium of a healthy adult male. Primers designed to detect ESTs AW844136, CV337204 and BX485038 did not amplify a product (SupplFig. 2). The 280 bp product, derived from the alternative SCNN1B transcript generated by exon read-through, was reliably amplified from T84 derived cDNA even when the RNA was pretreated with DNAse (SupplFig 2). Moreover, Sanger sequencing of the alternative product confirmed its identity at the exon 3/intron 3 border. Hence, the alternative mRNA was generated by an exon read-through event and matched the genomic sequence by the base (SupplFig 2E). Furthermore, primers placed upstream in exon 1 encoding the 5′ UTR of SCNN1B in combination with a primer placed on the retained intron sequence yielded a product from cDNA (Fig. 1C). Conversely, no signal was observed when using a primer located in the downstream exon 5 in combination with the retained intron sequence (data not shown). To summarize, cancer-derived intestinal epithelial cells as well as virus-immortalized respiratory epithelial cells expressed an alternative SCNN1B transcript in which intron 3 was partially retained. If translated, this alternative transcript would preserve the reading frame at the end of exon 3 and would be translated into a protein that terminates prematurely after an 26 additional amino acids derived from the retained intron sequence, producing a severely truncated SCNN1B protein of 221 amino acids.

The SCNN1B haplotype found in discordant pairs is enriched for predicted transcription factor binding sites

We next assumed that an allelic association with a discordant manifestation of CF severity is mediated by factors that recognize the allele enriched among discordant pairs. To test our hypothesis, we assessed whether the 6-marker-haplotype that is associated with intrapair discordance for CF severity, i.e., whether TTGACA at the six SNPs rs152730–rs152731–rs152745–rs152744–rs152741–rs152740 attracts different DNA-binding proteins compared to the GCAGTT allele that is observed among concordant sibling pairs. To identify potential transcription factor binding sites, we used the tool “Match” (available at http://www.gene-regulation.com/)¹⁴, which is based on a library of mononucleotide weight matrices from TRANSFAC6.0. The settings were restricted to vertebrate transcription factors and limited to minimize false negatives (estimated error rate of 10% for training data set). As an input sequence, we used both alleles at each of the six divergent SNPs and + /− 20 bp flanking sequences. Next, we compared the list of putative transcription factor binding sites between the input sequences derived from haplotypes associated with concordance and discordance and noted those predicted to interact with only one of the two contrasting alleles at each SNP. Surprisingly, only 6 binding sites were predicted for the haplotype observed among concordant sibling pairs; 21 predicted interactions were exclusively related to the six-marker-haplotype associated with intrapair discordance (Fig. 2, SupplTab 3a). Different from concordant sibling pairs, the haplotype observed among discordant sibling pairs had significantly more opportunities to interact with DNA-binding proteins (p = 0.048; in comparison to the expectancy value derived from 26 binding sites distributed equally between both haplotypes).

Next, we evaluated whether predicted interacting proteins of the SCNN1B haplotype TTGACA (SupplTab 3a) are associated with the response to amiloride upon superfusion of the nasal epithelium. The function of ENaC in vivo can be assessed based on the potential difference between the nasal epithelium and the subcutaneous space¹⁵. According to the nasal potential difference with the use of amiloride (indicative of ENaC-mediated sodium transport), except for GATA2Sat (Praw = 0.0456), none of the genes encoding predicted interaction partners showed an association with ENaC function (SupplTab 3b).

Interaction partners of double-stranded DNA sequences can be captured with an electrophoretic mobility shift assay following protein sequencing (EMSA-PSeq)

As our in silico analysis did not extend to DNA-binding proteins with unknown binding motifs, we aimed to identify interacting proteins by performing a modified electrophoretic mobility shift assay followed by protein sequencing of the DNA–protein complex (EMSA-PSeq; see supplement for experimental details). Briefly, we used nuclear extracts derived from epithelial cells and biotinylated 35-mer dsDNA probes that centrally carry one of the contrasting alleles of the SNPs as bait. To separate the unbound probe from the probe-protein-complexes, we performed native polyacrylamide gel electrophoresis, and the probe-protein-complexes were visualized after transfer of the separated samples to a membrane. The gel fragment corresponding to the signal generated by the probe-protein-complex was excised, and proteins within the excised gel fragment were identified by protein mass spectrometry. We used the NFkappaB-P65-consensus sequence¹⁶ for optimization of the experimental setup. Protein mass spectrometry and MASCOT analysis identified several hundred proteins per high-molecular weight complex. To enable the recognition of proteins that incidentally comigrate together with the probe/protein complex and/or that bind to any DNA unspecifically, and/or are introduced to the sample as contaminants during handling of the gel fragment, a set of 25 EMSA-PSeq experiments were performed in parallel for noise filtering (SupplFig 3). In the EMSA-PSeq sample obtained with the NFkappaB-P65-consensus probe (Fig. 3), we detected P65 (score 78, tagged by three specific peptides) as a unique signal within a total of 25 evaluated EMSA-PSeq data sets. Additionally, the EMSA-PSeq sample baited with the NFkappaB-P65-consensus probe uniquely attracted STAT3 and STAT6, both of which were not observed in any other of the 25 EMSA-PSeq datasets.

Nucleic acid binding proteins attracted by probes of SNPs rs152730, rs152731 and rs152744 and identified as unique by EMSA-PSeq

To detect interaction partners of SCNN1B SNPs associated with intrapair discordance, we used certain experimental conditions to detect NFkappaB-P65 using a probe with a p65 consensus sequence as bait (Fig. 3). Although we varied the conditions for electrophoresis, electrotransfer and detection (see supplement for details), we were not able to obtain a reproducible high-molecular-weight complex for either allele of rs152745, rs152741 and rs152740. However, high-molecular-weight DNA–protein-complexes were observed with probes representing the two contrasting alleles at SNPs rs152730, rs152731 and rs152744. As described in detail within the Supplemental material, we have filtered the raw data set of 25 EMSA-PSeq experiments for low-expressed proteins annotated to have nucleic acid binding capabilities and attracted to only one SNP (SupplFig 3).

Even when considering the inaccuracy of complex sizes after separation on the native polyacrylamide gel, high-molecular-weight complexes were incompatible with the interaction of a single protein found by EMSA-PSeq (SupplTab 4). Thus, several independent comigrating protein–protein and protein–protein-nucleic acid complexes were likely subjected to protein sequencing. Among the proteins identified as specific for the P65 consensus probe, P65 was one of 11 proteins (Fig. 4, SupplTab 4). Sixteen, five and eight proteins were identified uniquely for SNPs rs152730, rs152731 and rs152744, respectively, by EMSA-PSeq and data mining (Fig. 4, SupplTab 4). Since P65 was found with the same experimental and data evaluation strategy used for the data sets for the SCNN1B SNPs, we assume that our true-positive protein of interest has been captured as well.

The ESRP2 genetic background is associated with the manifestation of the amiloride-sensitive sodium current in the nasal epithelium

To prove or disprove that a protein identified by EMSA-PSeq can influence ENaC function, we performed a candidate-gene-based association study among CF patients in which one of the studied phenotypes addresses ENaC function in the nasal epithelium in vivo¹⁷. To select a plausible genetic locus based on the list of 29 candidate proteins (Fig. 4, SupplTab 4), we excluded components of the spliceosome multiprotein complex. Among the remaining EMSA-PSeq-derived proteins, the epithelial-specific splicing regulatory protein 2 ESRP2^18,19 was the most reasonable candidate. First, CF is an epithelial disease and ESRP2 is consistent with this feature^20,21. Second, an alternative SCNN1B transcript was observed (Fig. 1C, SupplFig 2) and ESRP2 is plausible based on its role in transcript processing.

We found ESRP2 exclusively on probes representing rs152731 whereas ESRP1 was detected on probes derived from the three EMSA-PSeq SNPs (Fig. 5). In the association study, the ESRP1 marker ESRP1-Sat1 showed no association with the phenotype or severity of CF (Praw > 0.2). In contrast, ESRP2-Sat1 exhibited an association signal (Pbest = 0.04) that was confirmed with a second microsatellite and 5 SNPs (Pbest = 0.0131, Pcorr = 0.068 for multiple testing of 7 markers; Fig. 5) for the manifestation of amiloride-sensitive sodium conductance, a hallmark of ENaC function.

The two rs152731 alleles are nonequivalent concerning ESRP2 binding

ESRP2 was detected as a factor binding to the C to T SNP rs152731 by EMSA-PSeq (Fig. 4) by using polyacrylamide gels with a separating distance of 4 cm. To detect ESRP2-rs152731 binding complexes, we employed EMSAs using polyacrylamide gels with a separating distance of 20 cm (Fig. 6A,B). To exclude nonspecific binding, signals obtained with an antibody directed against ESRP2 were compared to protein-DNA-complexes probed with IgG (isotype control). Signal intensities for rs152731-C were comparable in IgG and anti-ESRP2-Ab lanes. In contrast, signals obtained for rs152731-T were stronger with anti-ESRP2-Ab than with IgG. Normalized signal intensities for rs152731-T were higher than those for rs152731-C (p = 0.013).

Next, we addressed whether ESRP2 recognizes rs152731 directly. We used a coimmunoprecipitation (co-IP) experiment using a biotinylated rs152731 EMSA probe as bait with a raw nuclear extract. After precipitation with an anti-biotin-Ab, detection of ESRP2 by Western blotting in co-IP samples verified that all components sufficient for ESRP2 to bind rs152731 are present within the nuclear extract and/or the components used for the EMSA preceding co-IP (Fig. 6C,D). In three independent co-IP experiments comparing the two contrasting rs152731 alleles, ESRP2 signals derived from probes with rs152731-T yielded stronger signals than probes for rs152731-C. In summary, the results from two different techniques confirmed that rs152731-T binds ESRP2 better than does rs152731-C. Thus, the allele rs152731-T associated with intrapair discordance (Tables 1, 2) attracts ESRP2 to a greater extent and is more vulnerable to its genetic variations (Fig. 5).

ESRP2 knockdown alters global SCNN1B expression in 16HBE14o-cells

To determine whether ESRP2 influences the transcript species generated from SCNN1B, we downregulated ESRP2 by siRNA in T84 and 16HBE14o-cells and quantified the wild-type and the alternative transcripts by qPCR (Fig. 7). Silencing of ESRP2 in T84 cells resulted in highly variable changes in SCNN1B transcripts. Moreover, the observed changes were comparable to those using scrambled control siRNA. In contrast, no systematic effect of scrambled control siRNA was detected in 16HBE 14o-cells (p = 0.50 for wild-type, p = 0.32 for alternative SCNN1B transcript; Wilcoxon signed-rank test). Moreover, the amounts of wild-type and alternative SCNN1B were increased in 16HBE14o-cells upon treatment with siRNA directed against ESRP2 (p = 0.015 for wild-type; p = 0.054 for alternative SCNN1B transcript). Comparison of changes in wild-type and alternative SCNN1B transcript levels assessed by paired ΔΔCt levels for both amplicons indicated that downregulation of ESRP2 induced expression of functional wild-type SCNN1B in 16HBE14o-cells (p = 0.081, Wilcoxon signed rank test).

The upper respiratory tract is the origin of 16HBE14o-cells while T84 cells are derived from intestinal cells; 16HBE14o-cells are virus-immortalized and T84 are colon cancer cells. Because ESRP2 plays a prominent role in cancer^18,19, it is not surprising that these two cell lines behaved differently in the ESRP2 gene silencing assay. Furthermore, the baseline expression of both SCNN1B transcripts was lower in 16HBE14o than in T84 cells. Interestingly, T84 cells carry only one, but 16HBE14o-cells carry two of the ESRP2-receptive alleles T–G–A–C at markers rs152731–rs152745–rs152744–rs152741. This may explain the observed differences between 16HBE14o and T84 cells in response to ESRP2 silencing. Homozygosity for T–G–A–C is associated with intrapair discordance among CF twins and siblings.

Discussion

The design of the European CF twin and sibling study was inspired by Risch and Zhang²² who proposed that the use of sibling pairs with extremely concordant or discordant phenotypes will advance the discovery of quantitative trait loci in humans^22,23,24. Due to the high power of this approach, it was estimated that the genotyping load for studies undertaken with an extreme sib-pair design, selecting for patient pairs who exhibit phenotypes below the 30th or above the 70th centile, can be reduced by up to 40-fold²². Based on this strategy for patient recruitment, 37 F508del-CFTR homozygous sibling pairs of 318 cystic fibrosis affected patient pairs were selected for the association study by a ranking algorithm¹². The selected sibling pairs were comparable in terms of their birth cohort¹¹. This strategy helped us to minimize the influence of a major nongenetic confounder²⁵, i.e., complex therapeutic management which has improved the life expectancy of CF patients by several decades. Pulmonary and gastrointestinal disease manifestations were assessed quantitatively by CF population centiles for the normalized forced expiratory volume in 1 s (FEV1) and by weight as a percentage of predicted weight for height. For these two parameters, we selected sibling pairs with extreme phenotypes in the upper and lower 25%^17,26. Our selection criteria were in line with recommendations proposed by Risch and Zhang²², however, these criteria resulted in a small study population, thus limiting the power of the genetic association study. Moreover, since our study population is of white European descent, a group in which F508del-CFTR is the most common mutation causing CF, we cannot be certain that our findings can be applied to other populations.

Risch and Zhang concluded from their simulation studies that “extremely discordant sibling pairs represent a powerful design for the association studies of candidate genes”²², and our findings fully support this idea. The use of sibling pairs with extreme clinical phenotypes has been applied before^27,28, and our data support the notion that gene–gene interactions mediated by factors encoded in trans of the studied locus can be distinguished in an association study when mildly and severely affected siblings of discordant pairs are compared (Fig. 8).

Hence, in this study we identified a haplotype within SCNN1B associated with intrapair discordance in CF sibling pairs (Fig. 1, Tables 1, 2). We investigated in silico (Fig. 2) and experimentally (Figs. 3, 4) the occurrence of DNA-binding proteins interacting differentially with single or multiple SNPs within the haplotype. We were able to recognize ESRP2 as a candidate for validation among nucleic acid binding proteins, which showed an association with SCNN1B/ENaC function (Fig. 5). We further employed EMSA and co-IP as two different experimental approaches to support the allele-dependent interaction between rs152731 in SCNN1B and the nucleic acid binding protein ESRP2 (Fig. 6). Moreover, we demonstrated that siRNA mediated silencing of ESRP2 in respiratory epithelial cells causes an alteration in global SCNN1B expression (Fig. 7). Altogether, our findings consistently support the idea that SCNN1B and ESRP2 are interacting partners and that ESRP2 is capable of altering the SCNN1B transcript repertoire. It is plausible that this interaction can alter ENaC function and has an influence on the manifestation of CF.

Our proof-of-principle study has several limitations. Specifically the findings cannot fully explain whether the regulatory element within SCNN1B leads to the alternative SCNN1B transcript, from which a severely truncated, 221 amino acid SCNN1B may be translated. Similar truncated SCNN1B isoforms of 217 and 306 amino acids have been observed in patients with systemic pseudohypoaldosteronism⁴⁸. In addition, heterologous expression of these truncated SCNN1B mutants in Xenopus oocytes showed that they can assemble with wild-type alpha- and gamma ENaC subunits⁴⁸. These SCNN1B mutants resulted in lower ENaC activity (by 3–7%) than the wild-type protein⁴⁸. Under physiological conditions, parallel expression of two SCNN1B transcript species, one of which yields a truncated SCNN1B isoform upon translation, can reduce SCNN1B function but the extent remains unclear.

Using EMSA-Pseq for the identification of DNA-binding proteins has previously been conceived^29,30,31. To examine whether our EMSA conditions allowed the formation of high-molecular-weight multiprotein complexes with coherent DNA–protein-interactions in vivo, we investigated protein-DNA-complexes using an NFkappaB-P65-consensus as bait. In line with published data^32,33,34,35, this probe attracted NFkappaB-p65 and its known interaction partners, such as STAT3 and STAT6. From the EMSA-Pseq of SCNN1B probes, we selected ESRP2 as a candidate for further validation experiments. This selection was based on the fact that this protein is characteristically expressed in epithelial cells and that similar to other SNP-specific proteins recognized by EMSA-PSeq, ESRP2 has been well-characterized as an RNA-binding protein^18,19.

The defining border between RNA- and DNA-binding proteins has recently softened because typical DNA-binding proteins have become known to target long noncoding RNAs, defining dual-recognition nucleic acid binding proteins^36,37. A growing number of nucleic acid binding proteins have been recognized to interact with both nucleic acid species^{37,38,39,40,41,42,43,44} and genomic DNA⁴⁵. The ability to bind to DNA and RNA simultaneously designates a dual recognition protein capable of shuttling between both nucleic acid types during transcription³⁸. During transcription, DNA and RNA are physically close, and therefore, cotranscriptional processes, such as pre-mRNA splicing, can be mediated by putative dual-recognition proteins, such as hnRNP splicing regulatory factors^46,47.

Nevertheless, we cannot exclude that SCNN1B may have other important interacting partners in addition to ESRP2 that were not discovered in this study. In this work, we analyzed only three of six SNPs by EMSA-PSeq. Furthermore, while we filtered our primary protein sequencing data using a positive control (NFkappaB-P65) and several technical controls to recognize contamination, we did not incorporate a protein–probe-interaction with low binding affinity, which might enable the detection of weak interacting partners. In the future, the resolution of the EMSA-Pseq can be improved by using scrambled probes to control for nonspecific binding and by incorporating the false-positives captured in the data evaluation strategy. Moreover, EMSA-PSeq utilizes mass spectrometry to identify proteins. In contrast, nucleic acids such as long noncoding RNAs with the potential of interacting with the DNA sequence cannot be identified in this experimental setting, and thus, their relevance needs to be verified by other methods.

The haplotype associated with intrapair discordance covers a genomic segment of 8 kb, implying that a synergistic relationship of more than one interaction partner is responsible for the selective advantage that underlies the maintenance of linkage disequilibrium over such a distance. Thus, it is unlikely that the SCNN1B function can be fully understood based on studying single SNPs. Regardless, we are convinced that the methodology proposed herein—analysis of clinically discordant sibling pairs in combination with EMSA-PSeq—aids in our understanding of how some of the 10,000 SNPs identified by GWAS as being meaningful (www.genome.gov/gwastudies. Accessed at 03.02.2015) contribute to the manifestation of phenotypes in humans. As gene–gene interactions have been suggested to account for the phenomenon termed “missing heritability”⁴⁹, the discovery of regulatory interactions such as those between ESRP2 and SCNN1B might help to annotate existing GWAS data sets that have been performed with sibling pairs^50,51,52.

Methods

Details on the experimental procedures are provided in the supplement.

Cell culture

Biomaterials were derived from T84 colon cancer cells and immortalized respiratory epithelial 16HBE14o-, CFBE41o- and CFTE29o-cells.

RNA preparation

For RNA isolation, cells were cultured in plates, grown to confluency, snap-frozen in the gaseous phase of liquid N₂ and stored at − 80 °C. RNA was extracted using QIAamp RNA Blood Mini Kit (52,304, Qiagen, Hilden, Germany) and RNase-free DNase Set (79,254, Qiagen, Hilden, Germany).

Oligonucleotides for PCR

Sequences of primers used for genotyping and combinatorial PCR are listed in the supplement (SupplTab 5).

Extraction of nuclear proteins

Nuclear extracts were prepared according to published standard methods⁵³ (SupplTab 6). To prevent carryover of the high-salt buffer used for lysis of nuclei, nuclear proteins were dialyzed against low-salt HEPES buffer. The completeness of dialysis was verified by measuring the conductivity of the nuclear extract with a needle probe (customized, Technische Forschungswerkstätten of the Hannover Medical School). The quality of nuclear proteins was ascertained by verifying the conductivity of the final extract after dialysis and by noting the absence of degradation by SDS-electrophoresis followed by Coomassie staining.

EMSA-PSeq

To identify proteins that interact with a specific DNA sequence, we performed an EMSA experiment, visualized the shifted band, captured the DNA–protein complexes by excising the corresponding region of the polyacrylamide gel and then performed protein mass spectrometry. The composition of the EMSA binding buffer was adjusted to reflect the nuclear milieu⁵⁴ (SupplTab 7). All experimental details and data analysis methods are provided in the supplement (SupplTab 8, SupplTab 9, SupplFig4, SupplFig5, SupplFig6).

Coimmunoprecipitation

Biotinylated 35-mer double-stranded DNA probes for the rs152731 allele C and allele T were incubated with nuclear extract in an EMSA experiment. The DNA–protein-complexes were precipitated using an anti-biotin antibody and protein G agarose beads. The protein–DNA-complexes were eluted from the beads in three consecutive steps, and Western blotting with anti-ESRP2 was used to detect ESRP2 in the precipitated protein–DNA-complexes. IgG instead of the anti-biotin antibody served as a negative control in all experiments. ESRP2 was identified using a signal from an unpurified nuclear extract developed in parallel in each Western blot experiment.

siRNA-mediated downregulation of ESRP2 in epithelial model cell lines and SCNN1B transcript analysis by real-time RT-PCR

siRNA directed against ESRP2 and scrambled control siRNA was purchased from GE Healthcare (mixture of four siRNAs, on-target plus pool, GE Healthcare). T84 and 16HBE14o-cells were transfected for 24 h and 48 h using a protocol supplied by the manufacturer with 10 µl of Dharmafect 1 and 100 pmol siRNA in 2 ml of cell culture medium per well of a 6-well plate. Commercially available kits were used according to the manufacturer’s instructions. RNA was isolated using the RNA-easy-mini-kit (Qiagen), transcribed into cDNA with the High-Capacity cDNA Reverse Transcription kit with RNase inhibitor (Applied Biosystems) and used as a template for real-time PCR with PowerUp SYBR Green Master Mix (Applied Biosystems) to target wild-type and read-through alternative SCNN1B transcripts with the StepOnePlus real-time PCR system (ThermoFisherScientific). The housekeeping gene aldolase was amplified from 5 ng cDNA with 400 nM forward and reverse primers. SCNN1B transcripts were amplified from 30 ng of cDNA with 400 nM (read-through alternative transcript) and 700 nM (wild-type transcript) forward and reverse primers, respectively. Amplification was carried out using annealing at 60 °C. Threshold cycle (Ct) values were retrieved using StepOne-Software (Thermo FisherScientific).

Genetic markers

Except for 7 previously typed markers¹¹, genetic markers were developed de novo for this project. Genotyping was performed by the SNPstream assay (technology by Beckman Coulter, used at Cologne Center of Genomics, Cologne, Germany), by microsatellite genotyping using direct blotting electrophoresis¹⁷ or by PCR–RFLP (see SupplTab 5a).

Evaluation of genetic data in the association study on European cystic fibrosis twins and siblings

The work presented here derived data from an association study on European CF twins and siblings¹⁷. The study was approved by the ethics committee of Hannover Medical School and written informed consent was obtained from all participants or their parental guardians. All methods were performed in accordance with relevant guidelines and regulations. The clinical characteristics of the patients have been described in detail elsewhere^12,15,17. Briefly, the 12% most informative pairs from the entire sample of 318 CF twin and sibling pairs for whom pulmonary function data and weight and height were available in 1996 were selected by a ranking algorithm¹². To study genetic modifiers, we aimed to reduce the effect of the disease-causing CFTR gene on the disease phenotype, thus deciding to study only one CFTR mutation genotype. F508del-CFTR, present on 70% of CF chromosomes from white populations of Central and West-European countries, is the only CFTR mutation for which such an approach is feasible. Moreover, patient subsamples were examined to assess the manifestation of the basic defect of impaired ion conductance in the respiratory tissue, as determined in vivo by nasal potential difference measurement¹⁵, and in intestinal tissue, as determined ex vivo by intestinal current measurement¹⁵. Genetic information obtained from the case and reference populations with contrasting phenotypes was compared using the software package FAMHAP⁵⁵, which allows family-based analysis^56,57, accepts data evaluation in association studies on unrelated individuals as well as on affected sibling pairs⁵⁵ and is adapted to handle intrapair comparison of genotype data in sibling pairs⁵⁵. Correction for multiple testing at loci typed with more than one marker was performed by haplotype permutation⁵⁶. For this purpose, the entire data set of cases and references was used to estimate haplotype frequencies⁵⁵. To ensure a consistent assignment of rare haplotypes in small subsamples, the genotype data of 101 families with a total of 171 patients from the European CF twin and sibling study were used as a training set in all comparisons. Haplotype, or, in cases of noninformative phase or haplotype uncertainty, weighted haplotype explanation lists were assigned to each individual whereby the haplotype frequencies of the entire data set were taken into account to compute conditional likelihood weights⁵⁵. Permutation was performed by randomly assigning the affection status to the individuals in each replication⁵⁵. For the comparison of case sibling pairs to reference sibling pairs, the affection status was permuted or not with an equal chance for both siblings simultaneously^55,56,57. For all comparisons described herein, the phenotypes and sample sizes of the case and reference populations are detailed within the legends, in Figs. 1, 5 and 8 as well as in Table 1.

Statistical analyses

The algorithms of Sham and Curtis⁵⁸ were used to compare the observed occupancy of SCNN1B haplotypes associated with concordance vs discordance with unique transcription factors to the expectancy value derived from binding partners distributed equally between both haplotypes.

The EMSA band intensity between rs152731-C and rs152731-T probes was compared using a Mann–Whitney-U-Test in technically (electrophoresis) and biologically (cell culture and preparation of nuclear extract) independent experiments.

Changes in the expression levels of SCNN1B transcripts were judged from threshold cycle Ct values obtained by qPCR using the ΔΔCt method comparing siRNA or treated cells to untreated controls. To test against the hypothesis that no change in the SCNN1B transcript was observed, the Wilcoxon signed rank test was used to assess whether or not equal proportions of samples showed an increase or decrease in SCNN1B transcript species upon treatment with siRNA. For this analysis, technically (qPCR) and biologically (cell culture and experimental intervention) independent Ct values were used. For statistical analysis, ΔΔCt values derived from independent siRNA-treated or control samples were used.

Data availability

Supplemental Information is provided with this manuscript. Primary data will be shared with interested parties upon reasonable request.

References

Welter, D. et al. The NHGRI GWAS Catalog, a curated resource of SNP-trait associations. Nucleic Acids Res. 42, D1001-1006 (2014).
Article CAS PubMed Google Scholar
Zhang, F. & Lupski, J. R. Non-coding genetic variants in human disease. Hum. Mol. Genet. 24, R102-110 (2015).
Article CAS PubMed PubMed Central Google Scholar
Birney, E. et al. Identification and analysis of functional elements in 1% of the human genome by the ENCODE pilot project. Nature 447, 799–816 (2007).
Article ADS CAS PubMed Google Scholar
Tak, Y. G. & Farnham, P. J. Making sense of GWAS: using epigenomics and genome engineering to understand the functional relevance of SNPs in non-coding regions of the human genome. Epigenetics Chromatin 8, 57 (2015).
Article PubMed PubMed Central CAS Google Scholar
Ratjen, F. et al. Cystic fibrosis. Nat. Rev. Dis. Primer 1, 15010 (2015).
Article Google Scholar
Strausbaugh, S. D. & Davis, P. B. Cystic fibrosis: a review of epidemiology and pathobiology. Clin. Chest Med. 28, 279–288 (2007).
Article PubMed Google Scholar
Elborn, J. S. Cystic fibrosis. Lancet Lond. Engl. 388, 2519–2531 (2016).
Article CAS Google Scholar
Berdiev, B. K., Qadri, Y. J. & Benos, D. J. Assessment of the CFTR and ENaC association. Mol. Biosyst. 5, 123–127 (2009).
Article CAS PubMed Google Scholar
Enuka, Y., Hanukoglu, I., Edelheit, O., Vaknine, H. & Hanukoglu, A. Epithelial sodium channels (ENaC) are uniformly distributed on motile cilia in the oviduct and the respiratory airways. Histochem. Cell Biol. 137, 339–353 (2012).
Article CAS PubMed Google Scholar
Tarran, R., Trout, L., Donaldson, S. H. & Boucher, R. C. Soluble mediators, not cilia, determine airway surface liquid volume in normal and cystic fibrosis superficial airway epithelia. J. Gen. Physiol. 127, 591–604 (2006).
Article CAS PubMed PubMed Central Google Scholar
Stanke, F. et al. The TNFalpha receptor TNFRSF1A and genes encoding the amiloride-sensitive sodium channel ENaC as modulators in cystic fibrosis. Hum. Genet. 119, 331–343 (2006).
Article CAS PubMed Google Scholar
Mekus, F. et al. Categories of deltaF508 homozygous cystic fibrosis twin and sibling pairs with distinct phenotypic characteristics. Twin Res. Off. J. Int. Soc. Twin Stud. 3, 277–293 (2000).
Article ADS CAS Google Scholar
Stanke, F. et al. Hierarchical fine mapping of the cystic fibrosis modifier locus on 19q13 identifies an association with two elements near the genes CEACAM3 and CEACAM6. Hum. Genet. 127, 383–394 (2010).
Article CAS PubMed Google Scholar
Matys, V. et al. TRANSFAC and its module TRANSCompel: transcriptional gene regulation in eukaryotes. Nucleic Acids Res. 34, D108–D110 (2006).
Article ADS CAS PubMed Google Scholar
Bronsveld, I. et al. Chloride conductance and genetic background modulate the cystic fibrosis phenotype of Delta F508 homozygous twins and siblings. J. Clin. Invest. 108, 1705–1715 (2001).
Article CAS PubMed PubMed Central Google Scholar
Schreck, R., Zorbas, H., Winnacker, E. L. & Baeuerle, P. A. The NF-kappa B transcription factor induces DNA bending which is modulated by its. Nucleic Acids Res. 18, 6497–6502 (1990).
Article CAS PubMed PubMed Central Google Scholar
Stanke, F. et al. Genes that determine immunology and inflammation modify the basic defect of impaired ion conductance in cystic fibrosis epithelia. J. Med. Genet. 48, 24–31 (2011).
Article CAS PubMed Google Scholar
Warzecha, C. C., Shen, S., Xing, Y. & Carstens, R. P. The epithelial splicing factors ESRP1 and ESRP2 positively and negatively regulate diverse types of alternative splicing events. RNA Biol. 6, 546–562 (2009).
Article CAS PubMed Google Scholar
Warzecha, C. C. et al. An ESRP-regulated splicing programme is abrogated during the epithelial-mesenchymal transition. EMBO J. 29, 3286–3300 (2010).
Article CAS PubMed PubMed Central Google Scholar
Haq, I. J., Gray, M. A., Garnett, J. P., Ward, C. & Brodlie, M. Airway surface liquid homeostasis in cystic fibrosis: pathophysiology and therapeutic targets. Thorax 71, 284–287 (2016).
Article PubMed Google Scholar
Stoltz, D. A., Meyerholz, D. K. & Welsh, M. J. Origins of cystic fibrosis lung disease. N. Engl. J. Med. 372, 351–362 (2015).
Article PubMed PubMed Central CAS Google Scholar
Risch, N. & Zhang, H. Extreme discordant sib pairs for mapping quantitative trait loci in humans. Science 268, 1584–1589 (1995).
Article ADS CAS PubMed Google Scholar
Eaves, L. & Meyer, J. Locating human quantitative trait loci: guidelines for the selection of sibling pairs for genotyping. Behav. Genet. 24, 443–455 (1994).
Article CAS PubMed Google Scholar
Risch, N. J. & Zhang, H. Mapping quantitative trait loci with extreme discordant sib pairs: sampling considerations. Am. J. Hum. Genet. 58, 836–843 (1996).
CAS PubMed PubMed Central Google Scholar
Stanke, F. et al. An informative intragenic microsatellite marker suggests the IL-1 receptor as a genetic modifier in cystic fibrosis. Eur. Respir. J. 50, 1700426 (2017).
Article PubMed CAS Google Scholar
Mekus, F., Laabs, U., Veeze, H. & Tummler, B. Genes in the vicinity of CFTR modulate the cystic fibrosis phenotype in highly concordant or discordant F508del homozygous sib pairs. Hum. Genet. 112, 1–11 (2003).
Article CAS PubMed Google Scholar
Rogus, J. J. et al. High-density single nucleotide polymorphism genome-wide linkage scan for susceptibility genes for diabetic nephropathy in type 1 diabetes: discordant sibpair approach. Diabetes 57, 2519–2526 (2008).
Article CAS PubMed PubMed Central Google Scholar
Santangelo, S. L. et al. A discordant sib-pair linkage analysis of age-related macular degeneration. Ophthalmic Genet. 26, 61–67 (2005).
Article CAS PubMed Google Scholar
Jiang, D., Jia, Y. & Jarrett, H. W. Transcription factor proteomics: identification by a novel gel mobility shift-three-dimensional electrophoresis method coupled with southwestern blot and high-performance liquid chromatography–electrospray-mass spectrometry analysis. J. Chromatogr. A 1218, 7003–7015 (2011).
Article CAS PubMed PubMed Central Google Scholar
Stead, J. A., Keen, J. N. & McDowall, K. J. The identification of nucleic acid-interacting proteins using a simple proteomics-based approach that directly incorporates the electrophoretic mobility shift assay. Mol. Cell. Proteomics MCP 5, 1697–1702 (2006).
Article CAS PubMed Google Scholar
Woo, A. J., Dods, J. S., Susanto, E., Ulgiati, D. & Abraham, L. J. A proteomics approach for the identification of DNA binding activities observed in the electrophoretic mobility shift assay. Mol. Cell. Proteomics MCP 1, 472–478 (2002).
Article CAS PubMed Google Scholar
Dhar, K., Rakesh, K., Pankajakshan, D. & Agrawal, D. K. SOCS3 promotor hypermethylation and STAT3-NF-kappaB interaction downregulate SOCS3 expression in human coronary artery smooth muscle cells. Am. J. Physiol. Heart Circ. Physiol. 304, H776-785 (2013).
Article CAS PubMed PubMed Central Google Scholar
Shen, C. H. & Stavnezer, J. Interaction of stat6 and NF-kappaB: direct association and synergistic activation of interleukin-4-induced transcription. Mol. Cell. Biol. 18, 3395–3404 (1998).
Article CAS PubMed PubMed Central Google Scholar
Yoshida, Y. et al. Interleukin 1 activates STAT3/nuclear factor-kappaB cross-talk via a unique. J. Biol. Chem. 279, 1768–1776 (2004).
Article CAS PubMed Google Scholar
Yu, Z. & Kone, B. C. The STAT3 DNA-binding domain mediates interaction with NF-kappaB p65 and inducible nitric oxide synthase transrepression in mesangial cells. J. Am. Soc. Nephrol. JASN 15, 585–591 (2004).
Article CAS PubMed Google Scholar
Connolly, K. M., Wojciak, J. M. & Clubb, R. T. Site-specific DNA binding using a variation of the double stranded RNA binding motif. Nat. Struct. Biol. 5, 546–550 (1998).
Article CAS PubMed Google Scholar
Hudson, W. H. & Ortlund, E. A. The structure, function and evolution of proteins that bind DNA and RNA. Nat. Rev. Mol. Cell Biol. 15, 749–760 (2014).
Article CAS PubMed PubMed Central Google Scholar
Gaillard, C., Cabannes, E. & Strauss, F. Identity of the RNA-binding protein K of hnRNP particles with protein H16, a sequence-specific single strand DNA-binding protein. Nucleic Acids Res. 22, 4183–4186 (1994).
Article CAS PubMed PubMed Central Google Scholar
Dempsey, L. A., Hanakahi, L. A. & Maizels, N. A specific isoform of hnRNP D interacts with DNA in the LR1 heterodimer: canonical RNA binding motifs in a sequence-specific duplex DNA binding protein. J. Biol. Chem. 273, 29224–29229 (1998).
Article CAS PubMed Google Scholar
Chennathukuzhi, V. M., Kurihara, Y., Bray, J. D., Yang, J. & Hecht, N. B. Altering the GTP binding site of the DNA/RNA-binding protein, Translin/TB-RBP, decreases RNA binding and may create a dominant negative phenotype. Nucleic Acids Res. 29, 4433–4440 (2001).
Article CAS PubMed PubMed Central Google Scholar
Zhao, H., Yang, Y. & Zhou, Y. Structure-based prediction of RNA-binding domains and RNA-binding sites and application to structural genomics targets. Nucleic Acids Res. 39, 3017–3025 (2011).
Article CAS PubMed Google Scholar
Shkreta, L. & Chabot, B. The RNA splicing response to DNA damage. Biomolecules 5, 2935–2977 (2015).
Article CAS PubMed PubMed Central Google Scholar
Norman, M., Rivers, C., Lee, Y.-B., Idris, J. & Uney, J. The increasing diversity of functions attributed to the SAFB family of. Biochem. J. 473, 4271–4288 (2016).
Article CAS PubMed Google Scholar
Ghosh, P. & Sowdhamini, R. Genome-wide survey of putative RNA-binding proteins encoded in the human proteome. Mol. Biosyst. 12, 532–540 (2016).
Article CAS PubMed Google Scholar
Mikula, M., Bomsztyk, K., Goryca, K., Chojnowski, K. & Ostrowski, J. Heterogeneous nuclear ribonucleoprotein (HnRNP) K genome-wide binding survey reveals its role in regulating 3′-end RNA processing and transcription termination at the early growth response 1 (EGR1) gene through XRN2 exonuclease. J. Biol. Chem. 288, 24788–24798 (2013).
Article CAS PubMed PubMed Central Google Scholar
Busch, A. & Hertel, K. J. Evolution of SR protein and hnRNP splicing regulatory factors. Wiley Interdiscip. Rev. RNA 3, 1–12 (2012).
Article CAS PubMed Google Scholar
Shipman, K. L., Robinson, P. J., King, B. R., Smith, R. & Nicholson, R. C. Identification of a family of DNA-binding proteins with homology to RNA splicing factors. Biochem. Cell Biol. Biochim. Biol. Cell. 84, 9–19 (2006).
Article CAS Google Scholar
Edelheit, O. et al. Truncated beta epithelial sodium channel (ENaC) subunits responsible for multi-system pseudohypoaldosteronism support partial activity of ENaC. J. Steroid Biochem. Mol. Biol. 119, 84–88 (2010).
Article CAS PubMed Google Scholar
Zuk, O., Hechter, E., Sunyaev, S. R. & Lander, E. S. The mystery of missing heritability: genetic interactions create phantom heritability. Proc. Natl. Acad. Sci. U.S.A. 109, 1193–1198 (2012).
Article ADS CAS PubMed PubMed Central Google Scholar
Beekman, M. et al. Genome-wide linkage analysis for human longevity: genetics of healthy aging study. Aging Cell 12, 184–193 (2013).
Article CAS PubMed Google Scholar
Hemani, G. et al. Inference of the genetic architecture underlying BMI and height with the use of 20,240 sibling pairs. Am. J. Hum. Genet. 93, 865–875 (2013).
Article CAS PubMed PubMed Central Google Scholar
McQueen, M. B. et al. The National Longitudinal Study of Adolescent to Adult Health (Add Health) sibling pairs genome-wide data. Behav. Genet. 45, 12–23 (2015).
Article PubMed Google Scholar
Pollock, R. M. DNA–protein interactions. In Current Protocols in Molecular Biology (ed. Ausubel, F. M.) 12.0.1-12.11 (Wiley, New Work, 1997).
Google Scholar
Century, T. J., Fenichel, I. R. & Horowitz, S. B. The concentrations of water, sodium and potassium in the nucleus and cytoplasm of amphibian oocytes. J. Cell Sci. 7, 5–13 (1970).
CAS PubMed Google Scholar
Herold, C. & Becker, T. Genetic association analysis with FAMHAP: a major program update. Bioinforma. Oxf. Engl. 25, 134–136 (2009).
Article CAS Google Scholar
Becker, T. & Knapp, M. A powerful strategy to account for multiple testing in the context of haplotype analysis. Am. J. Hum. Genet. 75, 561–570 (2004).
Article CAS PubMed PubMed Central Google Scholar
Knapp, M. & Becker, T. Family-based association analysis with tightly linked markers. Hum. Hered. 56, 2–9 (2003).
Article PubMed Google Scholar
Sham, P. C. & Curtis, D. Monte Carlo tests for associations between disease and alleles at highly polymorphic loci. Ann. Hum. Genet. 59, 97–105 (1995).
Article CAS PubMed Google Scholar
Voilley, N. et al. Cloning, chromosomal localization, and physical linkage of the beta and gamma subunits (SCNN1B and SCNN1G) of the human epithelial amiloride-sensitive sodium channel. Genomics 28, 560–565 (1995).
Article CAS PubMed Google Scholar
Saxena, A. et al. Gene structure of the human amiloride-sensitive epithelial sodium channel beta subunit. Biochem. Biophys. Res. Commun. 252, 208–213 (1998).
Article CAS PubMed Google Scholar
Wang, M., Herrmann, C. J., Simonovic, M., Szklarczyk, D. & von Mering, C. Version 4.0 of PaxDb: protein abundance data, integrated across model organisms, tissues, and cell-lines. Proteomics 15, 3163–3168 (2015).
Article CAS PubMed PubMed Central Google Scholar

Download references

Acknowledgements

We thank the late Dieter Gruenert for providing the 16HBE14o-, CFBE41o- and CFTE29o-cell lines and Geoffrey Sargent for providing expert advice on the culture conditions. We are grateful to Melanie Lenz for assistance with preparing the nuclear extracts and to Jörg Viering for customizing the device that measures conductivity in volumes < 100 µl (quality control of nuclear extracts).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

Authors and Affiliations

Institute for Community Medicine, Ernst Moritz Arndt University, Greifswald, Germany
Tim Becker
xValue GmbH, Villich, Germany
Tim Becker
Research Core Unit Proteomics, Hannover Medical School, Hannover, Germany
Andreas Pich & Mohammed Ibrahim
Department of Paediatric Pneumology, Allergology and Neonatology, Hannover Medical School, Hannover, Germany
Stephanie Tamm, Silke Hedtfeld, Burkhard Tümmler & Frauke Stanke
Cologne Center for Genomics, University of Cologne, Cologne, Germany
Janine Altmüller, Nina Dalibor & Mohammad Reza Toliat
German Center for Lung Research (DZL), Partner site BREATH, Hannover, Germany
Sabina Janciauskiene, Burkhard Tümmler & Frauke Stanke
Department of Pneumology, Hannover Medical School, Hannover, Germany
Sabina Janciauskiene

Authors

Tim Becker
View author publications
You can also search for this author in PubMed Google Scholar
Andreas Pich
View author publications
You can also search for this author in PubMed Google Scholar
Stephanie Tamm
View author publications
You can also search for this author in PubMed Google Scholar
Silke Hedtfeld
View author publications
You can also search for this author in PubMed Google Scholar
Mohammed Ibrahim
View author publications
You can also search for this author in PubMed Google Scholar
Janine Altmüller
View author publications
You can also search for this author in PubMed Google Scholar
Nina Dalibor
View author publications
You can also search for this author in PubMed Google Scholar
Mohammad Reza Toliat
View author publications
You can also search for this author in PubMed Google Scholar
Sabina Janciauskiene
View author publications
You can also search for this author in PubMed Google Scholar
Burkhard Tümmler
View author publications
You can also search for this author in PubMed Google Scholar
Frauke Stanke
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

B.T. and F.S. conceived and designed the project. Funding was acquired by F.S., T.B. and B.T. A.P., S.T., S.H., M.I., J.A. and N.D. performed the experiments. A.P., J.A., M.R.T., S.J. and F.S. analyzed the primary data. T.B. developed and implemented the program to perform the interpair- and intrapair comparisons of genetic data of affected sib pairs. Formal analysis of genetic data was done by F.S. and T.B. T.B., A.P., S.T., S.H., M.I., J.A., M.T., S.J., B.T. and F.S. drafted the manuscript and/or revised the draft critically for content. F.S. visualized the data and wrote, edited and revised the final manuscript.

Corresponding author

Correspondence to Frauke Stanke.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary Information

Supplementary information 1.

Supplementary information 2.

Supplementary Data R1.

Supplementary Data R2.

Supplementary Data R3.

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Becker, T., Pich, A., Tamm, S. et al. Genetic information from discordant sibling pairs points to ESRP2 as a candidate trans-acting regulator of the CF modifier gene SCNN1B. Sci Rep 10, 22447 (2020). https://doi.org/10.1038/s41598-020-79804-y

Download citation

Received: 08 August 2019
Accepted: 10 December 2020
Published: 31 December 2020
DOI: https://doi.org/10.1038/s41598-020-79804-y

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.