Introdution

Birt–Hogg–Dubé (BHD) syndrome (OMIM#135150) is a rare autosomal dominantly inherited condition caused by mutations in the FLCN gene encoding folliculin.1, 2, 3 BHD syndrome predisposes to the development of cutaneous hamartomas (fibrofolliculomas (FF)), multiple lung cysts (LCs), spontaneous pneumothoraces and renal cell cancer (RCC).4 The renal cancers associated with BHD are predominantly of the hybrid oncocytic/chromophobe subtype followed by the clear cell renal carcinomas.5 The phenotype is characterized in particular by multiple FF—predominantly facial and upper body—as up to 85% of all BHD syndrome patients suffer from these benign dermatological papules, which usually do not appear before the age of 20.6, 7, 8 Moreover, the dermatologic findings may be the only presenting signs in patients with BHD syndrome, thus making this recognition vital so that further systemic evaluation can be undertaken.

Currently, there is no international consensus for surveillance of BHD patients; hence the Danish recommendations consist of annually renal imaging either by MRI or ultrasound examination, depending on local competence. The onset of renal imaging is 10 years before the debut of the earliest renal cancer incidence in affected family members. The patients undergo examination by a dermatologist depending on individual requirements. Examination of the lungs is not a routine part of the Danish surveillance program.

Other clinical findings associated with BHD syndrome include a range of benign and malignant tumors, leading to a wide variability in the clinical features and consequently a high risk of underdiagnosing mutation carriers.8, 9 Although BHD syndrome was first described in 1977,1 the FLCN gene encoding folliculin was not identified until 2002.3 The FLCN gene is located on chromosome 17p11.2, contains 14 exons and encodes the 579-amino acid protein folliculin. Folliculin is widely conserved across species, indicating a crucial biological function, and recent data suggest that FLCN is involved in multiple biological pathways including metabolism, autophagy, cell-cell adhesion and apoptosis.10

So far, ~200 families with BHD syndrome caused by a pathogenic FLCN mutation have been reported worldwide,7, 8, 11, 12 and close to 150 unique DNA variants have been identified, including nonsense mutations, missense variants, frameshift mutations, intronic variants as well as large genomic rearrangements, of which ~110 are classified as pathogenic (https://grenada.lumc.nl/LOVD2/shared1/home.php?select_db=FLCN).

During the last decade, screening of patients suffering from FF, spontaneous pneumothoraces, kidney cancer as well as genetic susceptibility for BHD syndrome has been performed in Denmark. This study reports the variants identified in Danish patients with BHD syndrome or BHD syndrome-like symptoms and reveals six novel FLCN variants as well as a Danish founder mutation. Moreover, the study classifies the identified variants based on in silico analysis and functional splicing assays.

Materials and methods

Patients

Patients were referred to genetic counseling if there was a suspicion of familial BHD syndrome or if individuals had one of the following symptoms: FF, multiple LCs, spontaneous pneumothorax (SP) or RCC. In some cases, healthy individuals were included if there was a family history of any of the above symptoms. Following verbal and written consent, blood samples were collected from the proband for FLCN mutational screening. The patient/family history was verified using hospital medical records and pathology reports. The study was approved by the Capital Region Committee on Health Research Ethics (H-2-2014-088).

FLCN screening

Genomic DNA was purified from whole blood using the QIAamp DNA Mini Kit (Qiagen, Hilden, Germany) or ReliaPrep Large Volume HT gDNA Isolation Kit (Promega, Madison, WI, USA) using a Tecan Freedom EVO HSM2.0 Workstation (Tecan, Männedorf, Switzerland) according to the manufacturer’s instructions. FLCN screening was performed using intronic primers flanking each exon and the PCR products were sequenced as recently described.13 Moreover, genomic DNA was examined for large genomic rearrangements by multiplex ligation-dependent probe amplification (MLPA) analysis using the SALSA MLPA P256 FLCN Kit (MRC-Holland, Amsterdam, the Netherlands). Since 2014, the analysis has been performed using targeted next-generation sequencing and a library designed to capture all exons from the FLCN gene as previously described.14 Sequencing was performed on a MiSeq (Illumina, San Diego, CA, USA) to an average depth of at least × 100. Sequencing data were analyzed using Sequence Pilot (JSI medical systems, Ettenheim, Germany), where variants were called if the non-reference base frequency was above 25%. Sequence variants were verified in an independent blood sample by Sanger sequencing. FLCN variants are numbered according to accession number NM_144997.5 following the guidelines from the Human Genome Variation Societsy (http://www.hgvs.org/mutnomen).

In silico analysis

The integrated Alamut Visual software (v.2.6.1) (http://www.interactive-biosoftware.com) was used to predict the pathogenicity of specific protein variants using Align GVGD,15 PolyPhen-216 and SIFT,17 or to predict the effect of intronic variants on the efficiency of splicing using Splice Site Finder,18 GeneSplicer,19 Splice Site Prediction by Neural Network,20 MaxEntScan21 and Human Splicing Finder.22 The analysis was performed using default settings in all predictions. A variation in the splicing prediction of more than 10% in at least two algorithms was considered as having an effect on splicing.23 The frequency of the variants was obtained from the Exome Aggregation Consortium (ExAC) database. Classification of each variant was performed according to the five-tier scheme, where class 5 is pathogenic, class 4 is likely pathogenic, class 3 is uncertain, class 2 is likely benign and class 1 is benign.24

Mini-gene splicing assay

Wild-type FLCN exons 9 and 11 were cloned into the pSPL3 vector (Figure 1) and variations were introduced by DpnI-mediated mutagenesis performed using Phusion High-Fidelity Polymerase according to the accompanying instructions (Finnzymes, Waltham, MA, USA). Wild type and variant constructs were then transfected in duplicate into COS-7 cells and processed as recently described.25, 26, 27

Figure 1
figure 1

Mini-gene splicing analysis of two FLCN intronic variants. COS-7 cells were transfected with wild type or mutant vectors in duplicate. Total RNA was isolated, RT-PCR analysis was performed, and PCR products were separated by agarose gel electrophoresis and visualized by ethidium bromide staining. The sizes of the DNA marker (M) are indicated to the left. (a, b) The FLCN c.1062+2T>G variant generated a strong band of 177 bp lacking exon 9 and a very weak band of 498 bp using a cryptic splice site 130 bp upstream in intron 9. (c, d) The FLCN c.1177-5_1177-3del variant resulted in one strong band of 177 bp lacking exon 11. All PCR products were verified by Sanger sequencing. M, size marker; SA, splice acceptor site; SD, splice donor site.

SNP chip and identity by descent analysis

Nine individuals carrying the FLCN c.1062+2T>G mutation were genotyped for 934 968 SNPs using the Affymetrix 6.0 SNP chip as recently described.28 To obtain estimates of allele frequencies and linkage disequilibrium, the data were merged with the 180 European individuals from HapMap 3. Only the 724 772 SNPs that overlapped between the data sets were used. In addition, SNPs with a minor allele frequency below 5% and SNPs with missing information above 5% were removed. A simple method of moment estimator from PLINK was used to estimate coefficient of relationship between all pairs of individuals.29 To infer whether the c.1062+2T>G mutation is a founder mutation, we estimated relatedness locally on the genome using RELATE.30 To accommodate linkage disequilibrium in the analysis, we conditioned on the 50 previous single-nucleotide polymorphisms (SNPs). Physical positions were used assuming a uniform recombination rate of 1.3 cM/Mb (centiMorgans per megabase). Because we were only interested in whether the individuals share at least one allele IBD, we ignored the state of sharing two alleles IBD. This approach gave us a probability across chromosome 17 (location of the FLCN gene) of two individuals sharing at least one allele IBD. If the mutation is a founder mutation then all individuals should be related in the region around the mutation. The mean probability of sharing at least one allele IBD was estimated by taking the mean posterior probability of IBD sharing between all pairs of individuals for each site. We used a cut-off of 0.95 for the mean IBD sharing to infer the region shared IBD between all pairs of individuals.

Results

Since 2006, we have performed genetic FLCN screening of potential BHD syndrome individuals. Up until 1 November, 2015, 143 individuals or families have been examined and currently we have identified 13 different FLCN variants in 31 families, of which 6 are novel (Table 1). Moreover, we have identified three common polymorphisms (Table 2).

Table 1 FLCN germline variants identified in 31 Danish BHD syndrome families
Table 2 FLCN polymorphisms identified in putative Danish BHD syndrome families

We identified two novel frameshift mutations (c.158delA and c.233delA) as well as four known frameshift mutations (c.57_58delCT, c.584delG, c.1285delC and c.1285dupC) in the FLCN gene. The c.57_58delCT, c.233delA, c.584delG and c.1285dupC mutations were identified in one family each, the c.158delA mutation in two families, whereas the c.1285delC mutation was identified in seven distinct families. Moreover, in one family we identified a deletion of whole exon 9 (c.872-?_1062+?del). The mutations were all identified in families suffering from one or up to four of the symptoms of BHD syndrome, of which FF and spontaneous pneumothoraces were most common.

We also identified two intronic variants (c.1062+2T>G and c.1177-5_1177-3del) in the FLCN gene, present in the vicinity of exon 9 and 11, respectively. The c.1062+2T>G variant was identified in 11 putative BHD syndrome families, whereas the c.1177-5_1177-3del variant was identified in one family with RCC. FF was present in all of the 11 families with the c.1062+2T>G variant, whereas LC/SP and RCC was observed less frequently. The potential pathogenicity of the two intronic variants was investigated using five different in silico splicing prediction programs, which predict changes in splice-site strength (Table 3). The c.1062+2T>G variant was predicted to destroy the SD site by all five programs, whereas the c.1177-5_1177-3del variant was suggested by two programs to decrease the strength of the SA site by more than 10%. The functional effects of both variants were subsequently examined by mini-gene splicing assays cloning either exon 9 or 11 and surrounding intronic sequences into the pSPL3 vector (Figures 1a and c). In accordance with the in silico results, both variants revealed the presence of alternative gel bands compared with the corresponding wild types. The wild- type FLCN exon 9 construct generated a transcript comprising the expected 368 bp as well as a very weak band of 177 bp lacking exon 9. The c.1062+2T>G mutant yielded a strong band of 177 bp lacking exon 9 and a very weak band of 498 bp (Figure 1b). Sequencing revealed that the 498 bp band included an additional 130 bp of intron 9 using a cryptic splice site generating an out-of-frame mutation. Wild-type FLCN exon 11 generated two transcripts, one at the expected 301 bp and one band of 177 bp, indicating partial skipping of exon 11. The c.1177-5_1177-3del variant resulted in only one band of 177 bp lacking exon 11 (Figure 1d). We conclude that the c.1062+2T>G and c.1177-5_1177-3del mutations both result in aberrant splicing.

Table 3 Splicing prediction analysis of two FLCN intronic variants

As the c.1062+2T>G mutation was identified in 11 seemingly unrelated Danish families, we speculated that this mutation could be a Danish founder mutation. To investigate whether this was the case, we performed SNP array and IBD analysis based on one individual from 9 of the 11 families. If the FLCN mutation is a founder mutation, we would expect the carriers to share a common haplotype around the FLCN gene that is shared IBD. We compared the pairwise IBD sharing between each pair of individuals, both for the whole genome and locally at the FLCN locus. As shown in Figure 2, the coefficient of relationship based on the genome-wide data is estimated to be very low among all nine families, indicating that they are not closely related. However, when estimating the IBD sharing locally at the FLCN locus, we see that they are all related with very high probability. Accordingly, we infer that the FLCN c.1062+2T>G mutation is a Danish founder mutation. The mean IBD sharing between all pairs of individuals across chromosome 17 is shown in Supplementary Figure 1. From this, the region shared IBD among all nine individuals is suggested to be 2.95 Mb long.

Figure 2
figure 2

Global and local relatedness among members from nine different Danish families harboring the FLCN c.1062+2T>G intronic mutation. Each individual is indicated by their family ID (Table 1). The plot shows both the overall coefficient of relatedness between pairs of individuals based on all markers across the genome (see the upper triangle) and the local probability of relatedness at the FLCN locus (lower triangle). None of the families are closely related. However, at the FLCN gene, all of the individuals have high probability for sharing at least one allele identity by descent (IBD), which is consistent with a founder mutation.

We moreover identified a novel 21-bp deletion (c.1093_1113del) in exon 10 of the FLCN gene. The variant is an in-frame deletion leading to removal of seven amino acids (p.Ala365_Gly371del) from the FLCN protein. The deletion occurs in a highly conserved region of FLCN forming the second α-helix (Figure 3), suggesting that it is important for FLCN function. The variant was identified in two unrelated families, both of which presented with FF, as well as RCC in one family and spontaneous pneumothoraces and LCs in the other.

Figure 3
figure 3

Conservation of the second α-helix and first β-sheet in the C-terminal part of the FLCN protein. Human, monkey, rat, mouse, dog, platypus, chicken, frog, tetraodon and zebrafish were aligned. Highly conserved amino acids are shown in purple boxes, whereas conserved amino acids are shown in blue boxes. The Ala365-Gly371 amino acids are marked in brackets. α-helices (red cylinders) and β-sheet (yellow cylinders) are shown at the bottom and is based on data from Nookala et al.42 A full color version of this figure is available at the Journal of Human Genetics journal online.

Furthermore, we identified two missense variants (p.Asp33His and p.Ala90Ser) in the FLCN gene. The p.Asp33His variant was identified in a healthy individual whose daughter died of RCC at the age of 29, whereas the p.Ala90Ser variant was identified in a patient with FF, spontaneous pneumothoraces and LCs. The p.Asp33His variant is reported in two South Asian individuals in the ExAC database, of which one is homozygous. The p.Ala90Ser variant is reported at a frequency of 0.042% in the ExAC database. In silico protein analysis predicted that neither of the variants are pathogenic. Moreover, the p.Ala90Ser variant was identified in a patient with the disease-causing FLCN mutation p.Ala365_Gly371del. Finally, we identified one synonymous variant (p.Ile426Ile) in the FLCN gene, which is reported at a frequency of 0.01% in the ExAC database. The variant was found in an individual with RCC. It is currently unknown whether the variant has any effect on the RNA level.

Discussion

Mutations in the FLCN gene predispose to FF, LCs, spontaneous pneumothoraces and RCC—a syndrome known as BHD. In this study, we report the characterization of FLCN variants identified in 143 Danish patients or families with suspected BHD syndrome.

We identified six different frameshift mutations (c.57_58delCT, c.158delA, c.233delA, c.584delG, c.1285delC and c.1285dupC) in the FLCN gene, which are all classified as pathogenic (class 5). The c.57_58delCT mutation, resulting in a premature stop at codon 35, was found in a family with FF, spontaneous pneumothoraces and RCC. The mutation has recently been described in a French patient with spontaneous pneumothoraces and RCC as well as in a Japanese patient with FF and pneumothoraces.31, 32 Two of the frameshift mutations (c.158delA and c.233delA) are novel. The c.158delA mutation, which introduces a stop codon at position 54, was identified in a large pedigree. The five mutation carriers suffered from bilateral spontaneous pneumothoraces and one family member had fatal bilateral RCC. Moreover, the mutation was identified in an apparently unrelated family with a single affected member suffering from FF, LCs and spontaneous pneumothoraces. The c.233delA mutation, which results in a frameshift and a premature stop at codon 129, was recently found in a single patient with FF. At present, no other family members have been tested. The c.584delG mutation introduces a premature stop at codon 222. This mutation was identified in a family with LCs, spontaneous pneumothoraces and FF. The mutation has previously been described in three BHD syndrome patients with FF and LCs.7, 8 Two of these patients also had spontaneous pneumothoraces or RCC, respectively.8

The deletion or insertion of cytosine at position 1285 has previously been reported to be the cause of BHD syndrome in up to 75% of a cohort of affected families.3, 8, 33 The reported phenotype included the classical symptoms of FF, spontaneous pneumothoraces and RCC. However, other skin lesions (MM, perifollicular fibroma, angiofibroma and basal cell carcinoma) and colorectal lesions were listed among the clinical characteristics of these families. In our cohort, the c.1285delC and c.1285dupC mutations were identified in ~25% of mutation positive families (8/31) and presented with a variety of the classical BHD syndrome symptoms.

We moreover identified a novel deletion of exon nine. The deletion was identified in an individual suffering from LCs and moreover had RCC in the family. The deletion results in a frameshift and a premature stop codon and is therefore classified as pathogenic (class 5). LGRs have previously been described in the literature,34, 35, 36, 37 suggesting that examinations for LGRs in the FLCN gene should be performed routinely in a diagnostic setting.

Approximately 35% of the mutation positive BHD syndrome families in our cohort (11/31) carried the c.1062+2T>G intronic variant. In silico analysis and mini-gene splicing assay in combination with co-segregation data confirmed the pathogenic effect of this intronic mutation (class 5). This mutation was first reported in two apparently unrelated families.7 In one of these families, all five affected members suffered from FF, four also had LCs, three had renal tumors and one had SP. The other family consisted of three affected members all suffering from LCs and renal tumors. On the basis of these data, the c.1062+2T>G mutation was thought to be associated with a higher frequency of renal tumors.7 However, this was not supported by a later study reporting another family with the c.1062+2T>G mutation whose affected members only suffered from FF and LCs.8 Interestingly, in a study characterizing renal tumors in BHD syndrome patients, the c.1062+2T>G mutation was reported in a patient with RCC and FF.31 In the present study, SNP analysis revealed that nine of the Danish families (two families were just recently included in the study and therefore did not participate in the analysis) share a common haplotype IBD and hence are very distant relatives. This finding suggests that the mutation is a founder mutation shared among all the Danish c.1062+2T>G carriers. Intriguingly, we are aware that two of the families have relatives (Danish immigrants) in the USA and that they have also been tested for the mutation. However, whether these families participated in the previous studies of Schmidt et al.7 and Toro et al.8 is currently unknown. The phenotype of the carriers of the Danish founder mutation is predominantly characterized by FF. Although carriers also displayed a broad spectrum of the other classical BHD syndrome symptoms (RCC, LCs and spontaneous pneumothoraces), only 3 out of the 11 families suffered renal tumors. Hence, in line with the findings of Toro et al.8, we found no evidence of an association between the Danish founder mutation and a higher frequency of renal tumors. Moreover, MM was present in one family, whereas colon cancer was observed in another family as recently described.38

We also identified another intronic variant in the FLCN gene, namely a 3-bp deletion (c.1177-5_1177-3del) in intron 10 located just before the consensus SA site. The variant was found in two sisters with bilateral RCC. In silico and mini-gene splicing analyses showed that the variant leads to complete skipping of exon 11. The variant has previously been described in six BHD syndrome families with various classical BHD syndrome symptoms (FF, LCs and spontaneous pneumothoraces),32, 36, 39, 40, 41 and has been shown to co-segregate with the disease.40 On the basis of all the data, we classify the variant as pathogenic (class 5).

A novel in-frame deletion, c.1093_1113del, leading to removal of seven amino acids (p.Ala365_Gly371del) from the FLCN protein was found in two apparently unrelated Danish families. In one family, all mutation carriers suffered from FF and two members also had bilateral RCC of the chromophobe subtype. The other family consisted of only one affected member suffering from FF, LCs and spontaneous pneumothoraces. The p.Ala365_Gly371del variant deletes seven amino acids in the second α-helix, which is essential for maintain the proper structure of the C-terminal halves of FLCN.42 The variant has currently not been examined functionally, but several in-frame deletions in FLCN have previously been shown to disrupt the protein stability in vitro.43 With this in mind and due to the clear co-segregation in one of the families, we classify this novel mutation as likely pathogenic (class 4).

Finally, we identified two novel missense mutations (p.Asp33His and p.Ala90Ser) and one synonymous variant (p.Ile426Ile) in the FLCN gene. The p.Asp33His variant was identified in an individual with no BHD syndrome symptoms. She was tested as her daughter suffered from RCC and died at the age of 29. Homozygous loss of FLCN causes embryonic lethality in mice44 and as the p.Asp33His variant has been reported in homozygote state, we classify this variant as likely benign (class 2). The p.Ala90Ser variant was identified in a patient with the likely pathogenic p.Ala365_Gly371del mutation and the p.Ile426Ile synonymous variant was identified in a patient with RCC and no other classical BHD symptoms. Both variants are classified as likely benign (class 2).

In conclusion, we have identified 13 variants and 3 polymorphism in the FLCN gene in Danish patients with suspected BHD syndrome, 6 of which are novel. Of these variants, 6 frameshift mutations, 1 large genomic rearrangement, and 2 intronic mutations are classified as pathogenic, whereas 1 in-frame deletion is classified as likely pathogenic. Two missense variants and one synonymous variant are classified as likely benign, whereas two intronic variants and one synonymous variant are classified as polymorphisms. Moreover, SNP array analysis revealed that the c.1062+2T>G intronic variant is a Danish founder mutation. The findings presented here extend our knowledge of the FLCN mutation spectrum.