PKD2 founder mutation is the most common mutation of polycystic kidney disease in Taiwan

Yu, Chih-Chuan; Lee, An-Fu; Kohl, Stefen; Lin, Ming-Yen; Cheng, Siao Muk; Hung, Chi-Chih; Chang, Jer-Ming; Chiu, Yi-Wen; Hwang, Shang-Jyh; Otto, Edgar A.; Hildebrandt, Friedhelm; Hwang, Daw-Yang

doi:10.1038/s41525-022-00309-w

Download PDF

Article
Open access
Published: 01 July 2022

PKD2 founder mutation is the most common mutation of polycystic kidney disease in Taiwan

Chih-Chuan Yu^1,2^na1,
An-Fu Lee¹^na1,
Stefen Kohl³,
Ming-Yen Lin⁴,
Siao Muk Cheng¹,
Chi-Chih Hung⁴,
Jer-Ming Chang⁴,
Yi-Wen Chiu⁴,
Shang-Jyh Hwang⁴,
Edgar A. Otto⁵,
Friedhelm Hildebrandt⁶,
Taiwan PKD Consortium &
…
Daw-Yang Hwang ORCID: orcid.org/0000-0002-5139-3612^1,4,7

npj Genomic Medicine volume 7, Article number: 40 (2022) Cite this article

5735 Accesses
9 Citations
1 Altmetric
Metrics details

Subjects

Abstract

Autosomal Dominant polycystic kidney disease (ADPKD) is the most common inherited adult kidney disease. Although ADPKD is primarily caused by PKD1 and PKD2, the identification of several novel causative genes in recent years has revealed more complex genetic heterogeneity than previously thought. To study the disease-causing mutations of ADPKD, a total of 920 families were collected and their diagnoses were established via clinical and image studies by Taiwan PKD Consortium investigators. Amplicon-based library preparation with next-generation sequencing, variant calling, and bioinformatic analysis was used to identify disease-causing mutations in the cohort. Microsatellite analysis along with genotyping and haplotype analysis was performed in the PKD2 p.Arg803* family members. The age of mutation was calculated to estimate the time at which the mutation occurred or the founder arrived in Taiwan. Disease-causing mutations were identified in 634 families (68.9%) by detection of 364 PKD1, 239 PKD2, 18 PKHD1, 7 GANAB, and 6 ALG8 pathogenic variants. 162 families (17.6%) had likely causative but non-diagnostic variants of unknown significance (VUS). A single PKD2 p.Arg803* mutation was found in 17.8% (164/920) of the cohort in Taiwan. Microsatellite and array analysis showed that 80% of the PKD2 p.Arg803* families shared the same haplotype in a 250 kb region, indicating those families may originate from a common ancestor 300 years ago. Our findings provide a mutation landscape as well as evidence that a founder effect exists and has contributed to a major percentage of the ADPKD population in Taiwan.

The genetic landscape of polycystic kidney disease in Ireland

Article 16 January 2021

Genetic Characteristics of Korean Patients with Autosomal Dominant Polycystic Kidney Disease by Targeted Exome Sequencing

Article Open access 18 November 2019

Matching clinical and genetic diagnoses in autosomal dominant polycystic kidney disease reveals novel phenocopies and potential candidate genes

Article Open access 13 May 2020

Introduction

Polycystic kidney disease is genetically heterogeneous with dominant and recessive forms^1,2. Autosomal dominant polycystic kidney disease (ADPKD) is the most common inherited adult kidney disease that affects one in 500–2500 individuals worldwide³. ADPKD eventually end up in end stage kidney disease (ESKD), and 5–10% of ESKD worldwide is due to ADPKD⁴. ADPKD is mostly caused by mutations in PKD1 and PKD2^5,6, with the newly discovered GANAB and DNAJB11 contributing a small percentage^7,8. PKD1 truncating mutations are associated with more severe disease, and non-truncation PKD1 and mutations of other genes usually result in a slower disease progression, but with great intrafamilial variations^9,10,11,12. Six PKD1 pseudogenes (PKD1P1–PKD1P6), containing the first 33 exons and located 13 to 16 Mb from genuine PKD1, are highly homologous^5,6,13. This caused a diagnostic challenge for traditional PCR in targeting the genuine PKD1, but the challenge has been overcome by the use of long-range PCR with specific primers^14,15,16. Most identified families have private mutations and fewer than 2% of unrelated ADPKD-affected families carry the same mutation¹⁴.

ADPKD diagnosis is mostly based on imaging studies and family history, and it can be difficult to differentiate from other cystic kidney diseases when imaging results are atypical or in young individuals with negative family history. ADPKD individuals with early disease diagnosis, rapid disease progression, or intrafamilial variation may be due to biallelic mutations, hypomorphic mutation, or mutations in other cystic-related genes¹⁷. Genetic testing is providing a definitive diagnosis for patients and for individuals who are seeking genetic consultation and pre-implantation diagnosis. In Taiwan, the prevalence of ADPKD is unknown and ADPKD contributes only 2.25% of the ESKD population¹⁸. In this study we examined disease-causing genes of PKD1, PKD2, PKHD1, GANAB, DNAJB11, and ALG8 to understand the mutation spectrum of ADPKD patients in the Taiwan PKD Consortium.

Results

Mutation landscape of Taiwan ADPKD

The study comprised 920 clinically diagnosed ADPKD families of 99.7% Chinese descent (containing one Japanese patient, one Vietnamese patient, and one mixed Chinese and German patient). With a mean coverage of more than 500× for the panel, sufficient diagnostic sequencing depth was reached by our method. Disease-causing mutations were identified in 634 families (68.9%) by detection 364 PKD1, 239 PKD2, 18 PKHD1, 7 GANAB, and 6 ALG8 pathogenic variants. 162 families (17.6%) had likely causative but non-diagnostic variants of unknown significance (VUS). No class 3–5 ACMG variants were identified in the remaining 124 families (13.5%) (Fig. 1a). In families with pathogenic variants or VUS, PKD1 mutations contributed 50.4% and PKD2 represented 29.2% of the cohort, respectively. Mutations in PKHD1, GANAB, and ALG8 genes together accounted for 6.8% of the enrolled families, and no disease-causing mutation was found in DNAJB11. In terms of variants type, missense, frameshift indel, and truncating mutations were the most common changes in PKD1-associated families (Fig. 1b), while p.Arg803* represented 61% of PKD2 changes (Fig. 1c). Missense and splicing site variants were the most common changes in PKD2-associated families if p.Arg803* was not included. Allelic heterogeneity in the PKD1 gene was observed with a total of 253 (out of 328) different variants identified in single-family, which represents 54.4% of the PKD1-associated families. Allelic heterogeneity does not exist in the PKD2 gene. Only 44 (out of 59) unique variants, accounted for 16.3% (44/269) of PKD2-associated families. Variants in the PKD1 and PKD2 distributed evenly across genes with no single variant exceeding 1.2% of the cohort if the PKD2 p.Arg803* variant was excluded (Fig. 2).

**Fig. 1: Mutation Profile of Taiwan ADPKD cohort.**

**Fig. 2: Variants distribution of *PKD1* and *PKD2* genes.**

Identification of novel and recurrent mutations

A total of 152 novel pathogenic or likely pathogenic variants (PKD1: 120, PKD2: 12, PKHD1: 12, GANAB: 6, ALG8: 2) were not reported to be associated with PKD in the public disease databases and previous publications (Supplementary Table 1). Eight individuals in different families had two pathogenic variants or one pathogenic plus a VUS in trans or in different genes were identified, and most of them had severe phenotype with ESKD onset age between 31 to 49 (Table 1). Excluding the PKD2 p.Arg803* mutation, 35 different pathogenic variants were found in more than 2 families and accounted for 159 families in our cohort (Table 2). Detailed mutations of each family were listed in Supplementary Table 2.

Table 1 Individuals with variants identified in trans or in different genes.

Full size table

Table 2 Recurrent mutations identified in Taiwan PKD Cohort.

Full size table

Identification of PKD2 p.R803* as unique founder mutation

A single PKD2 p.Arg803* accounted for 17.8% (164/920, 196 patients from 164 families) of the cohort, which had not been reported in previous ADPKD cohorts. Geographically, PKD2 p.Arg803* was found all over Taiwan and represented 10–25% of the ADPKD collected from different areas (Fig. 3a). Extreme PKD2 p.Arg803* percentages (0 and 50%) were most likely due to small sampling sizes in that county. Microsatellite and microarray analysis were performed to understand the kinship of those PKD2 p.Arg803* families. Microsatellite analysis was first performed in 24 families and no makers can differentiate these individuals except D4S1563, showing a peak in the 230 bp position in most samples (Supplementary Fig. S1). D4S1563 was then used in the further analysis of another 87 more PKD2 p.Arg803* families. In total, it was found that 89 families (80.2%, 89/111) carried this D4S1563 unique peak. This result indicated those microsatellite markers, spanning ~600Kb up and down-stream of PKD2 was insufficient to show the existence of kinship. We next performed the Axiom microarray analysis with a higher resolution which enable us to identify if shared haplotype existed in these individuals. To determine whether all PKD2 p.Arg803* descended from a single ancestral mutation event or arose independently, we constructed haplotypes in the PKD2 region of 96 PKD2 p.Arg803* mutation carriers (78 families) and 480 individuals randomly selected from Taiwan Biobank as the representation of the normal population in Taiwan. Haplotype reconstruction, carried out by manually setting known family members and using the statistical software PHASE v2.1, suggested that more than 12 different haplotypes exist in the current sample. The software showed that PKD2 p.Arg803* individuals were predicted with a probability of 79.5% (62/78) to have a common mutation haplotype (G-G-C-C-T-A-A-T-ACAG-C-T-C-del-A-T-T-C-A-T-G-T-A-A-C-A-T-G-G-A) containing the p.Arg803* loci (Table 3). This haplotype had a very low probability in the control group (data not shown). In the control cases, 12.7% were predicted to carry the different haplotype (A-C-A-A-G-G-G-T-del-C-C-T-del-G-C-C-A-G-A-A-T-T-G-C-G-T-A-T-G). A population growth rate of 1.31% was calculated by averaging data from the Taiwan National Development Council from 1960 to 2020. With this 1.31% average growth rate and microarray data, the founder of PKD2 p.Arg803* probably appeared 12.6 generations ago (95% CI: 9.4–16.0, Fig. 3b). Assuming 25 years as an average generation, the PKD2 p.Arg803* mutation was introduced 300 years ago (95% CI: 235–398) in Taiwan. The PKD2 p.Arg803* in our cohort fulfills the criteria of founder mutation since the mutant allele is rare in the general population (minor allele frequency of 0.0001 and 0.00001 in the TOPMed and gnomAD database, respectively). Although this mutation was also identified in other ethnic populations, the mutation percentage is much higher in our cohort compared to previous publications (Supplementary Table 3). Principal component analysis showed these 96 PKD2 p.Arg803* carriers clustered tightly with 480 control individuals from Taiwan Biobank (Fig. 3c). Their genetic structure is close but different from Chinese from Denver and Beijing and different from the HapMap phase 3 reference population.

**Fig. 3: Geographic location, percentage, age of mutation, and principle component analysis of *PKD2* p.Arg803*.**

Table 3 Phasing of 250 kb Haplotype in 96 PKD2 p.Arg803* Individuals.

Full size table

Identification of genetic recombination in PKD2 region

In family DY1466 with 6 PKD2 p.Arg803* affected individuals, members III-1 and III-2 were found to carry different microsatellite markers and haplotypes. They had only 72 kb (10 SNPs) within this 250 kb region which was the same as their father (Fig. 4a). Although II-1 and II-2 carried the common p.Arg803* haplotype along with the D4S1563 marker, gene recombination most likely occurred in individual II-1, with a 72 kb allele crossover leading to both of his children having the PKD2 p.Arg803* mutation but different haplotypes (Fig. 4b).

**Fig. 4: Genetic recombination of the *PKD2* region in family DY1466.**

PKD2 p.Arg803* variant and renal function decline

The trajectory of estimated glomerular filtration rate (eGFR) across 20 years in 57 PKD2 p.Arg803* individuals was compared with 26 PKD2 non-p.Arg803* truncation individuals (Table 4). The eGFR trajectories between individuals were very heterogeneous in both groups. After considering individual eGFR change variance and correlation and adjusting baseline age and sex, the PKD2 non-p.Arg803* truncation individuals presented a more decline in predicted eGFR trajectories (on average by 19 ml/min/1.73 m²) than the PKD2 p.Arg803* (Supplementary Fig. S2 and Table 4). The mixed-effect model also identified the curvilinear relationships between eGFR decline and follow-up year (Supplementary Table 4). In general, the annual eGFR declines by an average of 2.09 ml/min/1.73 m² in the patients with Arg803* variant and more rapidly by 2.67 ml/min/1.73 m² in the non-Arg803* truncation patients. In terms of clinical endpoint, 2 cases (3.4%) in PKD2-Arg803* group and 4 (15.4%) in the non-PKD2-Arg803* group entering ESKD.

Table 4 Longitudinal renal function decline in individuals with PKD2 p.Arg803* and non-p.Arg803* truncation.

Full size table

Exemplary imaging in atypical cases of ADPKD

DY2242 II-1 had ALG8 c.175-2A>G variant and abdominal CT showing bilateral polycystic kidney disease with liver cysts at the age of 44 with preserved kidney function (Fig. 5a). DY1920 I-2 was diagnosed with ALG8 p.Gly275fs mutation, her kidney cysts rarely extended outside the kidney contour with no liver cyst at the age of 65 (Fig. 5b). She and one of her daughters (II-1) both had a pathogenic SPAST p.Gly382Cys variants which are compatible with their clinical features of spastic paraplegia. DY1591 I-2 had GANAB p.Val4_Ala5del variant and renal ultrasonography showed bilateral multiple kidney cysts without liver cysts as well as preserved kidney contour at the age of 38 (Fig. 5c). DY778 I-1 presented with rapid kidney function deterioration with the identification of GANAB p.Arg443* variant and he received continuous ambulatory peritoneal dialysis at the age of 47. An acute episode of hematoma occurred in the right kidney due to right renal artery pseudoaneurysms rupture 1 year after dialysis (Fig. 5d). In family DY95 where no family history can be retrieved, II-4 had abnormal kidney function since teenager and received renal replacement at the age of 33. Compound heterozygous PKD1 variants (p.Cys259Tyr and p.Ala1458fs) were identified and abdominal CT showed liver cysts and bilateral polycystic kidney disease at the age of 33, while only a total of 4 cysts (2 on each kidney) in the father (I-1, age 71) and no kidney cyst (image not shown, age 40) was detected in the sister II-2 (Fig. 5e). The p.Ala1458fs variant most likely occurred de novo in trans of the p.Cys259Tyr variant, leading to a very severe phenotype. DY95 I-1 and II-2 have eGFR ~60 ml/min/1.73 M² and ~100 ml/min/1.73 M² at the time of the CT survey, respectively. In family DY2061 where compound heterozygous PKD1 variants (p.Arg4150Cys and p.Thr3539Ile) in II-1 lead to severe phenotype with renal replacement therapy at the age of 31. Renal ultrasound shows enlarged kidneys more than 15 cm in diameter with bilateral multiple kidney cysts at the age of 29 (Fig. 5f). In family DY2184, a total of three kidney cysts (Fig. 5g, one in the right kidney and 2 in the left kidney at age 25) were identified in II-1 where a likely pathogenic PKHD1 variant (p.Arg375Gln) was detected. The mother (I-2) has only one cyst at the age of 55.

Discussion

Our study provided the ADPKD mutation landscape in Taiwan and extended the mutation spectrum with 151 novel pathogenic or likely pathogenic variants. Our study showed mutations in PKD2 are higher than previous studies, this is most likely due to the unique p.Arg803* founder effect in Taiwan. If PKD2 p.Arg803* was excluded, the PKD2 mutation represented only 11% of the cohort. The Toronto Genetic Epidemiology Study of PKD (TGESP) found that 30.3% was due to PKD2 mutation, but no founder effect was observed¹¹. ADPKD families usually had unique mutations and any single mutation had never been reported to contribute to more than 2% of the PKD populations. No mutation hotspots that would support the existence of a founder effect had been found, except in a smaller ADPKD study in Taiwan and four cases in the Alpujarra region of Granada^19,20. All other ADPKD studies had a total of 32 PKD2 p.Arg803* cases, along with a very low percentage in the population (Supplementary Table 3). Although PKD2 p.Arg803* mutation (CGA > TGA) is a classic CpG dinucleotide mutation and similar CpG mutation hotspots have been identified in many other hereditary diseases^21,22, we provide evidence that this mutation was due to the founder effect with at least 80% of families are related and share the same haplotype in the PKD2 region. Whether families with the same mutations other than PKD2 p.Arg803* (such as ten families with PKD2 c.1094 + 3_1094 + 6delAAGT variant) have distant kinship needs further studies, since Taiwan is a relatively small and secluded island in the history.

Microsatellite markers were used in linkage analysis to demonstrate the prevalence of the genotype and the correlation between phenotypes and genotypes²³. However, meiotic recombination not only increases genome diversity and leads to linkage disequilibrium^24,25. Microsatellite marker D4S1563 is located more than 500 kb from the PKD2 gene, and genetic recombination within this region may cause loss of linkage between PKD2 p.Arg803* and haplotype markers, as in family DY1466. This result indicates that the microsatellite or haplotype analysis may have under-estimated the prevalence of the founder effect in our PKD2 p.Arg803* population.

Taiwan is a geographically isolated island where most of the current population’s ancestry is Chinese but traces of Austronesian, Dutch and Japanese ancestry also exist. With such a high prevalence of PKD2 p.Arg803* mutation, we hypothesized that the mutation is either de novo in Taiwan or that of a founder who immigrated 300 years ago during the end of the Ming Dynasty or the early Qing Dynasty²⁶. The family identified in the Penghu Island (27 miles west of Taiwan with a population of 100,500), who lived there for generations, maybe the decedent of the PKD2 p.Arg803* founder or immigrant who first arrived Penghu from China, and their offspring later populated all over Taiwan. Analysis of PKD2 p.Arg803* in other ADPKD communities may provide more information on the origin and migration track of this unique population. The renal function decline in individuals with PKD2 p.Arg803* was relatively slower than the non-p.Arg803* truncation, where most (23 out of 26) of the non-p.Arg803* truncation had mutation before position Arg803. Larger numbers of PKD2 truncation patients with clinical endpoints are needed to show the renal survival benefit in individuals harboring p.Arg803* compared with non- p.Arg803* truncation, and whether the N-termini mutation of PKD2 lead to a more severe phenotype needs further studies to validate our finding. Furthermore, individuals with PKD2 mutations usually will not enter dialysis until the 7th or 8th decade, which may explain the lower ADPKD percentage in the dialysis population in Taiwan¹⁸.

The clinical phenotype of PKD can be complex and mixed, sometimes making accurate diagnosis difficult without genetic study, especially for patients with early disease onset, no family history, or limited family members^27,28. Genetic studies may partly explain the intrafamilial variation observed in 8 families where index cases with 2 pathogenic variants. Similar to previous reports²⁷, Individuals with 2 pathogenic variants had early kidney cysts development or rapid renal function deterioration compared to other affected family members with one mutation. Besides biallelic and hypomorphic mutations, mosaicism, common and rare variants of kidney disease-related and cystogenic related genes, concomitant clinical and environmental factors may lead to intrafamilial kidney disease severity discordance in ADPKD^17,29.

In our experience, the Pei-Ravine unified ultrasonographic criteria can be used to diagnose patients where ALG8 and GANAB are the disease-causing genes. Although the criteria were based on the PKD1 and PKD2 cohort, some PKHD1 carriers, and patients of advancing age present with multiple kidney cysts which are similar to the ADPKD pattern²⁷. However, the criteria may be insufficient to detect mosaicism or hypomorphic mutation where renal cysts developed very slowly, such as the DY95 I-1 and II-2. In patients with the diagnosis of PKHD1, ALG8, and GANAB variants, kidney cysts tend to be less destructive than PKD1 and PKD2 with relatively preserved kidney contours. Whether kidney contour preservation can be included in the diagnostic criteria for ALG8 and GANAB need further studies.

The limitation of our study includes that many genes that may phenocopy PKD are not included in our panel, including TSC, VHL, HNF1B, ALG9, FLCN, and IFT140^30,31 The library made by multiplex PCR can have amplicons lost. Large insertion, large deletion, translocation, or other complex genetic structural changes cannot be identified by our method. Multiplex ligation-dependent probe amplification should increase our diagnosis in un-identified families with indel mutation. The mosaic mutation may be under-estimated if the patient’s allele frequency was less than 25%. Filtering our data by setting lower allele frequency along with modified PCR methods, such as allele-specific PCR or COLD-PCR, should identify low-grade mosaicism³². Further comprehensive analysis of our cohort by exome sequencing, genome sequencing, long-read sequencing, and optical mapping techniques along with copy number variation analysis should provide a genetic diagnosis for those unsolved families and help clinicians provide better clinical care. The clinical database of this continuing expanding cohort is incomplete, and an official Taiwan PKD Registration is currently under establishment by the Taiwan PKD Consortium and Taiwan Society of Nephrology. Genotype (panel and exome sequencing data), phenotype (renal and extra-renal), and longitudinal biochemistry data will be included in the Taiwan PKD Registration which should better clarify the correlation of genotype-phenotype as well as inter- and intrafamilial variabilities in the PKD. Our findings provide a mutation landscape of ADPKD in Taiwan with a high frequency of PKD2 mutations. A unique PKD2 p.Arg803* founder mutation occurred 300 years ago and contributed to the single most common mutation in the Taiwan ADPKD community.

Methods

Human subjects

Blood samples, pedigree information, and access to results of laboratory work were obtained from individuals or parents/guardians if minors after informed consent was given. Patients were diagnosed with ADPKD according to the Pei-Ravine criteria³³. The radiographic diagnostic criteria were based on ultrasonography with unknown genotypes, including ≥3 cysts in one or both kidneys in age 15 to 39, ≥2 cysts in each kidney in age 40 to 59, and ≥4 cysts in each kidney in age ≥60. A total of 1421 individuals from 920 families (745 male, median age 44, interquartile range, IQR 33–56) were enrolled in this cohort. The study was approved by the institutional review boards of the National Health Research Institutes and Kaohsiung Medical University Hospital. All participants provided written informed consent to take part in the study.

Primer design, long-range PCR, and multiplex PCR

DNA was extracted according to the standard method from peripheral blood obtained from all study participants after informed consent. The panel is composed of polycystic-related genes, including ALG8, DNAJB11, GANAB, PKD1, PKD2, and PKHD1. A total of seven long-range PCRs of the PKD1 gene were designed to avoid amplification of the pseudogene-overlapping region in exon 1 to exon 33 of PKD1. The long-range PCR method and primers were modified from a previous publication¹⁶. Briefly, 100 ng of genomic DNA was used in a 10 μl PCR reaction. A simplified protocol consisting of a three-step touchdown PCR composed of the first step of 95 °C for 3 min, 24 cycles of 95 °C for 30 s, initial 70 °C for 30 s (with a decrease of 0.5 °C per cycle), and 72 °C for 3 min. A second step with 30 cycles of 95 °C for 30 s, 58 °C for 30 s, and 72 °C for 3 min, with a final extension step of 72 °C for 10 min. Q solution was added in the long-range PCR, except PKD1 exon 1 PCR where a 10% DMSO was used. Two microliters of long-range PCR product were mixed followed by a 4000-fold dilution to avoid genomic DNA carry-over. The final product was used as input for target DNA enrichment by the multiplex PCR. The Fluidigm 48.48. Access Array System was used for multiplex PCR as previously described³⁴. Two 48.48 Access Array chips were used, one for PKD1 exon 1 to exon 33 region and one for all other target regions. Primers were pooled to generate 2-plex (PKD1 pseudogene region) or 4-plex primer pools per multiplex PCR. Every sample master mix contained 50 ng DNA, 1X FastStart High Fidelity Reaction Buffer with MgCl2, 5 % DMSO, dNTPs (200 μM each), FastStart High Fidelity Enzyme Blend, and 1X Access Array loading reagent. 48 different DNA or long-range PCR samples were mixed with 48 different multiplex primer pools on one 48.48 Access Array followed by thermal cycling. Subsequently harvested amplicon pools were submitted to another PCR step to tag PCR products with 48 different barcodes and Illumina sequence-specific adaptors. Barcoded PCR products were pooled from 48 individuals and submitted to next-generation resequencing on an Illumina MimiSeq platform with 2 × 150 bp paired-end runs according to the manufacturer’s protocol. Exon 1 of PKD1 was amplified and Sanger sequenced separately if the read depth was insufficient for analysis. All primer sequences were listed in Supplementary Table 5.

Bioinformatics

CLCbio Genomic Workbench (Qiagen, USA) was used for analysis. Identified variants were labeled as pathogenic/likely pathogenic, VUS, or benign according to the guidelines of the American College of Medical Genetics and Genomics (ACMG) and analyzed with Varsome The Human Genomics Community^35,36. Variant pathogenicity was determined by the order of ACMG-Databases-family segregation. Variant not classified as pathogenic/likely pathogenic by ACMG guideline was considered pathogenic or likely pathogenic if the same variant segregated in the family and existed in the ClinVar Database³⁷, the Leiden Open Variation Database³⁸, or the ADPKD Variant Database³⁹. Classified pathogenic/likely pathogenic variants that did not segregate in the family were considered as VUS. Detected variants of pathogenic, likely pathogenic, and unknown significance were confirmed by Sanger sequencing. Segregation analysis was performed if DNA from family members were available.

Microsatellite analysis

Microsatellite analysis was performed in a total of 111 PKD2 p.Arg803* families. Five polymorphic markers located outside the PKD2 region, including D4S1534, D4S1542, D4S1563, D4S1544, and D4S414 were selected as previously described²³. Primer sequences were described in Supplementary Table 5. Microsatellite analysis was performed as a previous publication⁴⁰. Briefly, the 5′ end of each forward primer was tagged with the following universal tag sequence: 5′-GAGAGAAAGGGAAGGGAG-3′. A universal primer, consisting of the same sequence as the added tag, was fluorescently labeled with 6-FAM or TET. PCR products were separated on an automated capillary sequencer (3130XL, Applied Biosystems) and results were analyzed with the Peak Scanner 2 (Applied Biosystems).

Genotyping, haplotype, and principal component analysis

A total of 78 families (96 individuals) harboring PKD2 p.Arg803* were selected for haplotype analysis. A total of 480 health controls were selected from Taiwan Biobank (https://taiwanview.twbiobank.org.tw/data_appl). The Axiom Genome-Wide TWB 2.0 Array which contained 752,921 SNP probes was used for genotyping⁴¹. Data analysis was performed by using Axiom Analysis Suites 5.0.1 (Thermo Fisher Scientific). The CEL files from the microarray were converted to PLINK format via PLINK1.9 (www.cog-genomics.org/plink/1.9/). Haplotype reconstruction was first conducted by manually phasing the region in the families with affected members. Twelve families in the disease group were manually phased first, and their shared haplotypes were set as known phases and analyzed with other cases by PHASE2.1, a program based on a Bayesian statistical method using coalescent-based models that considers the joint distribution of haplotypes and infer loci from unphased genotype data^42,43. Posterior distributions in PHASE were estimated by Gibbs sampling, a Markov-Chain Monte Carlo algorithm⁴⁴, and the haplotype with the highest probability was chosen to represent each individual. For principal component analysis, the population structure of 96 PKD2 p.Arg803* carriers, 480 control individuals from Taiwan Biobank, and eleven population samples from the HapMap 3 project (https://www.sanger.ac.uk/resources/downloads/human/hapmap3.html) were analyzed by PLINK1.9. The SNPs selection for PCA were according to Wei et al.⁴¹ with the following criteria: minor allele frequency >5%, low inter-marker linkage disequilibrium (r² < 0.3), call-rate larger than 99%, and Hardy-Weinberg equilibrium (p > 10⁻⁴).

Age of mutation analysis

DMLE + 2.3 was used to estimate the age of mutation by comparing the linkage disequilibrium between mutation position and linked markers in unrelated health controls and affected cases⁴⁵. Multiple parameters were set as default for Bayesian estimation. The population growth rate was calculated by averaging Taiwan’s population growth rate from 1960 to 2020 in a medium-variant projection (1.31%). The proportion of PKD2 p.Arg803* disease allele frequency was obtained from TOPMed data in the dbSNP Database⁴⁶.

Analysis of PKD2 p.Arg803* variant associated with renal function decline

To explore the influences of the PKD2 p.Arg803* variant on the eGFR decline, we conducted a repeated measures mixed model incorporating random intercept and slope. A total of 57 PKD2 p.Arg803* individuals (28 male, 738 measurements) and 26 PKD2 non-p.Arg803* truncation individuals (20 male, 446 measurements) were included. The longitudinal eGFR in the study was calculated using the CKD-EPI creatinine equation⁴⁷. Each patient’s follow-up year for eGFR measurements was determined as the period from the date of the first eGFR measurement to the date of the subsequent measures. We put the age at baseline, sex, follow-up year, quadratic follow-up year, and the interaction term between time and group effect using the forced entry approach. The model was performed using SAS (version 9.4, SAS Institute, Cary, NC, USA). A p value < 0.05 was considered statistical significance.

Reference sequences and variant nomenclature

The following NCBI Ref sequences were used, ALG8: NM_024079.5, DNAJB11: NM_016306.6, GANAB: NM_198335.4, PKD1: NM_001009944.3, PKD2: NM_000297.4, PKHD1: NM_138694.4. The standard nomenclature recommended by Human Genome Variation Society was used to number nucleotides and name mutations or variants⁴⁸.

Reporting summary

Further information on research design is available in the Nature Research Reporting Summary linked to this article.

Data availability

The Taiwan biobank datasets are available through the TWB (https://taiwanview.twbiobank.org.tw/data_appl). Sequencing data generated or analyzed during this study are included in this published article and its Supplementary Information files. The microarray datasets of PKD2 p. Arg803* have been deposited in the ArrayExpress database at EMBL-EBI (www.ebi.ac.uk/arrayexpress) under accession number E-MTAB-11846.

References

Cornec-Le, G. E. et al. Autosomal dominant polycystic kidney disease. Lancet 393, 919–935 (2019).
Article Google Scholar
Torres, V. E. et al. Autosomal dominant polycystic kidney disease. Lancet 369, 1287–1301 (2007).
Article PubMed Google Scholar
Lanktree, M. B. et al. Prevalence estimates of polycystic kidney and liver disease by population sequencing. J. Am. Soc. Nephrol. 29, 2593–2600 (2018).
Article CAS PubMed PubMed Central Google Scholar
Chebib, F. T. et al. Autosomal dominant polycystic kidney disease: core curriculum 2016. Am. J. Kidney Dis. 67, 792–810 (2016).
Article PubMed Google Scholar
Consortium EPKD. The polycystic kidney disease 1 gene encodes a 14 kb transcript and lies within a duplicated region on chromosome 16. Cell 77, 881–894 (1994).
Article Google Scholar
Hughes, J. et al. The polycystic kidney disease 1 (PKD1) gene encodes a novel protein with multiple cell recognition domains. Nat. Genet. 10, 151–160 (1995).
Article CAS PubMed Google Scholar
Porath, B. et al. Mutations in GANAB, encoding the glucosidase IIα subunit, cause autosomal-dominant polycystic kidney and liver disease. Am. J. Hum. Genet. 98, 1193–1207 (2016).
Article CAS PubMed PubMed Central Google Scholar
Cornec-Le, G. E. et al. Monoallelic mutations to DNAJB11 cause atypical autosomal-dominant polycystic kidney disease. Am. J. Hum. Genet. 102, 832–844 (2018).
Article CAS Google Scholar
Cornec-Le, G. E. et al. Type of PKD1 mutation influences renal outcome in ADPKD. J. Am. Soc. Nephrol. 24, 1006–1013 (2013).
Article CAS Google Scholar
Heyer, C. M. et al. Predicted mutation strength of nontruncating PKD1 mutations aids genotype-phenotype correlations in autosomal dominant polycystic kidney disease. J. Am. Soc. Nephrol. 27, 2872–2884 (2016).
Article CAS PubMed PubMed Central Google Scholar
Hwang, Y. H. et al. Refining genotype-phenotype correlation in autosomal dominant polycystic kidney disease. J. Am. Soc. Nephrol. 27, 1861–1868 (2016).
Article CAS PubMed Google Scholar
Rossetti, S. et al. Genotype–phenotype correlations in autosomal dominant and autosomal recessive polycystic kidney disease. J. Am. Soc. Nephrol. 18, 1374–1380 (2007).
Article CAS PubMed Google Scholar
Bogdanova, N. et al. Homologues to the first gene for autosomal dominant polycystic kidney disease are pseudogenes. Genomics 74, 333–341 (2001).
Article CAS PubMed Google Scholar
Audrézet, M. P. et al. Autosomal dominant polycystic kidney disease: comprehensive mutation analysis of PKD1 and PKD2 in 700 unrelated patients. Hum. Mutat. 33, 1239–1250 (2012).
Article PubMed CAS Google Scholar
Rossetti, S. et al. Identification of gene mutations in autosomal dominant polycystic kidney disease through targeted resequencing. J. Am. Soc. Nephrol. 23, 915–933 (2012).
Article CAS PubMed PubMed Central Google Scholar
Tan, Y. C. et al. A novel long-range PCR sequencing method for genetic analysis of the entire PKD1 gene. J. Mol. Diagn. 14, 305–313 (2012).
Article CAS PubMed PubMed Central Google Scholar
Ong, A. C. et al. A polycystin-centric view of cyst formation and disease: the polycystins revisited. Kidney Int. 88, 699–710 (2015).
Article CAS PubMed PubMed Central Google Scholar
Lee, P. W. et al. Epidemiology and mortality in dialysis patients with and without polycystic kidney disease: a national study in Taiwan. J. Nephrol. 26, 755 (2013).
Article PubMed Google Scholar
Chang, M. Y. et al. Novel PKD1 and PKD2 mutations in Taiwanese patients with autosomal dominant polycystic kidney disease. J. Hum. Genet. 58, 720–727 (2013).
Article CAS PubMed Google Scholar
García-Rabaneda, C. et al. Nueva mutación asociada a poliquistosis renal autosómica dominante con efecto fundador localizada en la Alpujarra de Granada. Nefrología 40, 536–542 (2020).
Article PubMed Google Scholar
Cooper, D. N. et al. On the sequence-directed nature of human gene mutation: the role of genomic architecture and the local DNA sequence environment in mediating gene mutations underlying human inherited disease. Hum. Mutat. 32, 1075–1099 (2011).
Article CAS PubMed PubMed Central Google Scholar
Youssoufian, H. et al. Recurrent mutations in haemophilia A give evidence for CpG mutation hotspots. Nature 324, 380–382 (1986).
Article CAS PubMed Google Scholar
Mizoguchi, M. et al. Genotypes of autosomal dominant polycystic kidney disease in Japanese. J. Hum. Genet. 47, 51–54 (2002).
Article PubMed Google Scholar
Baudat, F. et al. Meiotic recombination in mammals: localization and regulation. Nat. Rev. Genet. 14, 794–806 (2013).
Article CAS PubMed Google Scholar
Thompson, E. A. Identity by descent: variation in meiosis, across genomes, and in populations. Genetics 194, 301–326 (2013).
Article CAS PubMed PubMed Central Google Scholar
Knapp, R. G. Chinese frontier settlement in Taiwan. Ann. Assoc. Am. Geogr. 66, 43–59 (1976).
Article Google Scholar
Bergmann, C. ARPKD and early manifestations of ADPKD: the original polycystic kidney disease and phenocopies. Pediatr. Nephrol. 230, 15–30 (2015).
Article Google Scholar
Lanktree, M. B. et al. Evolving role of genetic testing for the clinical management of autosomal dominant polycystic kidney disease. Nephrol. Dial. Transplant. 34, 1453–1460 (2019).
Article CAS PubMed Google Scholar
Lanktree, M. B. et al. Intrafamilial Variability of ADPKD. Kidney Int. Rep. 4, 995–1003 (2019).
Article PubMed PubMed Central Google Scholar
Schönauer, R. et al. Matching clinical and genetic diagnoses in autosomal dominant polycystic kidney disease reveals novel phenocopies and potential candidate genes. Genet. Med. 22, 1374–1383 (2022).
Article CAS Google Scholar
Senum, S. R. et al. Monoallelic IFT140 pathogenic variants are an important cause of the autosomal dominant polycystic kidney-spectrum phenotype. Am. J. Hum. Genet. 109, 136–156 (2022).
Article CAS PubMed Google Scholar
Gajecka, M. Unrevealed mosaicism in the next-generation sequencing era. Mol. Genet. Genomics. 291, 513–530 (2016).
Article CAS PubMed Google Scholar
Pei, Y. et al. Unified criteria for ultrasound diagnosis of autosomal dominant polycystic kidney disease. J. Am. Soc. Nephrol. 20, 205–212 (2009).
Article CAS PubMed PubMed Central Google Scholar
Hwang, D. Y. et al. Mutations in 12 known dominant disease-causing genes clarify many congenital anomalies of the kidney and urinary tract. Kidney Int. 85, 1429–1433 (2014).
Article CAS PubMed PubMed Central Google Scholar
Richards, S. et al. Standards and guidelines for the interpretation of sequence variants: a joint consensus recommendation of the American College of Medical Genetics and Genomics and the Association for Molecular Pathology. Genet. Med. 17, 405–424 (2015).
Article PubMed PubMed Central Google Scholar
Kopanos, C. et al. VarSome. the human genomic variant search engine. Bioinformatics 35, 1978–1980 (2019).
Article CAS PubMed Google Scholar
The ClinVar Database. https://www.ncbi.nlm.nih.gov/clinvar. (Accessed 1 Aug 2021).
The Leiden Open Variation Database. https://www.lovd.nl. (Accessed 1 Aug 2021).
The ADPKD Variant Database. https://pkdb.mayo.edu/welcome. (Accessed 1 Aug 2021).
Otto, E. A. et al. Mutation analysis in nephronophthisis using a combined approach of homozygosity mapping, CEL I endonuclease cleavage, and direct sequencing. Hum. Mutat. 29, 418–426 (2008).
Article CAS PubMed Google Scholar
Wei, C. Y. et al. Genetic profiles of 103,106 individuals in the Taiwan Biobank provide insights into the health and history of Han Chinese. npj. Genom. Med. 6, 10 (2021).
Article CAS PubMed PubMed Central Google Scholar
Stephens, M. et al. Accounting for decay of linkage disequilibrium in haplotype inference and missing-data imputation. Am. J. Hum. Genet. 76, 449–462 (2005).
Article CAS PubMed PubMed Central Google Scholar
Stephens, M. et al. A new statistical method for haplotype reconstruction from population data. Am. J. Hum. Genet. 68, 978–989 (2001).
Article CAS PubMed PubMed Central Google Scholar
Gilks, W. R. Markov Chain Monte Carlo. In: Encyclopedia of biostatistics. (John Wiley & Sons, Ltd. Hoboken 2005).
Reeve, J. P. et al. DMLE+: Bayesian linkage disequilibrium gene mapping. Bioinformatics 18, 894–895 (2002).
Article CAS PubMed Google Scholar
The dbSNP Database. https://www.ncbi.nlm.nih.gov/snp. (Accessed 1 Aug 2021).
Levey, A. S. et al. A new equation to estimate glomerular filtration rate. Ann. Intern. Med. 150, 604–612 (2009).
Article PubMed PubMed Central Google Scholar
den Dunnen, J. T. et al. HGVS recommendations for the description of sequence variants: 2016 update. Hum. Mutat. 37, 564–569 (2016).
Article CAS Google Scholar
The MutationMapper. https://www.cbioportal.org/mutation_mapper. (Accessed 1 Aug 2021).

Download references

Acknowledgements

We thank the members of the investigated families for participating in this study and the Taiwan PKD Consortium physicians in charge of these patients for providing samples. The authors acknowledge the technical services provided by the core facilities of the National Yang-Ming University Genome Research Center and the National Center for Genome Medicine, Academia Sinica. This study was supported in part by the National Health Research Institutes, Taiwan (CA-110-PP-08), and the Ministry of Science and Technology, Taiwan (MOST 104-2314-B-037-066, 107-2314-B-400-041, 108-2314-B-400-041-MY3). F.H. was supported by a grant from the National Institutes of Health (DK068306).

Author information

These authors contributed equally: Chih-Chuan Yu, An-Fu Lee.
A list of authors and their affiliations appears in the Supplementary Information.

Authors and Affiliations

National Institute of Cancer Research, National Health Research Institutes, Tainan, Taiwan
Chih-Chuan Yu, An-Fu Lee, Siao Muk Cheng & Daw-Yang Hwang
Department of Laboratory Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
Chih-Chuan Yu
Department of Pediatrics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Cologne, Germany
Stefen Kohl
Division of Nephrology, Department of Medicine, Kaohsiung Medical University Hospital, Kaohsiung Medical University, Kaohsiung, Taiwan
Ming-Yen Lin, Chi-Chih Hung, Jer-Ming Chang, Yi-Wen Chiu, Shang-Jyh Hwang & Daw-Yang Hwang
Michigan Medicine, Ann Arbor, MI, USA
Edgar A. Otto
Department of Pediatrics, Boston Children’s Hospital, Harvard Medical School, Boston, MA, USA
Friedhelm Hildebrandt
Center for Biomarkers and Biotech Drugs, Department of Biomedical Science and Environmental Biology, Kaohsiung Medical University, Kaohsiung, Taiwan
Daw-Yang Hwang

Authors

Chih-Chuan Yu
View author publications
You can also search for this author in PubMed Google Scholar
An-Fu Lee
View author publications
You can also search for this author in PubMed Google Scholar
Stefen Kohl
View author publications
You can also search for this author in PubMed Google Scholar
Ming-Yen Lin
View author publications
You can also search for this author in PubMed Google Scholar
Siao Muk Cheng
View author publications
You can also search for this author in PubMed Google Scholar
Chi-Chih Hung
View author publications
You can also search for this author in PubMed Google Scholar
Jer-Ming Chang
View author publications
You can also search for this author in PubMed Google Scholar
Yi-Wen Chiu
View author publications
You can also search for this author in PubMed Google Scholar
Shang-Jyh Hwang
View author publications
You can also search for this author in PubMed Google Scholar
Edgar A. Otto
View author publications
You can also search for this author in PubMed Google Scholar
Friedhelm Hildebrandt
View author publications
You can also search for this author in PubMed Google Scholar
Daw-Yang Hwang
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

Taiwan PKD Consortium

Contributions

Conceptualization: D.-Y.H., E.A.O, F.H.; methodology: C.-C.Y., A.-F.L., S.K., D.-Y.H.; clinical data collection: C.-C.H., J.-M.C., Y.-W.C., S.-J.H., D.-Y.H.; analysis and investigation: C.-C.Y., A.-F.L., S.M.C., D.-Y.H.; writing-original draft preparation: D.-Y.H.; statistical analysis: M.-Y Lin; writing-review and editing: all authors. All authors read and approved the final paper.

Corresponding author

Correspondence to Daw-Yang Hwang.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Material

Reporting Summary

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Yu, CC., Lee, AF., Kohl, S. et al. PKD2 founder mutation is the most common mutation of polycystic kidney disease in Taiwan. npj Genom. Med. 7, 40 (2022). https://doi.org/10.1038/s41525-022-00309-w

Download citation

Received: 10 November 2021
Accepted: 09 June 2022
Published: 01 July 2022
DOI: https://doi.org/10.1038/s41525-022-00309-w

This article is cited by

Exploring the clinical and genetical spectrum of ADPKD in Chile to assess ProPKD score as a risk prediction tool
- Esperanza Bayyad
- Anita Plaza
- Paola Krall
Translational Medicine Communications (2023)

Subjects

Abstract

Similar content being viewed by others

The genetic landscape of polycystic kidney disease in Ireland

Genetic Characteristics of Korean Patients with Autosomal Dominant Polycystic Kidney Disease by Targeted Exome Sequencing

Matching clinical and genetic diagnoses in autosomal dominant polycystic kidney disease reveals novel phenocopies and potential candidate genes

Introduction

Results

Mutation landscape of Taiwan ADPKD

Identification of novel and recurrent mutations

Identification of PKD2 p.R803* as unique founder mutation

Identification of genetic recombination in PKD2 region

PKD2 p.Arg803* variant and renal function decline

Exemplary imaging in atypical cases of ADPKD

Discussion

Methods

Human subjects

Primer design, long-range PCR, and multiplex PCR

Bioinformatics

Microsatellite analysis

Genotyping, haplotype, and principal component analysis

Age of mutation analysis

Analysis of PKD2 p.Arg803* variant associated with renal function decline

Reference sequences and variant nomenclature

Reporting summary

Data availability

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

Taiwan PKD Consortium

Contributions

Corresponding author

Ethics declarations

Competing interests

Additional information

Supplementary information

Supplementary Material

Reporting Summary

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Exploring the clinical and genetical spectrum of ADPKD in Chile to assess ProPKD score as a risk prediction tool

Search

Quick links