Evaluation of reported pathogenic variants and their frequencies in a Japanese population based on a whole-genome reference panel of 2049 individuals

Yamaguchi-Kabata, Yumi; Yasuda, Jun; Tanabe, Osamu; Suzuki, Yoichi; Kawame, Hiroshi; Fuse, Nobuo; Nagasaki, Masao; Kawai, Yosuke; Kojima, Kaname; Katsuoka, Fumiki; Saito, Sakae; Danjoh, Inaho; Motoike, Ikuko N.; Yamashita, Riu; Koshiba, Seizo; Saigusa, Daisuke; Tamiya, Gen; Kure, Shigeo; Yaegashi, Nobuo; Kawaguchi, Yoshio; Nagami, Fuji; Kuriyama, Shinichi; Sugawara, Junichi; Minegishi, Naoko; Hozawa, Atsushi; Ogishima, Soichi; Kiyomoto, Hideyasu; Takai-Igarashi, Takako; Kinoshita, Kengo; Yamamoto, Masayuki

doi:10.1038/s10038-017-0347-1

Article
Published: 01 December 2017

Evaluation of reported pathogenic variants and their frequencies in a Japanese population based on a whole-genome reference panel of 2049 individuals

Yumi Yamaguchi-Kabata^1,2,
Jun Yasuda^1,2,
Osamu Tanabe^1,2,
Yoichi Suzuki^1,2^nAff8,
Hiroshi Kawame^1,2,
Nobuo Fuse^1,2,
Masao Nagasaki^1,2,3,
Yosuke Kawai^1,2,
Kaname Kojima^1,2,
Fumiki Katsuoka^1,2,
Sakae Saito^1,2,
Inaho Danjoh^1,2,
Ikuko N. Motoike^1,2,3,
Riu Yamashita^1,2,3,
Seizo Koshiba^1,2,
Daisuke Saigusa^1,2,
Gen Tamiya^1,2,4,
Shigeo Kure^1,2,
Nobuo Yaegashi^1,2,
Yoshio Kawaguchi¹,
Fuji Nagami¹,
Shinichi Kuriyama^1,2,5,
Junichi Sugawara^1,2,
Naoko Minegishi^1,2,
Atsushi Hozawa^1,2,
Soichi Ogishima^1,2,
Hideyasu Kiyomoto^1,2,6,
Takako Takai-Igarashi^1,2,
ToMMo Study Group,
Kengo Kinoshita^1,3,7 &
…
Masayuki Yamamoto^1,2

Journal of Human Genetics volume 63, pages 213–230 (2018)Cite this article

1580 Accesses
31 Citations
Metrics details

Abstract

Clarifying allele frequencies of disease-related genetic variants in a population is important in genomic medicine; however, such data is not yet available for the Japanese population. To estimate frequencies of actionable pathogenic variants in the Japanese population, we examined the reported pathological variants in genes recommended by the American College of Medical Genetics and Genomics (ACMG) in our reference panel of genomic variations, 2KJPN, which was created by whole-genome sequencing of 2049 individuals of the resident cohort of the Tohoku Medical Megabank Project. We searched for pathogenic variants in 2KJPN for 57 autosomal ACMG-recommended genes responsible for 26 diseases and then examined their frequencies. By referring to public databases of pathogenic variations, we identified 143 reported pathogenic variants in 2KJPN for the 57 ACMG recommended genes based on a classification system. At the individual level, 21% of the individuals were found to have at least one reported pathogenic allele. We then conducted a literature survey to review the variants and to check for evidence of pathogenicity. Our results suggest that a substantial number of people have reported pathogenic alleles for the ACMG genes, and reviewing variants is indispensable for constructing the information infrastructure of genomic medicine for the Japanese population.

You have full access to this article via your institution.

Download PDF

Actionable secondary findings in the 73 ACMG-recommended genes in 1559 Thai exomes

Article 08 October 2021

Wanna Chetruengchai & Vorasuk Shotelersuk

Reduced penetrance of pathogenic ACMG variants in a deeply phenotyped cohort study and evaluation of ClinVar classification over time

Article Open access 15 July 2020

Jeroen van Rooij, Pascal Arp, … Annemieke J. M. H. Verkerk

A database of 5305 healthy Korean individuals reveals genetic and clinical implications for an East Asian population

Article Open access 02 November 2022

Jeongeun Lee, Jean Lee, … Murim Choi

Introduction

One of the main goals of medical genomics is the development of personalized medicine and personalized healthcare based on individual genomes. Large-scale sequencing of individual genomes from cohort participants [1,2,3] provides us with a catalog of numerous genomic variants whose frequencies range from rare to common, as well as a set of phased variants in individual haplotypes. Such genomic variation data of local populations are valuable resources. Thus, this data allows for genome-wide association studies aimed at finding disease-related variants by using phased variants for genotype imputation [4]. In addition to binary phenotypic traits, studying association with quantitative traits, such as metabolomics in plasma, is also valuable to reveal genetic susceptibility to proximal phenotypes at the molecular level [5]. Another important use of this data is to allow for detection of rare and pathogenic variants, and to estimate their population frequencies.

Large-scale genome sequencing of volunteers from general residents provides valuable data for the field of medical genomics; however, whole genome sequencing (WGS) or whole exome sequencing (WES) [6, 7] may uncover other clinically relevant variants in addition to specific findings intended by a particular project. Therefore, it raises an important problem: how can researchers use and manage secondary findings, which may be deliberately sought, or incidental findings (accidental discoveries) from WGS or WES studies? In this situation, the American Colleges of Medical Genetics and Genomics (ACMG) recommends that clinical sequencing laboratories return pathogenic variants of 24 conditions (56 genes) in 2013, and recently updated this list to include 26 diseases (59 genes) [8, 9] as a minimum set for returning secondary findings, which were selected from the viewpoint of medical actionability. Although medical actionability depends on the clinical systems in the society, it is important to estimate population frequencies of the genetic variants in the ACMG gene list to improve social welfare.

Since the ACMG released their recommendations, several groups have tried to estimate frequencies of actionable variants in 56 genes for diverse samples and by using different methods [10,11,12,13,14,15]. Using WGS or WES data, some studies [15, 16] tried to estimate the frequencies of pathogenic variants in the recommended genes for European and African ancestries, and target populations of the 1000 Genomes Project. Although East Asian populations were analyzed in the 1000 Genomes Project, the number of individuals for each population was not significant enough to detect rare pathogenic variants and frequency estimations. Therefore, the frequencies of low-frequency pathogenic variants in the Japanese population have not been characterized well.

Tohoku University Tohoku Medical Megabank Organization (ToMMo) initiated genome cohort studies [17] along with Iwate Medical University to promote research in medical genomics aiming to realize personalized healthcare. As the first step towards our goal, deep WGS of more than 2000 cohort participants was performed, and the reference panel for the Japanese population, 2KJPN, was constructed [18, 19]. In this study, we report the first examination of population frequencies of the responsible genomic variants in the actionable ACMG genes by using 2KJPN and public annotations in the Human Gene Mutation Database (HGMD) [20] and ClinVar [21]. We found that 21% of the individuals had at least one reported pathogenic variant for the 57 autosomal ACMG genes, suggesting that not a small proportion of individuals may have some risk allele for the actionable genes. In addition, we performed manual inspections of some variants through extensive literature surveys, and found that there were many discrepancies between the two public annotations. Some reported disease mutations can be benign variants, and a few variants were lacking enough evidence for the Japanese population. These results indicate that we need to construct an information infrastructure of pathogenic variants for the Japanese population through appropriate variant review and interpretations, ultimately allowing for personalized healthcare for the Japanese population.

Materials and Methods

Subjects and data for single nucleotide variation

We used the 2KJPN whole genome reference panel (2049 individuals) of the Tohoku Medical Megabank Project, which was created using the approach same as that used for the 1KJPN panel (1070 individuals) [18]. Briefly, the subjects were selected from the participants of the resident cohort study [17], and then the genomic DNA of the 2049 individuals obtained from peripheral blood samples was subjected to paired-end sequencing using the Illumina HiSeq 2500 platform (see details in Nagasaki et al. [18]).

This project was performed as a part of prospective cohort studies at ToMMo with the approval of the Ethical Committee of the Tohoku University School of Medicine and ToMMo. The samples used here were obtained from the cohort participants, all of whom gave their written consent. Under the terms of the informed consent provided by the participants in our cohort project, whole genome data including sequenced data, variant calls, and inferred genotypes are securely controlled under the Materials and Information Distribution Review Committee of Tohoku Medical Megabank Project, and the sharing of data with other researchers was discussed in each research proposal by the review committee.

There are two sets of variants in 2KJPN—a high confidence variant set and a high sensitivity variant set. The former set was created with high precision and the latter set was created by maximizing sensitivity. The allele frequency of the high confidence variant set is publicly available through a portal site, the integrative Japanese Genome Variation Database (iJGVD; http://ijgvd.megabank.tohoku.ac.jp/) [19]. In this study, we used both the variant sets (high confidence single nucleotide variations (SNVs) and high sensitive SNVs) for analyses, and the results from the high confidence set were primarily used for subsequent manual inspection. The results of the high sensitive set were also used when additional existing SNVs were suspected. We also used variant frequency data for di-allelic SNVs for 4300 European Americans (EAs) and 2203 African Americans (AAs) from the Exome Sequencing Project (ESP) [22] to compare the allele frequency of each SNV with the corresponding SNV in 2KJPN.

Variant annotation

SnpEff software (ver. 3.3c), which is based on the gene annotation model of GENCODE version 17, was used to predict the effects of a variant on its gene product. SNVs were classified into functional categories, such as synonymous, missense, nonsense, intron, 5′-untranslated region (UTR), and 3′-UTR. As a measure to predict pathogenicity, the combined annotation-dependent depletion (CADD) scores [23] were added to each SNV by intersecting the list of the precomputed scores with all possible SNVs. In addition, for low-frequency missense SNVs, the Mendelian Clinically Applicable Pathogenicity (M-CAP) scores [24] were annotated similarly. To identify which SNV is a reported pathogenic variant, we used the Human Gene Mutation Database (HGMD) Professional (2016.2) [20] and ClinVar (2016 September) [21] (Fig. 1). With the ClinVar database, we used entries that included annotations as “pathogenic” or “likely pathogenic.” Overlaps between the SNVs of 2KJPN and the reported pathological variants in HGMD and ClinVar were extracted. Possible pathological SNVs were identified based on the genomic coordinates and the consistency of the allele bases. We identified 6862 pathological SNVs that overlapped with HGMD or ClinVar variants that were annotated as “pathogenic” or “likely pathogenic.” Then we selected the variants for 57 autosomal genes (except for two X-linked genes: GLA and OTC) recommended by ACMG for the return of genomic results [8] with modifications in 2016 [9].

Filtering variants

To search for disease-causing variants for the 57 autosomal ACMG recommended genes, 2KJPN SNVs that matched the HGMD or ClinVar variants were extracted. Next, we defined the following three categories for these potentially pathogenic variants (Table 1): i) reported pathogenic (RP) variants that were already annotated as “disease-causing mutation (DM)” in HGMD or “pathogenic” in ClinVar; ii) candidates of pathogenic variants (canP) that were already annotated as “DM? (likely disease-causing mutation)” in HGMD or “likely pathogenic” in ClinVar; and iii) disease-associated variants and other types (assoV, etc.). RP and canP variants were filtered by minor allele frequency (MAF) < 0.5% in 2KJPN, and the pathologically annotated SNVs that existed at the higher frequency ( ≥ 0.5%) were classified in the third group (assoV, etc.).

Table 1 Automatic classification of pathogenically annotated variants

Full size table

Results

A total of 46,822 SNVs, including 1317 with protein-altering mutations and 386 that were identified in HGMD or ClinVar as “pathogenic” or “likely pathogenic”, in the 57 ACMG genes in 2KJPN were selected based on the genomic coordinates of the genes (see Table 2 for statistics, and a whole list of the variants is shown in Supplementary Table 1). After automatically classifying variants as either RP or canP, 143 SNVs with RP variants were detected with the MAF threshold of < 0.5% (156 RP variants when MAF < 1%) (Table 2). RP or canP variants were found in 47 genes, but not in PSM2, VHL, PTEN, SDHAF2, SDHC, TGFBR1, SMAD3, TNNI3, TPM1, MYL3, ACTC1, PRKAG2, MYL2, DSC2, and AMAD4. Using the allele frequencies of the RP variants, population frequencies of potential risk alleles were estimated (Table 2). Genes that showed relatively higher population frequencies were RYR2, MSH2, MYBPC3, ATP7B, APC, and BRCA2.

Table 2 Filtering candidate variants and total frequency of RP variants in 2KJPN for 57 autosomal ACMG genes

Full size table

At the individual level, 431 of the 2049 individuals had at least one RP variant for the 26 diseases (Fig. 2). This was based on the automatic classification, which may be overestimated in a proportion of individuals having a real risk allele. We then focused on several diseases, and the reported pathogenic variants were manually inspected through a literature survey. We manually reviewed the detected pathogenic variants that have been previously reported by focusing on the distinct phenotypic effects of variants in a single gene (if any), the allele frequencies, and incidence rates. In addition to the pathogenic variants that were previously reported, we also searched for candidates of expected pathogenic variants based on gene-based annotations and predicted scores of pathogenicity. With thresholds of CADD score >20 or M-CAP [24] score >0.025, 815 SNVs (including 709 missense SNVs) were detected as variants which satisfy a recommended threshold of pathogenicity but lack any pathological annotations in HGMD and ClinVar (Supplementary Table 2). For example, we found three nonsense variants and 37 missense variants in apolipoprotein B gene (APOB) that were not pathogenically annotated in HGMD and ClinVar. The three nonsense variants of APOB were p.Tyr1578*, p.Ser2128*, and p.Lys2376*, and were all found as singletons. Twenty-three of the 37 missense variants of APOB were also found as singletons.

Hereditary breast and ovarian cancer (HBOC)

BRCA1 and BRCA2 are the major susceptibility genes for HBOC. HBOC has been thought to be less prevalent in the East Asian countries, including Japan [25]. However, reports of germline genetic variations among patients with breast and ovarian cancers indicated that the population frequency of susceptible genetic variants of HBOC may be higher than that previously thought in the Japanese population [26, 27].

In 2KJPN, we identified three nonsense variants (one in BRCA1 and two in BRCA2) that were reported as responsible variants (Table 3). A nonsense variant, BRCA1 p.Leu63*, was found in a heterozygous individual in 2KJPN [28]. Interestingly, the BRCA1 p.Arg1699Gln variant is known to be one of the genetic risk factors for intermediate breast and ovarian cancers [29], and this variant was found in three heterozygous individuals in 2KJPN (MAF = 0.07%). A missense variant, BRCA1 p.Val271Met (rs80357244), was classified as DM in HGMD and was found in 27 heterozygous individuals (MAF = 0.7%) in 2KJPN. However, this variant was categorized as variant of uncertain significance (VUS) in ClinVar, and was also classified as “polymorphic” by FALCO biosynthesis [26], a private genetic testing company. Therefore, this allele may not have a strong effect on HBOC susceptibility. This allele was found only in East Asians (in the ExAC database) and not in EAs and AAs.

Table 3 Reported pathogenic variants for selected genes

Full size table

Additionally, two nonsense variants in BRCA2 were reported as pathogenic and were found in 2KJPN (Table 3). BRCA2 p.Arg2318* is also one of the known mutations of HBOC in Japan [30], and a heterozygous individual was identified in 2KJPN. The other nonsense variant (p.Arg3384*) in BRCA2 was identified in two heterozygous individuals. In a previous study [31], this variant (p.Arg3384*) did not result in any cancer predisposition and was classified as a possibly benign variation, probably because this nonsense variant is located at the C-terminus end of the gene product. On the contrary, a missense variant of BRCA2, p.Ile2675Val, was detected in a heterozygous individual in 2KJPN and is considered to be a pathogenic variant affecting splicing [32]. Additionally, BRCA2 p.Gly2044Val, also registered as DM in HGMD, was found in 59 heterozygous individuals in 2KJPN (MAF = 1.4%), and has been classified as “polymorphic” by FALCO biosynthesis [26].

These results suggest that the population frequencies of susceptible variants of HBOC might be much higher in the Japanese population than previously thought, even though we have not included insertions and deletions in our analysis. Because multiple structural variants of BRCA1 and BRCA2 associated with HBOC are reported [33], checking for the presence or absence of these variations is essential. Several missense variants with high scores of pathogenicity, such as BRCA2 p.Gly2508Ser (CADD phred = 35 and M-CAP score [24] = 0.204), would require further studies to determine its association with HBOC, although most of the reported variants with strong effects in the two genes are nonsense or frame-shifting variants.

Lynch syndrome genes

Lynch syndrome (LS) is known to have a familial predisposition of colon cancer accompanied with stomach and endometrial cancers; 3% of newly diagnosed colorectal cancers develop due to LS [34]. Most of the genes responsible for LS are related to the DNA mismatch repair function and are inherited in an autosomal dominant manner [34]. Molecular diagnoses of LS would contribute to the early diagnosis of cancers in multiple organs, and are critical for the treatment of cancers in patients with LS. MSH2 and MLH1 are the major susceptible genes for LS; around 70% of the reported mutations for LS were found in these genes [35]. Most of the reported susceptible variants of LS are nonsense mutations, out-of-frame indels, or splicing error-causing variants. One important feature of the mutations found in LS-susceptible variants is that most of them are unique for each family [35].

There are six RP variants in MSH2 in 2KJPN (Table 3). An RP variant, p. Leu811*, has been reported in a few Japanese families with LS [36,37,38]. The other five variants labeled as RP, such as rs138068023 (c.-68-157G > C) and p.Pro5Gln, are not listed in either ClinVar or InSiGHT databases (see review by Chao et al. 2008 [39]); many of them are not rare in 2KJPN so that most of them would not be pathogenic.

In MLH1, there are six RP variants in 2KJPN (Table 3). An intronic variant, c.1668-1 G > A, found in a heterozygous individual in 2KJPN, may cause exon skipping and is considered to be pathogenic [40]. Another missense variant, p.Arg687Trp, found in a heterozygous individual, was previously reported in two Japanese families with LS [41]. Other RP variants are considered to be not pathogenic or of unknown significance in other databases. For example, a missense variant, p.Arg385Cys, is annotated as “likely pathogenic” in the ClinVar database and a report shows co-segregation with cancer phenotype [42]. However, the same research group later stated that the variant was a “missense variant of unreported pathogenicity” [43]. This variant was found in five individuals in 2KJPN, and further studies are needed to clarify the pathogenic roles of this variant. Similarly, p. Leu582Val is classified as “reported pathogenic” but Takahashi et al. (2007) reported that this variant, which was found in six individuals in 2KJPN, had no functional significance [44]. Another missense variant, p.Gln701Lys, was classified as an RP variant, which is a rare variant, but is considered “likely benign” by InSiGHT database [45].

Retinoblastoma

Retinoblastoma (Rb) is the most frequent intraocular malignant tumor in children, with an incidence rate ranging from 1/15,000 to 1/18,000 live births [46]. Rb is caused by bi-allelic inactivation of RB1 located on chromosome 13q14 that encodes RB protein. In the dominantly inherited form, one mutation is inherited through the germline and the secondary mutation occurs in the somatic cells [47]. The Rb protein acts as a tumor suppressor, which regulates cell growth and stops cells from undergoing uncontrolled proliferation.

In 2KJPN, we found two missense variants of RB1, p.Arg621Ser (rs367578442, CADD phred score = 12.5) and p.Leu819Val (CADD phred score = 15.26) as canP variants, which are registered in HGMD in the “DM?” category (Table 3), and were originally reported by a previous study on Chinese patients with Rb [48]. The p.Arg621Ser variant was found in two heterozygous individuals, and p.Leu819Val was found in a heterozygous individual.

Furthermore, p.Arg621Ser is located between two RB1 pocket-domains, which feature an additional protein-binding site [49], and p.Leu819Val is located in the RB1 C-terminus region, which is involved in association with the dimer surface resulting from an association of the E2 factors (E2Fs) [50]. The p.Arg621Ser variant was also found in an AA subject in an ancestrally diverse cohort of 681 healthy individuals [51]. A previous study involving Japanese patients with Rb found that a majority of the somatic mutations were found in the adenovirus early region 1A (ElA) binding sites [52]; however, the p.Arg621Ser and p.Leu819Val variants have not been reported in any domestic report in patients with Rb. Further examination of genetic variants in the germline may be needed to assess the frequency of the risk variants in RB1 in the Japanese population. In addition to inactivation of Rb1 itself, deregulation of Rb1-related biological pathways has critical roles in most types of human cancer. A more precise annotation and identification of RB1 mutations could play a pivotal role in enhancing the clinical management of the risks for Rb.

Multiple endocrine neoplasia type 2 and familial medullary thyroid cancer (FMTC)

RET is an important gene related to several clinically distinct diseases. Gain-of-function mutations in RET cause multiple endocrine neoplasia (MEN) type 2 and FMTC [53]. On the contrary, loss-of-function mutations in RET are known to be risk factors for Hirschsprung disease (HSCR), which is caused by the congenital absence of parasympathetic ganglion cells in the intestinal tissues [54]. Furthermore, several mutations of this gene have also been reported in patients with congenital central hypoventilation syndrome (CCHS) [55]. In 2KJPN, known missense variants, p.Val292Met, for MEN type 2 (MAF = 0.0015), and p.Gly321Arg, for FMTC (MAF = 0.00049), were found as variants causing genetically dominant effects (Table 3). Our results suggest that screening the sequence of RET may be beneficial for early recognition of patients with MEN type 2 and FMTC in the Japanese population.

In addition, we found one missense variant, p.Arg114His, classified as DM in HGMD for CCHS as variants with loss-of-function effects. The allele frequency of this variant (MAF = 0.0056) in 2KJPN was higher than that of EAs (p < 0.00001) (see Supplementary Table 3 for inter-ethnic comparisons), and this variant was originally reported in a domestic study [56]. In ClinVar, this variant was annotated with “conflicting interpretations of pathogenicity.” Further studies are needed to clarify the pathogenic role of this variant. We also found another missense variant, p.Thr278Asn, for HSCR (MAF = 0.012) that was registered as DM in HGMD. However, this variant was annotated with “conflicting interpretations of pathogenicity in the ClinVar database.” This variant was originally reported in Asia [56], and the allele frequency in people with European and African ancestries is very low (not detected in ESP EA and AA). Further studies are needed to clarify the clinical impact of this variant.

Familial hypercholesterolemia

Familial hypercholesterolemia (FH) is a relatively common genetic disorder with a prevalence of 1:200–500. Generally, people with untreated FH are at a higher risk of coronary heart disease [57]. Genetic variants in three genes—low-density lipoprotein receptor (LDLR) [58], APOB, and proprotein convertase subtilisin/kexin type 9 (PCSK9) [59]—account for the majority of cases with autosomal dominant FH [60].

LDLR

The LDLR protein recognizes apolipoprotein B-100 (apo B-100) embedded in the outer phospholipid layer of low-density lipoproteins (LDLs), and mediates the endocytosis of LDL. After internalization of the LDLR-LDL complexes into the endosomes, the complexes dissociate and LDLR is either recycled or degraded, whereas LDL is taken into lysosomes where the protein moiety is degraded. LDLR variants that cause FH result in defective synthesis, assembly, LDL-binding, transport, or recycling of the protein, causing reduced clearance of LDL, the major plasma cholesterol-carrier, and thus, dramatically raising blood cholesterol levels.

In search for potentially pathological variants of FH in the 2KJPN reference panel, we identified four missense SNVs (p.Arg115His, p.Arg257Trp, p.Leu568Val, and p.Pro699Leu) classified as DM in HGMD (Table 3). Three of these variants, p.Arg115His, p.Leu568Val, and p.Pro699Leu, are classified as “likely pathogenic” in ClinVar. An additional missense SNV, p.Gly461Ser, was classified as “likely pathogenic” in ClinVar and was identified in 2KJPN, but no annotations were given to this variant in HGMD. Through a literature survey (Table 4), three of these variants (p.Arg257Trp, p.Leu568Val, and p.Pro699Leu found in two, one, and one heterozygous individuals, respectively) identified in 2KJPN were found to have strong evidence for the pathological significance for FH. This finding was based on multiple studies with patients of European or East Asian origin, including the Japanese, and significantly higher allele frequencies in these patients were identified over the population controls, including those in 2KJPN. In regards to another missense variant, p.Gly461Ser (found in two heterozygous individuals in 2KJPN), only one report of two probands with this variant in 262 Greek families with FH was found [61], suggesting a higher allele frequency of this variant in patients over the general population, as this variant was not found in over 70,000 Europeans in the Exome Aggregation Consortium (ExAC) [62]. However, the pathological significance of this variant should be confirmed by further studies. As for another missense variant, p.Arg115His, the allele frequency in 2KJPN was 0.0039, which was greater than expected for causative variants of FH based on the estimated prevalence of 1 in 200–500 in Japan, hence suggesting that these variants are benign or have mild effects. These two variants showed a higher allele frequency in 2KJPN than that of EAs (p < 0.00001), and both of them were originally reported from domestic studies [63, 64]. Thus, these variants could be classified as benign evaluated solely from the viewpoint of their relatively high frequencies.

Table 4 Review of pathologically annotated variants in LDLR, APOB, and PCSK9 for FH

Full size table

APOB

APOB mutations have been estimated to account for 1–5% of patients with FH, and are inherited in an autosomal dominant manner [65, 66]. From APOB, apo B-100 is synthesized exclusively in the liver as one of the two main protein isoforms, and is a major constituent of LDL and VLDL. Apo B-100 serves as a recognition signal for the LDL receptor to bind and internalize LDL particles. Furthermore, APOB pathogenic variants decrease the binding affinity of LDL particles for the LDL receptor, thus causing fewer LDL particles to be cleared from the blood, which then dramatically raises the plasma cholesterol levels.

In the 2KJPN reference panel, we identified two reported pathogenic missense variants (Table 3). One of the variants, p.Arg3527Gln, is a well-characterized pathogenic missense variant [67] in APOB, and was identified in only one heterozygous individual among the 2049 individuals (allele frequency = 0.0002). Another missense variant, p.Ile3768Thr [68], was registered as DM in HGMD, and was identified in 2KJPN in a heterozygous individual. However, no evidence for the pathogenicity of this variant has been presented; therefore, its clinical and functional significance must be scrutinized.

PCSK9

PCSK9 encodes neural apoptosis regulated convertase (NARC)-1, a 692 AA protein that is the ninth member of the secretory subtilase family [69]. The protein is synthesized as a soluble zymogen that undergoes autocatalytic cleavage in the endoplasmic reticulum. The mature protein binds to the EGF-A domain of lipoprotein receptors [70, 71], abolishes their functions, and raises the level of cholesterol in the blood stream. Furthermore, some gain-of-function mutations increase the binding affinity of PCSK9 and lipoprotein receptors, thus resulting in the degradation of LDLR, inefficient incorporation of cholesterol in the liver cells, and higher cholesterol levels in the blood stream. On the contrary, individuals having loss-of-function variants showed lower levels of LDL cholesterol [72].

In the 2KJPN reference panel, we identified six reported pathogenic SNVs for PCSK9—five missense variants (p.Glu54Ala, p.Arg104Cys, p.Arg215His, p.Ala514Thr, and p.Ser668Arg) and one nonsense variant (p.Trp428*) (Table 3). Since p.Ser668Arg and p.Trp428* were reported to be causative variants for low LDL cholesterol [63], we considered the other four missense variants (p.Glu54Ala, p.Arg104Cys, p.Arg215His, and p.Ala514Thr) as causative variants for hypercholesterolemia. The population frequency of the causative variants responsible for hypercholesterolemia in PCSK9 was estimated to be 0.001 in 2KJPN, which was based on the allele frequencies of the four responsible variants, all of which were singletons. There was a missense variant, p.Glu32Lys (rs564427867), which was registered as DM in HGMD, and this variant was found in 2KJPN in 44 heterozygous and one homozygous individual. Although this variant was discarded during variant filtering due to its relatively higher frequency (0.011) for assigning RP variants, we further reviewed this variant because it was reported in a domestic study [73]. In ClinVar, this variant was annotated as “conflicting interpretations of pathogenicity.” Because it showed a higher allele frequency in 2KJPN than that in non-Asian populations (p < 0.00001), the variant might not have been detected or reported by DNA sequencing of the patient samples from non-Asian population (see Supplementary Table 3). The variant could be classified as benign based on its high frequency; however, it may have mild effects on FH. Our results showed that phenotypic variants responsible for high and low levels of LDL cholesterol were found in 2KJPN. Further careful examination may be needed to assess the proportion of risk variants in LDLR, APOB, and PCSK9 in the Japanese population [74]. It is very important to know which genes have causative mutations in patients with FH to assess for appropriate therapeutic strategies [75] for personalized medicine.

Discussion

Here, we presented the estimation of pathogenic variant frequencies for actionable genes in the Japanese population for the first time, and showed that a substantial number of people had reported pathogenic variants of the ACMG-recommended genes. Although there have been numerous domestic reports on pathogenic variants of diseases detected in patient groups, it was not clear in what proportion the responsible variants exist among healthy individuals. Identification of potential risk alleles and their frequency estimation among healthy individuals are, thus, highly important for public health.

We also found that manually checking and reviewing variants are very critical to interpreting variants for its pathogenicity, although it is needless to say about distinguishing distinct phenotypic effects by single genes, such as RET and PCSK9. In this study, for several diseases, we manually reviewed pathogenic variants annotated in public databases (HGMD and ClinVar). We conducted a careful review of the variants, especially for the three genes (LDLR, APOB, PCSK9) responsible for FH, and the allelic frequencies of the risk alleles were compared with the prevalence data in the Japanese population for each condition. We found that evidence of pathogenicity in the Japanese population was lacking for some variants, even if they were reported as DMs, such as in the case of genes responsible for FH. In addition, we observed that some of the reported pathogenic variants could be benign after a review of the variants. The insufficiencies in the data of reported pathogenic variants may be due to an insufficient examination of allele frequency in healthy controls in the original studies or an inappropriate curation in the public databases.

We found that some of HGMD-DM variants existed at higher frequencies in 2KJPN than in other ethnic groups. The examples were RET (p.Arg114His and p.Thr278Asn), LDLR (p.Arg115His), and PCSK9 (p.Glu32Lys) among the genes in our variant review and inter-ethnic allele frequency comparison. These examples may suggest that protein-altering variants, such as missense or truncating variants, which exist in Asian populations, but are rarely detected in other ethnic populations, are more likely to be reported in the literature as novel DMs for Asians. Although these variants may have mild effects on phenotypes, we should re-review reported pathogenic variants to check whether protein-altering SNVs showing inter-ethnic frequency differences have been biasedly reported or registered in the databases as novel pathological variants.

Although public databases of pathogenic variants, such as HGMD and ClinVar, are useful as information resources, reviewing reported pathogenic variants for their pathogenicity in the target population is necessary and a challenging issue. In particular, this is critical for returning individual genomic results with clinical benefits and avoiding unnecessary psychosocial harm due to uncertain clinical validity. Our study suggested that some of the reported pathogenic variants should be re-reviewed, even though they were designated as “disease-causing variants”, especially when they are used for the purposes of identification of secondary findings in clinical settings under the ACMG recommendations.

Several previous reports have tried to overcome this issue, and one of these studies estimated the frequency of actionable variants in the diverse 1000 genomes [15]. They conducted an extensive literature survey by checking the population frequency, evidence for pathogenicity, and evaluations by expert physicians with medical specialties relevant to the conditions. Although 237 variants were annotated as disease-causing variants by HGMD, only seven variants remained to be likely pathogenic after the variant review.

Information of individual status of risk variants should be utilized for public health. The participants in our cohort studies were very interested in individual genomic results [76]. However, in the present situation in Japan, actual attempts or trials of returning individual genomic results in the research context have been very limited. This may be due to a number of medical, psychosocial, ethical, and financial issues, as well as the lack of experiences. Such situations may vary among countries, and real actionability depends on the medical systems in the society. Considering the current situation in Japan, we have a plan of action to return individual genomic results to the participants of our cohort studies (will be described elsewhere). We may be able to follow their medical conditions in the long-term after the participants receive their genomic results [77]. We expect that this kind of practice would contribute to the accumulation of case information about dealing with genetic results from the standpoints of scientific and practical aspects.

Furthermore, there are limitations in this study because 1) insertions and deletions were not analyzed; 2) actionable genes in chromosome X (GLA and OTC) were not analyzed; and 3) reported pathogenic variants were not detected for 15 genes, which may be due to the limited number of individuals or very low allele frequencies in the Japanese population. Although we obtained variants as candidates of expected pathogenic variants, further analysis and evaluation through appropriate filtering and interpretations are needed for selecting strong candidates of pathogenic variants. We would overcome these limitations as much as possible by including other types of variants and extending our analysis in the near future.

It may be not surprising that about 21% people have reported pathogenic variants of the 57 ACMG genes. In our previous study with 1KJPN, we showed that one individual had 11.2 HGMD-DM variants (9.6 as heterozygous and 1.6 as homozygous) on average [18]. In this study, a small fraction of the low-frequency HGMD-DM variants in the limited set of disease genes may have been detected. Such estimates may be lowered if variants were reviewed appropriately for all the target genes.

The recommended gene list for incidental findings, which were originally proposed by ACMG, may be modified by considering its practicality for East Asian populations. For example, a Korean group is considering inclusion of CDH1 for the risk of hereditary diffuse gastric cancer [11], based on its high penetrance. This kind of consideration would improve the quality of genetic medicine in East Asian countries. Other phenotypes not listed in the recommendations by ACMG may be taken into consideration if they are important for healthcare in the Japanese population. Based on this study, we should construct an information infrastructure of pathogenic variants for the Japanese population. Through appropriate variant interpretations, updated information of pathogenic variants would be useful for diagnostic strategies and subsequent personalized healthcare for the Japanese population.

References

Genome of the Netherlands Consortium. Whole-genome sequence variation, population structure and demographic history of the Dutch population. Nat Genet. 2014;46:818–25.
Article Google Scholar
Gudbjartsson DF, Helgason H, Gudjonsson SA, Zink F, Oddson A, Gylfason A, et al. Large-scale whole-genome sequencing of the Icelandic population. Nat Genet. 2015;47:435–44.
Article CAS PubMed Google Scholar
UK 10K Consortium, Walter K, Min JL, Huang J, Crooks L, Memari Y, et al. The UK10K project identifies rare variants in health and disease. Nature. 2015;526:82–90.
Article Google Scholar
Styrkarsdottir U, Thorleifsson G, Sulem P, Gudbjartsson DF, Sigurdsson A, Jonasdottir A, et al. Nonsense mutation in the LGR4 gene is associated with several human diseases and other traits. Nature. 2013;497:517–20.
Article CAS PubMed Google Scholar
Koshiba S, Motoike I, Kojima K, Hasegawa T, Shirota M, Saito T, et al. The structural origin of metabolic quantitative diversity. Sci Rep. 2016;6:31463.
Article CAS PubMed PubMed Central Google Scholar
Tennessen JA, Bigham AW, O’Connor TD, Fu W, Kenny EE, Gravel S, et al. Evolution and functional impact of rare coding variation from deep sequencing of human exomes. Science. 2012;337:64–9.
Article CAS PubMed PubMed Central Google Scholar
Higasa K, Miyake N, Yoshimura J, Okamura K, Niihori T, Saitsu H, et al. Human genetic variation database, a reference database of genetic variations in the Japanese population. J Hum Genet. 2016;61:547–53.
Article CAS PubMed PubMed Central Google Scholar
Green RC, Berg JS, Grody WW, Kalia SS, Korf BR, Martin CL, et al. ACMG recommendations for reporting of incidental findings in clinical exome and genome sequencing. Genet Med. 2013;15:565–74.
Article CAS PubMed PubMed Central Google Scholar
Kalia SS, Adelman K, Bale SJ, Chung WK, Eng C, Evans JP, et al. Recommendations for reporting of secondary findings in clinical exome and genome sequencing, 2016 update (ACMG SF v2.0): a policy statement of the American College of Medical Genetics and Genomics. Genet Med. 2016;19:249–55.
Article PubMed Google Scholar
Jurgens J, Ling H, Hetrick K, Pugh E, Schiettecatte F, Doheny K, et al. Assessment of incidental findings in 232 whole-exome sequences from the Baylor-Hopkins Center for Mendelian Genomics. Genet Med. 2015;17:782–88.
Article PubMed PubMed Central Google Scholar
Jang MA, Lee SH, Kim N, Ki CS. Frequency and spectrum of actionable pathogenic secondary findings in 196 Korean exomes. Genet Med. 2015;17:1007–11.
Article CAS PubMed Google Scholar
Gambin T, Jhangiani SN, Below JE, Campbell IM, Wiszniewski W, Muzny DM, et al. Secondary findings and carrier test frequencies in a large multiethnic sample. Genome Med. 2015;7:54.
Article PubMed PubMed Central Google Scholar
Yang Y, Muzny DM, Xia F, Niu Z, Person R, Ding Y, et al. Molecular findings among patients referred for clinical whole-exome sequencing. JAMA. 2014;312:1870–79.
Article CAS PubMed PubMed Central Google Scholar
Dorschner MO, Amendola LM, Turner EH, Robertson PD, Shirts BH, Gallego CJ, et al. Actionable, pathogenic incidental findings in 1,000 participants’ exomes. Am J Hum Genet. 2013;93:631–40.
Article CAS PubMed PubMed Central Google Scholar
Olfson E, Cottrell CE, Davidson NO, Gurnett CA, Heusel JW, Stitziel NO, et al. Identification of Medically Actionable Secondary Findings in the 1000 Genomes. PloS ONE. 2015;10:e0135193.
Article PubMed PubMed Central Google Scholar
Tabor HK, Auer PL, Jamal SM, Chong JX, Yu JH, Gordon AS, et al. Pathogenic variants for Mendelian and complex traits in exomes of 6,517 European and African Americans: implications for the return of incidental results. Am J Hum Genet. 2014;95:183–93.
Article CAS PubMed PubMed Central Google Scholar
Kuriyama S, Yaegashi N, Nagami F, Arai T, Kawaguchi Y, Osumi N, et al. The Tohoku Medical Megabank project: design and mission. J Epidemiol. 2016;26:493–511.
Article PubMed Google Scholar
Nagasaki M, Yasuda J, Katsuoka F, Nariai N, Kojima K, Kawai Y, et al. Rare variant discovery by deep whole-genome sequencing of 1,070 Japanese individuals. Nat Commun. 2015;6:8018.
Article CAS PubMed PubMed Central Google Scholar
Yamaguchi-Kabata Y, Nariai N, Kawai Y, Sato Y, Kojima K, Tateno M, et al. iJGVD: an integrative Japanese genome variation database based on whole-genome sequencing. Hum Genome Var. 2015;2:15050.
Article CAS PubMed PubMed Central Google Scholar
Stenson PD, Ball EV, Mort M, Phillips AD, Shiel JA, Thomas NS, et al. Human Gene Mutation Database (HGMD): 2003 update. Hum Mutat. 2003;21:577–81.
Article CAS PubMed Google Scholar
Landrum MJ, Lee JM, Benson M, Brown G, Chao C, Chitipiralla S, et al. ClinVar: public archive of interpretations of clinically relevant variants. Nucleic Acids Res. 2016;44:D862–68.
Article CAS PubMed Google Scholar
Fu W, O’Connor TD, Jun G, Kang HM, Abecasis G, Leal SM, et al. Analysis of 6,515 exomes reveals the recent origin of most human protein-coding variants. Nature. 2013;493:216–220.
Article CAS PubMed Google Scholar
Kircher M, Witten DM, Jain P, O’Roak BJ, Cooper GM, Shendure J. A general framework for estimating the relative pathogenicity of human genetic variants. Nat Genet. 2014;46:310–15.
Article CAS PubMed PubMed Central Google Scholar
Jagadeesh KA, Wenger AM, Berger MJ, Guturu H, Stenson PD, Cooper DN, et al. M-CAP eliminates a majority of variants of uncertain significance in clinical exomes at high sensitivity. Nat Genet. 2016;48:1581–86.
Article CAS PubMed Google Scholar
Liede A, Narod SA. Hereditary breast and ovarian cancer in Asia: genetic epidemiology of BRCA1 and BRCA2. Hum Mutat. 2002;20:413–24.
Article CAS PubMed Google Scholar
Hirotsu Y, Nakagomi H, Sakamoto I, Amemiya K, Mochizuki H, Omata M. Detection of BRCA1 and BRCA2 germline mutations in Japanese population using next-generation sequencing. Mol Genet Genomic Med. 2015;3:121–9.
Article CAS PubMed Google Scholar
Sakamoto I, Hirotsu Y, Nakagomi H, Ouchi H, Ikegami A, Teramoto K, et al. BRCA1 and BRCA2 mutations in Japanese patients with ovarian, fallopian tube, and primary peritoneal cancer. Cancer. 2016;122:84–90.
Article CAS PubMed Google Scholar
Sekine M, Nagata H, Tsuji S, Hirai Y, Fujimoto S, Hatae M, et al. Mutational analysis of BRCA1 and BRCA2 and clinicopathologic analysis of ovarian cancer in 82 ovarian cancer families: two common founder mutations of BRCA1 in Japanese population. Clin Cancer Res. 2001;7:3144–50.
CAS PubMed Google Scholar
Spurdle AB, Whiley PJ, Thompson B, Feng B, Healey S, Brown MA, et al. BRCA1 R1699Q variant displaying ambiguous functional abrogation confers intermediate breast and ovarian cancer risk. J Med Genet. 2012;49:525–32.
Article CAS PubMed Google Scholar
Ikeda N, Miyoshi Y, Yoneda K, Shiba E, Sekihara Y, Kinoshita M, et al. Frequency of BRCA1 and BRCA2 germline mutations in Japanese breast cancer families. Int J Cancer. 2001;91:83–8.
Article CAS PubMed Google Scholar
Borg A, Haile RW, Malone KE, Capanu M, Diep A, Torngren T, et al. Characterization of BRCA1 and BRCA2 deleterious mutations and variants of unknown clinical significance in unilateral and bilateral breast cancer: the WECARE study. Hum Mutat. 2010;31:E1200–40.
Article CAS PubMed PubMed Central Google Scholar
Bonnet C, Krieger S, Vezain M, Rousselin A, Tournier I, Martins A, et al. Screening BRCA1 and BRCA2 unclassified variants for splicing mutations using reverse transcription PCR on patient RNA and an ex vivo assay based on a splicing reporter minigene. J Med Genet. 2008;45:438–46.
Article CAS PubMed Google Scholar
Walsh T, Lee MK, Casadei S, Thornton AM, Stray SM, Pennil C, et al. Detection of inherited mutations for breast and ovarian cancer using genomic capture and massively parallel sequencing. Proc Natl Acad Sci USA. 2010;107:12629–33.
Article CAS PubMed PubMed Central Google Scholar
Tiwari AK, Roy HK, Lynch HT. Lynch syndrome in the 21st century: clinical perspectives. QJM. 2016;109:151–8.
Article CAS PubMed Google Scholar
Peltomaki P. Update on Lynch syndrome genomics. Fam Cancer. 2016;15:385–93.
Article Google Scholar
Lu SL, Akiyama Y, Nagasaki H, Nomizu T, Ikeda E, Baba S, et al. Loss or somatic mutations of hMSH2 occur in hereditary nonpolyposis colorectal cancers with hMSH2 germline mutations. Jpn J Cancer Res. 1996;87:279–87.
Article CAS PubMed Google Scholar
Konishi M, Kikuchi-Yanoshita R, Tanaka K, Muraoka M, Onda A, Okumura Y, et al. Molecular nature of colon tumors in hereditary nonpolyposis colon cancer, familial polyposis, and sporadic colon cancer. Gastroenterology. 1996;111:307–17.
Article CAS PubMed Google Scholar
Miyaki M, Konishi M, Muraoka M, Kikuchi-Yanoshita R, Tanaka K, Iwama T, et al. Germ line mutations of hMSH2 and hMLH1 genes in Japanese families with hereditary nonpolyposis colorectal cancer (HNPCC): usefulness of DNA analysis for screening and diagnosis of HNPCC patients. J Mol Med. 1995;73:515–20.
Article CAS PubMed Google Scholar
Chao EC, Velasquez JL, Witherspoon MS, Rozek LS, Peel D, Ng P, et al. Accurate classification of MLH1/MSH2 missense variants with multivariate analysis of protein polymorphisms-mismatch repair (MAPP-MMR). Hum Mutat. 2008;29:852–60.
Article CAS PubMed Google Scholar
Arnold S, Buchanan DD, Barker M, Jaskowski L, Walsh MD, Birney G, et al. Classifying MLH1 and MSH2 variants using bioinformatic prediction, splicing assays, segregation, and tumor characteristics. Hum Mutat. 2009;30:757–70.
Article CAS PubMed PubMed Central Google Scholar
Furukawa T, Konishi F, Shitoh K, Kojima M, Nagai H, Tsukamoto T. Evaluation of screening strategy for detecting hereditary nonpolyposis colorectal carcinoma. Cancer. 2002;94:911–20.
Article PubMed Google Scholar
Cravo M, Afonso AJ, Lage P, Albuquerque C, Maia L, Lacerda C, et al. Pathogenicity of missense and splice site mutations in hMSH2 and hMLH1 mismatch repair genes: implications for genetic testing. Gut. 2002;50:405–12.
Article CAS PubMed PubMed Central Google Scholar
Lage PA, Albuquerque C, Sousa RG, Cravo ML, Salazar M, Francisco I, et al. Association of colonic and endometrial carcinomas in Portuguese families with hereditary nonpolyposis colorectal carcinoma significantly increases the probability of detecting a pathogenic mutation in mismatch repair genes, primarily the MSH2 gene. Cancer. 2004;101:172–7.
Article CAS PubMed Google Scholar
Takahashi M, Shimodaira H, Andreutti-Zaugg C, Iggo R, Kolodner RD, Ishioka C. Functional analysis of human MLH1 variants using yeast and in vitro mismatch repair assays. Cancer Res. 2007;67:4595–604.
Article CAS PubMed Google Scholar
Thompson BA, Spurdle AB, Plazzer JP, Greenblatt MS, Akagi K, Al-Mulla F, et al. Application of a 5-tiered scheme for standardized classification of 2,360 unique mismatch repair gene variants in the InSiGHT locus-specific database. Nat Genet. 2014;46:107–15.
Article CAS PubMed Google Scholar
Bishop JO, Madson EC. Retinoblastoma. Review of the current status. Surv Ophthalmol. 1975;19:342–66.
CAS PubMed Google Scholar
Knudson AG Jr. Mutation and cancer: statistical study of retinoblastoma. Proc Natl Acad Sci USA. 1971;68:820–3.
Article PubMed PubMed Central Google Scholar
He MY, An Y, Gao YJ, Qian XW, Li G, Qian J. Screening of RB1 gene mutations in Chinese patients with retinoblastoma and preliminary exploration of genotype-phenotype correlations. Mol Vis. 2014;20:545–52.
PubMed PubMed Central Google Scholar
Lee JO, Russo AA, Pavletich NP. Structure of the retinoblastoma tumour-suppressor pocket domain bound to a peptide from HPV E7. Nature. 1998;391:859–65.
Article CAS PubMed Google Scholar
Rubin SM, Gall AL, Zheng N, Pavletich NP. Structure of the Rb C-terminal domain bound to E2F1-DP1: a mechanism for phosphorylation-induced E2F release. Cell. 2005;123:1093–106.
Article CAS PubMed Google Scholar
Bodian DL, McCutcheon JN, Kothiyal P, Huddleston KC, Iyer RK, Vockley JG, et al. Germline variation in cancer-susceptibility genes in a healthy, ancestrally diverse cohort: implications for individual genome sequencing. PloS ONE. 2014;9:e94554.
Article PubMed PubMed Central Google Scholar
Shimizu T, Toguchida J, Kato MV, Kaneko A, Ishizaki K, Sasaki MS. Detection of mutations of the RB1 gene in retinoblastoma patients by using exon-by-exon PCR-SSCP analysis. Am J Hum Genet. 1994;54:793–800.
CAS PubMed PubMed Central Google Scholar
Krampitz GW, Norton JA. RET gene mutations (genotype and phenotype) of multiple endocrine neoplasia type 2 and familial medullary thyroid carcinoma. Cancer. 2014;120:1920–31.
Article CAS PubMed Google Scholar
Pan ZW, Li JC. Advances in molecular genetics of Hirschsprung’s disease. Anat Rec. 2012;295:1628–38.
Article CAS Google Scholar
Sasaki A, Kanai M, Kijima K, Akaba K, Hashimoto M, Hasegawa H, et al. Molecular analysis of congenital central hypoventilation syndrome. Hum Genet. 2003;114:22–6.
Article CAS PubMed Google Scholar
Kanai M, Numakura C, Sasaki A, Shirahata E, Akaba K, Hashimoto M, et al. Congenital central hypoventilation syndrome: a novel mutation of the RET gene in an isolated case. Tohoku J Exp Med. 2002;196:241–6.
Article CAS PubMed Google Scholar
Emerging Risk Factors, C, Di Angelantonio E, Sarwar N, Perry P, Kaptoge S, Ray KK, et al. Major lipids, apolipoproteins, and risk of vascular disease. JAMA. 2009;302:1993–2000.
Article Google Scholar
Brown MS, Goldstein JL. A receptor-mediated pathway for cholesterol homeostasis. Science. 1986;232:34–47.
Article CAS PubMed Google Scholar
Abifadel M, Varret M, Rabes JP, Allard D, Ouguerram K, Devillers M, et al. Mutations in PCSK9 cause autosomal dominant hypercholesterolemia. Nat Genet. 2003;34:154–6.
Article CAS PubMed Google Scholar
Henderson R, O’Kane M, McGilligan V, Watterson S. The genetics and screening of familial hypercholesterolaemia. J Biomed Sci. 2016;23:39.
Article PubMed PubMed Central Google Scholar
Mollaki V, Progias P, Drogari E. Familial Hypercholesterolemia in Greek children and their families: genotype-to-phenotype correlations and a reconsideration of LDLR mutation spectrum. Atherosclerosis. 2014;237:798–804.
Article CAS PubMed Google Scholar
Lek M, Karczewski KJ, Minikel EV, Samocha KE, Banks E, Fennell T, et al. Analysis of protein-coding genetic variation in 60,706 humans. Nature. 2016;536:285–91.
Article CAS PubMed PubMed Central Google Scholar
Miyake Y, Yamamura T, Sakai N, Miyata T, Kokubo Y, Yamamoto A. Update of Japanese common LDLR gene mutations and their phenotypes: Mild type mutation L547V might predominate in the Japanese population. Atherosclerosis. 2009;203:153–60.
Article CAS PubMed Google Scholar
Varret M, Rabes JP, Thiart R, Kotze MJ, Baron H, Cenarro A, et al. LDLR Database (second edition): new additions to the database and the software, and results of the first molecular analysis. Nucleic Acids Res. 1998;26:248–52.
Article CAS PubMed PubMed Central Google Scholar
Soria LF, Ludwig EH, Clarke HR, Vega GL, Grundy SM, McCarthy BJ. Association between a specific apolipoprotein B mutation and familial defective apolipoprotein B-100. Proc Natl Acad Sci USA. 1989;86:587–91.
Article CAS PubMed PubMed Central Google Scholar
Vega GL, Grundy SM. In vivo evidence for reduced binding of low density lipoproteins to receptors as a cause of primary moderate hypercholesterolemia. J Clin Invest. 1986;78:1410–14.
Article CAS PubMed PubMed Central Google Scholar
Braenne I, Kleinecke M, Reiz B, Graf E, Strom T, Wieland T, et al. Systematic analysis of variants related to familial hypercholesterolemia in families with premature myocardial infarction. Eur J Hum Genet. 2016;24:191–7.
Article CAS PubMed Google Scholar
Johansen CT, Wang J, Lanktree MB, Cao H, McIntyre AD, Ban MR, et al. Excess of rare variants in genes identified by genome-wide association study of hypertriglyceridemia. Nat Genet. 2010;42:684–7.
Article CAS PubMed PubMed Central Google Scholar
Seidah NG, Benjannet S, Wickham L, Marcinkiewicz J, Jasmin SB, Stifani S, et al. The secretory proprotein convertase neural apoptosis-regulated convertase 1 (NARC-1): liver regeneration and neuronal differentiation. Proc Natl Acad Sci USA. 2003;100:928–33.
Article CAS PubMed PubMed Central Google Scholar
Kwon HJ, Lagace TA, McNutt MC, Horton JD, Deisenhofer J. Molecular basis for LDL receptor recognition by PCSK9. Proc Natl Acad Sci USA. 2008;105:1820–5.
Article CAS PubMed PubMed Central Google Scholar
Cunningham D, Danley DE, Geoghegan KF, Griffor MC, Hawkins JL, Subashi TA, et al. Structural and biophysical studies of PCSK9 and its mutants linked to familial hypercholesterolemia. Nat Struct Mol Biol. 2007;14:413–9.
Article CAS PubMed Google Scholar
Cohen JC, Boerwinkle E, Mosley TH Jr, Hobbs HH. Sequence variations in PCSK9, low LDL, and protection against coronary heart disease. N Engl J Med. 2006;354:1264–72.
Article CAS PubMed Google Scholar
Noguchi T, Katsuda S, Kawashiri MA, Tada H, Nohara A, Inazu A, et al. The E32K variant of PCSK9 exacerbates the phenotype of familial hypercholesterolaemia by increasing PCSK9 function and concentration in the circulation. Atherosclerosis. 2010;210:166–72.
Article CAS PubMed Google Scholar
Abul-Husn NS, Manickam K, Jones LK, Wright EA, Hartzel DN, Gonzaga-Jauregui C, et al. Genetic identification of familial hypercholesterolemia within a single U.S. health care system. Science. 2016;354:aaf7000.
Article PubMed Google Scholar
Fitzgerald K, Frank-Kamenetsky M, Shulga-Morskaya S, Liebow A, Bettencourt BR, Sutherland JE, et al. Effect of an RNA interference drug on the synthesis of proprotein convertase subtilisin/kexin type 9 (PCSK9) and the concentration of serum LDL cholesterol in healthy volunteers: a randomised, single-blind, placebo-controlled, phase 1 trial. Lancet. 2014;383:60–8.
Article CAS PubMed Google Scholar
Ishikuro M, Nakaya N, Obara T, Sato Y, Metoki H, Kikuya M, et al. Public attitudes toward an epidemiological study with genomic analysis in the Great East Japan Earthquake disaster area. Prehosp Disaster Med. 2016;31:330–4.
Article PubMed Google Scholar
Wald DS, Bestwick JP, Morris JK, Whyte K, Jenkins L, Wald NJ. Child-parent familial hypercholesterolemia screening in primary care. N Engl J Med. 2016;375:1628–37.
Article CAS PubMed Google Scholar

Download references

Acknowledgements

This work was supported by Tohoku Medical Megabank Project from MEXT and Japan Agency for Medical Research and Development (AMED), and by the grant “Advanced Genome Research and Bioinformatics Study to Facilitate Medical Innovation”, also by AMED. This research was also supported by the Center of Innovation Program from Japan Science and Technology Agency, JST. All computational resources were provided by the ToMMo supercomputer system. We are indebted to all volunteers who participated in this Tohoku Medical Megabank project. We would like to acknowledge all the members associated with this project; the member list is available at the following web site: http://www.megabank.tohoku.ac.jp/english/a161201/. We would like to thank Editage (www.editage.jp) for English language editing.

Author information

Yoichi Suzuki
Present address: Ageo Central General Hospital, Ageo, 362-8588, Japan

Authors and Affiliations

Tohoku Medical Megabank Organization, Tohoku University, Aoba-ku, Sendai, 980-8573, Japan
Yumi Yamaguchi-Kabata, Jun Yasuda, Osamu Tanabe, Yoichi Suzuki, Hiroshi Kawame, Nobuo Fuse, Masao Nagasaki, Yosuke Kawai, Kaname Kojima, Fumiki Katsuoka, Sakae Saito, Inaho Danjoh, Ikuko N. Motoike, Riu Yamashita, Seizo Koshiba, Daisuke Saigusa, Gen Tamiya, Shigeo Kure, Nobuo Yaegashi, Yoshio Kawaguchi, Fuji Nagami, Shinichi Kuriyama, Junichi Sugawara, Naoko Minegishi, Atsushi Hozawa, Soichi Ogishima, Hideyasu Kiyomoto, Takako Takai-Igarashi, Kengo Kinoshita & Masayuki Yamamoto
Graduate School of Medicine, Tohoku University, Aoba-ku, Sendai, 980-8575, Japan
Yumi Yamaguchi-Kabata, Jun Yasuda, Osamu Tanabe, Yoichi Suzuki, Hiroshi Kawame, Nobuo Fuse, Masao Nagasaki, Yosuke Kawai, Kaname Kojima, Fumiki Katsuoka, Sakae Saito, Inaho Danjoh, Ikuko N. Motoike, Riu Yamashita, Seizo Koshiba, Daisuke Saigusa, Gen Tamiya, Shigeo Kure, Nobuo Yaegashi, Shinichi Kuriyama, Junichi Sugawara, Naoko Minegishi, Atsushi Hozawa, Soichi Ogishima, Hideyasu Kiyomoto, Takako Takai-Igarashi & Masayuki Yamamoto
Graduate School of Information Sciences, Tohoku University, Aoba-ku, Sendai, 980-8579, Japan
Masao Nagasaki, Ikuko N. Motoike, Riu Yamashita & Kengo Kinoshita
Statistical Genetics Team, RIKEN Center for Advanced Intelligence Project, Chuo-ku, Tokyo, 103-0027, Japan
Gen Tamiya
International Research Institute of Disaster Science, Tohoku University, Aoba-ku, Sendai, 980-0845, Japan
Shinichi Kuriyama
Japan Agency for Medical Research and Development (AMED), Chiyodaku, Tokyo, 100-0004, Japan
Hideyasu Kiyomoto
Institute of Development, Aging and Cancer, Tohoku University, Aoba-ku, Sendai, 980-8575, Japan
Kengo Kinoshita

Authors

Yumi Yamaguchi-Kabata
View author publications
You can also search for this author in PubMed Google Scholar
Jun Yasuda
View author publications
You can also search for this author in PubMed Google Scholar
Osamu Tanabe
View author publications
You can also search for this author in PubMed Google Scholar
Yoichi Suzuki
View author publications
You can also search for this author in PubMed Google Scholar
Hiroshi Kawame
View author publications
You can also search for this author in PubMed Google Scholar
Nobuo Fuse
View author publications
You can also search for this author in PubMed Google Scholar
Masao Nagasaki
View author publications
You can also search for this author in PubMed Google Scholar
Yosuke Kawai
View author publications
You can also search for this author in PubMed Google Scholar
Kaname Kojima
View author publications
You can also search for this author in PubMed Google Scholar
Fumiki Katsuoka
View author publications
You can also search for this author in PubMed Google Scholar
Sakae Saito
View author publications
You can also search for this author in PubMed Google Scholar
Inaho Danjoh
View author publications
You can also search for this author in PubMed Google Scholar
Ikuko N. Motoike
View author publications
You can also search for this author in PubMed Google Scholar
Riu Yamashita
View author publications
You can also search for this author in PubMed Google Scholar
Seizo Koshiba
View author publications
You can also search for this author in PubMed Google Scholar
Daisuke Saigusa
View author publications
You can also search for this author in PubMed Google Scholar
Gen Tamiya
View author publications
You can also search for this author in PubMed Google Scholar
Shigeo Kure
View author publications
You can also search for this author in PubMed Google Scholar
Nobuo Yaegashi
View author publications
You can also search for this author in PubMed Google Scholar
Yoshio Kawaguchi
View author publications
You can also search for this author in PubMed Google Scholar
Fuji Nagami
View author publications
You can also search for this author in PubMed Google Scholar
Shinichi Kuriyama
View author publications
You can also search for this author in PubMed Google Scholar
Junichi Sugawara
View author publications
You can also search for this author in PubMed Google Scholar
Naoko Minegishi
View author publications
You can also search for this author in PubMed Google Scholar
Atsushi Hozawa
View author publications
You can also search for this author in PubMed Google Scholar
Soichi Ogishima
View author publications
You can also search for this author in PubMed Google Scholar
Hideyasu Kiyomoto
View author publications
You can also search for this author in PubMed Google Scholar
Takako Takai-Igarashi
View author publications
You can also search for this author in PubMed Google Scholar
Kengo Kinoshita
View author publications
You can also search for this author in PubMed Google Scholar
Masayuki Yamamoto
View author publications
You can also search for this author in PubMed Google Scholar

Consortia

ToMMo Study Group

Corresponding authors

Correspondence to Yumi Yamaguchi-Kabata or Kengo Kinoshita.

Ethics declarations

Conflict of interest

The authors declare that they have no competing interests.

Electronic Supplementary Material

Supplementary Table 1

Supplementary Table 2

Supplementary Table 3

Rights and permissions

Reprints and permissions

About this article

Cite this article

Yamaguchi-Kabata, Y., Yasuda, J., Tanabe, O. et al. Evaluation of reported pathogenic variants and their frequencies in a Japanese population based on a whole-genome reference panel of 2049 individuals. J Hum Genet 63, 213–230 (2018). https://doi.org/10.1038/s10038-017-0347-1

Download citation

Received: 04 April 2017
Revised: 25 August 2017
Accepted: 29 August 2017
Published: 01 December 2017
Issue Date: February 2018
DOI: https://doi.org/10.1038/s10038-017-0347-1

This article is cited by

Assessing the efficacy of target adaptive sampling long-read sequencing through hereditary cancer patient genomes
- Wataru Nakamura
- Makoto Hirata
- Yuichi Shiraishi
npj Genomic Medicine (2024)
Assessment of pathogenic variation in gynecologic cancer genes in a national cohort
- Urška Kotnik
- Aleš Maver
- Luca Lovrecic
Scientific Reports (2023)
Frequency of actionable secondary findings in 7472 Korean genomes derived from the National Project of Bio Big Data pilot study
- Youngjun Kim
- Jeong-Min Kim
- Mi-Hyun Park
Human Genetics (2023)
Comparison of the loci associated with HbA1c and blood glucose levels identified by a genome-wide association study in the Japanese population
- Takuya Sakashita
- Yasuyuki Nakamura
- Keitaro Matsuo
Diabetology International (2023)
Pathological variants in genes associated with disorders of sex development and central causes of hypogonadism in a whole-genome reference panel of 8380 Japanese individuals
- Naomi Shiga
- Yumi Yamaguchi-Kabata
- Junichi Sugawara
Human Genome Variation (2022)

Abstract

Similar content being viewed by others

Introduction

Materials and Methods

Subjects and data for single nucleotide variation

Variant annotation

Filtering variants

Results

Hereditary breast and ovarian cancer (HBOC)

Lynch syndrome genes

Retinoblastoma

Multiple endocrine neoplasia type 2 and familial medullary thyroid cancer (FMTC)

Familial hypercholesterolemia

LDLR

APOB

PCSK9

Discussion

References

Acknowledgements

Author information

Authors and Affiliations

Consortia

ToMMo Study Group

Corresponding authors

Ethics declarations

Conflict of interest

Electronic Supplementary Material

Rights and permissions

About this article

Cite this article

Share this article

This article is cited by

Search

Quick links