Introduction

Since the first description of the structure of DNA in 1953, great advances have been achieved in sequencing techniques, which have had an impact on human biology and medicine. Owing to such advances, comprehensive genotyping for exome-wide, genome-wide and targeted genomic regions has been widely used in research settings as well as in clinical settings.

Plasma lipid levels have been shown to be highly heritable as well as modifiable risk factors for coronary artery disease (CAD).1, 2, 3, 4 These findings have motivated researchers to perform genetic association studies, which could potentially define which risk factors are causal. In addition, such genetic association studies could identify pathways and therapeutic targets for dyslipidemias as well as CAD.5

In this review, we provide a current perspective on comprehensive genotyping in dyslipidemias, organizing the concepts on the basis of the allele frequencies and effect sizes of the genetic variants, and introducing the paradigm shift of innovative drug development targeting lipids to reduce CAD through genetics.

Advances in comprehensive genotyping

The structure of DNA was first described by Watson and Crick in 1953.6 Twenty-four years later, two papers introducing Gilbert and Sanger sequencing were published. This technique has become a research tool as well as a standard commercial procedure.7, 8 Recently, the so-called next-generation sequencing (NGS) technologies featuring massive parallel sequencing have been developed. NGS is widely used in a variety of applications, such as whole genome, whole exome and RNA sequencing.9 A third generation of sequencing (single-molecule sequencing) has also been developed, which avoids DNA/RNA amplification in a template library. The advantages of third-generation sequencing include avoiding polymerase chain reaction (PCR)-induced error and amplification bias with the use of less genomic DNA. Finally, a fourth-generation sequencing technology characterized by single-molecule real-time sequencing using electrical signals has been developed. Although it requires high technology, it is the simplest method of genotyping and could be the ‘ultimate technology’ for sequencing. Advances in sequencing technology are summarized in Table 1.

Table 1 Advances in sequencing technology

Clinical exome sequencing in Mendelian dyslipidemias

Whole-exome sequencing has been widely used to determine their genetic backgrounds in families with suspected monogenic inherited diseases, including dyslipidemias.10 With this approach, thousands of variants are usually detected in an individual regardless of phenotypes.11, 12 We typically exclude variants predicted to be benign or common and those with an unmatched segregation pattern within a family (Figure 1). To determine the pathogenicity of particular variants, in silico variant annotation prediction tools are quite useful.13 In addition, information on the allele frequency of certain variants from comprehensively sequenced public data can be used for this purpose.14 Phenotypic assessment, including segregation patterns, is the most important part of these procedures. It is laborious to define a segregation pattern, as much phenotypic information about the relatives, including those without the phenotype, must be collected.

Figure 1
figure 1

Typical bioinformatic variant filtering after whole-exome sequencing (WES). Thousands of variants are identified through WES. In general, benign variants, common variants and unmatched variants from the assumed segregation pattern are excluded from top to bottom. MAF, minor allele frequency.

Mendelian randomization studies: Causal factor or merely bystander?

To establish a causal relationship between a risk factor (usually a biomarker) and an outcome, randomized controlled trials (RCTs), which require a large amount of time and effort, are the gold standard. In contrast, the Mendelian randomization study is a technique that uses genotypes as instruments to assess a causal relationship between biomarkers and outcomes.15 In a Mendelian randomization study, a genetic variant associated with a particular biomarker is used as a proxy for the biomarker. Outcomes are compared between the group harboring the effect allele and a group with the reference allele. This approach can be considered a proxy for an RCT, in which the randomized groups have similar confounding variables (Figure 2). Accordingly, a Mendelian randomization study can be regarded as a natural RCT.

Figure 2
figure 2

Schema of randomized controlled trials (RCTs) and Mendelian randomization studies. Left panel indicates the conventional RCT. Right panel indicates Mendelian randomized study.

The LDL cholesterol story

Studies with increasing sample sizes have shown that a number of polymorphisms in multiple different genes are associated with plasma low-density lipoprotein (LDL) cholesterol levels.16, 17, 18 Inheriting an allele associated with lower LDL cholesterol can be considered as equivalent to being randomly allocated to an LDL cholesterol-lowering therapy from birth, whereas inheriting the reference allele can be considered as equivalent to being randomly allocated to usual care. Comparing the rate of cardiovascular events between individuals with and without such an LDL cholesterol-lowering polymorphism should, therefore, provide an unconfounded causal estimate of the effect of long-term exposure to lower LDL cholesterol on the risk of CAD, similar to the result obtained from a long-term RCT. Mendelian randomization studies have repeatedly shown that polymorphisms that are associated with lower LDL cholesterol are also associated with a lower risk of CAD.19, 20, 21 The magnitude of the reduction of the risk of CAD is linearly correlated with the magnitude of the reduction of LDL cholesterol as a result of polymorphisms (Figure 3). This relationship between genetically mediated LDL cholesterol and CAD risk reduction seems to be similar to that observed in RCTs using statins, although the magnitude of the association is stronger in individuals with lifelong exposure from genetic variants (Figure 3).

Figure 3
figure 3

Linear relationship between coronary artery disease (CAD) risk reduction and lowering of low-density lipoprotein (LDL) cholesterol. Blue line indicates the relationship between CAD risk reduction and genetic (lifelong) LDL cholesterol lowering. Pink line indicates the relationship between CAD risk reduction and pharmacological lowering of LDL cholesterol.

The HDL cholesterol story

Numerous epidemiological studies have shown that high-density lipoprotein (HDL) cholesterol levels are inversely associated with CAD.22, 23, 24 Such observations have motivated physicians and researchers to develop HDL cholesterol-raising therapies to reduce CAD. In 1990, Inazu et al.25 reported a family exhibiting extremely high HDL cholesterol levels caused by a loss-of-function mutation in the cholesteryl ester transfer protein (CETP) gene. Development of CETP inhibitors began after this discovery, and several large RCTs have been conducted aiming to clarify the beneficial effect of increasing HDL cholesterol with the use of these drugs in the prevention of CAD.26, 27 RCTs have also been conducted to determine if niacin, which raises HDL cholesterol, reduces the risk of CAD. The predicted risk reduction was not obtained, in contrast to the ‘LDL cholesterol story.’28 In 2010, it was shown that a common genetic variant in apolipoprotein A1 (ApoA1), which increases levels of apoA-I and HDL cholesterol, was not associated with a decreased risk of CAD.29 Kathiresan and co-workers30 reported that having an allele or alleles that raise HDL cholesterol did not reduce the risk of CAD. These studies cast doubt on the concept of HDL cholesterol as ‘good’ cholesterol and have led us to regard HDL cholesterol as merely a marker.

The triglyceride story

How about triglycerides? Triglycerides have also been shown to be associated with CAD in numerous epidemiological studies.31, 32 Establishing triglycerides as a causal factor of CAD has been difficult, as it is difficult to distinguish the effects of triglycerides on CAD from those of LDL cholesterol and HDL cholesterol by RCTs or by Mendelian randomization studies. Do et al.33 performed an interesting study creating a framework to see if triglyceride levels causally influence the risk of CAD. They constructed a model adjusting the effects of LDL cholesterol and/or HDL cholesterol levels on the risk of CAD, and found that the genetic impact of single-nucleotide polymorphisms on triglyceride levels was independently associated with the risk of CAD.33 Those facts imply that plasma triglyceride levels causally affect the risk of CAD.

Novel pharmacological targets for the treatment of dyslipidemias identified through genetics

Primary hypobetalipoproteinemia includes abetalipoproteinemia (ABL) and chylomicron retention disease, which have a recessive form of inheritance, and familial hypobetalipoproteinemia, which has a dominant form of inheritance.34 ABL is an extremely rare recessive disease caused by mutations in the microsomal triglyceride transfer protein (MTP) gene, and chylomicron retention disease is also an extremely rare recessive disease caused by mutations in SARA2 genes. The genetic backgrounds of familial hypobetalipoproteinemia have been shown to be loss-of-function mutations in the genes for apolipoprotein B (ApoB) and proprotein convertase subtilisin/kexin type 9 (PCSK9). It is of note that investigations of such rare variants in extreme cases have led to the development of novel drugs targeting the general population.

ApoB

ApoB is the primary apolipoprotein of very-low-density lipoprotein, intermediate-density lipoprotein, LDL and lipoprotein(a) particles. Mipomersen, which inhibits the synthesis of ApoB, has been developed based on those backgrounds.35 In addition to a number of clinical trials of the effect of mipomersen in patients with heterozygous familial hypercholesterolemia,36, 37 it has been shown that mipomersen is effective even in patients with homozygous familial hypercholesterolemia, where the effect of statins is limited.38 In 2013, the US Food and Drug Administration approved this drug for homozygous familial hypercholesterolemia, despite its relatively frequent adverse effect of elevated liver enzymes associated with fatty liver. From the viewpoint of genetics, it had already been shown that patients with familial hypobetalipoproteinemia caused by ApoB loss-of-function mutations have low LDL cholesterol as well as elevated liver enzymes associated with fatty liver.39, 40 Accordingly, such on-target/off-target effects had been already estimated based on observations of patients with familial hypobetalipoproteinemia and ApoB mutations.

MTP

MTP has a pivotal role in the assembly and secretion of apoB-containing lipoproteins in the liver and intestine. Lomitapide binds MTP and inhibits the synthesis of apoB-containing lipoproteins in the intestine and the liver, leading to a reduction of plasma LDL cholesterol levels.41, 42 Deleterious mutations in the MTP gene cause ABL, a rare, autosomal recessive inherited disease.43 In patients with ABL, very-low-density lipoprotein and LDL levels are significantly reduced. Mimicking this disease by inhibiting MTP has attracted a lot of attention as a novel pharmacological strategy. However, this drug seems to cause adverse effects, including diarrhea, steatorrhea, and fatty liver, as seen in patients with ABL harboring mutations in the MTP gene.

PCSK9

In 2003, PCSK9 was recognized from the discovery of causal mutations in this gene in families with severe hypercholesterolemia.44 Since its recognition, PCSK9 has been regarded as a third cause of autosomal dominant hypercholesterolemia. Loss-of-function mutations in the PCSK9 gene have been shown to be associated with lowering of LDL cholesterol and reduction of CAD risk.45, 46 These findings have facilitated the development of PCSK9 inhibitors aiming to reduce plasma LDL cholesterol and CAD.47 Considering the robust evidence from human genetics, including the fact that the subject harboring loss-of-function mutations in the PCSK9 gene is apparently healthy without liver dysfunction,48 this drug seems to be promising in terms of reducing the risk of CAD with fewer adverse effects than those from ApoB/MTP inhibitors.

Rare variants (family-based studies), common variants (population-based studies) and clinical trials

As has been repeatedly shown in the success stories of the discoveries of rare variants in extreme cases that have led to the development of novel drugs for the general population, clinical trials using drugs with consistent directions of on-target/off-target effects between rare variants and common variants are likely to succeed (Figure 4).

Figure 4
figure 4

Schema of directions of on-target/off-target effects among rare inherited diseases, common heritable diseases and randomized controlled trials. Blue arrow indicates the on-target/off-target effects of rare inherited diseases, which are often large enough to observe in a single case (family). Red arrows indicate the on-target/off-target effects of common heritable diseases, which often require large sample sizes to determine their direction. Green arrows indicate the on-target/off-target effects observed with the use of a drug in a randomized controlled trial.

Contribution of rare variants to lipids and CAD

In addition to the simplified classification between rare variants with critical effects and common variants with small effects, many rare variants with modest effects contribute to lipids and CAD. To identify such contributions, several rare variant association studies investigating rare variants and lipids/CAD using whole-exome sequencing,49 whole-genome sequencing50 and imputation-based approach51 have been performed. These studies have found that (1) rare and low-frequency variants account for 3–4% of the variance explained in lipids, (2) rare and low-frequency variants in ApoB-related genes are associated with CAD and (3) rare and low-frequency variants in LDL receptor gene and apolipoprotein A5 gene contribute to the development of early-onset myocardial infarction. These results collectively suggest that rare and low-frequency variants with modest effects contribute to lipids and CAD, although their amounts are smaller than those of common variants.

Different ethnicity, different variants

Based on the recent growth in the worldwide population, there are several rare and private variants. Such rare variants tend to be specific to ethnicity compared with common variants. It is true that the distribution of lipid trait-related variants could be different among different ethnic groups, but the major driver of this difference could be allele frequency as reported in previous association studies using common and rare variants in different ethnic groups.52, 53 Accordingly, we believe that it is still possible to discover novel genes associated with lipids by well-designed rare variant association studies, especially in non-European populations.

Identifying lipid-related causal genes

Genome-wide association studies using common variants have identified more than 100 different loci associated with lipids till date. However, it is still challenging to detect causal genes/variants identified by genome-wide association studies located outside the coding region. In this regard, referring to a Mendelian type of disease or to a follow-up study focusing on rare coding variants seems to be a good approach.

Conclusion

Dyslipidemias, especially hyper-LDL cholesterolemia and hypertriglyceridemia, are important causal factors of CAD. Comprehensive genotyping in Mendelian dyslipidemias as well as Mendelian randomization studies using common genetic variants have contributed to our understandings of lipids and CAD. Investigation of extreme cases caused by rare genetic variants can lead to the development of novel drugs for the general population, especially when the on-target/off-target effects of rare variants and common variants exhibit the same direction. Such a strategy could aid in the search for drug discovery seeds other than those for dyslipidemias.