Introduction

The novel coronavirus disease 2019 (COVID-19), caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has spread rapidly since its emergence in December 2019 and has affected hundreds of millions of people worldwide1. The development and spread of safe and efficacious vaccines are expected to be the key to controlling the COVID-19 pandemic. According to the World Health Organization, several vaccines have been developed worldwide to prevent the spread of COVID-192. mRNA-based vaccines made by Pfizer/BioNTech (BNT162b1) and Moderna (mRNA-1273) are among the most widely used in Japan and other parts of the world. As of November 29, 2021, the Japanese government estimated that 97.2 million people had been fully vaccinated (received two doses of either vaccine) in Japan, representing 76.7% of the country’s population3.

The most common side effects reported following the COVID-19 mRNA vaccination were injection site reactions, fatigue, headache, and myalgia. These side effects are reported to be mild to moderate and last for a couple of days4. Other local and systemic side effects of the vaccination include swelling, chills, joint pain, fever, redness, itching, nausea, diarrhea, abdominal pain, rash outside the injection site, and vomiting4. Serious side effects such as mild allergic reactions and anaphylaxis are rare but have been reported5,6. Side effects are signs of a common immune response to the vaccine and vary with age, sex, and ethnicity7,8. Interestingly, an elevated risk of myocarditis following the COVID-19 mRNA vaccination was observed in males aged 12–29 years9. HLA alleles have also been reported to be associated with adverse events following the COVID-19 mRNA vaccination10,11. However, to date, the mechanism underlying the development of side effects remains unclear, and there is still a lack of reports from non-European ethnicities.

One crucial challenge is to elucidate the factors that induce adverse events in response to the COVID-19 vaccine to understand its detailed mechanism of action. In this study, we hypothesized that individual differences in response to the COVID-19 vaccine may be explained, in part, by genetic differences. Therefore, to clarify the possible mechanisms by which host genetic variations might affect the COVID-19 vaccine treatment response, a genome-wide association study (GWAS) was performed in the Japanese population.

Materials and methods

Study subjects

The data were obtained through the Japanese direct-to-consumer (DTC) genetic testing services “Genequest ALL” and “Euglena MyHealth”, which are provided by Genequest Inc. (Tokyo, Japan) and Euglena Co., Ltd. (Tokyo, Japan), respectively. We asked subjects who were aged ≥ 18 years and who gave consent to participate in the study to answer internet-based questionnaires about COVID-19 vaccine adverse events. All the participants provided written informed consent for the general use of their genetic data for research. Before participating in this study, information on the study’s aim was sent to the participants and an additional study-specific agreement was obtained by opt-in. The study was conducted in accordance with the principles of the Declaration of Helsinki. The study protocol was approved by the Ethics Committees of Genequest Inc. (IRB no. 2021-0633-4) and Tohoku University Graduate School of Medicine (IRB no. 2021-1-469).

DNA sampling, genotyping, quality control, and genotype imputation

Saliva samples were collected, stabilized, and transported using an Oragene DNA Collection Kit (DNA Genotek Inc., Ottawa, Ontario, Canada) or GeneFix Saliva DNA Collection (Cell Projects Ltd., Harrietsham, Kent, UK). Genotype analysis was performed using the Illumina Infinium Global Screening Array v1 + Customs BeadChip (Illumina, San Diego, CA, USA), which contains 704,589 markers; Infinium Global Screening Array-24 v3.0 + Customs BeadChip, which contains 655,471 markers; HumanCore-12 + Customs BeadChip, which contains 302,073 markers; HumanCore-24 + Customs BeadChip, which contains 309,725 markers; and InfiniumCore-24 + Customs BeadChip, which contains 308,500 markers. Because the analyzed single nucleotide polymorphism (SNP) sets were very different among the genotyping chips used, the subjects were divided into two groups depending on the type of genotyping chip: those analyzed using the former two chips (595,105 common markers) and those analyzed using the latter three chips (289,930 common markers). These are referred to as populations A and B, respectively. Quality control and association analysis procedures were performed separately for each cohort.

For the quality control analysis, we filtered out the SNP markers. The parameters were as follows: call rate per SNP < 0.95; Hardy–Weinberg equilibrium exact test p-value < 1 × 10−6; minor allele frequency < 0.01; SNPs not in autosomes. We also excluded subjects based on the following parameters: inconsistent sex information between the genotype and the questionnaire, call rate per subject < 0.95, closely related pairs determined using the identity-by-descent method (PI_HAT > 0.1875), and estimated non-Japanese ancestry. Quality control analyses were performed using PLINK12,13 (version 1.90b3.42) and Eigensoft14 (version 6.1.3).

Genome-wide genotype imputation was performed using a pre-phasing/imputation stepwise approach implemented in EAGLE215 (version 2.4) and Minimac316 (version 2.0.1). The imputation reference panel was 1000 Genomes Phase 317 (version 5). Variants with low imputation quality (R2 < 0.3) and minor allele frequency (< 0.05) were excluded from further analyses. Finally, we used dosage data for the common 5,930,410 variants for the GWAS in populations A and B.

Adverse events measurement

We provided internet-based questionnaires regarding the COVID-19 vaccine to the study participants. First, they were asked for the manufacturer’s name of the COVID-19 vaccine. Next, they answered questions regarding the local or systemic reactions they experienced after the first and/or second COVID-19 vaccination. The questionnaire provides 53 options of adverse events reported to occur at a frequency of 0.1% or more in the Japanese subjects18; the study participants selected all options that applied to themselves. For each listed adverse event, participants who selected an option were categorized as 'cases', while those who did not select it were categorized ‘controls’. Detailed information on the questionnaires is provided in Table S1.

Genome-wide association and meta-analysis

The association between genotype dosage and the occurrence of COVID-19 vaccine adverse events was examined using a logistic regression model under the assumption of additive genetic effects. For each population, GWAS was performed, with adjustments for age and sex, using PLINK (version 2.00a3).

We combined the statistical data from both populations using a fixed-effects model and the inverse-variance weighting method with the METAL software19 (version 2011-03-25). Variants achieving genome-wide significance (p < 5.0 × 10−8) in the meta-analysis were considered to be associated with the occurrence of COVID-19 vaccine adverse events.

HLA imputation and association analysis

HLA types for A, B, C, DPB1, DQA1, DQB1, and DRB1 were imputed using the HIBAG software20,21 with default recommendations using the HIBAG Asian reference. Posterior probabilities > 0.5 were used as genotype calls. The association between HLA type and the occurrence of COVID-19 vaccine adverse events was examined using the PyHLA software22 (version 1.0.0) with a logistic regression model under the assumption of additive genetic effects with adjustments for age and sex. We performed an HLA association analysis for each population and combined statistical data from both populations using a fixed-effects model and the inverse-variance weighting method with the METAL software19. We analyzed 95 HLA alleles at a frequency > 5%. For the multiple testing correction, HLA alleles achieving p < 5.26 × 10−4 (0.05/95) were considered to be associated with the occurrence of COVID-19 vaccine adverse events.

Results and discussion

Study subjects and the occurrence of COVID-19 vaccine adverse events

The participants’ characteristics are shown in Table 1. The population vaccinated with the mRNA-1273 vaccine was older and had a greater prevalence of females when compared to the population vaccinated with BNT162b1. This difference may be a result of the Japanese vaccination circumstances, where the BNT162b1 vaccine was approved first and administered to vaccination-priority targets such as elderly people and healthcare workers. The occurrence of adverse events following COVID-19 vaccination is shown in Table1 and Table S2. Systemic reactions were more prevalent after the 2nd vaccination dose than after the 1st dose (44% and 71% for BNT162b1 and 57% and 96% for mRNA-1273, for the 1st and 2nd doses, respectively). This is consistent with the results of previous studies8,23. Compared to the report from the Japanese Ministry of Health, Labour, and Welfare18, most of the adverse events had occurrences with a range of difference of ˂ 5%; 88% and 82% for the BNT162b1 vaccine, and 79% and 62% for the mRNA-1273 vaccine, for the 1st and 2nd dose, respectively (Table S2). The adverse event occurrences were very different between BNT162b1 1st dose, 2nd dose and mRNA-1273 1st dose, 2nd dose; the following analyses were performed after stratification by the kind of vaccine and the dose. Table S3S6 showed that adverse events were correlated with each other and were not independent.

Table 1 Characteristics of the subjects included in the entire present study.

Incidence of COVID-19 vaccine adverse events with respect to sex and age differences

Previous studies7,23 reported that women and younger people have a higher risk of adverse events following COVID-19 vaccination. Consistent with these studies, the present study also showed a higher risk in females (p-value < 0.05; 51% and 63% in BNT162b1, 39% and 53% in mRNA-1273, for the 1st and 2nd doses, respectively) and younger people (p-value < 0.05; 51% and 63% in BNT162b1, 31% and 41% in mRNA-1273, for the 1st and 2nd doses, respectively) (Tables S7 and S8).

GWAS for COVID-19 vaccine adverse events

We performed a GWAS for each population and a meta-analysis of the adverse events. Due to the absence of cases in either population, we were unable to perform GWAS for the following conditions: constipation at BNT162b1 1st dose, dyspnea at BNT162b1 1st and 2nd dose and mRNA-1273 1st dose, eczema at BNT162b1 1st dose, and sneeze at mRNA-1273 1st dose. We identified 14 loci associated with the adverse events in response to the COVID-19 vaccine at the genome-wide significance level (p-value < 5 × 10−8), for the 1st or 2nd dose of the BNT162b1 or mRNA-1273 vaccines (Table 2, Table S9, Figure S1). Associations between rs9266082 and higher fever and rs13279405 and chest pain were found with a p-value < 0.05, for both the BNT162b1 and mRNA-1273 vaccines. However, other variants were differentially associated with the BNT162b1 and mRNA-1273 vaccines. Two hypotheses could explain this: genetic susceptibility to adverse events may differ between the two COVID-19 mRNA vaccines or the populations may have differential statistical power because of differential sample size and adverse event occurrence. In these associated loci, 6p21 (rs551634406, rs183300, rs9266082, rs375726766, rs3135408) was associated with a 37.5 °C or higher fever, 38 °C or higher fever, and muscle pain. Since 37.5 °C or higher fever and 38 °C or higher fever were correlated (Peason correlation coefficients = 0.578, Table S4), the lead variants associated with these two adverse events, despite being different, were located in the same genetic loci.

Table 2 Loci identified by the meta-analysis of COVID-19 vaccine adverse events.

The annotations of the associated variants are listed in Table 324,25. The occurrence of adverse events in response to COVID-19 vaccines was previously reported to differ among ethnicities. The prevalence of fatigue as a reaction to the BNT162b2 vaccine was 59% and 69%, and fever was 16% and 38% in European and Japanese populations, respectively8,18,23. The prevalence of fatigue as a reaction to the mRNA-1273 vaccine was 68% and 80%, and fever was 17% and 77% in European and Japanese populations, respectively8,18,23. In our study, the allele frequencies of eight out of the 14 variants (rs10744866, rs146922515, rs551634406, rs183300, rs375726766, rs13279405, rs34086990, and rs3135408) differed by more than 10% between European and Japanese populations, suggesting the possibility that a difference in genetic backgrounds may influence the occurrence of adverse events in response to COVID-19 vaccines. These loci, especially 6p21, are associated with the expression of many genes, according to the Genotype Tissue-Expression (GTEx) database25. Therefore, these genes may influence the mechanism of action of COVID-19 vaccines. HLA genes’ (HLA-B, C, DPA1) mRNA expressions differed among the genotypes of the associated loci. HLA alleles have also been associated with adverse events following COVID-19 mRNA vaccination10. HLA genes have been reported to be associated with adverse events after the administration of various vaccines26 and drugs27. Increased NOTCH4 expression in the circulating regulatory T cells of COVID-19 patients was associated with disease severity and predicted mortality28. The expression of RPS18 was previously found to increase in isolated T cells upon stimulation with the live influenza virus29. A variant in BAK1 and A haplotypes of MICB have been associated with dengue hemorrhagic fever caused by the dengue virus30,31. A variant in PSORS1C1 has been associated with severe allopurinol-induced adverse reactions32,33. Variants in HSP70, TAPBP, and WDR46 were found to be associated with aspirin-exacerbated respiratory disease34,35,36.

Table 3 Allele frequency and expression quantitative trait locus (eQTL) genes of the loci associated with the COVID-19 vaccine adverse events.

HLA association with the COVID-19 vaccine adverse events

We performed an HLA imputation analysis and used high quality alleles (posterior probabilities > 0.5). The frequency of HLA alleles and sample size in our data were shown in Table S10. Subsequently, we performed an HLA association analysis of the adverse events. We identified 14 HLA alleles associated with the adverse events in response to the COVID-19 vaccine at the significance level (p-value < 5.26 × 10−4), for the 1st or 2nd dose of the BNT162b1 or mRNA-1273 vaccines (Table 4, Table S11). HLA-DQA1*03:01 was associated with chills following the 2nd dose of mRNA-1273 and BNT162b1 in our study and was also reported to be associated with the adverse events of BNT162b1 in a European study11. This allele was considered replicated and more reliably associated. However, our study did not analyze HLA-A*03:01, the top-associated allele in a European study11, because of its low allele frequency (less than 0.005). As allele frequencies are far different between ethnicities, there may have been several alleles that were significantly associated with our study but were not replicated in other ethnicities. HLA-A*11:01 was associated with sneeze following the 1st dose of BNT162b1 and mRNA-1273 in our study and was also reported to be associated with severe COVID-19 in a Japanese study37. The risk allele was shared between the adverse events of the COVID-19 vaccine and severe COVID-19, which suggests that the allele and COVID-19 are strongly related. HLA-B*15:02 and HLA-B*15:35 were associated with adverse events in our study, and HLA-B*15:01 was associated with asymptomatic COVID-19 in a European study38. Therefore, HLA-B*15 might play an important role in COVID-19. However, HLA-B*15:01 was not significantly associated with adverse events in our study (Table S11). We confirmed the association between two reliable HLA alleles (HLA-DQA1*03:01 and HLA-A*11:01) and all adverse events (Fig. 1, Figure S2). The risk for several adverse events were similarly elevated; however, due to the low frequency of the HLA alleles and the infrequent occurrence of adverse events, the associations between the two HLA allele and adverse events had wide confidence intervals and were not statistically significant. To examine whether associated HLA alleles and variants were correlated, we analyzed the correlation coefficients between HLA alleles and variants (Table S12). As a result, these variants and DPB1, DQA1, DQB1 alleles were moderately correlated and the top correlation coefficient was -0.461 between rs375726766 and DPB1*09:01.

Table 4 HLA alleles identified by the HLA association analysis of the COVID-19 vaccine adverse events.
Figure 1
figure 1figure 1

Odds ratios and 95% confidence intervals of the two reliable HLA alleles (HLA-DQA1*03:01 and HLA-A*11:01) for the occurrence of all adverse events.

Limitations

This study had some limitations. Initially, 53 types of adverse events were included; however, multiple test corrections were not performed because the adverse events were correlated. Therefore, the associated loci that we identified could include false positives, and replicate studies are required. Second, our data on the adverse events were based on web-based self-reports and might have been affected by recall bias. Third, our data might include subjects who responded to the questionnaire before experiencing any adverse events. However, the occurrence of adverse events was similar to that reported in a large-scale survey performed in Japan18. Thus, our adverse event data are considered reliable. Finally, our GWAS was based only on the Japanese population. Therefore, our results may not be directly applicable to individuals of other ethnicities.

Conclusions

In this study, we performed GWAS for adverse events following the COVID-19 vaccination. To the best of our knowledge, this study is the first of its kind to focus on East Asian populations. We identified 14 loci associated with the adverse effects of the COVID-19 vaccines in the Japanese population. HLA allele association analysis revealed that various HLA alleles were associated with these adverse effects; HLA-DQA1*03:01 and HLA-A*11:01 were reliably associated with the adverse events. We discovered that genetic background is associated with the susceptibility to experiencing adverse events following the COVID-19 vaccination. Our results may enable the preparation and management of the occurrence of adverse events based on their susceptibility. Furthermore, we obtained valuable basic data that can be used to investigate the mechanisms of action of COVID-19 vaccines.