Introduction

Recent large-scale association studies identified a number of novel susceptibility genes to systemic lupus erythematosus (SLE),1, 2, 3, 4, 5 among which many have been replicated in independent studies. However, genetic effect of each susceptibility allele is generally modest; with the exception of HLA-DRB1, none reached the allelic odds ratio (OR) of 2. Thus, the effect of each susceptibility allele is not sufficient to identify the high-risk individuals when used alone.

One possible approach to solve this problem is to combine multiple risk allele genotypes of each individual into a genetic risk score. Such an approach has been employed in type 2 diabetes,6, 7 cholesterol levels and cardiovascular diseases,8, 9 prostate cancer10 and rheumatic diseases such as gout,11 rheumatoid arthritis12 and systemic sclerosis.13

Our group has been intensively working on the susceptibility genes to SLE in a Japanese population. Thus far, we replicated association of HLA-DRB1*15:01, STAT4 rs7574865T, BLK rs13277113A, TNFAIP3 rs2230926G and TNIP1 rs7708392C in Japanese, which had previously been reported in the Caucasian or Chinese populations.14, 15, 16, 17, 18 In the process of these studies, we noted that the genetic contribution of STAT4, BLK and TNIP1 was greater in the Japanese compared with the Caucasian populations, due to population differences in the risk allele frequencies.15, 16, 18

With respect to IRF5, we found that the initially reported Caucasian risk haplotype carrying rs2004640T, exon 6 insertion and rs10954213A19 was nearly absent in Japanese20 Moreover, although the association of rs2004640 was also observed in Japanese, another single nucleotide polymorphism (SNP) rs41298401, which is monomorphic in Caucasians, was more strongly associated with SLE.20

In FCGR2B, we identified association of a SNP rs1050501, which alters Ile at position 232 to Thr in the transmembrane region.21 This association was replicated in multiple Asian populations.22, 23, 24 This amino acid substitution was associated with reduced localization of inhibitory Fcγ receptor IIb molecule in the lipid raft, thereby resulting in hyperactivation of B cells25 and monocytes.26

In the case of TNFSF13 (APRIL), Koyama et al.27 and our group28 independently demonstrated association of rs11552708 (Gly67Arg) with SLE in Japanese. This association was also replicated in a study on mixed European-American, African-American and Hispanic populations.29 Thus, TNFSF13 is also an established susceptibility gene to SLE.

All of the above mentioned eight susceptibility genes have been independently replicated, and ORs of the risk genotypes are greater than 1.3 in Japanese.14, 15, 16, 17, 18, 20, 21, 28 In the case of SLE, a recent study from the United States reported association of cumulative risk allele number at 19 susceptibility loci with the age of onset among the patients;30 however, association with genetic risk for the occurrence of SLE as compared with healthy controls was not analyzed.

In this study, we combined the genotype data at the eight susceptibility loci established in the Japanese, and tested the association of the cumulative risk allele number with susceptibility to SLE. When the subjects carrying seven risk alleles were used as a reference, the OR for individuals carrying 10 and 11–13 risk alleles were 4.17 and 8.77, respectively. In contrast, subjects with four or less risk alleles were significantly decreased in SLE (OR 0.15). In addition, the proportion of the patients with neurologic disorder was significantly increased in those carrying 10 risk alleles than those with <10. Thus, this study suggested that the cumulative number of risk alleles could potentially be useful for distinguishing groups with high and low genetic predisposition to SLE and its severe manifestation.

Materials and methods

Patients and controls

A total of 282 Japanese female patients with SLE and 222 healthy female controls were recruited at University of Tsukuba, Juntendo University and the University of Tokyo. To exclude potential bias caused by gender-associated difference in the genetic factors for SLE, male patients and controls were excluded from this study. All patients and healthy individuals were native Japanese living in the central part of Japan. All patients fulfilled the American College of Rheumatology criteria for SLE.31 Presence or absence of renal and neurologic disorders was also classified based on the American College of Rheumatology criteria.31 This study was reviewed and approved by the research ethics committees of University of Tsukuba, Juntendo University and the University of Tokyo. Informed consent was obtained from all study participants. This study was conducted in accordance with the principles expressed in the Declaration of Helsinki.

Genotyping

Genotyping of HLA-DRB1 was performed by PCR-microtiter plate hybridization (MPH) technique or by Luminex Multi-Analyte Profiling system (xMAP) with WAKFlow HLA typing kit (Wakunaga, Hiroshima, Japan) as described elsewhere.32 IRF5 rs41298401 genotype was determined by direct sequencing.20 FCGR2B rs1050501 was genotyped using nested PCR and oligonucleotide hybridization as previously described,21 or direct sequencing using the sequencing primer annealing at intron 5 (5′-AATACGGGCCTAGATCTGAATGTG-3′) described by Li et al.22 after PCR using primers placed at exon 4 (5′-AAGGACAAGCCTCTGGTCAA-3′) and exon 7 (5′-CCCAACTTTGTCAGCCTCAT-3′).21 TNFSF13 rs11552708 was genotyped using PCR-single strand conformation polymorphism or direct sequencing, as previously described.28 Other SNPs were genotyped using the TaqMan genotyping assay (Applied Biosystems, Foster City, CA, USA), as previously described.15, 16, 17, 18 Identity of the genotyping results were confirmed when two genotyping systems were used for a locus.

The genotyping data of the cases and controls were derived from our previous studies.14, 15, 16, 17, 18, 20, 21, 28 In addition, when necessary, additional genotyping was performed for individuals in whom any of the eight loci had not been genotyped.

Statistical analysis

Association of each susceptibility allele with SLE in the cases and controls analyzed this study was performed by Fisher’s exact test using 3 × 2 contingency table. Gene–gene interaction was examined between each pair of genes using logistic regression analysis under the co-dominant model, where risk allele homozygotes xi=2, heterozygotes xi=1 and non-risk allele homozygotes xi=0 for gene i. The logistic regression model for interaction between gene i and gene j was given by

The deviation from 0 was evaluated for βij by the Wald test.

The increase in the risk by carrying an additional risk allele was calculated using logistic regression analysis. The risk allele count and the disease status were used as independent and dependent variables, respectively.

The statistical significance of the difference in the average number of the risk alleles in cases and controls was examined using Student's t-test (two-tailed). When the OR and 95% confidence interval (CI) was calculated for individuals carrying different number of risk alleles, subjects carrying seven alleles, the median of the risk allele number in healthy controls, were used as a reference. The statistical significance was analyzed using Fisher's exact test.

The proportion of the patients with renal disorder, neurologic disorder or early onset (<20 years) was compared between those with 10 and <10 risk alleles using χ2-test.

Results

Association between cumulative risk allele number and susceptibility to SLE

We selected eight established susceptibility alleles, HLA-DRB1*15:01, IRF5 rs41298401C, STAT4 rs7574865T, BLK rs13277113A, TNFAIP3 rs2230926G, TNIP1 rs7708392C, FCGR2B rs1050501C and TNFSF13 rs11552708G, based on the following criteria: (1) at least one report of significant association in Japanese, (2) OR of the risk genotype >1.3 and (3) at least one independent replication.

These alleles were genotyped in 282 Japanese female patients with SLE and 222 healthy female controls. Table 1 shows the genotype frequencies of the examined loci in the patients and healthy controls. To exclude potential influence from gender-associated difference in the genetic background, we focused on female patients and controls. As a result, the sample size of this study was smaller than our previous studies where each locus was examined individually.14, 15, 16, 17, 18, 20, 21, 28 This resulted in lack of statistically significant association of TNFAIP3, TNIP1 and TNFSF13 in the subjects analyzed here.

Table 1 Genotype frequencies of each susceptibility gene with systemic lupus erythematosus in the Japanese female patients and controls examined in this study

The gene–gene interaction was examined between each pair of two loci (28 pairs) using logistic regression analysis under a co-dominant model. Statistically significant gene–gene interaction was not observed, except for a marginal interaction between BLK and FCGR2B (uncorrected P=0.04).

Table 2 shows the distribution of the cumulative risk allele number of each subject in the patients and healthy controls. Risk allele number was distributed between 4–13 in the patients and 3–11 in the controls. The average number of the risk alleles was significantly increased in the patients (8.07±1.60 alleles) as compared with controls (7.02±1.64 alleles) (P=1.63 × 10−12, Student's t-test).

Table 2 Numbers and percentages of the patients and healthy controls carrying each cumulative risk allele number

Figure 1 shows the OR and 95% CI for each group with increasing number of risk alleles, as compared with the individuals carrying seven risk alleles, the median risk allele count in the control group. In this analysis, groups with 11–13 and 3–4 risk alleles were combined due to the small number of the subjects in each group (Table 2). Because apparent gene–gene interaction was not observed, the increase in the risk by carrying an additional risk allele was estimated using logistic regression analysis, where the risk allele count and the disease status were used as independent and dependent variables, respectively. This analysis indicated that each additional risk allele increased the OR by 1.50 fold (95% CI: 1.33–1.69, P=4.5 × 10−11).

Figure 1
figure 1

Odds ratios and 95% confidence intervals of the individuals carrying increasing numbers of risk alleles in eight SLE susceptibility loci. The reference group was the individuals with seven risk alleles, which was the median of the healthy controls. Statistical significance was tested using Fisher's exact test. Power calculation using computer simulation (1000 runs) demonstrated that the power to detect statistical significance at the significance level of 0.05 was 87.4% (3–4 alleles), 26.8% (5 alleles), 13.0% (6 alleles), 69.3% (8 alleles), 67.9% (9 alleles), 98.3% (10 alleles) and 94.9% (11–13 alleles), respectively.

When compared with individuals carrying seven risk alleles, those with eight or more risk alleles was associated with significantly elevated OR (P<0.05, Fisher's exact test) (Figure 1). Specifically, the ORs of individuals with 10 and 11–13 risk alleles were 4.17 (CI 1.89–9.19, P=0.0002) and 8.77 (CI 1.92–40.0, P=0.0016), respectively.

On the other hand, the OR in individuals having four or less risk alleles was significantly decreased in SLE (OR 0.15, CI 0.03–0.67, P=0.0070). When individuals carrying 11 or more risk alleles were compared with those carrying four or less alleles, the OR was 60 (CI 7.48–481.6, P=5.2 × 10−6).

Power calculation was performed using computer simulation (1000 runs). This sample size provided 95.7% (seven versus eight or more alleles), 98.3% (seven versus 10 alleles), 94.9% (seven versus 11–13 alleles), 87.4% (seven versus 3–4 alleles) and 99.8% (3–4 versus 11–13 alleles) power to detect statistical difference at the significance level of 0.05.

Association between cumulative risk allele number and clinical subsets

We next examined whether the cumulative risk allele number is associated with severe clinical phenotypes such as renal disorder and neurologic disorder, as well as with early onset of SLE. First, we compared the mean risk allele number between the patients with and without each clinical feature. The mean number of risk alleles showed a tendency towards increase in the patients with renal disorder (8.21±1.57 alleles, n=156) as compared with those without (7.90±1.64 alleles, n=125), in those with neurologic disorder (8.39±1.96, n=41) than those without (8.02±1.53, n=232) and in the patients with early onset (<20 years) (8.10±1.42, n=72) than those with later onset (20 years) (8.07±1.66, n=208), although none of the differences reached statistical significance (P>0.05, Student's t-test).

We then compared the proportion of each clinical characteristics between the patients carrying 10 risk alleles and those with <10. As shown in Table 3, significant increase of neurologic disorder was observed in the patients carrying 10 risk alleles.

Table 3 Comparison of clinical phenotypes between SLE patients with 10 and <10 risk alleles

Discussion

Attempts to develop a marker for estimating the risk of an individual by using cumulative risk allele number at multiple loci have been reported in various complex diseases.6, 7, 8, 9, 10, 11, 12, 13 In this study, by combining the number of risk alleles at eight susceptibility loci, each having rather small allelic OR, it became possible to distinguish high and low risk groups with up to 60-fold difference in the genetic predisposition to SLE. However, it should be noted that the model proposed in this study is, at this point, not sufficient for the purpose of predicting disease onset or identifying a high-risk group that requires preventive intervention. This is because of the low incidence of SLE, lack of measures for prevention, as well as the limitations of the study approach as described later.

In this study, we simply counted the number of risk alleles of multiple susceptibility loci, Our model could potentially be improved by weighting each risk allele according to the size of genetic effects (ORs) as described in the study of RA,12 or taking the appropriate genetic model (for example dominant, co-dominant or recessive) into account. However, the genetic effect of each allele and the model that provides the best fit sometimes vary considerably among studies. In addition, in most of the susceptibility genes, the causative allele, which accounts for the molecular mechanism of SLE has yet to be identified. The ORs of the unidentified causative alleles may be different from those observed for the ‘proxy’ SNPs. Therefore, we employed a model-free approach to simply count the number of risk alleles. Our data suggest that even this simple approach could potentially provide a useful parameter to estimate genetic predisposition to SLE of each individual. With establishment of many more susceptibility genes, this model is expected to improve its power to identify individuals with elevated genetic risk to develop SLE.

There are several limitations in this study. Because of the low prevalence of SLE, the sample size of this study is small, which resulted in the wide range of CIs in the subjects carrying 10 or more risk alleles. Although significant gene–gene interaction was not detected in every possible combination of two genes, it would be intriguing to examine the combined effects of specific risk genotypes at three or more susceptibility loci. However, such a study requires much larger number of subjects than this study. For these reasons, we admit that this study is somewhat preliminary and the results should be validated in a larger sample set.

One approach to increase the sample size in the study of rare diseases is to organize a large-scale international collaboration. However, we focused on the Japanese population, because we wanted to find a biomarker useful in clinical practice or preventive medicine in Japanese. The genetic background has been shown to be slightly different even among the closely-related East Asian populations.33, 34 Therefore, the next step will be to validate our results in a larger sample size through collaboration with multiple SLE centers in Japan.

Another possibility is the use of GWAS data of Japanese SLE. Although GWAS of Japanese SLE has not been reported yet, once the data become available, it might be possible to impute the genotypes at the susceptibility genes analyzed in this study to compare cumulative risk allele numbers between a large number of patients and controls.

It should be noted that the set of eight genes in this study was based on the Japanese data, and is not directly applicable to other populations. For example, the risk alleles of PTPN2235 and ITGAM1, 2 as well as HLA-DRB1*03:01,36 established risk factors for SLE in the Caucasian populations, are absent or very rare in the Japanese population. Thus, similar analysis should be carried out in each population, using the set of established susceptibility genes in the target population.

It should also be emphasized that this study considered only the SLE susceptibility genes that have already been replicated in the Japanese population. Recently reported SLE susceptibility genes, which have yet to be replicated in Japanese such as TNFSF44, 5, 37 were not included. For the same reason, we did not include the multiple SNPs in the HLA region, which were shown to be independently associated with SLE in other populations.38 Therefore, our current model represents only a proportion of the whole genetic contribution to SLE, and should be improved in the future by incorporating additional susceptibility genes when their association was replicated in the Japanese.

The cumulative risk allele number might potentially be useful for the prediction of disease phenotype among the patients. In a recent study in the United States, the risk allele number was associated with early onset of SLE, which in turn was associated with severe disease.30 In our data, the association between the risk allele number and early onset was not detected. However, of particular interest, the patients carrying 10 risk alleles exhibited more than twofold risk for developing neurologic disorder, one of the severe manifestations of the disease, than those with 10 risk alleles. Such information would be useful for personalized medicine; for example, more aggressive treatment may be considered for the patients with increased number of risk alleles.

The cumulative risk allele number may also be useful for research purpose. For example, longitudinal studies of healthy individuals carrying an increased number of risk alleles may efficiently identify environmental triggers or epigenetic modifiers that lead to the development of SLE. It would also be of interest to test association with the presence of antinuclear antibodies among healthy individuals.39