Introduction

IgG4-related disease (IgG4-RD) is a systemic disorder involving a spectrum of multiple indications, distinguished by often elevated levels of serum IgG4, infiltration of IgG4+ plasma cells into target tissues, and diffuse swelling, mass formation, or fibrosis of affected organs1. This disease affects men approximately two-fold more often than women and age at diagnosis ranges from 50 to 70 years2. Most patients do respond to steroids initially, although relapse is observed in up to 47% cases3,4,5. Various histopathological features are shared among different IgG4-RD indications, which challenge diagnosis, although certain syndromes have organ-specific involvement6. Some examples include: Mikulicz’s disease affecting the salivary and lacrimal glands, autoimmune pancreatitis affecting the pancreas, Riedel’s thyroiditis affecting the thyroid, and Morbus Ormond or retroperitoneal fibrosis (RF), affecting tissue in the retroperitoneum, to name a few2.

Beyond the evidence of certain genetic risk factors7,8,9,10,11, IgG4-RD is mechanistically thought to be activated by the innate response to pathogens that mimic self-antigen, leading to an autoimmune response2,6. Type 1 helper T cells (Th1) are thought to support innate immune response to infection, which then shifts to type 2 helper T cells (Th2) involvement with increases in expression of IL-4, IL-5, and IL-13 mRNA and protein in both the affected tissue and peripheral compartments12,13,14,15. Th2 adaptive response can affect Th1 response, thus this Th1/Th2 balance is important in regulation1. Regulatory T cells (Tregs) are also activated, with accumulation of CD4+CD25+ T cell infiltrates and abundance of IL-10, FOXP3, and TGF-β112,16,17. The increase of these cytokines promotes eosinophilia in the serum or tissue, high levels of IgG4-producing plasma cells, elevated production of IgE, and fibrosis, with inflammatory cell infiltrates ultimately causing organ damage6.

Recently, studies have utilized transcript profiling in labial salivary glands (LSGs) to identify distinguishing molecular features between IgG4-RD and Sjögren’s syndrome (SS), a disease with common phenotypic elements18,19,20. Among other findings, active involvement of Th2- (IL-4, IL-5, and IL-21), T follicular helper cell (Tfh)- (BCL-6 and CXCR5) and T-reg- (IL-10, FOXP3, CCL18, and TGF-β1) related transcripts in patients with IgG4-RD was observed. These data showed how elevated levels of such cytokines and chemokines can induce IgG4 plasma cell infiltration, high IgG4 levels in the periphery, and impact tissue fibrosis in the LSG of IgG4-RD patients19. However, no studies to date have assessed the differences in molecular pathways or cell populations among IgG4-related disease with salivary gland lesions (RD-SG), without SG lesions (RD-nonSG), and IgG4-related retroperitoneal fibrosis (RF), in the peripheral blood, as well as the effects of corticosteroids on these signaling pathways.

In this study, we used whole transcriptomic sequencing to identify and distinguish both cell and pathway-associated activation in the blood of healthy subjects or those with RD-SG, RD-nonSG, or RF. A large cohort of patients was transcript profiled at a relative baseline time point, with two patients providing additional post baseline flare specimens. To better understand the possible mechanism(s) implicated in the treatments of IgG4-RD, we evaluated the effects of prednisone on the molecular pathways most relevant to disease activity. Additionally, cell-specific gene signatures linking the B and T cell axes were assessed to elucidate cellular involvement, as well as the correlation with IgG4 mRNA levels across the three diseases.

Results

Transcriptome profiles in patients with RD-SG, RD-nonSG, or RF and healthy controls using principal components analysis

Principal components analysis (PCA) was used to elucidate the whole transcriptome profile among the three diseases in relation to healthy subjects (Fig. 1). Though the plot displayed an overlap in disease and control cohorts, there was an apparent difference between the control subjects and disease subjects. Specifically, along the x-axis (principal component 1), controls (red) were the leftmost cohort, followed by the other disease groups. More relevant was the smaller within-disease variability that was apparent in the control and RD-SG (blue) compared to the RD-nonSG cohort (green). The RF cohort (purple) was very small (n = 3), thus the distribution of these points were difficult to interpret.

Figure 1
figure 1

Principal components analysis plot of RD-SG, RD-nonSG, RF, and control subjects using the whole transcriptome.

IgG4 and IgE are the most over-expressed transcripts in RD-SG and RD-nonSG patients, and suppressed by prednisone in RD-SG patients

IgG4 and IgE were identified as two of the most over-expressed transcripts in both RD-nonSG and RD-SG compared to the control cohort (Supplementary Table 1). These two cohorts were stratified by patients who were currently being treated with prednisone. All four patient cohorts had significantly higher mRNA expression of IgG4/Total IgGs and IgE (p ≤ 0.001 for all cohorts; Fig. 2A,B). RD-SG patients treated with prednisone had significant suppression of IgG4/Total IgGs (p = 0.01) and IgE (p = 0.003) mRNAs compared to those not treated, while RF patients showed difference in IgE from controls, though the sample size was small (p = 0.04). IgG4/Total IgGs and IgE mRNAs were highly correlated across the diseases (Fig. 2C; rho = 0.66, p < 9.78 × 10−6).

Figure 2
figure 2

Distribution of IgG4 and IgE mRNAs. (A) Expression of IgG4 scaled by all IgG1, IgG2, and IgG3 transcripts and (B) IgE across control subjects, RD-SG patients on prednisone treatment, RD-SG patients not on prednisone treatment, RD-nonSG patients on prednisone treatment, RD-nonSG patients not on prednisone treatment, RF patients, all patients on predisone treatment, and all patients not on prednisone treatment. (C) Correlation between IgE and IgG4/Total IgG mRNAs for all three diseases. P-values under each disease group indicate comparisons to control and are adjusted by age. Pred +  = currently treated with prednisone; Pred− = not currently treated with prednisone.

A linear model was constructed to identify genes across the transcriptome most correlated with IgG4 mRNA levels. This approach was used to 1) adjust for transcripts modulated by prednisone treatment, and 2) distinguish transcripts unique to one of RD-SG, RD-nonSG, or RF cohorts. Among the top 50 positively and negatively correlated transcripts with IgG4 (p < 0.01), 39/100 were associated with RD-SG (p < 0.01), 28/100 with RD-nonSG (p < 0.01), and 3 with RF (p < 0.01), with 23 being shared between RD-SG and RD-nonSG and 2 associated with RF shared with RD-SG and RD-nonSG cohorts (CCL23 was unique to RF; Supplementary Table 2). This indicates that similar transcripts correlate with IgG4 in all three diseases. Among these top 50 most positively correlated genes with IgG4, regardless of disease (and adjusting for prednisone treatment), the most activated biological categories were immunoglobulin (IGHG1, IGHG3, IGHV1-69, IGHV3-30, IGKV3-20, IGKV3D-15, IGLV2-11, and IGLV3-21), followed by nuclear division/mitosis/replication (CDK1, CDC20, CDC6, DLGAP5, MCM10, RRM2, KIF4A, and TOP2A).

Treg, Th2, eosinophil, and neutrophil gene signatures are over-expressed in RD-SG and RD-nonSG, with a B cell signature suppressed in all diseases

Various cell-specific gene signatures were used to evaluate cell population involvement in the diseases studied here. Interestingly, a Treg gene signature showed significant over-expression in RD-SG and RD-nonSG patients without treatment with prednisone compared to controls (Fig. 3A). Regarding the Th2 cytokine signature, while IL-13 gene showed significant over-expression in only RD-SG patients compared to controls regardless of prednisone treatment, IL-4 gene showed significant elevation in both RD-SG and RD-nonSG patients compared to controls (Fig. 3B and C). The B cell signature was significantly suppressed in the majority of patient cohorts (including RF) (Fig. 3D). The eosinophil and neutrophil gene signatures showed opposite effects between patients with or without prednisone treatment, respectively, in RD-SG. Specifically, compared to patients without prednisone treatment, the eosinophil gene signature was significantly suppressed (Fig. 3E, p = 0.0001 in RD-SG,), whereas the neutrophil gene signature was significantly elevated in RD-SG patients treated with prednisone (Fig. 3F, p = 0.01). Interestingly, a plasma cell gene signature showed no changes in any of the diseases with or without treatment, compared to controls (data not shown).

Figure 3
figure 3

Distribution of cell-specific gene signatures. Expression of gene signatures of (A) Treg, (B) IL-13, (C) IL-4, (D) B cell, (E) eosinophil, and (F) neutrophil across control subjects, RD-SG patients on prednisone, RD-SG patients not on prednisone, RD-nonSG patients on prednisone, RD-nonSG patients not on prednisone, RF patients, all patients on prednisone combined, and all patients not on prednisone. P-values under each disease group indicate comparisons to control and are adjusted by age. Pred +  = currently treated with prednisone; Pred− = not currently treated with prednisone.

A molecular characterization of cell-specific gene signatures in RD-SG, RD-nonSG, and RF patients and controls

The molecular characterization of the cell-specific gene signatures, i.e. B, Th1, Th2, Treg, Tfh, and eosinophil cells, were analyzed across case and control cohorts (Fig. 4). From the heatmap, the effect of prednisone on all T cell sub-populations is evident. In general, the gene signatures were down-regulated in all patients treated with prednisone. The pattern seemed more apparent in T cell sub-populations of Th1, Treg, and Tfh in RD-SG patients. A similar pattern was seen in the eosinophil gene signature in both RD-SG and RD-nonSG patients. For the B cell signature, most genes were suppressed in RD-SG and RD-nonSG patients regardless of treatment, compared to healthy controls, though a few patients with active disease showed elevated expression across all genes (red vertical stripes in the B cell signature).

Figure 4
figure 4

Heatmap of cell-specific gene signatures across the control, RD-SG, RD-nonSG, and RF cohorts. Ctrl = control subjects; Pred +  = currently treated with prednisone.

IgG4 and IgE mRNAs, T, B, and eosinophil cell-specific genes/gene signatures differed among RD-SG/RD-nonSG patients with flare or stable status

In addition to a baseline time point, blood was procured at a second time point from six RD-SG/RD-nonSG patients, two of whom experienced a flare. Though the exact time differences between the relative baseline and post baseline visit were not identical for each patient, the general molecular patterns were consistent for the two flare patients and differed for the four stable patients at the second visit (Fig. 5). For each RD-SG/RD-nonSG patient, there was induction at the flare time point in Th1, Th2, Treg, and eosinophil gene signatures. Similar induction at the flare time point was observed in IgG4 and IgE mRNA levels (Fig. 5A–F). In contrast, the B cell signature, as well as two genes associated with Tfh cells (BCL6 and CXCR5), all showed suppressed profiles at the flare time point(Fig. 5G–I). The control cohort was provided in each plot to indicate relative similarity of expression levels to a normal healthy population.

Figure 5
figure 5

Longitudinal gene or gene signature profiles of two patients at relative baseline and post relative baseline visits when a flare was observed. All plots include the control cohort and all the 6 patients for (A) IgG4/Total IgG, (B) IgE, (C) eosinophil gene signature, (D) Th1 gene signature, (E) Th2 gene signature, (F) Treg gene signature, (G) BCL6, (H) CXCR5, and (I) B cell gene signature. Pink = patient experienced flare at visit 2; Blue = patient had stable disease at visit 2.

IgG4 serum levels correlate with cell signatures

Serum levels were correlated with cytokine and cell signatures described previously in RD-SG and RD-nonSG patients. IgG4 correlated with eosinophil and neutrophil gene signatures (rho = 0.44, p = 0.03 and rho = −0.48, p = 0.02, respectively) in RD-SG patients. In RD-nonSG, IgG4 levels correlated only with a B cell signature (rho = 0.74, p = 0.01) (Fig. 6). No other associations between serum levels of IgG4 and gene signatures were observed.

Figure 6
figure 6

Correlation between serum levels of IgG4 and (A) Treg signature, (B) IL13 signature, (C) IL4 signature, (D) B cell signature, (E) eosinophil signature, (F) neutrophil signature in RD-SG, RD-nonSG, and RF patients.

Discussion

We used RNA sequencing to molecularly profile a large cohort of RD-SG, RD-nonSG, and RF patients. We showed that IgG4 and IgE are among the most expressed transcripts in the blood of RD-SG or RD-nonSG patients, though not RF compared to controls, and both genes are highly correlated with each other. We also demonstrate that prednisone suppresses the levels of these genes in the blood in RD-SG, but not RD-nonSG patients. Reduction in serum IgG4 protein following steroid therapy has been observed previously in various studies as a result of immune suppression; glucocorticoid treatment is a well-known regimen to help attain remission in MD patients21,22. Among the 25 RD-SG patients, 17 of them in this study only have sialadenitis and dacryoadenitis involvement, which may explain the homogeneity of dramatic reduction in both IgG4 and IgE levels in the blood from prednisone treatment. The higher intra-cohort variability observed in RD-nonSG compared to RD-SG patients in PCA across the whole transcriptome may also support this finding. Th2 gene signatures were generally increased in both RD-SG and RD-nonSG patients, and Treg gene signature was significantly reduced in patients treated with prednisone. Activation of Th2 cytokines and blood eosinophilia in RD patients has been previously reported, suggesting an allergy response mechanism in IgG4-RD, and eosinophilia is often treated with corticosteroids to promote cell death and clearance23,24,25. The increased neutrophil signature in RD-SG patients treated with prednisone may be explained by the well-known phenomena of glucocorticoid-induced granulocytosis, where leukocytes have increased release from the bone marrow, and reduced migration out of the circulation26. Within IgG4-RD specifically, a microarray study observed neutrophil-specific genes (DEFA3 and DEFA4) significantly over-expressed in peripheral blood mononuclear cells (PBMCs) of patients on steroid therapy compared to those not27.

Genes associated with mitosis, cell cycle, and replication were most correlated with IgG4 expression. A previous study evaluating circulating autoantibodies in sera from IgG4-RD patients identified high levels of antibodies against prohibitin in patient subsets of autoimmune pancreatitis, MD, RF, IgG4-RD, and Sjögren’s syndrome (not healthy donors)28. The prevalence of anti-prohibitin auto-antibodies in IgG4-RD patients was hypothesized to increase cell proliferation, ultimately driving tissue enlargement. Additionally, a microarray study evaluating labial salivary glands in RD patients identified regulation of cell proliferation among the top enriched biological categories19. As increased IgG4 is a hallmark of this disease, the association between the phenotype and cell cycle processes is supported by these previous studies.

For RD patients with a longitudinal time point, there was an association between two patients that flared at the second visit (both on prednisone) and induction of Th1, Th2, Treg, and eosinophil gene signatures as well as IgG4 and IgE mRNAs. In contrast, this pattern across these gene signatures/genes was not consistently observed in the four patients with stable disease status at the second visit. That is to say, no stable patient had multiple induced Th1, Th2, Treg, or eosinophil gene signatures at the second time point. The balance between cell-mediated immunity (Th1 cells), humoral immunity (Th2 cells), and maintenance of immune homoeostasis (Tregs) and how this correlates with disease pathogenesis or activity have been investigated in rheumatoid arthritis and SLE, though conclusions have varied29,30. Immune-activated over-expression of IL-4, -5, -10, -13, and TGF-β1 drives eosinophilia and increased IgG4 and IgE levels in IgG4-RD, thus this suggests that at states of increased disease activity, Th1, Th2, Treg, and eosinophil involvement would be greater6.

An inverse relationship was observed between the flare visit and both the B cell signature and Tfh genes, where the gene signature/genes were suppressed at the flare visit. Similar to those gene signatures/genes showing induction at the flare visit, there was no consistent pattern of agreement in the four patients with stable disease across these gene signatures/genes at the second time point. At baseline, the B cell signature was significantly reduced in all patients compared to controls and was even more pronounced at the longitudinal flare time point in the two patients, suggesting cell infiltration to the disease tissue from the periphery in increased disease activity states. Tfh cells are located within germinal centers and secrete IL-21, driving differentiation of B cells to produce antibodies, thus the pattern showed by these Tfh-associated genes at the flare time point is consistent with that of the B cell signature31. A study in PBMCs of SLE patients showed that flares may be positively correlated with expansion of both Tfh and regulatory B cells through a regulatory feedback mechanism31. Another study in SLE found that a peripheral subset of CD27-IgD-CD97 + memory B cells were increased with disease flare, though the entire subset of CD27-IgD- B cells had no correlation with disease activity32. These results are in contrast to the pattern observed in this study at the flare visit in the two patients, though the B cell signature used here is not specific to either regulatory or memory B cells, and as indicated in the study by Jacobi et al.32, differences in B cell subsets can greatly vary with respect to phenotype.

In summary, we show the importance of the T and B cell axis with molecular profiling across RD-SG, RD-nonSG, and RF as well as features that distinguish these three diseases. Future work seeks to better understand the molecular mechanisms at relapse or recurrence following steroid reduction in these patients.

Methods

39 patients fulfilled the 2011 comprehensive IgG4-RD diagnostic criteria were involved in this study33. Among them, 26 patients were classified as definite IgG4-RD, 6 patients were classified as probable IgG4-RD and 7 patients were classified as possible IgG4-RD. Blood was procured from 25 RD-SG (ages 32–81; 12 Males), 11 RD-nonSG (ages 48–80; 9 Males), 3 RF (ages 48–65; 3 Males) and 10 control (ages 30–57; 7 Males) Chinese subjects (Table 1

Table 1 Subject summary: Control, RD-SG, RD-nonSG, and RF.

). Any organ with the salivary and lacrimal gland was involved in RD-SG patients. Except the salivary and lacrimal gland, other organs were involved in RD-nonSG patients. However, only retroperitoneal fibrosis was found in RF patients. Twenty patients were treated with prednisone (≤ 60 mg), with or without another glucocorticoid-sparing agent (cyclophosphamide, ursodeoxycholic acid, azathioprine, tamoxifen, hydroxychloroquine, methotrexate, and/or mycophenolate mofetil). Six patients had one longitudinal time point: two patients exhibited a flare at this second time point, while four did not. All participants provided written informed consent, in accordance with the Declaration of Helsinki. The study was approved by the Ethical Committee of Peking University People’s Hospital.

In this study, stable or active condition was defined for every subject at the first visit. Stable condition was defined as the disappearance of clinical symptoms, normalization or stabilization of serum IgG or IgG4, and resolution of organ manifestations on imaging. Or else, it was defined as active condition. At the longitudinal time point, we defined flare condition as a recurrence of symptoms with the development or reappearance of organ involvement or abnormalities on imaging studies and elevation of serum IgG or IgG4 level.

The IL-13 and IL-4 gene signatures were identified as what have been described previously34,35. The B and plasma cell gene signatures were developed using experiments as described in Streicher et al.36. The eosinophil and neutrophil gene signatures were identified from a phase 1 clinical trial in systemic lupus erythematous (SLE)37. Baseline blood cell counts of SLE patients were correlated with whole genome microarray transcript profiles measured in the blood of the same patients. Th1, Th2, Treg, and Tfh gene signatures were taken from Dong et al.38. The genes that compose each gene signature are provided in Supplementary Table 3.

The methods for RNA sequence read mapping and differential expression analysis are provided in the Supplementary Methods.