Introduction

Immunoglobulin G (IgG) 4-related disease (IgG4-RD) was first described in Japan. It is characterised by extensive organ involvement with tissue fibrosis and IgG4-positive plasma cell infiltration1. Although the Japanese IgG4-RD team established comprehensive diagnostic (CD) criteria for IgG4-RD in 20112, several issues with the 2011 CD criteria have been raised, such as difficulty in performing tissue biopsies in some patients with suspected IgG4-RD and concerns about the usefulness of serum IgG4 levels for diagnosis3,4. Because IgG4-RD and other diseases have overlapping clinical features, it is important to distinguish between them accurately. It is well known that some patients without IgG4-RD have high serum IgG4 levels and more IgG4-positive cells in affected tissues5,6,7.

In 2019, the American College of Rheumatology (ACR) and European League Against Rheumatism (EULAR) proposed criteria that can classify IgG4-RD with high specificity, without tissue biopsy8. However, there is concern about the exclusion criteria; patients positive for disease-specific antibodies are excluded from analysis for IgG4-RD before they are assessed for other autoimmune diseases. The revised comprehensive diagnostic (RCD) criteria for IgG4-RD were proposed by a team from the Ministry of Health, Labour and Welfare of Japan in 20209. The 2020 RCD criteria were a revision of the 2011 CD criteria. Definite IgG4-RD, which includes patients who meet the organ-specific criteria for IgG4-RD and pathological diagnoses including storiform fibrosis and obliterative phlebitis, was added. However, there are still few studies about the performance of the 2020 RCD criteria in diagnosing IgG4-RD.

In this study, we evaluated the performance of both the 2019 ACR/EULAR classification criteria and the 2020 RCD criteria in diagnosing IgG4-RD. We also evaluated clinical differences between IgG4-RD and other diseases.

Methods

Patients

We retrospectively evaluated patients who were admitted to Juntendo University Hospital with suspected IgG4-RD from April 2010 to March 2022. Rheumatologists certified by the Japan College of Rheumatology diagnosed IgG4-RD based on the 2011 CD criteria2 or respective organ-specific criteria for IgG4-RD10,11,12,13,14,15,16 after excluding other diseases. We also did not include patients who were initially diagnosed with IgG4-RD but ultimately diagnosed with another disease. This study was approved by the ethics committee of Juntendo University Hospital (approval number E22-0363) and conducted in accordance with the principles of the Declaration of Helsinki. The need to obtain informed consent was waived by the ethics committee of Juntendo University Hospital because of the retrospective design.

Clinical evaluation

We collected clinical data, including patient demographics, laboratory data at diagnosis, and outcomes, from medical records. The number of involved organs was counted from 11 lesions, based on a previous study17. The sensitivity and specificity of the 2019 ACR/EULAR classification criteria and the 2020 RCD criteria were evaluated by rheumatologists.

The 2019 ACR/EULAR classification criteria consist of a three-step process. Step 1 involved the entry criteria; a patient met the entry criteria if there were characteristic clinical or radiologic involvement in a typical organ. Step 2 involved the exclusion criteria; a patient met the exclusion criteria if they had any clinical, serologic, or pathologic exclusion items. If the patient met entry criteria and did not meet any exclusion criteria, Step 3 was initiated. Step 3 involved the inclusion criteria; a patient met the classification criteria for IgG4-RD if the total number of points over 8 domains was ≥ 20.

The 2020 RCD criteria contained three items: (1) clinical and radiological features, (2) serological diagnosis, and (3) pathological diagnosis. Patients who fulfilled all three items were diagnosed with definite IgG4-RD. Those who fulfilled items (1) and (3) were diagnosed with probable IgG4-RD. Those who fulfilled items (1) and (2) were diagnosed with possible IgG4-RD. Patients with a diagnosis of possible or probable IgG4-RD based on the CD criteria who fulfilled organ-specific criteria for IgG4-RD10,11,12,13,14,15,16 were also regarded as having definite IgG4-RD. In addition to definite IgG4-RD, probable IgG4-RD and possible IgG4-RD also met the 2020 RCD criteria. The details of both criteria were provided in previous publications8,9.

Statistical analysis

Statistical analysis was performed using SPSS version 23.0 software (IBM Corp.; Armonk, NY). We used the Mann–Whitney U test to analyse continuous variables and Fisher’s exact test to evaluate categorical variables. Sensitivity referred to the proportion of patients with IgG4-RD who met the diagnostic criteria. Specificity referred to the proportion of patients with other diseases who did not meet the diagnostic criteria. The Kappa coefficient was used for assessing agreement between the criteria (> 0.80 was considered almost perfect reliability, 0.61–0.80 substantial, 0.41–0.60 moderate, 0.21–0.40 fair, 0.00–0.20 slight, and < 0.00 poor)18. To evaluate the predictive performance of clinical data, receiver operating characteristic (ROC) analysis was performed. We calculated the cut-off point using Youden’s index. P-values < 0.05 were considered statistically significant.

Results

Diagnostic performance of both criteria

A total of 50 patients with suspected IgG4-RD were admitted to our hospital and we diagnosed 42 patients with IgG4-RD. Patients who were not diagnosed with IgG4-RD had multicentric Castleman disease (n = 2), non–IgG4-related idiopathic retroperitoneal fibrosis (n = 2), malignant lymphoma (n = 1), dedifferentiated liposarcoma (n = 1), urothelial carcinoma (n = 1), and undiagnosed disease (n = 1). The ACR/EULAR classification criteria had 88.1% sensitivity and 87.5% specificity for IgG4-RD diagnosis. On the other hand, the 2020 RCD criteria had 100% sensitivity and 50% specificity for IgG4-RD diagnosis. There were 21 patients diagnosed with definite IgG4-RD, 1 patient with probable IgG4-RD, and 20 patients with possible IgG4-RD. The kappa coefficient for the criteria was 0.43.

Clinical data of patients with versus without IgG4-RD

We compared the clinical characteristics and laboratory data at admission of patients with versus without IgG4-RD. Only the number of affected organs differed significantly between these two groups (Table 1). Figure 1a shows the number of patients by the number of affected organs. Figure 1b shows the number of patients for each affected organ. In addition, we compared the clinical data of patients with versus without IgG4-RD who had a single affected organ (Table 2). Among patients with only a single affected organ, patients with IgG4-RD had significantly lower serum C-reactive protein (CRP) levels (median 0.10 mg/dL vs 0.31 mg/dL; p = 0.020), higher serum IgG4/IgG ratios (22% vs 12%; p = 0.027), and lower total haemolytic complement activity (CH50) (38.0 U/mL vs 48.9 U/mL; p = 0.044) than patients without IgG4-RD. When we examined ROC curves for serum IgG4/IgG ratio in patients with a single affected organ, we obtained an optimal cutoff value of 14.7% for differentiating between patients with versus without IgG4-RD (Fig. 2).

Table 1 Characteristics of patients with versus without IgG4-RD.
Figure 1
figure 1

Affected organs by IgG4-RD status. (a) Number of patients by the number of affected organs. (b) Number of patients by affected organ. IgG4-RD immunoglobulin G4-related disease.

Table 2 Characteristics of patients with a single affected organ by IgG4-RD status.
Figure 2
figure 2

ROC curve for defining the optimal cut-off IgG4/IgG ratio in patients with a single affected organ. The ROC curve for distinguishing IgG4-RD from other diseases in patients with a single organ lesion. The cut-off value was 14.7%, AUC was 0.82, sensitivity was 90.9%, and specificity was 71.4%. AUC area under the curve, IgG4-RD immunoglobulin G4-related disease, ROC receiver operating characteristic.

Discussion

In this study, we evaluated the performance of the classification and diagnostic criteria for IgG4-RD in patients admitted to hospital with suspected IgG4-RD. We confirmed that the 2019 ACR/EULAR classification criteria enabled disease classification with high specificity, similar to the results of a previous study8. We also found that the 2020 RCD criteria enabled diagnosis with high sensitivity. The kappa coefficient between the 2011 CD criteria and the 2019 ACR/EULAR classification criteria showed poor reliability in a previous study19; however, this study showed moderate reliability between the 2020 RCD criteria and 2019 ACR/EULAR classification criteria. One possible explanation is that the 2020 RCD criteria allowed for the diagnosis of definite IgG4-RD with organ-specific findings or additional pathological findings, which allowed for the diagnosis to be made regardless of serum IgG4 levels, thereby increasing the agreement between the two sets of criteria.

There were no significant differences in patients with versus without IgG4-RD in this study, other than in the number of affected organs. Serum IgG4 levels and IgG4/IgG ratio were considered useful for diagnosing IgG4-RD20 according to one study, but another study found them to be less useful4. These results also showed that it was difficult to differentiate IgG4-RD from other diseases based serum IgG4 levels and the IgG4/IgG ratio. However, we confirmed the usefulness of the IgG4/IgG ratio in a sub-analysis limited to patients with a single affected organ. Previous studies showed that many patients with IgG4-RD have multiple affected organs21 and as the number of affected organs increases, so do serum IgG4 levels22. However, patients with IgG4-RD who have relatively small and restricted lesions have been reported to have higher IgG4/IgG ratios, even in the absence of elevated serum IgG levels20. CH50 and serum CRP levels were also significantly lower in patients with IgG4-RD than patients without IgG4-RD among patients with a single affected organ. A previous study reported hypocomplementemia and low serum CRP levels in IgG4-RD23; we had similar results in this study. The mechanism underlying hypocomplementemia in IgG4-RD remains unclear, but it has been reported that IgG4-RD with hypocomplementemia is associated with more kidney or lung involvement and relapse17. Since serum CRP levels are often elevated in mimickers such as multicentric Castleman disease5, we might have identified a significant difference in serum CRP levels among patients with versus without IgG4-RD with a single affected organ. For the diagnosis of IgG4-RD, focusing on laboratory data such as the serum IgG4/IgG ratio relative to the number of affected organs is expected to help increase diagnostic accuracy.

This study has several limitations. It used a single-centre, retrospective design. Interpreting results for classification criteria may require caution, because the purpose of classification criteria is to create homogenous cohorts for clinical research and not for diagnosis. Due to the limited number of patients, multivariate analysis was not performed. The results of this study might need to be confirmed in larger studies.

In conclusion, the 2019 ACR/EULAR classification criteria have high specificity and the 2020 RCD criteria have high sensitivity for diagnosing IgG4-RD. The number of affected organs is also important for diagnosing IgG4-RD, with particular attention to the serum IgG4/IgG ratio in the case of single organ involvement.