Introduction

Membranous nephropathy (MN) is one of the most common glomerular diseases causing primary nephrotic syndrome (NS) in China. The prevalence of MN has increased greatly in Central China during the past two decades1,2. Minimal change disease (MCD) is another primary glomerular disease with high prevalence characterized by rapid onset and development. Current studies suggest these two diseases account for about 60% NS patients in China3,4.

Renal biopsy is still the gold standard for diagnosing and distinguishing these two diseases. However, considering the poor physical condition and contraindications, many patients are unsuitable to undergo renal biopsy in daily clinical practice5. In addition, it does take a period of time to wait for the result. The onset of MCD is usually utterly urgent which may cause irreversible damage to renal function if the treatment is not timely and correct.

M-type phospholipase A2 receptor (PLA2R) is considered as a specific marker of idiopathic membranous nephropathy. Present evidences reveal PLA2R test has strong specificity, while its sensitivity is insufficient, 0.7 approximately6,7,8,9. Hence, quite amount of PLA2R-negative MN patients cannot be screened out. More importantly, the accuracy of PLA2R detection may be influenced by other diseases such as tumor-related diseases7,10. Thrombospondin type-I domain-containing 7A (THSD7A) was identified as the target antigen in about 10% of patients with PLA2R-negative MN in European and North American populations, but THSD7A-associated membranous nephropathy has a low prevalence in Chinese patients11. Recently, multiple novel proteins/target antigens have been identified in MN, including exostosin 1/2, neural epidermal growth-like 1 protein, semaphorin 3B, protocadherin 7 and neural cell adhesion molecule 112,13. Nevertheless, they are not still largely implemented in the routine clinical practice. Thus, it is urgent to develop a new convenient and noninvasive method to distinguish MN from MCD.

The aim of our study is to develop and validate a model to distinguish MCD and MN, especially PlA2R-negative MN. The model can be used to the patients who are unsuitable or unwilling to undergo renal biopsy. We believe our model will help clinicians treat these patients in a timely manner and improve their prognosis.

Methods

Study population and ethical approval

In this population-based retrospective analysis, we screened all the patients with NS who were hospitalized in the First Affiliated Hospital of Zhengzhou University from January 2017 to August 2019. The inclusion criteria were as follows: (1) aged 18–80 years, (2) diagnosed as idiopathic MN or primary MCD by renal biopsy, (3) experienced a PLA2R test. The exclusion criteria included application history of corticosteroid or immunosuppressant prior to renal biopsy. A total of 949 patients including 805 idiopathic MN (605 PLA2R-positive MN and 200 PLA2R-negative MN) and 144 primary MCD were enrolled in this study and defined as potentially relevant cases. Among them, patients with PLA2R-negative MN and MCD were used to develop and validate the discrimination model. In the end, we also tested the differential ability of the model in all idiopathic MN and MCD. Moreover, we also validated the model among 60 cases containing 42 PLA2R-negative MN and 18 primary NS caused by other etiology in real life (14 MCD, 3 focal segmental glomerulosclerosis, 1 mesangial capillary glomerulonephritis). Anti-PLA2R antibodies were determined by an ELISA assay. A negative serology was defined as an ELISA titer < 14 RU/mL. The enrollment flowchart of the participants in this study was shown in Fig. 1.

Figure 1
figure 1

Enrollment flowchart of participants used for model development and validation. MCD minimal change disease, MN membranous nephropathy.

The First Affiliated Hospital of Zhengzhou University Ethics Review Committee granted ethical approval for the study and the ethics review approval ID was “ZY-2021-0008”, and the requirement for informed consent was waived by the ethics commission. All methods were performed in accordance with the relevant guidelines and regulations.

Data collection

We collected the basic information and laboratory examination from all patients recruited at the time of renal biopsy, which might be involved in distinguishing the two diseases. The basic information included age, gender, onset time, systolic blood pressure (SBP) and diastolic blood pressure (DBP). The laboratory indices included red blood cell (RBC) count, white blood cell (WBC) count, platelet (PLT) count, eosinophil (EOS) count, percentage of eosinophils (EOS%), hemoglobin (Hb) levels, mean corpuscular hemoglobin (MCH), mean corpuscular hemoglobin concentration (MCHC), total protein (TP) levels, albumin (ALB) levels; total cholesterol (TC) levels, triglyceride (TC) levels, low density lipoprotein (LDL) levels, high density lipoprotein (HDL) levels, estimated glomerular filtration rate (eGFR); serum creatinine (SCr) levels; uric acid (UA) levels, C reactive protein (CRP) levels, erythrocyte sedimentation rate (ESR) levels, complement 3 (C3) levels, complement 4 (C4) levels, 24 h uric total protein (24 h TP) levels , 24 h urine volume and venous thrombosis of lower extremity.

The urine and venous blood samples of all participants were collected after 12 h of overnight fasting (except 24-h urine samples) and sent to laboratory for testing straight away.

Statistical analysis

The PLA2R-negetive MN and all MCD patients (n = 344) were randomly divided into the training group (n = 241) and the test group (n = 103).

There were a few missing data of several variables. For example, the HLD and LDL levels had 3.5% missing values, Scr and Urea levels had 0.9% missing values. To deal with these missing data, multivariate multiple imputation with chained equations was used to impute missing values so that we could maximize the statistical power and diminish bias14. Descriptive statistics of all variables including means, medians, and proportions are used to describe the characteristics of two groups. The categorical data expressed as the percentages, and means ± standard deviation (SD) or medians (quartile 1, quartile 3) were described as continuous variables satisfying or not satisfying the normal distribution, respectively. We used the univariate logistic regression to calculate the OR values of the variables, selected potential variables to perform multiple logistic regression subsequently, and calculated the collinearity of the variables to remove the colinear factors. The candidate variables with a p value < 0.05 in the univariate analysis were enrolled to develop the multivariable model.

Based on the clinical features and variables with statistical sense, we attempted to develop the discrimination model for distinguishing PlA2R-negative MN from primary MCD patients.

Next, we drew a receiver operating characteristic curve (ROC) and used the area under the receiver operating characteristic curve (AUC) to evaluate the verification efficiency of the model. The calibration was assessed by constructing the calibration curve. The fitting degree of the model was assessed by the Akaike information criterion (AIC). After comprehensively evaluating the performance of each model, we obtained the best model and constructed a nomogram to make it convenient for the clinical application.

At last, we constructed the decision curve analysis to determine the clinical utility of the discrimination model by quantifying the net benefits at different threshold probabilities15.

All the statistical analysis processes involved were completed by R software, version 4.0.2. P value < 0.05 was considered statistically significant. Model validation and evaluation processes were independently performed on the training group and test group, respectively. In order to assess the diagnostic ability of the model distinguishing all idiopathic MN and MCD, we also performed validation processes on 949 potentially relevant cases.

Ethics approval and consent to participate

The First Affiliated Hospital of Zhengzhou University Ethics Review Committee granted ethical approval for the study and the ethics review approval ID was “ZY-2021-0008”. Informed consent was waived because of the retrospective analysis.

Results

Baseline characteristics

The baseline characteristics of PLA2R-negative MN (n = 200) and MCD (n = 144) were shown in Table 1. The results suggested that PLA2R-negative MN patients tended to be older and have higher TP, ALB levels and greater urine volume. While, the RBC and PLT count, median Hb, TCHO, TG, LDL level, HLD, Scr, Urea, ESR, C3, C4 and 24hTP levels of PLA2R-negative MN were lower than those of MCD. The baseline characteristics of all idiopathic MN and MCD were shown in Table S1.

Table 1 Baseline characteristics of PLA2R-negative MN and MCD.

All the participants were randomly divided into training group (n = 241) and test group (n = 103). The levels of these variables were similar and had no significant statistical difference, which represented similar clinical profiles between the two groups (Table 2).

Table 2 Baseline characteristics showed there was no statistical difference in the training group and test group.

Six potential predictors were selected to develop the discrimination model

By means of univariate logistic regression, 17 potential predictors from 29 candidates were considered to have statistical significance (p < 0.05) in training group. After implementing the multivariable logistic regression analysis and removing the collinear candidates, six potential predictors including age, ALB levels, HDL levels, Urea levels, C3 levels and RBC counts were used to establish the discrimination model to distinguish PLA2R-negative MN from MCD (Table 3). We created a probability equation based on the above six predictors (Supplementary Equation). The results indicated that patients with older ages, higher ALB levels, lower HDL levels, lower serum Urea levels, lower C3 levels and lower RBC count were more likely suffering PLA2R-negative MN.

Table 3 Potential risk factors identified by univariate and multivariate logistic regression analysis.

Great discrimination and calibration capability of the model

We drew the ROC to evaluate the diagnostic effectiveness of the model (Fig. 2). The area under the ROC (AUC), which is referred to as the C-statistic, is considered to be an indicator for evaluating the effectiveness of the model. We found that the discrimination model had a high efficiency with an AUC of 0.904 (cut-off value: 0.511, sensitivity: 0.832, specificity: 0.886; Fig. 2A) in training group. We subsequently verified the effectiveness in the test group and the result showed a superior efficiency with an AUC of 0.886 (cut-off value: 0.693, sensitivity: 0.837, specificity: 0.850; Fig. 2B). The high value of AUC showed that the model had a great ability for discrimination the two diseases.

Figure 2
figure 2

Differential capability of the nomogram. (A) ROC curve based on obtained potential risk factors identified by multivariate logistic regression analysis showing discrimination rate for PLA2R negative MN and MCD in the training group. (B) ROC curve based on obtained potential risk factors identified by multivariate logistic regression analysis showing discrimination rate for PLA2R negative MN and MCD in the test group.

The calibration curve was plotted to evaluate the calibration of the model and it demonstrated a good agreement between prediction and observation both in training group (mean absolute error = 0.017) and test group (mean absolute error = 0.016, Fig. 3 A,B). The calibration curve indicated that the model had a great calibration capability.

Figure 3
figure 3

Calibration curve of the discrimination nomogram in the (A) training group or (B) test group. The x-axis represents the predicted probability of MN. The y-axis represents the actual pathologically diagnosed MN. The diagonal dotted line represents a perfect prediction by an ideal model. The solid line represents the performance of the nomogram, of which a closer fit to the diagonal dotted line represents a better prediction.

Construction and usage of the nomogram

In order to make the model convenient to use, we constructed the nomogram of our discrimination model based on six obtained predictive variables including age, HDL levels, ALB levels, Urea levels, C3 levels and RBC count (Fig. 4). The value of each variable represented as a score by drawing a straight line upward from the corresponding value to the “Points” line. Sum the total points and mark it at “Total points” line. Draw down a straight line to the corresponding “MN probability” axis and obtain the possibility of MN. In order to show the score intuitively, we have selected several representative values as examples (Table S2).

Figure 4
figure 4

Nomogram based on the laboratory model; ALB albumin, HDL high density lipoprotein, C3 complement 3, RBC red blood cell, MN membranous nephropathy.

Decision curve showed it would add more net benefits for clinical decision

The result of decision curve analysis for the nomogram was shown in Fig. 5. In training group, the decision curve showed that if the threshold probability of a patient is between 0.02 and 0.91, using the nomogram in the present study to predict MN adds more benefits than performing renal biopsy on none or all patients.

Figure 5
figure 5

Decision curve for the nomogram predicting MN in (A) training group or (B) test group. Net benefit is shown on the y-axis. The thick red line represents the model; the thin gray line represents the assumption that all patients have MN; the thin black line represents the assumption that all patients have MCD. In training group, the decision curve showed that if the threshold probability of a patient is between 0.02 and 0.91, using the nomogram in the present study to predict MN adds more benefit than performing biopsy on all or no patients.

Diagnosis efficiency testing in potentially relevant cases

In order to determine whether we can expand the application scope of our model, we also performed diagnosis efficiency testing in potentially relevant cases (including 605 PLA2R-positive MN, 200 PLA2R-negative MN and 144 MCD, Fig. 6). The ROC showed the great diagnostic performance with an AUC of 0.867 (cut-off value: 0.509, sensitivity: 0.785, specificity: 0.861), suggesting great discrimination ability for all idiopathic MN and MCD patients. The performances of calibration curve and decision were also good (Figure S1, S2). In addition, we performed diagnosis efficiency test in PLA2R-negative MN and primary NS caused by other etiology (42 PLA2R-negative MN, 14 MCD, 3 focal segmental glomerulosclerosis, 1 mesangial capillary glomerulonephritis, Fig. 7). The ROC showed the diagnostic performance with an AUC of 0.836 (cut-off value: 0.639, sensitivity: 0.778, specificity: 0.857).

Figure 6
figure 6

ROC curve showing discrimination effective of nomogram used in all MN and MCD.

Figure 7
figure 7

ROC curve showing discrimination effective of nomogram used in PLA2R-negative MN and primary NS caused by other etiology.

Discussion

In this retrospective case–control study, we attempted to develop a discrimination model used to distinguish patients with idiopathic MN and MCD. We collected and analyzed the basic information and laboratory examination of 949 patients with MN or MCD. Based on 200 PLA2R negative patients and 144 MCD patients, we developed a discrimination model to differentiate the two diseases. The results showed great diagnostic effectiveness with an AUC of 0.904 in training group and an AUC of 0.886 in test group as well as high calibration capability. To the best of our knowledge, it is the first study aiming to develop a discrimination model based on the basic information and the laboratory examination of participants to distinguish primary PLA2R-negative MN and MCD. In addition, our model showed great diagnostic effectiveness with an AUC of 0.867 in all idiopathic MN patients (either PLA2R-negative or PLA2R-positive MN) and MCD patients. It is an attempt at translational medicine of our study, which can aid clinicians to treat patients with different methods in a timely manner and thus improve their prognosis.

Currently, it is difficult to distinguish MN and MCD patients by a noninvasive tool in clinical practice. A study tried to use soluble urokinase-type plasminogen activator receptor (suPAR) level to distinguish idiopathic focal segmental glomerulosclerosis (FSGS), MN and MCD. However, the study revealed that the three types of glomerulopathy cannot be distinguished using suPAR solely 16. Therefore, there is no miraculous indicator or model to identify these two primary glomerular diseases currently.

Prediction and discrimination models based on clinical data have been developed increasingly in a wide variety of diseases recent years17,18. In terms of kidney diseases, prediction and discrimination models are also rapidly growing owing to its scientific nature and accuracy19,20,21. The appearance of clinical models gave us great inspiration.

Present evidences suggested that MN had the largest proportion of morbidity in elderly patients, while MCD accounts for the highest proportion of primary nephrotic syndrome in young patients3,22,23. Our experiments reached similar results that the age at biopsy had a certain influence on the nomogram. In our study, patients with older age were more likely to be considered as PLA2R-negative MN, while younger age at onset was considered to be a higher risk of MCD.

ALB level was one of the predictive factors in this model. Some studies showed MN patients always had higher ALB levels, which was consistent with our results24,25. One of the most important clinical manifestations of nephrotic syndrome is increased urinary protein and decreased albumin level. Larger amounts of glomerular albumin filtration will also make serum albumin and serum total protein at a low level. MCD patients always presents as an acute onset, severe illness, and a greater amount of urinary protein, and rapid decline in renal function may result in increased Scr levels and decreased eGFR and 24 h urine volume. Some patients will even progresses to AKI, which is relatively common actually, due to high-grade proteinuria26. The slit diaphragm between foot processes is regarded as a fine filter27. There is a common assumption that proteins leak from the slit pores due to reduced nephrin expression, leading to larger amounts of glomerular albumin filtration in MCD patients28. In our study, we chose ALB levels as one of the variables of the model by multivariable regression analysis to exclude the effect of collinearity.

HDL level is another variable that can be used to distinguish between two diseases according to our results. Nephrotic syndrome could cause upregulation of HDL endocytic receptor and downregulation of HDL docking receptor, causing dysregulation of lipid/lipoprotein metabolism29. MCD is also known as lipoid nephropathy because steatosis can be observed in epithelial cells of proximal convoluted tubules under light microscopy. In addition, increased hepatic lipoprotein synthesis and reduced lipoprotein degradation are also thought to be responsible for elevated blood lipid profiles. Takeshi Fujita compared lipid and fatty acid metabolism between 7 MCD and 11 MN patients. The results showed that the patients with MCD had higher level of blood lipids than MN30. Although the mean HDL level was much higher in patients with MCD, there was no statistical significance between the two groups, which was not exactly the same as our results. The reason might be that their sample size was not large enough, leading to the unobvious statistical significance.

Increased urea level was usually observed in a high protein decomposition status. After the renal filtration barrier disrupted, large amount of protein will leak into Bowman’s space and renal tubules through glomerular barrier to form crude urine. When proximal tubules enhance the reabsorption of filtered proteins, the protein decomposition is also increased at the same time, resulting in elevated serum Urea level. Serum Urea level were also found different in MN patients and MCD patients in Jin Dong’s research31, which was consistent with our results. The Urea level plays an important role in our discrimination model.

C3 level is also a biomarker for distinguishing PLA2R-negative MN and MCD. Complement is a group of glycoproteins with enzyme-like activity that exists in human serum and tissue fluid, together with its regulatory factors and related membrane proteins to form the complement system32. C3 is the largest content in each component of the complement system and the key substance of the classical pathway and the alternative pathway. A variety of glomerulonephritis showed evidence of complement activation. However, the role of complement in the pathogenesis of these kidney diseases remains not fully understood33. From the study of combining histopathological examination and blood test to identify different types of glomerulonephritis, the C3 level of MCD is higher than that of MN31, although there is no statistical significance which may be due to insufficient cases.

RBC count and Hb levels had statistical significance by means of univariate regression in our study. They are usually recognized as the indices to evaluate anemia. Compared with younger patients, idiopathic membranous nephropathy patients over 65 years old were found to have lower Hb level than patients less than 65 years old in Choi JY’s study34. However, the results in Yaeni Kim’s study showed there was no difference of Hb levels between elderly patients and young patients35. The reason for this ambiguity might be different gender and illness state of included patients. In our study, univariate regression showed there was no statistical difference in gender. And the data we collected was from the time of renal biopsy, reducing influence of the illness state.

The decision curve showed the clinical utility of our model, indicating it may be beneficial for clinicians to distinguish the two diseases by using our model. And using the nomogram to distinguish the two diseases added more benefits than either all or no patients who underwent a renal biopsy if the threshold probability of a patient was between 0.02 and 0.91. The results of decision curves suggest the good clinical application value of our model, reflecting the thinking mode of translational medicine.

The results of diagnosis efficiency test in potentially relevant cases suggested that our model is applicable to all idiopathic MN and MCD patients. Some hospitals are unable to perform PLA2R test, and our model might provide an alternative tool for these hospitals to distinguish MN and MCD.

Our study is an attempt in translational medicine and has a number of strengths. It had a large sample size with 949 idiopathic MN and MCD patients confirmed by renal biopsy. And the 6 items in the nomogram are routine clinical variables that can easily obtained by clinicians. We chose to collect the information and examination results at the time of renal biopsy, and excluded the influence of corticosteroid or immunosuppressive agents. What is more, our discrimination model has excellent diagnostic effectiveness with an AUC of 0.904 in training group and an AUC of 0.886 in test group. The outstanding discrimination ability for all idiopathic MN and MCD patients even showed wider application prospects of our model. The operation of the model is simple and fast, which can help doctors diagnose patients timely. Unlike renal biopsy, our model does not have any contraindications so that it can be used more widely.

However, there are also several limitations in our study. First, we still need to expand the sample size for further reducing the heterogeneity. In addition, all the patients came from the First Affiliated Hospital of Zhengzhou University and we did not conduct multicenter external validation. Besides, the parameters of our model mainly came from laboratory test (e.g. red blood cell count and albumin). These parameters are non-specific and may be affected by many factors, which is one of our limitations. Last, our model is only suitable for the identification of MN and MCD. The incidence of MN and MCD is high, while IgA nephropathy is the most common pattern of primary glomerular disease worldwide. Even through IgA nephropathy always presents the clinical features as nephritis with minor proteinuria rather than nephrotic syndrome, unlike MN and MCD3,36, there is still a lack of differential ability of our model for other types of nephrotic syndrome. Corrections to these shortcomings will be made in our subsequent research.

Conclusion

In this study, we developed and validated a discrimination model used for distinguishing PLA2R-negative MN and MCD patients. We further presented a nomogram including age, ALB levels, HDL levels, urea levels, C3 levels and RBC counts. The model showed good discrimination and calibration ability both in training group and test group. It also had a great diagnostic performance in all MN patients and MCD patients. Hopefully, it could provide a practical and convenient tool for clinicians to distinguish these two diseases.