Introduction

Chronic kidney disease (CKD) is a burden for Public Health and concerns millions of individuals worldwide. Kidney transplantation remains the optimal treatment for CKD, offering a better survival than dialysis and being cost-effectiveness1,2,3.

In order to increase the pool of available donors in a setting of organ shortage, grafts from uncontrolled donors after circulatory death (uDCD) have been used in France since 2006. Several studies have shown that kidneys from DCD provide almost an equal function as kidneys from donation after brain death (DBD)4,5,6,7,8,9,10,11,12,13. Moreover, as DBD kidney transplantation, DCD kidney transplantation is associated with increased survival of patients who have end-stage renal disease (ESRD) and are on the transplant waiting list14. Whereas early reports of uDCD showed good renal outcome and no increase in IF/TA compared to extended criteria DBD, other reported a very early and more severe development of IF/TA15 than in DBD.

Whatever the donor’s status, chronic allograft dysfunction (CAD), which is the final result of different etiological and pathogenetic conditions, remains the first cause of graft loss16,17,18,19,20. CAD corresponds to the irreversible replacement of functional renal tissue by extracellular matrix (ECM) proteins, leading to the progressive impairment of renal graft function. Among the pathological lesions observed during CAD, one of the most prominent is interstitial fibrosis and tubular atrophy (IF/TA). Other histological damages include glomerulosclerosis, splitting of glomerular capillary basement membranes and vascular intimal hyperplasia21. Many mechanisms which are of immunological origins or not, are involved in this multifactorial and complex process22,23,24,25,26. The current concept states that many processes leading to graft fibrosis, i.e. IF/TA, occur early after the transplantation, especially within the first few months27. IF/TA involves about 40% of kidney grafts at 3–6 months and up to 50% at 1 year, while renal function remains stable28,29, which suggests that renal function is a suboptimal and late marker of kidney graft dysfunction. Up to now, the only reliable method to assess IF/TA is the histological evaluation of a graft biopsy30,31 which also allows to determine specific lesions and pathogenic processes affecting the graft22,23. It is also well-known that early IF correlates with long-term graft dysfunction32,33,34, highlighting the need for accurate quantification of IF to better identify patients requiring specific therapeutic interventions and to determine the efficacity of such interventions. IF in kidney graft is usually graded using the Banff classification35. However, the Banff classification exhibits some limitations. The semi-quantitative evaluation in only 4 grades (0 to 3) prevents the precise evaluation of IF evolution and may be not sensitive enough in the early stages. Furthermore, studies have shown a wide interobserver variation in the assessment of renal graft biopsies using the Banff classification28,36,37,38.

The aim of the study was to assess the initial progression of interstitial fibrosis and tubular atrophy in kidney grafts from uDCD formerly classified as “Maastricht II” non heart beating donors, using a new image analysis method based on Sirius Red staining.

In the first part of our work, we will evaluate the accuracy, the robustness and reproducibility of our computerized analysis method (mIF) to assess IF. In the second part, mIF will be applied to assess IF during the first year after kidney transplantation from uDCD and mIF score will be correlated with clinico-biological data.

Results

Validation and reliability tests of the image analysis method for the quantification of interstitial fibrosis (mIF)

Among the 166 graft biopsies, 15 were rejected for image analysis because specimen adequacy was unsatisfactory (<7 glomeruli and/or no artery on the sample) according to the Banff criteria39 or because the paraffin block was worn out. Measurement of interstitial fibrosis by image analysis (mIF) was performed on the 151 biopsies independently by two pathologists (CP and SF) (Fig. 1A). The value of inter-operator intraclass correlation coefficient (ICC) was 0.75 with its 95% confidence interval (95% CI) equal to [0.67–0.81] (p = 8.4.10-29) (Fig. 1B). To assess the intra-observer reproducibility, 21 consecutive graft biopsies were analyzed again by one of the pathologists (CP) 6 months later. The value of ICC was 0.88 with 95% CI of [0.72–0.95] (p = 5.5.10-8) (Fig. 1C). Analysis of interstitial fibrosis (ci) according to the Banff criteria (using Masson’s trichrome staining) exhibited inter-operator ICC of 0.78 with 95% CI of [0.70–0.84] (p = 5.6.10−25) (Fig. 1D). Then, we compared morphometric quantification of interstitial fibrosis and semi-quantitative analysis performed by an expert pathologist (SF) according to Banff criteria (Fig. 1E). The mIF score significantly increased (p < 0.05) between the four groups defined by the Banff classification. Mean mIF (%) was respectively 8.3 ± 2.4 (ranges: 4.3–15.8) in the ci0 group (n = 40), 10.7 ± 3.6 (ranges: 4.0–25.2) in the ci1 group (n = 82), 17.1 ± 6.4 (ranges: 9.8–36.6) in the ci2 group (n = 18) and 20.1 ± 8.0 (ranges: 5.7–32.7) in the ci3 group (n = 11). The correlation between mIF and Banff ci was 0.62 with 95% CI of [0.51–0.71] (p < 0.001). Discrepancies between ci and mIF were observed in only 2 cases. These cases were carefully reanalyzed: interstitial edema and inflammation were the main drawback to accurately quantify ci in Massons’ trichrome staining (Supplemental Fig. 1).

Figure 1
figure 1

Quantification of interstitial fibrosis by image analysis: validation and reliability tests of the morphometric quantification of IF (mIF). (A) Morphometric analysis. Renal biopsy sections stained with Sirius red were captured by a ScanScope Aperio scanner (CS), using 20X objective. For each biopsy, the cortical section was manually selected. Glomeruli and medium-sized arteries were deleted by the operator. The red positive area was expressed as a percentage of the entire cortical kidney section using a computer-based morphometric analysis software (Calopix, Tribvn, Montrouge, France). (B) The value of inter-operator intraclass correlation coefficient (ICC) using mIF was 0.75 with its 95% confidence interval (95% CI) equal to [0.67–0.81] (p < 10−3) (n = 151). (C) The value of intra-observer ICC of mIF was 0.88 with 95% CI of [0.72–0.95] (p < 10−3) (21 consecutive graft biopsies were analyzed again 6 months later). (D) The value of inter-operator ICC using Banff criteria was 0.78 with 95% CI of [0.70–0.84] (p < 10−3) (n = 151). (E) mIF according to Banff ci. Pearson correlation between mIF and Banff ci was 0.62 with 95% CI of [0.51–0.71] (p < 10−3). Abbreviations: ci = interstitial cortical fibrosis; mIF, morphometric interstitial fibrosis; sd, standard deviation.

Clinical characteristics of the patients

The 66 patients who received kidney graft from uDCD included 14 females and 52 males. Mean age at the time of kidney transplantation was 44.7 ± 9.8 years old (median: 46; range: 20–59). Indications for kidney transplantation were hypertensive nephroangiosclerosis (n = 16), IgA nephropathy (n = 7), autosomal dominant polycystic kidney disease (n = 6), Alport syndrome (n = 4), diabetic nephropathy (n = 3), other glomerulopathies (n = 5), tubulo-interstitial nephritis (n = 3) and uropathy (n = 4). Nephropathy remained undetermined in 18 patients. All patients received induction therapy with anti-lymphocyte serum. They also received mycophenolate mofetil, corticosteroids and tacrolimus per local practice. Characteristics of the recipient patients are summarized in Table 1.

Table 1 Characteristics of donors, recipients and grafts.

Uncontrolled DCD: Baseline characteristics, transplantation and clinical outcome

The 66 patients received kidney graft from 48 uDCD (Table 1). Donors included 9 females and 39 males and were 39.7 ± 8.5 years old (median: 40.5; range: 19–53). None of them had history of arterial hypertension or diabetes mellitus. Mean total warm ischemic time (tWIT) from cardiac arrest to in situ preservation was 145.9 ± 15.2 minutes (median: 148.5; ranges: 115–188). Mean cold ischemia was 16.2 ± 3.7 hours (median: 17.0, ranges: 10–23.8). Delayed graft function occurred in 54/66 patients (82%). Mean time between transplantation and renal graft recovery was 22.6 ± 9.8 days (median: 20, ranges: 1–58). Non-primary graft function occurred in 2 patients. Biopsy-proven acute rejection occurred in eight patients (12%). Mean LDH level reached a pic at Day 3 post transplantation (2539 ± 1089 IU/L) which was significantly associated with time to graft recovery (time to recovery according to LDH at D3: y = 15.2 + 0.0025x, p = 0.016, R² = 0.1028).

Histological features

Among the 48 uDCD, 43 underwent kidney biopsies (so called D0) before organ donation. Thereafter, 20 patients underwent kidney graft biopsy between 15 and 30 days (D15–30) post-transplantation for delayed graft function, 28 at 3 months (M3) and at 12 months (M12) for routine evaluation. Socio-demographic and clinical data were not significantly different between the patients who underwent biopsies at D15–30, M3 and M12. Histological features of biopsies are summarized in Table 2. Acute tubular injury significantly decreased from D0 to D15-D30 (63% ± 28% versus 47% ± 22%, p = 0.036), then to M3 (9% ± 11%, p < 10-3) and M12 (4% ± 6%, p = 0.016). Conversely, chronic lesions, especially interstitial fibrosis and tubular atrophy, significantly increased over time. According to the Banff staging system, interstitial fibrosis remained unchanged from D0 to D15/30 (0.6 ± 0.5 at D0, 0.7 ± 0.6 at D15/30), increased from D0 to M3 (0.6 ± 0.5 versus 1.5 ± 0.8, p < 10−3) and remained stable between M3 and M12 (1.7 ± 0.9 versus 1.5 ± 0.8, p = 0.37) (Fig. 2A).

Table 2 Histological features of renal biopsies at D0, D15-D30, M3 and M12.
Figure 2
figure 2

Interstitial fibrosis (IF) significantly increased after renal graft transplantation. (A) According to the Banff criteria, ci remained unchanged from D0 to D15/30, increased from D0 to M3 (p < 10−3) and remained stable between M3 and M12 (p = 0.37). (B) Using morphometry analysis, mIF tended to increase as early as the first month after renal transplant (p = 0.056), increase was significant from D0 to M3 (p < 10−3) and remained significant from M3 to M12 (p = 0.021). Abbreviations: ci = interstitial cortical fibrosis; mIF, morphometric interstitial fibrosis; sd, standard deviation; tx, renal transplantation; *p < 0,05; ***p< 10−3. Figure was performed using R software58.

mIF significantly increased after the transplantation and correlated with clinical outcome

Image analysis IF was 8.4% ± 3.8% at D0, 10.0% ± 3.9% at D15-D30, 13.6% ± 6.7% at M3 and 15.9% ± 5.2% at M12 (Fig. 2B). Increase was significant from D0 to M3 (p < 10−3). Conversely to ci according to the Banff scoring system, increase of mIF remained significant from M3 to M12 (p = 0.021). Fibrosis tended to increase as early as the first month after renal transplant (p = 0.056). Most of the patients underwent more than one kidney biopsy. Among them, seven patients underwent early kidney graft biopsy between D7 and D30 and protocol biopsies at M3 and at M12 (Supplemental Fig. 2). In these paired cases, mIF significantly increased from first month to M3 (respectively 7.8% ± 1.8% and 11.4% ± 2.7%, p < 0.01) then from M3 to M12 (respectively 11.4% ± 2.7% and 17.7 ± 4.8%, p = 0.02). In these cases, Banff classification failed to demonstrate early increase of ci from first month to M3 (respectively 0.88 ± 0.35 and 1.3 ± 0.46, p = 0.08).

Baseline characteristics of uDCD (sex, age), tWIT and cold ischemia were not associated with mIF at any time. mIF at M12 (R² = 0.29, p = 0.005) but not at D0, D15–30 and M3 correlated with mean time to renal graft function recovery. In addition, mIF at M12 correlated with increase of creatininemia at M12 (R² = 0.32, p = 0.013) and at last follow-up (R² = 0.29, p = 0.005). We found the same correlation using the Banff scoring system, however, whereas there was no correlation between ci according to the Banff scoring system at M3 and creatininemia at M12 (R² = 0.15, p = 0.18), mIF at M3 tended to be associated with increase of creatininemia at M12 (R² = 0.31, p = 0.057).

Discussion

The aim of this study was to precisely describe the evolution of IF after renal transplantation with uDCD, using a new computerized image analysis method. Indeed, IF is one of the most prominent pathological lesions of CAD and correlated with renal graft prognosis32,33,34 but the currently used Banff classification is a semi-quantitative system which prevents precise description of IF evolution and may not be sensitive enough in early stages. Slight but significant increase of IF on renal graft may be missed. Grimm et al. demonstrated that a precise quantification of IF by computerized image analysis provides a better surrogate marker for time to graft failure than any combination of chronic lesion scoring using the Banff schema34. Moreover, some studies also exhibited a wide inter-observer variation in the IF assessment according to the Banff classification36,37. The morphometric analysis we developed exhibited high inter-observer and intra-observer reproducibility, with ICC [95% IC] of respectively 0.79 [0.74–0.83] and 0.88 [0.72–0.95]. A variety of techniques have been used to measure renal fibrosis, each with its own advantages and disadvantages. We decided to assess IF with unpolarized Sirius red staining, firstly because Sirius red staining is commonly used in pathological laboratories since the 1980s40, especially to assess liver fibrosis41,42,43,44,45, secondly because to its high specificity for collagen fibers and its previously demonstrated superiority compared to polarized Sirius Red and trichrome staining46,47,48,49. Indeed, IF is typically considered to be an excess accumulation of fibrillary collagen. Types I and III collagen represent almost all collagens synthetized by fibroblasts and often predominate in fibrotic scars. Sirius red dye molecules intercalate into the tertiary groove in the structure of types I and III collagen and are strongly birefringent when observed under polarized light. Polarized Sirius Red analysis exhibited good results in IF assessment34,49,50 but it requires individual fields to be photographed in polarized light and analyzed or cost-expensive whole slide scanners capable of polarized light. Similarly, trichrome staining, that is commonly used to the visual assessment of collagen content in the interstitium, has also be used to quantify IF by image analysis, with good results51,52. However, studies demonstrated than unpolarized Sirius Red had not only the best correlation with estimated glomerular filtration rate (GFR)47,48 compared to polarized Sirius Red and trichrome staining, but also significant correlation with the decrease of GFR33.

In our uDCD cohort, both mIF and ci according to the Banff classification significantly increased the first three months after renal transplantation From M3 to M12, mIF significantly increased whereas Banff classification failed to highlight increase of ci. Then, mIF may be more accurate than semiquantitative Banff evaluation to identify the effects of antifibrotic therapeutic interventions. Morphometric IF tended to increase in the first month after renal transplantation but without reaching significance, due to the limited size of sample, graft biopsies being not systematically performed during the first month. However, this early and sensitive quantification of IF may be a very valuable surrogate marker to study therapeutic intervention aiming the early development if CAD in kidney transplantation. Moreover, whereas increase of creatininemia at M12 was not associated with ci according to the Banff scoring system at M3, it tended to be associated with mIF at M3, suggesting usefulness of a more sensitive quantification analysis.

In our study, mIF at M12 was significantly associated with increase of creatininemia at M12 and at last follow-up, which confirms the need for a precise assessment of IF.

IF, which corresponds to the replacement of renal functional tissue by ECM, is not only a major concern after kidney transplantation, but also during CKD in native kidneys. Indeed, renal fibrosis still represents the final target to treat CKD53. Surprisingly, few studies have focused on morphometric quantification of IF on native kidneys54,55. Hunter et al.54 demonstrated than high collagen matrix index and fibrillary collagen index, both assessed by quantitative morphometry after Sirius Red staining, predicted relapse and progression to ESRD during lupus nephritis. Similarly, Gibyeli Genek et al.55 performed a quantitative evaluation of IF with Sirius Red in IgA nephritis and highlighted that such evaluation might serve as an effective novel method to determine the prognosis in IgA nephritis.

To conclude, the new morphometric image analysis method we developed, using a routine and cheap staining, may provide valuable tool to assess IF during chronic allograft dysfunction and thus to evaluate new sources of grafts. The use of this method to describe CKD on native kidney should also be evaluated.

Methods

Patients

After institutional review board, we retrospectively included all the 66 patients who received a kidney graft from uDCD between 2007 and 2012 in Bicêtre hospital. Kidney donation followed the 2008 Declaration of Istanbul principles and the French Agence Nationale de la Biomédecine regulation. Research was approved by the committee of the Centre de Ressources Biologiques (CRB, Paris Sud University). Informed written consent was given by all the patients for the scientific use of the graft biopsies in the CRB, Paris Sud University. All procedures and the use of tissues were performed in accordance with the Declaration of Helsinki principles. Clinical reports and biological data were collected from the associated clinical database.

Histological review of kidney graft biopsies

A total of 166 kidney graft biopsies was performed in the 66 patients. The biopsies were processed for routine light microscopy, as we previously described56. Biopsy samples were fixed in formalin, acetic acid and alcohol (AFA), paraffin-embedded and sliced 2.5 µm thick. Slides were stained with HES (hematoxylin, eosin and saffron), Masson’s trichrome (3 sections), periodic acid Schiff, Jones methenamine silver and Sirius red. Third Masson’s trichrome and Sirius red staining were consecutive. Kidney graft biopsies were reviewed independently by 2 pathologists (CP and SF) for histological features according to Banff recommendations35 in a blind manner from clinical data. Immunostaining for C4d was performed using a rabbit monoclonal A24-T anti-human C4d antibody (DB Biotech, Kosice, Slovak Republic; dilution 1/100) and a Leica BOND-MAX™ autostainer (Leica Biosystems Newcastle Ltd, UK). Epitope retrieval was achieved using the ready-to-use Bond Epitope Retrieval Solution 1 (Leica Biosystems Newcastle Ltd, UK) after 30 min heating.

Quantification of interstitial fibrosis by image analysis

Renal biopsy sections sliced 2.5 µm sliced and stained with Sirius red were digitalized by a ScanScope Aperio scanner (CS), using 20X objective. For each biopsy, the cortical section, defined as the part inside the renal capsule and outside the medulla, was manually selected on digital slides. Glomeruli and medium-sized arteries were deleted by the operator.

Renal-cortex fibrosis was quantified using a computer-based morphometric analysis software (Calopix, Tribvn Healthcare, Châtillon France) as previously published56,57. Briefly, the operator manually selects internal Sirius Red negative areas and positive areas and next runs the software that show the final selection of the Red area. The internal negative control step allows the comparison of slides that are sequentially stained in various batches of Sirius Red since staining usually may vary. Finally, the result was expressed as a percentage of the red positive area on the total cortical surface. Morphometric analyses were performed twice for each biopsy in an independant manner by two pathologists (CP and SF) who had no knowledge of the clinical data.

Statistical analyzes

Descriptive statistical methods (means, medians, standard deviations and ranges) were used to assess the distributions of variables. Wilcoxon rank sum and t tests for continuous variables, Fisher’s exact and chi-squared tests for categorical variables were performed. The intraclass correlation coefficient (ICC) with its 95% CI was used to study the reproducibility of morphometric IF (mIF) measures. Correlations between quantitative variables were assessed with Pearson product-moment correlation coefficient. For all analyses, a p value <0.05 was regarded as significant. Analyses were performed using R software (version 3.2.0)58,59, InStat 3 software (GraphPad Software, San Diego, CA) and Prism 4 (GraphPad Software).