Comparing distress of mouse models for liver damage

Tang, Guanglin; Seume, Nico; Häger, Christine; Kumstel, Simone; Abshagen, Kerstin; Bleich, André; Vollmar, Brigitte; Talbot, Steven R.; Zhang, Xianbin; Zechner, Dietmar

doi:10.1038/s41598-020-76391-w

Download PDF

Article
Open access
Published: 13 November 2020

Comparing distress of mouse models for liver damage

Guanglin Tang¹,
Nico Seume¹,
Christine Häger²,
Simone Kumstel¹,
Kerstin Abshagen¹,
André Bleich²,
Brigitte Vollmar¹,
Steven R. Talbot²^na1,
Xianbin Zhang¹^na1 &
…
Dietmar Zechner¹^na1

Scientific Reports volume 10, Article number: 19814 (2020) Cite this article

4347 Accesses
13 Citations
1 Altmetric
Metrics details

Subjects

Abstract

In order to foster animal welfare as well as high quality of research, many countries regulate by law that the severity of animal experiments must be evaluated and considered when performing biomedical research. It is well accepted that multiple parameters rather than a single readout parameter should be applied to describe animal distress or suffering. However, since the performance of readout parameters for animal distress is rarely defined and methods for multivariate analysis have only in rare cases been used, it is not known which methodology is most appropriate to define animal distress. This study used receiver operating characteristic curve analysis to quantify the performance of burrowing activity, body weight change and a distress score of mice after induction of liver damage by bile duct ligation or carbon tetrachloride. In addition, Support Vector Machine classification was used to compare the distress of these mouse models. This approach demonstrated that bile duct ligation causes much more distress than carbon tetrachloride-induced liver damage. This study, therefore, provides a prototype how to compare two animal models by considering several readout parameters. In the future these or similar methods for multivariate analysis will be necessary, when assessing and comparing the severity of animal models.

A novel multi-parametric analysis of non-invasive methods to assess animal distress during chronic pancreatitis

Article Open access 01 October 2019

Ahmed Abdelrahman, Simone Kumstel, … Dietmar Zechner

Robustness of a multivariate composite score when evaluating distress of animal models for gastrointestinal diseases

Article Open access 14 February 2023

Steven R. Talbot, Simone Kumstel, … Dietmar Zechner

Machine learning prediction models for prognosis of critically ill patients after open-heart surgery

Article Open access 09 February 2021

Zhihua Zhong, Xin Yuan, … Fanna Liu

Introduction

Public discussions on animal welfare have caused the implementation of laws and guidelines to regulate experiments on animals in most countries^1,2. This made animal welfare a top priority when conducting and publishing in vivo studies^3,4,5. Thus, when pursuing animal experiments, scientists have to balance two goals: animal welfare and the potential benefit of research. While this objective is self-evident and coherent, a detailed concept what needs to be done to balance both goals is more difficult to define. In many countries a prospective and often also actual severity assessment of animal experiments are legally required⁶. This should provide the basis for an ethical evaluation and the conclusion, if an animal experiment is justified and, therefore, should be allowed to be conducted.

Thus, an evidence-based analysis of animal distress is often legally required and is also essential for a realistic harm/benefit analysis, a sensible selection of an animal model and the development of refinement strategies. Scientists have primarily used non-invasive methods to assess animal distress. For example, many distress scores based on appearance, behaviour and physical parameters of rodents have been developed^7,8,9. In addition, natural behaviour of animals such as burrowing activity has been explored to assess distress^10,11,12. One of the most popular parameters to evaluate suffering from animals is body weight which has the distinct advantage that it can be easily and objectively measured^7,13,14,15.

While many distinct readout parameters for measuring distress are available, very little is known about how these methods can be compared. The performance of a method or a diagnostic test is usually evaluated by receiver operating characteristic (ROC) curve analysis. The area under the curve (AUC) quantifies this performance and indicates how accurately a test discriminates between two states, typically referred to as diseased and non-diseased state¹⁶. However, it is well accepted that multiple parameters rather than a single readout parameter should be applied to describe and compare animal distress^7,17,18. Many studies indeed evaluate several readout parameters for distress, but do not combine these parameters by a statistical procedure to reach a holistic conclusion^{13,19,20,21,22}. To facilitate such an integrated conclusion, a multivariate analysis, which combines different readout parameters when analysing animal distress, is necessary. Such analyses are often performed in clinical situations in form of a binary logistic regression in order to test whether a combination of biomarkers has higher discriminatory power to differentiate between diseased and non-diseased states than single biomarkers^23,24. Another option to analyse more than one readout parameter simultaneously is clustering, followed by Support Vector Machine (SVM) classification. For example, clustering was used to differentiate between subgroups of patients with irritable bowel syndrome²⁵ or to compare distinct distress levels of mice during colitis¹⁵.

Thus, it was one aim of this study to evaluate, if ROC curve analysis and binary logistic regression be used to describe the performance of single or multiple readout parameters for defining distress in animals. Moreover, it was the aim to assess whether SVM classification can be used to compare the severity of two animal models. We compared distress caused by bile duct ligation (BDL) to distress caused by carbon tetrachloride (CCl₄). These two animal models are widely used for studying liver damage and fibrosis^{26,27,28,29,30}.

Results

Characterisation of parameters measuring distress after BDL

Mice were evaluated before and after BDL during the early, middle and late phases of cholestasis by assessing a distress score, burrowing activity and body weight (Fig. 1). First of all, we aspired to evaluate the suitability of these parameters to measure distress of mice. We hypothesized that parameters, which are suitable to measure distress should be able to differentiate between healthy and diseased mice as well as between mice which survived and non-survivors.

Thus, we first analysed mice, which survived until day 14 (survivors), in order to explore, if these read out parameters could differentiate between healthy and diseased mice. While the distress score increased continuously after BDL, the burrowing activity and body weight of mice rather decreased after this intervention (supplementary Fig. S1). No significant change in any of these parameters was observed when treating the mice with the NLRP3 inflammasome inhibitor MCC950 (supplementary Fig. S1), although previous studies suggested that this inhibitor can have analgesic function³¹. Thus, all BDL cohort mice were pooled and distress before BDL (pre) was compared to distress after BDL (post). We observed that BDL led to a significant increase of the distress score (Fig. 2a). It caused a significant decrease of burrowing activity (Fig. 2b) and a reduction of body weight (Fig. 2c). This suggests that distress score, burrowing activity and change in body weight are sensitive parameters that can differentiate between distress before (level 0) and after BDL (level 1). To evaluate the performance of these parameters in distinguishing between these two distress levels, we used ROC curves. We observed that all parameters, distress score, burrowing activity and body weight, can discriminate between these two distress levels (Fig. 2d). Combining multiple distress parameters with binary logistic regression revealed that the combination of distress score plus burrowing activity, distress score plus body weight and the combination of all three parameters produced a very high AUC indicating a very good performance in defining distress (Fig. 2e–g).

We also evaluated, if distress parameters could differentiate between different magnitudes of cholestasis. ALP activity has been demonstrated to increase with the progression of cholestasis³². Therefore, we evaluated ALP activity of mice after 2, 5 or 14 days of cholestasis and used k-means clustering to discretize the data into two categories: Low ALP and high ALP. Surprisingly, we observed that neither the distress score nor the burrowing activity could differentiate between low ALP and high ALP animals (supplementary Fig. S2). However, body weight change could differentiate well (AUC = 0.79) between these two clusters (supplementary Fig. S2). In order to analyse, if other parameters measuring distress would improve the differentiation between low ALP and high ALP animals, we determined the corticosterone concentration in the blood plasma (supplementary Fig. S2). Indeed, the corticosterone concentration in the blood plasma of animals could also differentiate well (AUC = 0.72) between low and high ALP animals (supplementary Fig. S2). However, when combining body weight change and corticosterone concentration in a logistic regression the discriminatory power of the combination was not higher than the discriminatory power of only the body weight change (supplementary Fig. S2). Thus, for differentiating between low and high ALP animals analysing body weight change is sufficient. Possibly, a combination with yet unknown additional distress parameters might be needed to predict the magnitude of cholestasis with an even higher discriminatory power.

We then explored, if mice which did not survive until day 14 (non-survivors) reached a different distress level before death when compared to mice that survived after BDL. We observed that the distress score of non-survivors measured before death is significantly higher than the distress score of survivors (Fig. 3a). The burrowing activity (Fig. 3b) and body weight (Fig. 3c) of non-survivors were significantly lower than those of surviving mice. These data suggest that non-survivors experience increased distress before death (level 2) when compared to surviving mice (level 1). In order to evaluate the performance of the readout parameters in distinguishing between these two distress levels, we used ROC curves. All single readout parameters such as distress score, burrowing activity and change in body weight had discriminatory power to differentiate between survivors and non-survivors (Fig. 3d). After combining multiple distress parameters with binary logistic regression, we observed that combination of two or three parameters also had a high discriminatory power (Fig. 3e,f). The combination of all three parameters (distress score plus burrowing activity plus body weight) produced the largest AUC, suggesting that the combination of all readout parameters allows the best differentiation between survivors and non-survivors (Fig. 3g). These data, therefore, suggest that the distress score, burrowing activity and body weight are suitable parameters to describe distinct distress levels.

Considering multiple parameters when differentiating between two distress levels

Next, we evaluated whether all three parameters can be used together to discriminate between the distress of healthy (pre-intervention) against the distress of diseased animals (post-intervention). We used machine learning to address this question: more specifically, we used a Support Vector Machine (SVM) to classify samples. Class-labels were obtained by labelling pre- against post-intervention data. For subsequent classification, we first split the data randomly into a training (containing 70% of data) and a test data set (containing 30% of data). The model was then built using the training data (Fig. 4a). Within the SVM, a linear kernel function was used to find the classifier. This tuned and optimized discriminator was visualized in the plots as a hyperplane, separating two putative levels of distress, which were defined as distress level 0 or distress level 1 (Fig. 4b).

For internal model optimization, and to address potential sampling bias we used hyper-parameter tuning and fivefold repeated tenfold cross-validation. The mean accuracies, sensitivities and specificities from this process were reported for the model (Fig. 4c shows results for both, the optimized and non-optimized model). The model itself was validated using the excluded (and labelled) test data (Fig. 4c). We observed high accuracy, sensitivity, and specificity for training as well as test data (Fig. 4c). This suggests that the combination of all three parameters (distress score, burrowing activity, bodyweight) exhibits a high diagnostic ability for the differentiation between distress level 0 and distress level 1. The rigorous model design and cross-validation process further ensured that these results are not based on potential sampling bias. Also, the optimized model shows lower accuracies for the external test data (accuracy optimized model: 0.80; accuracy not optimized model: 1). This was expected as the not-optimized models tend to overfit the data.

Comparing distress of the BDL to the CCl₄ animal model

Next we pursued the question if and how we can compare the distress between two animal models. In order to compare the BDL model to another animal model widely used for studying liver damage and fibrosis, mice were repetitively injected with CCl₄ (Fig. 5a). These mice were also either treated with MCC950 or a vehicle control and the distress of these animals was analysed before any intervention and during the early, middle and late phases of disease progression by assessing the distress score, burrowing activity and body weight (Fig. 5a). Again, no significant change in distress score, burrowing activity and body weight was observed when treating the mice with MCC950 or a vehicle control (data not shown). Thus, all CCl₄ cohort mice were pooled and post-CCl₄ and post-BDL data were then compared (Fig. 5b–d). We observed that CCl₄-treated mice had a significantly decreased distress score (Fig. 5b), increased burrowing activity (Fig. 5c) and significantly less body weight reduction (Fig. 5d), when compared to BDL mice. Thus, all three read out parameters indicate that CCl₄ causes less distress than BDL.

We then compared these two animal models by using the optimized training model based on 70% of the BDL data. We then classified the post-CCl₄ data according to this training model (see blue crosses in Fig. 6a). In addition, we classified the post-BDL data of the test data set (see blue crosses in Fig. 6b). Only 2 out of 30 post-CCl₄ data points were assigned to distress level 1, whereas 12 out of 15 post-BDL data points were correctly assigned to distress level 1 (Fig. 6c). Using Fisher’s exact test, a significant difference in the distress levels distribution between BDL and the CCl₄ cohort was observed (P < 0.001). This multivariate analysis suggests that at most time points CCl₄-treated animals experience less distress than animals after BDL.

In order to compare liver damage in both animal models, we assessed the activity of aspartate aminotransferase (AST), alanine aminotransferase (ALT) and glutamate dehydrogenase (GLDH) in blood plasma. AST and ALT activity was significantly increased in cholestatic as well as CCl₄-treated mice, when compared to heathy control animals (supplementary Figure S3). GLDH was significantly increased in cholestatic animals when compared to healthy or CCl₄-treated mice (supplementary Figure S3). In addition, we also evaluated oxidative stress by measuring malondialdehyde in liver tissue. Malondialdehyde was significantly increased after repetitive CCl₄-treatment when compared to cholestatic or healthy mice (supplementary Figure S3). These results demonstrate that the liver is damaged after cholestasis and toxic liver injury, but that specific pathophysiological features such as the induction of oxidative stress differs between these two animal models.

Discussion

There is an urgent need to evaluate the feasibility of methods to compare distress caused by different animal models³³. The present study compared BDL to CCl₄-induced liver damage and evaluated animal distress based on three distinct readout parameters. The multivariate analysis using SVM clearly demonstrated that BDL caused more distress than the treatment with CCl₄.

No direct multivariate comparison of distress between BDL and CCl₄-induced liver damage has been published to our knowledge. However, publications describe an average body weight loss of 15–20%, 18%, or 20–30% after BDL^34,35,36 or a transient body weight loss of approximately 8% or 10% during repetitive CCl₄ injection^37,38. This supports our conclusion that BDL causes more distress than CCl₄. However, the BDL animal model will still be needed for the following reasons. Distinct animal models are necessary to address the central principle of science that robust research needs many independent lines of evidence³⁹. Indeed, BDL and CCl₄-induced liver damage are often used in one study to prove a scientific conclusion in two independent animal models^40,41. In addition, there are also some differences between these two animal models. BDL causes an increase in biliary pressure, inflammation and cytokine secretion resulting in proliferation of biliary epithelial cells and portal fibrosis⁴². BDL therefore mimics cholestatic injury, which is, for example, observed during autoimmune diseases (primary biliary cirrhosis and primary sclerosing cirrhosis) and obstructive conditions such as cholelithiasis and tumour compression of bile ducts⁴³. In contrast, metabolites of CCl₄, such as trichloromethyl radicals, induce oxidative stress, centrilobular liver necrosis, an inflammatory response and liver fibrosis^42,44. In many aspects, it mimics liver damage in humans by different toxins⁴². These distinct pathophysiological features and mechanisms of animal models will remain to be of utterly importance, when deciding which animal model will be used for addressing a specific scientific hypothesis. However, at least for the BDL animal model, the use of analgesics should be essential⁴. It is especially necessary to mention this point, if one considers that only 3.4% of studies, which describe experiments using BDL in mice, specified the administration of a systemic analgesic⁴⁵. This is surprising, considering that it was already demonstrated decades ago that animals experience post-operative pain after BDL⁴⁶. However, analgesia can also interfere with disease mechanisms and can actually be harmful to animals when applied in high doses^47,48.

The most important prerequisite for being able to judge animal distress are methods with high discriminatory power to differentiate between distinct distress levels. This study used ROC curve analysis to evaluate the discriminatory power of readout parameters. This tool has been widely used to define the diagnostic ability of methods in a clinical situation. For example, ROC curve analysis helped to define which biomarker in the blood has the best discriminatory power to predict pancreatic cancer²³ or which biochemical marker is suitable to predict increased risk of stillbirth in women with intrahepatic cholestasis of pregnancy⁴⁹. In our study ROC curve analysis judged the suitability of readout parameters to differentiate between healthy mice and diseased mice or between diseased mice, which survive, and diseased mice, which will succumb to their disease. All readout parameters: distress score, burrowing activity and body weight change had discriminatory power to differentiate between animals before and after induction of cholestasis (Fig. 2d). However, burrowing activity was the parameter with the lowest performance (performance of parameters: distress score > body weight change > burrowing activity). When differentiating between survivors and non-survivors all readout parameters had again a high discriminatory power (Fig. 3d), but body weight change was the parameter with the lowest performance (performance of parameters: burrowing activity > distress score > body weight change). In addition to assessing the discriminatory power, one can determine the optimal cut-off of a diagnostic method by Youden’s index and calculate the positive predictive value (PPV)⁵⁰. We, therefore, also calculated the PPV using the combination of all three parameters. An optimal cut-off calculated by Youden’s index lead to 5 false positive and 10 true positive predictions, resulting in a PPV of 67%. Thus, it is not practical to use this method for deciding, if animals should be euthanized, because one would kill too many animals, which would otherwise survive. However, the combination of all three parameters is useful in describing distinct distress levels and can be used to compare 2 different animal models. These experiments also demonstrate that not a single readout parameter can be used as the gold standard for all situations.

This need for considering multiple parameters to assess animal welfare was often postulated^7,17,18. However, in many studies several parameters are evaluated, but these parameters are often not combined by a statistical procedure to reach a holistic conclusion^{13,19,20,21,22}. Only very few studies exist, which use biostatistical methods to combine distinct readout parameters for defining animal distress. For example, Peng et al. have used composite z scores to compare the results of several behavioural tests between control mice and mice after surgery²⁰. Häger et al. have used k-means clustering to compare distinct distress levels during colitis¹⁵. Möller et al. have used principal component analysis to describe many behavioural and biochemical variables supporting the conclusion that there is no major difference in distress between rats after electrode implantation and rats after electrode implantation plus kindling of epilepsy⁵¹. In our study we plotted three parameters and defined distress levels by SVM classification. This method had a high specificity, sensitivity and accuracy when validated with test data (Fig. 4c). However, we also want to emphasize that ROC curve analysis indicated that single read out parameters or two read out parameters, which were combined by multiple logistic regression, have also a very high discriminatory power to differentiate between distress levels in the BDL animal model (Fig. 2g). This indicates that less than three readout parameters might suffice to define the distress of animals and to compare animal models. However, we propose that substantiating a conclusion by considering several readout parameter is better to than relying on only one single parameter. Such a multivariate conclusion reduces arbitrariness when choosing a readout parameter and therefore diminishes bias when comparing animal models.

Although this publication suggests that SVMs can be used to compare the distress of two animal models, it is premature to claim that this method will allow us to determine the severity of all animal models in a scientific and rational manner. First, distinct research facilities will have to test if this or similar methods can be applied to many different animal models to compare distress between distinct models. Second, accessible tools to assess and compare distress have to be provided for the scientific community. Talbot and colleagues have started to explore such a tool, and recommend the use of a Relative Severity Assessment (RELSA) score for comparing animal models⁵². It will be important for the research community to make such tools accessible online. Third, the scientific community will have to provide a network of comparing distress between the most essential animal models. Only if this network allows an arrangement of animal models according to their distress level, one could start grading evidence-based severity into categories (e.g. mild, moderate or severe) as demanded by the legislation of many countries.

Methods

Animals

This study was conducted in accordance with the European directive 2010/63/EU and national law. All experiments were approved by the local ethics committee of the public authority (Landesamt für Landwirtschaft, Lebensmittelsicherheit und Fischerei Mecklenburg-Vorpommern, 7221.3-1-002/17). Because female mice were used to expand the mouse strain, surplus male BALB/cANCrl mice were used for this study. Please note that the focus on male mice might be a limitation of this study. A few mice of this mouse strain were purchased from Charles River (Wilmington, MA USA) and bred in the central animal facility of the Rostock University Medical Center (the health of the animal stock is routinely checked according to FELASA guidelines). Before the experiment the mice had more than 2 days for acclimatization. Animals were allocated in a non-random manner matching the age of both treatment groups and the experimenters were not blinded when injecting drugs. Distress was evaluated by two people (GT, NS), and in case of difficulties, in addition by another person (DZ). The required number of animals was calculated before starting the experiments by sample size calculation (alpha = 0.05, power = 0.8). Mice were group housed during breeding and the first few days before the actual experiments. Afterwards they were single housed in Eurostandard Type III clear plastic cages with wire lid, light/dark cycle of 12 h/12 h (dawn: 6:30–7:00 am) at a temperature of 21 ± 2 °C, with a relative humidity of 60 ± 20%. Autoclaved bedding (Bedding Espe Max 3–5 mm granulate, H 0234-500, Abedd, Vienna, Austria), shredded tissue paper (PZN03058052, FSMED Verbandmittel GmbH, Frankenberg, Deutschland), one paper tunnel (75 × 38 mm, H 0528-151, ssniff) and a wooden enrichment tool (Espe size S, 40 × 16 × 10 mm), H0234.NSG, Abedd). Food (pellets, V1534.000, 10 mm, ssniff) and tap water ad libitum were provided. Mice were euthanized by quickly anaesthetizing them with 5 vol % isoflurane and killing them with cervical dislocation.

Induction of liver damage

For inducing cholestasis by BDL on day 0, mice were quickly anaesthetized by 5 vol % isoflurane (CP-pharma, Burgdorf, Germany) and placed on a heating plate (37 °C). Then the laparotomy was performed under anesthesia (1.2–2.5 vol % isoflurane). As described in a previous study⁵³, the common bile duct was ligated by three surgical knots and was then transected between the two distal ligations. After closing the abdominal cavity, each mouse was allowed to recover from anesthesia in a single cage in front of a red warming lamp. The surgical procedure took 25–40 min. To relieve pain, 5 mg/kg carprofen (Pfizer GmbH, Berlin, Germany) was injected (sc) before operation and 0.25 ml metamizol (500 mg/ml, Ratiopharm GmbH, Ulm, Germany) was added to the drinking water (100 ml, drinking water was changed daily) until euthanasia of the mice. Supportive care was given after BDL by offering soaked food to all animals until euthanasia. In order to evaluate, if the NLRP3 inflammasome inhibitor MCC950 (Sigma Aldrich, St. Louis, USA, code PZ0280) could impair distress, 20 mg/kg MCC950 or aqua dest. ad inj. (Sham) was ip injected daily from day 1 before BDL to day 13 after BDL. For inducing liver damage by CCl₄ (Merck Millipore, Eschborn, Germany, code 1.02209.1000), this substance was diluted fourfold with corn oil (Sigma-Aldrich, code C8267). Per g body weight 1 µl of this solution (dose of CCl₄: 0.25 ml/kg body weight) was injected (ip) between 14:40–15:00 into the mice twice per week until day 42 (on day 0, 4, 7, 11, 14, 18, 21, 25, 28, 32, 35, 39). To relieve pain, 0.25 ml metamizol (500 mg/mL, Ratiopharm GmbH, Ulm, Germany) was added to the drinking water (100 ml) until euthanasia of the mice. 20 mg/kg MCC950 or aqua dest. ad inj. (Sham) was injected (ip) daily from day 28 to day 41 after first CCl₄ injection. The sixteen BDL mice (survivors) were at the beginning of the experiment 10.29/8.07–18.61 (median/interquartile range) weeks old and had 27.11/21.80–29.68 (median/interquartile range) g body weight, whereas ten BDL mice (non-survivors) were at the beginning of the experiment 9.79/8.36–12.20 weeks old and had 24.90/23.83–26.23 g body weight. The ten CCl₄-treated mice were at the beginning of the experiment 7.86/7.86–8.14 weeks old and had 24.52/22.99–24.97 g body weight.

Evaluation of animal distress

Burrowing

To evaluate burrowing activity of mice, a tube (length: 15 cm, diameter: 6.5 cm) filled with 200 g of food pellets was placed into the cage 2–3 h before the dark phase⁵⁴. The remaining pellets in the burrowing tube were weighed after 17 ± 2 h and the weight of the burrowed pellets was calculated. Burrowing activity was measured before the first intervention (pre) and during the acute (day 0), early (BDL: day 1, CCl₄: day 4), middle (BDL: day 4, CCl₄: day 18) and late (BDL: day 13, CCl₄: day 39) phase of liver damage. The burrowing tube was always placed into the cage 1 ± 0.5 h after CCl₄ injection. Changes in burrowing activity were calculated by using the weight of burrowed pellets on day 7 before BDL and on day 8 before CCl₄ injection as a reference for the respective cohort.

Distress score

The wellbeing of mice was assessed by evaluating multiple parameters with the help of a distress score⁵⁵. When the total score was higher than 15, the affected mouse was euthanized in order to avoid further deterioration of health. Distress was assessed before the first intervention (pre) and during the acute (day 0), early (BDL: day 1, CCl₄: day 4), middle (BDL: day 4, CCl₄: day 18) and late (BDL: day 13, CCl₄: day 39) phase of liver damage. The distress was always evaluated 30 ± 5 min after CCl₄ injection.

Body weight

The body weight of mice was assessed before the first intervention (pre) and during the acute (day 1), early (BDL: day 2, CCl₄: day 5), middle (BDL: day 5, CCl₄: day 19) and late (BDL: day 14, CCl₄: day 40) phase of liver damage. Thus, in all experiments the body weight was determined 1 day after measuring distress by a score sheet or by burrowing activity. This allows enough time for a body weight adjustment to a specific distress level (e.g. after injection of CCl₄).

Blood plasma and tissue analysis

AST, ALT, GLDH and ALP activity were spectrophotometrically assessed in blood plasma using the Cobas c111 analyser (Roche GmbH, Mannheim, Germany). For determining the corticosterone concentration in blood plasma the mouse and rat ELISA-Kit (DEV9922, Demeditec Diagnostics GmbH, Erfurt, Germany) was used according to the instructions of the manufacturer. Oxidative stress was evaluated by measuring the total malondialdehyde concentration after hydrolysing liver tissue at pH 1–2 and using the BIOXYTEC MDA-586 kit from OxisResearch (OXIS Health Products Inc. Portland, OR, USA).

Data presentation and statistical analysis

In line graphs data are presented as mean value ± standard deviation, whereas box plots indicate median interquartile range as well as 90% percentile and 10% percentile in form of whiskers. The characteristics of data were assessed by Shapiro–Wilk normality test and by Levene median equal variance test. Student’s t-test (based on normal distribution and equal variance of data) or the Mann–Whitney Rank Sum test were used to determine the significance of differences. When comparing two groups, differences with P ≤ 0.05 were considered to be significant. When comparing treatment groups at several time points, differences were only considered to be significant, when the P-value was lower than 0.05 divided by the number of meaningful comparisons (Bonferroni correction for multiple comparisons). These evaluations were done using SigmaPlot 12.0 (SYSTAT Software Inc., San Jose, USA; https://systatsoftware.com/products/sigmaplot/). For box plots, ROC curves, logistic regressions and Support Vector Machine classification, data of the pre- and post-intervention phase (all data from the acute, early and middle phase) were used to differentiate between healthy and diseased animals. For differentiating between post-BDL survivors and non-survivors, all data of surviving mice of the acute, early and middle phase after BDL were compared to data measured 0–2 days before death or euthanasia of non-survivors.

ROC curve analysis (using SigmaPlot 12.0, SYSTAT Software Inc.) determined the area under the curve (AUC) with the respecting 95% confidence intervals (CI) as a measurement for the performance of the readout parameters⁵⁶. In addition, this analysis gives the asymptotic P-value that determines if the AUC is significantly different from AUC = 0.5. To analyse the efficacy of the combination of two or three parameters, the data sets were combined by binary logistic regression using SigmaPlot 12.0 and the ROC curves were calculated afterwards.

In order to analyse distress considering all three readout parameters simultaneously, a Support Vector Machine was built on a 64-bit computer with 32 GB RAM using the R software⁵⁷ with the following packages: caret⁵⁸ and e1071⁵⁹. Prior to model building, samples were class-labelled using the experimental time phases (pre- vs. post-intervention). Categories were labelled as level 0 (pre) and 1 (post) and used in the classification process. Samples were randomized into 70% training and 30% test data prior to model building. A linear kernel function (u'∙v) was then used to construct the SVM-classifier with the training data. Data were scaled for the building process. The non-optimised fit was then tuned for the hyper-parameter cost function to optimise the SVM margin width for the classifier. In parallel, the tuning process was stratified using fivefold repeated tenfold cross-validation. The mean from all internal validation runs was then used to construct the optimised classifier. Model performance was reported in two stages: (a) re-classification (prediction) of the training data against the model (non-generalizable internal performance check) and (b) classification of the external test data (validation). In each case, data from a confusion matrix (accuracy, sensitivity, specificity) was reported for both, the optimised and the non-optimised model. The resulting values reflect model stability and also compensate for low sample sizes via repeated cross-validation. The externalised test data further assess the generalisability of the model. Finally, the hyperplane was constructed by coefficient extraction and grid extension of the optimised SVM model. When comparing CCl₄ cohorts to BDL, the optimized model was used to predict severity classes for post-intervention BDL data from the externalized test set as well as post-intervention CCl₄ data. The predictions were plotted in a scatterplot and class differences analyzed by Fisher’s Exact Test.

Data availability

The authors declare that all data supporting the findings of this study are available within the paper and its supplementary information file.

References

1National Research Council (US) Committee for the Update of the Guide for the Care and Use of Laboratory Animals. Guide for the Care and Use of Laboratory Animals (Eighth Edition). (National Academy of Sciences, 2011).
2Directive 2010/63/EU of the European Parliament and of the Council of 22 September 2010 on the protection of animals used for scientific purposes (Text with EEA relevance). Available from: https://eur-lex.europa.eu/LexUriServ/LexUriServ.do?uri=OJ:L:2010:276:0033:0079:en:PDF. (2019).
Diaz, S. L. Conducting and reporting animal experimentation: Quo vadis?. Eur. J. Neurosci. https://doi.org/10.1111/ejn.14091 (2018).
Article PubMed Google Scholar
Smith, A. J., Clutton, R. E., Lilley, E., Hansen, K. E. A. & Brattelid, T. PREPARE: Guidelines for planning animal research and testing. Lab. Anim. 52, 135–141. https://doi.org/10.1177/0023677217724823 (2018).
Article CAS PubMed Google Scholar
Kilkenny, C., Browne, W. J., Cuthill, I. C., Emerson, M. & Altman, D. G. Improving bioscience research reporting: the ARRIVE guidelines for reporting animal research. PLoS Biol. 8, e1000412. https://doi.org/10.1371/journal.pbio.1000412 (2010).
Article CAS PubMed PubMed Central Google Scholar
Smith, D. et al. Classification and reporting of severity experienced by animals used in scientific procedures: FELASA/ECLAM/ESLAV Working Group report. Lab. Anim. 52, 5–57. https://doi.org/10.1177/0023677217744587 (2018).
Article CAS PubMed PubMed Central Google Scholar
Morton, D. B. & Griffiths, P. H. Guidelines on the recognition of pain, distress and discomfort in experimental animals and an hypothesis for assessment. Vet. Rec. 116, 431–436 (1985).
Article CAS PubMed Google Scholar
Roughan, J. V. & Flecknell, P. A. Evaluation of a short duration behaviour-based post-operative pain scoring system in rats. Eur. J. Pain 7, 397–406. https://doi.org/10.1016/S1090-3801(02)00140-4 (2003).
Article PubMed Google Scholar
Graf, R., Cinelli, P. & Arras, M. Morbidity scoring after abdominal surgery. Lab. Anim. 50, 453–458. https://doi.org/10.1177/0023677216675188 (2016).
Article CAS PubMed Google Scholar
Deacon, R. M. Burrowing in rodents: A sensitive method for detecting behavioral dysfunction. Nat. Protoc. 1, 118–121. https://doi.org/10.1038/nprot.2006.19 (2006).
Article CAS PubMed Google Scholar
Jirkof, P. et al. Burrowing behavior as an indicator of post-laparotomy pain in mice. Front. Behav. Neurosci. 4, 165. https://doi.org/10.3389/fnbeh.2010.00165 (2010).
Article PubMed PubMed Central Google Scholar
Shepherd, A. J., Cloud, M. E., Cao, Y. Q. & Mohapatra, D. P. Deficits in burrowing behaviors are associated with mouse models of neuropathic but not inflammatory pain or migraine. Front. Behav. Neurosci. 12, 124. https://doi.org/10.3389/fnbeh.2018.00124 (2018).
Article CAS PubMed PubMed Central Google Scholar
Lofgren, J. et al. Analgesics promote welfare and sustain tumour growth in orthotopic 4T1 and B16 mouse cancer models. Lab. Anim. 52, 351–364. https://doi.org/10.1177/0023677217739934 (2018).
Article CAS PubMed Google Scholar
Hohlbaum, K. et al. Severity classification of repeated isoflurane anesthesia in C57BL/6JRj mice-assessing the degree of distress. PLoS ONE 12, e0179588. https://doi.org/10.1371/journal.pone.0179588 (2017).
Article CAS PubMed PubMed Central Google Scholar
Hager, C. et al. Running in the wheel: Defining individual severity levels in mice. PLoS Biol. 16, e2006159. https://doi.org/10.1371/journal.pbio.2006159 (2018).
Article CAS PubMed PubMed Central Google Scholar
Hajian-Tilaki, K. Receiver operating characteristic (ROC) curve analysis for medical diagnostic test evaluation. Caspian J. Intern. Med. 4, 627–635 (2013).
PubMed PubMed Central Google Scholar
Hawkins, P. et al. A guide to defining and implementing protocols for the welfare assessment of laboratory animals: Eleventh report of the BVAAWF/FRAME/RSPCA/UFAW Joint Working Group on Refinement. Lab. Anim. 45, 1–13. https://doi.org/10.1258/la.2010.010031 (2011).
Article CAS PubMed Google Scholar
Baumans, V. Science-based assessment of animal welfare: Laboratory animals. Rev. Sci. Tech. 24, 503–513 (2005).
Article CAS PubMed Google Scholar
Harikrishnan, V. S., Hansen, A. K., Abelson, K. S. & Sorensen, D. B. A comparison of various methods of blood sampling in mice and rats: Effects on animal welfare. Lab. Anim. 52, 253–264. https://doi.org/10.1177/0023677217741332 (2018).
Article CAS PubMed Google Scholar
Peng, M. et al. Battery of behavioral tests in mice to study postoperative delirium. Sci. Rep. 6, 29874. https://doi.org/10.1038/srep29874 (2016).
Article ADS CAS PubMed PubMed Central Google Scholar
Moore, E. S. et al. Comparing phlebotomy by tail tip amputation, facial vein puncture, and tail vein incision in C57BL/6 mice by using physiologic and behavioral metrics of pain and distress. J. Am. Assoc. Lab. Anim. Sci. 56, 307–317 (2017).
PubMed PubMed Central Google Scholar
Hurst, J. L. & West, R. S. Taming anxiety in laboratory mice. Nat. Methods 7, 825–826. https://doi.org/10.1038/nmeth.1500 (2010).
Article CAS PubMed Google Scholar
Kim, J. et al. Detection of early pancreatic ductal adenocarcinoma with thrombospondin-2 and CA19–9 blood markers. Sci. Transl. Med. https://doi.org/10.1126/scitranslmed.aah5583 (2017).
Article PubMed PubMed Central Google Scholar
Booken, N. et al. Sezary syndrome is a unique cutaneous T-cell lymphoma as identified by an expanded gene signature including diagnostic marker molecules CDO1 and DNM3. Leukemia 22, 393–399. https://doi.org/10.1038/sj.leu.2405044 (2008).
Article CAS PubMed Google Scholar
Guthrie, E. et al. Cluster analysis of symptoms and health seeking behaviour differentiates subgroups of patients with severe irritable bowel syndrome. Gut 52, 1616–1622. https://doi.org/10.1136/gut.52.11.1616 (2003).
Article CAS PubMed PubMed Central Google Scholar
Giebeler, A. et al. c-Met confers protection against chronic liver tissue damage and fibrosis progression after bile duct ligation in mice. Gastroenterology 137, 297–308. https://doi.org/10.1053/j.gastro.2009.01.068 (2009).
Article CAS PubMed Google Scholar
Modica, S. et al. Selective activation of nuclear bile acid receptor FXR in the intestine protects mice against cholestasis. Gastroenterology 142, 355–356. https://doi.org/10.1053/j.gastro.2011.10.028 (2012).
Article CAS PubMed Google Scholar
Ding, B. S. et al. Divergent angiocrine signals from vascular niche balance liver regeneration and fibrosis. Nature 505, 97–102. https://doi.org/10.1038/nature12681 (2014).
Article ADS CAS PubMed Google Scholar
Scholten, D., Trebicka, J., Liedtke, C. & Weiskirchen, R. The carbon tetrachloride model in mice. Lab. Anim. 49, 4–11. https://doi.org/10.1177/0023677215571192 (2015).
Article CAS PubMed Google Scholar
Cubero, F. J. et al. Combined activities of JNK1 and JNK2 in hepatocytes protect against toxic liver injury. Gastroenterology 150, 968–981. https://doi.org/10.1053/j.gastro.2015.12.019 (2016).
Article CAS PubMed Google Scholar
Khan, N., Kuo, A., Brockman, D. A., Cooper, M. A. & Smith, M. T. Pharmacological inhibition of the NLRP3 inflammasome as a potential target for multiple sclerosis induced central neuropathic pain. Inflammopharmacology 26, 77–86. https://doi.org/10.1007/s10787-017-0401-9 (2018).
Article CAS PubMed Google Scholar
Ghallab, A. et al. Influence of liver fibrosis on lobular zonation. Cells 8, 1556. https://doi.org/10.3390/cells8121556 (2019).
Article CAS PubMed Central Google Scholar
Bleich, A. & Tolba, R. H. How can we assess their suffering? German research consortium aims at defining a severity assessment framework for laboratory animals. Lab. Anim. 51, 667. https://doi.org/10.1177/0023677217733010 (2017).
Article CAS PubMed Google Scholar
Ezure, T. et al. The development and compensation of biliary cirrhosis in interleukin-6-deficient mice. Am. J. Pathol. 156, 1627–1639. https://doi.org/10.1016/S0002-9440(10)65034-1 (2000).
Article CAS PubMed PubMed Central Google Scholar
Georgiev, P. et al. Characterization of time-related changes after experimental bile duct ligation. Br. J. Surg. 95, 646–656. https://doi.org/10.1002/bjs.6050 (2008).
Article CAS PubMed Google Scholar
Gabele, E. et al. TNFalpha is required for cholestasis-induced liver fibrosis in the mouse. Biochem. Biophys. Res. Commun. 378, 348–353. https://doi.org/10.1016/j.bbrc.2008.10.155 (2009).
Article CAS PubMed Google Scholar
Yi, H. S. et al. Treatment with 4-methylpyrazole modulated stellate cells and natural killer cells and ameliorated liver fibrosis in mice. PLoS ONE 10, e0127946. https://doi.org/10.1371/journal.pone.0127946 (2015).
Article CAS PubMed PubMed Central Google Scholar
Yoshioka, H. et al. Vitamin D3-induced hypercalcemia increases carbon tetrachloride-induced hepatotoxicity through elevated oxidative stress in mice. PLoS ONE 12, e0176524. https://doi.org/10.1371/journal.pone.0176524 (2017).
Article CAS PubMed PubMed Central Google Scholar
Munafo, M. R. & Davey Smith, G. Robust research needs many lines of evidence. Nature 553, 399–401. https://doi.org/10.1038/d41586-018-01023-3 (2018).
Article ADS CAS PubMed Google Scholar
Habib, A. et al. Inhibition of monoacylglycerol lipase, an anti-inflammatory and antifibrogenic strategy in the liver. Gut https://doi.org/10.1136/gutjnl-2018-316137 (2018).
Article PubMed Google Scholar
Zhang, K. et al. The liver-enriched lnc-LFAR1 promotes liver fibrosis by activating TGFbeta and Notch pathways. Nat. Commun. 8, 144. https://doi.org/10.1038/s41467-017-00204-4 (2017).
Article ADS CAS PubMed PubMed Central Google Scholar
Yanguas, S. C. et al. Experimental models of liver fibrosis. Arch. Toxicol. 90, 1025–1048. https://doi.org/10.1007/s00204-015-1543-4 (2016).
Article CAS PubMed Google Scholar
Liedtke, C. et al. Experimental liver fibrosis research: Update on animal models, legal issues and translational aspects. Fibrogen. Tissue Repair 6, 19. https://doi.org/10.1186/1755-1536-6-19 (2013).
Article Google Scholar
Manibusan, M. K., Odin, M. & Eastmond, D. A. Postulated carbon tetrachloride mode of action: A review. J. Environ. Sci. Health C Environ. Carcinog. Ecotoxicol. Rev. 25, 185–209. https://doi.org/10.1080/10590500701569398 (2007).
Article CAS PubMed Google Scholar
Secklehner, J. & Richardson, C. A. The reporting of animal welfare details in liver research: A review of studies describing bile duct ligation in mice (2011–2013). J. Hepatol. 62, 250–251. https://doi.org/10.1016/j.jhep.2014.09.029 (2015).
Article PubMed Google Scholar
Liles, J. H. & Flecknell, P. A. The influence of buprenorphine or bupivacaine on the post-operative effects of laparotomy and bile-duct ligation in rats. Lab. Anim. 27, 374–380. https://doi.org/10.1258/002367793780745552 (1993).
Article CAS PubMed Google Scholar
Jirkof, P. Side effects of pain and analgesia in animal experimentation. Lab. Anim. (NY) 46, 123–128. https://doi.org/10.1038/laban.1216 (2017).
Article Google Scholar
Jirkof, P. et al. Administration of tramadol or buprenorphine via the drinking water for post-operative analgesia in a mouse-osteotomy model. Sci. Rep. 9, 10749. https://doi.org/10.1038/s41598-019-47186-5 (2019).
Article ADS CAS PubMed PubMed Central Google Scholar
Ovadia, C. et al. Association of adverse perinatal outcomes of intrahepatic cholestasis of pregnancy with biochemical markers: Results of aggregate and individual patient data meta-analyses. Lancet 393, 899–909. https://doi.org/10.1016/S0140-6736(18)31877-4 (2019).
Article PubMed PubMed Central Google Scholar
Molinaro, A. M. Diagnostic tests: How to estimate the positive predictive value. Neurooncol. Pract. 2, 162–166. https://doi.org/10.1093/nop/npv030 (2015).
Article PubMed PubMed Central Google Scholar
Moller, C. et al. Toward evidence-based severity assessment in rat models with repeated seizures: I. Electrical kindling. Epilepsia 59, 765–777. https://doi.org/10.1111/epi.14028 (2018).
Article CAS PubMed Google Scholar
Talbot, S. R. et al. One score to rule them all: severity assessment in laboratory mice. bioRxiv https://doi.org/10.1101/2020.06.23.166801 (2020).
Article PubMed PubMed Central Google Scholar
Abshagen, K. et al. Pathobiochemical signatures of cholestatic liver disease in bile duct ligated mice. BMC Syst. Biol. 9, 83. https://doi.org/10.1186/s12918-015-0229-0 (2015).
Article CAS PubMed PubMed Central Google Scholar
Deacon, R. Assessing burrowing, nest construction, and hoarding in mice. J. Vis. Exp. https://doi.org/10.3791/2607 (2012).
Article PubMed PubMed Central Google Scholar
Kumstel, S. et al. Grading distress of different animal models for gastrointestinal diseases based on plasma corticosterone kinetics. Animals (Basel) https://doi.org/10.3390/ani9040145 (2019).
Article Google Scholar
Bewick, V., Cheek, L. & Ball, J. Statistics review 13: Receiver operating characteristic curves. Crit. Care 8, 508–512. https://doi.org/10.1186/cc3000 (2004).
Article PubMed PubMed Central Google Scholar
R Core Team. The R project for statistical computing. https://www.R-project.org/. (2019).
Kuhn, M. Caret: Classification and regression training. R package version 6.0–84. https://CRAN.R-project.org/package=caret. (2019).
David, M., Evgenia, D., Kurt, H., Andreas, W. & Leisch., F. e1071: Misc functions of the department of statistics, probability theory group (Formerly: E1071), TU Wien. R package version 1.7–2. https://CRAN.R-project.org/package=e1071. (2019).

Download references

Acknowledgements

Guanglin Tang and Xianbin Zhang were supported by the China Scholarship Council (Grant number: 201808080167 and 201608080159). The study was supported by the Deutsche Forschungsgemeinschaft (DFG research group for 2591, Grant number: 321137804, ZE 712/1-1, and VO 450/15-1).

Funding

Open Access funding enabled and organized by Projekt DEAL.

Author information

These authors jointly supervised this work: Steven R. Talbot, Xianbin Zhang and Dietmar Zechner.

Authors and Affiliations

Rudolf-Zenker, Institute of Experimental Surgery, Rostock University Medical Center, Rostock, Germany
Guanglin Tang, Nico Seume, Simone Kumstel, Kerstin Abshagen, Brigitte Vollmar, Xianbin Zhang & Dietmar Zechner
Institute for Laboratory Animal Science, Hannover Medical School, Hanover, Germany
Christine Häger, André Bleich & Steven R. Talbot

Authors

Guanglin Tang
View author publications
You can also search for this author in PubMed Google Scholar
Nico Seume
View author publications
You can also search for this author in PubMed Google Scholar
Christine Häger
View author publications
You can also search for this author in PubMed Google Scholar
Simone Kumstel
View author publications
You can also search for this author in PubMed Google Scholar
Kerstin Abshagen
View author publications
You can also search for this author in PubMed Google Scholar
André Bleich
View author publications
You can also search for this author in PubMed Google Scholar
Brigitte Vollmar
View author publications
You can also search for this author in PubMed Google Scholar
Steven R. Talbot
View author publications
You can also search for this author in PubMed Google Scholar
Xianbin Zhang
View author publications
You can also search for this author in PubMed Google Scholar
Dietmar Zechner
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Study design: D.Z., B.V. and K.A.; Data collection and analysis or interpretation: all authors; Drafting manuscript: G.T., S.R.T., X.Z., and D.Z.; Revising manuscript: all authors. Approved final version of manuscript: all authors. G.T, S.R.T., X.Z. and D.Z. are responsible for the integrity of the manuscript.

Corresponding authors

Correspondence to Steven R. Talbot, Xianbin Zhang or Dietmar Zechner.

Ethics declarations

Competing interests

The authors declare no competing interests.

Additional information

Publisher's note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Supplementary information

Supplementary Information

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons licence, and indicate if changes were made. The images or other third party material in this article are included in the article's Creative Commons licence, unless indicated otherwise in a credit line to the material. If material is not included in the article's Creative Commons licence and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this licence, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Tang, G., Seume, N., Häger, C. et al. Comparing distress of mouse models for liver damage. Sci Rep 10, 19814 (2020). https://doi.org/10.1038/s41598-020-76391-w

Download citation

Received: 12 November 2019
Accepted: 01 October 2020
Published: 13 November 2020
DOI: https://doi.org/10.1038/s41598-020-76391-w

This article is cited by

Continuous monitoring of physiological data using the patient vital status fusion score in septic critical care patients
- Philipp L. S. Ohland
- Thomas Jack
- Steven R. Talbot
Scientific Reports (2024)
Liver fibrosis pathologies and potentials of RNA based therapeutics modalities
- Rimpy Diwan
- Samantha Lynn Gaytan
- Md Nurunnabi
Drug Delivery and Translational Research (2024)
Combating lead and cadmium exposure with an orally administered chitosan-based chelating polymer
- Jordyn Ann Howard
- Halyna Kuznietsova
- Olivier Tillement
Scientific Reports (2023)
Robustness of a multivariate composite score when evaluating distress of animal models for gastrointestinal diseases
- Steven R. Talbot
- Simone Kumstel
- Dietmar Zechner
Scientific Reports (2023)
Development of behavioral patterns in young C57BL/6J mice: a home cage-based study
- Maria Reiber
- Ines Koska
- Heidrun Potschka
Scientific Reports (2022)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.