Abstract
In sub-Saharan Africa, simple biomarkers of liver fibrosis are needed to scale-up hepatitis B treatment. We conducted an individual participant data meta-analysis of 3,548 chronic hepatitis B patients living in eight sub-Saharan African countries to assess the World Health Organization-recommended aspartate aminotransferase-to-platelet ratio index and two other fibrosis biomarkers using a Bayesian bivariate model. Transient elastography was used as a reference test with liver stiffness measurement thresholds at 7.9 and 12.2kPa indicating significant fibrosis and cirrhosis, respectively. At the World Health Organization-recommended cirrhosis threshold (>2.0), aspartate aminotransferase-to-platelet ratio index had sensitivity (95% credible interval) of only 16.5% (12.5–20.5). We identified an optimised aspartate aminotransferase-to-platelet ratio index rule-in threshold (>0.65) for liver stiffness measurement >12.2kPa with sensitivity and specificity of 56.2% (50.5–62.2) and 90.0% (89.0–91.0), and an optimised rule-out threshold (<0.36) with sensitivity and specificity of 80.6% (76.1–85.1) and 64.3% (62.8–65.8). Here we show that the World Health Organization-recommended aspartate aminotransferase-to-platelet ratio index threshold is inappropriately high in sub-Saharan Africa; improved rule-in and rule-out thresholds can optimise treatment recommendations in this setting.
Introduction
Worldwide, an estimated 316 million people live with chronic hepatitis B virus infection (CHB)1. The natural course of infection is variable, ranging from an inactive carrier with an excellent long-term prognosis, to progressive hepatic necroinflammation leading to cirrhosis and/or hepatocellular carcinoma (HCC)2. Antiviral therapy effectively reduces the risk of these complications3,4, and the challenge in clinical practice is to identify patients at risk of progressive liver disease who should start timely antiviral therapy.
International treatment guidelines recommend antiviral therapy for patients with: cirrhosis; those with elevated hepatitis B virus (HBV) viral load and significant liver fibrosis; or those with high viraemia and inflammation5,6,7,8. Cirrhosis can be diagnosed clinically mostly at an advanced, decompensated phase. Earlier stages of liver fibrosis have traditionally been assessed by liver biopsy, which recently has been largely replaced by transient elastography (TE)9. In resource-limited settings, however, these fibrosis assessment tools are rarely available, and antiviral treatment is therefore often delayed until the patients have developed symptoms of advanced chronic liver disease (CLD).
The first World Health Organization (WHO)’s guidelines for CHB published in 2015 recommended the use of non-invasive fibrosis markers based on low-cost routinely available laboratory tests in resource-limited settings, and this was adapted into national guidelines in many low- and middle-income countries. Specifically, the aspartate aminotransferase-to-platelet ratio index (APRI) at a threshold of 2.0 was recommended to identify patients with cirrhosis, although the lack of data from sub-Saharan Africa (sSA) was acknowledged as an important knowledge gap8. APRI, FIB-4 and most other liver fibrosis markers were developed and validated in Caucasian and Asian cohorts10,11, where environmental exposures, endemic infections, and host/viral genetic factors differ from sSA.
With an estimated 82 million people living with CHB, sSA is in desperate need of locally-adapted treatment guidelines for CHB that consider operational constraints and potential differences in the natural history of infection in the region12. To meet this demand, we conducted a systematic review and individual patient data (IPD) meta-analysis to evaluate the performance of APRI, FIB-4, and gamma-glutamyl transferase-to-platelet ratio (GPR), a simple liver fibrosis biomarker developed in West Africa13, for the diagnosis of significant fibrosis and cirrhosis in CHB patients living in sSA. Although liver biopsy is considered the gold standard to diagnose these conditions, we used TE as a reference test in this analysis since liver biopsy is rarely performed in sSA and TE is now widely recognised as a reliable alternative to liver histology. TE has been recommended as a decision tool for HBV treatment in international HBV guidelines, and liver stiffness measured by TE predicts the risk of hepatic decompensation, incident HCC, oesophageal varices, and mortality14. The use of TE has been well validated compared to liver histopathology in several studies of CHB patients in sSA13,15,16.
Results
Study population
Database searches identified 1478 articles following removal of duplicates. After screening of title and abstract, 90 potentially eligible articles were fully reviewed (Supplementary Fig. 1). Finally, 30 articles met our inclusion criteria, and all the authors of these articles agreed to share the IPD13,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43. We additionally included IPD data from yet unpublished cohorts from South Africa (Spearman & Sonderup, 2021) and Dakar, Senegal (Seydi & Wandeler, 2021), and from one cohort which has subsequently been published (Thiès, Senegal)44.
Overall, we obtained IPD of 3960 hepatitis B surface-antigen (HBsAg)-positive patients from 12 distinct cohorts in eight countries: Burkina Faso, Ethiopia, The Gambia, Malawi, Nigeria, Senegal, South Africa, and Zambia (Fig. 1). We excluded 412 ineligible participants (Fig. 2). Characteristics of individual cohorts are reported in Supplementary Table 1.
n the number of hepatitis B patients included from each site in the current analysis. Area of circles is proportionate to cohort size. Attribution to iStockphoto (https://www.istockphoto.com/vector/africa-map-gm517581017-49316640).
Assessment of study quality indicated an overall low risk of bias according to QUADAS-2 criteria (Supplementary Table 2). All studies systematically performed both index tests and reference tests, and >90% of study participants had blood tests and TE the same week. Two cohorts had specific criteria that could reduce applicability: one restricted to asymptomatic patients with HBV DNA > 3.2 log10 IU/ml16, and another excluded patients with body mass index (BMI) > 28 kg/m2 23. Three studies performed TE in a subset of the overall cohort with non-random selection: two due to equipment availability15,36, and one at clinician’s discretion without specifying criteria30. Two studies had significant loss to follow-up between community diagnosis and evaluation at clinic37,38.
Table 1 summarises characteristics of the 3548 study participants. Median age was 33 years (interquartile range [IQR] 28–41), 60% were male, and median BMI was 22.4 kg/m2 (IQR 20.0–25.5). In >80% of cases (2824/3497), the reason for HBsAg testing was asymptomatic screening, as part of population-based screening, antenatal care, blood donation, or because of a family contact with HBV, whereas 673 individuals (19%) were tested due to suspected CLD, viz symptoms or clinical signs, or abnormal liver enzymes. The prevalence of liver stiffness measurement (LSM) > 12.2 kPa (LSM > 12.2; associated with cirrhosis) was 7.3% overall: 2.5% among asymptomatic screening participants and 26.4% among patients with suspected CLD. LSM > 12.2 was more prevalent among men relative to women (9.9 vs. 3.4%; P < 0.001) and its prevalence increased with age (Supplementary Fig. 2). LSM > 7.9 kPa (LSM > 7.9; associated with significant liver fibrosis (≥F2)) was observed in 17.4% overall; 11.5% among asymptomatic screening participants and 40.7% among patients with suspected CLD (P < 0.001).
In a multivariable model, increased likelihood of LSM > 12.2 was observed with increasing age (per 1-year increment; Adjusted Odds Ratio [AOR] 1.03; 95% CI 1.01–1.04), male sex (AOR 3.28; 95% CI 2.17–4.96), and suspected CLD (vs. asymptomatic screening participants; AOR 55.3; 95% CI 27.9–109.3). Male sex and suspected CLD were also significantly associated with LSM > 7.9 (Supplementary Table 3).
Diagnostic performance of APRI, GPR and FIB-4
APRI and GPR had the best discriminant performance, with area under the receiver operating curve (AUROC) of 0.81 (95% credible interval (CrI) 0.81–0.82) and 0.82 (95% CrI 0.81–0.83) for LSM > 12.2 and 0.75 (95% CrI 0.74–0.75) and 0.76 (95% CrI 0.75–0.77) for LSM > 7.9, respectively (Supplementary Fig. 3). FIB-4 had relatively lower AUROC of 0.77 (95% CrI 0.76–0.78) for LSM > 12.2 and 0.68 (95% CrI 0.68–0.69) for LSM > 7.9. ALT as a standalone marker was associated with the lowest performance, both for LSM > 7.9 and LSM > 12.2 (Fig. 3). Performance was significantly better for LSM > 12.2 relative to LSM > 7.9 for all evaluated biomarkers (Fig. 3).
The diagnosis of A LSM > 12.2 kPa (associated with cirrhosis) and B LSM > 7.9 kPa (associated with significant fibrosis): Bayesian random effects modela. Point estimates are shown as red circles with 95% credible intervals shown as error bars. CrI credible interval, APRI aspartate aminotransferase-to-platelet ratio index, FIB-4 fibrosis-4 score, GPR gamma[1]glutamyl transferase-to-platelet ratio, ALT alanine aminotransferase, LSM liver stiffness measurement. aBayesian bivariate random effects model (n = 3548 biologically independent samples) adjusted for sex, study, hazardous alcohol consumption, reason for testing (suspected liver disease or asymptomatic screening) and categorical body mass index; summary statistics show population average. Youden’s J = maximisation of Youden’s J statistic, with equal weight to sensitivity and specificity (J = sensitivity + specificity − 1). Rule-in models were chosen where specificity exceeds 90%; rule-out models where sensitivity >80%.
By applying the WHO-recommended APRI thresholds (2.0 for cirrhosis and 1.5 for significant fibrosis), the sensitivity was 16.5% (95% CrI 12.5–20.5) for the diagnosis of LSM > 12.2 and 11.8% (95% CrI 9.4–14.2) for LSM > 7.9, whereas the specificity was 99.5% (95% CrI 99.2–99.7) for LSM > 12.2 and 99.2% (95% CrI 98.9–99.5) for LSM > 7.9.
We then developed rule-out thresholds aiming for a test sensitivity of ≥80% (Fig. 3). For APRI the optimised threshold to rule-out LSM > 12.2 was 0.36 with a sensitivity of 80.6% (95% CrI 76.1–85.1) and a specificity of 64.3% (95% CrI 62.8–65.8). For GPR the optimised rule-out threshold for LSM > 12.2 was 0.23 with a sensitivity of 80.6% (95% CrI 75.1–86.2) and a specificity of 66.7% (95% CrI 64.1–69.5).
The optimised rule-in thresholds for LSM > 12.2, where specificity exceeded 90%, were 0.65 for APRI with a sensitivity of 56.2% (95% CrI 50.5–62.2) and a specificity of 90.0% (95% CrI 89.0–91.0); and 0.47 for GPR with a sensitivity of 58.6% (95% CrI 50.7–66.3) and a specificity of 90.9% (95% CrI 88.7–92.9). We assessed the proposed cut-offs in each individual site (Supplementary Table 4). Significant inter-site heterogeneity was observed; however, due to a small number of patients with LSM > 12.2 at most sites, confidence intervals were wide at the level of individual centres.
As expected, predictive values were strongly associated with disease prevalence (Table 2). In asymptomatic screening populations, using the new rule-in thresholds for APRI (0.65) and GPR (0.47), positive predictive values for LSM > 12.2 were 17.0% for APRI and 20.1% for GPR, while negative predictive values were 98.4% and 98.5%, respectively. Among patients with suspected CLD, the positive predictive values using the same thresholds were 70.5% and 74.9%, whereas negative predictive values were 77.3% and 73.0%, respectively. The trade-off between optimising test sensitivity and specificity is illustrated in Fig. 4. Lower biomarker thresholds were associated with substantial over-diagnosis of LSM > 12.2 (Fig. 5).
Relationship among A all participants; B asymptomatic screening populations and C patients with suspected liver diseasea. Point estimates at different APRI thresholds are shown as blue (specificity) or orange (sensitivity) dots, connected by straight line segments with 95% credible intervals shown as transparent error bands. aBayesian bivariate random effects model fitted for different thresholds of APRI using 60 equally spaced quantiles. Specificity is defined as the probability of a negative test given the absence of LSM > 12.2; sensitivity is defined as the probability of a positive test, given the presence of LSM > 12.2. Source data are provided as a Source Data file.
Using A rule-in and rule-out thresholds, B the WHO recommended threshold and C Youden J derived thresholda. aA single grey dot represents one person with liver stiffness measurement (LSM) < 7.9 kPa, one orange dot represents a person with LSM > 7.9 kPa associated with significant fibrosis, and one red dot represents a person with LSM > 12.2 kPa associated with cirrhosis (F4). Individual proportions rounded to nearest whole individual and represent the effect of applying APRI thresholds to the average populations included in the cohorts, stratified by reason for hepatitis B testing. Source data are provided as a Source Data file.
The following patient level co-variates were significantly associated with model performance for the APRI and GPR rule-in thresholds: hazardous alcohol consumption, suspected CLD as the reason for HBsAg testing, and female sex (Supplementary Table 5). Hazardous alcohol consumption reduced test specificity for both biomarkers. Specificity was improved for APRI and GPR for women relative to men with no significant effect of sex on sensitivity. For the GPR model, being overweight reduced test specificity. The effect of subgroup characteristics on diagnostic sensitivity and specificity is illustrated in Supplementary Fig. 4.
We performed a sensitivity analysis using centre-specific upper limit of normal (ULN) thresholds; these ranged from 25 to 60 U/L for aspartate aminotransferase (AST) and 24 to 85 U/L for gamma-glutamyl transferase (GGT). This was associated with reduced diagnostic performance. We then considered the effect of using a lower LSM cut-off (9.5 kPa) for the diagnosis of suspected cirrhosis. At this threshold, relative to 12.2 kPa, for both APRI and GPR, test performance was poorer, with lower sensitivity and specificity at rule-in and rule-out thresholds (Supplementary Fig. 5).
In a sensitivity analysis, among a subset of 134 patients who underwent pre-therapy evaluation with a liver biopsy, we assessed the diagnostic characteristics of APRI and GPR using METAVIR histological fibrosis scores as a reference test (Supplementary Table 6). Consistent with the findings from the main analysis, the WHO-recommended threshold of 2.0 was associated with a sensitivity of 11.1% (95% CI 0.3–48.2) and specificity of 99.2 (95% CI 95.6–100) for the diagnosis of cirrhosis (F4). The derived rule-in threshold of 0.65 for APRI was associated with a sensitivity of 100% (95% CI 66.4–100) and a specificity of 73.6% (95% CI 65.0–81.1), whereas the rule-out threshold was associated with a sensitivity of 100% (95% CI 66.4–100) and a specificity of 36.0% (95% CI 27.6–45.1). Similar findings, consistent with the main analysis were observed for GPR, and for the diagnosis of significant fibrosis (F2), as shown in Supplementary Table 6.
In an exploratory analysis, we compared patients with LSM > 12.2 who had false negative APRI classification results with those correctly classified to identify factors associated with impaired test sensitivity. The lowest sensitivity for APRI was observed among patients with LSM just above the threshold (12.2 kPa) for cirrhosis, and diagnostic sensitivity increased with increasing LSM with the effect plateauing above 30 kPa (Supplementary Fig. 6).
Discussion
In this study including more than 3500 patients living with CHB in sSA, we evaluated the diagnostic performance of three inexpensive, simple liver fibrosis biomarkers (APRI, GPR and FIB-4). Our analysis included all identified existing published data from sSA, together with novel data, offering the most comprehensive evaluation of the performance of these diagnostic tools in sSA to date. We found that the sensitivity of APRI to diagnose LSM >12.2 kPa (associated with cirrhosis) at the WHO-recommended threshold of 2.0 was only 16.5%, consistent with previous reports from the region13,15,17. Importantly, we also identified new rule-in and rule-out thresholds suitable for sSA. The best test characteristics were observed for APRI and GPR for the detection of LSM > 12.2; an APRI rule-in threshold of 0.65 yielded 56.2% sensitivity and 90.0% specificity, whereas a rule-out threshold of 0.36 yielded 80.6% sensitivity and 64.3% specificity.
Our findings compare well with a recent large IPD analysis of mainly Asian CHB patients using liver biopsy as a reference standard45. The authors found that the conventional APRI and FIB-4 thresholds were unsuitable for patient management and identified new, lower rule-out cirrhosis thresholds for both APRI (0.45) and FIB-4 (0.70), in line with our findings. Of note, both our study and the study by Sonneveld and colleagues applied a fixed ULN for AST. In a sensitivity analysis, we observed a poorer performance of APRI when using assay-specific ULN. It is likely that the variable performance of APRI in previous studies could be, at least partly, explained by the differing definitions of ULN for AST. Moreover, previous meta-analyses have often been restricted to analysis of pre-defined biomarker thresholds whereas a strength of using IPD is that it can facilitate analysis across the continuum of biomarker thresholds. It should be noted that the original study that developed APRI and defined its thresholds was performed among active chronic hepatitis C patients in the USA10, much older than our HBV population, with higher liver transaminases levels, and mainly Caucasian (only 8% African Americans), a very different patient population compared to African CHB patients.
The new thresholds identified in our study for APRI and GPR were particularly suitable at ruling out disease (i.e. they had high negative predictive values). The ability to rule in disease (i.e. positive predictive value) was much poorer. This phenomenon reflects a common problem with screening tests, particularly in a community setting, resulting in significant overdiagnosis when the prevalence and pre-test probability of disease is low46. Indeed, the reference test used in this study, TE at specific LSM thresholds, may be associated with false-positive results if liver biopsy is considered the reference standard, and particularly in a low pre-test probability population such as asymptomatic screening. One might argue that some degree of overdiagnosis of cirrhosis is acceptable since many of those with a high APRI score (in the absence of cirrhosis) have F2/F3 fibrosis or active hepatic inflammation and need antiviral therapy to prevent progression to cirrhosis or HCC, as illustrated in Fig. 5. Overdiagnosis would lead to wider use of nucleoside analogues, which are now accessible at very low cost (<40 USD per year), are generally safe and well-tolerated, and offer the benefit of pre-exposure prophylaxis for HIV, of particular importance in African countries with generalised HIV epidemics47,48. On the other hand, adherence to life-long antiviral therapy might be challenging and excessive overtreatment could overstretch health budgets. We aim to perform a cost-effectiveness analysis using the present data to shed light on the balance between over-treatment and widening access to care. The optimal strategy is likely to vary in different settings depending on disease prevalence and available resources.
The intended use of these biomarkers should be kept in mind when interpreting our results. Importantly, liver fibrosis assessment is not the only criterion to initiate antiviral therapy. Clinical assessment, alanine aminotransferase (ALT), age, and family history of HCC are available in most settings, and hepatitis B e-antigen (HBeAg) and HBV DNA might be available at larger centres in low- and middle-income countries. Thus, a full patient assessment may perform better than an evaluation considering APRI or GPR alone, as presented in this study. In future work we aim to optimise treatment eligibility criteria for sSA, beyond the present question of liver fibrosis assessment.
Our analysis relies on the validity of TE as a reference standard. TE is a widely used diagnostic tool, well established in high-income countries where it has largely replaced liver biopsy in routine patient care over the past decade and has been incorporated into international HBV treatment guidelines as an accurate tool to guide treatment decisions for CHB patients5,6,7,8. It is worth noting that TE has prognostic properties for predicting liver-related events and death, independent of its association with liver biopsy14. Notably, the Baveno VII 2021 consensus recommends the use of liver stiffness to define advanced CLD, and to prognosticate risks of decompensation and liver-related death49. Numerous cross-sectional studies have compared TE with liver histology and found good agreement, particularly for cirrhosis. In a meta-analysis from 2016, Li et al. analysed data from 27 studies with a total of 4386 CHB patients and found an AUROC of 0.88 for F ≥ 2 and 0.93 for F450. In our own study, we performed a sensitivity analysis among a subset of patients who underwent liver biopsy and found that the performance characteristics of the WHO-recommended thresholds and our newly derived rule-in and rule-out thresholds were consistent with findings from the main analysis. Indeed, liver biopsy also has its limitations due to sampling error and inter-observer variability of histological specimens, and it has been shown that AUROC > 0.90 is unachievable even for a perfect surrogate marker51.
This study had some limitations. First, APRI and GPR displayed reduced specificity in patients with alcohol abuse. It is well-known that alcohol may cause elevations of both AST and GGT and reductions in platelet count; hence, fibrosis scores based on AST and GGT must be used with caution in patients with known alcohol misuse52. Second, although there was an overall low risk of bias associated with the studies, a small number had non-random selection criteria or specific exclusions that could limit applicability. Third, not all participants had TE and blood tests performed the same day; however, the majority (>90%) had both tests done within the same week. Finally, TE has been associated with technical limitations including unsuccessful measurements reported in patients with ascites, obesity, or narrow intercostal spaces. We did not ascertain rates of TE test failure, and this may affect the overall applicability of our findings to the entire population with HBV.
The study had significant strengths. This is the largest and most comprehensive analysis from sSA, the most endemic region for HBV worldwide, and we had good geographical representation from East, West, and Southern Africa. We therefore believe our results are generalisable. The bivariate random effects meta-analysis (BRMA) modelling framework we used has been widely advocated for, in diagnostic accuracy meta-analysis53,54. The BRMA framework accounts for both within study precision of estimates and heterogeneity between studies. The model allowed us to model the effect of patient-level covariates on diagnostic accuracy. Further, the models jointly model sensitivity and specificity, capturing the dependence between these two parameters. Finally, the Bayesian estimation framework we used allows a principled treatment of missing data.
In conclusion, APRI, at the current WHO-recommended threshold of 2.0, had poor sensitivity for the diagnosis of LSM >12.2 kPa in sSA. We developed new rule-in and rule-out thresholds for APRI and GPR, which had better discriminatory properties. The non-invasive biomarkers were better at ruling out than ruling in disease, and when applied as screening tools on a general population a certain degree of over-diagnosis must be assumed. Our data are important for informing clinical practice in sSA and should be considered in the next revision of the WHO hepatitis B guidelines.
Methods
Systematic review for relevant cohorts
We searched PubMed, Scopus, the African Index Medicus and African Journals Online for articles published until 12th July 2022 using a search strategy covering the synonyms of “hepatitis B” AND (“TE” OR “liver biopsy”) AND “sSA” (Supplementary Table 7). Two investigators independently screened all identified articles and reviewed potentially eligible full-text articles for eligibility without language restrictions.
We included studies reporting on HBsAg-positive adults or adolescents (≥13 years) living in sSA who had pre-therapy LSM using TE and measurement of ALT, AST, GGT and platelet count. TE and blood tests were usually performed within the same week but an interval of up to 3 months was accepted. We excluded studies on migrants of African origin. Two investigators (AS and NR) independently extracted the variables listed in Supplementary Table 8 and evaluated the risk of bias using the QUADAS-2 assessment tool with disagreement resolved by consensus55.
Individual patient data
We contacted authors of all publications which met our study inclusion criteria, as well as researchers active in this area, to share IPD. For the current analysis, we excluded patients who had received anti-HBV treatment within the preceding 6 months, or who had started therapy for at least 7 days prior to the time of evaluation, patients with co-infection with hepatitis C (anti-HCV) or hepatitis D (anti-HDV) or human immunodeficiency virus (HIV), pregnant women, patients with ALT or AST exceeding five times the ULN (in accordance with European Association for the Study of the Liver [EASL] reliability recommendations for TE)9, and those with suspected or confirmed HCC. Centre-specific definitions were used to categorise patients with hazardous alcohol consumption as described in Supplementary Table 1; the majority used the WHO AUDIT tool56.
Reference test
TE was performed (using the M probe) on patients who fasted for at least 2 h. The median of 10 readings was reported. The result was discarded if the IQR divided by the median exceeded 30% when the median LSM was ≥7.1 kPa57. We used TE thresholds of 7.9 and 12.2 kPa, associated with significant fibrosis (equivalent to METAVIR ≥ F2) and cirrhosis (F4) in cross-sectional studies, as the reference standard for assessment of the biomarkers13,50. In a sensitivity analysis, we explored an alternative LSM threshold associated with cirrhosis (9.5 kPa) derived from a study of 135 Gambian CHB patients who underwent liver biopsy13.
Index tests
We assessed the following biomarkers:
-
i.
APRI = [(AST (U/L)/ULN of AST) × 100]/platelet count (×109/L)10;
-
ii.
GPR = [GGT (U/L)/ULN of GGT × 100]/platelet count (×109/L)13;
-
iii.
FIB-4 = (age (years) × AST (U/L))/(platelet count (×109/L) × (ALT (U/L))1/2)11;
-
iv.
ALT (U/L), as a standalone marker.
An ULN of 40 U/L for AST and 61 U/L for GGT were used in the models.
Statistical analysis
First, we evaluated the WHO recommended rule-in thresholds for APRI (1.5 for ≥F2 and 2.0 for F4)8. We then optimised the rule-in thresholds by giving priority to specificity (≥90%), and the rule-out thresholds by giving priority to sensitivity (≥80%). Finally, we assessed thresholds defined by maximising Youden’s J, equivalent to maximising the sum of sensitivity and specificity without priority to any of the two.
To calculate sensitivity and specificity, data were pooled using a single-stage IPD meta-analysis approach. We used a bivariate Bayesian random-effects meta-analysis model for sensitivity and specificity using patient-level covariates with study-level random effects to account for anticipated variability between sites (full model details are presented in Supplementary Methods 1)54.
Validation
We validated the model using bootstrap resampling, as widely recommended over split-sample approaches for internal validation58. We obtained 500 bootstrap samples from the original dataset, then fitted the model and estimated rule-in, rule-out and Youden thresholds, sensitivity and specificity at each different thresholds, and overall AUROC. Validation results for the APRI model for cirrhosis are shown in Supplementary Methods 2. Model parameters show unimodal distributions with narrow spread indicating good stability of parameter estimates.
Sensitivity analyses
First, we assessed the effect of using assay-specific ULN for AST, as reported by each centre. Second, we assessed the effect of using a lower liver stiffness threshold associated with cirrhosis (9.5 kPa)13. Third, we assessed the performance of new optimised rule-in and rule-out thresholds on a subset of patients who underwent pre-therapy liver biopsy using the METAVIR histological fibrosis scores as a reference test.
To explore variables associated with LSM thresholds indicative of cirrhosis (LSM > 12.2) and significant fibrosis (LSM > 7.9), we used mixed effects logistic regression models with study site-specific random effects by including variables that were anticipated a priori to be clinically important: age, sex, BMI category, and reason for hepatitis B testing (asymptomatic vs. suspected CLD). To examine an association between APRI test sensitivity and LSM among patients with cirrhosis, we used the Wilcoxon rank sum test and plotted the distribution of LSM and APRI classification using a kernel density plot and a restricted cubic spline plot for LSM against test sensitivity. Analyses were conducted in R version 4.1.0 (R Foundation for Statistical Computing, Austria), JAGS 4.3.0, using the rjags package (v4.10), and Stata v17 (Statacorp, USA). All R and JAGS code is available from GitHub (https://github.com/gitMarcH/HEPSANET)59.
Ethical review
The study protocol was registered in PROSPERO (CRD42020218043) and was reported in accordance with the PRISMA-IPD guidelines60. Each participating centre obtained permission from local research ethical review committees.
Reporting summary
Further information on research design is available in the Nature Portfolio Reporting Summary linked to this article.
Data availability
We searched PubMed, Scopus, the African Index Medicus, and African Journals Online for relevant papers. The data that supports the findings of this study are available from the corresponding author upon reasonable request. The source data underlying Figures and Supplementary Figures are provided as a separate Source Data file. Source data are provided with this paper.
Code availability
All the codes supporting the results of the bioinformatics analysis are available in the Github according to the link: (https://github.com/gitMarcH/HEPSANET).
References
Global Burden of Disease Collaborative Network. Global, regional, and national burden of hepatitis B, 1990-2019: a systematic analysis for the Global Burden of Disease Study 2019. Lancet Gastroenterol. Hepatol. 7, 796–829 (2022).
Trepo, C., Chan, H. L. & Lok, A. Hepatitis B virus infection. Lancet 384, 2053–2063 (2014).
Kim, W. R. et al. Impact of long-term tenofovir disoproxil fumarate on incidence of hepatocellular carcinoma in patients with chronic hepatitis B. Cancer 121, 3631–3638 (2015).
Marcellin, P. et al. Regression of cirrhosis during treatment with tenofovir disoproxil fumarate for chronic hepatitis B: a 5-year open-label follow-up study. Lancet 381, 468–475 (2013).
European Association for the Study of the Liver. EASL 2017 Clinical Practice Guidelines on the management of hepatitis B virus infection. J. Hepatol. 67, 370–398 (2017).
Sarin, S. K. et al. Asian-Pacific clinical practice guidelines on the management of hepatitis B: a 2015 update. Hepatol. Int. 10, 1–98 (2016).
Terrault, N. A. et al. Update on prevention, diagnosis, and treatment of chronic hepatitis B: AASLD 2018 hepatitis B guidance. Hepatology 67, 1560–1599 (2018).
World Health Organisation. WHO Guidelines Approved by the Guidelines Review Committee (World Health Organization, 2015).
European Association for Study of Liver and Latin American Association for the Study of the Liver. EASL-ALEH Clinical Practice Guidelines: Non-invasive tests for evaluation of liver disease severity and prognosis. J. Hepatol. 63, 237–264 (2015).
Wai, C. T. et al. A simple noninvasive index can predict both significant fibrosis and cirrhosis in patients with chronic hepatitis C. Hepatology 38, 518–526 (2003).
Sterling, R. K. et al. Development of a simple noninvasive index to predict significant fibrosis in patients with HIV/HCV coinfection. Hepatology 43, 1317–1325 (2006).
Spearman, C. W. et al. Hepatitis B in sub-Saharan Africa: strategies to achieve the 2030 elimination targets. Lancet Gastroenterol. Hepatol. 2, 900–909 (2017).
Lemoine, M. et al. The gamma-glutamyl transpeptidase to platelet ratio (GPR) predicts significant liver fibrosis and cirrhosis in patients with chronic HBV infection in West Africa. Gut 65, 1369–1376 (2016).
Singh, S. et al. Liver stiffness is associated with risk of decompensation, liver cancer, and death in patients with chronic liver diseases: a systematic review and meta-analysis. Clin. Gastroenterol. Hepatol. 11, 1573–1584 (2013).
Bonnard, P. et al. Comparison of elastography, serum marker scores, and histology for the assessment of liver fibrosis in hepatitis B virus (HBV)-infected patients in Burkina Faso. Am. J. Trop. Med. Hyg. 82, 454–458 (2010).
Mbaye, P. S. et al. Liver stiffness measurement and biochemical markers in Senegalese chronic hepatitis B patients with normal ALT and high viral load. PLoS ONE 6, e22291 (2011).
Desalegn, H., Aberra, H., Berhe, N., Gundersen, S. G. & Johannessen, A. Are non-invasive fibrosis markers for chronic hepatitis B reliable in sub-Saharan Africa. Liver Int. 37, 1461–1467 (2017).
Aberra, H. et al. Early experiences from one of the first treatment programs for chronic hepatitis B in sub-Saharan Africa. BMC Infect. Dis. 17, 438 (2017).
Aberra, H. et al. The WHO guidelines for chronic hepatitis B fail to detect half of the patients in need of treatment in Ethiopia. J. Hepatol. 70, 1065–1071 (2019).
Cohen, D. et al. Hepatitis B virus preS2Δ38-55 variants: a newly identified risk factor for hepatocellular carcinoma. JHEP Rep. 2, 100144 (2020).
Desalegn, H. et al. Predictors of mortality in patients under treatment for chronic hepatitis B in Ethiopia: a prospective cohort study. BMC Gastroenterol. 19, 74 (2019).
Desalegn, H. et al. Treatment of chronic hepatitis B in sub-Saharan Africa: 1-year results of a pilot program in Ethiopia. BMC Med. 16, 234 (2018).
Ameh, O. A. et al. Use of fibroscan in assessment of hepatic fibrosis in patients with chronic hepatitis B infection. Jos J. Med. 12, 9–14 (2018).
Diallo, I. et al. Evaluation of fibrosis in chronic hepatitis B: comparison between liver biopsy and elastography (Fibroscanï¿1/2). J. Africain d’Hepato-Gastroenterologie 10, 132–137 (2016).
Diallo, I. et al. Inactive hepatitis B carriers: outcomes of patients followed at Hopital Principal de Dakar, Senegal. Pan Afr. Med. J. 31, 49 (2018).
Johannessen, A., Aberra, H., Desalegn, H., Gordien, E. & Berhe, N. A novel score to select patients for treatment in chronic hepatitis B: Results from a large Ethiopian cohort. J. Hepatol. 71, 840–841 (2019).
Lemoine, M. et al. Food intake increases liver stiffness measurements and hampers reliable values in patients with chronic hepatitis B and healthy controls: the PROLIFICA experience in The Gambia. Aliment. Pharmacol. Ther. 39, 188–196 (2014).
Lemoine, M. et al. Acceptability and feasibility of a screen-and-treat programme for hepatitis B virus infection in The Gambia: the Prevention of Liver Fibrosis and Cancer in Africa (PROLIFICA) study. Lancet Glob. Health 4, e559–e567 (2016).
Maponga, T. G. et al. HBV and HIV viral load but not microbial translocation or immune activation are associated with liver fibrosis among patients in South Africa. BMC Infect. Dis. 18, 214 (2018).
Maponga, T. G. et al. Treatment advantage in HBV/HIV coinfection compared to HBV monoinfection in a South African cohort. J. Infect. 81, 121–130 (2020).
Nsokolo, B. et al. Stage of disease in hepatitis B virus infection in Zambian adults is associated with large cell change but not well defined using classic biomarkers. Trans. R. Soc. Trop. Med. Hyg. 111, 425–432 (2017).
Shimakawa, Y., Boucheron, P., Luong Nguyen, L. B., Lemoine, M. & Sombié, R. Performance of two simplified HBV treatment criteria (TREAT-B score and WHO guidelines) in Burkina Faso, West Africa. J. Hepatol. 71, 842–844 (2019).
Shimakawa, Y. et al. Natural history of chronic HBV infection in West Africa: a longitudinal population-based study from The Gambia. Gut 65, 2007–2016 (2016).
Shimakawa, Y. et al. Hepatitis B Core-related Antigen: An Alternative to Hepatitis B Virus DNA to Assess Treatment Eligibility in Africa. Clin. Infect. Dis. 70, 1442–1452 (2020).
Shimakawa, Y. et al. Development of a simple score based on HBeAg and ALT for selecting patients for HBV treatment in Africa. J. Hepatol. 69, 776–784 (2018).
Vinikoor, M. J. et al. Chronic hepatitis B virus monoinfection at a university hospital in Zambia. World J. Hepatol. 10, 622–628 (2018).
Vinikoor, M. J. et al. Eligibility for hepatitis B antiviral therapy among adults in the general population in Zambia. PLoS ONE 15, e0227041 (2020).
Stockdale, A. J. et al. Hepatitis B Vaccination Impact and the Unmet Need for Antiviral Treatment in Blantyre, Malawi. J. Infect. Dis. 226, 871–880 (2021).
Stockdale, A. J. et al. Diagnostic performance evaluation of hepatitis B e antigen rapid diagnostic tests in Malawi. BMC Infect. Dis. 21, 487 (2021).
Johannessen, A. et al. Mother-to-Child Transmission of Hepatitis B Virus in Ethiopia. Vaccines (Basel) 9, https://doi.org/10.3390/vaccines9050430 (2021).
Riebensahm, C. et al. Screening for hepatocellular carcinoma among adults with HIV/HBV co-infection in Zambia: a pilot study. Int. J. Infect. Dis. 116, 391–396 (2022).
Somé, E. N. et al. Epidemiological and Clinical Characteristics of Inactive Carriers of Hepatitis B Virus: A Descriptive Cross-Sectional Study in Burkina Faso. Infect. Epidemiol. Microbiol. 7, 237–244 (2021).
Somé, E. N. et al. Chronic viral HBeAg-negative hepatitis B: Epidemiological, clinical and biochemical characteristics in an outpatient descriptive cohort in Burkina Faso. Clin. Epidemiol. Global Health 11, https://doi.org/10.1016/j.cegh.2021.100799 (2021).
Sow, A. et al. HBV continuum of care using community- and hospital-based screening interventions in Senegal: Results from the PROLIFICA programme. JHEP Rep. 4, 100533 (2022).
Sonneveld, M. J. et al. Optimisation of the use of APRI and FIB-4 to rule out cirrhosis in patients with chronic hepatitis B: results from the SONIC-B study. Lancet Gastroenterol. Hepatol. 4, 538–544 (2019).
Manski, C. F. Bounding the Accuracy of Diagnostic Tests, With Application to COVID-19 Antibody Tests. Epidemiology 32, 162–167 (2021).
Marcellin, P. et al. Ten-year efficacy and safety of tenofovir disoproxil fumarate treatment for chronic hepatitis B virus infection. Liver Int. 39, 1868–1875 (2019).
Fonner, V. A. et al. Effectiveness and safety of oral HIV preexposure prophylaxis for all populations. AIDS 30, 1973–1983 (2016).
de Franchis, R., Bosch, J., Garcia-Tsao, G., Reiberger, T. & Ripoll, C. Baveno VII - Renewing consensus in portal hypertension. J. Hepatol. 76, 959–974 (2022).
Li, Y. et al. Systematic review with meta-analysis: the diagnostic accuracy of transient elastography for the staging of liver fibrosis in patients with chronic hepatitis B. Aliment. Pharmacol. Ther. 43, 458–469 (2016).
Mehta, S. H., Lau, B., Afdhal, N. H. & Thomas, D. L. Exceeding the limits of liver histology markers. J. Hepatol. 50, 36–41 (2009).
Gelsi, E. et al. Effect of detoxification on liver stiffness assessed by Fibroscan® in alcoholic patients. Alcohol. Clin. Exp. Res. 35, 566–570 (2011).
Reitsma, J. B. et al. Bivariate analysis of sensitivity and specificity produces informative summary measures in diagnostic reviews. J. Clin. Epidemiol. 58, 982–990 (2005).
Riley, R. D., Dodd, S. R., Craig, J. V., Thompson, J. R. & Williamson, P. R. Meta-analysis of diagnostic test studies using individual patient data and aggregate data. Stat. Med. 27, 6111–6136 (2008).
Whiting, P. F. et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann. Intern. Med. 155, 529–536 (2011).
Saunders, J. B., Aasland, O. G., Babor, T. F., de la Fuente, J. R. & Grant, M. Development of the Alcohol Use Disorders Identification Test (AUDIT): WHO Collaborative Project on Early Detection of Persons with Harmful Alcohol Consumption–II. Addiction 88, 791–804 (1993).
Boursier, J. et al. Determination of reliability criteria for liver stiffness evaluation by transient elastography. Hepatology 57, 1182–1191 (2013).
Steyerberg, E. W. & Harrell, F. E. Jr Prediction models need appropriate internal, internal-external, and external validation. J. Clin. Epidemiol. 69, 245–247 (2016).
Johannessen, A. et al. Non-invasive fibrosis markers for chronic hepatitis B in sub-Saharan Africa: systematic review and individual-patient-data meta-analysis. Zenodo, https://doi.org/10.5281/zenodo.7406523, 2022.
Stewart, L. A. et al. Preferred Reporting Items for Systematic Review and Meta-Analyses of individual participant data: the PRISMA-IPD Statement. JAMA 313, 1657–1665 (2015).
Acknowledgements
We would like to thank the staff at all participating centres who contributed to patient follow-and data collection. The research groups of M.L. (PROLIFICA) in The Gambia and Senegal are supported by the European Commission (FP7-HEALTH, grant ID: 265994), the Medical Research Council (MRC) in the UK (grant ID: MR/R011117/1) and Gilead Sciences (Inc., Foster City, CA, USA). A.J.S. was funded by the National Institute for Health Research (UK) with an Academic Clinical Lectureship at the University of Liverpool, UK. The statistical work was undertaken on Barkla, part of the High Performance Computing facilities at the University of Liverpool, UK. The authors assume full responsibility for analyses and interpretation of these data.
Author information
Authors and Affiliations
Contributions
A.J., A.J.S., H.D., M.S., C.W.S., M.V., Y.S. and M.L. conceptualised the study. A.J., A.J.S., E.O., M.Se, G.W., M.So, C.W.S., M.V., E.S., H.D., F.F., N.R., P.D., M.D., T.M., J.T., P.C.M., M.A., S.M., R.S., Y.S. and M.L. collected data. A.J.S. and M.Y.R.H. did the data analysis. A.J., A.J.S. and M.Y.R.H. drafted the first paper. All authors reviewed the paper and approved the final version. A.J., A.J.S. and M.Y.R.H. had access to and verified the data. A.J., A.J.S., M.Y.R.H., Y.S. and M.L. had final responsibility for the decision to submit for publication.
Corresponding author
Ethics declarations
Competing interests
The authors declare the following competing interests: M.L. and Y.S. have received consultancy fees and research funding from Gilead Sciences, USA. The other authors declare no competing interests.
Peer review
Peer review information
Nature Communications thanks the other anonymous reviewer(s) for their contribution to the peer review of this work. Peer review reports are available.
Additional information
Publisher’s note Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Source data
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Johannessen, A., Stockdale, A.J., Henrion, M.Y.R. et al. Systematic review and individual-patient-data meta-analysis of non-invasive fibrosis markers for chronic hepatitis B in Africa. Nat Commun 14, 45 (2023). https://doi.org/10.1038/s41467-022-35729-w
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41467-022-35729-w
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.