Introduction

Definitive diagnosis of tuberculosis (TB) rests on detection of M. tuberculosis from clinical specimens, but is difficult in children. While sputum smear microscopy underpins the WHO DOTS and Stop TB strategies for TB diagnosis in adults1, obtaining sputum is more difficult in children, and the yield of microscopy alone is very poor due to small numbers of bacilli in clinical specimens. Bacteriological diagnosis therefore depends on mycobacterial culture2.

Strengthening laboratory capacity is a key component of the Stop TB Strategy and Global Plan to Stop TB1,3. Commercial liquid culture methods like the BACTEC Mycobacteria Growth Indicator Tube (MGIT) system (BD Diagnostics, Sparks, MD, USA) offer higher sensitivity and more rapid results than traditional solid media. However, barriers to their uptake include cost, a lack of technically trained personnel, and the need for biosafety level 3 facilities3.

Microcolony culture techniques such as the Microscopic Observation Drug Susceptibility (MODS) assay have several potential advantages over conventional culture methods in low resource settings, including high sensitivity; more rapid results that include drug susceptibility testing (DST); lower cost; and less stringent biosafety requirements making it applicable in biosafety level 2 facilities4. However only limited data exist on their use in children5,6,7, and no studies have compared MODS with MGIT for diagnosis of TB in young children; nor evaluated MODS yield from induced sputum samples, despite good evidence and international recommendations supporting sputum induction for childhood TB diagnosis2,8.

The Xpert MTB/RIF real-time PCR assay (Cepheid, Sunnyvale, CA, USA) offers a rapid and operationally simpler alternative to culture for detection of M. tuberculosis and rifampicin resistance9,10. A recent meta-analysis suggests modest sensitivity and high specificity for paediatric TB diagnosis11, and the latest WHO childhood TB guidelines (2014) recommend Xpert MTB/RIF as an alternative to conventional microscopy and culture - but acknowledge the ā€œvery low quality of evidenceā€2.

We compared microscopy, MGIT and MODS culture, and the Xpert MTB/RIF assay for diagnosis of childhood TB in Kenya.

Participants

We prospectively assessed each diagnostic method among children presenting to Coast Provincial General Hospital and Kilifi County Hospital in Coast Province, Kenya. Between July 2010 and December 2011 children aged <15 years were investigated for TB if one or more of the following features of suspected TB were present and they were not already on TB treatment: unexplained persistent cough for >2 weeks; pneumonia not responding to first line antibiotics; unexplained fever for >2 weeks; unexplained progressive weight loss or failure to thrive for >4 weeks; a history of close contact with a suspected or confirmed case of pulmonary TB; or clinical suspicion of TB for any other reason.

Clinical procedures

Each child underwent a structured history and examination, chest radiography and tuberculin skin testing (TST) according to WHO guidelines2. Severe malnutrition was defined as weight for age z-score of <āˆ’3 or the presence of nutritional oedema12. Provider initiated testing and counseling for HIV was performed according to Kenyan national guidelines, which recommend testing for all inpatients, and for all patients investigated for TB, on an opt-out basis.

Children who were able to expectorate provided up to three spontaneous sputum samples. Sputum induction was performed on the remainder. If sputum induction was contraindicated (e.g. due to severe respiratory distress), gastric aspiration was performed. Sputum induction and gastric aspiration were performed according to international recommendations2. Further investigations including extra-pulmonary or repeat sputum sampling were performed at the discretion of the clinical team caring for the patient.

We classified children as Confirmed TB, Highly Probable TB, Possible TB or Not TB according to their clinical, radiological and microbiological findings. Categories were defined a priori and based closely on published definitions (Fig.Ā 1)13,14. Treatment protocols followed Kenyan national guidelines. Children treated for TB were followed up until completion of treatment at 6 months. Other children were also followed up for 6 months, or until TB could be confidently excluded. Final diagnostic assignments were revised in the light of follow up data.

Figure 1
figure 1

Childhood TB Case Definitions.

Laboratory procedures

Specimens were transported to the laboratory at 2ā€“8ā€‰Ā°C and processed the same day. Sputum specimens were decontaminated using the modified Petroffā€™s method and 4% sodium hydroxide15; centrifuged; and the sediment re-suspended in 2ā€‰ml culture broth containing Middlebrook 7H9 medium, oxalic acid, albumin, dextrose, and catalase (OADC, Becton Dickinson), and polymyxin, amphotericin B, nalidixic acid, trimethoprim, and azlocillin (PANTA, Becton Dickinson)16. After vigorous vortexing to ensure sample homogenization, two drops were used to make a smear for Ziehl Neelsen staining, and the remainder divided equally for MGIT, MODS, and Xpert MTB/RIF. Samples for Xpert MTB/RIF analysis were promptly frozen at āˆ’80ā€‰Ā°C for analysis at the end of the study.

Liquid mycobacterial culture was performed using the MGIT 960 system and MODS assay. A laboratory technician (RM) first received one month MODS training at the Universidad Peruana Cayetano Heredia in Lima, Peru. Standard protocols were followed15,16, except that MODS plates were examined 3 times a week rather than daily for logistic reasons.

Positive cultures from both methods were identified as M. tuberculosis complex (MTBC) or non-MTBC by the BD MGIT TBc Identification test (BD Diagnostics, Sparks, MD, USA). They were then further speciated and probed for isoniazid and rifampicin resistance mutations by PCR, using the Hain GenotypeĀ® line probe assay platform (Hain Lifescience GmbH, Nehren, Germany). The Xpert MTB/RIF assay (version G4) was performed at the end of the study on specimens from all children treated for confirmed, highly probable or possible TB, and from children in whom TB had been excluded. Laboratory procedures were externally monitored through the UK NEQAS quality assurance scheme and annual Good Clinical Laboratory Practice audits (Qualogy, UK).

In order to compare the performance of MGIT and MODS, and to prevent observer bias in the interpretation of either result, laboratory technologists were blinded to the identity of the MODS portion of each specimen. Specimens were instead identified by an electronically generated random numeric code, the key to which was held by the Principal Investigator.

Statistical analysis

Data were analyzed at both the patient and the specimen level. We first performed a per patient analysis to compare the sensitivity of smear microscopy, MGIT, MODS and Xpert MTB/RIF for identification of culture confirmed pulmonary TB cases. We then included all specimens (pulmonary and extra-pulmonary) in a per specimen analysis to calculate the sensitivity of each method against the existing reference standard of a positive M. tuberculosis culture (by MGIT and/or MODS).

To explore the specificity of Xpert MTB/RIF we calculated the proportion of specimens from Not TB cases that were Xpert MTB/RIF positive. Having established the very high specificity of the Xpert MTB/RIF assay, we then repeated the per specimen analysis using a composite reference standard incorporating Xpert MTB/RIF, such that a specimen was considered positive if either culture or Xpert MTB/RIF identified M. tuberculosis.

We used McNemarā€™s Ļ‡2 test to compare proportions between tests on paired aliquots of the same sample. To investigate any learning effect as laboratory staff became more experienced with MODS we performed the Ļ‡2 test for trend in the proportion of M. tuberculosis culture positive specimens identified by MODS in each quarter, and the Wilcoxon rank sum test to compare MODS time to detection (TTD) in the first and second years of the study. We used the Wilcoxon signed rank test to compare TTD of MTBC by MODS and MGIT. Associations with a positive Xpert MTB/RIF assay were explored using univariable and multivariable logistic regression models adjusted for clustering at the patient level.

Ethical approval and informed consent

The study was approved by the Kenya National Ethics Committee. All study procedures were performed in accordance with relevant guidelines and regulations. A parent or guardian provided written informed consent.

Data availability

The datasets generated during and/or analysed during the current study are available from the corresponding author on reasonable request.

Results

We identified 1500 children with features of suspected TB, 1442 of whom were investigated for TB (Fig.Ā 2). Active TB was diagnosed in 212 (14.7%) children, including 54 (25.5%) with Culture-confirmed TB, 63 (29.7%) with Highly Probable TB, and 95 (44.8%) with Possible TB. Most cases (185, 87%) had pulmonary TB (PTB). Baseline demographic and clinical characteristics are summarized in TableĀ 1. In keeping with previous studies, the proportion of cases that were bacteriologically confirmed was higher among older children.

Figure 2
figure 2

Patient enrolment and disease assignments.

Table 1 Baseline characteristics of children with and without TB.

Per patient analysis of pulmonary TB cases

There were 51 cases of Culture-confirmed pulmonary TB. Smear microscopy, MGIT and MODS on the initial sputum specimen identified M. tuberculosis in 13 (25%), 40 (78%) and 33 (65%) cases, respectively (TableĀ 2). Sensitivity was highest for each method among smear positive samples; Xpert MTB/RIF identified 100% of smear-positive cases and 53% of smear-negative cases on the first sputum specimen (TableĀ 2).

Table 2 Sensitivity of each method for detection of M. tuberculosis in the initial sputum specimen from patients with culture confirmed pulmonary TB.

Second and third sputum specimens were obtained from 237 (16%) and 102 (7%) children, respectively. Eight (16%) culture confirmed PTB cases were culture negative from the initial sample but grew M. tuberculosis from a subsequent sputum sample. MGIT, MODS and Xpert MTB/RIF each demonstrated a modest incremental yield from additional specimens (TableĀ 3). None of the 8 initial culture-negative specimens from confirmed PTB cases were Xpert MTB/RIF positive. However Xpert MTB/RIF did detect M. tuberculosis in sputum from 5 additional children with culture negative pulmonary TB (3 with clinically Highly Probable TB and 2 who were treated for Possible TB).

Table 3 Cumulative sensitivity (incremental yield) of serial sputum specimens from patients with culture confirmed pulmonary TB.

Per specimen analysis of pulmonary and extra-pulmonary specimens combined

A total of 2085 specimens were obtained for mycobacterial culture. We were unable to do MODS on 182 samples due to holiday staffing shortages (169) or insufficient sample (13). A total of 1903 specimens were therefore included in the analysis, of which 1802 (95%) were sputum specimens, 33 (2%) were smear positive, and 97 (5%) were M. tuberculosis culture positive (Supplementary Appendix, TableĀ S1). MGIT was more sensitive than MODS overall (93.8% vs 76.3%, pā€‰=ā€‰0.002), among all sputum specimens, among induced sputum specimens, and among specimens from both younger and older children (TableĀ 4).

Table 4 Comparison of MGIT and MODS culture sensitivity against culture reference standard incorporating both MGIT and MODS.

There was weak evidence of an increase in MODS sensitivity (Ļ‡2 test for trend pā€‰=ā€‰0.038) and a decrease in TTD over time, with a median TTD of 16 (IQR 7 to 21) days in 2010 compared with 10 (7 to 14) days in 2011 (pā€‰=ā€‰0.052; Supplementary Appendix, Fig.Ā S1). Overall, TTD was slightly shorter for MODS than MGIT (11 [IQR 6 to 15] days versus 12 [7 to 17] days, pā€‰=ā€‰0.001).

Discordant culture results were obtained for 29 specimens. Independent bacteriological confirmation of M. tuberculosis was available for 21/29 of these specimens, either by the Xpert MTB/RIF assay on the original specimen (15/29) and/or by isolation of M. tuberculosis from an independent specimen from the same patient (Supplementary Appendix, TableĀ S2). The clinical picture strongly supported the presence of M. tuberculosis complex in the remaining 8 culture-discordant samples: 5 MGIT+/MODSāˆ’ specimens came from children with clinically Highly Probable TB prior to culture results; 2 MGITāˆ’/MODS+ specimens came from children treated empirically for TB prior to the culture result although they did not meet the stringent definition of Highly Probable TB; and one MGIT+/MODSāˆ’ specimen that grew M. bovis BCG came from a 4 month old HIV-infected child who had a clinical syndrome consistent with disseminated BCG disease (Supplementary Appendix, Table S2). All were the only positive cultures in their batch, arguing against cross-contamination causing the discordance. Even if cross-contamination were the cause and these specimens considered culture negative, MGIT would remain more sensitive than MODS (95.5% vs 80.9%, pā€‰=ā€‰0.005).

Xpert MTB/RIF identified 72/97M. tuberculosis culture positive specimens, giving a sensitivity compared to culture of 74.2% (95% CI 64.3 to 82.6). Xpert MTB/RIF was also positive on 5/86 (6%) specimens from patients with culture-negative but clinically highly probable TB, and 2/179 (1%) specimens from children treated for possible TB. No other specimens were Xpert MTB/RIF positive, including none of 1164 specimens obtained from children in whom TB was subsequently excluded - giving a specificity of 100% (1 sided 97.5% CI 99.7 to 100%) and strongly suggesting positive Xpert MTB/RIF results should be interpreted as true positives.

Against the composite reference standard of a positive TB culture and/or Xpert MTB/RIF result, the sensitivity of TB culture was 93.3% (86.6 to 97.3), compared with 76.0% (66.6 to 83.8) for Xpert MTB/RIF (pā€‰=ā€‰0.002; TableĀ 5). The sensitivity of MGIT and MODS culture alone were 87.5% (79.6 to 93.2) and 71.2% (61.4 to 79.6), respectively.

Table 5 Comparison of culture and Xpert MTB/RIF sensitivity against composite reference standard of both culture and Xpert MTB/RIF.

TableĀ 6 shows Xpert MTB/RIF sensitivity against the composite reference standard, broken down by smear status, specimen type, age and HIV status. Sensitivity was higher for smear positive specimens (100% compared with 65% for smear negative specimens, pā€‰<ā€‰0.001), with a trend towards higher sensitivity among sputum samples. Neither age nor HIV status were associated with Xpert MTB/RIF yield in univariable or multivariable analyses.

Table 6 Sensitivity of the Xpert MTB/RIF assay against the composite reference standard.

Xpert MTB/RIF identified rifampicin resistance in 4 specimens from the same child, all of which were confirmed resistant to rifampicin by MODS and the GenoType MTBDRplus assay. MODS did not identify any other rifampicin resistant isolates.

Discussion

The lack of reliable diagnostic tools is the single greatest obstacle to improved childhood TB case management, particularly in low resource settings where the disease burden is greatest. Expansion of TB laboratory diagnostic capacity provides an opportunity to improve both diagnosis and surveillance in many settings3. However good quality data on the performance of each method are required to optimize their use and interpretation.

Few studies have compared laboratory methods for childhood TB diagnosis. Limitations of existing studies include small numbers of confirmed TB cases5,6,7,14,17,18,19,20,21,22, particularly among young children in whom diagnosis is most challenging5,6,7,14,19,22, and exclusion of ā€˜gold standardā€™ methods for specimen collection and/or mycobacterial culture5,6,7,17,20,23,24,25,26. Strengths of this study include the use of both sputum induction and liquid mycobacterial culture in keeping with current international recommendations2, and rigorous diagnostic assignments based on detailed clinical assessment and follow up. Prospective recruitment of all children who met broad, pre-defined inclusion criteria also ensures generalizability to a wide range of settings and clinical syndromes, including among children under 5 years who accounted for 81% of those investigated and half of all confirmed cases.

While comparison of culture yield between studies is hampered by widely varying clinical definitions, specimen types and culture methods, the proportion of culture-confirmed cases in our study is consistent with other studies using liquid mycobacterial culture for paediatric diagnosis5,8,14,18,20,26,27,28,29,30.

In our study MGIT was more sensitive than MODS culture. This is in keeping with the only other published study comparing MGIT with MODS for paediatric TB diagnosis5. Importantly we were also able to demonstrate the higher yield of MGIT for culture of paediatric induced sputum samples. These results contrast with other studies in adults in which MODS compared more favourably with MGIT4. The paucibacillary nature of childhood TB may have a role, and variation between centres also suggests MODS performance is operator dependent4. In support of this, even after one month residential training with an experienced team, we were able to demonstrate a learning effect with increasing MODS yield and reducing time to detection during the two years of the study. We did not compare MODS with culture on solid media, however other studies have shown MODS to be superior to Lowenstein Jensen media for diagnosis of childhood TB6,7,26.

Against a reference standard of mycobacterial culture, Xpert MTB/RIF sensitivity was similar to other paediatric studies11, and comparable to MODS - in keeping with the only other study to compare the two methods29. Significantly, a combination of culture and Xpert MTB/RIF increased bacteriological yield without compromising specificity.

Importantly, we were able to confirm the very high specificity of Xpert MTB/RIF in a large sample of specimens from children in whom TB had been excluded by careful clinical, radiological and laboratory assessment and follow up. Positive Xpert MTB/RIF results have been reported in a small number of child TB suspects who did well at follow up without TB treatment11,24. However the very high precision with which we were able to demonstrate 100% specificity of Xpert MTB/RIF in a carefully characterized group of children without TB suggests that these apparent false positive results from TB suspects in very high burden settings may have represented self-limiting pulmonary tuberculosis. This is well recognized in the pre-chemotherapy literature31.

Our primary analysis compared Xpert MTB/RIF sensitivity with the current culture reference standard. Having demonstrated the very high specificity of Xpert MTB/RIF we then repeated the analysis using a composite reference standard of positive culture or Xpert MTB/RIF. Composite reference standards should be used with care32. However, given the equivalent high specificities of Xpert MTB/RIF and mycobacterial culture, incorporation bias in favour of Xpert MTB/RIF is unlikely. Furthermore, misclassification bias arising from the imperfect sensitivity of culture may bias estimates of the accuracy of Xpert MTB/RIF culture alone is used as the reference33,34. We believe the composite reference standard therefore provides a fairer comparison of the two methods, and a better estimate of the true sensitivity of the Xpert MTB/RIF assay.

One potential limitation of our study is that molecular typing was not performed on MGIT-MODS discordant culture isolates to exclude cross-contamination. Nevertheless there was strong microbiological and/or clinical evidence to corroborate the positive culture in every case. In addition, negative controls in every MGIT batch and MODS plate provided no evidence of cross-contamination throughout the study, consistent with the very low prevalence and concentrations of M. tuberculosis in these paediatric specimens.

Due to finite resources, most children only had a single sputum sample obtained. Although this and other studies show the incremental yield of additional specimens where obtained, analysis of a single specimen is likely to reflect practice in most low resource settings due to the incremental cost of processing additional specimens.

In conclusion, our results underline the imperfect sensitivity of all currently available methods for bacteriological diagnosis of childhood TB. The choice of method in any particular setting depends on several factors including prevalence of TB and drug resistance; resources available to risk stratify patients for testing; test sensitivity, cost35, time to detection; and the availability of trained staff and laboratory biosafety facilities. Our data confirm the superior sensitivity of MGIT compared with MODS and Xpert MTB/RIF. Although MODS may have a role in some settings, the operationally simpler Xpert MTB/RIF assay was as sensitive as MODS, demonstrated equivalent specificity to MGIT, and combined with MGIT it optimized bacteriological yield. Together these results strengthen the evidence base for inclusion of Xpert MTB/RIF in the reference standard for bacteriologically confirmed childhood TB in both WHO clinical guidelines35 and research.

Role of the funding source

The study was supported by the Wellcome Trust in the form of research fellowships to AJB (081697) and JAGS (098532), and a core grant to the KEMRI-Wellcome Trust research programme (077092); and by a project grant from the Pneumonia Etiology Research for Child Health (PERCH) project funded by the Bill & Melinda Gates Foundation at Johns Hopkins Bloomberg School of Public Health. The funders had no role in data analysis or in the decision to publish.