A Commentary on

Mainkar A, Kim SG.

Diagnostic accuracy of 5 dental pulp tests: A systematic review and meta-analysis. J Endod 2018 44: 694-702. doi: 10.1016/j.joen.2018.01.021. Epub 2018 Mar 20. Review. PubMed PMID: 29571914.

figure 1

GRADE rating

Commentary

A pulp test's Sn and Sp are inherent properties of a diagnostic tool, and not a function of the pre-test probability of a non-vital pulp.1 Since any diagnostic tool is used in a variety of clinical situations, where the pre-test probability is scenario dependent, systematic reviews and meta-analyses of diagnostic tools typically focus on generating estimates for Sn and Sp.2,3 The clinician then uses these estimates to determine the predictive values (post-test probabilities) based on the specific clinical pre-test probability they are faced with chairside, which is not always the prevalence of the disease.1 Hence, I will focus this commentary on the review's Sn and Sp results.

Table 1 Summary of pooled diagnostic accuracy values for tools to test pulp vitality

Although two previous systematic reviews4,5 on the same clinical question were published, the authors suggest that '… [the] pooled values [meta-analysis] presented in this review may provide clinicians with a more accurate understanding for the sensitivity and specificity … of each pulp testing method'. I dispute this statement on the grounds that the high heterogeneity (I2> 85%) amongst the included cold pulp test (CPT), heat pulp test (HPT) and electric pulp test (EPT) studies makes the meta-analysis of the respective test's sensitivity (Sn) and specificity (Sp) of little merit.

Although generating a meta-analysis from a systematic review may be useful at increasing the precision of estimated results (ie narrowing the confidence interval by combining data from many studies), such analysis on diagnostic test studies poses a greater challenge for generating meaningful numbers compared to intervention or observational studies.6 Specifically, meta-analysis of diagnostic tool studies must deal with a pair of co-related outcomes (Sn and Sp) simultaneously, while other types of clinical studies typically deal with one independent outcome. Also, the validity of such analysis on diagnostic studies is often threatened by the variation in the threshold effect between studies.2 The threshold effect refers to the point that a non-response by a patient to a sensibility test (eg CPT, HPT and EPT) is considered to be positive. For example, in one of the included studies of this review,7 the response thresholds to the CPT and HPT were assessed at a maximum of 18 seconds after the respective application of cold and hot stimuli to the coronal third of a permanent tooth. On the other hand, another study's CPT and HPT thresholds were set at ten seconds for primary teeth.8 Furthermore, other included studies were not specific regarding the threshold time.9,10 Further confusing matters is the varying sources of cold (eg propane-butane mixture, CO2, dichlorotetrafluoroethane, ethyl chloride) and sources of heat (eg heated gutta percha, rubber cup) used amongst included studies. Also, the amperage threshold of the EPT studies was not standardised amongst included studies. For these and other reasons it is inappropriate, in the case of the three sensibility tests, to report separate pooled estimates for Sn and Sp. A more appropriate meta-analysis would have been a joint modeling of Sn and Sp using a receiver-operating characteristic model.3,11 In such a model, the probability of the tool's true positive (Sn) is analysed against the test's probability of a false positive (1-Sp) on a single graph, hence reflecting the covariance between Sn and Sp.

On the other hand, the PO and LDR are less prone to the same systematic errors as the sensibility tests because they do not rely on the subjective response of a patient, but rather on the relatively objective reading of the pulp's blood O2 saturation and pulpal blood flow respectively. This is reflected in the significantly lower level of heterogeneity found amongst these studies and the more precise pooled estimates for these devices compared to the sensibility tests. Unfortunately, these devices are currently fraught with other technical problems that make them impractical in clinical practice.4,5

Despite the limitation of the review's meta-analysis, its results and conclusion are consistent with the other systematic reviews. Specifically, the CPT is the simplest and most accurate pulpal sensibility test available to clinicians and hence should be the primary diagnostic tool to assess pulpal status. The EPT's low sensitivity but high specificity makes it a good tool to rule in a vital pulp given a negative result (ie patient response to it).