Diagnostic accuracy of salivary gland ultrasonography with different scoring systems in Sjögren’s syndrome: a systematic review and meta-analysis

Zhou, Mingzhu; Song, Shuju; Wu, Shanshan; Duan, Ting; Chen, Letian; Ye, Jingyi; Xiao, Jun

doi:10.1038/s41598-018-35288-5

Download PDF

Article
Open access
Published: 20 November 2018

Diagnostic accuracy of salivary gland ultrasonography with different scoring systems in Sjögren’s syndrome: a systematic review and meta-analysis

Mingzhu Zhou¹,
Shuju Song¹,
Shanshan Wu²,
Ting Duan¹,
Letian Chen¹,
Jingyi Ye³ &
…
Jun Xiao³

Scientific Reports volume 8, Article number: 17128 (2018) Cite this article

2381 Accesses
22 Citations
Metrics details

Subjects

Abstract

Noninvasive objective salivary gland ultrasonography (SGU) had been widely used to evaluate major salivary gland involvement in primary Sjögren’s syndrome (pSS) and treatment responses. However, the evaluation score, diagnostic sensitivity, and diagnostic specificity significantly varied among clinical studies. We conducted this meta-analysis to assess the diagnostic accuracy of different SGU scoring systems using the American-European Consensus Group criteria. Of the 1301 articles retrieved from six databases, 24 met the criteria for quality assessment and 14 for meta-analyses. The pooled sensitivities were 75% (0–4) with I² = 92.0%, 84% (0–16) with I² = 63.6%, and 75% (0–48) with I² = 90.9%; the pooled specificities were 93% (0–4) with I² = 71.5%, 88% (0–16) with I² = 65.4%, and 95% (0–48) with I² = 83.9%; the pooled diagnostic odds ratios were 71.26 (0–4) with I² = 0%, 46.3 (0–16) with I² = 73.8%, and 66.07 (0–48) I² = 0%; the areas under the SROC curves were 0.95 (0–4), 0.93 (0–16), and 0.94 (0–48). These results indicated that the 0–4 scoring system has a higher specificity and a less heterogeneity than other systems, and could be used as a universal SGU diagnostic standard.

Salivary ultrasonography and histopathologic evaluation of secondary Sjögren’s syndrome in rheumatoid arthritis patients

Article Open access 13 July 2023

Youngjae Park, Minae Oh, … Wan-Uk Kim

Anti-centromere antibody positivity is an independent variable associated with salivary gland ultrasonography score in Sjögren’s syndrome

Article Open access 04 March 2024

Toshimasa Shimizu, Shin-ya Nishihata, … Atsushi Kawakami

Whole-Volume ADC Histogram Analysis in Parotid Glands to Identify Patients with Sjögren’s Syndrome

Article Open access 03 July 2019

Chen Chu, Qianqian Feng, … Zhengyang Zhou

Introduction

Sjögren’s syndrome (SS) is the second most prevalent autoimmune rheumatic disease with a prevalence of 0.05%^1,2. SS can affect any body organ or system such as interstitial lung disease, pulmonary hypertension, amyloidosis, and mucosa-associated lymphoid tissue lymphoma^3,4,5,6. SS patients constantly suffer dry mouths, dry eyes, dry skin, a chronic cough, vaginal dryness, numbness in the arms and legs, feeling tired, muscle and joint pains, and thyroid problems.

The pathophysiology of SS has not been fully understood. It is believed to involve a combination of genetics and an environmental trigger such as exposure to a virus or bacteria^7,8. Due to the vast range of SS symptoms and the similarity between symptoms of SS and those of other conditions, diagnosis of SS is complicated and difficult. Also, since the SS symptoms such as dry eyes and dry mouth are very common, especially among patients over 40 years old, it is often mistaken as age-related, thus ignored. In addition, some medications can also cause symptoms similar to those of this autoimmune disorder. These unspecific and common symptoms make the objective diagnosis crucial.

Sialography and biopsy of the labial minor salivary glands (“lip biopsy”) are the established and objective examinations in diagnosing SS. However, the invasiveness and the complications from these procedures limit their clinical uses. Recently, noninvasive objective salivary gland ultrasonography (SGU) had been widely used to evaluate major salivary gland involvement in primary SS (pSS) and treatment responses⁹. Plenty of clinical studies demonstrated that SGU is sensitive and specific to pSS^{10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33}. Some studies showed that the results from SGU were highly consistent to those from Sialography and lip biopsy^16,17,23. It was recommend that SGU be used as a SS diagnostic tool²⁴. However, the evaluation score, diagnostic sensitivity, and diagnostic specificity significantly varied among these clinical studies^{10,11,12,13,14,15,16,17,18,19,20,21,22,23,24,25,26,27,28,29,30,31,32,33}. Therefore, a meta-analysis of these exiting clinical studies is needed to evaluate which scoring system has lower heterogeneity.

To our knowledge, the meta-analysis conducted by Delli et al. is the only meta-analysis that assessed the diagnostic properties of SGU in the diagnosis of SS². It has been established that a single gold standard should be used in meta-analysis. However, multiple gold standards i.e., FC, JDC, CC, TC, ECSG, AECG, RJDC, were used by Delli et al.². In addition, Delli et al. did not performed subgroup analysis, likely introducing bias. Therefore, a meta-analysis of these exiting studies by subgroups using a single gold standard is urgently needed to recommend a guideline regarding whether SGU is a highly specific pSS diagnostic tool and which SGU scoring system can be used as an universal diagnostic standard. To that end, we used the American-European Consensus Group (AECG) criteria as the gold standard and performed subgroup analyses per SGU scoring system.

Results

Study identification and selection

A total of 1301 studies were identified in the six databases. One thousand one hundreds and eighty-five studies were excluded per titles and abstracts; 92 per the exclusion criteria. The quality assessment was performed using QUSDAS-2 in the remaining 24 studies (Fig. 1), all of which used the AECG criteria for diagnosis of SS. Of the 24 studies, one study didn’t report about the scoring system clearly¹⁷, one used a self-defined complex scoring system²¹. Four scoring systems were used in 22 studies. Because 0–12 scoring system was used in only two studies, the final meta-analysis focused on the included 14 studies with three scoring systems as subgroups (0–4, 0–16, and 0–48).

Study characteristics

A total of 3360 patients were enrolled in the 24 studies, including 1976 SS patients and 1384 control subjects (Table 1). Fifteen studies only included pSS patients, 3 studies included both pSS and secondary SS (sSS) patients, and 6 studies didn’t specify the type of the disease (pSS or sSS); 4 studies used sSS patients as control, 18 studies used subjects with sicca symptoms but not SS as control, 12 studies used healthy controls as control. Overall, 12 studies had only one control group; 10 studies had more than one control groups; and 2 studies had no control group.

Table 1 Characteristics of the 24 included studies (AECG as the diagnostic criteria).

Full size table

Ultrasonography scoring systems and the subgroups

Fourteen studies used 0–4 scoring system including 0–3 scoring system (Table 1)³⁴. The positive criteria is mild parenchymal inhomogeneity seen as multiple hypoechogenic areas measuring <2 mm with blurred borders. Eight Studies used the scores 0–4 for counting. Six studies used the scores 0–3 for counting (the positive criteria was mentioned above). Seven studies were excluded including one study using AECG criteria partly as gold standard²⁴, two using the positive criteria lower than the above-mentioned criteria^12,15, two having no control groups^14,26, one using a self-defined complicated scoring system²¹, and one using the same patient population³⁰. The remaining seven studies were included as the 0–4 scoring subgroup in the meta-analysis^{10,11,25,27,31,32,33}.

Two studies used 0–12 scoring system^19,29. The scores ranged from 0 to 12, and ≥6 score was considered as positive criteria. The 0–12 scoring system was excluded from meta-analysis due to small sampling.

Six studies used 0–16 scoring system, which was first reported by Salaffi et al.³⁵. The scores ranged from 0 to 16. One study considered ≥5 as positive criteria³³; two studies ≥6^16,29; two studies ≥7^23,28; and one study ≥8²⁰. These six studies included as the 0–16 scoring subgroup in the meta-analysis.

Five studies used 0–48 scoring system, which was first reported by Hocevar et al.¹³. The scores ranged from 0 to 48. Two studies considered ≥17 as positive criteria^13,29; one study ≥15²⁸; one study ≥19¹⁸; and one study didn’t describe the cut-off value, which was excluded²². The four studies, which described the cuff-off values, were included as the 0–48 scoring subgroup in the meta-analysis.

Taken all together, 14 studies of the three subgroups were included in the meta-analysis, including 11 studies used only one scoring system, 2 studies used two scoring systems, and 1 study used three scoring systems (Tables 1 and 2).

Table 2 The sensitivity, specificity, and diagnostic OR of the three scoring systems (AECG as the diagnostic criteria).

Full size table

Diagnostic properties

The diagnostic properties of ultrasonography were compared among the 14 studies using the AECG criteria (Table 2). In particular, for the 14 studies included in our meta-analysis, the cut-off values ranged 1–2 (0–4 scoring system), 5–8 (0–16 scoring system), and 15–19 (0–48 scoring system), respectively. The sensitivity ranged 52–94% (0–4 scoring system), 75–93% (0–16 scoring system), and 59–91% (0–48 scoring system), respectively. The diagnostic specificity ranged 83–98% (0–4 scoring system), 78–97% (0–16 scoring system), and 84–100% (0–48 scoring system), respectively. The diagnostic OR ranged 47.06–278.67 (0–4 scoring system), 13.79–389.50 (0–16 scoring system), and 41.33–115.00 (0–48 scoring system), respectively.

The 0–4 scoring system has the least variations in specificity and diagnostic OR (0.90–0.95 and 42.29–120.09, respectively) when compared with the 0–16 scoring system (0.85–0.91 and 19.95–107.44, respectively) and the 0–48 scoring system (0.91–0.97 and 33.73–129.42, respectively) while the three systems have similar variations in sensitivity (0.71–0.79 in 0–4, 0.81–0.87 in 0–16, 0.70–0.80 in 0–48). In addition, the 0–4 scoring system had a universal cut-off value of 1 or 2 while the other two scoring systems did not. These results indicated that 0–4 scoring system is a more consistent scoring system.

Diagnostic accuracy

In the 0–4 scoring system for sensitivity, the I² index was 92.0%, (df = 6, p < 0.001) with a pooled sensitivity was 75% (71–79%) (Table 3); for specificity, the I² index was 71.5%, (df = 6, p = 0.0018) with a pooled specificity was 93% (90–95%); for DOR, the I² index was 0%, (df = 6, p = 0.643) with the pooled DOR was 71.26 (42.29–120.09). In the 0–16 scoring system for sensitivity, the I² index was 63.6%, (df = 5, p = 0.0174) with a pooled sensitivity was 84% (81–87%); for specificity, the I² index was 65.4%, (df = 5, p = 0.0129) with a pooled specificity was 88% (85–91%); for DOR, the I² index was 73.8%, (df = 5, p = 0.0019) with the pooled DOR was 46.3 (19.95–107.44). In the 0–48 scoring system for sensitivity, the I² index was 90.9%, (df = 3, p < 0.001) with a pooled sensitivity was 75% (70–80%); for specificity, the I² index was 83.9%, (df = 3, p = 0.0003) with a pooled specificity was 95% (91–97%); for DOR, the I² index was 0%, (df = 3, p = 0.551) with the pooled DOR was 66.07 (33.73–129.42). In summary, 0–16 scoring system had the highest sensitivity (84%) with relatively small I² (63.6%); 0–48 scoring system had the highest specificity (95%), which was similar to that of 0–4 (93%); 0–4 and 0–48 scoring systems had the best DOR (I² = 0%).

Table 3 The meta-analysis results of three scoring systems (AECG as the diagnostic criteria).

Full size table

Due to the different cut-off values of the scoring systems, SROC analyses were performed (Fig. 2). The summary operating sensitivities were 78% (65–88%) (0–4 scoring system), 85% (79–89%) (0–16 scoring system), and 78% (61–89%) (0–48 scoring system), respectively; the summary operating specificities were 95% (89–98%) (0–4), 89% (83–93%) (0–16), and 95% (86–98%) (0–48), respectively; the areas under curves (AUC) were 0.95 (0.93–0.97) (0–4), 0.93 (0.91–0.95) (0–16), and 0.94 (0.92–0.96) (0–48), respectively, indicating that these three systems were accurate diagnostic systems.

Taken all together, the heterogeneities of the pooled DOR for 0–4 and 0–48 scoring systems was 0%, or no heterogeneities, suggesting that these two scoring systems be reliable. However, it seemed that the 0–4 scoring system was the best among the three scoring systems because (i) the cut-off value was pre-specified in the 0–4 scoring system while the cut-off values in both 0–16 and 0–48 scoring system were different among the studies, and (ii) the 0–4 scoring system has the least variations in specificity and diagnostic OR (0.90–0.95 and 42.29–120.09, respectively) when compared with the 0–16 scoring system (0.85–0.91 and 19.95–107.44, respectively) and the 0–48 scoring system (0.91–0.97 and 33.73–129.42, respectively).

Quality assessment and risk of bias of the studies

High risk of bias was observed in “patient selection” due to the variations of inclusion and exclusion criteria (e.g., whether a case-control study was included or excluded) as well as the patient selection criteria (i.e., whether patients were enrolled consecutively or randomly) (Fig. 3). High risk of bias was also observed in the “conduct and interpretation of index test” due to the designs of the original studies (e.g., whether the SGU results were interpreted with the knowledge of the SS diagnosis; whether a threshold was pre-specified).

Twenty-four studies were included in the QUADAS-2 quality assessment, including 21 studies used only one scoring system, 2 studies used two scoring systems, and 1 study used three scoring systems (Table 1). The most frequent high risks of bias were biases due to patient selection and index test. In particular, 14 (58.3%) studies and 10 (41.7%) studies were rated as “high risk” in terms of the biases due to patient selection and due to index test (Table 4). In contrast, all the studies were rated as “low risk” in terms of the biases due to reference standard and due to flow and timing. More studies had low concerns in the applicability of patient selection (58.3%) than in the applicability of index test (41.7%) and the applicability of reference standard (0%). These results indicate that the applicability of SGU was high.

Table 4 Risk of bias and applicability of the studies included.

Full size table

An ultrasound picture scored with different scoring systems

Direct comparisons among different scoring systems on a same patient was performed (Fig. 4). 0–4 system is significantly distinguished from the other 3 systems while the 3 systems proportionally project among each other in essence (Fig. 4, lower right).

Discussion

Plenty of clinical researches using different diagnostic criteria and scoring systems indicated that SGU is sensitive and specific to pSS. In contrast, the meta-analyses on these studies were scarce. Only two meta-analysis studies have published regarding the diagnostic value of SGU in SS patients^2,36. One study compared the diagnostic properties of ultrasonography and sialography in SS, demonstrating that ultrasonography was comparable with sialography³⁶. However, this meta-analysis only included six studies, and could not explain the diagnostic value of ultrasonography in SS. In addition, the assessment of research methodology was less rigorous, with high risk of bias in all QUADAS-2 domains, resulting in concerns regarding the outcomes². The other study performed a good quality assessment². However, this meta-analysis did not distinguish the diagnostic criteria and the scoring systems. In addition, the studies included were not rigidly designed and performed as their results showed significant heterogeneity. Therefore, quality of the pooled outcomes (sensitivity, specificity, and diagnostic odds ratio) was low. The likely source of this heterogeneity was the ultrasonography scoring systems. To our knowledge, the current study is the first meta-analysis to perform subgroup analyses regarding different scoring system using only one diagnostic criterion.

The different cut-off values in the 0–16 and 0–48 systems resulted in relative large heterogeneity of sensitivity and specificity. To decrease this heterogeneity, we conducted SROC curve analysis in the three subgroups. Our results indicted all the three systems are reliable diagnostic tools with similar accuracy (SROC AUC 0.95 (0–4), 0.93 (0–16), and 0.94 (0–48)).

In the 0–4 system, the sensitivity was 75%, specificity 93%, diagnostic DOR heterogeneity 0%, cut-off pre-specific. In addition, the operation was simple and the operation time was shorter. These advantages allowed the 0–4 system to outperform 0–16 and 0–48 systems. In contrast, although both 0–16 and 0–48 systems were reliable scoring systems with similar AUC, the cut-off values were not pre-specified indicating that these scoring systems could not be used as SGU diagnostic standard. Taken all together, the 0–4 scoring system seems to be a better scoring system being used as a universal SGU diagnostic standard with a higher specificity and a less heterogeneity than the other scoring systems (0–16 and 0–48). Actually, the 0–4 system is significantly distinguished from the other 3 systems (Fig. 4, lower right).

This study has several strengths. First, mainly four scoring systems are used clinically, each of which has it own advantages. It is of clinical significance to meta-analyze different scoring systems as subgroups, respectively, to decrease possible heterogeneity and establish which scoring system is overall the best. Our results indicted that the 0–4 scoring system was the best among the three scoring systems as the diagnostic criterion. In particular, the heterogeneity of the pooled DOR for 0–4 and 0–48 scoring systems was 0%, or no heterogeneity, indicating that these scoring systems are reliable. Between the two scoring systems, we think 0–4 scoring system is better, because the cut-off value is pre-specified. In contrast, the cut-off value of 0–48 scoring system is different among studies. In addition, the heterogeneity of the pooled specificity was high in 0–48 system. The heterogeneity of the pooled sensitivity of 0–4 and 0–48 scoring system was both very high. This might relate with the selection of patients and control groups.

Second, diagnostic criteria are clinically important as different criteria might result in different diagnosis. However, previous meta-analysis included studies using different diagnostic criteria, i.e., FC, JDC, CC, TC, ECSG, AECG, RJDC². This meta-analysis only included studies that used AECG as a single criteria decreasing possible heterogeneity. Our results were consistent with clinical practice that AECG could be considered as an established diagnostic criteria for pSS.

Third, QUADAS-2 is the best quality assessment tool for diagnostic research. The quality of recent studies were higher than the past ones as showed by QUADAS-2³⁷. In contrast to studies enrolled consecutive subjects or subjects with suspected SS, and divided into groups after the index and reference test^{13,14,16,17,18,19,21,23,24,27,29}, this up-to-date meta-analysis included more recent rigidly designed studies and avoided the case-control type.

This study also has some limitations. First, the studies included varied in terms of (a) patient enrollment (random or not), which could cause the selection bias; (b) the blindness of the SGU examination to the diagnostic procedures, which could cause confirmatory bias (specifically, not blindness could cause confirmatory bias). This might be the major reason that the heterogeneity of sensitivity and specificity was high (I² of the sensitivity is 92.0% (0–4), 63.6% (0–16), and 90.9% (0–48), respectively; I² of the specificity is 71.5% (0–4), 65.4% (0–16), and 83.9% (0–48), respectively). SGU is highly specific in pSS. However, some studies only enrolled pSS patients while some enrolled both pSS and sSS patients. Regarding the control, it was ideal that control enrolls either sicca or healthy patients. However, some studies failed to do so directly resulting in the heterogeneity of sensitivity. In addition, the threshold of SGU was pre-specified only in 0–4 system while not pre-specified in other systems. However, these limitations unlikely could change our conclusion, which was drawn from the comparisons of subgroups.

Second, due to our rigid selection criteria, studies included in this meta-analysis used one SS diagnosis standard. However, two problems still existed: First, regarding the threshold of the SGU scores, the cut-off value was not pre-specified except in the 0–4 scoring systems; Second, most studies didn’t mention the interval between SGU and AECG diagnosis. Only in 7 studies that the SGU was performed simultaneously with the diagnostic procedures^{13,16,17,18,23,27,29}. Regarding subgroup analysis, less than 8 studies were included in each subgroup so that the funnel plot is insignificant.

In conclusion, SGU is a highly specific pSS diagnostic tool. The 0–4 scoring system is a better scoring system as a universal SGU diagnostic standard in terms of specificity and heterogeneity than the other scoring systems (0–16 and 0–48).

Methods

This review followed the guidelines of the Preferred Reporting Items for Systematic Reviews³⁸ and Meta-Analysis and the Meta-Analysis of Observational Studies in Epidemiology³⁹.

Literature Search

Six databases (Embase, Pubmed, Cochrane library, China National Knowledge Infrastructure (CNKI), WanFang databases, and WeiPu Periodical Resource Integration Service Platform from September 1, 1982 until April 15, 2018) were searched with the keywords (“salivary gland”, “parotid gland”, or “submandibular gland”) and (“ultrasonography”, “ultrasound”, or “sonography”), and (“Sjögren’s syndrome”, “Sjögren syndrome”, “sicca syndrome”, or “sicca”).

Study Selection

Inclusion criteria were studies containing data on the diagnostic value of SGU for pSS, using AECG criteria as the diagnostic criteria, and including more than 20 cases. Exclusion criteria for titles and abstracts included: case reports, case series with fewer than 20 cases, letters to the editor, experts’ opinions, review articles, studies without diagnostic value of SGU, and studies used non-AECG criteria as diagnostic criteria. The studies were fully assessed if the title and abstract only provided limited information or in case of doubt.

Two independent researchers (M.Z. and S.S.) initially evaluated the titles and abstracts for eligibility per inclusion and exclusion criteria. Disagreements were resolved through consensus. The full texts of eligible studies were screened by the diagnosis criteria of SS. The studies using the AECG criteria as the golden standard were finally selected for this study.

Data Extraction

Two researchers (M.Z. and S.S.) extracted the data independently. Disagreements were resolved through consensus. Extracted information included description of population, publication year, study type, study design, diagnosis criteria for SS, the definition of the scoring systems in studies (Supplemental Materials STable 1), and ultrasonographic scoring system as well as true positive, true negative, false positive, and false negative.

Quality Assessment

Two researchers (M.Z. and S.S.) assess the quality of the studies per QUADAS-2 (the revised Quality Assessment of Diagnostic Accuracy Studies) tool. Disagreements were resolved by discussion.

Statistical Analysis

Selected studies were further divided into three subgroups, 0–4, 0–16, and 0–48 ultrasonographic scoring systems. The pooled diagnostic sensitivity, specificity, and odds ratio (DOR) were calculated for each subgroups. The heterogeneity of the pooled sensitivity, specificity, and DOR were measured by the inconsistency (I²) and Cochran Q test. The heterogeneity was a measure of the between-study variations and was used to assess whether the studies in a meta-analysis represented a single population or several different populations. The percentage measures of the heterogeneity among the enrolled articles were calculated as I² index. Small heterogeneity in the enrolled articles was defined as I² < 25%, moderate heterogeneity was defined as I² 25–50%, obvious heterogeneity was defined as I² > 50%. The Cochran Q test was used for calculating heterogeneity (P < 0.05). The random effects model was used for data analysis.

The risk of bias of the included studies was assessed by QUADAS-2 tool³⁷. Quality assessment was performed with Review Manager software (version 5.3, The Nordic Cochrane Centre, The Cochrane Collaboration). Pooling of sensitivity, specificity, DOR, and heterogeneity test were performed with Meta-Disc software (version 1.4, Madrid, Spain). The summary receiver operating characteristic (SROC) curves were produced in STATA13.0.

Date Availability Statement

The authors declare that the data in this research is available.

References

Pillemer, S., Matteson, E. & Jacobsson, L. Incidence of physician-diagnosed primary Sjogren syndrome in residents of Olmsted County, MN. Mayo Clin Proc. 76, 593–599 (2001).
Article CAS Google Scholar
Delli, K. et al. Diagnostic properties of ultrasound of major salivary glands in Sjögren’s syndrome: a meta-analysis. Oral Dis. 21, 792–800 (2015).
Article CAS Google Scholar
Bertoni, M. et al. Pulmonary hypertension in primary Sjögren’s syndrome: report of a case and review of the literature. Clin Rheumatol. 24, 431–4 (2005).
Article CAS Google Scholar
Shahane, A. Pulmonary hypertension in rheumatic diseases: epidemiology and pathogenesis. Rheumatol Int. 33, 1655–67 (2013).
Article CAS Google Scholar
Brito-Zerón, P. et al. Sjögren syndrome. Nat Rev. Disease Primers. 2, 16047 (2016).
Article Google Scholar
Klippel, J. Primer on the rheumatic diseases, 13th ed. New York, NY, Springer. 389 (2008).
Takeda, K., Kaisho, T. & Akira, S. Toll-like receptors. Annu Rev Immunol. 21, 335–76 (2003).
Article CAS Google Scholar
Pflugfelder, S. C. et al. Epstein-Barr virus and the lacrimal gland pathology of Sjögren’s syndrome. Am J Pathol. 143, 49–64 (1993).
CAS PubMed PubMed Central Google Scholar
Jousse-Joulin, S. et al. Ultrasound assessment of salivary glands in patients with primary Sjögren’s syndrome treated with rituximab: Quantitative and Doppler waveform analysis. Biologics. 1, 311–319 (2007).
CAS PubMed PubMed Central Google Scholar
El Miedany, Y. M. et al. Quantitative ultrasonography and magnetic resonance imaging of the parotid gland: can they replace the histopathologic studies in patients with Sjögren’s syndrome? Joint Bone Spine. 71, 29–38 (2004).
Article Google Scholar
Niemela, R. K. et al. Ultrasonography of salivary glands in primary Sjögren’s syndrome. A comparison with magnetic resonance imaging and magnetic resonance sialography of parotid glands. Rheumatol. 43, 875–879 (2004).
Article CAS Google Scholar
Su, Y. J., Du, L. F. & Shi, L. L. Comparative study on parotid ultrasonography and sialography in Sjögren’s syndrome. Chin J Ultrasonogr. 13, 588–590 (2004).
Google Scholar
Hocevar, A. et al. Ultrasonographic changes of major salivary glands in primary Sjögren’s syndrome. Diagnostic value of a novel scoring system. Rheumatol. 44, 768–772 (2005).
Article CAS Google Scholar
Yang, B. et al. Comparative study on salivary gland SPECT and ultrasonography in diagnosis of Sjögren’s syndrome. China Medical Herald. 3, 148–149 (2006).
ADS Google Scholar
Song, S. J. et al. Comparative study on parotid ultrasonography and scintigraphy and labial gland biopsy in Sjögren’s syndrome. Chin J Med Ultrasound. 4, 159–161 (2007).
ADS Google Scholar
Salaffi, F. et al. Ultrasonography of salivary glands in primary Sjögren’s syndrome: a comparison with contrast sialography and scintigraphy. Rheumatol. 47, 1244–1249 (2008).
Article CAS Google Scholar
Poul, J. H., Brown, J. E. & Davies, J. Retrospective study of the effectiveness of high-resolution ultrasound compared with sialography in the diagnosis of Sjögren’s syndrome. Dentomaxillofac Radiol. 37, 392–397 (2008).
Article CAS Google Scholar
Milic, V. D. et al. Diagnostic value of salivary gland ultrasonographic scoring system in primary Sjögren’s syndrome: a comparison with scintigraphy and biopsy. J Rheumatol. 36, 1495–1500 (2009).
Article Google Scholar
Milic, V. D. et al. Major salivary gland sonography in Sjögren’s syndrome: diagnostic value of a novel ultrasonography score (0–12) for parenchymal inhomogeneity. Scand J Rheumatol. 39, 160–166 (2010).
Article Google Scholar
Xu, Z. H. et al. Value of sonographic score in the diagnosis of salivary gland involvement in patients with Sjögren’s syndrome. Chin J Ultrasonogr. 19, 977–980 (2010).
Google Scholar
Takagi, Y. et al. Salivary gland ultrasonography: can it be an alternative to sialography as an imaging modality for Sjögren’s syndrome? Ann Rheum Dis. 69, 1321–1324 (2010).
Article Google Scholar
Kong, L. H. et al. The comparative study of salivary gland scintigraphy quantitative analysis and high frequency ultrasound and labial gland biopsy in patients with Sjögren’s syndrome. Med J of Commun. 25, 133–136 (2011).
Google Scholar
Milic, V. et al. Ultrasonography of major salivary glands could be an alternative tool to sialoscintigraphy in the American-European classification criteria for primary Sjögren’s syndrome. Rheumatol. 51, 1081–1085 (2012).
Article Google Scholar
Cornec, D. et al. Contribution of salivary gland ultrasonography to the diagnosis of Sjögren’s syndrome: toward new diagnostic criteria? Arthritis Rheum. 65, 216–225 (2013).
Article Google Scholar
Theander, E. & Mandl, T. Primary Sjögren’s syndrome: The diagnostic and prognostic value of salivary gland ultrasonography using a simplified scoring system. Arthritis Care Res. 66, 1102–1107 (2014).
Article Google Scholar
Hammenfors, D. S. et al. Diagnostic utility of major salivary gland ultrasonography in primary Sjögren’s syndrome. Clin Exp Rheumatol. 33, 56–62 (2015).
PubMed Google Scholar
Baldini, C. et al. Salivary gland ultrasonography: a highly specific tool for the early diagnosis of primary Sjögren’s syndrome. Arthritis Res Ther. 17, 146 (2015).
Article Google Scholar
Zhang, X. et al. Ultrasonographic evaluation of major salivary glands in primary Sjögren’s syndrome: comparison of two scoring systems. Rheumatol. 54, 1680–1687 (2015).
Article CAS Google Scholar
Lin, D. F. et al. Cross-sectional comparison of ultrasonography scoring systems for primary Sjögren’s syndrome. Int J Clin Exp Med. 8, 19065–19071 (2015).
PubMed PubMed Central Google Scholar
Zhou, M. et al. The comparative study of diagnositic value for primary Sjögren’s syndrome between salivary gland ultrasonography and scintigraphy. J Clin Exp Med. 15, 87–90 (2016).
Google Scholar
Zhou, M. et al. Study on the value of salivary gland ultrasonography in Sjögren’s syndrome. Chin J Rheumatol. 20, 317–320 (2016).
Google Scholar
Chen, S. et al. Combination of Salivary Gland Ultrasonography and Virtual Touch Quantification for Diagnosis of Sjögren’s Syndrome: A Preliminary Study. BioMed Res Int. 2016, 2793898 (2016).
PubMed PubMed Central Google Scholar
Qi, X. et al. Comparison of the diagnostic value of four scoring systems in primary sjögren’s syndrome patients. Immunol Lett. 188, 9–12 (2017).
Article CAS Google Scholar
De Vita, S. et al. Salivary gland echography in primary and secondary Sjögren’s syndrome. Clin Exp Rheumatol. 10, 351–356 (1992).
PubMed Google Scholar
Salaffi, F. et al. Salivary gland ultrasonography in the evaluation of primary Sjogren’s syndrome. Comparison with minor salivary gland biopsy. J Rheumatol. 27, 1229–1236 (2000).
CAS PubMed Google Scholar
Song, G. G. & Lee, Y. H. Diagnostic accuracies of sialography and salivary ultrasonography in Sjogren’s syndrome patients: a meta-analysis. Clin Exp Rheumatol. 32, 516–522 (2014).
PubMed Google Scholar
Whiting, P. F. et al. QUADAS-2: a revised tool for the quality assessment of diagnostic accuracy studies. Ann Intern Med. 155, 529–536 (2011).
Article Google Scholar
Moher, D., Liberati, A., Tetzlaff, J. & Altman, D. G., PRISMA Group. Preferred reporting items for systematic reviews and meta-analyses: the PRISMA statement. Ann Intern Med. 151, 264–269 (2009).
Article Google Scholar
Stroup, D. F. et al. Meta-analysis of observational studies in epidemiology: a proposal for reporting. Meta-analysis Of Observational Studies in Epidemiology (MOOSE) group. JAMA. 283, 2008–2012 (2000).
Article CAS Google Scholar

Download references

Author information

Authors and Affiliations

Department of Rheumatology and Immunology, Beijing Friendship Hospital, Capital Medical University, Beijing, P. R. China
Mingzhu Zhou, Shuju Song, Ting Duan & Letian Chen
Department of National Clinical Center of Digestive Diseases, Beijing Friendship Hospital, Capital Medical University, Beijing, P. R. China
Shanshan Wu
The Macrohard Institute of Health, Michigan, MI, 48148, USA
Jingyi Ye & Jun Xiao

Authors

Mingzhu Zhou
View author publications
You can also search for this author in PubMed Google Scholar
Shuju Song
View author publications
You can also search for this author in PubMed Google Scholar
Shanshan Wu
View author publications
You can also search for this author in PubMed Google Scholar
Ting Duan
View author publications
You can also search for this author in PubMed Google Scholar
Letian Chen
View author publications
You can also search for this author in PubMed Google Scholar
Jingyi Ye
View author publications
You can also search for this author in PubMed Google Scholar
Jun Xiao
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

M.Z. and J.X. designed the study, interpreted the data, and wrote the manuscript; M.Z. and S.S. extracted the data independently; M.Z. and T.D. conducted the meta-analysis; S.W. and L.C. provided important insights into interpreting the data; J.Y. provided important insights into the manuscript. All authors read and approved the final manuscript.

Corresponding authors

Correspondence to Mingzhu Zhou or Jun Xiao.

Ethics declarations

Competing Interests

The authors declare no competing interests.

Additional information

Publisher’s note: Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Electronic supplementary material

Supplementary materials

Rights and permissions

Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the article’s Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the article’s Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.

Reprints and permissions

About this article

Cite this article

Zhou, M., Song, S., Wu, S. et al. Diagnostic accuracy of salivary gland ultrasonography with different scoring systems in Sjögren’s syndrome: a systematic review and meta-analysis. Sci Rep 8, 17128 (2018). https://doi.org/10.1038/s41598-018-35288-5

Download citation

Received: 29 December 2017
Accepted: 01 November 2018
Published: 20 November 2018
DOI: https://doi.org/10.1038/s41598-018-35288-5

Keywords

This article is cited by

Sonographic Features of Salivary Glands in Sjögren’s Syndrome and its Mimics
- Tracian James-Goulbourne
- Vagishwari Murugesan
- Eugene Y. Kissin
Current Rheumatology Reports (2020)

Comments

By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.

Subjects

Abstract

Similar content being viewed by others

Salivary ultrasonography and histopathologic evaluation of secondary Sjögren’s syndrome in rheumatoid arthritis patients

Anti-centromere antibody positivity is an independent variable associated with salivary gland ultrasonography score in Sjögren’s syndrome

Whole-Volume ADC Histogram Analysis in Parotid Glands to Identify Patients with Sjögren’s Syndrome

Introduction

Results

Study identification and selection

Study characteristics

Ultrasonography scoring systems and the subgroups

Diagnostic properties

Diagnostic accuracy

Quality assessment and risk of bias of the studies

An ultrasound picture scored with different scoring systems

Discussion

Methods

Literature Search

Study Selection

Data Extraction

Quality Assessment

Statistical Analysis

Date Availability Statement

References

Author information

Authors and Affiliations

Contributions

Corresponding authors

Ethics declarations

Competing Interests

Additional information

Electronic supplementary material

Supplementary materials

Rights and permissions

About this article

Cite this article

Share this article

Keywords

This article is cited by

Sonographic Features of Salivary Glands in Sjögren’s Syndrome and its Mimics

Comments

Search

Quick links