Abstract
We compared the diagnostic performances and unnecessary FNA rates of several guidelines and modified versions using the size threshold of the ACR TIRADS. Our Institutional Review Board approved this retrospective study and waived the requirement for informed consent and all methods were performed in accordance with the Declaration of Helsinki. A total of 1,384 thyroid nodules in 1,301 patients with definitive cytopathologic findings were included. US categories were assigned according to each guideline. We applied the size threshold suggested by the ACR TIRADS for FNA to the Kwak, ATA and EU guidelines and defined these modified guidelines as the modified Kwak (mKwak), modified ATA (mATA) and modified EU (mEU) guidelines. Diagnostic performances and unnecessary FNA rates of all guidelines were evaluated. Of 1,384 thyroid nodules, 291 (21%) were malignant. Among the original guidelines, the ACR TIRADS had the highest specificity, accuracy, LR and AUC (62.2%, 66%, 2.128 and 0.713). The mKwak, mATA and mEU guidelines had higher specificity, accuracy, LR and AUC (Pā<ā0.001 for all), and fewer unnecessary FNAs, compared with their original guidelines. Among all original and modified guidelines, the mKwak guideline had the highest specificity, accuracy, LR and AUC (64%, 68.6%, 2.389 and 0.75). The unnecessary FNA rate was the lowest with the mKwak guideline (61.1%). The highest sensitivity was observed with the ATA guideline (98.6%). After incorporating the size threshold of the ACR TIRADS to other TIRADS, all guidelines showed higher diagnostic accuracy and lower unnecessary FNA rates than their original versions. The mKwak guideline showed the best diagnostic performances.
Similar content being viewed by others
Introduction
Thyroid ultrasonography (US) is now regularly performed in clinical practice and thyroid nodules are exceedingly common on US with as many as 68% of adults having one, leading to issues of overdiagnosis and overtreatment1,2. Many guidelines recommend fine-needle aspiration (FNA) based on several risk stratification systems which use different US features and even different size thresholds3,4,5,6,7. Current risk stratification systems using US features can be broadly divided into two types: the point-scale Thyroid Imaging Reporting and Data System (TIRADS) suggested by Kwak et al. 8, Park et al. 9 and the American College of Radiology (ACR)3 and the pattern-recognition TIRADS suggested by Horvath et al. 10, the 2015 American Thyroid Association (ATA)7, and European Thyroid Association (EU)11. Different size criteria have been suggested by the ATA guideline, ACR and EU TIRADS3,7,11. Although there are many guidelines for recommending FNA for thyroid nodules on US, a worldwide communicable system does not presently exist.
Recently, Grani et al. 12 demonstrated that the ACR TIRADS reduced unnecessary FNAs more than other international guidelines with a very low false-negative rate (2.2%, 6/268). The ACR TIRADS suggests a higher size threshold for FNA than other guidelines while still recommending similar malignancy risks for each final assessment category3,7,11, and this higher size threshold is thought to explain the decrease in unnecessary FNAs3. However, physicians may need more time to classify a nodule on US when using the ACR TIRADS because each US feature is weighted differently3. On the other hand, one of other point-scale risk stratification systems proposed by Kwak et al. (Kwak TIRADS) has been proven to be practical and easily applicable in the assessment of thyroid nodules8,13,14,15,16,17,18,19,20, and can be performed by simply counting the number of suspicious US features without considering the malignancy probability of each US feature. One recent study compared the diagnostic efficiency of Kwak and ACR TIRADS and found the former to have higher AUC and accuracy19. However, the study did not consider the size threshold for recommending FNA19. We assumed that if they have similar diagnostic performances with the same size threshold for thyroid nodules, radiologists and clinicians can choose the more convenient risk stratification system for daily practice.
To find an effective guideline for recommending FNA for thyroid nodules, we investigated the diagnostic performances and unnecessary FNA rates of several guidelines in their original form, and their modified versions using the size threshold proposed by the ACR TIRADS.
Results
Baseline clinicopathological characteristics
Of 1,384 thyroid nodules, 1,093 (79%) were benign and 291 (21%) were malignant (Fig.Ā 1, Table 1). 397 nodules (28.7%) underwent surgery, 10 nodules (0.7%) were diagnosed by core needle biopsy and the last 977 (70.6%) nodules were diagnosed by cytologic findings from FNA. Among the 397 nodules which underwent surgery, 264 (66.5%, 264/397) were diagnosed as malignant and 133 (33.5%, 133/397) as benign. The malignant nodules were comprised of 234 papillary thyroid carcinomas (197 conventional, 33 follicular, 2 solid, 1 columnar and 1 oncocytic variant), 21 minimally invasive follicular carcinomas, 5 medullary carcinomas, 3 anaplastic carcinomas and 1 metastatic nasopharyngeal carcinoma. The most frequently excised benign nodules were follicular adenoma (nā=ā70) followed by adenomatous hyperplasia (nā=ā59), Hurthle cell adenoma (nā=ā3), and fibrotic nodule (nā=ā1). Demographics and US features of the patients and nodules are summarized in Table 1. The mean age (mean 51.1āĀ±ā13.4; range, 18ā90) was significantly higher in patients with benign nodules than patients with malignant nodules (mean 47āĀ±ā13.7Ā years; range, 18ā85Ā years) (Pā<ā0.001). Malignant thyroid nodules were significantly smaller than benign nodules (mean diameter 20.3āĀ±ā12.9Ā mm and 24āĀ±ā12.3Ā mm, respectively) (Pā<ā0.001). The malignant thyroid nodules had significantly higher rates of solid composition, hypoechogenicity or marked hypoechogenicity, microlobulated or irregular margins, microcalcifications or mixed calcifications, and nonparallel shape than benign nodules (Pā<ā0.001 for all).
Malignancy rates according to categories in the risk stratification systems
Each risk stratification system had significantly different malignancy rates according to categories (Table 2, Pā<ā0.001 for all). Most of the categorized lesions according to ACR and EU TIRADS were all in the range of the recommended risks of malignancy except for the not suspicious lesions (category 2) of ACR TIRADS and low risk (category 3) lesions of EU TIRADS. All categories except nodules of intermediate suspicion (category 4) in the ATA guideline were outside the recommended range.
Diagnostic performances of the guidelines
Among the original guidelines we evaluated, the ACR TIRADS had highest specificity, accuracy, LR and AUC (62.2%, 66%, 2.128 and 0.713, respectively) (Pā<ā0.001 for all, Tables 3 and 4, Figs.Ā 2 and 3) followed by Kwak guideline (35%, 47.5%, 1.458 and 0.649, respectively), EU guideline (28.1%, 42.2%, 1.324 and 0.616, respectively) and ATA guideline (19.9%, 36.4%, 1.231 and 0.592, respectively). Sensitivity was the highest with the ATA guideline (98.6%) and the lowest with the ACR guideline (80.4%, Pā=ā0.011 comparing ATA and Kwak, Pā=ā0.001 comparing the ATA and EU guidelines, Pā<ā0.001 for the other guidelines).
When the size threshold of ACR TIRADS was applied to the original TIRADS, the diagnostic ability increased in terms of specificity, accuracy, LR and AUC for all guidelines (Tables 3 and 4, Figs.Ā 2 and 3). The modified Kwak (mKwak) guideline had a specificity of 64%, accuracy of 68.6%, LR of 2.389 and AUC of 0.75 while the Kwak guideline had a specificity of 35%, accuracy of 47.5%, LR of 1.458 and AUC of 0.649 (Pā<ā0.001 for all). The modified ATA (mATA) guideline had a specificity of 57.2%, accuracy of 63.2%, LR of 1.998 and AUC of 0.714, while the original ATA guideline had a specificity of 19.9%, accuracy of 36.4%, LR of 1.231 and AUC of 0.592 (Pā<ā0.001 for all). The modified EU (mEU) guideline had a specificity of 40.1%, accuracy of 51.4%, LR of 1.565 and AUC of 0.669, while the EU guideline had a specificity of 28.1%, accuracy of 42.2%, LR of 1.324 and AUC of 0.616 (Pā<ā0.001 for all). However, the sensitivities of the modified guidelines were lower than their original versions. The sensitivity of the original guidelines was 94.8%, 98.6%, 95.2% for the Kwak, ATA and EU guidelines, respectively, while the modified versions showed a sensitivity of 85.9%, 85.6% and 93.8% for the mKwak, mATA and mEU guidelines, respectively. Among all the original and modified guidelines, the mKwak guideline had the highest specificity, accuracy, LR and AUC (64%, 68.6%, 2.389 and 0.75, respectively) (Pā=ā0.014 comparing the specificity of with ACR and Pā<ā0.001 for the others).
The unnecessary FNA rate was the lowest with the mKwak guideline (61.1%, 393/643) followed by the ACR (63.8%, 413/647), mATA (65.3%, 468/717), mEU (70.6%, 655/928), Kwak (72%, 711/987), EU (73.9%, 786/1,063) and ATA guidelines (75.3%, 876/1,163) (Table 5, Fig.Ā 3). In all modified guidelines, the unnecessary FNA rate decreased comparing to the original guidelines when the size threshold of the ACR TIRADS was applied.
Discussion
Currently, many guidelines composed of various TIRADS and size thresholds exist for further work-up such as FNA or follow-up US3,4,7,11. However, there has been no proven universal guideline proposed to reduce unnecessary FNAs and to find as many thyroid cancers as possible. It has also been difficult to compare the risk stratification systems themselves as each uses a different size threshold to recommend FNA although many studies have compared the diagnostic performances and unnecessary FNA rates of these guidelines12,20,21,22,23,24,25. To overcome this problem, we applied the size threshold of the ACR guideline to the Kwak, ATA and EU guidelines by matching the recommended malignancy rates. After applying the ACR TIRADS size threshold in the modified guidelines, diagnostic ability increased in terms of specificity, accuracy, LR and AUC compared with the original guidelines and the unnecessary FNA rates were also lower. The mKwak guideline which incorporated the ACR size threshold showed the best diagnostic results among the original and modified guidelines in terms of specificity, accuracy, LR and AUC.
Recently, many researchers demonstrated that the ACR TIRADS had superior diagnostic performance compared to other guidelines and reduced larger number of unnecessary FNAs (compared with guidelines from ATA, EU, American Association of Clinical Endocrinologists/American College of Endocrinology/Associazione Medici Endocrinologi, National Comprehensive Cancer Network, French Society of Endocrinology, Society of Radiology in Ultrasound and Korean Thyroid Association/Korean Society of Thyroid)12,21,22,23,25. Considering that the ACR incorporates a larger size threshold for FNA despite using similar recommended malignancy risks, the better diagnostic ability of the ACR guidelines can be explained by the size criteria for FNA and not the complicated US risk stratification system itself26. In this study, the ACR guideline showed better diagnostic accuracy than the original Kwak guideline which uses a 10Ā mm size threshold to recommend US-guided FNA (US-FNA) regardless of the number of suspicious US features. However, the mKwak guideline showed higher diagnostic accuracy than the original ACR guideline after the size threshold of the ACR guideline was applied. When US risk stratification systems are compared between the ACR and Kwak guidelines, the Kwak guideline is more straightforward and practical to use than the ACR guideline which uses a different point system for individual US features as they are assigned different weights3,8. Therefore, a combination of the easier US risk stratification system of the Kwak guideline and the size threshold of the ACR guideline can help clinicians in daily practice.
Increasing the size threshold of US-FNA resulted in decreasing the unnecessary FNA rate in all the guidelines we evaluated, which was the trade-off for lower sensitivity. In our study, the unnecessary FNA rate decreased more than sensitivity did for both the Kwak and EU guidelines. Size modification reduced the unnecessary FNA rate of the Kwak and EU guidelines by 10.9% and 3.3%, respectively while reducing sensitivity by 8.9% and 1.4%, respectively. When the ATA and mATA guidelines were compared, sensitivity decreased by 13% and the unnecessary FNA rate decreased by 10% with the mATA guidelines. As the only difference between the modified and original guidelines was size criteria, we can assume that the size threshold proposed by the ACR guideline increased diagnostic accuracy and reduced the unnecessary FNA rates. In one recent study, diagnostic performance and the unnecessary biopsy rate were evaluated with simulations using various nodule size cutoffs applied to the ATA and Korean Thyroid Association/Korean Society of Thyroid Radiology guidelines (KTA/KSThR)22. Among the various simulations, the 15Ā mm cutoff for intermediate suspicion, 25Ā mm cutoff for low suspicion and eliminating FNA for nodules of very low suspicion in the ATA guideline showed the highest specificity, accuracy and the lowest unnecessary biopsy rate22. These results suggest that the high specificity and low unnecessary FNA rate of the ACR guideline was due to the larger size cutoff which is in line with our study results22.
There are several limitations to this study. First, 1,244 of the 1,384 thyroid nodules (89.9%) were diagnosed based on cytologic findings alone, which could have resulted in some missed malignancies. We only included the nodules with definitive diagnostic cytopathologic findings (benign or malignant) at US-FNA, core needle biopsy, or surgery. Also, 5.2% (21/396) of the follicular carcinomas were diagnosed after surgery. Thus, a selection bias exists. Second, an experienced radiologist retrospectively re-assigned categories to thyroid nodules according to different risk stratification systems using US features prospectively recorded by 14 radiologists who were familiar with point-scale risk stratification. When US descriptors were recorded in this study, they could not be defined with the exact same definitions used in the other original guidelines, an issue which was not considered during data analysis, and this might have led to differences in the final assessments made in real-time examinations. Reassigning categories previously assigned according to the point-scale system to categories based on the pattern-recognition system might have also affected the results of this study. Third, the 14 radiologists performing the prospective imaging acquisition and analysis had variable levels of experience. Although interobserver variability and consistency are important considerations for choosing appropriate guidelines27,28, our study is reflective of actual clinical practice. Forth, the relatively high malignancy rate of thyroid nodules in our study is probably because we only included thyroid nodules which underwent FNA, which would naturally lead to a higher number of malignant nodules. Also, our institution is a tertiary referral center and that itself is a reason for the high malignancy rate of the study population.
In conclusion, application of the larger US-FNA size threshold of the ACR guideline resulted in increased diagnostic accuracy and decreased unnecessary FNA rates at the expense of decreased sensitivity. The mKwak guideline which is practical and easy to use showed superior diagnostic accuracy than the other guidelines, both original and modified. Further longitudinal multicenter studies with larger data are needed in the future to choose an accurate and effective risk stratification system for daily practice.
Methods
The institutional review board (IRB) of the Yonsei University College of Medicine approved this retrospective study and the requirement for informed consent for review of images and medical records was waived. And all methods were performed in accordance with the Declaration of Helsinki.
Study cohort
This study was performed from December 2015 to November 2016, during which 2,179 patients underwent US-FNA to diagnose thyroid nodules at our institution, a tertiary referral center. Among them, a total of 1704 thyroid nodules in 1602 patients were 10Ā mm or larger on US. 320 nodules were excluded because of a lack of definitive cytopathologic results after being initially diagnosed as nondiagnostic (nā=ā176), atypia or follicular lesion of undetermined significance (nā=ā110), follicular neoplasm or suspicion of follicular neoplasm (nā=ā27), or suspicion of malignancy (nā=ā7). Nodules were included if they had definitive diagnostic cytopathologic findings (benign or malignant) at US-FNA, core needle biopsy, or surgery. Finally, 1,384 thyroid nodules in 1,301 patients were included (Fig.Ā 1).
Mean age of the 1,301 patients was 50.2āĀ±ā13.6Ā years old (range 18ā90Ā years). Mean size of the 1,384 thyroid nodules was 23.2āĀ±ā12.6Ā mm (range 10-100Ā mm). Of the total patients, 1,062 (81.6%) were women and 239 (18.4%) were men. Of the total patients, 77 had two nodules and three had three nodules.
US examinations
Thyroid US was performed with a 5ā12Ā MHz linear array transducer (iU22; Philips Medical Systems). US examinations were performed by one of 14 board-certified radiologists (5 faculties and 9 fellows) with 1ā20Ā years of experience in thyroid imaging. US-FNAs were subsequently performed by the same radiologist who performed the thyroid US examination.
US features of thyroid nodules which underwent US-FNA were prospectively described and recorded in our institutional database at the time of US-FNA by the radiologist who performed the US and US-FNA according to composition, echogenicity, margin, calcifications, and shape. The composition was classified as solid, predominantly solid, predominantly cyst, spongiform nodule and cyst, the echogenicity was classified as hyperechogenicity, isoechogenicity, hypoechogenicity and marked hypoechogenicity, the margin was classified as well-defined, microlobulated and irregular margin, the calcification was classified as negative, egg-shell calcification, macrocalcification, microcalcification and mixed calcification. And the shape was classified as parallel and non-parallel. At our institution, US findings of solid composition, hypoechogenicity or marked hypoechogenicity, microlobulated or irregular margins, microcalcifications, and nonparallel shape were considered to be suspicious features for malignancy29.
Data and statistical analysis
Cytopathology results from FNA and surgery were considered as the standard reference. One radiologist (J.Y.K) with 17Ā years of experience in thyroid imaging, blind to the patientsā clinical data and pathological results, retrospectively re-assigned the TIRADS categories of each thyroid nodule using our institutional database which was made up of data collected by the radiologists who performed the US-FNAs. Ninety thyroid nodules (6.5%, 90/1,384) unspecified according to the ATA guideline including isoechoic or hyperechoic nodules with suspicious US features7 were regarded as intermediate suspicion as the calculated malignancy rates of these nodules were within the range of 10ā20%30.
Indications for FNA were based on US features and lesion size according to the various guidelines we used in this study3,7,11. A size threshold of 10Ā mm was used to indicate US-FNA in all thyroid nodules with suspicious US features in the Kwak TIRADS because the Kwak TIRADS recommends US-FNA when thyroid nodules more than 10Ā mm in size have suspicious US features rather than applying different size thresholds according to the final assessment category8,29. We applied the size criteria of the ACR TIRADS to the Kwak, ATA and EU guidelines according to similar recommended malignancy risk of each category3,7,8,11, and defined the new guidelines as the mKwak, mATA and mEU guidelines, respectively (Supplementary Table S1 online). The ACR TIRADS recommends no FNA for not suspicious thyroid nodules with recommended risk of malignancy of 2%3. The same strategy was applied for very low suspicion category of ATA guideline with recommended risk of malignancy of less than 3%7. For mildly suspicious thyroid nodules with a recommended malignancy risk of 5% in the ACR TIRADS, FNA was recommended when the nodule was 25Ā mm or larger3. The same size threshold was applied for nodules of low risk according to the EU guideline rather than the present size threshold of 20Ā mm because the recommended risks of malignancy was 2ā4%11. The recommended malignancy risk was 5ā20% for moderately suspicious nodules in the ACR TIRADS and FNA was recommended when the nodule was 15Ā mm or larger3. A size threshold of 15Ā mm was applied instead of 10Ā mm for nodules of intermediate suspicion according to the ATA guideline with a recommended malignancy risk of 10ā20%7. We also used a size threshold proposed by the ACR TIRADS to the Kwak guideline3,8: 25Ā mm size threshold for category 4a, 15Ā mm for category 4b and 10Ā mm for category 4c and 5. As the spongiform nodule and isolated macrocalcifications have no suspicious US feature according to Kwak TIRADS, they are considered as category 38.
Thyroid nodules were classified as nodules for which US-FNA was indicated and those for which it was not, according to the FNA criteria provided by each guideline3,7,8,11.
To compare the demographics between benign and malignant nodules, the independent two sample t-test was used to compare continuous data including patient age and the Chi-square test was used to compare categorical data including patient sex. Since some patients had more than one nodule, the generalized estimated equation (GEE) was used to compare both continuous and categorical data between benign and malignant nodules. Malignancy rates according to the final assessment by each system were calculated and compared with GEE. We also evaluated diagnostic performances including sensitivity, specificity, accuracy, negative predictive value (NPV), positive predictive value (PPV), likelihood ratio (LR) and area under the receiver operating characteristic curve (AUC) along with 95% confidence intervals (CI). The sensitivity, specificity, accuracy, NPV, PPV and LR were compared with GEE. The Delong method was used to compare AUC. The unnecessary biopsy rate for the diagnosis of thyroid cancer was defined as the number of benign nodules among the biopsy-required nodules. Statistical analysis was performed with SAS software (version 9.4, SAS Inc.). A two-sided Pā<ā0.05 was considered to indicate statistical significance.
References
Vaccarella, S. et al. Worldwide thyroid-cancer epidemic? The increasing impact of overdiagnosis. N. Engl. J. Med. 375, 614ā617 (2016).
Guth, S., Theune, U., Aberle, J., Galach, A. & Bamberger, C. Very high prevalence of thyroid nodules detected by high frequency (13 MHz) ultrasound examination. Eur. J. Clin. Invest. 39, 699ā706 (2009).
Tessler, F. N. et al. ACR thyroid imaging, reporting and data system (TI-RADS): white paper of the ACR TI-RADS committee. J. Am. College Radiol. 14, 587ā595 (2017).
Gharib, H. et al. American association of clinical endocrinologists, American College of endocrinology, and associazione Medici Endocrinologi medical guidelines for clinical practice for the diagnosis and management of thyroid nodulesā2016 update. Endocrine Pract. 22, 1ā60 (2016).
Frates, M. C. et al. Management of thyroid nodules detected at US: Society of radiologists in ultrasound consensus conference statement. Radiology 237, 794ā800 (2005).
Network, N. C. C. NCCN clinical practice guidelines in oncology. Thyroid carcinoma V. 2 2017. National Comprehensive Cancer Network website. https://www.nccn.org/professionals/physician_gls/ (2017).
Haugen, B. R. et al. 2015 American Thyroid Association Management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: The American Thyroid Association Guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid 26, 1ā133 (2016).
Kwak, J. Y. et al. Thyroid imaging reporting and data system for US features of nodules: a step in establishing better stratification of cancer risk. Radiology 260, 892ā899 (2011).
Park, J.-Y. et al. A proposal for a thyroid imaging reporting and data system for ultrasound features of thyroid carcinoma. Thyroid 19, 1257ā1264 (2009).
Horvath, E. et al. An ultrasonogram reporting system for thyroid nodules stratifying cancer risk for clinical management. J. Clin. Endocrinol. Metab. 94, 1748ā1751 (2009).
Russ, G. et al. European thyroid Association guidelines for ultrasound malignancy risk stratification of thyroid nodules in adults: The EU-TIRADS. Eur. Thyroid J. 6, 225ā237 (2017).
Grani, G. et al. Reducing the number of unnecessary thyroid biopsies while improving diagnostic accuracy: toward the ārightā TIRADS. J. Clin. Endocrinol. Metab. 104, 95ā102 (2018).
Wang, Y. et al. Malignancy risk stratification of thyroid nodules: comparisons of four ultrasound Thyroid imaging reporting and data systems in surgically resected nodules. Sci. Rep. 7, 11560 (2017).
Bartosz Migda, M. M. et al. Evaluation of four variants of the thyroid imaging reporting and data system (TIRADS) classification in patients with multinodular goitreāinitial study. Endokrynologia Polska 69, 156ā162 (2018).
Migda, B., Migda, M., Migda, M. S. & Slapa, R. Z. Use of the Kwak thyroid image reporting and data system (K-TIRADS) in differential diagnosis of thyroid nodules: Systematic review and meta-analysis. Eur. Radiol. 28, 2380ā2388 (2018).
Chandramohan, A. et al. Is TIRADS a practical and accurate system for use in daily clinical practice?. Indian J Radiol Imaging 26, 145 (2016).
Srinivas, M. N. S. et al. A prospective study to evaluate the reliability of thyroid imaging reporting and data system in differentiation between benign and malignant thyroid lesions. J. Clin. Imaging Sci. 6, 5ā5 (2016).
Schenke, S. & Zimny, M. Combination of Sonoelastography and TIRADS for the diagnostic assessment of thyroid nodules. Ultrasound Med. Biol. 44, 575ā583 (2018).
Gao, L. et al. Comparison among TIRADS (ACR TI-RADS and KWAK-TI-RADS) and 2015 ATA guidelines in the diagnostic efficiency of thyroid nodules. Endocrine 64, 90ā96 (2019).
Li, J., Li, H., Yang, Y., Zhang, X. & Qian, L. The KWAK TI-RADS and 2015 ATA guidelines for medullary thyroid carcinoma: Combined with cell block-assisted ultrasound-guided thyroid fine-needle aspiration. Clin. Endocrinol. 00, 1ā11 (2019).
Ruan, J.-L. et al. Fine needle aspiration biopsy indications for thyroid nodules: Compare a point-based risk stratification system with a pattern-based risk stratification system. Eur. Radiol. 29, 4871ā4878 (2019).
Ha, S. M. et al. Diagnostic performance of practice guidelines for thyroid nodules: Thyroid nodule size versus biopsy rates. Radiology 291, 92ā99 (2019).
Ha, E. J. et al. US fine-needle aspiration biopsy for thyroid malignancy: diagnostic performance of seven society guidelines applied to 2000 thyroid nodules. Radiology 287, 893ā900 (2018).
Yoon, J. H., Lee, H. S., Kim, E.-K., Moon, H. J. & Kwak, J. Y. J. R. Malignancy risk stratification of thyroid nodules: Comparison between the thyroid imaging reporting and data system and the 2014 American thyroid association management guidelines. Natl Lab Med 278, 917ā924 (2015).
Middleton, W. D. et al. Comparison of performance characteristics of american college of radiology TI-RADS, Korean Society of thyroid radiology TIRADS, and American Thyroid Association guidelines. Am. J. Roentgenol. 210, 1148ā1154 (2018).
Fradin, J. M. ACR TI-RADS: an advance in the management of thyroid nodules or Pandoraās box of surveillance?. J. Clin. Ultrasound 48, 3ā6 (2020).
Grani, G. et al. Interobserver agreement of various thyroid imaging reporting and data systems. Endocrine Connect. 7, 1ā7 (2018).
Grani, G. et al. Sonographically estimated risks of malignancy for thyroid nodules computed with five standard classification systems: changes over time and their relation to malignancy. Thyroid 28, 1190ā1197 (2018).
Kim, E.-K. et al. New sonographic criteria for recommending fine-needle aspiration biopsy of nonpalpable solid nodules of the thyroid. Am. J. Roentgenol. 178, 687ā691 (2002).
Yoon, J. H., Lee, H. S., Kim, E.-K., Moon, H. J. & Kwak, J. Y. Malignancy risk stratification of thyroid nodules: Comparison between the thyroid imaging reporting and data system and the 2014 American Thyroid Association management guidelines. Radiology 278, 917ā924 (2015).
Author information
Authors and Affiliations
Contributions
S.H. and J.Y.K. designed the study. S.H. wrote the manuscript. All authors (S.H., H.S.L., J.Y., E.-K.K., H.J.M., J.H.Y., V.Y.P. and J.Y.K.) contributed to the discussions and revisions of the manuscript. H.S.L. carried out the statistical calculations.
Corresponding author
Ethics declarations
Competing interests
The authors declare no competing interests.
Additional information
Publisher's note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Supplementary information
Rights and permissions
Open Access This article is licensed under a Creative Commons Attribution 4.0 International License, which permits use, sharing, adaptation, distribution and reproduction in any medium or format, as long as you give appropriate credit to the original author(s) and the source, provide a link to the Creative Commons license, and indicate if changes were made. The images or other third party material in this article are included in the articleās Creative Commons license, unless indicated otherwise in a credit line to the material. If material is not included in the articleās Creative Commons license and your intended use is not permitted by statutory regulation or exceeds the permitted use, you will need to obtain permission directly from the copyright holder. To view a copy of this license, visit http://creativecommons.org/licenses/by/4.0/.
About this article
Cite this article
Huh, S., Lee, H.S., Yoon, J. et al. Diagnostic performances and unnecessary US-FNA rates of various TIRADS after application of equal size thresholds. Sci Rep 10, 10632 (2020). https://doi.org/10.1038/s41598-020-67543-z
Received:
Accepted:
Published:
DOI: https://doi.org/10.1038/s41598-020-67543-z
Comments
By submitting a comment you agree to abide by our Terms and Community Guidelines. If you find something abusive or that does not comply with our terms or guidelines please flag it as inappropriate.