Evaluation of thyroid ultrasound reports’ quality in the Community of Madrid, Spain

in European Thyroid Journal
Authors:
Pablo Valderrabano Department of Endocrinology and Nutrition, Hospital Universitario Ramón y Cajal, IRYCIS, Madrid, Spain

Search for other papers by Pablo Valderrabano in
Current site
Google Scholar
PubMed
Close
https://orcid.org/0000-0001-8379-2709
,
Jhonatan Boris Quiñones Silva Department of Endocrinology and Nutrition, Hospital Universitario Ramón y Cajal, IRYCIS, Madrid, Spain

Search for other papers by Jhonatan Boris Quiñones Silva in
Current site
Google Scholar
PubMed
Close
https://orcid.org/0000-0001-6061-9751
,
Sandra Campos Mena Department of Endocrinology and Nutrition, Hospital Universitario Ramón y Cajal, IRYCIS, Madrid, Spain

Search for other papers by Sandra Campos Mena in
Current site
Google Scholar
PubMed
Close
,
Cristina Familiar Casado Department of Endocrinology and Nutrition, Hospital Clínico San Carlos, Madrid, Spain

Search for other papers by Cristina Familiar Casado in
Current site
Google Scholar
PubMed
Close
,
María Paz de Miguel Novoa Department of Endocrinology and Nutrition, Hospital Clínico San Carlos, Madrid, Spain

Search for other papers by María Paz de Miguel Novoa in
Current site
Google Scholar
PubMed
Close
,
María Concepción Sanabria Pérez Department of Endocrinology and Nutrition, Hospital Clínico San Carlos, Madrid, Spain

Search for other papers by María Concepción Sanabria Pérez in
Current site
Google Scholar
PubMed
Close
,
Elisa Fernández Fernández Department of Endocrinology and Nutrition, Hospital General Universitario Gregorio Marañón, Madrid, Spain

Search for other papers by Elisa Fernández Fernández in
Current site
Google Scholar
PubMed
Close
,
Aurelio López Guerra Department of Endocrinology and Nutrition, Hospital General Universitario Gregorio Marañón, Madrid, Spain

Search for other papers by Aurelio López Guerra in
Current site
Google Scholar
PubMed
Close
,
Marcel Sambo Department of Endocrinology and Nutrition, Hospital General Universitario Gregorio Marañón, Madrid, Spain

Search for other papers by Marcel Sambo in
Current site
Google Scholar
PubMed
Close
https://orcid.org/0000-0003-4487-6525
,
Patricia Martín Rojas-Marcos Department of Endocrinology and Nutrition, Hospital Universitario La Paz, Madrid, Spain

Search for other papers by Patricia Martín Rojas-Marcos in
Current site
Google Scholar
PubMed
Close
,
Paola Parra Ramírez Department of Endocrinology and Nutrition, Hospital Universitario La Paz, Madrid, Spain

Search for other papers by Paola Parra Ramírez in
Current site
Google Scholar
PubMed
Close
,
Clara Tasende Fernández Department of Endocrinology and Nutrition, Hospital Universitario Príncipe de Asturias, Madrid, Spain

Search for other papers by Clara Tasende Fernández in
Current site
Google Scholar
PubMed
Close
,
María Jesús Rodríguez Troyano Department of Endocrinology and Nutrition, Hospital Universitario Príncipe de Asturias, Madrid, Spain

Search for other papers by María Jesús Rodríguez Troyano in
Current site
Google Scholar
PubMed
Close
,
Victoria Alcázar Lázaro Department of Endocrinology and Nutrition, Hospital Universitario Severo Ochoa, Madrid, Spain

Search for other papers by Victoria Alcázar Lázaro in
Current site
Google Scholar
PubMed
Close
,
Marcos Lahera Vargas Department of Endocrinology and Nutrition, Hospital Universitario La Princesa, Madrid, Spain

Search for other papers by Marcos Lahera Vargas in
Current site
Google Scholar
PubMed
Close
, and
Nuria Palacios García Department of Endocrinology and Nutrition, Hospital Universitario Puerta de Hierro Majadahonda, Madrid, Spain

Search for other papers by Nuria Palacios García in
Current site
Google Scholar
PubMed
Close

Correspondence should be addressed to P Valderrabano: pablo.valderrabano@salud.madrid.org
Open access

Sign up for journal news

Objective

To evaluate the current quality of thyroid ultrasound reports in the Community of Madrid.

Methods

Consecutive thyroid ultrasound reports from patients evaluated in the endocrine outpatient clinics of eight academic hospitals in the Community of Madrid were assessed for quality during 2021 and 2022. Descriptions of eight different features were evaluated: number and axes of dimensions, composition, echogenicity, margins, shape, calcifications and category of suspicion. Features were considered adequately reported if described for all nodules ≥1 cm. The number of correctly reported features was compared by year of data capture (2021 vs 2022), specialty of the informant (radiologist vs endocrinologist), and origin of the report (in-house vs outsourced center). The quality of reports for assessing the need for cytological evaluation and/or growth during follow-up was evaluated.

Results

A total of 1234 reports were included, 63% from 2021; 82% were issued by radiologists and 89% were issued in-house. Composition and echogenicity were the most frequently reported (79% and 72%, respectively). The rest of the features were appropriately described in less than half of the reports. Forty percent of the reports were good to select nodules for biopsy, 23% had sufficient data to assess growth during follow-up, and only 13% met both quality criteria. The overall quality of reports was worse in outsourced centers (median number of described features 2 vs 4, P < 0.001) and better when issued by endocrinologists (median number of described features 6 vs 3, P < 0.001).

Conclusions

Most thyroid ultrasound reports issued in the Community of Madrid provide insufficient data to make management decisions regarding thyroid nodules.

Abstract

Objective

To evaluate the current quality of thyroid ultrasound reports in the Community of Madrid.

Methods

Consecutive thyroid ultrasound reports from patients evaluated in the endocrine outpatient clinics of eight academic hospitals in the Community of Madrid were assessed for quality during 2021 and 2022. Descriptions of eight different features were evaluated: number and axes of dimensions, composition, echogenicity, margins, shape, calcifications and category of suspicion. Features were considered adequately reported if described for all nodules ≥1 cm. The number of correctly reported features was compared by year of data capture (2021 vs 2022), specialty of the informant (radiologist vs endocrinologist), and origin of the report (in-house vs outsourced center). The quality of reports for assessing the need for cytological evaluation and/or growth during follow-up was evaluated.

Results

A total of 1234 reports were included, 63% from 2021; 82% were issued by radiologists and 89% were issued in-house. Composition and echogenicity were the most frequently reported (79% and 72%, respectively). The rest of the features were appropriately described in less than half of the reports. Forty percent of the reports were good to select nodules for biopsy, 23% had sufficient data to assess growth during follow-up, and only 13% met both quality criteria. The overall quality of reports was worse in outsourced centers (median number of described features 2 vs 4, P < 0.001) and better when issued by endocrinologists (median number of described features 6 vs 3, P < 0.001).

Conclusions

Most thyroid ultrasound reports issued in the Community of Madrid provide insufficient data to make management decisions regarding thyroid nodules.

Introduction

Up to 5% of the general population has palpable thyroid nodules (1). However, the prevalence of thyroid nodules is significantly higher when evaluated with ultrasound (1, 2). The increasing use of imaging techniques is uncovering a large reservoir of asymptomatic thyroid nodules that has acquired an epidemic magnitude (3, 4, 5, 6, 7). Consequently, thyroid nodules constitute nowadays one of the most frequent reasons for referral to endocrine outpatient clinics.

Although most thyroid nodules are benign, most thyroid cancers present as thyroid nodules (8). Therefore, an adequate differential diagnosis is essential. Ultrasound characterization is instrumental to select which nodules require fine-needle aspiration biopsy (9). Several classification systems have been developed in recent years to homogenize ultrasound characterization. These systems classify nodules into different categories that are associated with an estimated probability of malignancy and with a recommended threshold for biopsy (9, 10, 11). All of them consider the composition, echogenicity, shape, presence of calcifications and margins of the nodule. Therefore, knowing all these characteristics is necessary to determine the need to perform a cytological evaluation. When faced with incomplete ultrasound reports, the treating physician must decide whether to repeat the ultrasound, often prematurely hoping to obtain the missing information in the following report, or to request a fine-needle aspiration biopsy without knowing if it is really indicated (12, 13, 14). Both scenarios result in health care system inefficiency.

Furthermore, knowing the maximum dimensions of the nodule in the anteroposterior, transverse and craniocaudal axes are necessary to follow up thyroid nodules correctly. The absence of this information can mistakenly lead to thinking that a nodule is growing significantly when in fact it is stable, for example if a single different dimension is given in two successive reports. This situation may trigger unnecessary invasive procedures, including surgery.

The objective of this study was to evaluate the current quality of thyroid ultrasound reports in the Community of Madrid to identify areas for improvement to be addressed by the Regional Endocrine Society (Society of Endocrinology, Nutrition and Diabetes of the Community of Madrid – SENDIMAD)’s thyroid cancer working group.

Methods

This transversal study was approved by the ethics committee of the Ramón y Cajal University Hospital (#178-21). Study data were collected and managed using REDCap electronic data capture tools hosted at Ramón y Cajal Health Research Institute (15, 16). REDCap (Research Electronic Data Capture) is a secure, web-based software platform designed to support data capture for research studies, providing i) an intuitive interface for validated data capture, ii) audit trails for tracking data manipulation and export procedures, iii) automated export procedures for seamless data downloads to common statistical packages, and iv) procedures for data integration and interoperability with external sources. For this study, no personal identifiers or demographic data were collected. Two different forms with predefined fields were created in the database: one for reports from 2021 and another one for reports from 2022. In both forms, the specialty of the physician issuing the report, the origin of the report (own center or outsourced test) and the hospital where the report had been evaluated were registered. The presence or absence of information on eight descriptive variables for nodules 1 cm or larger was collected: number of dimensions reported (0, 1, 2 or 3); identification of corresponding axes for the given measurements; composition; echogenicity; margins; shape; calcifications and category of sonographic suspicion. The specific ultrasound classification system used in the report was also registered. Each of the variables was categorized as consistently reported (for all nodules 1 cm or larger); reported sometimes (in some but not all nodules 1 cm or larger) or absent (not described for any of the nodules). To calculate the average of variables correctly described in each report, those described ‘sometimes’ (not reported in all nodules >1 cm) were considered as incorrectly reported. The number of dimensions was considered correctly reported if all three dimensions were given. The shape of the nodule was considered as correctly described if it was either specified in the report or it could be derived from the dimensions and axes detailed in the report. The quality of the report was considered good for assessing the need for cytological evaluation if either size in one dimension (assuming the measurement of largest dimension was provided) plus category of suspicion or size in one dimension plus composition, echogenicity, margins, shape and calcifications were described. The quality of the report was considered good to assess growth during follow-up if three dimensions with corresponding axes were given.

Analysis of the thyroid ultrasound reports’ quality in 2021

The quality of thyroid ultrasound reports was evaluated by 14 investigators from eight different hospitals (six of eight third-level hospitals and two of 12 second-level hospitals) that provide care for 46.5% population of the Community of Madrid (3,189,125 of 6,863,539 inhabitants) (https://gestiona3.madrid.org/bvirtual/BVCM050988.pdf; Accessed 11/07/2024) . Each investigator retrieved at least the first 50 consecutive thyroid ultrasound reports with at least one thyroid nodule 1 cm or larger previously evaluated in their outpatient clinic during 2021. To do that, participants retrieved clinic appointments and reviewed the medical records of all patients evaluated in their outpatient clinics starting on January 1st, 2021, until at least 50 ultrasound reports consecutively evaluated were retrieved.

Analysis of the thyroid ultrasound reports’ quality in 2022

The quality of thyroid ultrasound reports was evaluated by 10 investigators from six different hospitals that provide care to 32.8% population of the Community of Madrid (2,253,270 of 6,863,539 inhabitants) (https://gestiona3.madrid.org/bvirtual/BVCM050988.pdf; Accessed 11/07/2024). Each investigator collected at least the first 50 consecutive thyroid ultrasound reports with at least one thyroid nodule 1 cm or larger that was being evaluated in their outpatient clinic during 2022. Reports issued by the investigators in their thyroid nodule clinics were excluded to avoid information bias.

Statistical analysis

All reports evaluated were included in the analysis. There was one report with missing data for the evaluation of composition, no other missing data were found. The frequency of reports contributed by each participating institution, the year of data collection (2021 or 2022), the specialty of the informant (physician issuing the evaluated reports), the origin of the report (in-house vs outsourced) and each of the descriptive variables comprising the quality criteria were collected. The proportion of each descriptive variable and the classification system used (for those with available data) was compared by year of data collection, origin of the report, and specialty of the informant using the Chi-squared test or the Fisher’s exact test if the expected values were lower than five in at least one cell of the contingency table. The association between axes description and the number of dimensions reported (1, 2, or 3) was assessed using logistic regression to determine the trend across dimensions. We also determined the sum of the number of nodule features comprising the quality of the report contained in each form and compared whether it was different according to the year of data collection, origin of the report, and specialty of the informant using the Mann–Whitney test, as it was a discrete variable with just nine possible values (from zero to eight). Differences in the proportion of reports considered good for assessing the need for cytological evaluation and/or for assessing growth during follow-up were analyzed by type of data collection, the origin of the report, and the specialty of the informant using the Chi-squared test. A significance level of 0.05 was chosen for all analyses. Statistical analyses were done with the R software (https://www.r-project.org) environment for statistical computing (17).

Results

A total of 1234 reports were included in the analyses. The distribution of such reports by institution and year of data capture (2021 vs 2022) is shown in Fig. 1. Most reports (774, 63%) evaluated were from 2021, and two participating institutions did not contribute to reports evaluated in 2022. Reports were issued by radiologists (n = 1015, 82%), endocrinologists (n = 207, 17%) or other specialists (n = 12, 1%; eight by nuclear medicine and four by primary care physicians). Most reports were issued by in-house specialists (n = 1097, 89%), whereas the rest were issued by specialists from outsourced centers (n = 137, 11%).

Figure 1
Figure 1

Distribution of reports according to institution and year of data capture.

Citation: European Thyroid Journal 14, 1; 10.1530/ETJ-24-0390

The overall quality of the reports is described in Table 1. Only 43.6% reports provided the three dimensions, and 32% identified the corresponding axes of the dimensions given. The frequency of axes reporting was associated with the number of dimensions provided (ORtrend: 2.8; 95% CI: 2.3–3.3; P < 0.001). Nodule composition and echogenicity were the features most frequently reported (79% and 72%, respectively). The rest of the evaluated features were appropriately described in less than half of the reports: margins in 40%, calcifications in 28% and shape in 32%. Only 471 (38%) reports categorized the sonographic suspicion of the nodules, and among them, the classification system most frequently used was ACR-TIRADS (63%), followed by EU-TIRADS (12%) and ATA (American Thyroid Association) (8%).

Table 1

Overall report quality and comparisons by year of data capture, by origin of the report, and by specialty of the informant. Data are presented as n (%).

Characteristics Overall 2021 2022 P In-house Outsourced P Radiologist Endocrinologist P
# dimensions 0.005 <0.001* <0.001*
 None 27 (2.2) 14 (1.8) 13 (2.8) 27 (2.5) 0 (0.0) 25 (2.5) 2 (1.0)
 One 390 (31.6) 228 (29.5) 162 (35.2) 340 (31) 50 (36.5) 384 (37.8) 4 (1.9)
 Two 279 (22.6) 165 (21.3) 114 (24.8) 229 (20.9) 50 (36.5) 274 (27.0) 3 (1.4)
 Three 538 (43.6) 367 (47.4) 171 (37.2) 501 (45.7) 37 (27.0) 332 (32.7) 198 (95.7)
Axes 0.8 <0.001 <0.001
 No 841 (68.2) 525 (67.8) 316 (68.7) 727 (66.3) 114 (83.2) 756 (74.5) 79 (38.2)
 Yes 393 (31.8) 249 (32.2) 144 (31.3) 370 (33.7) 23 (16.8) 259 (25.5) 128 (61.8)
Shape 0.08 <0.001* <0.001
 No 800 (64.8) 497 (64.2) 303 (65.9) 689 (62.8) 111 (81.0) 724 (71.3) 70 (33.8)
 Yes 398 (32.3) 260 (33.6) 138 (30.0) 372 (33.9) 26 (19.0) 258 (25.4) 134 (64.7)
 Sometimes 36 (2.9) 17 (2.2) 19 (4.1) 36 (3.3) 0 (0.0) 33 (3.3) 3 (1.5)
Composition 0.6 0.121 <0.001
 No 148 (12.0) 88 (11.3) 60 (13.0) 126 (11.5) 22 (16.1) 143 (14.1) 3 (1.5)
 Yes 980 (79.5) 621 (80.2) 359 (78.0) 872 (79.6) 108 (78.8) 770 (75.9) 201 (97.1)
 Sometimes 105 (8.5) 65 (8.4) 40 (8.7) 98 (8.9) 7 (5.1) 101 (10.0) 3 (1.5)
Echogenicity 0.006 <0.001 <0.001
 No 226 (18.3) 121 (15.6) 105 (22.8) 175 (16) 51 (37.2) 215 (21.2) 9 (4.4)
 Yes 889 (72.0) 573 (74.0) 316 (68.7) 808 (73.7) 81 (59.1) 687 (67.7) 193 (93.2)
 Sometimes 119 (9.7) 80 (10.3) 39 (8.5) 114 (10.4) 5 (3.7) 113 (11.1) 5 (2.4)
Margins 0.006 <0.001 <0.001
 No 677 (54.9) 403 (52.1) 274 (59.6) 579 (52.8) 98 (71.5) 648 (63.8) 18 (8.7)
 Yes 499 (40.4) 339 (43.8) 160 (34.8) 463 (42.1) 36 (26.3) 315 (31.0) 183 (88.4)
 Sometimes 57 (4.7) 32 (4.1) 25 (5.4) 55 (5.0) 3 (2.2) 52 (5.1) 6 (2.9)
Calcifications 0.62 0.001 <0.001
 No 819 (66.3) 521 (67.3) 298 (64.8) 710 (64.7) 109 (79.6) 709 (69.9) 100 (48.3)
 Yes 345 (28.0) 209 (27.0) 136 (29.6) 319 (29.1) 26 (19.0) 249 (24.5) 94 (45.4)
 Sometimes 70 (5.7) 44 (5.7) 26 (5.7) 68 (6.2) 2 (1.5) 57 (5.6) 13 (6.3)
US pattern 0.08 0.005 0.025*
 No 747 (60.5) 484 (62.5) 263 (57.2) 647 (59.0) 100 (73.0) 627 (61.8) 115 (55.6)
 Yes 471 (38.2) 278 (35.9) 193 (42.0) 435 (39.6) 36 (26.3) 372 (36.7) 92 (44.4)
 Sometimes 16 (1.3) 12 (1.6) 4 (0.9) 15 (1.4) 1 (0.7) 16 (1.6) 0 (0.0)
Classification <0.001 0.5 <0.001
 ACR-TIRADS 307 (63.0) 187 (64.5) 120 (60.9) 282 (62.7) 25 (67.6) 236 (60.8) 64 (59.6)
 ATA 2015 40 (8.2) 38 (13.1) 2 (1.0) 39 (8.7) 1 (2.7) 14 (3.6) 26 (28.3)
 EU-TIRADS 57 (11.7) 27 (9.3) 30 (15.2) 51 (11.3) 6 (16.2) 55 (14.2) 2 (2.2)
 Other/unspec 83 (17.0) 38 (13.1) 45 (22.8) 78 (17.3) 5 (13.5) 83 (21.4) 0 (0.0)

Unspec, unspecified; US pattern, sonographic pattern.

Fisher’s exact test.

12 reports issued by nuclear medicine and primary care physicians were excluded from the analysis.

Information regarding composition was not registered in one report from 2022.

Comparison of report quality by year of data capture (2021 vs 2022)

The comparison by year of data capture is reported in Table 1. There was no difference in the median number interquartile range (IQR) of features appropriately reported between 2021 and 2022 (4 (2–6) vs 3 (2–5), P = 0.1). The distributions of the number of features appropriately reported and their Kernel density plots (overlapped) by year of data capture are shown in Fig. 2A. Only 6% and 4% reports had all the nodule features reported in 2021 and 2022, respectively. The same figures were observed for reports without any features appropriately reported. Among 2021 reports, three dimensions, echogenicity, and margins were more frequently reported (47% vs 37%, P = 0.005; 74% vs 69%, P = 0.06 and 44% vs 35%, P = 0.006, respectively). There were no differences in the reporting frequency of axes, shape, composition, or presence of calcifications. Reports from 2022 tended to categorize the sonographic suspicion of the nodules more frequently (42% vs 36%, P = 0.08). The most frequently used classification system for reporting sonographic suspicion was ACR-TIRADS in both groups (65% vs 61% in 2021 and 2022, respectively); however, the EU-TIRADS classification system was more frequently used in the 2022 reports (15% vs 9%, P < 0.001).

Figure 2
Figure 2

Distribution of the number of features appropriately reported and their Kernel density plots (overlapped) by year of data capture (A), origin of the report (B) and specialty of the informant (C).

Citation: European Thyroid Journal 14, 1; 10.1530/ETJ-24-0390

Comparison of report quality by origin of the report (in-house vs outsourced centers)

The overall quality of the reports issued in outsourced centers was significantly worse (Table 1), with lower rates of correct description of all evaluated items except for composition. The median IQR number of features correctly reported by in-house specialists and specialists from outsourced centers was 4 (2–5) and 2 (1–4), respectively (P < 0.001). The distributions of the number of features appropriately reported and their Kernel density plots (overlapped) by year of data capture are shown in Fig. 2B. All features were appropriately reported in 5% and 2% reports issued by in-house specialists and specialists from outsourced centers, respectively, whereas none of the features were appropriately reported in 5% and 7%, respectively. Reports issued by in-house specialists were more likely to categorize the sonographic suspicion of nodules than reports issued by specialists from outsourced centers (40% vs 26%, P = 0.005). The use of different classification systems was not significantly different by origin of the report, with ACR-TIRADS being the most frequently used classification in both groups.

Comparison of report quality by specialty of informant (radiology vs endocrinology)

Reports issued by endocrinologists were more detailed than reports issued by radiologists, with a median IQR of 6 (5–7) vs 3 (2–5) items correctly reported, respectively (P < 0.001) (Table 1). Reports issued by other specialists were excluded from the analysis due to low counts (n = 12). The distributions of the number of features appropriately reported and their Kernel density plots (overlapped) by specialty of the informant are shown in Fig. 2C. Up to 23% reports issued by endocrinologists had all eight features appropriately reported, as opposed to 1% of the reports issued by radiologists. Furthermore, none of the reports issued by endocrinologists had zero features appropriately described, whereas this situation was found in 6% of the reports issued by radiologists. Almost all reports issued by endocrinologists described three dimensions (96% vs 33%, P < 0.001). Reports issued by endocrinologists were more likely to categorize the sonographic suspicion of nodules than reports issued by radiologists (44% vs 37%, P = 0.03). ACR-TIRADS was the most frequently used classification system for both endocrinologists and radiologists (70% vs 61%, respectively), although radiologists used EU-TIRADS more frequently than endocrinologists (14% vs 2.2%, P < 0.001).

The number of items correctly reported by radiologists was in general better in reports collected in 2022 than in reports collected in 2021, with an average of 3.4 vs 3.0 (P = 0.002). Similar results were obtained when outsourced reports were excluded from the comparison (average 3.5 vs 3.1, P = 0.003). The improvement was due to an increased rate in the reporting of axes (30.7% vs 21.7%, P = 0.001), shape (29.1% vs 22.7%, P = 0.009), calcifications (24.5% vs 21.6%, P = 0.047) and category of suspicion (40.4% vs 33.9%, P = 0.046). No significant differences were observed in the reporting of the number of dimensions, composition, echogenicity or margin.

Quality of the reports for assessing the need for cytological evaluation and growth during follow-up

Overall, 40% (95% CI: 37%–43%) of the reports were considered of sufficient quality to assess the need for cytological evaluation. There was no difference by year of data capture (42% in 2021 vs 39% in 2022; P = 0.4). However, the quality of the reports in this regard was better among those issued by in-house specialists compared to outsourced centers’ specialists (42% vs 27%, P < 0.001); and among those issued by endocrinologists compared to radiologists (52% vs 37%, P < 0.001).

Overall, 23% (95% CI: 20%–25%) of the reports were considered of sufficient quality to assess growth during follow-up. This proportion was higher for 2021 reports (25% vs 18% in 2022 reports, P = 0.008), and for those issued by in-house specialists (24% vs 12% issued by specialists of outsourced centers, P = 0.002); and issued by endocrinologists (60% vs 15% issued by radiologists, P < 0.001).

Only 13% (95% CI: 11%–15%) of the reports evaluated met both quality criteria and would therefore allow assessing the need for cytological evaluation and growth during follow-up.

Discussion

The quality of the reports was considered good to appropriately select nodules for biopsy (either size in one dimension plus category of suspicion or size in one dimension plus composition, echogenicity, margins, shape and calcifications) in 40% of cases. In addition, the quality of the reports was considered good to appropriately assess growth during follow-up (three dimensions with axes) in 23% of cases. Therefore, most reports’ quality was insufficient to inform on the need for biopsy (60%) and/or to assess growth during follow-up (77%). Both quality criteria were met in only 13% (159) of the reports. It is unknown whether missing information was assessed by informants but omitted from the reports because of negative findings (non-suspicious features). However, omitting such information in the report leaves treating physicians with the same uncertainty as if it had not been evaluated, prompting management decisions based on insufficient data, leading to unnecessary repeated ultrasounds, biopsies, surgeries and ultimately to system inefficiency.

Thyroid nodule management is mostly decided by endocrinologists with clinical information such as the presence of compressive symptoms, lab test results, ultrasound report and, if available, cytological diagnosis and/or scintigraphy. The quality of ultrasound reports is instrumental in the management decision process. As described in previous studies, the quality of thyroid ultrasound reports before implementing report-standardization strategies was poor in our setting (18, 19, 20, 21, 22). Perhaps the poor quality of such an instrumental tool has driven endocrinologists and other specialists dealing with thyroid nodule management to train in thyroid ultrasound. In Spain, endocrinologists started to perform thyroid ultrasound a little over 15 years ago. Since then, the number of endocrinologists trained in thyroid ultrasound has been increasing, but most thyroid ultrasounds are still being done by radiologists.

In our cohort, most reports (82%) were issued by radiologists, with a median of three out of eight items correctly reported for all thyroid nodules. The quality of the reports was worse for outsourced ultrasounds and better for reports issued by endocrinologists. A previous study also found higher-quality reports among provider-performed ultrasounds than in ultrasounds performed in imaging centers or in radiology departments (21). However, our data might be biased, as endocrinologists’ reports in our 2021 cohort were often issued by the investigators of this study, who have a particular interest in thyroid nodules and thyroid cancer (members of the SENDIMAD’s thyroid cancer working group). Thus, it is possible that the quality of thyroid ultrasound reports issued by other endocrinologists is not as good. To limit bias in the endocrinologists’ reports due to awareness of quality assessment, inclusion of their own reports was only allowed in 2021 (issued before the study design) but forbidden in 2022. This limited the number of reports issued by endocrinologists in our 2022 cohort and prevented comparison with 2021 data due to low counts. The number of features correctly described in the reports issued by radiologists in 2022 improved by an average of 0.4 items over 2021 reports (3.4 vs 3.0). This observation could represent a selection bias of our 2021 cohort or a true improvement of reports over time. Anyhow, the quality of most reports was still insufficient to make clinical decisions.

We did not collect information about the expertise of the informants or select which radiologists issued the reports. Furthermore, radiologists were unaware of the report quality evaluation. Thus, our study represents real-world data from mostly tertiary academic institutions that provide care to more than half of the population of the Community of Madrid. Most reports were issued by either endocrinologists or radiologists working for the public National Health System, who have a fixed monthly salary independent of the number or quality of their reports. However, thyroid ultrasounds ordered by primary care physicians are frequently outsourced to private imaging centers, which get reimbursed per patient. No minimum quality of the report is required for reimbursement. Thus, there is a lack of incentive to provide good quality reports in private imaging centers at this time. We found that report quality was particularly poor in this setting.

We did not collect the number of nodules described in the reports or the size of the largest nodule but rather aggregate data for all nodules >1 cm. Prior studies found no association between the quality of the description and the size of the nodule (18). Furthermore, several studies considered reporting the maximum diameter of the nodule as a quality measurement. It is usually assumed that whenever a single dimension is provided, it must be the largest one. Although this sounds reasonable, it is often not true. But even if it were the largest dimension, it is still insufficient to assess growth during follow-up. Like in our series, most reports in previous studies failed to report three dimensions (18, 19, 21).

A good-quality report can precisely rule out malignancy in 50% of the thyroid nodules (23), select nodules requiring cytological evaluation (9, 10, 11), allow risk stratification of thyroid nodules with indeterminate cytology (24, 25), and provide relevant information for the use and interpretation of the results of molecular testing (26). Thus, improving the quality of ultrasound reports is key for the optimization of the institutional thyroid nodule evaluation pathway (26, 27). Previous studies have demonstrated that implementing report-standardization strategies improves the quality of thyroid ultrasound reports (12, 14, 20, 22, 28). However, more detailed reports may take longer time to evaluate and describe all relevant features (13). Thus, guideline implementation might require scheduling adjustments.

In summary, our study found that most thyroid ultrasound reports issued during 2021 and 2022 in the public health care system of the Community of Madrid provided insufficient data to assess the need for cytological evaluation and/or growth during follow-up of thyroid nodules. Such reports likely led to health care system inefficiency. The quality of outsourced thyroid ultrasound reports was particularly poor. Quality metrics need to be enforced for thyroid ultrasounds covered by the public health care system. To facilitate this, the SENDIMAD’s thyroid cancer working group is developing an application intended to generate semiautomatic standardized thyroid ultrasound reports. Future studies will address whether reporting time is reduced with the application and whether the quality of thyroid ultrasound reports improves with its use in the Community of Madrid.

Declaration of interest

The authors declare that there is no conflict of interest that could be perceived as prejudicing the impartiality of the research reported.

Funding

This work was supported by a 2021 Research in Endocrinology, Nutrition and Diabetes grant (IPI/2021/N2) from the Society of Endocrinology, Nutrition and Diabetes of the Community of Madrid (SENDIMAD).

Author contribution statement

PV was responsible for conceptualization, methodology, investigation, writing the original draft, supervision and funding acquisition. JBQS contributed to investigation, formal analysis, data curation, writing the original draft and visualization. SCM, CFC, MPMN, MCSP, EFF, ALG, MESS, PMR-M, PPR, CTF, MJRT, VAL, MLV and NPG were involved in investigation, as well as writing, reviewing and editing.

References

  • 1

    Mazzaferri EL . Management of a solitary thyroid nodule. N Engl J Med 1993 328 553559. (https://doi.org/10.1056/nejm199302253280807)

  • 2

    Guth S , Theune U , Aberle J , et al. Very high prevalence of thyroid nodules detected by high frequency (13 MHz) ultrasound examination. Eur J Clin Invest 2009 39 699706. (https://doi.org/10.1111/j.1365-2362.2009.02162.x)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 3

    Ahn HS , Kim HJ & Welch HG . Korea's thyroid-cancer “epidemic”--screening and overdiagnosis. N Engl J Med 2014 371 17651767. (https://doi.org/10.1056/nejmp1409841)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 4

    Vaccarella S , Franceschi S , Bray F , et al. Worldwide thyroid-cancer epidemic? The increasing impact of overdiagnosis. N Engl J Med 2016 375 614617. (https://doi.org/10.1056/nejmp1604412)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 5

    Nagar S , Aschebrook-Kilfoy B , Kaplan EL , et al. Age of diagnosing physician impacts the incidence of thyroid cancer in a population. Cancer Causes Control 2014 25 16271634. (https://doi.org/10.1007/s10552-014-0467-2)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 6

    Udelsman R & Zhang Y . The epidemic of thyroid cancer in the United States: the role of endocrinologists and ultrasounds. Thyroid 2014 24 472479. (https://doi.org/10.1089/thy.2013.0257)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 7

    Zevallos JP , Hartman CM , Kramer JR , et al. Increased thyroid cancer incidence corresponds to increased use of thyroid ultrasound and fine-needle aspiration: a study of the Veterans Affairs health care system. Cancer 2015 121 741746. (https://doi.org/10.1002/cncr.29122)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 8

    Werk EE Jr , Vernon BM , Gonzalez JJ , et al. Cancer in thyroid nodules. A community hospital survey. Arch Intern Med 1984 144 474476. (https://doi.org/10.1001/archinte.144.3.474)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 9

    Haugen BR , Alexander EK , Bible KC , et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the American thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid 2016 26 1133. (https://doi.org/10.1089/thy.2015.0020)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 10

    Tessler FN , Middleton WD , Grant EG , et al. ACR thyroid imaging, reporting and data system (TI-RADS): white paper of the ACR TI-RADS committee. J Am Coll Radiol 2017 14 587595. (https://doi.org/10.1016/j.jacr.2017.01.046)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 11

    Russ G , Bonnema SJ , Erdogan MF , et al. European thyroid association guidelines for ultrasound malignancy risk stratification of thyroid nodules in adults: the EU-TIRADS. Eur Thyroid J 2017 6 225237. (https://doi.org/10.1159/000478927)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 12

    Ghazizadeh S , Kelly TL , Khajanchee YS , et al. Standardization of thyroid ultrasound reporting in the community setting decreases biopsy rates. Clin Endocrinol 2021 94 10351042. (https://doi.org/10.1111/cen.14431)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 13

    Chan AJ , Sarrazin J , Halperin IJ , et al. Quality improvement initiative to standardise thyroid ultrasound reports and reduce unnecessary fine-needle aspiration biopsies of thyroid nodules. BMJ Open Qual 2022 11 e001769. (https://doi.org/10.1136/bmjoq-2021-001769)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 14

    Hu XY , Wu J , Seal P , et al. Improvement in thyroid ultrasound report quality with radiologists' adherence to 2015 ATA or 2017 TIRADS: a population study. Eur Thyroid J 2022 11 e220035. (https://doi.org/10.1530/etj-22-0035)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 15

    Harris PA , Taylor R , Minor BL , et al. The REDCap consortium: building an international community of software platform partners. J Biomed Inform 2019 95 103208. (https://doi.org/10.1016/j.jbi.2019.103208)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 16

    Harris PA , Taylor R , Thielke R , et al. Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform 2009 42 377381. (https://doi.org/10.1016/j.jbi.2008.08.010)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 17

    R Core Team . R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing, 2023. (https://www.R-project.org/)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 18

    Symonds CJ , Seal P , Ghaznavi S , et al. Thyroid nodule ultrasound reports in routine clinical practice provide insufficient information to estimate risk of malignancy. Endocrine 2018 61 303307. (https://doi.org/10.1007/s12020-018-1634-0)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 19

    Raposo L , Freitas C , Martins R , et al. Malignancy risk of thyroid nodules: quality assessment of the thyroid ultrasound report. BMC Med Imaging 2022 22 61. (https://doi.org/10.1186/s12880-022-00789-3)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 20

    Botha M , Kisansa M & Greeff W . American college of radiology thyroid imaging reporting and data system standardises reporting of thyroid ultrasounds. SA J Radiol 2020 24 1804. (https://doi.org/10.4102/sajr.v24i1.1804)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 21

    Jiang L , Lee CY , Sloan DA , et al. Variation in the quality of thyroid nodule evaluations before surgical referral. J Surg Res 2019 244 914. (https://doi.org/10.1016/j.jss.2019.06.024)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 22

    Griffin AS , Mitsky J , Rawal U , et al. Improved quality of thyroid ultrasound reports after implementation of the ACR thyroid imaging reporting and data system nodule lexicon and risk stratification system. J Am Coll Radiol 2018 15 743748. (https://doi.org/10.1016/j.jacr.2018.01.024)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 23

    Russ G , Royer B , Bigorgne C , et al. Prospective evaluation of thyroid imaging reporting and data system on 4550 nodules with and without elastography. Eur J Endocrinol 2013 168 649655. (https://doi.org/10.1530/eje-12-0936)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 24

    Valderrabano P , McGettigan MJ , Lam CA , et al. Thyroid nodules with indeterminate cytology: utility of the American thyroid association sonographic patterns for cancer risk stratification. Thyroid 2018 28 10041012. (https://doi.org/10.1089/thy.2018.0085)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 25

    Lam CA , McGettigan MJ , Thompson ZJ , et al. Ultrasound characterization for thyroid nodules with indeterminate cytology: inter-observer agreement and impact of combining pattern-based and scoring-based classifications in risk stratification. Endocrine 2019 66 278287. (https://doi.org/10.1007/s12020-019-02000-0)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 26

    Valderrabano P , Eszlinger M , Stewardson P , et al. Clinical value of molecular markers as diagnostic and prognostic tools to guide treatment of thyroid cancer. Clin Endocrinol 2023 98 753762. (https://doi.org/10.1111/cen.14882)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 27

    Stewardson P , Eszlinger M & Paschke R . Diagnosis of endocrine disease: usefulness of genetic testing of fine-needle aspirations for diagnosis of thyroid cancer. Eur J Endocrinol 2022 187 R41R52. (https://doi.org/10.1530/eje-21-1293)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 28

    Atweh L , Al-Hadidi A , Singh J , et al. Quality improvement methodology to improve standardized reporting of pediatric thyroid ultrasounds using TI-RADS. J Pediatr Surg 2024 59 731736. (https://doi.org/10.1016/j.jpedsurg.2023.12.009)

    • PubMed
    • Search Google Scholar
    • Export Citation

 

  • Collapse
  • Expand
  • Figure 1

    Distribution of reports according to institution and year of data capture.

  • Figure 2

    Distribution of the number of features appropriately reported and their Kernel density plots (overlapped) by year of data capture (A), origin of the report (B) and specialty of the informant (C).

  • 1

    Mazzaferri EL . Management of a solitary thyroid nodule. N Engl J Med 1993 328 553559. (https://doi.org/10.1056/nejm199302253280807)

  • 2

    Guth S , Theune U , Aberle J , et al. Very high prevalence of thyroid nodules detected by high frequency (13 MHz) ultrasound examination. Eur J Clin Invest 2009 39 699706. (https://doi.org/10.1111/j.1365-2362.2009.02162.x)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 3

    Ahn HS , Kim HJ & Welch HG . Korea's thyroid-cancer “epidemic”--screening and overdiagnosis. N Engl J Med 2014 371 17651767. (https://doi.org/10.1056/nejmp1409841)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 4

    Vaccarella S , Franceschi S , Bray F , et al. Worldwide thyroid-cancer epidemic? The increasing impact of overdiagnosis. N Engl J Med 2016 375 614617. (https://doi.org/10.1056/nejmp1604412)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 5

    Nagar S , Aschebrook-Kilfoy B , Kaplan EL , et al. Age of diagnosing physician impacts the incidence of thyroid cancer in a population. Cancer Causes Control 2014 25 16271634. (https://doi.org/10.1007/s10552-014-0467-2)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 6

    Udelsman R & Zhang Y . The epidemic of thyroid cancer in the United States: the role of endocrinologists and ultrasounds. Thyroid 2014 24 472479. (https://doi.org/10.1089/thy.2013.0257)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 7

    Zevallos JP , Hartman CM , Kramer JR , et al. Increased thyroid cancer incidence corresponds to increased use of thyroid ultrasound and fine-needle aspiration: a study of the Veterans Affairs health care system. Cancer 2015 121 741746. (https://doi.org/10.1002/cncr.29122)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 8

    Werk EE Jr , Vernon BM , Gonzalez JJ , et al. Cancer in thyroid nodules. A community hospital survey. Arch Intern Med 1984 144 474476. (https://doi.org/10.1001/archinte.144.3.474)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 9

    Haugen BR , Alexander EK , Bible KC , et al. 2015 American thyroid association management guidelines for adult patients with thyroid nodules and differentiated thyroid cancer: the American thyroid association guidelines task force on thyroid nodules and differentiated thyroid cancer. Thyroid 2016 26 1133. (https://doi.org/10.1089/thy.2015.0020)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 10

    Tessler FN , Middleton WD , Grant EG , et al. ACR thyroid imaging, reporting and data system (TI-RADS): white paper of the ACR TI-RADS committee. J Am Coll Radiol 2017 14 587595. (https://doi.org/10.1016/j.jacr.2017.01.046)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 11

    Russ G , Bonnema SJ , Erdogan MF , et al. European thyroid association guidelines for ultrasound malignancy risk stratification of thyroid nodules in adults: the EU-TIRADS. Eur Thyroid J 2017 6 225237. (https://doi.org/10.1159/000478927)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 12

    Ghazizadeh S , Kelly TL , Khajanchee YS , et al. Standardization of thyroid ultrasound reporting in the community setting decreases biopsy rates. Clin Endocrinol 2021 94 10351042. (https://doi.org/10.1111/cen.14431)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 13

    Chan AJ , Sarrazin J , Halperin IJ , et al. Quality improvement initiative to standardise thyroid ultrasound reports and reduce unnecessary fine-needle aspiration biopsies of thyroid nodules. BMJ Open Qual 2022 11 e001769. (https://doi.org/10.1136/bmjoq-2021-001769)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 14

    Hu XY , Wu J , Seal P , et al. Improvement in thyroid ultrasound report quality with radiologists' adherence to 2015 ATA or 2017 TIRADS: a population study. Eur Thyroid J 2022 11 e220035. (https://doi.org/10.1530/etj-22-0035)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 15

    Harris PA , Taylor R , Minor BL , et al. The REDCap consortium: building an international community of software platform partners. J Biomed Inform 2019 95 103208. (https://doi.org/10.1016/j.jbi.2019.103208)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 16

    Harris PA , Taylor R , Thielke R , et al. Research electronic data capture (REDCap)--a metadata-driven methodology and workflow process for providing translational research informatics support. J Biomed Inform 2009 42 377381. (https://doi.org/10.1016/j.jbi.2008.08.010)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 17

    R Core Team . R: A Language and Environment for Statistical Computing. Vienna, Austria: R Foundation for Statistical Computing, 2023. (https://www.R-project.org/)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 18

    Symonds CJ , Seal P , Ghaznavi S , et al. Thyroid nodule ultrasound reports in routine clinical practice provide insufficient information to estimate risk of malignancy. Endocrine 2018 61 303307. (https://doi.org/10.1007/s12020-018-1634-0)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 19

    Raposo L , Freitas C , Martins R , et al. Malignancy risk of thyroid nodules: quality assessment of the thyroid ultrasound report. BMC Med Imaging 2022 22 61. (https://doi.org/10.1186/s12880-022-00789-3)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 20

    Botha M , Kisansa M & Greeff W . American college of radiology thyroid imaging reporting and data system standardises reporting of thyroid ultrasounds. SA J Radiol 2020 24 1804. (https://doi.org/10.4102/sajr.v24i1.1804)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 21

    Jiang L , Lee CY , Sloan DA , et al. Variation in the quality of thyroid nodule evaluations before surgical referral. J Surg Res 2019 244 914. (https://doi.org/10.1016/j.jss.2019.06.024)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 22

    Griffin AS , Mitsky J , Rawal U , et al. Improved quality of thyroid ultrasound reports after implementation of the ACR thyroid imaging reporting and data system nodule lexicon and risk stratification system. J Am Coll Radiol 2018 15 743748. (https://doi.org/10.1016/j.jacr.2018.01.024)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 23

    Russ G , Royer B , Bigorgne C , et al. Prospective evaluation of thyroid imaging reporting and data system on 4550 nodules with and without elastography. Eur J Endocrinol 2013 168 649655. (https://doi.org/10.1530/eje-12-0936)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 24

    Valderrabano P , McGettigan MJ , Lam CA , et al. Thyroid nodules with indeterminate cytology: utility of the American thyroid association sonographic patterns for cancer risk stratification. Thyroid 2018 28 10041012. (https://doi.org/10.1089/thy.2018.0085)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 25

    Lam CA , McGettigan MJ , Thompson ZJ , et al. Ultrasound characterization for thyroid nodules with indeterminate cytology: inter-observer agreement and impact of combining pattern-based and scoring-based classifications in risk stratification. Endocrine 2019 66 278287. (https://doi.org/10.1007/s12020-019-02000-0)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 26

    Valderrabano P , Eszlinger M , Stewardson P , et al. Clinical value of molecular markers as diagnostic and prognostic tools to guide treatment of thyroid cancer. Clin Endocrinol 2023 98 753762. (https://doi.org/10.1111/cen.14882)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 27

    Stewardson P , Eszlinger M & Paschke R . Diagnosis of endocrine disease: usefulness of genetic testing of fine-needle aspirations for diagnosis of thyroid cancer. Eur J Endocrinol 2022 187 R41R52. (https://doi.org/10.1530/eje-21-1293)

    • PubMed
    • Search Google Scholar
    • Export Citation
  • 28

    Atweh L , Al-Hadidi A , Singh J , et al. Quality improvement methodology to improve standardized reporting of pediatric thyroid ultrasounds using TI-RADS. J Pediatr Surg 2024 59 731736. (https://doi.org/10.1016/j.jpedsurg.2023.12.009)

    • PubMed
    • Search Google Scholar
    • Export Citation