Skip to main content

11.09.2024 | Original Article

Evaluating the accuracy and adequacy of ChatGPT in responding to queries of diabetes patients in primary healthcare

verfasst von: İrem Şenoymak, Nuriye Hale Erbatur, Mustafa Can Şenoymak, Memet Taşkın Egici

Erschienen in: International Journal of Diabetes in Developing Countries

Einloggen, um Zugang zu erhalten

Abstract

Objective

This study evaluates the accuracy and adequacy of Chat Generative Pre-trained Transformer (ChatGPT) in responding to common queries formulated by primary care physicians based on their interactions with diabetic patients in primary healthcare settings.

Methods

Thirty-two frequently asked questions were identified by experienced primary care physicians and presented systematically to ChatGPT. Responses underwent evaluation by two endocrinology and metabolism physicians which utilized a 3-point Likert scale for accuracy (1, inaccurate; 2, partially accurate; 3, accurate) and a 6-point Likert scale for adequacy (1, completely inadequate to 6, completely adequate). Questions were categorized into groups including general information, diagnostic processes, treatment procedures, and complications.

Results

The median accuracy score was 3.0 (IQR, 3.0–3.0), and the adequacy score was 4.5 (IQR, 4.0–5.8). None of the questions received an inaccurate rating, and the lowest accuracy score assigned by both evaluators was 3. Significant agreement was observed between the evaluators, demonstrated by a weighted κ of 0.61 (p < .0001) for accuracy and substantial agreement with a weighted κ of 0.62 (p < 0.0001) for adequacy. The Kruskal–Wallis tests revealed no statistically significant differences among the groups for both accuracy (p = .71) and adequacy (p = .57).

Conclusions

ChatGPT demonstrated commendable accuracy and adequacy in addressing diabetes-related queries in primary healthcare.
Literatur
1.
Zurück zum Zitat GBD 2021 Diabetes Collaborators. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021 [published correction appears in Lancet. 2023 Sep 30;402(10408):1132]. Lancet. 2023;402(10397):203–234. https://doi.org/10.1016/S0140-6736(23)01301-6. GBD 2021 Diabetes Collaborators. Global, regional, and national burden of diabetes from 1990 to 2021, with projections of prevalence to 2050: a systematic analysis for the Global Burden of Disease Study 2021 [published correction appears in Lancet. 2023 Sep 30;402(10408):1132]. Lancet. 2023;402(10397):203–234. https://​doi.​org/​10.​1016/​S0140-6736(23)01301-6.
2.
Zurück zum Zitat Da Rocha RB, Silva CS, Cardoso VS. Self-care in adults with type 2 diabetes mellitus: a systematic review. CDR. 2020;16:598–607.CrossRef Da Rocha RB, Silva CS, Cardoso VS. Self-care in adults with type 2 diabetes mellitus: a systematic review. CDR. 2020;16:598–607.CrossRef
3.
Zurück zum Zitat American Diabetes Association Professional Practice Committee. 5. Facilitating positive health behaviors and well-being to improve health outcomes: standards of care in diabetes-2024 [published correction appears in Diabetes Care. 2024 Feb 05;:]. Diabetes Care. 2024;47(Suppl 1):S77-S110. https://doi.org/10.2337/dc24-S005. American Diabetes Association Professional Practice Committee. 5. Facilitating positive health behaviors and well-being to improve health outcomes: standards of care in diabetes-2024 [published correction appears in Diabetes Care. 2024 Feb 05;:]. Diabetes Care. 2024;47(Suppl 1):S77-S110. https://​doi.​org/​10.​2337/​dc24-S005.
13.
Zurück zum Zitat Meo SA, Al-Khlaiwi T, AbuKhalaf AA, Meo AS, Klonoff DC. The scientific knowledge of bard and ChatGPT in endocrinology, diabetes, and diabetes technology: multiple-choice questions examination-based performance. J Diabetes Sci Technol 19322968231203987 (2023) https://doi.org/10.1177/19322968231203987. Meo SA, Al-Khlaiwi T, AbuKhalaf AA, Meo AS, Klonoff DC. The scientific knowledge of bard and ChatGPT in endocrinology, diabetes, and diabetes technology: multiple-choice questions examination-based performance. J Diabetes Sci Technol 19322968231203987 (2023) https://​doi.​org/​10.​1177/​1932296823120398​7.
Metadaten
Titel
Evaluating the accuracy and adequacy of ChatGPT in responding to queries of diabetes patients in primary healthcare
verfasst von
İrem Şenoymak
Nuriye Hale Erbatur
Mustafa Can Şenoymak
Memet Taşkın Egici
Publikationsdatum
11.09.2024
Verlag
Springer India
Erschienen in
International Journal of Diabetes in Developing Countries
Print ISSN: 0973-3930
Elektronische ISSN: 1998-3832
DOI
https://doi.org/10.1007/s13410-024-01401-w