Assessing the role of advanced artificial intelligence as a tool in multidisciplinary tumor board decision-making for recurrent/metastatic head and neck cancer cases - the first study on ChatGPT 4o and a comparison to ChatGPT 4.0.

ChatGPT HNSCC artificial intelligence multidisciplinary tumorboard salvage surgery

Journal

Frontiers in oncology
ISSN: 2234-943X
Titre abrégé: Front Oncol
Pays: Switzerland
ID NLM: 101568867

Informations de publication

Date de publication:
2024
Historique:
received: 26 06 2024
accepted: 21 08 2024
medline: 20 9 2024
pubmed: 20 9 2024
entrez: 20 9 2024
Statut: epublish

Résumé

Recurrent and metastatic head and neck squamous cell carcinoma (HNSCC) is characterized by a complex therapeutic management that needs to be discussed in multidisciplinary tumor boards (MDT). While artificial intelligence (AI) improved significantly to assist healthcare professionals in making informed treatment decisions for primary cases, an application in the even more complex recurrent/metastatic setting has not been evaluated yet. This study also represents the first evaluation of the recently published LLM ChatGPT 4o, compared to ChatGPT 4.0 for providing therapy recommendations. The therapy recommendations for 100 HNSCC cases generated by each LLM, 50 cases of recurrence and 50 cases of distant metastasis were evaluated by two independent reviewers. The primary outcome measured was the quality of the therapy recommendations measured by the following parameters: clinical recommendation, explanation, and summarization. In this study, ChatGPT 4o and 4.0 provided mostly general answers for surgery, palliative care, or systemic therapy. ChatGPT 4o proved to be 48.5% faster than ChatGPT 4.0. For clinical recommendation, explanation, and summarization both LLMs obtained high scores in terms of performance of therapy recommendations, with no significant differences between both LLMs, but demonstrated to be mostly an assisting tool, requiring validation by an experienced clinician due to a lack of transparency and sometimes recommending treatment modalities that are not part of the current treatment guidelines. This research demonstrates that ChatGPT 4o and 4.0 share a similar performance, while ChatGPT 4o is significantly faster. Since the current versions cannot tailor therapy recommendations, and sometimes recommend incorrect treatment options and lack information on the source material, advanced AI models at the moment can merely assist in the MDT setting for recurrent/metastatic HNSCC.

Sections du résumé

Background UNASSIGNED
Recurrent and metastatic head and neck squamous cell carcinoma (HNSCC) is characterized by a complex therapeutic management that needs to be discussed in multidisciplinary tumor boards (MDT). While artificial intelligence (AI) improved significantly to assist healthcare professionals in making informed treatment decisions for primary cases, an application in the even more complex recurrent/metastatic setting has not been evaluated yet. This study also represents the first evaluation of the recently published LLM ChatGPT 4o, compared to ChatGPT 4.0 for providing therapy recommendations.
Methods UNASSIGNED
The therapy recommendations for 100 HNSCC cases generated by each LLM, 50 cases of recurrence and 50 cases of distant metastasis were evaluated by two independent reviewers. The primary outcome measured was the quality of the therapy recommendations measured by the following parameters: clinical recommendation, explanation, and summarization.
Results UNASSIGNED
In this study, ChatGPT 4o and 4.0 provided mostly general answers for surgery, palliative care, or systemic therapy. ChatGPT 4o proved to be 48.5% faster than ChatGPT 4.0. For clinical recommendation, explanation, and summarization both LLMs obtained high scores in terms of performance of therapy recommendations, with no significant differences between both LLMs, but demonstrated to be mostly an assisting tool, requiring validation by an experienced clinician due to a lack of transparency and sometimes recommending treatment modalities that are not part of the current treatment guidelines.
Conclusion UNASSIGNED
This research demonstrates that ChatGPT 4o and 4.0 share a similar performance, while ChatGPT 4o is significantly faster. Since the current versions cannot tailor therapy recommendations, and sometimes recommend incorrect treatment options and lack information on the source material, advanced AI models at the moment can merely assist in the MDT setting for recurrent/metastatic HNSCC.

Identifiants

pubmed: 39301542
doi: 10.3389/fonc.2024.1455413
pmc: PMC11410764
doi:

Types de publication

Journal Article

Langues

eng

Pagination

1455413

Informations de copyright

Copyright © 2024 Schmidl, Hütten, Pigorsch, Stögbauer, Hoch, Hussain, Wollenberg and Wirth.

Déclaration de conflit d'intérêts

The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.

Auteurs

Benedikt Schmidl (B)

Department of Otolaryngology Head and Neck Surgery, Technical University Munich, Munich, Germany.

Tobias Hütten (T)

Department of Otolaryngology Head and Neck Surgery, Technical University Munich, Munich, Germany.

Steffi Pigorsch (S)

Department of RadioOncology, Technical University Munich, Munich, Germany.

Fabian Stögbauer (F)

Institute of Pathology, Technical University Munich, Munich, Germany.

Cosima C Hoch (CC)

Department of Otolaryngology Head and Neck Surgery, Technical University Munich, Munich, Germany.

Timon Hussain (T)

Department of Otolaryngology Head and Neck Surgery, Technical University Munich, Munich, Germany.

Barbara Wollenberg (B)

Department of Otolaryngology Head and Neck Surgery, Technical University Munich, Munich, Germany.

Markus Wirth (M)

Department of Otolaryngology Head and Neck Surgery, Technical University Munich, Munich, Germany.

Classifications MeSH