A hybrid machine learning model combining association rule mining and classification algorithms to predict differentiated thyroid cancer recurrence.
associative classification
differentiated thyroid cancer
machine learning
personalized medicine
recurrence prediction
Journal
Frontiers in medicine
ISSN: 2296-858X
Titre abrégé: Front Med (Lausanne)
Pays: Switzerland
ID NLM: 101648047
Informations de publication
Date de publication:
2024
2024
Historique:
received:
08
07
2024
accepted:
23
09
2024
medline:
21
10
2024
pubmed:
21
10
2024
entrez:
21
10
2024
Statut:
epublish
Résumé
Differentiated thyroid cancer (DTC) is the most prevalent endocrine malignancy with a recurrence rate of about 20%, necessitating better predictive methods for patient management. This study aims to create a relational classification model to predict DTC recurrence by integrating clinical, pathological, and follow-up data. The balanced dataset comprises 550 DTC samples collected over 15 years, featuring 13 clinicopathological variables. To address the class imbalance in recurrence status, the Synthetic Minority Over-sampling Technique for Nominal and Continuous (SMOTE-NC) was utilized. A hybrid model combining classification algorithms with association rule mining was developed. Two relational classification approaches, regularized class association rules (RCAR) and classification based on association rules (CBAR), were implemented. Binomial logistic regression analyzed independent predictors of recurrence. Model performance was assessed through accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score. The RCAR model demonstrated superior performance over the CBAR model, achieving accuracy, sensitivity, and F1 score of 96.7%, 93.1%, and 96.7%, respectively. Association rules highlighted that papillary pathology with an incomplete response strongly predicted recurrence. The combination of incomplete response and lymphadenopathy was also a significant predictor. Conversely, the absence of adenopathy and complete response to treatment were linked to freedom from recurrence. Incomplete structural response was identified as a critical predictor of recurrence risk, even with other low-recurrence conditions. This study introduces a robust and interpretable predictive model that enhances personalized medicine in thyroid cancer care. The model effectively identifies high-risk individuals, allowing for tailored follow-up strategies that could improve patient outcomes and optimize resource allocation in DTC management.
Sections du résumé
Background
UNASSIGNED
Differentiated thyroid cancer (DTC) is the most prevalent endocrine malignancy with a recurrence rate of about 20%, necessitating better predictive methods for patient management. This study aims to create a relational classification model to predict DTC recurrence by integrating clinical, pathological, and follow-up data.
Methods
UNASSIGNED
The balanced dataset comprises 550 DTC samples collected over 15 years, featuring 13 clinicopathological variables. To address the class imbalance in recurrence status, the Synthetic Minority Over-sampling Technique for Nominal and Continuous (SMOTE-NC) was utilized. A hybrid model combining classification algorithms with association rule mining was developed. Two relational classification approaches, regularized class association rules (RCAR) and classification based on association rules (CBAR), were implemented. Binomial logistic regression analyzed independent predictors of recurrence. Model performance was assessed through accuracy, sensitivity, specificity, positive predictive value, negative predictive value, and F1 score.
Results
UNASSIGNED
The RCAR model demonstrated superior performance over the CBAR model, achieving accuracy, sensitivity, and F1 score of 96.7%, 93.1%, and 96.7%, respectively. Association rules highlighted that papillary pathology with an incomplete response strongly predicted recurrence. The combination of incomplete response and lymphadenopathy was also a significant predictor. Conversely, the absence of adenopathy and complete response to treatment were linked to freedom from recurrence. Incomplete structural response was identified as a critical predictor of recurrence risk, even with other low-recurrence conditions.
Conclusion
UNASSIGNED
This study introduces a robust and interpretable predictive model that enhances personalized medicine in thyroid cancer care. The model effectively identifies high-risk individuals, allowing for tailored follow-up strategies that could improve patient outcomes and optimize resource allocation in DTC management.
Identifiants
pubmed: 39430590
doi: 10.3389/fmed.2024.1461372
pmc: PMC11486678
doi:
Types de publication
Journal Article
Langues
eng
Pagination
1461372Informations de copyright
Copyright © 2024 Firat Atay, Yagin, Colak, Elkiran, Mansuri, Ahmad and Ardigò.
Déclaration de conflit d'intérêts
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.