A Neoteric Feature Extraction Technique to Predict the Survival of Gastric Cancer Patients.
K-Nearest Neighbors (KNN)
Naïve Bayes (NB)
area under ROC curve (AUC)
feature extraction
gastric cancer
radiomics features
random forest (RF) classifier
support vector machine (SVM)
Journal
Diagnostics (Basel, Switzerland)
ISSN: 2075-4418
Titre abrégé: Diagnostics (Basel)
Pays: Switzerland
ID NLM: 101658402
Informations de publication
Date de publication:
01 May 2024
01 May 2024
Historique:
received:
10
01
2024
revised:
26
04
2024
accepted:
28
04
2024
medline:
11
5
2024
pubmed:
11
5
2024
entrez:
11
5
2024
Statut:
epublish
Résumé
At the time of cancer diagnosis, it is crucial to accurately classify malignant gastric tumors and the possibility that patients will survive. This study aims to investigate the feasibility of identifying and applying a new feature extraction technique to predict the survival of gastric cancer patients. A retrospective dataset including the computed tomography (CT) images of 135 patients was assembled. Among them, 68 patients survived longer than three years. Several sets of radiomics features were extracted and were incorporated into a machine learning model, and their classification performance was characterized. To improve the classification performance, we further extracted another 27 texture and roughness parameters with 2484 superficial and spatial features to propose a new feature pool. This new feature set was added into the machine learning model and its performance was analyzed. To determine the best model for our experiment, Random Forest (RF) classifier, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naïve Bayes (NB) (four of the most popular machine learning models) were utilized. The models were trained and tested using the five-fold cross-validation method. Using the area under ROC curve (AUC) as an evaluation index, the model that was generated using the new feature pool yields AUC = 0.98 ± 0.01, which was significantly higher than the models created using the traditional radiomics feature set ( This study demonstrated that although radiomics features produced good classification performance, creating new feature sets significantly improved the model performance.
Sections du résumé
BACKGROUND
BACKGROUND
At the time of cancer diagnosis, it is crucial to accurately classify malignant gastric tumors and the possibility that patients will survive.
OBJECTIVE
OBJECTIVE
This study aims to investigate the feasibility of identifying and applying a new feature extraction technique to predict the survival of gastric cancer patients.
METHODS
METHODS
A retrospective dataset including the computed tomography (CT) images of 135 patients was assembled. Among them, 68 patients survived longer than three years. Several sets of radiomics features were extracted and were incorporated into a machine learning model, and their classification performance was characterized. To improve the classification performance, we further extracted another 27 texture and roughness parameters with 2484 superficial and spatial features to propose a new feature pool. This new feature set was added into the machine learning model and its performance was analyzed. To determine the best model for our experiment, Random Forest (RF) classifier, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naïve Bayes (NB) (four of the most popular machine learning models) were utilized. The models were trained and tested using the five-fold cross-validation method.
RESULTS
RESULTS
Using the area under ROC curve (AUC) as an evaluation index, the model that was generated using the new feature pool yields AUC = 0.98 ± 0.01, which was significantly higher than the models created using the traditional radiomics feature set (
CONCLUSIONS
CONCLUSIONS
This study demonstrated that although radiomics features produced good classification performance, creating new feature sets significantly improved the model performance.
Identifiants
pubmed: 38732368
pii: diagnostics14090954
doi: 10.3390/diagnostics14090954
pii:
doi:
Types de publication
Journal Article
Langues
eng
Subventions
Organisme : NIGMS NIH HHS
ID : P20 GM135009
Pays : United States