A Neoteric Feature Extraction Technique to Predict the Survival of Gastric Cancer Patients.

K-Nearest Neighbors (KNN) Naïve Bayes (NB) area under ROC curve (AUC) feature extraction gastric cancer radiomics features random forest (RF) classifier support vector machine (SVM)

Journal

Diagnostics (Basel, Switzerland)
ISSN: 2075-4418
Titre abrégé: Diagnostics (Basel)
Pays: Switzerland
ID NLM: 101658402

Informations de publication

Date de publication:
01 May 2024
Historique:
received: 10 01 2024
revised: 26 04 2024
accepted: 28 04 2024
medline: 11 5 2024
pubmed: 11 5 2024
entrez: 11 5 2024
Statut: epublish

Résumé

At the time of cancer diagnosis, it is crucial to accurately classify malignant gastric tumors and the possibility that patients will survive. This study aims to investigate the feasibility of identifying and applying a new feature extraction technique to predict the survival of gastric cancer patients. A retrospective dataset including the computed tomography (CT) images of 135 patients was assembled. Among them, 68 patients survived longer than three years. Several sets of radiomics features were extracted and were incorporated into a machine learning model, and their classification performance was characterized. To improve the classification performance, we further extracted another 27 texture and roughness parameters with 2484 superficial and spatial features to propose a new feature pool. This new feature set was added into the machine learning model and its performance was analyzed. To determine the best model for our experiment, Random Forest (RF) classifier, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naïve Bayes (NB) (four of the most popular machine learning models) were utilized. The models were trained and tested using the five-fold cross-validation method. Using the area under ROC curve (AUC) as an evaluation index, the model that was generated using the new feature pool yields AUC = 0.98 ± 0.01, which was significantly higher than the models created using the traditional radiomics feature set ( This study demonstrated that although radiomics features produced good classification performance, creating new feature sets significantly improved the model performance.

Sections du résumé

BACKGROUND BACKGROUND
At the time of cancer diagnosis, it is crucial to accurately classify malignant gastric tumors and the possibility that patients will survive.
OBJECTIVE OBJECTIVE
This study aims to investigate the feasibility of identifying and applying a new feature extraction technique to predict the survival of gastric cancer patients.
METHODS METHODS
A retrospective dataset including the computed tomography (CT) images of 135 patients was assembled. Among them, 68 patients survived longer than three years. Several sets of radiomics features were extracted and were incorporated into a machine learning model, and their classification performance was characterized. To improve the classification performance, we further extracted another 27 texture and roughness parameters with 2484 superficial and spatial features to propose a new feature pool. This new feature set was added into the machine learning model and its performance was analyzed. To determine the best model for our experiment, Random Forest (RF) classifier, Support Vector Machine (SVM), K-Nearest Neighbors (KNN), and Naïve Bayes (NB) (four of the most popular machine learning models) were utilized. The models were trained and tested using the five-fold cross-validation method.
RESULTS RESULTS
Using the area under ROC curve (AUC) as an evaluation index, the model that was generated using the new feature pool yields AUC = 0.98 ± 0.01, which was significantly higher than the models created using the traditional radiomics feature set (
CONCLUSIONS CONCLUSIONS
This study demonstrated that although radiomics features produced good classification performance, creating new feature sets significantly improved the model performance.

Identifiants

pubmed: 38732368
pii: diagnostics14090954
doi: 10.3390/diagnostics14090954
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : NIGMS NIH HHS
ID : P20 GM135009
Pays : United States

Auteurs

Warid Islam (W)

School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK 73019, USA.

Neman Abdoli (N)

School of Electrical and Computer Engineering, University of Oklahoma, Norman, OK 73019, USA.

Tasfiq E Alam (TE)

School of Industrial and Systems Engineering, University of Oklahoma, Norman, OK 73019, USA.

Meredith Jones (M)

Stephenson School of Biomedical Engineering, University of Oklahoma, Norman, OK 73019, USA.

Bornface M Mutembei (BM)

Stephenson School of Biomedical Engineering, University of Oklahoma, Norman, OK 73019, USA.

Feng Yan (F)

Stephenson School of Biomedical Engineering, University of Oklahoma, Norman, OK 73019, USA.

Qinggong Tang (Q)

Stephenson School of Biomedical Engineering, University of Oklahoma, Norman, OK 73019, USA.

Classifications MeSH