Multi-centre radiomics for prediction of recurrence following radical radiotherapy for head and neck cancers: Consequences of feature selection, machine learning classifiers and batch-effect harmonization.
Head-and-neck cancer
Loco-regional recurrence
Machine learning
Multi-institutional
Prognosis
Radiomics
Journal
Physics and imaging in radiation oncology
ISSN: 2405-6316
Titre abrégé: Phys Imaging Radiat Oncol
Pays: Netherlands
ID NLM: 101704276
Informations de publication
Date de publication:
Apr 2023
Apr 2023
Historique:
received:
28
10
2022
revised:
01
05
2023
accepted:
02
05
2023
medline:
1
6
2023
pubmed:
1
6
2023
entrez:
1
6
2023
Statut:
epublish
Résumé
Radiomics models trained with limited single institution data are often not reproducible and generalisable. We developed radiomics models that predict loco-regional recurrence within two years of radiotherapy with private and public datasets and their combinations, to simulate small and multi-institutional studies and study the responsiveness of the models to feature selection, machine learning algorithms, centre-effect harmonization and increased dataset sizes. 562 patients histologically confirmed and treated for locally advanced head-and-neck cancer (LA-HNC) from two public and two private datasets; one private dataset exclusively reserved for validation. Clinical contours of primary tumours were not recontoured and were used for Pyradiomics based feature extraction. ComBat harmonization was applied, and LASSO-Logistic Regression (LR) and Support Vector Machine (SVM) models were built. 95% confidence interval (CI) of 1000 bootstrapped area-under-the-Receiver-operating-curves (AUC) provided predictive performance. Responsiveness of the models' performance to the choice of feature selection methods, ComBat harmonization, machine learning classifier, single and pooled data was evaluated. LASSO and SelectKBest selected 14 and 16 features, respectively; three were overlapping. Without ComBat, the LR and SVM models for three institutional data showed AUCs (CI) of 0.513 (0.481-0.559) and 0.632 (0.586-0.665), respectively. Performances following ComBat revealed AUCs of 0.559 (0.536-0.590) and 0.662 (0.606-0.690), respectively. Compared to single cohort AUCs (0.562-0.629), SVM models from pooled data performed significantly better at AUC = 0.680. Multi-institutional retrospective data accentuates the existing variabilities that affect radiomics. Carefully designed prospective, multi-institutional studies and data sharing are necessary for clinically relevant head-and-neck cancer prognostication models.
Sections du résumé
Background and purpose
UNASSIGNED
Radiomics models trained with limited single institution data are often not reproducible and generalisable. We developed radiomics models that predict loco-regional recurrence within two years of radiotherapy with private and public datasets and their combinations, to simulate small and multi-institutional studies and study the responsiveness of the models to feature selection, machine learning algorithms, centre-effect harmonization and increased dataset sizes.
Materials and methods
UNASSIGNED
562 patients histologically confirmed and treated for locally advanced head-and-neck cancer (LA-HNC) from two public and two private datasets; one private dataset exclusively reserved for validation. Clinical contours of primary tumours were not recontoured and were used for Pyradiomics based feature extraction. ComBat harmonization was applied, and LASSO-Logistic Regression (LR) and Support Vector Machine (SVM) models were built. 95% confidence interval (CI) of 1000 bootstrapped area-under-the-Receiver-operating-curves (AUC) provided predictive performance. Responsiveness of the models' performance to the choice of feature selection methods, ComBat harmonization, machine learning classifier, single and pooled data was evaluated.
Results
UNASSIGNED
LASSO and SelectKBest selected 14 and 16 features, respectively; three were overlapping. Without ComBat, the LR and SVM models for three institutional data showed AUCs (CI) of 0.513 (0.481-0.559) and 0.632 (0.586-0.665), respectively. Performances following ComBat revealed AUCs of 0.559 (0.536-0.590) and 0.662 (0.606-0.690), respectively. Compared to single cohort AUCs (0.562-0.629), SVM models from pooled data performed significantly better at AUC = 0.680.
Conclusions
UNASSIGNED
Multi-institutional retrospective data accentuates the existing variabilities that affect radiomics. Carefully designed prospective, multi-institutional studies and data sharing are necessary for clinically relevant head-and-neck cancer prognostication models.
Identifiants
pubmed: 37260438
doi: 10.1016/j.phro.2023.100450
pii: S2405-6316(23)00041-6
pmc: PMC10227455
doi:
Types de publication
Journal Article
Langues
eng
Pagination
100450Subventions
Organisme : DBT-Wellcome Trust India Alliance
ID : IA/E/18/1/504306
Pays : India
Informations de copyright
© 2023 The Authors. Published by Elsevier B.V. on behalf of European Society of Radiotherapy & Oncology.
Déclaration de conflit d'intérêts
The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
Références
Oncotarget. 2017 Mar 17;8(33):55600-55612
pubmed: 28903447
PLoS One. 2019 Sep 19;14(9):e0222509
pubmed: 31536526
Front Oncol. 2019 Mar 27;9:174
pubmed: 30972291
Transl Cancer Res. 2016 Aug;5(4):371-382
pubmed: 30627523
J Clin Med. 2022 Dec 24;12(1):
pubmed: 36614941
Front Oncol. 2021 Mar 29;11:633176
pubmed: 33854969
J Nucl Med. 2022 Feb;63(2):172-179
pubmed: 34531263
Oral Oncol. 2017 Feb;65:8-15
pubmed: 28109473
Insights Imaging. 2020 Aug 12;11(1):91
pubmed: 32785796
Cell Death Dis. 2019 Jul 15;10(8):540
pubmed: 31308358
Sci Rep. 2015 Jun 05;5:11044
pubmed: 26251068
Sci Rep. 2022 Jul 26;12(1):12762
pubmed: 35882891
Cancer Epidemiol Biomarkers Prev. 2021 Jun;30(6):1035-1047
pubmed: 33849968
Med Phys. 2020 Nov;47(11):5931-5940
pubmed: 32521049
Radiother Oncol. 2020 Mar;144:148-151
pubmed: 31812929
J Med Phys. 2021 Jul-Sep;46(3):181-188
pubmed: 34703102
Radiology. 2019 Apr;291(1):53-59
pubmed: 30694160
PLoS One. 2021 Jul 1;16(7):e0253653
pubmed: 34197503
Sci Rep. 2017 Aug 31;7(1):10117
pubmed: 28860628
Insights Imaging. 2022 Feb 24;13(1):28
pubmed: 35201534
Cancer Res. 2017 Nov 1;77(21):e104-e107
pubmed: 29092951
Invest Radiol. 2022 Jul 1;57(7):433-443
pubmed: 35045555
Sci Rep. 2019 Feb 26;9(1):2764
pubmed: 30809047