Dimensional reduction based on peak fitting of Raman micro spectroscopy data improves detection of prostate cancer in tissue specimens.
Raman micro-spectroscopy
feature reduction
feature selection
machine learning
prostate cancer
Journal
Journal of biomedical optics
ISSN: 1560-2281
Titre abrégé: J Biomed Opt
Pays: United States
ID NLM: 9605853
Informations de publication
Date de publication:
11 2021
11 2021
Historique:
received:
30
06
2021
accepted:
04
10
2021
entrez:
7
11
2021
pubmed:
8
11
2021
medline:
18
11
2021
Statut:
ppublish
Résumé
Prostate cancer is the most common cancer among men. An accurate diagnosis of its severity at detection plays a major role in improving their survival. Recently, machine learning models using biomarkers identified from Raman micro-spectroscopy discriminated intraductal carcinoma of the prostate (IDC-P) from cancer tissue with a ≥85 % detection accuracy and differentiated high-grade prostatic intraepithelial neoplasia (HGPIN) from IDC-P with a ≥97.8 % accuracy. To improve the classification performance of machine learning models identifying different types of prostate cancer tissue using a new dimensional reduction technique. A radial basis function (RBF) kernel support vector machine (SVM) model was trained on Raman spectra of prostate tissue from a 272-patient cohort (Centre hospitalier de l'Université de Montréal, CHUM) and tested on two independent cohorts of 76 patients [University Health Network (UHN)] and 135 patients (Centre hospitalier universitaire de Québec-Université Laval, CHUQc-UL). Two types of engineered features were used. Individual intensity features, i.e., Raman signal intensity measured at particular wavelengths and novel Raman spectra fitted peak features consisting of peak heights and widths. Combining engineered features improved classification performance for the three aforementioned classification tasks. The improvements for IDC-P/cancer classification for the UHN and CHUQc-UL testing sets in accuracy, sensitivity, specificity, and area under the curve (AUC) are (numbers in parenthesis are associated with the CHUQc-UL testing set): +4 % (+8 % ), +7 % (+9 % ), +2 % (6%), +9 (+9) with respect to the current best models. Discrimination between HGPIN and IDC-P was also improved in both testing cohorts: +2.2 % (+1.7 % ), +4.5 % (+3.6 % ), +0 % (+0 % ), +2.3 (+0). While no global improvements were obtained for the normal versus cancer classification task [+0 % (-2 % ), +0 % (-3 % ), +2 % (-2 % ), +4 (+3)], the AUC was improved in both testing sets. Combining individual intensity features and novel Raman fitted peak features, improved the classification performance on two independent and multicenter testing sets in comparison to using only individual intensity features.
Identifiants
pubmed: 34743445
pii: JBO-210212R
doi: 10.1117/1.JBO.26.11.116501
pmc: PMC8571651
doi:
Types de publication
Journal Article
Multicenter Study
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Références
Am J Surg Pathol. 2020 May;44(5):673-680
pubmed: 31876580
J Phys Chem B. 2006 Oct 5;110(39):19625-31
pubmed: 17004830
Analyst. 2019 Nov 4;144(22):6517-6532
pubmed: 31647061
Br J Cancer. 2003 Jul 7;89(1):106-8
pubmed: 12838309
Appl Spectrosc. 2010 Jun;64(6):595-600
pubmed: 20537226
BJU Int. 2010 Aug;106(4):484-8
pubmed: 20201840
Phys Med Biol. 2016 Dec 7;61(23):R370-R400
pubmed: 27804917
J Biomed Opt. 2013 Aug;18(8):87001
pubmed: 23907278
Cancer Metastasis Rev. 2014 Sep;33(2-3):673-93
pubmed: 24510129
Br J Cancer. 2005 Jun 20;92(12):2166-70
pubmed: 15928665
Analyst. 2010 Dec;135(12):3060-9
pubmed: 20949203
Biometrics. 1988 Sep;44(3):837-45
pubmed: 3203132
PLoS Med. 2020 Aug 14;17(8):e1003281
pubmed: 32797086
Eur Urol. 2016 Jul;70(1):106-119
pubmed: 26996659