Multi-group diagnostic classification of high-dimensional data using differential scanning calorimetry plasma thermograms.
Journal
PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081
Informations de publication
Date de publication:
2019
2019
Historique:
received:
19
03
2019
accepted:
23
07
2019
entrez:
21
8
2019
pubmed:
21
8
2019
medline:
24
3
2020
Statut:
epublish
Résumé
The thermoanalytical technique differential scanning calorimetry (DSC) has been applied to characterize protein denaturation patterns (thermograms) in blood plasma samples and relate these to a subject's health status. The analysis and classification of thermograms is challenging because of the high-dimensionality of the dataset. There are various methods for group classification using high-dimensional data sets; however, the impact of using high-dimensional data sets for cancer classification has been poorly understood. In the present article, we proposed a statistical approach for data reduction and a parametric method (PM) for modeling of high-dimensional data sets for two- and three- group classification using DSC and demographic data. We compared the PM to the non-parametric classification method K-nearest neighbors (KNN) and the semi-parametric classification method KNN with dynamic time warping (DTW). We evaluated the performance of these methods for multiple two-group classifications: (i) normal versus cervical cancer, (ii) normal versus lung cancer, (iii) normal versus cancer (cervical + lung), (iv) lung cancer versus cervical cancer as well as for three-group classification: normal versus cervical cancer versus lung cancer. In general, performance for two-group classification was high whereas three-group classification was more challenging, with all three methods predicting normal samples more accurately than cancer samples. Moreover, specificity of the PM method was mostly higher or the same as KNN and DTW-KNN with lower sensitivity. The performance of KNN and DTW-KNN decreased with the inclusion of demographic data, whereas similar performance was observed for the PM which could be explained by the fact that the PM uses fewer parameters as compared to KNN and DTW-KNN methods and is thus less susceptible to the risk of overfitting. More importantly the accuracy of the PM can be increased by using a greater number of quantile data points and by the inclusion of additional demographic and clinical data, providing a substantial advantage over KNN and DTW-KNN methods.
Identifiants
pubmed: 31430304
doi: 10.1371/journal.pone.0220765
pii: PONE-D-19-07952
pmc: PMC6701772
doi:
Substances chimiques
Blood Proteins
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
e0220765Subventions
Organisme : NIGMS NIH HHS
ID : P20 GM103482
Pays : United States
Organisme : NCRR NIH HHS
ID : P20 RR018733
Pays : United States
Organisme : NIAID NIH HHS
ID : R01 AI129959
Pays : United States
Organisme : NCI NIH HHS
ID : R21 CA187345
Pays : United States
Déclaration de conflit d'intérêts
NCG is a co-inventor on a patent application describing approaches for the analysis of DSC plasma thermogram data and their use for diagnostic classification (Garbett, N.C., and Brock, G.N. “Methods of Characterizing and/or Predicting Risk Associated with a Biological Sample Using Thermal Stability Profiles,” U.S. PCT Application PCT/US16/57416, Oct. 2016). NCG is a consultant for TA Instruments, Inc., a supplier of calorimetry instrumentation but not the supplier of the DSC instrument used to collect data for this study. This does not alter the authors’ adherence to all journal policies on sharing data and materials.
Références
IEEE Trans Pattern Anal Mach Intell. 2007 Jun;29(6):1035-51
pubmed: 17431301
Biophys J. 2008 Feb 15;94(4):1377-83
pubmed: 17951300
Clin Chem. 2007 Nov;53(11):2012-4
pubmed: 18030697
Semin Nephrol. 2007 Nov;27(6):621-6
pubmed: 18061844
Exp Mol Pathol. 2009 Jun;86(3):186-91
pubmed: 19146849
Biophys Chem. 2010 Nov;152(1-3):184-90
pubmed: 20961680
Anal Chem. 2011 Oct 15;83(20):7992-8
pubmed: 21928840
Neurosurgery. 2013 Aug;73(2):289-95; discussion 295
pubmed: 23624408
Biochim Biophys Acta. 2013 Oct;1830(10):4675-80
pubmed: 23665587
PLoS One. 2014 Jan 08;9(1):e84710
pubmed: 24416269
Sci Rep. 2015 Jan 23;5:7988
pubmed: 25614381
Biochim Biophys Acta. 2016 May;1860(5):981-989
pubmed: 26459005
PLoS One. 2017 Nov 9;12(11):e0186232
pubmed: 29121669
Biochim Biophys Acta Gen Subj. 2018 Aug;1862(8):1701-1710
pubmed: 29705200