Eigen-entropy based time series signatures to support multivariate time series classification.

Correlation coefficient Dense multi scale entropy Eigen-entropy Eigenvalue Multivariate time series classification Time series signatures

Journal

Scientific reports

ISSN: 2045-2322

Titre abrégé: Sci Rep

Pays: England

ID NLM: 101563288

Informations de publication

Date de publication:
12 Jul 2024

Historique:

received: 24 03 2024

accepted: 05 07 2024

medline: 12 7 2024

pubmed: 12 7 2024

entrez: 11 7 2024

Statut: epublish

Résumé

Most current algorithms for multivariate time series classification tend to overlook the correlations between time series of different variables. In this research, we propose a framework that leverages Eigen-entropy along with a cumulative moving window to derive time series signatures to support the classification task. These signatures are enumerations of correlations among different time series considering the temporal nature of the dataset. To manage dataset's dynamic nature, we employ preprocessing with dense multi scale entropy. Consequently, the proposed framework, Eigen-entropy-based Time Series Signatures, captures correlations among multivariate time series without losing its temporal and dynamic aspects. The efficacy of our algorithm is assessed using six binary datasets sourced from the University of East Anglia, in addition to a publicly available gait dataset and an institutional sepsis dataset from the Mayo Clinic. We use recall as the evaluation metric to compare our approach against baseline algorithms, including dependent dynamic time warping with 1 nearest neighbor and multivariate multi-scale permutation entropy. Our method demonstrates superior performance in terms of recall for seven out of the eight datasets.

Identifiants

DOI: 10.1038/s41598-024-66953-7 PMID: 38992044

pubmed: 38992044

doi: 10.1038/s41598-024-66953-7

pii: 10.1038/s41598-024-66953-7

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

16076

Subventions

Organisme : National Science Foundation (NSF) - Partnership for Innovation - "Avoiding Kidney Injuries"

ID : 2122901

Informations de copyright

Références

Keogh, E. & Ratanamahatana, C. A. Exact indexing of dynamic time warping. Knowl. Inf. Syst. 7, 358–386 (2005).

doi: 10.1007/s10115-004-0154-9

Wang, Z., Yan, W. & Oates, T. Time series classification from scratch with deep neural networks: A strong baseline. In 2017 International Joint Conference on Neural Networks (IJCNN), 1578–1585 (IEEE, 2017).

Ruiz, A. P., Flynn, M., Large, J., Middlehurst, M. & Bagnall, A. The great multivariate time series classification bake off: a review and experimental evaluation of recent algorithmic advances. Data Min. Knowl. Discov. 35, 401–449 (2021).

pubmed: 33679210 doi: 10.1007/s10618-020-00727-3

Shokoohi-Yekta, M., Hu, B., Jin, H., Wang, J. & Keogh, E. Generalizing dtw to the multi-dimensional case requires an adaptive approach. Data Min. Knowl. Discov. 31, 1–31 (2017).

pubmed: 29104448 doi: 10.1007/s10618-016-0455-0

Lin, J., Keogh, E., Wei, L. & Lonardi, S. Experiencing sax: a novel symbolic representation of time series. Data Min. Knowl. Discov. 15, 107–144 (2007).

doi: 10.1007/s10618-007-0064-z

Baydogan, M. G., Runger, G. & Tuv, E. A bag-of-features framework to classify time series. IEEE Trans. Pattern Anal. Mach. Intell. 35, 2796–2802 (2013).

pubmed: 24051736 doi: 10.1109/TPAMI.2013.72

Schäfer, P. The boss is concerned with time series classification in the presence of noise. Data Min. Knowl. Discov. 29, 1505–1530 (2015).

doi: 10.1007/s10618-014-0377-7

Lines, J. & Bagnall, A. Time series classification with ensembles of elastic distance measures. Data Min. Knowl. Discov. 29, 565–592 (2015).

doi: 10.1007/s10618-014-0361-2

Bagnall, A., Lines, J., Hills, J. & Bostrom, A. Time-series classification with cote: the collective of transformation-based ensembles. IEEE Trans. Knowl. Data Eng. 27, 2522–2535 (2015).

doi: 10.1109/TKDE.2015.2416723

Middlehurst, M., Large, J. & Bagnall, A. The canonical interval forest (cif) classifier for time series classification. In 2020 IEEE International Conference on Big Data (Big Data), 188–195 (IEEE, 2020).

Deng, H., Runger, G., Tuv, E. & Vladimir, M. A time series forest for classification and feature extraction. Inf. Sci. 239, 142–153 (2013).

doi: 10.1016/j.ins.2013.02.030

Lubba, C. H. et al. catch22: Canonical time-series characteristics: Selected through highly comparative time-series analysis. Data Min. Knowl. Discov. 33, 1821–1852 (2019).

doi: 10.1007/s10618-019-00647-x

Cai, F. et al. Stride: Systematic radar intelligence analysis for adrd risk evaluation with gait signature simulation and deep learning. IEEE Sens. J. (2023).

He, K., Zhang, X., Ren, S. & Sun, J. Deep residual learning for image recognition. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 770–778 (2016).

Szegedy, C. et al. Going deeper with convolutions. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, 1–9 (2015).

IsmailFawaz, H. et al. Finding alexnet for time series classification. Inceptiontime. Data Min. Knowl. Discov. 34, 1936–1962 (2020).

doi: 10.1007/s10618-020-00710-y

Dempster, A., Petitjean, F. & Webb, G. I. Rocket: exceptionally fast and accurate time series classification using random convolutional kernels. Data Min. Knowl. Discov. 34, 1454–1495 (2020).

doi: 10.1007/s10618-020-00701-z

Bandt, C. & Pompe, B. Permutation entropy: A natural complexity measure for time series. Phys. Rev. Lett. 88, 174102 (2002).

pubmed: 12005759 doi: 10.1103/PhysRevLett.88.174102

Pincus, S. M., Gladstone, I. M. & Ehrenkranz, R. A. A regularity statistic for medical data analysis. J. Clin. Monit. 7, 335–345 (1991).

pubmed: 1744678 doi: 10.1007/BF01619355

CuestaFrau, D. et al. Classification of fever patterns using a single extracted entropy feature: A feasibility study based on sample entropy. Math. Biosci. Eng. 17, 235–249 (2019).

pubmed: 31731349 doi: 10.3934/mbe.2020013

Rostaghi, M. & Azami, H. Dispersion entropy: A measure for time-series analysis. IEEE Signal Process. Lett. 23, 610–614 (2016).

doi: 10.1109/LSP.2016.2542881

Chen, W., Wang, Z., Xie, H. & Yu, W. Characterization of surface emg signal based on fuzzy entropy. IEEE Trans. Neural Syst. Rehabil. Eng. 15, 266–272 (2007).

pubmed: 17601197 doi: 10.1109/TNSRE.2007.897025

Goldberger, A. L., Peng, C.-K. & Lipsitz, L. A. What is physiologic complexity and how does it change with aging and disease?. Neurobiol. Aging 23, 23–26 (2002).

pubmed: 11755014 doi: 10.1016/S0197-4580(01)00266-4

Costa, M., Goldberger, A. L. & Peng, C.-K. Multiscale entropy analysis of complex physiologic time series. Phys. Rev. Lett. 89, 068102 (2002).

pubmed: 12190613 doi: 10.1103/PhysRevLett.89.068102

Zhao, D. et al. Dense multi-scale entropy and it’s application in mechanical fault diagnosis. Meas. Sci. Technol. 31, 125008 (2020).

doi: 10.1088/1361-6501/aba4da

Keller, K. & Lauffer, H. Symbolic analysis of high-dimensional time series. Int. J. Bifurcation Chaos 13, 2657–2668 (2003).

doi: 10.1142/S0218127403008168

He, S., Sun, K. & Wang, H. Multivariate permutation entropy and its application for complexity analysis of chaotic systems. Physica A 461, 812–823 (2016).

doi: 10.1016/j.physa.2016.06.012

Weerakody, P. B., Wong, K. W., Wang, G. & Ela, W. A review of irregular time series data handling with gated recurrent neural networks. Neurocomputing 441, 161–178. https://doi.org/10.1016/j.neucom.2021.02.046 (2021).

doi: 10.1016/j.neucom.2021.02.046

Liu, Z. Time Series Modeling of iIrregularly Sampled Multivariate Clinical Data. Ph.D. thesis, University of Pittsburgh (2016).

Liu, Z. & Hauskrecht, M. Clinical time series prediction: Toward a hierarchical dynamical system framework. Artif. Intell. Med. 65, 5–18. https://doi.org/10.1016/j.artmed.2014.10.005 (2015).

doi: 10.1016/j.artmed.2014.10.005 pubmed: 25534671

Wei, S. J., Al Riza, D. F. & Nugroho, H. Comparative study on the performance of deep learning implementation in the edge computing: Case study on the plant leaf disease identification. J. Agric. Food Res. 10, 100389 (2022).

Gupta, A., Anand, A. & Hasija, Y. Recall-based machine learning approach for early detection of cervical cancer. In 2021 6th International Conference for Convergence in Technology (I2CT), 1–5, https://doi.org/10.1109/I2CT51068.2021.9418099 (2021).

Lines, J., Taylor, S. & Bagnall, A. Hive-cote: The hierarchical vote collective of transformation-based ensembles for time series classification. In 2016 IEEE 16th International Conference on Data Mining (ICDM), 1041–1046 (IEEE, 2016).

Cabello, N., Naghizade, E., Qi, J. & Kulik, L. Fast, accurate and explainable time series classification through randomization. Data Min. Knowl. Discov. 1, 1–64 (2023).

Reyna, M. A. et al. Early prediction of sepsis from clinical data: The physionet/computing in cardiology challenge 2019. Crit. Care Med. 48, 210–217 (2020).

pubmed: 31939789 pmcid: 6964870 doi: 10.1097/CCM.0000000000004145

Huang, J. et al. Eigen-entropy: A metric for multivariate sampling decisions. Inf. Sci. 619, 84–97 (2023).

doi: 10.1016/j.ins.2022.11.023

Bagnall, A. et al. The uea multivariate time series classification archive, 2018. arXiv:1811.00075 (2018).

Goldberger, A. L. et al. Physiobank, physiotoolkit, and physionet: Components of a new research resource for complex physiologic signals. Circulation 101, e215–e220 (2000).

pubmed: 10851218 doi: 10.1161/01.CIR.101.23.e215

Gantmacher, F. The Theory of Matrices Vol. 1 (Chelsea Publishing Company, 1977).

Bier, A., Jastrzębska, A. & Olszewski, P. Variable-length multivariate time series classification using rocket: A case study of incident detection. IEEE Access 10, 95701–95715 (2022).

doi: 10.1109/ACCESS.2022.3203523

Pedregosa, F. Scikit-learn: Machine learning in python fabian. J. Mach. Learn. Res. 12, 2825 (2011).

Khanam, J. J. & Foo, S. Y. A comparison of machine learning algorithms for diabetes prediction. Ict Express 7, 432–439 (2021).

doi: 10.1016/j.icte.2021.02.004

Chen, R.-C., Dewi, C., Huang, S.-W. & Caraka, R. E. Selecting critical features for data classification based on machine learning methods. J. Big Data 7, 52 (2020).

doi: 10.1186/s40537-020-00327-4

Osisanwo, F. et al. Supervised machine learning algorithms: Classification and comparison. Int. J. Comput. Trends Technol. (IJCTT) 48, 128–138 (2017).

doi: 10.14445/22312803/IJCTT-V48P126

Hashim, A. S., Awadh, W. A. & Hamoud, A. K. Student performance prediction model based on supervised machine learning algorithms. in IOP Conference Series: Materials Science and Engineering, vol. 928, 032019 (IOP Publishing, 2020).

Uddin, S., Khan, A., Hossain, M. E. & Moni, M. A. Comparing different supervised machine learning algorithms for disease prediction. BMC Med. Inform. Decis. Mak. 19, 1–16 (2019).

doi: 10.1186/s12911-019-1004-8

Morabito, F. C. et al. Multivariate multi-scale permutation entropy for complexity analysis of alzheimer’s disease eeg. Entropy 14, 1186–1202 (2012).

doi: 10.3390/e14071186

Huang, J. & Ling, C. X. Using auc and accuracy in evaluating learning algorithms. IEEE Trans. Knowl. Data Eng. 17, 299–310 (2005).

doi: 10.1109/TKDE.2005.50

Sambasivam, G. & Opiyo, G. D. A predictive machine learning application in agriculture: Cassava disease detection and classification with imbalanced dataset using convolutional neural networks. Egypt. Inform. J. 22, 27–34 (2021).

doi: 10.1016/j.eij.2020.02.007

Chertow, G. M., Burdick, E., Honour, M., Bonventre, J. V. & Bates, D. W. Acute kidney injury, mortality, length of stay, and costs in hospitalized patients. J. Am. Soc. Nephrol. 16, 3365–3370 (2005).

pubmed: 16177006 doi: 10.1681/ASN.2004090740

Luo, X. et al. A comparison of different diagnostic criteria of acute kidney injury in critically ill patients. Crit. Care 18, 1–8 (2014).

doi: 10.1186/cc13977

Aronson, S., Fontes, M. L., Miao, Y. & Mangano, D. T. Risk index for perioperative renal dysfunction/failure: Critical dependence on pulse pressure hypertension. Circulation 115, 733–742 (2007).

pubmed: 17283267 doi: 10.1161/CIRCULATIONAHA.106.623538

Lewington, A. J., Cerdá, J. & Mehta, R. L. Raising awareness of acute kidney injury: A global perspective of a silent killer. Kidney Int. 84, 457–467 (2013).

pubmed: 23636171 pmcid: 3758780 doi: 10.1038/ki.2013.153

Bagshaw, S. M., George, C., Dinu, I. & Bellomo, R. A multi-centre evaluation of the rifle criteria for early acute kidney injury in critically ill patients. Nephrol. Dial. Transplant. 23, 1203–1210 (2008).

pubmed: 17962378 doi: 10.1093/ndt/gfm744

Lind, M. et al. Rapid detection of acute kidney injury (aki) in hospitalized patients. in Proceedings of the 5th Annual ABRC-Flinn Research Conference (2020).

Delacre, M., Lakens, D. & Leys, C. Why psychologists should by default use welch’s t-test instead of student’s t-test. Int. Rev. Soc. Psychol. 30, 92–101 (2017).

doi: 10.5334/irsp.82

Eigen-entropy based time series signatures to support multivariate time series classification.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Subventions

Informations de copyright

Références

Auteurs

Abhidnya Patharkar (A)

Jiajing Huang (J)

Teresa Wu (T)

Erica Forzani (E)

Leslie Thomas (L)

Marylaura Lind (M)

Naomi Gades (N)

Classifications MeSH