Optimal feature selection for heart disease prediction using modified Artificial Bee colony (M-ABC) and K-nearest neighbors (KNN).

Feature selection Healthcare Heart disease prediction Privacy awareness

Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
31 Oct 2024
Historique:
received: 23 06 2024
accepted: 28 10 2024
medline: 1 11 2024
pubmed: 1 11 2024
entrez: 1 11 2024
Statut: epublish

Résumé

Heart disease is a complex and widespread illness that affects a significant number of people worldwide. Machine learning provides a way forward for early heart disease diagnosis. A classification model has been developed for the present study to predict heart disease. The attribute selection was done using a modified bee algorithm. Using the proposed model, practitioners can accurately predict heart disease and make informed decisions about patient health. In our study, we have proposed a framework based on Modified Artificial Bee Colony (M-ABC) and k-Nearest Neighbors (KNN) for predicting the optimal feature selection to obtain better accuracy. Using a modified bee algorithm, this paper focuses on identifying the optimal subset of attributes from the dataset. Specifically, during the classification-training phase, only the features that provide significant information are retained. The proposed study not only improves classification accuracy but also reduces training time for classifiers.

Identifiants

pubmed: 39482391
doi: 10.1038/s41598-024-78021-1
pii: 10.1038/s41598-024-78021-1
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

26241

Informations de copyright

© 2024. The Author(s).

Références

Khan, J., Khan, M. A., Jhanjhi, N. Z., Humayun, M. & Alourani, A. Smart-city-based data fusion algorithm for internet of things. Comput. Mater. Contin. 73, 2407–2421 (2022).
Dash, S., Shakyawar, S. K., Sharma, M. & Kaushik, S. Big data in healthcare: management, analysis and future prospects. J. Big Data 6, 54 (2019).
doi: 10.1186/s40537-019-0217-0
Khan, M. A. An IoT framework for heart disease prediction based on MDCNN classifier. IEEE Access 8, 34717–34727 (2020).
doi: 10.1109/ACCESS.2020.2974687
Mohan, S., Thirumalai, C. & Srivastava, G. Effective heart disease prediction using hybrid machine learning techniques. IEEE Access 7, 81542–81554 (2019).
doi: 10.1109/ACCESS.2019.2923707
Anbarasi, M., Anupriya, E. & Iyengar, N. C. S. N. Enhanced prediction of heart disease with feature subset selection using genetic algorithm. Int. J. Eng. Sci. Technol. 2 (10), 5370–5376 (2010).
Liu, X. et al. A hybrid classification system for heart disease diagnosis based on the RFRS method. Comput. Math. Methods Med. https://doi.org/10.1155/2017/8272091 (2017).
doi: 10.1155/2017/8272091 pubmed: 29348781 pmcid: 5734001
Tomar, D. & Agarwal, S. Feature selection based least square twin support vector machine for diagnosis of heart disease. Int. J. Bio-Sci. Bio-Technol. 6 (2), 69–82 (2014).
doi: 10.14257/ijbsbt.2014.6.2.07
Karayılan, T. & Kılıç, Ö. Prediction of heart disease using neural network. In Computer Science and Engineering (UBMK), 2017 International Conference, 719–723 (IEEE, 2017). https://doi.org/10.1109/UBMK.2017.8093512 .
Polat, K. & Güneş, S. A new feature selection method on classification of medical datasets: Kernel F-score feature selection. Expert Syst. Appl. 36 (7), 10367–10373 (2009).
doi: 10.1016/j.eswa.2009.01.041
Li, J., et al. Heart disease identification method using machine learning classification in e-healthcare. IEEE Access 8, 107562–107582 (2020).
doi: 10.1109/ACCESS.2020.3001149
Li, Y., Li, T. & Liu, H. Recent advances in feature selection and its applications. Knowl. Inf. Syst., vol. 53 (3), 551–577 (2017).
Raschka, S. Model evaluation model selection and algorithm selection in machine learning. http://arxiv.org/abs/1811.12808. (2018).
Khan, M. A. & Algarni, F. A healthcare monitoring system for the diagnosis of heart disease in the IoMT cloud environment using MSSO-ANFIS. IEEE Access 8, 122259–122269 (2020).
doi: 10.1109/ACCESS.2020.3006424
Mukherjee, R., Sahana, S. K., Kumar, S., Agrawal, S. & Singh, S. Application of different decision tree classifier for diabetes prediction: a machine learning approach. In Proc. of 4th International Conference on Frontiers in Computing and Systems. COMSYS 2023. Lecture Notes in Networks and Systems, (Kole, D.K., Roy Chowdhury, S., Basu, S., Plewczynski, D., Bhattacharjee, D. eds) vol 974 (Springer, 2024). https://doi.org/10.1007/978-981-97-2611-0_4 .
Haq, A. U. et al. Intelligent machine learning approach for effective recognition of diabetes in E-healthcare using clinical data. Sensors 20 (9), 2649 (2020).
doi: 10.3390/s20092649 pubmed: 32384737 pmcid: 7249007
Dhilsath, F. M. & Samuel, S. J. Hyperparameter tuning of ensemble classifiers using grid search and random search for prediction of heart disease. In Computational Intelligence and Healthcare Informatics (O.P. Jena, A.R. Tripathy, A.A. Elngar and Z. Polkowski eds). https://doi.org/10.1002/9781119818717.ch8 (2021).
Meng, Y. et al. A machine learning approach to classifying self-reported health status in a cohort of patients with heart disease using activity tracker data. IEEE J. Biomed. Health Inf. 24 (3), 878–884 (2020).
doi: 10.1109/JBHI.2019.2922178
Boateng, E., Otoo, J. & Abaye, D. Basic tenets of classification algorithms K-Nearest-Neighbor, Support Vector Machine, Random Forest and Neural Network: a review. J. Data Anal. Inform. Process. 8, 341–357. https://doi.org/10.4236/jdaip.2020.84020 (2020).
doi: 10.4236/jdaip.2020.84020
Hashi, E. K. & Zaman, M. S. U. Developing a hyperparameter tuning based machine learning approach of heart disease prediction. J. Appl. Sci. Process. Eng. 7 (2), 631–647 (2020).
doi: 10.33736/jaspe.2639.2020
Modak, S., Abdel-Raheem, E. & Rueda, L. Heart disease prediction using adaptive infinite feature selection and deep neural networks. In 2022 International Conference on Artificial Intelligence in Information and Communication (ICAIIC), 235–240. https://doi.org/10.1109/ICAIIC54071.2022.9722652 (2022).
Kishor, A. & Jeberson, W. Diagnosis of heart disease using Internet of things and machine learning algorithms. In Proc. of the Second International Conference on Computing, Communications, and Cyber-Security, 691–702 (Springer, 2021).
Dun, B., Wang, E. & Majumder, S. Heart disease diagnosis on medical data using ensemble learning. Comput. Sci. 1 (1), 1–5 (2016).
Rabbi, M. F., Uddin, M. P., Ali, M. A. & Kibria, M. F. Performance evaluation of data mining classification techniques for heart disease prediction. Amer. J. Eng. Res. 7 (2), 278–283 (2018).
Ramalingam, V. V., Dandapath, A. & Raja, M. K. Heart disease prediction using machine learning techniques: a survey. Int. J. Eng. Technol. 7 (2.8), 684–687 (2018).
Pouriyeh, S. et al. A comprehensive investigation and comparison of machine learning techniques in the domain of heart disease. In 2017 IEEE Symposium on Computers and Communications (ISCC),204–207 (2017).
Fix, E. & Hodges, J. L. Discriminatory analysis. Nonparametric discrimination: consistency properties. Int. Stat. Rev. Rev. Int. Stat. 57 (3), 238–247 (1989).
Palaniappan, S. & Awang, R. Intelligent heart disease prediction system using data mining techniques. In 2008 IEEE/ACS International Conference on Computer Systems and Applications, 108–115 (2008).
Rabbi, M. F. et al. Performance evaluation of data mining classification techniques for heart disease prediction. Am. J. Eng. Res. 7 (2), 278–283 (2018).
Haq, A. U., Li, J. P., Memon, M. H., Nazir, S. & Sun, R. A hybrid intelligent system framework for the prediction of heart disease using machine learning algorithms. Mobile Inf. Syst. 2018, 21 (2018).
Amin, M. S., Chiam, Y. K. & Varathan, K. D. Identification of significant features and data mining techniques in predicting heart disease. Telemat. Inform 36, 82–93 (2019).
Ahmed, M. H., Hongou, F., Mohamed, L. & Khan, A. Evaluating the efficacy of Deep Learning architectures in Predicting Traffic patterns for Smart City Development. J. Artif. Intell. Metaheurist. 26–35. https://doi.org/10.54216/JAIM.060203 (2023).
Towfek, S., Khodadadi, N., Abualigah, L. & Rizk, F. AI in higher education: insights from student surveys and predictive analytics using PSO-guided WOA and linear regression. J. Artif. Intell. Eng. Pract. 1 (1), 1–17. https://doi.org/10.21608/jaiep.2024.354003 (2024).
doi: 10.21608/jaiep.2024.354003
El-Kenawy, E. S. M. et al. Greylag goose optimization: nature-inspired optimization algorithm. Expert Syst. Appl. 238, 122147 (2024).
doi: 10.1016/j.eswa.2023.122147
Abdollahzadeh, B. et al. Puma optimizer (PO): a novel metaheuristic optimization algorithm and its application in machine learning. Cluster Comput. 27, 5235–5283. https://doi.org/10.1007/s10586-023-04221-5 (2024).
doi: 10.1007/s10586-023-04221-5
Khan, M. A. et al. . Asynchronous federated learning for improved cardiovascular disease prediction using artificial intelligence. Diagnostics 13, 2340. https://doi.org/10.3390/diagnostics13142340 (2023).
doi: 10.3390/diagnostics13142340 pubmed: 37510084 pmcid: 10377760
Panniem, A. & Puphasuk, P. A modified artificial bee colony algorithm with firefly algorithm strategy for continuous optimization problems. J. Appl. Math. 2018, 1237823 (2018).

Auteurs

Muhammad Amir Khan (MA)

School of Computing Sciences, College of Computing, Informatics, and Mathematics, Universiti Teknologi MARA, 40450, Shah Alam, Selangor, Malaysia.

Tehseen Mazhar (T)

Department of Computer Science and Information Technology, School Education Department,Government of Punjab, Layyah 31200, Pakistan. tehseenmazhar719@gmail.com.

Muhammad Mateen Yaqoob (M)

Department of AI and Data Science, FAST-National university of Computer and emerging sciences, Islamabad, Pakistan.

Muhammad Badruddin Khan (M)

Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia. mbkhan@imamu.edu.sa.

Abdul Khader Jilani Saudagar (AK)

Information Systems Department, College of Computer and Information Sciences, Imam Mohammad Ibn Saud Islamic University (IMSIU), 11432, Riyadh, Saudi Arabia.

Yazeed Yasin Ghadi (YY)

Department of Computer Science and Software Engineering, Al Ain University, Abu Dhabi, 12555, United Arab Emirates.

Umar Farooq Khattak (UF)

School of Information Technology, UNITAR International University, Kelana Jaya, 47301, Petaling Jaya, Malaysia.

Mohammad Shahid (M)

Department of Computer Science, IIC University of Technology, Phnom Penh, 121206, Cambodia.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH