Machine learning-based heart disease prediction system for Indian population: An exploratory study done in South India.
Affordable healthcare
Cardiovascular diseases
Early diagnosis
Machine learning
Journal
Medical journal, Armed Forces India
ISSN: 0377-1237
Titre abrégé: Med J Armed Forces India
Pays: India
ID NLM: 7602492
Informations de publication
Date de publication:
Jul 2021
Jul 2021
Historique:
received:
29
01
2020
accepted:
14
10
2020
entrez:
26
7
2021
pubmed:
27
7
2021
medline:
27
7
2021
Statut:
ppublish
Résumé
In India, huge mortality occurs due to cardiovascular diseases (CVDs) as these diseases are not diagnosed in early stages. Machine learning (ML) algorithms can be used to build efficient and economical prediction system for early diagnosis of CVDs in India. A total of 1670 anonymized medical records were collected from a tertiary hospital in South India. Seventy percent of the collected data were used to train the prediction system. Five state-of-the-art ML algorithms (k-Nearest Neighbours, Naïve Bayes, Logistic Regression, AdaBoost and Random Forest [RF]) were applied using Python programming language to develop the prediction system. The performance was evaluated over remaining 30% of data. The prediction system was later deployed in the cloud for easy accessibility via Internet. ML effectively predicted the risk of heart disease. The best performing (RF) prediction system correctly classified 470 out of 501 medical records thus attaining a diagnostic accuracy of 93.8%. Sensitivity and specificity were observed to be 92.8% and 94.6%, respectively. The prediction system attained positive predictive value of 94% and negative predictive value of 93.6%. The prediction model developed in this study can be accessed at http://das.southeastasia.cloudapp.azure.com/predict/. ML-based prediction system developed in this study performs well in early diagnosis of CVDs and can be accessed via Internet. This study offers promising results suggesting potential use of ML-based heart disease prediction system as a screening tool to diagnose heart diseases in primary healthcare centres in India, which would otherwise get undetected.
Sections du résumé
BACKGROUND
BACKGROUND
In India, huge mortality occurs due to cardiovascular diseases (CVDs) as these diseases are not diagnosed in early stages. Machine learning (ML) algorithms can be used to build efficient and economical prediction system for early diagnosis of CVDs in India.
METHODS
METHODS
A total of 1670 anonymized medical records were collected from a tertiary hospital in South India. Seventy percent of the collected data were used to train the prediction system. Five state-of-the-art ML algorithms (k-Nearest Neighbours, Naïve Bayes, Logistic Regression, AdaBoost and Random Forest [RF]) were applied using Python programming language to develop the prediction system. The performance was evaluated over remaining 30% of data. The prediction system was later deployed in the cloud for easy accessibility via Internet.
RESULTS
RESULTS
ML effectively predicted the risk of heart disease. The best performing (RF) prediction system correctly classified 470 out of 501 medical records thus attaining a diagnostic accuracy of 93.8%. Sensitivity and specificity were observed to be 92.8% and 94.6%, respectively. The prediction system attained positive predictive value of 94% and negative predictive value of 93.6%. The prediction model developed in this study can be accessed at http://das.southeastasia.cloudapp.azure.com/predict/.
CONCLUSIONS
CONCLUSIONS
ML-based prediction system developed in this study performs well in early diagnosis of CVDs and can be accessed via Internet. This study offers promising results suggesting potential use of ML-based heart disease prediction system as a screening tool to diagnose heart diseases in primary healthcare centres in India, which would otherwise get undetected.
Identifiants
pubmed: 34305284
doi: 10.1016/j.mjafi.2020.10.013
pii: S0377-1237(20)30214-8
pmc: PMC8282535
doi:
Types de publication
Journal Article
Langues
eng
Pagination
302-311Informations de copyright
© 2020 Director General, Armed Forces Medical Services. Published by Elsevier, a division of RELX India Pvt. Ltd.
Déclaration de conflit d'intérêts
The authors have none to declare.
Références
Indian Pediatr. 2019 Sep 15;56(9):796
pubmed: 31638016
Circ Res. 2017 Oct 13;121(9):1092-1101
pubmed: 28794054
Lancet Glob Health. 2018 Dec;6(12):e1339-e1351
pubmed: 30219317
Indian J Public Health. 2019 Apr-Jun;63(2):151-153
pubmed: 31219066
PLoS One. 2017 Apr 4;12(4):e0174944
pubmed: 28376093
Indian J Community Med. 2018 Jul-Sep;43(3):141-143
pubmed: 30294075
Acta Ophthalmol. 2018 Feb;96(1):63-68
pubmed: 29178249