Machine learning-based heart disease prediction system for Indian population: An exploratory study done in South India.

Affordable healthcare Cardiovascular diseases Early diagnosis Machine learning

Journal

Medical journal, Armed Forces India
ISSN: 0377-1237
Titre abrégé: Med J Armed Forces India
Pays: India
ID NLM: 7602492

Informations de publication

Date de publication:
Jul 2021
Historique:
received: 29 01 2020
accepted: 14 10 2020
entrez: 26 7 2021
pubmed: 27 7 2021
medline: 27 7 2021
Statut: ppublish

Résumé

In India, huge mortality occurs due to cardiovascular diseases (CVDs) as these diseases are not diagnosed in early stages. Machine learning (ML) algorithms can be used to build efficient and economical prediction system for early diagnosis of CVDs in India. A total of 1670 anonymized medical records were collected from a tertiary hospital in South India. Seventy percent of the collected data were used to train the prediction system. Five state-of-the-art ML algorithms (k-Nearest Neighbours, Naïve Bayes, Logistic Regression, AdaBoost and Random Forest [RF]) were applied using Python programming language to develop the prediction system. The performance was evaluated over remaining 30% of data. The prediction system was later deployed in the cloud for easy accessibility via Internet. ML effectively predicted the risk of heart disease. The best performing (RF) prediction system correctly classified 470 out of 501 medical records thus attaining a diagnostic accuracy of 93.8%. Sensitivity and specificity were observed to be 92.8% and 94.6%, respectively. The prediction system attained positive predictive value of 94% and negative predictive value of 93.6%. The prediction model developed in this study can be accessed at http://das.southeastasia.cloudapp.azure.com/predict/. ML-based prediction system developed in this study performs well in early diagnosis of CVDs and can be accessed via Internet. This study offers promising results suggesting potential use of ML-based heart disease prediction system as a screening tool to diagnose heart diseases in primary healthcare centres in India, which would otherwise get undetected.

Sections du résumé

BACKGROUND BACKGROUND
In India, huge mortality occurs due to cardiovascular diseases (CVDs) as these diseases are not diagnosed in early stages. Machine learning (ML) algorithms can be used to build efficient and economical prediction system for early diagnosis of CVDs in India.
METHODS METHODS
A total of 1670 anonymized medical records were collected from a tertiary hospital in South India. Seventy percent of the collected data were used to train the prediction system. Five state-of-the-art ML algorithms (k-Nearest Neighbours, Naïve Bayes, Logistic Regression, AdaBoost and Random Forest [RF]) were applied using Python programming language to develop the prediction system. The performance was evaluated over remaining 30% of data. The prediction system was later deployed in the cloud for easy accessibility via Internet.
RESULTS RESULTS
ML effectively predicted the risk of heart disease. The best performing (RF) prediction system correctly classified 470 out of 501 medical records thus attaining a diagnostic accuracy of 93.8%. Sensitivity and specificity were observed to be 92.8% and 94.6%, respectively. The prediction system attained positive predictive value of 94% and negative predictive value of 93.6%. The prediction model developed in this study can be accessed at http://das.southeastasia.cloudapp.azure.com/predict/.
CONCLUSIONS CONCLUSIONS
ML-based prediction system developed in this study performs well in early diagnosis of CVDs and can be accessed via Internet. This study offers promising results suggesting potential use of ML-based heart disease prediction system as a screening tool to diagnose heart diseases in primary healthcare centres in India, which would otherwise get undetected.

Identifiants

pubmed: 34305284
doi: 10.1016/j.mjafi.2020.10.013
pii: S0377-1237(20)30214-8
pmc: PMC8282535
doi:

Types de publication

Journal Article

Langues

eng

Pagination

302-311

Informations de copyright

© 2020 Director General, Armed Forces Medical Services. Published by Elsevier, a division of RELX India Pvt. Ltd.

Déclaration de conflit d'intérêts

The authors have none to declare.

Références

Indian Pediatr. 2019 Sep 15;56(9):796
pubmed: 31638016
Circ Res. 2017 Oct 13;121(9):1092-1101
pubmed: 28794054
Lancet Glob Health. 2018 Dec;6(12):e1339-e1351
pubmed: 30219317
Indian J Public Health. 2019 Apr-Jun;63(2):151-153
pubmed: 31219066
PLoS One. 2017 Apr 4;12(4):e0174944
pubmed: 28376093
Indian J Community Med. 2018 Jul-Sep;43(3):141-143
pubmed: 30294075
Acta Ophthalmol. 2018 Feb;96(1):63-68
pubmed: 29178249

Auteurs

Ekta Maini (E)

Research Scholar (Computer Science & Engineering), Dayananda Sagar University, Bengaluru, India.

Bondu Venkateswarlu (B)

Associate Professor (Computer Science & Engineering), Dayananda Sagar University, Bengaluru, India.

Baljeet Maini (B)

Professor Pediatrics, Teerthanker Mahaveer Medical College & Research Centre, Moradabad, India.

Dheeraj Marwaha (D)

Senior Software Engineer, Microsoft India, Hyderabad, India.

Classifications MeSH