Built Environment Features Obtained from Google Street View Are Associated with Coronary Artery Disease Prevalence: A Deep-Learning Framework.
Cardiovascular risk
machine learning
neighborhood
phenotypes
socio-environmental
Journal
medRxiv : the preprint server for health sciences
Titre abrégé: medRxiv
Pays: United States
ID NLM: 101767986
Informations de publication
Date de publication:
29 Mar 2023
29 Mar 2023
Historique:
pubmed:
11
4
2023
medline:
11
4
2023
entrez:
10
4
2023
Statut:
epublish
Résumé
Built environment plays an important role in development of cardiovascular disease. Tools to evaluate the built environment using machine vision and informatic approaches has been limited. We sought to investigate the association between machine vision-based built environment and prevalence of cardiometabolic disease in urban cities. This cross-sectional study used features extracted from Google Street view (GSV) images to measure the built environment and link them with prevalence of cardiometabolic disease. Convolutional neural networks, light gradient boosting machines and activation maps were utilized to predict health outcomes and identify feature associations with coronary heart disease (CHD). The study obtained 0.53 million GSV images covering 789 census tracts in 7 cities (Cleveland, OH; Fremont, CA; Kansas City, MO; Detroit, MI; Bellevue, WA; Brownsville, TX; and Denver, CO). Analyses were conducted from February 2022 to December 2022. We used census tract-level data from the Centers for Disease Control and Prevention's PLACES dataset. Main outcomes included census tract-level estimated prevalence of CHD based on GSV built environment features. Built environment features extracted from GSV using deep learning predicted 63% of the census tract variation in CHD prevalence. The ExtraTrees Regressor achieved the best result among all models with the lowest average mean absolute error of 1.11% and Root mean square of error of 1.58. The addition of GSV features outperformed and improved a model that only included census-tract level age, sex, race, income and education. Activation maps from the features revealed a set of neighborhood features represented by buildings and roads associated with CHD prevalence. In this cross-sectional study, a significant portion of CHD prevalence were explained by GSV-based built environment factors analyzed using deep learning, independent of census tract demographics. Machine vision enabled assessment of the built environment could help play a significant role in designing and improving heart-heathy cities.
Sections du résumé
Background
UNASSIGNED
Built environment plays an important role in development of cardiovascular disease. Tools to evaluate the built environment using machine vision and informatic approaches has been limited. We sought to investigate the association between machine vision-based built environment and prevalence of cardiometabolic disease in urban cities.
Methods
UNASSIGNED
This cross-sectional study used features extracted from Google Street view (GSV) images to measure the built environment and link them with prevalence of cardiometabolic disease. Convolutional neural networks, light gradient boosting machines and activation maps were utilized to predict health outcomes and identify feature associations with coronary heart disease (CHD). The study obtained 0.53 million GSV images covering 789 census tracts in 7 cities (Cleveland, OH; Fremont, CA; Kansas City, MO; Detroit, MI; Bellevue, WA; Brownsville, TX; and Denver, CO). Analyses were conducted from February 2022 to December 2022. We used census tract-level data from the Centers for Disease Control and Prevention's PLACES dataset. Main outcomes included census tract-level estimated prevalence of CHD based on GSV built environment features.
Results
UNASSIGNED
Built environment features extracted from GSV using deep learning predicted 63% of the census tract variation in CHD prevalence. The ExtraTrees Regressor achieved the best result among all models with the lowest average mean absolute error of 1.11% and Root mean square of error of 1.58. The addition of GSV features outperformed and improved a model that only included census-tract level age, sex, race, income and education. Activation maps from the features revealed a set of neighborhood features represented by buildings and roads associated with CHD prevalence.
Conclusions
UNASSIGNED
In this cross-sectional study, a significant portion of CHD prevalence were explained by GSV-based built environment factors analyzed using deep learning, independent of census tract demographics. Machine vision enabled assessment of the built environment could help play a significant role in designing and improving heart-heathy cities.
Identifiants
pubmed: 37034698
doi: 10.1101/2023.03.28.23287888
pmc: PMC10081432
pii:
doi:
Types de publication
Preprint
Langues
eng
Déclaration de conflit d'intérêts
Disclosures: None of the authors have conflicts of interest relevant to the contents of this manuscript.