Equitable machine learning counteracts ancestral bias in precision medicine, improving outcomes for all.

ancestry artificial intelligence cancer equitable AI genomics

Journal

Research square
Titre abrégé: Res Sq
Pays: United States
ID NLM: 101768035

Informations de publication

Date de publication:
27 Jul 2023
Historique:
pubmed: 7 8 2023
medline: 7 8 2023
entrez: 7 8 2023
Statut: epublish

Résumé

Gold standard genomic datasets severely under-represent non-European populations, leading to inequities and a limited understanding of human disease [1-8]. Therapeutics and outcomes remain hidden because we lack insights that we could gain from analyzing ancestry-unbiased genomic data. To address this significant gap, we present PhyloFrame, the first-ever machine learning method for equitable genomic precision medicine. PhyloFrame corrects for ancestral bias by integrating big data tissue-specific functional interaction networks, global population variation data, and disease-relevant transcriptomic data. Application of PhyloFrame to breast, thyroid, and uterine cancers shows marked improvements in predictive power across all ancestries, less model overfitting, and a higher likelihood of identifying known cancer-related genes. The ability to provide accurate predictions for underrepresented groups, in particular, is substantially increased. These results demonstrate how AI can mitigate ancestral bias in training data and contribute to equitable representation in medical research.

Identifiants

pubmed: 37546907
doi: 10.21203/rs.3.rs-3168446/v1
pmc: PMC10402189
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NCI NIH HHS
ID : R01 CA259396
Pays : United States

Déclaration de conflit d'intérêts

Competing interests Authors declare that they have no competing interests.

Auteurs

Leslie A Smith (LA)

Department of Computer & Information Science & Engineering, University of Florida, 432 Newell Dr, Gainesville, 32611, FL, USA.

James A Cahill (JA)

Environmental Engineering Sciences Department, University of Florida, 432 Newell Dr, Gainesville, 32611, FL, USA.

Kiley Graim (K)

Department of Computer & Information Science & Engineering, University of Florida, 432 Newell Dr, Gainesville, 32611, FL, USA.

Classifications MeSH