Equitable machine learning counteracts ancestral bias in precision medicine, improving outcomes for all.
ancestry
artificial intelligence
cancer
equitable AI
genomics
Journal
Research square
Titre abrégé: Res Sq
Pays: United States
ID NLM: 101768035
Informations de publication
Date de publication:
27 Jul 2023
27 Jul 2023
Historique:
pubmed:
7
8
2023
medline:
7
8
2023
entrez:
7
8
2023
Statut:
epublish
Résumé
Gold standard genomic datasets severely under-represent non-European populations, leading to inequities and a limited understanding of human disease [1-8]. Therapeutics and outcomes remain hidden because we lack insights that we could gain from analyzing ancestry-unbiased genomic data. To address this significant gap, we present PhyloFrame, the first-ever machine learning method for equitable genomic precision medicine. PhyloFrame corrects for ancestral bias by integrating big data tissue-specific functional interaction networks, global population variation data, and disease-relevant transcriptomic data. Application of PhyloFrame to breast, thyroid, and uterine cancers shows marked improvements in predictive power across all ancestries, less model overfitting, and a higher likelihood of identifying known cancer-related genes. The ability to provide accurate predictions for underrepresented groups, in particular, is substantially increased. These results demonstrate how AI can mitigate ancestral bias in training data and contribute to equitable representation in medical research.
Identifiants
pubmed: 37546907
doi: 10.21203/rs.3.rs-3168446/v1
pmc: PMC10402189
pii:
doi:
Types de publication
Preprint
Langues
eng
Subventions
Organisme : NCI NIH HHS
ID : R01 CA259396
Pays : United States
Déclaration de conflit d'intérêts
Competing interests Authors declare that they have no competing interests.