HIP: a method for high-dimensional multi-view data integration and prediction accounting for subgroup heterogeneity.
COPD
multi-view data
multi-view learning
one-step methods
subgroup heterogeneity
Journal
Briefings in bioinformatics
ISSN: 1477-4054
Titre abrégé: Brief Bioinform
Pays: England
ID NLM: 100912837
Informations de publication
Date de publication:
23 Sep 2024
23 Sep 2024
Historique:
received:
10
03
2024
revised:
09
07
2024
accepted:
13
09
2024
medline:
30
9
2024
pubmed:
30
9
2024
entrez:
30
9
2024
Statut:
ppublish
Résumé
Epidemiologic and genetic studies in many complex diseases suggest subgroup disparities (e.g. by sex, race) in disease course and patient outcomes. We consider this from the standpoint of integrative analysis where we combine information from different views (e.g. genomics, proteomics, clinical data). Existing integrative analysis methods ignore the heterogeneity in subgroups, and stacking the views and accounting for subgroup heterogeneity does not model the association among the views. We propose Heterogeneity in Integration and Prediction (HIP), a statistical approach for joint association and prediction that leverages the strengths in each view to identify molecular signatures that are shared by and specific to a subgroup. We apply HIP to proteomics and gene expression data pertaining to chronic obstructive pulmonary disease (COPD) to identify proteins and genes shared by, and unique to, males and females, contributing to the variation in COPD, measured by airway wall thickness. Our COPD findings have identified proteins, genes, and pathways that are common across and specific to males and females, some implicated in COPD, while others could lead to new insights into sex differences in COPD mechanisms. HIP accounts for subgroup heterogeneity in multi-view data, ranks variables based on importance, is applicable to univariate or multivariate continuous outcomes, and incorporates covariate adjustment. With the efficient algorithms implemented using PyTorch, this method has many potential scientific applications and could enhance multiomics research in health disparities. HIP is available at https://github.com/lasandrall/HIP, a video tutorial at https://youtu.be/O6E2OLmeMDo and a Shiny Application at https://multi-viewlearn.shinyapps.io/HIP_ShinyApp/ for users with limited programming experience.
Identifiants
pubmed: 39344710
pii: 7790998
doi: 10.1093/bib/bbae470
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : National Center For Advancing Translational Science
ID : 5KL2TR002492
Organisme : NIGMS NIH HHS
ID : 1R35GM142695
Pays : United States
Organisme : NHLBI NIH HHS
ID : U01 HL089897
Pays : United States
Informations de copyright
© The Author(s) 2024. Published by Oxford University Press.