Probabilistic Approaches to Overcome Content Heterogeneity in Data Integration: A Study Case in Systematic Lupus Erythematosus.

Probabilistic data integration biomedical data harmonisation content heterogeneity missing data

Journal

Studies in health technology and informatics
ISSN: 1879-8365
Titre abrégé: Stud Health Technol Inform
Pays: Netherlands
ID NLM: 9214582

Informations de publication

Date de publication:
16 Jun 2020
Historique:
entrez: 24 6 2020
pubmed: 24 6 2020
medline: 26 8 2020
Statut: ppublish

Résumé

Integrating data from different sources into homogeneous dataset increases the opportunities to study human health. However, disparate data collections are often heterogeneous, which complicates their integration. In this paper, we focus on the issue of content heterogeneity in data integration. Traditional approaches for resolving content heterogeneity map all source datasets to a common data model that includes only shared data items, and thus omit all items that vary between datasets. Based on an example of three datasets in Systemic Lupus Erythematosus, we describe and experimentally evaluate a probabilistic data integration approach which propagates the uncertainty resulting from content heterogeneity into statistical inference, avoiding the need to map to a common data model.

Identifiants

pubmed: 32570412
pii: SHTI200188
doi: 10.3233/SHTI200188
doi:

Types de publication

Journal Article

Langues

eng

Pagination

387-391

Auteurs

Alexia Sampri (A)

Division of Informatics, Imaging and Data Sciences University of Manchester, Manchester, UK.

Nophar Geifman (N)

Division of Informatics, Imaging and Data Sciences University of Manchester, Manchester, UK.

Helen Le Sueur (H)

Division of Informatics, Imaging and Data Sciences University of Manchester, Manchester, UK.

Patrick Doherty (P)

Division of Musculoskeletal & Dermatological Sciences, University of Manchester, Manchester, UK.

Philip Couch (P)

Division of Informatics, Imaging and Data Sciences University of Manchester, Manchester, UK.

Ian Bruce (I)

Division of Musculoskeletal & Dermatological Sciences, University of Manchester, Manchester, UK.
NIHR Manchester Biomedical Research Centre, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK.

Niels Peek (N)

Division of Informatics, Imaging and Data Sciences University of Manchester, Manchester, UK.
NIHR Manchester Biomedical Research Centre, University of Manchester, Manchester Academic Health Science Centre, Manchester, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH