Advancing data science in drug development through an innovative computational framework for data sharing and statistical analysis.


Journal

BMC medical research methodology
ISSN: 1471-2288
Titre abrégé: BMC Med Res Methodol
Pays: England
ID NLM: 100968545

Informations de publication

Date de publication:
14 11 2021
Historique:
received: 27 07 2020
accepted: 22 09 2021
entrez: 14 11 2021
pubmed: 15 11 2021
medline: 24 11 2021
Statut: epublish

Résumé

Novartis and the University of Oxford's Big Data Institute (BDI) have established a research alliance with the aim to improve health care and drug development by making it more efficient and targeted. Using a combination of the latest statistical machine learning technology with an innovative IT platform developed to manage large volumes of anonymised data from numerous data sources and types we plan to identify novel patterns with clinical relevance which cannot be detected by humans alone to identify phenotypes and early predictors of patient disease activity and progression. The collaboration focuses on highly complex autoimmune diseases and develops a computational framework to assemble a research-ready dataset across numerous modalities. For the Multiple Sclerosis (MS) project, the collaboration has anonymised and integrated phase II to phase IV clinical and imaging trial data from ≈35,000 patients across all clinical phenotypes and collected in more than 2200 centres worldwide. For the "IL-17" project, the collaboration has anonymised and integrated clinical and imaging data from over 30 phase II and III Cosentyx clinical trials including more than 15,000 patients, suffering from four autoimmune disorders (Psoriasis, Axial Spondyloarthritis, Psoriatic arthritis (PsA) and Rheumatoid arthritis (RA)). A fundamental component of successful data analysis and the collaborative development of novel machine learning methods on these rich data sets has been the construction of a research informatics framework that can capture the data at regular intervals where images could be anonymised and integrated with the de-identified clinical data, quality controlled and compiled into a research-ready relational database which would then be available to multi-disciplinary analysts. The collaborative development from a group of software developers, data wranglers, statisticians, clinicians, and domain scientists across both organisations has been key. This framework is innovative, as it facilitates collaborative data management and makes a complicated clinical trial data set from a pharmaceutical company available to academic researchers who become associated with the project. An informatics framework has been developed to capture clinical trial data into a pipeline of anonymisation, quality control, data exploration, and subsequent integration into a database. Establishing this framework has been integral to the development of analytical tools.

Sections du résumé

BACKGROUND
Novartis and the University of Oxford's Big Data Institute (BDI) have established a research alliance with the aim to improve health care and drug development by making it more efficient and targeted. Using a combination of the latest statistical machine learning technology with an innovative IT platform developed to manage large volumes of anonymised data from numerous data sources and types we plan to identify novel patterns with clinical relevance which cannot be detected by humans alone to identify phenotypes and early predictors of patient disease activity and progression.
METHOD
The collaboration focuses on highly complex autoimmune diseases and develops a computational framework to assemble a research-ready dataset across numerous modalities. For the Multiple Sclerosis (MS) project, the collaboration has anonymised and integrated phase II to phase IV clinical and imaging trial data from ≈35,000 patients across all clinical phenotypes and collected in more than 2200 centres worldwide. For the "IL-17" project, the collaboration has anonymised and integrated clinical and imaging data from over 30 phase II and III Cosentyx clinical trials including more than 15,000 patients, suffering from four autoimmune disorders (Psoriasis, Axial Spondyloarthritis, Psoriatic arthritis (PsA) and Rheumatoid arthritis (RA)).
RESULTS
A fundamental component of successful data analysis and the collaborative development of novel machine learning methods on these rich data sets has been the construction of a research informatics framework that can capture the data at regular intervals where images could be anonymised and integrated with the de-identified clinical data, quality controlled and compiled into a research-ready relational database which would then be available to multi-disciplinary analysts. The collaborative development from a group of software developers, data wranglers, statisticians, clinicians, and domain scientists across both organisations has been key. This framework is innovative, as it facilitates collaborative data management and makes a complicated clinical trial data set from a pharmaceutical company available to academic researchers who become associated with the project.
CONCLUSIONS
An informatics framework has been developed to capture clinical trial data into a pipeline of anonymisation, quality control, data exploration, and subsequent integration into a database. Establishing this framework has been integral to the development of analytical tools.

Identifiants

pubmed: 34773974
doi: 10.1186/s12874-021-01409-4
pii: 10.1186/s12874-021-01409-4
pmc: PMC8590765
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

250

Informations de copyright

© 2021. The Author(s).

Références

J Am Med Inform Assoc. 2010 Mar-Apr;17(2):169-77
pubmed: 20190059
Neuroimage. 2018 Feb 1;166:400-424
pubmed: 29079522
J R Stat Soc Ser A Stat Soc. 1991;154(2):305-40
pubmed: 12159132
Neuroimage. 2012 Aug 15;62(2):782-90
pubmed: 21979382
Neurology. 2014 Jul 15;83(3):278-86
pubmed: 24871874
BMC Med Inform Decis Mak. 2012 Jul 09;12:66
pubmed: 22776564

Auteurs

Ann-Marie Mallon (AM)

MRC Harwell Institute, Harwell Campus, Oxfordshire, OX11 0RD, UK. a.mallon@har.mrc.ac.uk.

Dieter A Häring (DA)

Novartis Pharma AG, Basel, Switzerland.

Frank Dahlke (F)

Novartis Pharma AG, Basel, Switzerland.

Piet Aarden (P)

Novartis Pharma AG, Basel, Switzerland.

Soroosh Afyouni (S)

Big Data Institute, University of Oxford Li Ka Shing Centre for Health Information and Discovery, Old Road Campus, Oxford, OX3 7LF, UK.

Daniel Delbarre (D)

MRC Harwell Institute, Harwell Campus, Oxfordshire, OX11 0RD, UK.

Khaled El Emam (K)

Children's Hospital of Eastern Ontario Research Institute, 401 Smyth Road, Ottawa, Ontario, K1J 8 L1, Canada.

Habib Ganjgahi (H)

Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB, Oxford, UK.

Stephen Gardiner (S)

MRC Harwell Institute, Harwell Campus, Oxfordshire, OX11 0RD, UK.

Chun Hei Kwok (CH)

MRC Harwell Institute, Harwell Campus, Oxfordshire, OX11 0RD, UK.

Dominique M West (DM)

MRC Harwell Institute, Harwell Campus, Oxfordshire, OX11 0RD, UK.

Ewan Straiton (E)

MRC Harwell Institute, Harwell Campus, Oxfordshire, OX11 0RD, UK.

Sibylle Haemmerle (S)

Novartis Pharma AG, Basel, Switzerland.

Adam Huffman (A)

Big Data Institute, University of Oxford Li Ka Shing Centre for Health Information and Discovery, Old Road Campus, Oxford, OX3 7LF, UK.

Tom Hofmann (T)

Novartis Pharma AG, Basel, Switzerland.

Luke J Kelly (LJ)

Big Data Institute, University of Oxford Li Ka Shing Centre for Health Information and Discovery, Old Road Campus, Oxford, OX3 7LF, UK.
Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB, Oxford, UK.

Peter Krusche (P)

Novartis Pharma AG, Basel, Switzerland.

Marie-Claude Laramee (MC)

Novartis Pharma AG, Basel, Switzerland.

Karine Lheritier (K)

Novartis Pharma AG, Basel, Switzerland.

Greg Ligozio (G)

Novartis Pharma AG, East Hanover, NJ, USA.

Aimee Readie (A)

Novartis Pharma AG, East Hanover, NJ, USA.

Luis Santos (L)

MRC Harwell Institute, Harwell Campus, Oxfordshire, OX11 0RD, UK.

Thomas E Nichols (TE)

Big Data Institute, University of Oxford Li Ka Shing Centre for Health Information and Discovery, Old Road Campus, Oxford, OX3 7LF, UK.

Janice Branson (J)

Novartis Pharma AG, Basel, Switzerland.

Chris Holmes (C)

Department of Statistics, University of Oxford, 24-29 St Giles', OX1 3LB, Oxford, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH