Visually guided preprocessing of bioanalytical laboratory data using an interactive R notebook (pguIMP).


Journal

CPT: pharmacometrics & systems pharmacology
ISSN: 2163-8306
Titre abrégé: CPT Pharmacometrics Syst Pharmacol
Pays: United States
ID NLM: 101580011

Informations de publication

Date de publication:
11 2021
Historique:
revised: 06 07 2021
received: 15 04 2021
accepted: 10 08 2021
pubmed: 2 10 2021
medline: 5 4 2022
entrez: 1 10 2021
Statut: ppublish

Résumé

The evaluation of pharmacological data using machine learning requires high data quality. Therefore, data preprocessing, that is, cleaning analytical laboratory errors, replacing missing values or outliers, and transforming data adequately before actual data analysis, is crucial. Because current tools available for this purpose often require programming skills, preprocessing tools with graphical user interfaces that can be used interactively are needed. In collaboration between data scientists and experts in bioanalytical diagnostics, a graphical software package for data preprocessing called pguIMP is proposed, which contains a fixed sequence of preprocessing steps to enable reproducible interactive data preprocessing. As an R-based package, it also allows direct integration into this data science environment without requiring any programming knowledge. The implementation of contemporary data processing methods, including machine-learning-based imputation techniques, ensures the generation of corrected and cleaned bioanalytical data sets that preserve data structures such as clusters better than is possible with classical methods. This was evaluated on bioanalytical data sets from lipidomics and drug research using k-nearest-neighbors-based imputation followed by k-means clustering and density-based spatial clustering of applications with noise. The R package provides a Shiny-based web interface designed to be easy to use for non-data analysis experts. It is demonstrated that the spectrum of methods provided is suitable as a standard pipeline for preprocessing bioanalytical data in biomedical research domains. The R package pguIMP is freely available at the comprehensive R archive network (https://cran.r-project.org/web/packages/pguIMP/index.html).

Identifiants

pubmed: 34598320
doi: 10.1002/psp4.12704
pmc: PMC8592507
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1371-1381

Subventions

Organisme : Landesoffensive zur Entwicklung wissenschaftlich-ökonomischer Exzellenz (LOEWE),
Organisme : LOEWE-Zentrum für Translationale Medizin und Pharmakologie, project "Reproducible cleaning of biomedical laboratory data using methods of visualization, error correction and transformation implemented as interactive R-notebooks" (JL).

Informations de copyright

© 2021 The Authors. CPT: Pharmacometrics & Systems Pharmacology published by Wiley Periodicals LLC on behalf of American Society for Clinical Pharmacology and Therapeutics.

Références

Biometrika. 1968 Mar;55(1):1-17
pubmed: 5661047
Int J Mol Sci. 2015 Oct 28;16(10):25897-911
pubmed: 26516852
Bioanalysis. 2016 Feb;8(4):351-64
pubmed: 26856187
CPT Pharmacometrics Syst Pharmacol. 2021 Apr;10(4):291-308
pubmed: 33715307
Pharmacol Res Perspect. 2017 Dec;5(6):
pubmed: 29226627
Regul Toxicol Pharmacol. 2017 Oct;89:20-25
pubmed: 28713068
Clin Pharmacol Ther. 2020 Apr;107(4):871-885
pubmed: 32128792
Expert Opin Drug Discov. 2016;11(3):241-56
pubmed: 26689499
Pharmacol Res Perspect. 2015 Mar;3(2):e00131
pubmed: 26038706
Metabolites. 2014 Jun 16;4(2):433-52
pubmed: 24957035
Front Psychiatry. 2019 Feb 11;10:41
pubmed: 30804821
CPT Pharmacometrics Syst Pharmacol. 2021 Nov;10(11):1371-1381
pubmed: 34598320
J Pharmacokinet Pharmacodyn. 2008 Aug;35(4):401-21
pubmed: 18686017
Nat Rev Neurol. 2020 Jul;16(7):381-400
pubmed: 32541893
J Biopharm Stat. 1997 Mar;7(1):171-8
pubmed: 9056596
AAPS J. 2009 Jun;11(2):371-80
pubmed: 19452283
Front Big Data. 2021 Jul 08;4:693674
pubmed: 34308343
J Pharmacokinet Pharmacodyn. 2001 Oct;28(5):481-504
pubmed: 11768292
PLoS Comput Biol. 2020 Aug 27;16(8):e1008126
pubmed: 32853229
Science. 2011 Dec 2;334(6060):1226-7
pubmed: 22144613
Anal Chim Acta. 2015 Jul 23;885:1-16
pubmed: 26231889
Pain. 2009 Jul;144(1-2):119-24
pubmed: 19395173
Sri Lankan J Appl Stat. 2014;5(4):227-246
pubmed: 27110215
Stat Med. 2012 Dec 30;31(30):4280-95
pubmed: 22825800

Auteurs

Sebastian Malkusch (S)

Institute of Clinical Pharmacology, Goethe-University, Frankfurt am Main, Germany.

Lisa Hahnefeld (L)

Institute of Clinical Pharmacology, Goethe-University, Frankfurt am Main, Germany.

Robert Gurke (R)

Institute of Clinical Pharmacology, Goethe-University, Frankfurt am Main, Germany.
Fraunhofer Institute for Translational Medicine and Pharmacology ITMP, Frankfurt am Main, Germany.

Jörn Lötsch (J)

Institute of Clinical Pharmacology, Goethe-University, Frankfurt am Main, Germany.
Fraunhofer Institute for Translational Medicine and Pharmacology ITMP, Frankfurt am Main, Germany.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH