Detection of atypical data in multicenter clinical trials using unsupervised statistical monitoring.


Journal

Clinical trials (London, England)
ISSN: 1740-7753
Titre abrégé: Clin Trials
Pays: England
ID NLM: 101197451

Informations de publication

Date de publication:
10 2019
Historique:
pubmed: 25 7 2019
medline: 1 9 2020
entrez: 24 7 2019
Statut: ppublish

Résumé

A risk-based approach to clinical research may include a central statistical assessment of data quality. We investigated the operating characteristics of unsupervised statistical monitoring aimed at detecting atypical data in multicenter experiments. The approach is premised on the assumption that, save for random fluctuations and natural variations, data coming from all centers should be comparable and statistically consistent. Unsupervised statistical monitoring consists of performing as many statistical tests as possible on all trial data, in order to detect centers whose data are inconsistent with data from other centers. We conducted simulations using data from a large multicenter trial conducted in Japan for patients with advanced gastric cancer. The actual trial data were contaminated in computer simulations for varying percentages of centers, percentages of patients modified within each center and numbers and types of modified variables. The unsupervised statistical monitoring software was run by a blinded team on the contaminated data sets, with the purpose of detecting the centers with contaminated data. The operating characteristics (sensitivity, specificity and Youden's J-index) were calculated for three detection methods: one using the The operating characteristics of the three methods were satisfactory in situations of data contamination likely to occur in practice, specifically when a single or a few centers were contaminated. As expected, the sensitivity increased for increasing proportions of patients and increasing numbers of variables contaminated. The three methods showed a specificity better than 93% in all scenarios of contamination. The method based on the Data Inconsistency Score and individual The use of brute force (a computer-intensive approach that generates large numbers of statistical tests) is an effective way to check data quality in multicenter clinical trials. It can provide a cost-effective complement to other data-management and monitoring techniques.

Sections du résumé

BACKGROUND/AIMS
A risk-based approach to clinical research may include a central statistical assessment of data quality. We investigated the operating characteristics of unsupervised statistical monitoring aimed at detecting atypical data in multicenter experiments. The approach is premised on the assumption that, save for random fluctuations and natural variations, data coming from all centers should be comparable and statistically consistent. Unsupervised statistical monitoring consists of performing as many statistical tests as possible on all trial data, in order to detect centers whose data are inconsistent with data from other centers.
METHODS
We conducted simulations using data from a large multicenter trial conducted in Japan for patients with advanced gastric cancer. The actual trial data were contaminated in computer simulations for varying percentages of centers, percentages of patients modified within each center and numbers and types of modified variables. The unsupervised statistical monitoring software was run by a blinded team on the contaminated data sets, with the purpose of detecting the centers with contaminated data. The operating characteristics (sensitivity, specificity and Youden's J-index) were calculated for three detection methods: one using the
RESULTS
The operating characteristics of the three methods were satisfactory in situations of data contamination likely to occur in practice, specifically when a single or a few centers were contaminated. As expected, the sensitivity increased for increasing proportions of patients and increasing numbers of variables contaminated. The three methods showed a specificity better than 93% in all scenarios of contamination. The method based on the Data Inconsistency Score and individual
CONCLUSIONS
The use of brute force (a computer-intensive approach that generates large numbers of statistical tests) is an effective way to check data quality in multicenter clinical trials. It can provide a cost-effective complement to other data-management and monitoring techniques.

Identifiants

pubmed: 31331195
doi: 10.1177/1740774519862564
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

512-522

Auteurs

Laura Trotta (L)

CluePoints S.A., Louvain-la-Neuve, Belgium.

Yuusuke Kabeya (Y)

Department of Biostatistics, The University of Tokyo, Tokyo, Japan.
EPS Corporation, Tokyo, Japan.

Marc Buyse (M)

International Drug Development Institute (IDDI), San Francisco, CA, USA.
CluePoints, Wayne, PA, USA.

Erik Doffagne (E)

CluePoints S.A., Louvain-la-Neuve, Belgium.

David Venet (D)

Institut de Recherches Interdisciplinaires et de Développements en Intelligence Artificielle (IRIDIA), University of Brussels, Brussels, Belgium.

Lieven Desmet (L)

Institute of Statistics, Biostatistics and Actuarial Sciences (ISBA), University of Louvain, Louvain-la-Neuve, Belgium.

Tomasz Burzykowski (T)

International Drug Development Institute (IDDI), Louvain-la-Neuve, Belgium.
Interuniversity Institute for Biostatistics and Statistical Bioinformatics (I-BioStat), University of Hasselt, Hasselt, Belgium.

Akira Tsuburaya (A)

Department of Surgery, Jizankai Medical Foundation, Tsuboi Cancer Center Hospital, Koriyama, Japan.

Kazuhiro Yoshida (K)

Department of Surgical Oncology, Graduate School of Medicine, Gifu University, Gifu, Japan.

Yumi Miyashita (Y)

Epidemiological and Clinical Research Information Network (ECRIN), Okazaki, Japan.

Satoshi Morita (S)

Department of Biomedical Statistics and Bioinformatics, Graduate School of Medicine, Kyoto University, Kyoto, Japan.

Junichi Sakamoto (J)

Epidemiological and Clinical Research Information Network (ECRIN), Okazaki, Japan.
Tokai Central Hospital, Kakamigahara, Japan.

Paurush Praveen (P)

CluePoints S.A., Louvain-la-Neuve, Belgium.

Koji Oba (K)

Department of Biostatistics, The University of Tokyo, Tokyo, Japan.
Interfaculty Initiative in Information Studies, The University of Tokyo, Tokyo, Japan.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH