Comparison of multiple modalities for drug response prediction with learning curves using neural networks and XGBoost.

Journal

Bioinformatics advances

ISSN: 2635-0041

Titre abrégé: Bioinform Adv

Pays: England

ID NLM: 9918282081306676

Informations de publication

Date de publication:
2024

Historique:

received: 22 09 2023

revised: 19 12 2023

accepted: 22 12 2023

medline: 29 1 2024

pubmed: 29 1 2024

entrez: 29 1 2024

Statut: epublish

Résumé

Anti-cancer drug response prediction is a central problem within stratified medicine. Transcriptomic profiles of cancer cell lines are typically used for drug response prediction, but we hypothesize that proteomics or phosphoproteomics might be more suitable as they give a more direct insight into cellular processes. However, there has not yet been a systematic comparison between all three of these datatypes using consistent evaluation criteria. Due to the limited number of cell lines with phosphoproteomics profiles we use learning curves, a plot of predictive performance as a function of dataset size, to compare the current performance and predict the future performance of the three omics datasets with more data. We use neural networks and XGBoost and compare them against a simple rule-based benchmark. We show that phosphoproteomics slightly outperforms RNA-seq and proteomics using the 38 cell lines with profiles of all three omics data types. Furthermore, using the 877 cell lines with proteomics and RNA-seq profiles, we show that RNA-seq slightly outperforms proteomics. With the learning curves we predict that the mean squared error using the phosphoproteomics dataset would decrease by See https://github.com/Nik-BB/Learning-curves-for-DRP for the code used.

Identifiants

DOI: 10.1093/bioadv/vbad190 PMID: 38282976 PMC: PMC10812874

pubmed: 38282976

doi: 10.1093/bioadv/vbad190

pii: vbad190

pmc: PMC10812874

doi:

Types de publication

Journal Article

Langues

eng

Pagination

vbad190

Informations de copyright

Déclaration de conflit d'intérêts

None declared.

Comparison of multiple modalities for drug response prediction with learning curves using neural networks and XGBoost.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Pagination

Informations de copyright

Déclaration de conflit d'intérêts

Auteurs

Nikhil Branson (N)

Pedro R Cutillas (PR)

Conrad Bessant (C)

Classifications MeSH