CiRCus: A Framework to Enable Classification of Complex High-Throughput Experiments.
classification
competition binding
kinobeads
labeling
machine learning
proteomics
Journal
Journal of proteome research
ISSN: 1535-3907
Titre abrégé: J Proteome Res
Pays: United States
ID NLM: 101128775
Informations de publication
Date de publication:
05 04 2019
05 04 2019
Historique:
pubmed:
26
2
2019
medline:
2
6
2020
entrez:
26
2
2019
Statut:
ppublish
Résumé
Despite the increasing use of high-throughput experiments in molecular biology, methods for evaluating and classifying the acquired results have not kept pace, requiring significant manual efforts to do so. Here, we present CiRCus, a framework to generate custom machine learning models to classify results from high-throughput proteomics binding experiments. We show the experimental procedure that guided us to the layout of this framework as well as the usage of the framework on an example data set consisting of 557 166 protein/drug binding curves achieving an AUC of 0.9987. By applying our classifier to the data, only 6% of the data might require manual investigation. CiRCus bundles two applications, a minimal interface to label a training data set (CindeR) and an interface for the generation of random forest classifiers with optional optimization of pretrained models (CurveClassification). CiRCus is available on https://github.com/kusterlab accompanied by an in-depth user manual and video tutorial.
Identifiants
pubmed: 30799618
doi: 10.1021/acs.jproteome.8b00724
doi:
Substances chimiques
Proteins
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM