PhenoComb: a discovery tool to assess complex phenotypes in high-dimensional single-cell datasets.


Journal

Bioinformatics advances
ISSN: 2635-0041
Titre abrégé: Bioinform Adv
Pays: England
ID NLM: 9918282081306676

Informations de publication

Date de publication:
2022
Historique:
received: 22 04 2022
revised: 28 06 2022
accepted: 28 07 2022
entrez: 26 1 2023
pubmed: 27 1 2023
medline: 27 1 2023
Statut: epublish

Résumé

High-dimensional cytometry assays can simultaneously measure dozens of markers, enabling the investigation of complex phenotypes. However, as manual gating relies on previous biological knowledge, few marker combinations are often assessed. This results in complex phenotypes with the potential for biological relevance being overlooked. Here, we present PhenoComb, an R package that allows agnostic exploration of phenotypes by assessing all combinations of markers. PhenoComb uses signal intensity thresholds to assign markers to discrete states (e.g. negative, low, high) and then counts the number of cells per sample from all possible marker combinations in a memory-safe manner. Time and disk space are the only constraints on the number of markers evaluated. PhenoComb also provides several approaches to perform statistical comparisons, evaluate the relevance of phenotypes and assess the independence of identified phenotypes. PhenoComb allows users to guide analysis by adjusting several function arguments, such as identifying parent populations of interest, filtering of low-frequency populations and defining a maximum complexity of phenotypes to evaluate. We have designed PhenoComb to be compatible with a local computer or server-based use. In testing of PhenoComb's performance on synthetic datasets, computation on 16 markers was completed in the scale of minutes and up to 26 markers in hours. We applied PhenoComb to two publicly available datasets: an HIV flow cytometry dataset (12 markers and 421 samples) and the COVIDome CyTOF dataset (40 markers and 99 samples). In the HIV dataset, PhenoComb identified immune phenotypes associated with HIV seroconversion, including those highlighted in the original publication. In the COVID dataset, we identified several immune phenotypes with altered frequencies in infected individuals relative to healthy individuals. Collectively, PhenoComb represents a powerful discovery tool for agnostically assessing high-dimensional single-cell data. The PhenoComb R package can be downloaded from https://github.com/SciOmicsLab/PhenoComb. Supplementary data are available at

Identifiants

pubmed: 36699375
doi: 10.1093/bioadv/vbac052
pii: vbac052
pmc: PMC9710698
doi:

Types de publication

Journal Article

Langues

eng

Pagination

vbac052

Informations de copyright

© The Author(s) 2022. Published by Oxford University Press.

Références

Bioinformatics. 2012 Apr 1;28(7):1009-16
pubmed: 22383736
Cytometry B Clin Cytom. 2013 Jul-Aug;84(4):207-17
pubmed: 23576305
Immunology. 2008 Dec;125(4):441-9
pubmed: 19137647
Curr Opin Immunol. 2013 Oct;25(5):556-63
pubmed: 24148236
JCI Insight. 2020 Jun 4;5(11):
pubmed: 32369447
Cytometry. 1997 Jan 1;27(1):71-6
pubmed: 9000587
Nat Biotechnol. 2018 Dec 03;:
pubmed: 30531897
Cell. 2015 Jul 2;162(1):184-97
pubmed: 26095251
Cell Rep. 2021 Aug 17;36(7):109527
pubmed: 34348131
Blood. 2010 Mar 4;115(9):1678-89
pubmed: 19903895
J Acquir Immune Defic Syndr. 2008 Sep 1;49(1):40-7
pubmed: 18667932

Auteurs

David M Woods (DM)

Division of Medical Oncology, Department of Medicine, University of Colorado School of Medicine, Aurora, CO 80045, USA.

Classifications MeSH