Detection of viral infection in cell lines using ViralCellDetector.

Bacterial infection Cell lines Differentially expressed genes Machine learning RNA-seq data Random Forest Viral infection

Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
25 Jul 2023
Historique:
pubmed: 7 8 2023
medline: 7 8 2023
entrez: 7 8 2023
Statut: epublish

Résumé

Cell lines are commonly used in research to study biology, including gene expression regulation, cancer progression, and drug responses. However, cross-contaminations with bacteria, mycoplasma, and viruses are common issues in cell line experiments. Detection of bacteria and mycoplasma infections in cell lines is relatively easy but identifying viral infections in cell lines is difficult. Currently, there are no established methods or tools available for detecting viral infections in cell lines. To address this challenge, we developed a tool called ViralCellDetector that detects viruses through mapping RNA-seq data to a library of virus genome. Using this tool, we observed that around 10% of experiments with the MCF7 cell line were likely infected with viruses. Furthermore, to facilitate the detection of samples with unknown sources of viral infection, we identified the differentially expressed genes involved in viral infection from two different cell lines and used these genes in a machine learning approach to classify infected samples based on the host response gene expression biomarkers. Our model reclassifies the infected and non-infected samples with an AUC of 0.91 and an accuracy of 0.93. Overall, our mapping- and marker-based approaches can detect viral infections in any cell line simply based on readily accessible RNA-seq data, allowing researchers to avoid the use of unintentionally infected cell lines in their studies.

Identifiants

pubmed: 37546847
doi: 10.1101/2023.07.21.550094
pmc: PMC10401957
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NIGMS NIH HHS
ID : R01 GM134307
Pays : United States

Déclaration de conflit d'intérêts

Conflict of Interest: None

Auteurs

Rama Shankar (R)

Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, Grand Rapids, MI 49503, USA.

Shreya Paithankar (S)

Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, Grand Rapids, MI 49503, USA.

Suchir Gupta (S)

Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, Grand Rapids, MI 49503, USA.

Bin Chen (B)

Department of Pediatrics and Human Development, College of Human Medicine, Michigan State University, Grand Rapids, MI 49503, USA.
Department of Pharmacology and Toxicology, College of Human Medicine, Michigan State University, Grand Rapids, Michigan, USA.
Department of Computer Science and Engineering, College of Engineering, Michigan State University, East Lansing, Michigan, USA.

Classifications MeSH