Functional module detection through integration of single-cell RNA sequencing data with protein-protein interaction networks.


Journal

BMC genomics
ISSN: 1471-2164
Titre abrégé: BMC Genomics
Pays: England
ID NLM: 100965258

Informations de publication

Date de publication:
02 Nov 2020
Historique:
received: 14 11 2019
accepted: 12 10 2020
entrez: 3 11 2020
pubmed: 4 11 2020
medline: 15 5 2021
Statut: epublish

Résumé

Recent advances in single-cell RNA sequencing have allowed researchers to explore transcriptional function at a cellular level. In particular, single-cell RNA sequencing reveals that there exist clusters of cells with similar gene expression profiles, representing different transcriptional states. In this study, we present SCPPIN, a method for integrating single-cell RNA sequencing data with protein-protein interaction networks that detects active modules in cells of different transcriptional states. We achieve this by clustering RNA-sequencing data, identifying differentially expressed genes, constructing node-weighted protein-protein interaction networks, and finding the maximum-weight connected subgraphs with an exact Steiner-tree approach. As case studies, we investigate two RNA-sequencing data sets from human liver spheroids and human adipose tissue, respectively. With SCPPIN we expand the output of differential expressed genes analysis with information from protein interactions. We find that different transcriptional states have different subnetworks of the protein-protein interaction networks significantly enriched which represent biological pathways. In these pathways, SCPPIN identifies proteins that are not differentially expressed but have a crucial biological function (e.g., as receptors) and therefore reveals biology beyond a standard differential expressed gene analysis. The introduced SCPPIN method can be used to systematically analyse differentially expressed genes in single-cell RNA sequencing data by integrating it with protein interaction data. The detected modules that characterise each cluster help to identify and hypothesise a biological function associated to those cells. Our analysis suggests the participation of unexpected proteins in these pathways that are undetectable from the single-cell RNA sequencing data alone. The techniques described here are applicable to other organisms and tissues.

Sections du résumé

BACKGROUND BACKGROUND
Recent advances in single-cell RNA sequencing have allowed researchers to explore transcriptional function at a cellular level. In particular, single-cell RNA sequencing reveals that there exist clusters of cells with similar gene expression profiles, representing different transcriptional states.
RESULTS RESULTS
In this study, we present SCPPIN, a method for integrating single-cell RNA sequencing data with protein-protein interaction networks that detects active modules in cells of different transcriptional states. We achieve this by clustering RNA-sequencing data, identifying differentially expressed genes, constructing node-weighted protein-protein interaction networks, and finding the maximum-weight connected subgraphs with an exact Steiner-tree approach. As case studies, we investigate two RNA-sequencing data sets from human liver spheroids and human adipose tissue, respectively. With SCPPIN we expand the output of differential expressed genes analysis with information from protein interactions. We find that different transcriptional states have different subnetworks of the protein-protein interaction networks significantly enriched which represent biological pathways. In these pathways, SCPPIN identifies proteins that are not differentially expressed but have a crucial biological function (e.g., as receptors) and therefore reveals biology beyond a standard differential expressed gene analysis.
CONCLUSIONS CONCLUSIONS
The introduced SCPPIN method can be used to systematically analyse differentially expressed genes in single-cell RNA sequencing data by integrating it with protein interaction data. The detected modules that characterise each cluster help to identify and hypothesise a biological function associated to those cells. Our analysis suggests the participation of unexpected proteins in these pathways that are undetectable from the single-cell RNA sequencing data alone. The techniques described here are applicable to other organisms and tissues.

Identifiants

pubmed: 33138772
doi: 10.1186/s12864-020-07144-2
pii: 10.1186/s12864-020-07144-2
pmc: PMC7607865
doi:

Substances chimiques

RNA 63231-63-0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

756

Subventions

Organisme : Engineering and Physical Sciences Research Council
ID : EP/R513295/1
Organisme : Engineering and Physical Sciences Research Council
ID : EP/R018472/1
Organisme : Engineering and Physical Sciences Research Council
ID : EP/N014529/1

Références

Nature. 2008 Mar 27;452(7186):423-8
pubmed: 18344981
Cell Syst. 2017 Sep 27;5(3):251-267.e3
pubmed: 28957658
Comput Struct Biotechnol J. 2014 Sep 03;11(18):22-7
pubmed: 25379140
J Biol Chem. 2017 Jun 23;292(25):10444-10454
pubmed: 28473467
Bioinformatics. 2017 Jul 15;33(14):i190-i198
pubmed: 28881986
Nature. 2019 Nov;575(7783):512-518
pubmed: 31597160
Genome Biol. 2015 Nov 02;16:241
pubmed: 26527291
Proc Natl Acad Sci U S A. 2011 Jan 4;108(1):308-13
pubmed: 21173249
Biochim Biophys Acta Mol Basis Dis. 2018 Jun;1864(6 Pt B):2349-2359
pubmed: 29466699
Proteomics. 2006 Jan;6(1):35-40
pubmed: 16281187
Nature. 2001 May 3;411(6833):41-2
pubmed: 11333967
Nephrol Dial Transplant. 2016 Feb;31(2):206-13
pubmed: 25550448
Nat Commun. 2017 Jan 16;8:14049
pubmed: 28091601
OMICS. 2014 Feb;18(2):155-65
pubmed: 24404838
Mol Syst Biol. 2019 Oct;15(10):e9005
pubmed: 31657111
Science. 2015 Jan 23;347(6220):1260419
pubmed: 25613900
Exp Mol Med. 2018 Aug 7;50(8):96
pubmed: 30089861
Nat Rev Genet. 2013 Oct;14(10):719-32
pubmed: 24045689
Nucleic Acids Res. 2002 Jan 1;30(1):207-10
pubmed: 11752295
Nat Commun. 2016 Feb 01;7:10331
pubmed: 26831545
J Proteome Res. 2019 Mar 1;18(3):1218-1227
pubmed: 30592618
Proc Natl Acad Sci U S A. 2012 Feb 28;109(9):3510-5
pubmed: 22308347
Nucleic Acids Res. 2006 Jan 1;34(Database issue):D535-9
pubmed: 16381927
BMC Bioinformatics. 2019 Aug 28;20(1):446
pubmed: 31462221
Cell Syst. 2019 Jan 23;8(1):43-52.e5
pubmed: 30638811
Nucleic Acids Res. 2014 Aug;42(14):8845-60
pubmed: 25053837
Int J Mol Sci. 2015 Dec 29;17(1):
pubmed: 26729094
J Hepatol. 2015 Jan;62(1):156-64
pubmed: 25111176
Nat Commun. 2018 Mar 15;9(1):1090
pubmed: 29545622
Autophagy. 2018;14(12):2083-2103
pubmed: 30081711
Genome Res. 2015 Oct;25(10):1491-8
pubmed: 26430159
Mol Aspects Med. 2018 Feb;59:114-122
pubmed: 28712804
Brief Funct Genomics. 2018 Jul 1;17(4):240-245
pubmed: 29236955
Nat Rev Mol Cell Biol. 2007 Feb;8(2):101-12
pubmed: 17245412
Nat Methods. 2017 Nov;14(11):1083-1086
pubmed: 28991892
Methods Mol Biol. 2015;1346:151-68
pubmed: 26542721
BMC Bioinformatics. 2012 Jul 28;13:182
pubmed: 22838965
Bioinformatics. 2018 Sep 1;34(17):i972-i980
pubmed: 30423088
Bioinformatics. 2008 Jul 1;24(13):i223-31
pubmed: 18586718
Annu Rev Biomed Eng. 2007;9:205-28
pubmed: 17341157
Nat Biotechnol. 2018 Jun;36(5):411-420
pubmed: 29608179
BMC Bioinformatics. 2019 Jan 18;20(1):40
pubmed: 30658573
Nat Rev Neurol. 2017 Sep 29;13(10):612-623
pubmed: 28960209
Nature. 2013 Jun 13;498(7453):236-40
pubmed: 23685454
Nat Commun. 2018 Oct 22;9(1):4383
pubmed: 30348985
Bioinformatics. 2015 May 15;31(10):1632-9
pubmed: 25609797
Science. 2015 Feb 20;347(6224):1257601
pubmed: 25700523
Cancer Inform. 2008;6:257-73
pubmed: 19259413
Nature. 2019 Feb;566(7745):496-502
pubmed: 30787437
Bioinformatics. 2002;18 Suppl 1:S233-40
pubmed: 12169552
Nat Protoc. 2018 Apr;13(4):599-604
pubmed: 29494575
Nature. 2019 Aug;572(7768):199-204
pubmed: 31292543
BMC Syst Biol. 2010 Jul 22;4:100
pubmed: 20649971
PLoS One. 2014 Aug 22;9(8):e104993
pubmed: 25148538
Nature. 2017 Jun 15;546(7658):431-435
pubmed: 28607484
Addiction. 2011 Oct;106(10):1718-24
pubmed: 21819471
Mol Syst Biol. 2019 Jun 19;15(6):e8746
pubmed: 31217225
Bioinformatics. 2018 Mar 15;34(6):994-1000
pubmed: 29112702
FEBS Lett. 2017 Aug;591(15):2213-2225
pubmed: 28524227
Nat Biotechnol. 2015 May;33(5):495-502
pubmed: 25867923

Auteurs

Florian Klimm (F)

Department of Mathematics, Imperial College London, London, SW7 2AZ, UK. f.klimm@gmail.com.
Mitochondrial Biology Unit, University of Cambridge, Cambridge, CB2 0XY, UK. f.klimm@gmail.com.

Enrique M Toledo (EM)

Discovery Technology and Genomics, Novo Nordisk Research Centre Oxford, Oxford, OX3 7FZ, UK.

Thomas Monfeuga (T)

Discovery Technology and Genomics, Novo Nordisk Research Centre Oxford, Oxford, OX3 7FZ, UK.

Fang Zhang (F)

Discovery Technology and Genomics, Novo Nordisk Research Centre Oxford, Oxford, OX3 7FZ, UK.

Charlotte M Deane (CM)

Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK.

Gesine Reinert (G)

Department of Statistics, University of Oxford, Oxford, OX1 3LB, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH