CirGO: an alternative circular way of visualising gene ontology terms.
Bioinformatics
Data organisation
Gene ontology terms
Python
Visualisation
Journal
BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194
Informations de publication
Date de publication:
18 Feb 2019
18 Feb 2019
Historique:
received:
25
10
2018
accepted:
07
02
2019
entrez:
20
2
2019
pubmed:
20
2
2019
medline:
8
3
2019
Statut:
epublish
Résumé
Prioritisation of gene ontology terms from differential gene expression analyses in a two-dimensional format remains a challenge with exponentially growing data volumes. Typically, gene ontology terms are represented as tree-maps that enclose all data into defined space. However, large datasets make this type of visualisation appear cluttered and busy, and often not informative as some labels are omitted due space limits, especially when published in two-dimensional (2D) figures. Here we present an open source CirGO (Circular Gene Ontology) software that visualises non-redundant two-level hierarchically structured ontology terms from gene expression data in a 2D space. Gene ontology terms based on statistical significance were summarised with a semantic similarity algorithm and grouped by hierarchical clustering. This software visualises the most enriched gene ontology terms in an informative, comprehensive and intuitive format that is achieved by organising data from the most relevant to the least, as well as the appropriate use of colours and supporting information. Additionally, CirGO is an easy to use software that supports researchers with little computational background to present their gene ontology data in a publication ready format. Our easy to use open source CirGO Python software package provides biologists with a succinct presentation of terms and functions that are most represented in a specific gene expression data set in a visually appealing 2D format (e.g. for reporting research results in scientific articles). CirGO is freely available at https://github.com/IrinaVKuznetsova/CirGO.git .
Sections du résumé
BACKGROUND
BACKGROUND
Prioritisation of gene ontology terms from differential gene expression analyses in a two-dimensional format remains a challenge with exponentially growing data volumes. Typically, gene ontology terms are represented as tree-maps that enclose all data into defined space. However, large datasets make this type of visualisation appear cluttered and busy, and often not informative as some labels are omitted due space limits, especially when published in two-dimensional (2D) figures.
RESULTS
RESULTS
Here we present an open source CirGO (Circular Gene Ontology) software that visualises non-redundant two-level hierarchically structured ontology terms from gene expression data in a 2D space. Gene ontology terms based on statistical significance were summarised with a semantic similarity algorithm and grouped by hierarchical clustering. This software visualises the most enriched gene ontology terms in an informative, comprehensive and intuitive format that is achieved by organising data from the most relevant to the least, as well as the appropriate use of colours and supporting information. Additionally, CirGO is an easy to use software that supports researchers with little computational background to present their gene ontology data in a publication ready format.
CONCLUSIONS
CONCLUSIONS
Our easy to use open source CirGO Python software package provides biologists with a succinct presentation of terms and functions that are most represented in a specific gene expression data set in a visually appealing 2D format (e.g. for reporting research results in scientific articles). CirGO is freely available at https://github.com/IrinaVKuznetsova/CirGO.git .
Identifiants
pubmed: 30777018
doi: 10.1186/s12859-019-2671-2
pii: 10.1186/s12859-019-2671-2
pmc: PMC6380029
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
84Subventions
Organisme : National Health and Medical Research Council
ID : APP1154646
Références
Genome Biol. 2003;4(4):R28
pubmed: 12702209
Genome Res. 2003 Sep;13(9):2129-41
pubmed: 12952881
J Biomed Inform. 2008 Apr;41(2):282-92
pubmed: 17921072
Science. 2008 Jun 6;320(5881):1344-9
pubmed: 18451266
Nat Rev Genet. 2008 Jul;9(7):509-15
pubmed: 18475267
Nature. 2008 Jun 26;453(7199):1239-43
pubmed: 18488015
Nat Methods. 2008 Jul;5(7):621-8
pubmed: 18516045
Immunology. 2008 Oct;125(2):154-60
pubmed: 18798919
Bioinformatics. 2009 Jan 15;25(2):288-9
pubmed: 19033274
Nat Protoc. 2009;4(1):44-57
pubmed: 19131956
BMC Bioinformatics. 2009 Feb 03;10:48
pubmed: 19192299
PLoS Comput Biol. 2009 Jul;5(7):e1000443
pubmed: 19649320
PLoS One. 2011;6(7):e21800
pubmed: 21789182
Nucleic Acid Ther. 2012 Aug;22(4):271-4
pubmed: 22830413
Bioinformatics. 2013 Mar 1;29(5):666-8
pubmed: 23297033
J Biomed Inform. 2014 Apr;48:38-53
pubmed: 24269894
Cell Rep. 2016 Aug 16;16(7):1874-90
pubmed: 27498866
IEEE Trans Vis Comput Graph. 2017 Jan;23(1):521-530
pubmed: 27875168
Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338
pubmed: 27899567
PLoS One. 2017 Dec 21;12(12):e0190152
pubmed: 29267363
Mol Biol Evol. 1987 Jul;4(4):406-25
pubmed: 3447015