CirGO: an alternative circular way of visualising gene ontology terms.

Bioinformatics Data organisation Gene ontology terms Python Visualisation

Journal

BMC bioinformatics
ISSN: 1471-2105
Titre abrégé: BMC Bioinformatics
Pays: England
ID NLM: 100965194

Informations de publication

Date de publication:
18 Feb 2019
Historique:
received: 25 10 2018
accepted: 07 02 2019
entrez: 20 2 2019
pubmed: 20 2 2019
medline: 8 3 2019
Statut: epublish

Résumé

Prioritisation of gene ontology terms from differential gene expression analyses in a two-dimensional format remains a challenge with exponentially growing data volumes. Typically, gene ontology terms are represented as tree-maps that enclose all data into defined space. However, large datasets make this type of visualisation appear cluttered and busy, and often not informative as some labels are omitted due space limits, especially when published in two-dimensional (2D) figures. Here we present an open source CirGO (Circular Gene Ontology) software that visualises non-redundant two-level hierarchically structured ontology terms from gene expression data in a 2D space. Gene ontology terms based on statistical significance were summarised with a semantic similarity algorithm and grouped by hierarchical clustering. This software visualises the most enriched gene ontology terms in an informative, comprehensive and intuitive format that is achieved by organising data from the most relevant to the least, as well as the appropriate use of colours and supporting information. Additionally, CirGO is an easy to use software that supports researchers with little computational background to present their gene ontology data in a publication ready format. Our easy to use open source CirGO Python software package provides biologists with a succinct presentation of terms and functions that are most represented in a specific gene expression data set in a visually appealing 2D format (e.g. for reporting research results in scientific articles). CirGO is freely available at https://github.com/IrinaVKuznetsova/CirGO.git .

Sections du résumé

BACKGROUND BACKGROUND
Prioritisation of gene ontology terms from differential gene expression analyses in a two-dimensional format remains a challenge with exponentially growing data volumes. Typically, gene ontology terms are represented as tree-maps that enclose all data into defined space. However, large datasets make this type of visualisation appear cluttered and busy, and often not informative as some labels are omitted due space limits, especially when published in two-dimensional (2D) figures.
RESULTS RESULTS
Here we present an open source CirGO (Circular Gene Ontology) software that visualises non-redundant two-level hierarchically structured ontology terms from gene expression data in a 2D space. Gene ontology terms based on statistical significance were summarised with a semantic similarity algorithm and grouped by hierarchical clustering. This software visualises the most enriched gene ontology terms in an informative, comprehensive and intuitive format that is achieved by organising data from the most relevant to the least, as well as the appropriate use of colours and supporting information. Additionally, CirGO is an easy to use software that supports researchers with little computational background to present their gene ontology data in a publication ready format.
CONCLUSIONS CONCLUSIONS
Our easy to use open source CirGO Python software package provides biologists with a succinct presentation of terms and functions that are most represented in a specific gene expression data set in a visually appealing 2D format (e.g. for reporting research results in scientific articles). CirGO is freely available at https://github.com/IrinaVKuznetsova/CirGO.git .

Identifiants

pubmed: 30777018
doi: 10.1186/s12859-019-2671-2
pii: 10.1186/s12859-019-2671-2
pmc: PMC6380029
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

84

Subventions

Organisme : National Health and Medical Research Council
ID : APP1154646

Références

Genome Biol. 2003;4(4):R28
pubmed: 12702209
Genome Res. 2003 Sep;13(9):2129-41
pubmed: 12952881
J Biomed Inform. 2008 Apr;41(2):282-92
pubmed: 17921072
Science. 2008 Jun 6;320(5881):1344-9
pubmed: 18451266
Nat Rev Genet. 2008 Jul;9(7):509-15
pubmed: 18475267
Nature. 2008 Jun 26;453(7199):1239-43
pubmed: 18488015
Nat Methods. 2008 Jul;5(7):621-8
pubmed: 18516045
Immunology. 2008 Oct;125(2):154-60
pubmed: 18798919
Bioinformatics. 2009 Jan 15;25(2):288-9
pubmed: 19033274
Nat Protoc. 2009;4(1):44-57
pubmed: 19131956
BMC Bioinformatics. 2009 Feb 03;10:48
pubmed: 19192299
PLoS Comput Biol. 2009 Jul;5(7):e1000443
pubmed: 19649320
PLoS One. 2011;6(7):e21800
pubmed: 21789182
Nucleic Acid Ther. 2012 Aug;22(4):271-4
pubmed: 22830413
Bioinformatics. 2013 Mar 1;29(5):666-8
pubmed: 23297033
J Biomed Inform. 2014 Apr;48:38-53
pubmed: 24269894
Cell Rep. 2016 Aug 16;16(7):1874-90
pubmed: 27498866
IEEE Trans Vis Comput Graph. 2017 Jan;23(1):521-530
pubmed: 27875168
Nucleic Acids Res. 2017 Jan 4;45(D1):D331-D338
pubmed: 27899567
PLoS One. 2017 Dec 21;12(12):e0190152
pubmed: 29267363
Mol Biol Evol. 1987 Jul;4(4):406-25
pubmed: 3447015

Auteurs

Irina Kuznetsova (I)

Harry Perkins Institute of Medical Research, Nedlands, Western Australia, 6009, Australia. irina.kuznetsova@perkins.uwa.edu.au.

Artur Lugmayr (A)

Visualisation and Interactive Media (VisLab), Curtin University, Perth, 6102, Australia.

Stefan J Siira (SJ)

Harry Perkins Institute of Medical Research, Nedlands, Western Australia, 6009, Australia.

Oliver Rackham (O)

Harry Perkins Institute of Medical Research, Nedlands, Western Australia, 6009, Australia.
School of Molecular Sciences, University of Western Australia, Nedlands, 6009, Australia.

Aleksandra Filipovska (A)

Harry Perkins Institute of Medical Research, Nedlands, Western Australia, 6009, Australia. aleksandra.filipovska@perkins.uwa.edu.au.
School of Molecular Sciences, University of Western Australia, Nedlands, 6009, Australia. aleksandra.filipovska@perkins.uwa.edu.au.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH