Linearization of genome sequence graphs revisited.

Bioinformatics Computer science Genomics

Journal

iScience
ISSN: 2589-0042
Titre abrégé: iScience
Pays: United States
ID NLM: 101724038

Informations de publication

Date de publication:
23 Jul 2021
Historique:
received: 23 03 2021
revised: 21 05 2021
accepted: 15 06 2021
entrez: 19 7 2021
pubmed: 20 7 2021
medline: 20 7 2021
Statut: epublish

Résumé

The need to include the genetic variation within a population into a reference genome led to the concept of a genome sequence graph. Nodes of such a graph are labeled with DNA sequences occurring in represented genomes. Due to double-stranded nature of DNA, each node may be oriented in one of two possible ways, resulting in marking one end of the labeling sequence as in-side and the other as out-side. Edges join pairs of sides and reflect adjacency between node sequences in genomes constituting the graph. Linearization of a sequence graph aims at orienting and ordering graph nodes in a way that makes it more efficient for visualization and further analysis, e.g. access and traversal. We propose a new linearization algorithm, called ALIBI - Algorithm for Linearization by Incremental graph BuIlding. The evaluation shows that ALIBI is computationally very efficient and generates high-quality results.

Identifiants

pubmed: 34278263
doi: 10.1016/j.isci.2021.102755
pii: S2589-0042(21)00723-9
pmc: PMC8264155
doi:

Types de publication

Journal Article

Langues

eng

Pagination

102755

Informations de copyright

© 2021 The Authors.

Déclaration de conflit d'intérêts

The authors declare no competing interests.

Références

Brief Bioinform. 2018 Jan 1;19(1):118-135
pubmed: 27769991
Genome Res. 2017 May;27(5):665-676
pubmed: 28360232
Nat Biotechnol. 2018 Oct;36(9):875-879
pubmed: 30125266
G3 (Bethesda). 2015 Mar 17;5(5):931-41
pubmed: 25787242
BMC Bioinformatics. 2014 Apr 09;15:99
pubmed: 24712884
Bioinformatics. 2013 Jul 01;29(13):1679-81
pubmed: 23620362
BMC Genomics. 2020 Apr 16;21(Suppl 2):274
pubmed: 32299360
J Comput Biol. 2018 Jul;25(7):664-676
pubmed: 29792514
Immunogenetics. 2008 Jan;60(1):1-18
pubmed: 18193213

Auteurs

Anna Lisiecka (A)

Institute of Informatics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland.

Norbert Dojer (N)

Institute of Informatics, University of Warsaw, Banacha 2, 02-097 Warsaw, Poland.

Classifications MeSH