Identifying similarities at metabolic pathways with a strategy of Enzymatic Step Sequences.

Comparative genomics Enzymatic Step Sequences (ESS) comparative program Enzymatic step sequences Enzyme commission number KEGG Metabolic pathways

Journal

MethodsX
ISSN: 2215-0161
Titre abrégé: MethodsX
Pays: Netherlands
ID NLM: 101639829

Informations de publication

Date de publication:
2023
Historique:
received: 15 11 2022
accepted: 07 03 2023
entrez: 27 3 2023
pubmed: 28 3 2023
medline: 28 3 2023
Statut: epublish

Résumé

An easy and fast strategy to compare functionally the metabolic maps is described. The KEGG metabolic maps are transformed into linear Enzymatic Step Sequences (ESS) using the Breadth First Search (BFS) algorithm. To do this, the KGML files are retrieved, and directed graph representations are created; where the nodes represent enzymes or enzymatic complexes, and the edges represent a compound, that is the 'product' from one reaction and a 'substrate' for the next. Then, a set of initialization nodes are selected, and used as the root for the construction of the BFS tree. This tree is used as a guide to the construction of the ESS. From each leaf (terminal node), the path is traced backwards until it reaches the root metabolic map and with two or fewer neighbors in the graph. In a second step, the ESS are compared with a Dynamic Programing algorithm, considering an "ad hoc" substitution matrix, and minimizing the global score. The dissimilarity values between two EC numbers ranged from 0 to 1, where 0 indicates similar EC numbers, and 1 indicates different EC numbers. Finally, the alignment is evaluated by using the normalized entropy-based function, considering a threshold of ≤ 0.27 as significant.•The KEGG metabolic maps are transformed into linear Enzymatic Step Sequences (ESS) using the Breadth First Search (BFS) algorithm.•Nodes represent enzymes or enzymatic complexes, and the edges represent a compound, that is 'product' from one reaction and a 'substrate' for the next.•The ESS are compared with a Dynamic Programing algorithm, considering an "ad hoc" substitution matrix, and minimizing the global score.

Identifiants

pubmed: 36970029
doi: 10.1016/j.mex.2023.102118
pii: S2215-0161(23)00119-X
pmc: PMC10034429
doi:

Types de publication

Journal Article

Langues

eng

Pagination

102118

Informations de copyright

© 2023 Published by Elsevier B.V.

Déclaration de conflit d'intérêts

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Références

Comput Struct Biotechnol J. 2015 Apr 09;13:277-85
pubmed: 25973143
BMC Genomics. 2015 Nov 17;16:957
pubmed: 26578309

Auteurs

Augusto Cesar Poot-Hernandez (AC)

Unidad de Bioinformática y Manejo de la Información. Instituto de Fisiología Celular. Universidad Nacional Autónoma de México, Ciudad Universitaria, México, Mexico.

Katya Rodriguez-Vazquez (K)

Departamento de Ingeniería de Sistemas Computacionales y Automatización, Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Ciudad Universitaria, México, Mexico.

Ernesto Perez-Rueda (E)

Instituto de Investigaciones en Matemáticas Aplicadas y en Sistemas, Universidad Nacional Autónoma de México, Unidad Académica del Estado de Yucatán. Mérida, Yucatán. Mexico.

Classifications MeSH