Family-Free Genome Comparison.

Common intervals Conserved adjacencies Double-cut-and-join Gene family-free Gene order analysis Gene orthology inference Genome distance Genome median Genome similarity

Journal

Methods in molecular biology (Clifton, N.J.)
ISSN: 1940-6029
Titre abrégé: Methods Mol Biol
Pays: United States
ID NLM: 9214969

Informations de publication

Date de publication:
2024
Historique:
medline: 31 5 2024
pubmed: 31 5 2024
entrez: 31 5 2024
Statut: ppublish

Résumé

The comparison of large-scale genome structures across distinct species offers valuable insights into the species' phylogeny, genome organization, and gene associations. In this chapter, we review the family-free genome comparison tool FFGC that, relying on built-in interfaces with a sequence comparison tool (either BLAST+ or DIAMOND) and with an ILP solver (either CPLEX or Gurobi), provides several methods for analyses that do not require prior classification of genes across the studied genomes. Taking annotated genome sequences as input, FFGC is a complete workflow for genome comparison allowing not only the computation of measures of similarity and dissimilarity but also the inference of gene families, simultaneously based on sequence similarities and large-scale genomic features.

Identifiants

pubmed: 38819556
doi: 10.1007/978-1-0716-3838-5_3
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

57-72

Informations de copyright

© 2024. The Author(s), under exclusive license to Springer Science+Business Media, LLC, part of Springer Nature.

Références

Braga MDV, Chauve C, Doerr D, Jahn K, Stoye J, Thévenin A, Wittler R (2013) The potential of family-free genome comparison. In: Models and Algorithms for Genome Evolution. Volume 19 of Comp. Biol. Springer London, pp 287–323
Doerr D, Stoye J, Böcker S, Jahn K (2014) Identifying gene clusters by discovering common intervals in indeterminate strings. BMC Genom 15(Suppl 6):S2
doi: 10.1186/1471-2164-15-S6-S2
Doerr D, Thévenin A, Stoye J (2012) Gene family assignment-free comparative genomics. BMC Bioinform 13(Suppl 19):S3
doi: 10.1186/1471-2105-13-S19-S3
Doerr D, Balaban M, Feijão P, Chauve C (2017) The gene family-free median of three. Algorithms Mol Biol 12(1):14
doi: 10.1186/s13015-017-0106-z pubmed: 28559921 pmcid: 5446766
Martinez FV, Feijão P, Braga MDV, Stoye J (2015) On the family-free DCJ distance and similarity. Algorithms Mol Biol 10(1):13
doi: 10.1186/s13015-015-0041-9 pubmed: 25859276 pmcid: 4391664
Rubert DP, Martinez FV, Braga MDV (2021) Natural family-free genomic distance. Algorithms Mol Biol 16(4)
Rubert DP, Doerr D, Braga MDV (2021) The potential of family-free rearrangements towards gene orthology inference. J Bioinform Comput Biol 19(6):2140014
doi: 10.1142/S021972002140014X pubmed: 34775922
Rubert DP, Braga MDV (2023) Efficient gene orthology inference via large-scale rearrangements. Algorithms Mol Biol, 18(14)
Uno T, Yagiura M (2000) Fast algorithms to enumerate all common intervals of two permutations. Algorithmica 26(2):290–309
doi: 10.1007/s004539910014
Yancopoulos S, Attie O, Friedberg R (2005) Efficient sorting of genomic permutations by translocation, inversion and block interchange. Bioinformatics 21(16):3340–3346
doi: 10.1093/bioinformatics/bti535 pubmed: 15951307
Bergeron A, Mixtacki J, Stoye J (2006) A unifying view of genome rearrangements. In: Proc. of WABI 2006. Volume 4175 of LNCS, pp 163–173
Braga MDV, Willing E, Stoye J (2011) Double cut and join with insertions and deletions. J Comput Biol 18(9):1167–1184
doi: 10.1089/cmb.2011.0118 pubmed: 21899423
van Dongen S (2008) Graph clustering via a discrete uncoupling process. SIAM J Matrix Anal Appl 30(1):121–141
doi: 10.1137/040608635
Buchfink B, Xie C, Huson DH (2015) Fast and sensitive protein alignment using DIAMOND. Nat Methods 12:59–60
doi: 10.1038/nmeth.3176 pubmed: 25402007
Camacho C, Coulouris G, Avagyan V, Ma N, Papadopoulos J, Bealer K, Madden TL (2008) BLAST+: architecture and applications. BMC Bioinform 10:421–421
doi: 10.1186/1471-2105-10-421
Pesquita C, Faria D, Bastos H, Ferreira AE, Falcão AO, Couto FM (2008) Metrics for GO based protein semantic similarity: a systematic evaluation. BMC Bioinform 9(Suppl 5):S4
doi: 10.1186/1471-2105-9-S5-S4
Lechner M, Findeiß S, Steiner L, Marz M, Stadler PF, Prohaska SJ (2011) Proteinortho: Detection of (co-)orthologs in large-scale analysis. BMC Bioinform 12:124
doi: 10.1186/1471-2105-12-124

Auteurs

Marilia D V Braga (MDV)

Faculty of Technology and Center for Biotechnology, Bielefeld University, Bielefeld, Germany.

Daniel Doerr (D)

Department for Endocrinology and Diabetology, Medical Faculty and University Hospital Düsseldorf, German Diabetes Center (DDZ), Leibniz Institute for Diabetes Research, and Center for Digital Medicine, Heinrich Heine University, Düsseldorf, Germany.

Diego P Rubert (DP)

Faculdade de Computacão, Universidade Federal de Mato Grosso do Sul, Campo Grande, MS, Brazil.

Jens Stoye (J)

Faculty of Technology and Center for Biotechnology, Bielefeld University, Bielefeld, Germany. jens.stoye@uni-bielefeld.de.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH