VOGDB-Database of Virus Orthologous Groups.
comparative Genomics
genome analysis
genome annotation
orthologous groups
protein families
virus genomes
Journal
Viruses
ISSN: 1999-4915
Titre abrégé: Viruses
Pays: Switzerland
ID NLM: 101509722
Informations de publication
Date de publication:
25 Jul 2024
25 Jul 2024
Historique:
received:
01
07
2024
revised:
21
07
2024
accepted:
23
07
2024
medline:
1
9
2024
pubmed:
31
8
2024
entrez:
29
8
2024
Statut:
epublish
Résumé
Computational models of homologous protein groups are essential in sequence bioinformatics. Due to the diversity and rapid evolution of viruses, the grouping of protein sequences from virus genomes is particularly challenging. The low sequence similarities of homologous genes in viruses require specific approaches for sequence- and structure-based clustering. Furthermore, the annotation of virus genomes in public databases is not as consistent and up to date as for many cellular genomes. To tackle these problems, we have developed VOGDB, which is a database of virus orthologous groups. VOGDB is a multi-layer database that progressively groups viral genes into groups connected by increasingly remote similarity. The first layer is based on pair-wise sequence similarities, the second layer is based on the sequence profile alignments, and the third layer uses predicted protein structures to find the most remote similarity. VOGDB groups allow for more sensitive homology searches of novel genes and increase the chance of predicting annotations or inferring phylogeny. VOGD B uses all virus genomes from RefSeq and partially reannotates them. VOGDB is updated with every RefSeq release. The unique feature of VOGDB is the inclusion of both prokaryotic and eukaryotic viruses in the same clustering process, which makes it possible to explore old evolutionary relationships of the two groups. VOGDB is freely available at vogdb.org under the CC BY 4.0 license.
Identifiants
pubmed: 39205165
pii: v16081191
doi: 10.3390/v16081191
pii:
doi:
Substances chimiques
Viral Proteins
0
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : FWF Austrian Science Fund
ID : I1303
Organisme : Marie Skłodowska-Curie Actions Innovative Training Networks grant agreement
ID : 955974 (VIROINF)