VOGDB-Database of Virus Orthologous Groups.

comparative Genomics genome analysis genome annotation orthologous groups protein families virus genomes

Journal

Viruses
ISSN: 1999-4915
Titre abrégé: Viruses
Pays: Switzerland
ID NLM: 101509722

Informations de publication

Date de publication:
25 Jul 2024
Historique:
received: 01 07 2024
revised: 21 07 2024
accepted: 23 07 2024
medline: 1 9 2024
pubmed: 31 8 2024
entrez: 29 8 2024
Statut: epublish

Résumé

Computational models of homologous protein groups are essential in sequence bioinformatics. Due to the diversity and rapid evolution of viruses, the grouping of protein sequences from virus genomes is particularly challenging. The low sequence similarities of homologous genes in viruses require specific approaches for sequence- and structure-based clustering. Furthermore, the annotation of virus genomes in public databases is not as consistent and up to date as for many cellular genomes. To tackle these problems, we have developed VOGDB, which is a database of virus orthologous groups. VOGDB is a multi-layer database that progressively groups viral genes into groups connected by increasingly remote similarity. The first layer is based on pair-wise sequence similarities, the second layer is based on the sequence profile alignments, and the third layer uses predicted protein structures to find the most remote similarity. VOGDB groups allow for more sensitive homology searches of novel genes and increase the chance of predicting annotations or inferring phylogeny. VOGD B uses all virus genomes from RefSeq and partially reannotates them. VOGDB is updated with every RefSeq release. The unique feature of VOGDB is the inclusion of both prokaryotic and eukaryotic viruses in the same clustering process, which makes it possible to explore old evolutionary relationships of the two groups. VOGDB is freely available at vogdb.org under the CC BY 4.0 license.

Identifiants

pubmed: 39205165
pii: v16081191
doi: 10.3390/v16081191
pii:
doi:

Substances chimiques

Viral Proteins 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : FWF Austrian Science Fund
ID : I1303
Organisme : Marie Skłodowska-Curie Actions Innovative Training Networks grant agreement
ID : 955974 (VIROINF)

Auteurs

Lovro Trgovec-Greif (L)

Centre for Microbiology and Environmental Systems Science, University of Vienna, 1030 Vienna, Austria.
Doctoral School of Microbiology and Environmental Systems Science, University of Vienna, 1030 Vienna, Austria.

Hans-Jörg Hellinger (HJ)

Doctoral School of Microbiology and Environmental Systems Science, University of Vienna, 1030 Vienna, Austria.
Armaments and Defence Technology Agency, Austria.

Jean Mainguy (J)

Genoscope, 91000 Evry Cedex, France.

Alexander Pfundner (A)

Centre for Microbiology and Environmental Systems Science, University of Vienna, 1030 Vienna, Austria.
Doctoral School of Microbiology and Environmental Systems Science, University of Vienna, 1030 Vienna, Austria.

Dmitrij Frishman (D)

Department of Bioinformatics, School of Life Sciences, Technical University Munich, 85350 Freising, Germany.

Michael Kiening (M)

Department of Bioinformatics, School of Life Sciences, Technical University Munich, 85350 Freising, Germany.

Nicole Suzanne Webster (NS)

Australian Institute of Marine Science, PMB no3 Townsville MC, Townsville 4810, Australia.
Institute for Marine and Antarctic Studies, University of Tasmania, Hobart 7000, Australia.
Australian Centre for Ecogenomics, University of Queensland, Brisbane 4072, Australia.

Patrick William Laffy (PW)

Australian Institute of Marine Science, PMB no3 Townsville MC, Townsville 4810, Australia.

Michael Feichtinger (M)

Centre for Microbiology and Environmental Systems Science, University of Vienna, 1030 Vienna, Austria.

Thomas Rattei (T)

Centre for Microbiology and Environmental Systems Science, University of Vienna, 1030 Vienna, Austria.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Amaryllidaceae Alkaloids Lycoris NADPH-Ferrihemoprotein Reductase Gene Expression Regulation, Plant Plant Proteins
Drought Resistance Gene Expression Profiling Gene Expression Regulation, Plant Gossypium Multigene Family

Classifications MeSH