A hepatitis B virus (HBV) sequence variation graph improves sequence alignment and sample-specific consensus sequence construction for genetic analysis of HBV.


Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
12 Jan 2023
Historique:
pubmed: 31 1 2023
medline: 31 1 2023
entrez: 30 1 2023
Statut: epublish

Résumé

Hepatitis B virus (HBV) remains a global public health concern, with over 250 million individuals living with chronic HBV infection (CHB) and no curative therapy currently available. Viral diversity is associated with CHB pathogenesis and immunological control of infection. Improved methods to characterize the viral genome at both the population and intra-host level could aid drug development efforts. Conventionally, HBV sequencing data are aligned to a linear reference genome and only sequences capable of aligning to the reference are captured for analysis. Reference selection has additional consequences, including sample-specific 'consensus' sequence construction. It remains unclear how to select a reference from available sequences and whether a single reference is sufficient for genetic analyses. Using simulated short-read sequencing data generated from full-length publicly available HBV genome sequences and HBV sequencing data from a longitudinally sampled individual with CHB, we investigate alternative graph-based alignment approaches. We demonstrate that using a phylogenetically representative 'genome graph' for alignment, rather than linear reference sequences, avoids issues of reference ambiguity, improves alignment, and facilitates the construction of sample-specific consensus sequences genetically similar to an individual's infection. Graph-based methods can therefore improve efforts to characterize the genetics of viral pathogens, including HBV, and may have broad implications in host pathogen research.

Identifiants

pubmed: 36711598
doi: 10.1101/2023.01.11.523611
pmc: PMC9882026
pii:
doi:

Types de publication

Preprint

Langues

eng

Subventions

Organisme : NIDA NIH HHS
ID : DP2 DA056130
Pays : United States
Organisme : NIDA NIH HHS
ID : L60 DA056996
Pays : United States

Déclaration de conflit d'intérêts

Declaration of interests None

Auteurs

Dylan Duchen (D)

Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA.

Steven Clipman (S)

Division of Infectious Diseases, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA.

Candelaria Vergara (C)

Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA.

Chloe L Thio (CL)

Division of Infectious Diseases, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA.

David L Thomas (DL)

Division of Infectious Diseases, Johns Hopkins School of Medicine, Baltimore, MD, 21205, USA.

Priya Duggal (P)

Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA.

Genevieve L Wojcik (GL)

Department of Epidemiology, Johns Hopkins Bloomberg School of Public Health, Baltimore, MD, 21205, USA.

Classifications MeSH