Phased nanopore assembly with Shasta and modular graph phasing with GFAse.


Journal

Genome research
ISSN: 1549-5469
Titre abrégé: Genome Res
Pays: United States
ID NLM: 9518021

Informations de publication

Date de publication:
16 Apr 2024
Historique:
received: 19 07 2023
accepted: 19 03 2024
medline: 17 4 2024
pubmed: 17 4 2024
entrez: 16 4 2024
Statut: aheadofprint

Résumé

Reference-free genome phasing is vital for understanding allele inheritance and the impact of single-molecule DNA variation on phenotypes. To achieve thorough phasing across homozygous or repetitive regions of the genome, long-read sequencing technologies are often used to perform phased de novo assembly. As a step toward reducing the cost and complexity of this type of analysis, we describe new methods for accurately phasing Oxford Nanopore Technologies (ONT) sequence data with the Shasta genome assembler and a modular tool for extending phasing to the chromosome scale called GFAse. We test using new variants of ONT PromethION sequencing, including those using proximity ligation, and show that newer, higher accuracy ONT reads substantially improve assembly quality.

Identifiants

pubmed: 38627094
pii: gr.278268.123
doi: 10.1101/gr.278268.123
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Informations de copyright

© 2024 Lorig-Roach et al.; Published by Cold Spring Harbor Laboratory Press.

Auteurs

Ryan Lorig-Roach (R)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA; rlorigro@ucsc.edu pacarnev@ucsc.edu bpaten@ucsc.edu.

Melissa Meredith (M)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Jean Monlong (J)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Miten Jain (M)

Department of Bioengineering, Department of Physics, Northeastern University, Boston, Massachusetts 02120, USA.

Hugh Olsen (H)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Brandy McNulty (B)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

David Porubsky (D)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.

Tessa Montague (T)

The Mortimer B. Zuckerman Mind Brain Behavior Institute, Department of Neuroscience, Columbia University, New York, New York 10027, USA.
Howard Hughes Medical Institute, Columbia University, New York, New York 10032, USA.

Julian Lucas (J)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Chris Condon (C)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Jordan M Eizenga (JM)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Sissel Juul (S)

Oxford Nanopore Technologies Incorporated, New York, New York 10013, USA.

Sean McKenzie (S)

Oxford Nanopore Technologies Incorporated, New York, New York 10013, USA.

Sara E Simmonds (SE)

Chan Zuckerberg Initiative Foundation, Redwood City, California 94063, USA.

Jimin Park (J)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Mobin Asri (M)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Sergey Koren (S)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland 20894, USA.

Evan Eichler (E)

Department of Genome Sciences, University of Washington School of Medicine, Seattle, Washington 98195, USA.
Howard Hughes Medical Institute, University of Washington, Seattle, Washington 98195, USA.

Richard Axel (R)

The Mortimer B. Zuckerman Mind Brain Behavior Institute, Department of Neuroscience, Columbia University, New York, New York 10027, USA.
Howard Hughes Medical Institute, Columbia University, New York, New York 10032, USA.

Bruce Martin (B)

Chan Zuckerberg Initiative Foundation, Redwood City, California 94063, USA.

Paolo Carnevali (P)

Chan Zuckerberg Initiative Foundation, Redwood City, California 94063, USA; rlorigro@ucsc.edu pacarnev@ucsc.edu bpaten@ucsc.edu.

Karen Miga (K)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA.

Benedict Paten (B)

UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, California 95060, USA; rlorigro@ucsc.edu pacarnev@ucsc.edu bpaten@ucsc.edu.

Classifications MeSH