An improved pig reference genome sequence to enable pig genetics and genomics research.


Journal

GigaScience
ISSN: 2047-217X
Titre abrégé: Gigascience
Pays: United States
ID NLM: 101596872

Informations de publication

Date de publication:
01 06 2020
Historique:
received: 28 10 2019
revised: 12 03 2020
accepted: 22 04 2020
entrez: 17 6 2020
pubmed: 17 6 2020
medline: 5 10 2021
Statut: ppublish

Résumé

The domestic pig (Sus scrofa) is important both as a food source and as a biomedical model given its similarity in size, anatomy, physiology, metabolism, pathology, and pharmacology to humans. The draft reference genome (Sscrofa10.2) of a purebred Duroc female pig established using older clone-based sequencing methods was incomplete, and unresolved redundancies, short-range order and orientation errors, and associated misassembled genes limited its utility. We present 2 annotated highly contiguous chromosome-level genome assemblies created with more recent long-read technologies and a whole-genome shotgun strategy, 1 for the same Duroc female (Sscrofa11.1) and 1 for an outbred, composite-breed male (USMARCv1.0). Both assemblies are of substantially higher (>90-fold) continuity and accuracy than Sscrofa10.2. These highly contiguous assemblies plus annotation of a further 11 short-read assemblies provide an unprecedented view of the genetic make-up of this important agricultural and biomedical model species. We propose that the improved Duroc assembly (Sscrofa11.1) become the reference genome for genomic research in pigs.

Sections du résumé

BACKGROUND
The domestic pig (Sus scrofa) is important both as a food source and as a biomedical model given its similarity in size, anatomy, physiology, metabolism, pathology, and pharmacology to humans. The draft reference genome (Sscrofa10.2) of a purebred Duroc female pig established using older clone-based sequencing methods was incomplete, and unresolved redundancies, short-range order and orientation errors, and associated misassembled genes limited its utility.
RESULTS
We present 2 annotated highly contiguous chromosome-level genome assemblies created with more recent long-read technologies and a whole-genome shotgun strategy, 1 for the same Duroc female (Sscrofa11.1) and 1 for an outbred, composite-breed male (USMARCv1.0). Both assemblies are of substantially higher (>90-fold) continuity and accuracy than Sscrofa10.2.
CONCLUSIONS
These highly contiguous assemblies plus annotation of a further 11 short-read assemblies provide an unprecedented view of the genetic make-up of this important agricultural and biomedical model species. We propose that the improved Duroc assembly (Sscrofa11.1) become the reference genome for genomic research in pigs.

Identifiants

pubmed: 32543654
pii: 5858065
doi: 10.1093/gigascience/giaa051
pmc: PMC7448572
pii:
doi:

Types de publication

Journal Article Research Support, N.I.H., Intramural Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : Wellcome Trust
ID : WT108749/Z/15/Z
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/M011461/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/D/20211550
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/M01844X/1
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/M011615/1
Pays : United Kingdom
Organisme : Wellcome Trust
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BBS/E/D/10002070
Pays : United Kingdom
Organisme : Biotechnology and Biological Sciences Research Council
ID : BB/F021372/1
Pays : United Kingdom

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press.

Références

BMC Genomics. 2017 Aug 22;18(1):643
pubmed: 28830355
Nat Biotechnol. 2018 Oct 22;:
pubmed: 30346939
Nat Rev Genet. 2015 Nov;16(11):627-40
pubmed: 26442640
Genome Res. 2016 Mar;26(3):342-50
pubmed: 26848124
Bioinformatics. 2016 Aug 15;32(16):2508-10
pubmed: 27153597
PLoS One. 2009 Aug 05;4(8):e6524
pubmed: 19654876
Bioinformatics. 2009 Jul 15;25(14):1754-60
pubmed: 19451168
Mob DNA. 2015 Jun 02;6:11
pubmed: 26045719
BMC Biol. 2019 Dec 30;17(1):108
pubmed: 31884969
Nucleic Acids Res. 2019 Jan 8;47(D1):D745-D751
pubmed: 30407521
Nat Biotechnol. 2015 Jun;33(6):623-30
pubmed: 26006009
Nucleic Acids Res. 2016 Jan 4;44(D1):D81-9
pubmed: 26612867
Nat Genet. 2015 Oct;47(10):1141-8
pubmed: 26323058
BMC Genomics. 2012 Nov 15;13:585
pubmed: 23153393
Genome Res. 2016 Jan;26(1):130-9
pubmed: 26560630
Genome Biol. 2015 Mar 25;16:57
pubmed: 25854118
Nat Biotechnol. 2011 May 15;29(7):644-52
pubmed: 21572440
Genome Biol. 2004;5(2):R12
pubmed: 14759262
Toxicol Pathol. 2016 Apr;44(3):346-57
pubmed: 26511847
Bioinformatics. 2014 Aug 1;30(15):2114-20
pubmed: 24695404
Bioinformatics. 2014 Nov 1;30(21):3004-11
pubmed: 25015988
Animal. 2012 Oct;6(10):1565-71
pubmed: 22717310
Genomics. 2005 Dec;86(6):739-52
pubmed: 16246521
Gigascience. 2020 Jun 1;9(6):
pubmed: 32543654
Anim Genet. 2017 Aug;48(4):395-403
pubmed: 28497848
Anim Genet. 2016 Jun;47(3):298-305
pubmed: 27028052
Genet Sel Evol. 2016 Mar 29;48:23
pubmed: 27025270
Cytogenet Cell Genet. 1996;74(1-2):127-32
pubmed: 8893819
Mol Reprod Dev. 2017 Sep;84(9):1012-1017
pubmed: 28394093
Nucleic Acids Res. 1999 Jan 15;27(2):573-80
pubmed: 9862982
Genome Res. 2011 Jun;21(6):952-60
pubmed: 20980557
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
Nat Biotechnol. 2018 Oct;36(9):875-879
pubmed: 30125266
Genome Res. 2011 Jun;21(6):940-51
pubmed: 21460063
Nucleic Acids Res. 2016 Jan 4;44(D1):D827-33
pubmed: 26602686
BMC Genomics. 2016 Sep 05;17:705
pubmed: 27595709
Bioinformatics. 2014 Mar 1;30(5):614-20
pubmed: 24142950
Nature. 2012 Nov 15;491(7424):393-8
pubmed: 23151582
Nat Genet. 2018 Nov;50(11):1574-1583
pubmed: 30275530
Annu Rev Anim Biosci. 2013 Jan;1:221-37
pubmed: 25387018
Nat Methods. 2015 Aug;12(8):780-6
pubmed: 26121404
Bioinformatics. 2016 Feb 15;32(4):497-504
pubmed: 26504144
Genome Res. 2017 May;27(5):865-874
pubmed: 27646534
BMC Genomics. 2019 May 7;20(1):344
pubmed: 31064321
J Anim Sci. 2013 Aug;91(8):3583-92
pubmed: 23736050
Front Genet. 2015 Nov 27;6:338
pubmed: 26640477
PLoS One. 2012;7(11):e47768
pubmed: 23185243
Nat Methods. 2016 Dec;13(12):1050-1054
pubmed: 27749838
Nat Genet. 2014 Aug;46(8):858-65
pubmed: 25017103
Comp Funct Genomics. 2005;6(4):251-5
pubmed: 18629187
J Anim Breed Genet. 2013 Oct;130(5):331-2
pubmed: 24074169
BMC Genomics. 2012 Nov 15;13:586
pubmed: 23152986
Mamm Genome. 1995 Mar;6(3):176-86
pubmed: 7749224
Trends Genet. 2018 Sep;34(9):666-681
pubmed: 29941292
Nat Methods. 2013 Jun;10(6):563-9
pubmed: 23644548
Toxicol Appl Pharmacol. 2013 Jul 15;270(2):149-57
pubmed: 23602889
BMC Genomics. 2014 Jul 03;15:550
pubmed: 24988888
Bioinformatics. 2010 Mar 15;26(6):841-2
pubmed: 20110278
Genome Biol. 2007;8(7):R139
pubmed: 17625002
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712
Bioinformatics. 2016 Oct 1;32(19):3021-3
pubmed: 27318204
Cytogenet Cell Genet. 1993;62(1):37-41
pubmed: 8380763

Auteurs

Amanda Warr (A)

The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK.

Nabeel Affara (N)

Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge CB2 1QP, UK.

Bronwen Aken (B)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK.

Hamid Beiki (H)

Department of Animal Science, 2255 Kildee Hall, Iowa State University, Ames, IA 50011-3150, USA.

Derek M Bickhart (DM)

Dairy Forage Research Center, USDA-ARS, 1925 Linden Drive, Madison, WI 53706, USA.

Konstantinos Billis (K)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK.

William Chow (W)

Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK.

Lel Eory (L)

The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK.

Heather A Finlayson (HA)

The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK.

Paul Flicek (P)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK.

Carlos G Girón (CG)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK.

Darren K Griffin (DK)

School of Biosciences, University of Kent, Giles Lane, Canterbury CT2 7NJ, UK.

Richard Hall (R)

Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA.

Greg Hannum (G)

Denovium Inc., San Diego, CA, USA.

Thibaut Hourlier (T)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK.

Kerstin Howe (K)

Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge CB10 1SA, UK.

David A Hume (DA)

The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK.
Mater Research Institute-University of Queensland, Translational Research Institute, Brisbane QLD 4104, Australia.

Osagie Izuogu (O)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK.

Kristi Kim (K)

Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA.

Sergey Koren (S)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA.

Haibou Liu (H)

Department of Animal Science, 2255 Kildee Hall, Iowa State University, Ames, IA 50011-3150, USA.

Nancy Manchanda (N)

Bioinformatics and Computational Biology Program, Iowa State University, 2014 Molecular Biology Building, Ames, IA 50011, USA.

Fergal J Martin (FJ)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK.

Dan J Nonneman (DJ)

USDA-ARS U.S. Meat Animal Research Center, 844 Road 313, Clay Center, NE 68933, USA.

Rebecca E O'Connor (RE)

School of Biosciences, University of Kent, Giles Lane, Canterbury CT2 7NJ, UK.

Adam M Phillippy (AM)

Genome Informatics Section, Computational and Statistical Genomics Branch, National Human Genome Research Institute, National Institutes of Health, 9000 Rockville Pike, Bethesda, MD 20892, USA.

Gary A Rohrer (GA)

USDA-ARS U.S. Meat Animal Research Center, 844 Road 313, Clay Center, NE 68933, USA.

Benjamin D Rosen (BD)

Animal Genomics and Improvement Laboratory, USDA-ARS, 10300 Baltimore Avenue, Beltsville, MD 20705-2350, USA.

Laurie A Rund (LA)

Department of Animal Sciences, University of Illinois, 1201 West Gregory Drive, Urbana, IL 61801, USA.

Carole A Sargent (CA)

Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge CB2 1QP, UK.

Lawrence B Schook (LB)

Department of Animal Sciences, University of Illinois, 1201 West Gregory Drive, Urbana, IL 61801, USA.

Steven G Schroeder (SG)

Animal Genomics and Improvement Laboratory, USDA-ARS, 10300 Baltimore Avenue, Beltsville, MD 20705-2350, USA.

Ariel S Schwartz (AS)

Denovium Inc., San Diego, CA, USA.

Ben M Skinner (BM)

Department of Pathology, University of Cambridge, Tennis Court Road, Cambridge CB2 1QP, UK.

Richard Talbot (R)

Edinburgh Genomics, University of Edinburgh, Charlotte Auerbach Road, Edinburgh EH9 3FL, UK.

Elizabeth Tseng (E)

Pacific Biosciences, 1305 O'Brien Drive, Menlo Park, CA 94025, USA.

Christopher K Tuggle (CK)

Department of Animal Science, 2255 Kildee Hall, Iowa State University, Ames, IA 50011-3150, USA.
Bioinformatics and Computational Biology Program, Iowa State University, 2014 Molecular Biology Building, Ames, IA 50011, USA.

Mick Watson (M)

The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK.

Timothy P L Smith (TPL)

USDA-ARS U.S. Meat Animal Research Center, 844 Road 313, Clay Center, NE 68933, USA.

Alan L Archibald (AL)

The Roslin Institute and Royal (Dick) School of Veterinary Studies, The University of Edinburgh, Easter Bush Campus, Midlothian EH25 9RG, UK.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH