Facilitating accessible, rapid, and appropriate processing of ancient metagenomic data with AMDirT.


Journal

F1000Research
ISSN: 2046-1402
Titre abrégé: F1000Res
Pays: England
ID NLM: 101594320

Informations de publication

Date de publication:
2023
Historique:
accepted: 07 05 2024
medline: 12 9 2024
pubmed: 12 9 2024
entrez: 12 9 2024
Statut: epublish

Résumé

Access to sample-level metadata is important when selecting public metagenomic sequencing datasets for reuse in new biological analyses. The Standards, Precautions, and Advances in Ancient Metagenomics community (SPAAM, https://spaam-community.org) has previously published AncientMetagenomeDir, a collection of curated and standardised sample metadata tables for metagenomic and microbial genome datasets generated from ancient samples. However, while sample-level information is useful for identifying relevant samples for inclusion in new projects, Next Generation Sequencing (NGS) library construction and sequencing metadata are also essential for appropriately reprocessing ancient metagenomic data. Currently, recovering information for downloading and preparing such data is difficult when laboratory and bioinformatic metadata is heterogeneously recorded in prose-based publications. Through a series of community-based hackathon events, AncientMetagenomeDir was updated to provide standardised library-level metadata of existing and new ancient metagenomic samples. In tandem, the companion tool 'AMDirT' was developed to facilitate rapid data filtering and downloading of ancient metagenomic data, as well as improving automated metadata curation and validation for AncientMetagenomeDir. AncientMetagenomeDir was extended to include standardised metadata of over 6000 ancient metagenomic libraries. The companion tool 'AMDirT' provides both graphical- and command-line interface based access to such metadata for users from a wide range of computational backgrounds. We also report on errors with metadata reporting that appear to commonly occur during data upload and provide suggestions on how to improve the quality of data sharing by the community. Together, both standardised metadata reporting and tooling will help towards easier incorporation and reuse of public ancient metagenomic datasets into future analyses.

Sections du résumé

Background UNASSIGNED
Access to sample-level metadata is important when selecting public metagenomic sequencing datasets for reuse in new biological analyses. The Standards, Precautions, and Advances in Ancient Metagenomics community (SPAAM, https://spaam-community.org) has previously published AncientMetagenomeDir, a collection of curated and standardised sample metadata tables for metagenomic and microbial genome datasets generated from ancient samples. However, while sample-level information is useful for identifying relevant samples for inclusion in new projects, Next Generation Sequencing (NGS) library construction and sequencing metadata are also essential for appropriately reprocessing ancient metagenomic data. Currently, recovering information for downloading and preparing such data is difficult when laboratory and bioinformatic metadata is heterogeneously recorded in prose-based publications.
Methods UNASSIGNED
Through a series of community-based hackathon events, AncientMetagenomeDir was updated to provide standardised library-level metadata of existing and new ancient metagenomic samples. In tandem, the companion tool 'AMDirT' was developed to facilitate rapid data filtering and downloading of ancient metagenomic data, as well as improving automated metadata curation and validation for AncientMetagenomeDir.
Results UNASSIGNED
AncientMetagenomeDir was extended to include standardised metadata of over 6000 ancient metagenomic libraries. The companion tool 'AMDirT' provides both graphical- and command-line interface based access to such metadata for users from a wide range of computational backgrounds. We also report on errors with metadata reporting that appear to commonly occur during data upload and provide suggestions on how to improve the quality of data sharing by the community.
Conclusions UNASSIGNED
Together, both standardised metadata reporting and tooling will help towards easier incorporation and reuse of public ancient metagenomic datasets into future analyses.

Identifiants

pubmed: 39262445
doi: 10.12688/f1000research.134798.2
pmc: PMC11387932
doi:

Substances chimiques

DNA, Ancient 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

926

Informations de copyright

Copyright: © 2024 Borry M et al.

Déclaration de conflit d'intérêts

No competing interests were disclosed.

Auteurs

Maxime Borry (M)

Cluster of Excellence "Balance of the Microverse", Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.
Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.

Adrian Forsythe (A)

Department of Animal Zoology, Uppsala Universitet, Norbyvägen 18D, Uppsala, 752 36, Sweden.

Aida Andrades Valtueña (A)

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.

Alexander Hübner (A)

Cluster of Excellence "Balance of the Microverse", Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.
Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.
Associated Research Group of Archaeogenetics, Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.

Anan Ibrahim (A)

Department of Paleobiotechnology, Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.

Andrea Quagliariello (A)

Department of Comparative Biomedicine and Food Science, Universita degli Studi di Padova, Viale dell'Università 16, Legnaro, Padova, 350250, Italy.

Anna E White (AE)

Section for Molecular Ecology and Evolution, Globe Institute, Faculty of Health and Medical Sciences, Københavns Universitet, Øster Farimagsgade 5, Copenhagen K, 1353, Denmark.
BioArCh, Department of Archaeology, University of York, York, England, YO10 5DD, UK.

Arthur Kocher (A)

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.
Transmission, Infection, Diversification and Evolution Group, Max Planck Institute for Geoanthropology, Kahlaische Str. 10, Jena, Thuringia, 07745, Germany.

Åshild J Vågene (ÅJ)

Section for Hologenomics, Globe Institute, Faculty of Health and Medical Sciences, Københavns Universitet, Oester Voldgade 44747, Copenhagen K, 1350, Denmark.

Bjørn Peare Bartholdy (BP)

Department of Archaeological Sciences, Universiteit Leiden, Einsteinweg 2, Leiden, 2333 CC, The Netherlands.

Diāna Spurīte (D)

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.
Institute of Ecology and Evolution, Friedrich-Schiller-Universität Jena, Jena, Thuringia, 07743, Germany.

Gabriel Yaxal Ponce-Soto (GY)

Microbial Paleogenomics Unit, Institut Pasteur, Université Paris Cité, CNRS UMR 2000, Rue du Docteur Roux 25-28, Paris, Île-de-France, F-75015, France.

Gunnar Neumann (G)

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.
Max Planck-Harvard Research Center for the Archaeoscience of the Ancient Mediterranean (MHAAM), Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.

I-Ting Huang (IT)

Department of Organismic and Evolutionary Biology, Harvard University, 26 Oxford St., Cambridge, Massachusetts, 02138, USA.

Ian Light (I)

Max Planck Institute for Infection Biology, Virchowweg 12, Berlin, Berlin, 10117, Germany.

Irina M Velsko (IM)

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.

Iseult Jackson (I)

SFI Centre for Research Training in Genomics Data Science, University of Galway, Galway, H91 TK33, Ireland.
Smurfit Institute of Genetics, The University of Dublin Trinity College, Dublin, Leinster, D02 VF25, Ireland.

Jasmin Frangenberg (J)

Department of Paleobiotechnology, Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.

Javier G Serrano (JG)

Department of Biochemistry, Microbiology, Cell Biology and Genetics, Universidad de La Laguna, San Cristóbal de La Laguna, Santa Cruz de Tenerife, 38200, Spain.

Julien Fumey (J)

Microbial Paleogenomics Unit, Institut Pasteur, Université Paris Cité, CNRS UMR 2000, Rue du Docteur Roux 25-28, Paris, Île-de-France, F-75015, France.
Bioinformatics and Biostatistics Hub, Institut Pasteur, Université Paris Cité, Rue du Docteur Roux 25-28, Paris, Île-de-France, F-75015, France.

Kadir T Özdoğan (KT)

Animal Ecology, Wageningen Environmental Research, P.O box 47, Wageningen, Gelderland, 6700 AA, The Netherlands.
Department of History and Art History, Universiteit Utrecht, Drift 6, Utrecht, Utrecht, 3512 BS, The Netherlands.

Kelly E Blevins (KE)

Center for Bioarchaeological Research, Arizona State University, Candy Mall, Tempe, Arizona, 85281, USA.
Department of Archaeology, Durham University, South Road, Durham, County Durham, England, DH1 3LE, UK.

Kevin G Daly (KG)

Smurfit Institute of Genetics, The University of Dublin Trinity College, Dublin, Leinster, D02 VF25, Ireland.

Maria Lopopolo (M)

Microbial Paleogenomics Unit, Institut Pasteur, Université Paris Cité, CNRS UMR 2000, Rue du Docteur Roux 25-28, Paris, Île-de-France, F-75015, France.

Markella Moraitou (M)

Institute of Ecology and Evolution, School of Biological Sciences, The University of Edinburgh, Charlotte Auerbach Road, Edinburgh, Scotland, EH9 3FL, UK.

Megan Michel (M)

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.
Max Planck-Harvard Research Center for the Archaeoscience of the Ancient Mediterranean (MHAAM), Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.
Department of Human Evolutionary Biology, Harvard University, Divinity Avenue 11, Cambridge, Massachusetts, 02138, USA.

Meriam van Os (M)

Department of Anatomy, University of Otago, 270 Great King St, Dunedin, Otago, 9016, New Zealand.

Miriam J Bravo-Lopez (MJ)

International Laboratory for Human Genome Research (LIIGH), Universidad Nacional Autonoma de Mexico, La Mesa 3001, Juriquilla, Queretaro, 76230, Mexico.
Center for Genomic Sciences (CCG), Universidad Nacional Autonoma de Mexico, Cuernavaca, Morelos, 62210, Mexico.

Mohamed S Sarhan (MS)

Institute for Mummy Studies, Eurac Research, Drususallee 1, Bolzano/Bozen, Autonome Provinz Bozen, 39100, Italy.
Centre for Integrative Biology (CIBIO), Universita degli Studi di Trento, Via Sommarive 9, Povo, Trentino, 38123, Italy.

Nihan D Dagtas (ND)

Department of Anatomy and Anthropology and Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine, Tel Aviv University, Ramat Aviv, Tel Aviv-Yafo, 69978, Israel.

Nikolay Oskolkov (N)

National Bioinformatics Infrastructure Sweden, Science for Life Laboratory, Tomtebodavägen 23, Stockholm, 17165, Sweden.
Department of Biology, Lunds Universitet, Sölvegatan 35, Lund, 223 62, Sweden.

Olivia S Smith (OS)

Department of Integrative Biology, The University of Texas at Austin, Speedway 2415, Austin, Texas, 78712, USA.

Ophélie Lebrasseur (O)

Instituto Nacional de Antropología y Pensamiento Latinoamericano, 3 de Febrero 1370 (1426), Ciudad Autónoma de Buenos Aires, C1426BJN CABA, Argentina.
Center for Anthropobiology and Genomics of Toulouse, CNRS/Universite Toulouse III Paul Sabatier, Allées Jules Guesde 37, Toulouse, Occitanie, 31000, France.

Piotr Rozwalak (P)

Department of Computational Biology, Adam Mickiewicz University, Poznań, Uniwersytetu Poznanskiego 6, Poznań, Wielkopolska, 61-614, Poland.

Raphael Eisenhofer (R)

Center for Evolutionary Hologenomics, Globe Institute, Københavns Universitet, ester Voldgade 44747, Copenhagen, Copenhagen K, 1350, Denmark.

Sally Wasef (S)

Defence Genomics, Centre for Genomics and Personalised Health, Queensland University of Technology, Musk Ave 60, Kelvin Grove, Queensland, 4059, Australia.

Shreya L Ramachandran (SL)

Department of Human Genetics, The University of Chicago, E. 58th St. 920, Chicago, Illinois, 60637, USA.

Valentina Vanghi (V)

Department of Anatomy and Anthropology and Department of Human Molecular Genetics and Biochemistry, Faculty of Medicine, Tel Aviv University, Ramat Aviv, Tel Aviv-Yafo, 69978, Israel.

Christina Warinner (C)

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.
Associated Research Group of Archaeogenetics, Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.
Department of Anthropology, Harvard University, Divinity Avenue 11, Cambridge, Massachusetts, 02138, USA.
Faculty of Biological Sciences, Institute of Microbiology, Friedrich-Schiller-Universität Jena, Neugasse 25, Jena, Thuringia, 07743, Germany.

James A Fellows Yates (JA)

Department of Archaeogenetics, Max Planck Institute for Evolutionary Anthropology, Deutscher Pl. 6, Leipzig, Saxony, 04103, Germany.
Associated Research Group of Archaeogenetics, Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.
Department of Paleobiotechnology, Leibniz Institute for Natural Product Research and Infection Biology Hans Knöll Institute, Adolf-Reichwein-Straße 23, Jena, Thuringia, 07745, Germany.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH