Automated identification of sequence-tailored Cas9 proteins using massive metagenomic data.


Journal

Nature communications
ISSN: 2041-1723
Titre abrégé: Nat Commun
Pays: England
ID NLM: 101528555

Informations de publication

Date de publication:
29 10 2022
Historique:
received: 31 08 2022
accepted: 17 10 2022
pubmed: 31 10 2022
medline: 2 11 2022
entrez: 30 10 2022
Statut: epublish

Résumé

The identification of the protospacer adjacent motif (PAM) sequences of Cas9 nucleases is crucial for their exploitation in genome editing. Here we develop a computational pipeline that was used to interrogate a massively expanded dataset of metagenome and virome assemblies for accurate and comprehensive PAM predictions. This procedure allows the identification and isolation of sequence-tailored Cas9 nucleases by using the target sequence as bait. As proof of concept, starting from the disease-causing mutation P23H in the RHO gene, we find, isolate and experimentally validate a Cas9 which uses the mutated sequence as PAM. Our PAM prediction pipeline will be instrumental to generate a Cas9 nuclease repertoire responding to any PAM requirement.

Identifiants

pubmed: 36309502
doi: 10.1038/s41467-022-34213-9
pii: 10.1038/s41467-022-34213-9
pmc: PMC9617884
doi:

Substances chimiques

CRISPR-Associated Protein 9 EC 3.1.-
RNA, Guide 0
Endonucleases EC 3.1.-

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

6474

Subventions

Organisme : NCI NIH HHS
ID : U01 CA230551
Pays : United States

Informations de copyright

© 2022. The Author(s).

Références

PeerJ. 2019 Jul 26;7:e7359
pubmed: 31388474
Nucleic Acids Res. 2014 Dec 16;42(22):e168
pubmed: 25300484
Microbiology (Reading). 2009 Mar;155(Pt 3):733-740
pubmed: 19246744
J Bacteriol. 2008 Feb;190(4):1401-12
pubmed: 18065539
Nature. 2020 Feb;578(7794):229-236
pubmed: 32051598
Nat Biotechnol. 2013 Sep;31(9):827-32
pubmed: 23873081
Nucleic Acids Res. 2022 Apr 8;50(6):3523-3534
pubmed: 35258601
Biotechnol J. 2018 Sep;13(9):e1700595
pubmed: 30076736
Genome Res. 2017 May;27(5):824-834
pubmed: 28298430
Genome Biol. 2021 Jul 14;22(1):209
pubmed: 34261503
PLoS Genet. 2013;9(3):e1003312
pubmed: 23516369
PeerJ. 2016 Oct 18;4:e2584
pubmed: 27781170
Cell. 2021 Feb 18;184(4):1098-1109.e9
pubmed: 33606979
Nature. 2015 Mar 12;519(7542):199-202
pubmed: 25707807
Genome Biol. 2016 Jun 20;17(1):132
pubmed: 27323842
Trends Genet. 2021 Dec;37(12):1053-1055
pubmed: 34563399
Nature. 2015 Apr 9;520(7546):186-91
pubmed: 25830891
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Bioinformatics. 2015 May 15;31(10):1674-6
pubmed: 25609793
Methods Enzymol. 2019;616:219-240
pubmed: 30691644
Nat Biotechnol. 2019 Dec;37(12):1408-1412
pubmed: 31748692
Science. 2020 Apr 17;368(6488):290-296
pubmed: 32217751
Nat Biotechnol. 2020 Jul;38(7):824-844
pubmed: 32572269
Nat Commun. 2021 Jan 22;12(1):555
pubmed: 33483498
PLoS Pathog. 2020 Mar 9;16(3):e1008344
pubmed: 32150575
RNA Biol. 2013 May;10(5):817-27
pubmed: 23492433
Genome Biol. 2021 Sep 30;22(1):281
pubmed: 34593010
Bioinformatics. 2020 Apr 1;36(7):2272-2274
pubmed: 31821414
Bioinformatics. 2010 Oct 1;26(19):2460-1
pubmed: 20709691
PLoS One. 2019 Aug 22;14(8):e0221068
pubmed: 31437182
RNA Biol. 2013 May;10(5):891-9
pubmed: 23403393
CRISPR J. 2020 Dec;3(6):462-469
pubmed: 33275853
Nat Microbiol. 2021 Jul;6(7):960-970
pubmed: 34168315
Orphanet J Rare Dis. 2006 Oct 11;1:40
pubmed: 17032466
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Nat Rev Microbiol. 2020 Feb;18(2):67-83
pubmed: 31857715
Nature. 2010 Nov 4;468(7320):67-71
pubmed: 21048762
PLoS One. 2010 Mar 10;5(3):e9490
pubmed: 20224823
Cell. 2019 Jan 24;176(3):649-662.e20
pubmed: 30661755
Science. 2012 Aug 17;337(6096):816-21
pubmed: 22745249
Nucleic Acids Res. 2015 Jan;43(Database issue):D571-7
pubmed: 25428358
Nucleic Acids Res. 2011 Aug;39(14):5845-52
pubmed: 21478170
Nat Commun. 2020 Nov 2;11(1):5512
pubmed: 33139742
Proc Natl Acad Sci U S A. 1991 Oct 15;88(20):9370-4
pubmed: 1833777
BMC Bioinformatics. 2015 Nov 17;16:387
pubmed: 26577052
Nucleic Acids Res. 2004 Mar 19;32(5):1792-7
pubmed: 15034147
Nucleic Acids Res. 2018 Jan 4;46(D1):D1062-D1067
pubmed: 29165669
Sci Adv. 2018 Oct 24;4(10):eaau0766
pubmed: 30397647
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712
Nucleic Acids Res. 2018 Jan 4;46(D1):D41-D47
pubmed: 29140468
Nature. 2016 Dec 1;540(7631):144-149
pubmed: 27851729

Auteurs

Matteo Ciciani (M)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy.

Michele Demozzi (M)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy.

Eleonora Pedrazzoli (E)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy.

Elisabetta Visentin (E)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy.

Laura Pezzè (L)

Alia Therapeutics, Trento, Italy.

Lorenzo Federico Signorini (LF)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy.
Shmunis School of Biomedicine and Cancer research, Tel Aviv University, Tel Aviv, Israel.

Aitor Blanco-Miguez (A)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy.

Moreno Zolfo (M)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy.

Francesco Asnicar (F)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy.

Antonio Casini (A)

Alia Therapeutics, Trento, Italy.

Anna Cereseto (A)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy. anna.cereseto@unitn.it.

Nicola Segata (N)

Department of Computational, Cellular and Integrative Biology, University of Trento, Trento, Italy. nicola.segata@unitn.it.

Articles similaires

Coal Metagenome Phylogeny Bacteria Genome, Bacterial
Prader-Willi Syndrome Humans Angelman Syndrome CRISPR-Cas Systems Human Embryonic Stem Cells
Gene Editing Climate Change Africa South of the Sahara Crops, Agricultural Agriculture
Animals Humans CRISPR-Cas Systems Mice Vascular Endothelial Growth Factor A

Classifications MeSH