Classification and phylogeny for the annotation of novel eukaryotic GNAT acetyltransferases.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
12 2020
Historique:
received: 21 05 2020
accepted: 16 10 2020
revised: 07 01 2021
pubmed: 29 12 2020
medline: 20 2 2021
entrez: 28 12 2020
Statut: epublish

Résumé

The enzymes of the GCN5-related N-acetyltransferase (GNAT) superfamily count more than 870 000 members through all kingdoms of life and share the same structural fold. GNAT enzymes transfer an acyl moiety from acyl coenzyme A to a wide range of substrates including aminoglycosides, serotonin, glucosamine-6-phosphate, protein N-termini and lysine residues of histones and other proteins. The GNAT subtype of protein N-terminal acetyltransferases (NATs) alone targets a majority of all eukaryotic proteins stressing the omnipresence of the GNAT enzymes. Despite the highly conserved GNAT fold, sequence similarity is quite low between members of this superfamily even when substrates are similar. Furthermore, this superfamily is phylogenetically not well characterized. Thus functional annotation based on sequence similarity is unreliable and strongly hampered for thousands of GNAT members that remain biochemically uncharacterized. Here we used sequence similarity networks to map the sequence space and propose a new classification for eukaryotic GNAT acetyltransferases. Using the new classification, we built a phylogenetic tree, representing the entire GNAT acetyltransferase superfamily. Our results show that protein NATs have evolved more than once on the GNAT acetylation scaffold. We use our classification to predict the function of uncharacterized sequences and verify by in vitro protein assays that two fungal genes encode NAT enzymes targeting specific protein N-terminal sequences, showing that even slight changes on the GNAT fold can lead to change in substrate specificity. In addition to providing a new map of the relationship between eukaryotic acetyltransferases the classification proposed constitutes a tool to improve functional annotation of GNAT acetyltransferases.

Identifiants

pubmed: 33362253
doi: 10.1371/journal.pcbi.1007988
pii: PCOMPBIOL-D-20-00852
pmc: PMC7790372
doi:

Substances chimiques

Acetyltransferases EC 2.3.1.-

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1007988

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

J Biol Chem. 2001 Jun 8;276(23):20154-9
pubmed: 11274203
Mol Cell Biol. 2009 Jul;29(13):3569-81
pubmed: 19398576
Mol Syst Biol. 2020 Jul;16(7):e9464
pubmed: 32633465
Hum Mol Genet. 2019 Sep 1;28(17):2900-2919
pubmed: 31127942
J Biol Chem. 2011 Oct 21;286(42):37002-10
pubmed: 21900231
Syst Biol. 2000 Jun;49(2):225-32
pubmed: 12118406
EMBO J. 1992 Jun;11(6):2087-93
pubmed: 1600941
EMBO J. 1999 Nov 1;18(21):6155-68
pubmed: 10545125
Hum Mol Genet. 2015 Apr 1;24(7):1956-76
pubmed: 25489052
Trends Biochem Sci. 2016 Sep;41(9):746-760
pubmed: 27498224
Bioinformatics. 1998;14(1):48-54
pubmed: 9520501
Nucleic Acids Res. 2019 Jan 8;47(D1):D427-D432
pubmed: 30357350
Annu Rev Biophys Biomol Struct. 2000;29:81-103
pubmed: 10940244
Structure. 2016 Jul 6;24(7):1044-56
pubmed: 27320834
Molecules. 2020 Jan 29;25(3):
pubmed: 32013195
J Comput Biol. 2000;7(6):761-76
pubmed: 11382360
Plant Physiol. 2020 Aug;183(4):1838-1854
pubmed: 32457093
Structure. 2017 Apr 4;25(4):641-649.e3
pubmed: 28380339
Nat Commun. 2015 Jul 17;6:7640
pubmed: 26184543
Nucleic Acids Res. 2016 Jul 8;44(W1):W351-5
pubmed: 27131377
Nature. 1998 Jun 4;393(6684):440-2
pubmed: 9623998
J Biol Chem. 2011 Oct 14;286(41):36132-41
pubmed: 21865168
PLoS Genet. 2011 Jul;7(7):e1002169
pubmed: 21750686
Biochim Biophys Acta. 2015 Aug;1854(8):1019-37
pubmed: 25900361
Carbohydr Res. 2019 Jan 15;472:72-75
pubmed: 30500476
EMBO J. 1989 Jul;8(7):2067-75
pubmed: 2551674
Proc Natl Acad Sci U S A. 2009 May 19;106(20):8157-62
pubmed: 19420222
J Mol Biol. 2002 Apr 26;318(2):595-608
pubmed: 12051862
Mol Syst Biol. 2011 Oct 11;7:539
pubmed: 21988835
J Biol Chem. 2001 Jun 29;276(26):24097-107
pubmed: 11313340
Brief Bioinform. 2002 Sep;3(3):265-74
pubmed: 12230035
Nat Commun. 2014 Jul 15;5:4383
pubmed: 25023910
Proteomics. 2015 Jul;15(14):2426-35
pubmed: 25951519
Mol Cell Proteomics. 2011 May;10(5):M110.004580
pubmed: 21383206
Biochim Biophys Acta. 2009 Jan;1789(1):58-68
pubmed: 18722564
J Biol Chem. 2014 Feb 7;289(6):3652-65
pubmed: 24338013
Mol Cell. 2011 Oct 7;44(1):39-50
pubmed: 21981917
Mol Biol Evol. 1995 Jul;12(4):546-57
pubmed: 7659011
Bioinformatics. 2014 May 1;30(9):1312-3
pubmed: 24451623
J Mol Biol. 1999 Dec 17;294(5):1311-25
pubmed: 10600387
J Mol Biol. 2003 Oct 31;333(4):863-82
pubmed: 14568541
Mol Biol Evol. 2008 Jul;25(7):1307-20
pubmed: 18367465
Nat Methods. 2012 Mar 18;9(5):471-2
pubmed: 22426491
J Biol Chem. 2009 Nov 6;284(45):31122-9
pubmed: 19744929
Bioinformatics. 2001 Mar;17(3):282-3
pubmed: 11294794
Nucleic Acids Res. 2016 Jan 4;44(D1):D7-19
pubmed: 26615191
Proc Natl Acad Sci U S A. 2018 Apr 24;115(17):4405-4410
pubmed: 29581307
Exp Mol Med. 2018 Jul 27;50(7):1-13
pubmed: 30054468
Trends Biochem Sci. 2012 Apr;37(4):152-61
pubmed: 22405572
Sci Rep. 2016 Aug 23;6:31425
pubmed: 27550639
Cell. 2001 Apr 20;105(2):257-67
pubmed: 11336675
Biopolymers. 2013 Feb;99(2):98-111
pubmed: 23175385
Bioinformatics. 2008 Dec 1;24(23):2780-1
pubmed: 18818215
Bioinformatics. 2019 Dec 15;35(24):5326-5327
pubmed: 31263867
Biochem J. 2005 Mar 15;386(Pt 3):433-43
pubmed: 15496142
Proc Int Conf Intell Syst Mol Biol. 1994;2:28-36
pubmed: 7584402
FEBS J. 2018 Sep;285(17):3299-3316
pubmed: 30028079
Nucleic Acids Res. 2013 Jan;41(Database issue):D344-7
pubmed: 23161676
Nat Cell Biol. 2004 May;6(5):405-13
pubmed: 15077113
J Biol Chem. 1992 Oct 5;267(28):20277-81
pubmed: 1400344
PLoS Biol. 2009 Nov;7(11):e1000236
pubmed: 19885390
J Proteomics. 2011 Apr 1;74(4):431-41
pubmed: 21184851
Plant Physiol. 2020 Aug;183(4):1502-1516
pubmed: 32461302
J Biol Chem. 2016 Mar 4;291(10):5270-7
pubmed: 26755727
Gene. 2006 Apr 26;371(2):291-5
pubmed: 16507339
Oncogene. 2013 Jan 17;32(3):269-76
pubmed: 22391571
Proteins. 2000 Oct 1;41(1):98-107
pubmed: 10944397
Proteomics. 2015 Jul;15(14):2436-46
pubmed: 25886145
Genome Res. 2003 Nov;13(11):2498-504
pubmed: 14597658
Structure. 2015 Feb 3;23(2):332-41
pubmed: 25619998
J Biol Chem. 2017 Apr 21;292(16):6821-6837
pubmed: 28196861
PLoS One. 2011;6(9):e24713
pubmed: 21935442
J Mol Evol. 1994 Sep;39(3):306-14
pubmed: 7932792
Apoptosis. 2016 Mar;21(3):298-311
pubmed: 26666750
Nat Commun. 2017 Jun 06;8:15726
pubmed: 28585574
Mol Biol Evol. 2001 May;18(5):691-9
pubmed: 11319253
Sci Rep. 2016 Feb 10;6:21304
pubmed: 26861501
Nucleic Acids Res. 2017 Jan 4;45(D1):D289-D295
pubmed: 27899584
Mol Cell Proteomics. 2012 Jun;11(6):M111.015131
pubmed: 22223895
J Med Genet. 2014 Mar;51(3):185-96
pubmed: 24431331
Int J Cancer. 2010 Dec 15;127(12):2777-89
pubmed: 21351257
Bioinformatics. 1998;14(9):755-63
pubmed: 9918945
Bioinformatics. 2011 Apr 15;27(8):1164-5
pubmed: 21335321
Arch Biochem Biophys. 2005 Jan 1;433(1):212-26
pubmed: 15581578
Am J Hum Genet. 2018 May 3;102(5):985-994
pubmed: 29656860
Cell Rep. 2015 Mar 3;10(8):1362-74
pubmed: 25732826
Proc Natl Acad Sci U S A. 2013 Sep 3;110(36):14652-7
pubmed: 23959863
Plant Physiol. 2020 Feb;182(2):792-806
pubmed: 31744933
Bioinformatics. 2010 Mar 1;26(5):680-2
pubmed: 20053844
J Biol Chem. 2001 Dec 14;276(50):47239-47
pubmed: 11559708
PLoS Biol. 2011 May;9(5):e1001073
pubmed: 21655302
Proc Natl Acad Sci U S A. 2012 Jul 31;109(31):12449-54
pubmed: 22814378
Mol Biol Evol. 2005 Feb;22(2):193-9
pubmed: 15483331
Brief Bioinform. 2019 Jul 19;20(4):1160-1166
pubmed: 28968734
J Biol Chem. 2005 Apr 29;280(17):16659-64
pubmed: 15723835
Protein Sci. 2012 Jul;21(7):911-7
pubmed: 22573613
Structure. 2019 Jul 2;27(7):1057-1070.e4
pubmed: 31155310
Nat Struct Mol Biol. 2013 Sep;20(9):1098-105
pubmed: 23912279
J Exp Bot. 2018 Aug 31;69(19):4555-4568
pubmed: 29945174
Mol Cell. 2019 Mar 21;73(6):1097-1114
pubmed: 30878283
Cell Mol Life Sci. 2011 Apr;68(7):1147-56
pubmed: 21132344
Gene. 2005 Dec 19;363:15-23
pubmed: 16289629
Comput Struct Biotechnol J. 2020 Mar 03;18:532-547
pubmed: 32206212
FEBS Lett. 2008 Sep 3;582(20):2973-8
pubmed: 18675810
Mol Cell Biol. 2003 Oct;23(20):7403-14
pubmed: 14517307
Plant Cell. 2003 Aug;15(8):1817-32
pubmed: 12897255
J Biol Chem. 2003 Oct 3;278(40):38109-12
pubmed: 12915400
Methods Mol Biol. 2017;1574:9-15
pubmed: 28315240
Biochim Biophys Acta. 2016 Oct;1864(10):1372-401
pubmed: 27296530
J Biol Chem. 1996 Aug 2;271(31):18920-4
pubmed: 8702554
Proc Natl Acad Sci U S A. 2018 Apr 24;115(17):4399-4404
pubmed: 29581253
Science. 2010 Feb 19;327(5968):973-7
pubmed: 20110468
J Comput Biol. 2010 Mar;17(3):337-54
pubmed: 20377449
Nucleic Acids Res. 2009 Jul;37(Web Server issue):W202-8
pubmed: 19458158
Mol Cell Proteomics. 2016 Nov;15(11):3361-3372
pubmed: 27694331
Comput Appl Biosci. 1992 Jun;8(3):275-82
pubmed: 1633570
Mol Cell Proteomics. 2010 May;9(5):928-39
pubmed: 20061308
Nature. 2018 Apr;556(7702):452-456
pubmed: 29670290
Plant Cell. 2015 May;27(5):1547-62
pubmed: 25966763
Sci Adv. 2020 Apr 08;6(15):eaay8793
pubmed: 32284999

Auteurs

Bojan Krtenic (B)

Department of Biological Sciences, University of Bergen, Norway.
Computational Biology Unit, Department of Informatics, University of Bergen, Norway.

Adrian Drazic (A)

Department of Biomedicine, University of Bergen, Norway.

Thomas Arnesen (T)

Department of Biological Sciences, University of Bergen, Norway.
Department of Biomedicine, University of Bergen, Norway.
Department of Surgery, Haukeland University Hospital, Norway.

Nathalie Reuter (N)

Computational Biology Unit, Department of Informatics, University of Bergen, Norway.
Department of Chemistry, University of Bergen, Norway.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Animals Hemiptera Insect Proteins Phylogeny Insecticides
Amaryllidaceae Alkaloids Lycoris NADPH-Ferrihemoprotein Reductase Gene Expression Regulation, Plant Plant Proteins
Drought Resistance Gene Expression Profiling Gene Expression Regulation, Plant Gossypium Multigene Family

Classifications MeSH