Positions of cysteine residues reveal local clusters and hidden relationships to Sequons and Transmembrane domains in Human proteins.

Cysteine residues Disulfide bonds N-glycosylation Posttranslational modifications Protein sequence Protein structure and function Sequons Transmembrane domains

Journal

Scientific reports
ISSN: 2045-2322
Titre abrégé: Sci Rep
Pays: England
ID NLM: 101563288

Informations de publication

Date de publication:
29 10 2024
Historique:
received: 21 06 2024
accepted: 18 10 2024
medline: 29 10 2024
pubmed: 29 10 2024
entrez: 29 10 2024
Statut: epublish

Résumé

Membrane proteins often possess critical structural features, such as transmembrane domains (TMs), N-glycosylation, and disulfide bonds (SS bonds), which are essential to their structure and function. Here, we extend the study of the motifs carrying N-glycosylation, i.e. the sequons, and the Cys residues supporting the SS bonds, to the whole human proteome with a particular focus on the Cys positions in human proteins with respect to those of sequons and TMs. As the least abundant amino acid residue in protein sequences, the positions of Cys residues in proteins are not random but rather selected through evolution. We discovered that the frequency of Cys residues in proteins is length dependent, and the frequency of CC gaps formed between adjacent Cys residues can be used as a classifier to distinguish proteins with special structures and functions, such as keratin-associated proteins (KAPs), extracellular proteins with EGF-like domains, and nuclear proteins with zinc finger C2H2 domains. Most importantly, by comparing the positions of Cys residues to those of sequons and TMs, we discovered that these structural features can form dense clusters in highly repeated and mutually exclusive modalities in protein sequences. The evolutionary advantages of such complementarity among the three structural features are discussed, particularly in light of structural dynamics in proteins that are lacking from computational predictions. The discoveries made here highlight the sequence-structure-function axis in biological organisms that can be utilized in future protein engineering toward synthetic biology.

Identifiants

pubmed: 39468182
doi: 10.1038/s41598-024-77056-8
pii: 10.1038/s41598-024-77056-8
doi:

Substances chimiques

Cysteine K848JZ4886
Membrane Proteins 0
Proteome 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

25886

Subventions

Organisme : The National Sciences and Engineering Research Council of Canada
ID : RGPIN06073
Organisme : The National Sciences and Engineering Research Council of Canada
ID : RGPIN06073

Informations de copyright

© 2024. The Author(s).

Références

Bakshi, T., et al., Hidden Relationships between N-Glycosylation and Disulfide Bonds in Individual Proteins. Int J Mol Sci, 2022. 23(7).
Desai, M., et al., Discovery and Visualization of the Hidden Relationships among N-Glycosylation, Disulfide Bonds, and Membrane Topology. Int J Mol Sci, 2023. 24(22).
Petersen, M.T., P.H. Jonson, and S.B. Petersen, Amino acid neighbours and detailed conformational analysis of cysteines in proteins. Protein Eng, 1999. 12(7): p. 535 − 48.
Gupta, R. and S. Brunak, Prediction of glycosylation across the human proteome and the correlation to protein function. Pac Symp Biocomput, 2002: p. 310 − 22.
Pakhrin, S.C., et al., DeepNGlyPred: A Deep Neural Network-Based Approach for Human N-Linked Glycosylation Site Prediction. Molecules, 2021. 26(23).
Pitti, T., et al., N-GlyDE: a two-stage N-linked glycosylation site prediction incorporating gapped dipeptides and pattern-based encoding. Sci Rep, 2019. 9(1): p. 15975.
Eitner, K., et al., Statistical distribution of amino acid sequences: a proof of Darwinian evolution. Bioinformatics, 2010. 26(23): p. 2933–2935.
Miseta, A. and P. Csutora, Relationship between the occurrence of cysteine in proteins and the complexity of organisms. Mol Biol Evol, 2000. 17(8): p. 1232-9.
Doring, V. and P. Marliere, Reassigning cysteine in the genetic code of Escherichia coli. Genetics, 1998. 150(2): p. 543 − 51.
Marino, S.M. and V.N. Gladyshev, Cysteine function governs its conservation and degeneration and restricts its utilization on protein surfaces. J Mol Biol, 2010. 404(5): p. 902 − 16.
Gould, N.S., et al., Site-Specific Proteomic Mapping Identifies Selectively Modified Regulatory Cysteine Residues in Functionally Distinct Protein Networks. Chem Biol, 2015. 22(7): p. 965 − 75.
Bhopatkar, A.A., V.N. Uversky, and V. Rangachari, Disorder and cysteines in proteins: A design for orchestration of conformational see-saw and modulatory functions. Prog Mol Biol Transl Sci, 2020. 174: p. 331–373.
Williams, R.M., et al., The protein non-folding problem: amino acid determinants of intrinsic order and disorder. Pac Symp Biocomput, 2001: p. 89–100.
Schwarz, F. and M. Aebi, Mechanisms and principles of N-linked protein glycosylation. Curr Opin Struct Biol, 2011. 21(5): p. 576 − 82.
Ruiz-Canada, C., D.J. Kelleher, and R. Gilmore, Cotranslational and posttranslational N-glycosylation of polypeptides by distinct mammalian OST isoforms. Cell, 2009. 136(2): p. 272 − 83.
Desai, M., S.R. Chowdhury, and B. Sun, A quest for cytosolic sequons and their functions. Sci Rep, 2024. 14(1): p. 7736.
Vogt, G., et al., Gains of glycosylation comprise an unexpectedly large group of pathogenic mutations. Nat Genet, 2005. 37(7): p. 692–700.
Varki, A., Nothing in glycobiology makes sense, except in the light of evolution. Cell, 2006. 126(5): p. 841-5.
Feng, X. and P. Barth, A topological and conformational stability alphabet for multipass membrane proteins. Nature Chemical Biology, 2016. 12(3): p. 167–173.
White, S.H. and W.C. Wimley, MEMBRANE PROTEIN FOLDING AND STABILITY: Physical Principles. Annual Review of Biophysics and Biomolecular Structure, 1999. 28(1): p. 319–365.
Cui, J., et al., Darwinian selection for sites of Asn-linked glycosylation in phylogenetically disparate eukaryotes and viruses. Proc Natl Acad Sci U S A, 2009. 106(32): p. 13421-6.
Nurk, S., et al., The complete sequence of a human genome. Science, 2022. 376(6588): p. 44–53.
Adhikari, S., et al., A high-stringency blueprint of the human proteome. Nat Commun, 2020. 11(1): p. 5301.
Jumper, J., et al., Highly accurate protein structure prediction with AlphaFold. Nature, 2021. 596(7873): p. 583–589.
Tunyasuvunakool, K., et al., Highly accurate protein structure prediction for the human proteome. Nature, 2021. 596(7873): p. 590–596.
Krogh, A., et al., Deep TMHMM predicts alpha and beta transmembrane proteins using deep neural networks. biorxiv, 2022.
Sherman, B.T., et al., DAVID: a web server for functional enrichment analysis and functional annotation of gene lists (2021 update). Nucleic Acids Res, 2022. 50(W1): p. W216-W221.
Sidak, Z., Rectangular confidence regions for the means of multivariate normal distributions. J. Am. Stat. Assoc., 1967. 62: p. 626–633.
Benjamini, Y. and Y. Hochberg, Controlling the false discovery rate: A practical and powerful approach to multiple testing. J. Royal Stat. Soc. Series B. Stat. Methodology, 1995. 57: p. 289–300.
30. Saeed, A.I., et al., TM4: a free, open-source system for microarray data management and analysis. Biotechniques, 2003. 34(2): p. 374-8.
Devabhaktuni, A., et al., TagGraph reveals vast protein modification landscapes from large tandem mass spectrometry datasets. Nat Biotechnol, 2019. 37(4): p. 469–479.
Cuthbert, G.L., et al., Histone deimination antagonizes arginine methylation. Cell, 2004. 118(5): p. 545 − 53.
Wang, Y., et al., Histone hypercitrullination mediates chromatin decondensation and neutrophil extracellular trap formation. J Cell Biol, 2009. 184(2): p. 205 − 13.
Ramazi, S., A. Allahverdi, and J. Zahiri, Evaluation of post-translational modifications in histone proteins: A review on histone modification defects in developmental and neurological disorders. J Biosci, 2020. 45.
Dunker, A.K., et al., Intrinsically disordered protein. J Mol Graph Model, 2001. 19(1): p. 26–59.
Peng, Z., et al., A creature with a hundred waggly tails: intrinsically disordered proteins in the ribosome. Cell Mol Life Sci, 2014. 71(8): p. 1477 − 504.
Wright, P.E. and H.J. Dyson, Intrinsically disordered proteins in cellular signalling and regulation. Nat Rev Mol Cell Biol, 2015. 16(1): p. 18–29.
Garcia-Gimenez, J.L., et al., Histone h3 glutathionylation in proliferating mammalian cells destabilizes nucleosomal structure. Antioxid Redox Signal, 2013. 19(12): p. 1305-20.
Lin, X., et al., Role of cysteine amino acid residues on the RNA binding activity of human thymidylate synthase. Nucleic Acids Res, 2003. 31(16): p. 4882-7.
Stutzer, A., et al., Analysis of protein-DNA interactions in chromatin by UV induced cross-linking and mass spectrometry. Nat Commun, 2020. 11(1): p. 5250.
Muller, M.M. and T.W. Muir, Histones: at the crossroads of peptide and protein chemistry. Chem Rev, 2015. 115(6): p. 2296 − 349.
Laughrea, M., et al., Mistranslation in twelve Escherichia coli ribosomal proteins. Cysteine misincorporation at neutral amino acid residues other than tryptophan. Eur J Biochem, 1987. 169(1): p. 59–64.
Moosmann, B. and C. Behl, Mitochondrially encoded cysteine predicts animal lifespan. Aging Cell, 2008. 7(1): p. 32–46.
Schindeldecker, M., et al., Differential cysteine depletion in respiratory chain complexes enables the distinction of longevity from aerobicity. Mech Ageing Dev, 2011. 132(4): p. 171-9.
Schindeldecker, M. and B. Moosmann, Cysteine Is the Only Universally Affected and Disfavored Proteomic Amino Acid under Oxidative Conditions in Animals. Antioxidants (Basel), 2024. 13(3).
Andrews, C., et al., Structural Aspects and Prediction of Calmodulin-Binding Proteins. Int J Mol Sci, 2020. 22(1).
Tan, R.Y., Y. Mabuchi, and Z. Grabarek, Blocking the Ca2+-induced conformational transitions in calmodulin with disulfide bonds. J Biol Chem, 1996. 271(13): p. 7479-83.
Sepulveda, V., et al., SCO-spondin, a giant matricellular protein that regulates cerebrospinal fluid activity. Fluids Barriers CNS, 2021. 18(1): p. 45.
Meiniel, O. and A. Meiniel, The complex multidomain organization of SCO-spondin protein is highly conserved in mammals. Brain Res Rev, 2007. 53(2): p. 321-7.
Wheelan, S.J., A. Marchler-Bauer, and S.H. Bryant, Domain size distributions can predict domain boundaries. Bioinformatics, 2000. 16(7): p. 613-8.
Xu, D. and R. Nussinov, Favorable domain size in proteins. Folding and Design, 1997. 3: p. 11–17.
Parry, D.A., et al., Human hair keratin-associated proteins: sequence regularities and structural implications. J Struct Biol, 2006. 155(2): p. 361-9.
Klaassen, C.D., J. Liu, and S. Choudhuri, Metallothionein: an intracellular protein to protect against cadmium toxicity. Annu Rev Pharmacol Toxicol, 1999. 39: p. 267 − 94.
Richardson, J.S., et al., Broad Analysis of Vicinal Disulfides: Occurrences, Conformations with Cis or with Trans Peptides, and Functional Roles Including Sugar Binding. J Mol Biol, 2017. 429(9): p. 1321–1335.
Kolsek, K., C. Aponte-Santamaria, and F. Grater, Accessibility explains preferred thiol-disulfide isomerization in a protein domain. Sci Rep, 2017. 7(1): p. 9858.
Reddy, K.K.A., et al., Ligand-induced transition in conformations of vicinal cysteine disulfides in proteins. Proteins, 2021. 89(6): p. 599–613.
Fraser, R.D.B., et al., Disulphide bonding in alpha-keratin. 1987.
Thornton, J.M., Disulphide bridges in globular proteins. J Mol Biol, 1981. 151(2): p. 261 − 87.
Beeby, M., et al., The genomics of disulfide bonding and protein stabilization in thermophiles. PLoS Biol, 2005. 3(9): p. e309.
Mallick, P., et al., Genomic evidence that the intracellular proteins of archaeal microbes contain disulfide bonds. Proc Natl Acad Sci U S A, 2002. 99(15): p. 9679-84.
Tudos, E., et al., A word of caution about biological inference - Revisiting cysteine covalent state predictions. FEBS Open Bio, 2014. 4: p. 310-4.
Sun, B., et al., N-Glycoproteome of E14.Tg2a mouse embryonic stem cells. PLOS ONE, 2013. 8(2): p. e55722.
Englander, S.W. and L. Mayne, The nature of protein folding pathways. Proc Natl Acad Sci U S A, 2014. 111(45): p. 15873-80.
Dombkowski, A.A., K.Z. Sultana, and D.B. Craig, Protein disulfide engineering. FEBS Lett, 2014. 588(2): p. 206 − 12.
Pai, P.P. and S. Mondal, Intriguing cystine-knot miniproteins in drug design and therapeutics, in Toxins and drug discovery, L. Cruz and S. Luo, Editors. 2017, Springer: Dordrecht.
Moradi, S.V., et al., Glycosylation, an effective synthetic strategy to improve the bioavailability of therapeutic peptides. Chem Sci, 2016. 7(4): p. 2492–2500.
Seeberger, P.H., D.I. Freedberg, and R.D. Cummings, Glycans in biotechnology and the pharmaceutical industry, in Essentials of glycobiology, A. Varki, et al., Editors. 2022, Cold spring harbor laboratory press: Cold Spring Harbor.
Donato, R.K. and A. Mija, Keratin Associations with Synthetic, Biosynthetic and Natural Polymers: An Extensive Review. Polymers (Basel), 2019. 12(1).
Lazarus, B.S., et al., Engineering with keratin: A functional material and a source of bioinspiration. iScience, 2021. 24(8): p. 102798.
Lu, P., et al., Accurate computational design of multipass transmembrane proteins. Science, 2018. 359(6379): p. 1042–1046.
Teng, F., et al., Programmable synthetic receptors: the next-generation of cell and gene therapies. Signal Transduct Target Ther, 2024. 9(1): p. 7.

Auteurs

Manthan Desai (M)

Department of Molecular Biology and Biochemistry, Simon Fraser University, Burnaby, BC, Canada.
Department of Computing Science, Simon Fraser University, Burnaby, BC, Canada.

Bingyun Sun (B)

Department of Chemistry, Simon Fraser University, Burnaby, BC, Canada. bingyun_sun@sfu.ca.
Simon Fraser University, Burnaby, BC, V5A 1S6, Canada. bingyun_sun@sfu.ca.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH