Biophysical characterization of high-confidence, small human proteins.

Amino Acids Computational Biology Genome Human Peptides Proteins

Journal

Biophysical reports
ISSN: 2667-0747
Titre abrégé: Biophys Rep (N Y)
Pays: United States
ID NLM: 9918266001106676

Informations de publication

Date de publication:
21 Jun 2024
Historique:
received: 29 01 2024
revised: 09 04 2024
accepted: 20 06 2024
medline: 24 6 2024
pubmed: 24 6 2024
entrez: 23 6 2024
Statut: aheadofprint

Résumé

Significant efforts have been made to characterize the biophysical properties of proteins. Small proteins have received less attention because their annotation has historically been less reliable. However, recent improvements in sequencing, proteomics, and bioinformatics techniques have led to the high-confidence annotation of small open reading frames (smORFs) that encode for functional proteins, producing smORF-encoded proteins (SEPs). SEPs have been found to perform critical functions in several species, including humans. While significant efforts have been made to annotate SEPs, less attention has been given to the biophysical properties of these proteins. We characterized the distributions of predicted and curated biophysical properties, including sequence composition, structure, localization, function, and disease association of a conservative list of previously identified human SEPs. We found significant differences between SEPs and both larger proteins and control sets. Additionally, we provide an example of how our characterization of biophysical properties can contribute to distinguishing protein-coding smORFs from non-coding ones in otherwise ambiguous cases.

Identifiants

pubmed: 38909903
pii: S2667-0747(24)00026-0
doi: 10.1016/j.bpr.2024.100167
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

100167

Informations de copyright

Copyright © 2024. Published by Elsevier Inc.

Auteurs

A M Whited (AM)

BioFrontiers Institute, University of Colorado, Boulder, CO, USA.

Irwin Jungreis (I)

Broad Institute of MIT and Harvard, Cambridge, MA, USA; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA.

Jeffre Allen (J)

BioFrontiers Institute, University of Colorado, Boulder, CO, USA; Department of Biochemistry, University of Colorado Boulder, CO, USA.

Christina L Cleveland (CL)

BioFrontiers Institute, University of Colorado, Boulder, CO, USA.

Jonathan M Mudge (JM)

European Molecular Biology Laboratory, European Bioinformatics Institute, Wellcome Genome Campus, Hinxton, Cambridge, UK.

Manolis Kellis (M)

Broad Institute of MIT and Harvard, Cambridge, MA, USA; MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA, USA.

John L Rinn (JL)

BioFrontiers Institute, University of Colorado, Boulder, CO, USA; Department of Biochemistry, University of Colorado Boulder, CO, USA.

Loren E Hough (LE)

BioFrontiers Institute, University of Colorado, Boulder, CO, USA; Department of Physics, University of Colorado Boulder, CO, USA.

Classifications MeSH