MehNet: a vigesimal-based model by amino acid melting points generates unique ID numbers for protein sequences.
MehNet
melting points
protein ID numbers
vigesimal digits
Journal
Journal of biomolecular structure & dynamics
ISSN: 1538-0254
Titre abrégé: J Biomol Struct Dyn
Pays: England
ID NLM: 8404176
Informations de publication
Date de publication:
17 Jan 2024
17 Jan 2024
Historique:
medline:
17
1
2024
pubmed:
17
1
2024
entrez:
17
1
2024
Statut:
aheadofprint
Résumé
The amino acid encoding plays a pivotal role in machine learning-based methods for predicting protein structure and function, as well as in protein mapping techniques. Additionally, the classification of protein sequences presents its own challenges. The current study aims to assign a constant value to each amino acid, thereby creating distinctions among protein sequences. The datasets used in this study were obtained from the UniProt Knowledgebase. Subsequently, these datasets underwent preprocessing steps, and identical sequences were categorized under the same headings. Each amino acid was ranked based on its respective melting point and was assigned a vigesimal digit. These generated vigesimal digits were subsequently converted to decimal values. The centerpiece of this methodology was the
Identifiants
pubmed: 38230442
doi: 10.1080/07391102.2024.2302937
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM