Convex hull principle for classification and phylogeny of eukaryotic proteins.
Classification
Convex hull principle
Human proteins
Natural vector
Phylogenetic analysis
Protein kinases
Journal
Genomics
ISSN: 1089-8646
Titre abrégé: Genomics
Pays: United States
ID NLM: 8800135
Informations de publication
Date de publication:
12 2019
12 2019
Historique:
received:
02
09
2018
revised:
25
11
2018
accepted:
30
11
2018
pubmed:
12
12
2018
medline:
22
4
2020
entrez:
12
12
2018
Statut:
ppublish
Résumé
This study quantitatively validates the principle that the biological properties associated with a given genotype are determined by the distribution of amino acids. In order to visualize this central law of molecular biology, each protein was represented by a point in 250-dimensional space based on its amino acid distribution. Proteins from the same family are found to cluster together, leading to the principle that the convex hull surrounding protein points from the same family do not intersect with the convex hulls of other protein families. This principle was verified computationally for all available and reliable protein kinases and human proteins. In addition, we generated 2,328,761 figures to show that the convex hulls of different families were disjoint from each other. The classification performs well with high and robust accuracy (95.75% and 97.5%) together with reasonable phylogenetic trees validate our methods further.
Identifiants
pubmed: 30529533
pii: S0888-7543(18)30518-4
doi: 10.1016/j.ygeno.2018.11.033
pii:
doi:
Substances chimiques
Protein Kinases
EC 2.7.-
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
1777-1784Informations de copyright
Copyright © 2018 Elsevier Inc. All rights reserved.