Convex hull principle for classification and phylogeny of eukaryotic proteins.

Classification Convex hull principle Human proteins Natural vector Phylogenetic analysis Protein kinases

Journal

Genomics
ISSN: 1089-8646
Titre abrégé: Genomics
Pays: United States
ID NLM: 8800135

Informations de publication

Date de publication:
12 2019
Historique:
received: 02 09 2018
revised: 25 11 2018
accepted: 30 11 2018
pubmed: 12 12 2018
medline: 22 4 2020
entrez: 12 12 2018
Statut: ppublish

Résumé

This study quantitatively validates the principle that the biological properties associated with a given genotype are determined by the distribution of amino acids. In order to visualize this central law of molecular biology, each protein was represented by a point in 250-dimensional space based on its amino acid distribution. Proteins from the same family are found to cluster together, leading to the principle that the convex hull surrounding protein points from the same family do not intersect with the convex hulls of other protein families. This principle was verified computationally for all available and reliable protein kinases and human proteins. In addition, we generated 2,328,761 figures to show that the convex hulls of different families were disjoint from each other. The classification performs well with high and robust accuracy (95.75% and 97.5%) together with reasonable phylogenetic trees validate our methods further.

Identifiants

pubmed: 30529533
pii: S0888-7543(18)30518-4
doi: 10.1016/j.ygeno.2018.11.033
pii:
doi:

Substances chimiques

Protein Kinases EC 2.7.-

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

1777-1784

Informations de copyright

Copyright © 2018 Elsevier Inc. All rights reserved.

Auteurs

Xin Zhao (X)

Department of Mathematical Sciences, Tsinghua University, Beijing 100084, PR China.

Kun Tian (K)

Department of Mathematical Sciences, Tsinghua University, Beijing 100084, PR China.

Rong L He (RL)

Department of Biological Sciences, Chicago State University, Chicago, IL 60628, USA.

Stephen S-T Yau (SS)

Department of Mathematical Sciences, Tsinghua University, Beijing 100084, PR China. Electronic address: yau@uic.edu.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH