iACP-GE: accurate identification of anticancer peptides by using gradient boosting decision tree and extra tree.


Journal

SAR and QSAR in environmental research
ISSN: 1029-046X
Titre abrégé: SAR QSAR Environ Res
Pays: England
ID NLM: 9440156

Informations de publication

Date de publication:
Jan 2023
Historique:
pubmed: 24 12 2022
medline: 16 2 2023
entrez: 23 12 2022
Statut: ppublish

Résumé

Cancer is one of the main diseases threatening human life, accounting for millions of deaths around the world each year. Traditional physical and chemical methods for cancer treatment are extremely time-consuming, lab-intensive, expensive, inefficient and difficult to be applied in a high-throughput way. Hence, it is an urgent task to develop automated computational methods to enable fast and accurate identification of anticancer peptides (ACPs). In this paper, we develop a novel model named iACP-GE to identify ACPs. Multi-features are extracted by using binary encoding, enhanced grouped amino acid composition and BLOSUM62 encoding based on the N5C5 sequence, as well as detrended forward moving-average auto-cross correlation analysis based on physicochemical properties of 20 natural amino acids. Thus, 835 features are obtained for each sample, in order to avoid information redundancy, gradient boosting decision tree was adopted as the feature selection strategy. Then, the optimal feature subset is input to the extra tree classifier. The accuracies of ACP740 and ACP240 datasets with the 5-fold cross-validation were 90.54% and 91.25%, respectively. Experimental results indicate that iACP-GE significantly outperforms several existing models on ACP740 and ACP240 datasets and can be used as an effective tool for the identification of ACPs. The datasets and source codes for iACP-GE are available at https://github.com/yunyunliang88/iACP-GE.

Identifiants

pubmed: 36562289
doi: 10.1080/1062936X.2022.2160011
doi:

Substances chimiques

Peptides 0
Amino Acids 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1-19

Auteurs

Y Liang (Y)

School of Science, Xi'an Polytechnic University, Xi'an, P. R. China.

X Ma (X)

School of Science, Xi'an Polytechnic University, Xi'an, P. R. China.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH