Graph Classification of Molecules Using Force Field Atom and Bond Types.


Journal

Molecular informatics
ISSN: 1868-1751
Titre abrégé: Mol Inform
Pays: Germany
ID NLM: 101529315

Informations de publication

Date de publication:
01 2020
Historique:
received: 19 11 2018
accepted: 23 09 2019
pubmed: 8 10 2019
medline: 22 12 2020
entrez: 8 10 2019
Statut: ppublish

Résumé

Classification of the biological activities of chemical substances is important for developing new medicines efficiently. Various machine learning methods are often employed to screen large libraries of compounds and predict the activities of new substances by training the molecular structure-activity relationships. One such method is graph classification, in which a molecular structure can be represented in terms of a labeled graph with nodes that correspond to atoms and edges that correspond to the bonds between these atoms. In a conventional graph definition, atomic symbols and bond orders are employed as node and edge labels, respectively. In this study, we developed new graph definitions using the assignment of atom and bond types in the force fields of molecular dynamics methods as node and edge labels, respectively. We found that these graph definitions improved the accuracies of activity classifications for chemical substances using graph kernels with support vector machines and deep neural networks. The higher accuracies obtained using our proposed definitions can enhance the development of the materials informatics using graph-based machine learning methods.

Identifiants

pubmed: 31589809
doi: 10.1002/minf.201800155
doi:

Substances chimiques

Azepines 0
Benzenesulfonates 0

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1800155

Informations de copyright

© 2019 Wiley-VCH Verlag GmbH & Co. KGaA, Weinheim.

Références

A. Cherkasov, E. N. Muratov, D. Fourches, A. Varnek, I. I. Baskin, M. Cronin, J. Dearden, P. Gramatica, Y. C. Martin, R. Todeschini, J. Med. Chem. 2014, 57, 4977-5010.
A. Tropsha, Mol. Inf. 2010, 29, 476-488.
R. D. Cramer. III, J. D. Bunce, D. E. Patterson, I. E. Frank, Mol. Inf. 1998, 7, 18-25.
M. Karelson, V. S. Lobanov, A. R. Katritzky, Chem. Rev. 1996, 96, 1027-1044.
V. Svetnik, A. Liaw, C. Tong, J. C. Culberson, R. P. Sheridan, B. P. Feuston, J. Chem. Inf. Comput. Sci. 2003, 43, 1947-1958.
T. Puzyn, B. Rasulev, A. Gajewicz, X. Hu, T. P. Dasari, A. Michalkova, H. Hwang, A. Toropov, D. Leszczynska, Nat. Nanotechnol. 2011, 6, 175-178.
W. Muster, A. Breidenbach, H. Fischer, S. Kirchner, Drug Discovery Today 2008, 13, 303-310.
A. Vedani, M. Dobler, M. A. Lill, Basic Clin. Pharmacol. Toxicol. 2006, 99, 195-208.
D. Fourches, E. Muratov, A. Tropsha, J. Chem. Inf. Model. 2010, 50, 1189-1204.
B. Chen, D. Wild, R. Guha, J. Chem. Inf. Model. 2009, 49, 2044-2055.
D. Rogers, M. Hahn, J. Chem. Inf. Model. 2010, 50, 742-754.
C. Steinbeck, C. Hoppe, S. Kuhn, M. Floris, R. Guha, E. Willighagen, Curr. Pharm. Des. 2006, 12, 2111-2120.
J. L. Durant, B. A. Leland, D. R. Henry, J. G. Nourse, J. Chem. Inf. Comput. Sci. 2002, 42, 1273-1280.
Z. Q. Chen, S. Yamamoto, M. Maekawa, a. Kawasuso, X. L. Yuan, T. Sekiguchi, J. Appl. Phys. 2003, 94, 4807.
L. H. Hall, J. Chem. Inf. Comput. Sci. 1995, 35, 1039-1045.
N. Shervashidze, P. Schweitzer, E. Jan van Leeuwen, K. Mehlhorn, K. M. Borgwardt, J. Mach. Learn. Res. 2011, 12, 2539-2561.
K. M. Borgwardt, H. P. Kriegel, in Proc. - IEEE Int. Conf. Data Min., 2005, pp. 74-81.
P. Mahe, N. Ueda, T. Akutsu, J.-L. Perret, J.-P. Vert, J. Chem. Inf. Model. 2005, 45, 939-951.
D. Kimura, H. Kashima, in Proc. 29th Int. Conference Mach. Learn. 2012, pp. 393-400.
H. Yamashita, T. Higuchi, R. Yoshida, J. Chem. Inf. Model. 2014, 54, 1289-1300.
N. Shervashidze, S. V. N. Vishwanathan, T. H. Petri, K. Mehlhorn, K. M. Borgwardt, in Proc. Twelfth Int. Conf. Artif. Intell. Stat., 2009, pp. 488-495.
T. Horváth, T. Gärtner, S. Wrobel, Proc. 2004 ACM SIGKDD Int. Conf. Knowl. Discov. data Min. - KDD'04 2004, 158.
A. Kashima, H. Tsuda, K. Inokuchi, Proc. 20th Int. Conf. Mach. Learn. 2003, 321-328.
T. Gärtner, P. Flach, S. Wrobel, Learn. Theory Kernel Mach. 2003, 129-143.
S. Hido, H. Kashima, Proc. - IEEE Int. Conf. Data Min. 2009, 179-188.
L. Ralaivola, S. J. Swamidass, H. Saigo, P. Baldi, Neural Networks 2005, 18, 1093-1110.
P. Mahé, J. P. Vert, Mach. Learn. 2009, 75, 3-35.
P. Mahé, N. Ueda, T. Akutsu, J.-L. Perret, J.-P. Vert, in Proc. Twenty-First Int. Conf. Mach. Learn. 2004, pp. 552-559.
M. Niepert, M. Ahmed, K. Kutzkov, Proc. 33rd Int. Conf. Mach. Learn. 2016, 2014-2023.
P. Yanardag, S. V. N. Vishwanathan, in Proc. 21th ACM SIGKDD Int. Conf. Knowl. Discov. Data Min. - KDD'15, 2015, pp. 1365-1374.
J. Zhang, J. Huan, Int. J. Comput. Biosci. 2010, 1, 13-21.
K. Maruhashi, M. Todoriki, T. Ohwa, K. Goto, Y. Hasegawa, H. Inakoshi, H. Anai, Proc. 32nd AAAI Conf. Artif. Intell. 2018, 3770-3777.
D. H. Smith, R. E. Carhart, R. Venkataraghavan, J. Chem. Inf. Comput. Sci. 1985, 25, 64-73.
J. J. Huuskonen, D. J. Livingstone, I. V. Tetko, J. Chem. Inf. Comput. Sci. 2000, 40, 947-955.
Y. Duan, C. Wu, S. Chowdhury, M. C. Lee, G. Xiong, W. Zhang, R. Yang, P. Cieplak, R. Luo, T. Lee, J. Comput. Chem. 2003, 24, 1999-2012.
J. Wang, R. M. Wolf, J. W. Caldwell, P. A. Kollman, D. A. Case, J. Comput. Chem. 2004, 25, 1157-74.
W. D. Cornell, P. Cieplak, C. I. Bayly, I. R. Gould, K. M. Merz, D. M. Ferguson, D. C. Spellmeyer, T. Fox, J. W. Caldwell, P. A. Kollman, J. Am. Chem. Soc. 1995, 117, 5179-5197.
The PubChem Project.; https://pubchem.ncbi.nlm.nih.gov/.
P. B. Jayaraj, M. K. Ajay, M. Nufail, G. Gopakumar, U. C. A. Jaleel, J. Cheminform. 2016, 8, 1-10.
Q. Li, Y. Wang, S. H. Bryant, Bioinformatics 2009, 25, 3310-3316.
A. Jakalian, D. B. Jack, C. I. Bayly, J. Comput. Chem. 2002, 23, 1623-1641.
The Amber Home Page, http://ambermd.org/.
F. P. Such, S. Member, S. Sah, S. Member, M. A. Dominguez, S. Member, S. Pillai, C. Zhang, A. Michael, N. D. Cahill, IEEE J. Sel. Top. Signal Process. 2017, 11, 884-896.
X. Glorot, A. Bordes, Y. Bengio, AISTATS'11 Proc. 14th Int. Conf. Artif. Intell. Stat. 2011, 15, 315-323.
N. Srivastava, G. Hinton, A. Krizhevsky, I. Sutskever, R. Salakhutdinov, J. Mach. Learn. Res. 2014, 15, 1929-1958.
G. Sergey Ioffe, G. Christian Szegedy, Icml 2015, 22, 137-141.

Auteurs

Hideyuki Jippo (H)

Digital Annealer Unit, Fujitsu Laboratories Ltd., 10-1 Morinosato-Wakamiya, Atsugi, Kanagawa, 243-0197, Japan.

Tatsuru Matsuo (T)

Artificial Intelligence Laboratory, Fujitsu Laboratories Ltd., 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki, Kanagawa, 211-8588, Japan.

Ryota Kikuchi (R)

Artificial Intelligence Laboratory, Fujitsu Laboratories Ltd., 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki, Kanagawa, 211-8588, Japan.

Daisuke Fukuda (D)

Artificial Intelligence Laboratory, Fujitsu Laboratories Ltd., 4-1-1 Kamikodanaka, Nakahara-ku, Kawasaki, Kanagawa, 211-8588, Japan.

Azuma Matsuura (A)

Digital Annealer Unit, Fujitsu Laboratories Ltd., 10-1 Morinosato-Wakamiya, Atsugi, Kanagawa, 243-0197, Japan.

Mari Ohfuchi (M)

Digital Annealer Unit, Fujitsu Laboratories Ltd., 10-1 Morinosato-Wakamiya, Atsugi, Kanagawa, 243-0197, Japan.

Articles similaires

Photosynthesis Ribulose-Bisphosphate Carboxylase Carbon Dioxide Molecular Dynamics Simulation Cyanobacteria

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Understanding the role of machine learning in predicting progression of osteoarthritis.

Simone Castagno, Benjamin Gompels, Estelle Strangmark et al.
1.00
Humans Disease Progression Machine Learning Osteoarthritis
Alzheimer Disease Humans Regression Analysis Quantitative Structure-Activity Relationship Drug Design

Classifications MeSH