Benford's Law and distributions for better drug design.

AI Distribution ML bias drug design machine learning

Journal

Expert opinion on drug discovery
ISSN: 1746-045X
Titre abrégé: Expert Opin Drug Discov
Pays: England
ID NLM: 101295755

Informations de publication

Date de publication:
03 Nov 2023
Historique:
medline: 3 11 2023
pubmed: 3 11 2023
entrez: 3 11 2023
Statut: aheadofprint

Résumé

Modern drug discovery incorporates various tools and data, heralding the beginning of the data-driven drug design (DD) era. The distributions of chemical and physical data used for Artificial Intelligence (AI)/Machine Learning (ML) and to drive DD have thus become highly important to be understood and used effectively. The authors perform a comprehensive exploration of the statistical distributions driving the data-intensive era of drug discovery, including Benford's Law in AI/ML-based DD. As the relevance of data-driven discovery escalates, we anticipate meticulous scrutiny of datasets utilizing principles like Benford's Law to enhance data integrity and guide efficient resource allocation and experimental planning. In this data-driven era of the pharmaceutical and medical industries, addressing critical aspects such as bias mitigation, algorithm effectiveness, data stewardship, effects, and fraud prevention are essential. Harnessing Benford's Law and other distributions and statistical tests in DD provides a potent strategy to detect data anomalies, fill data gaps, and enhance dataset quality. Benford's Law is a fast method for data integrity and quality of datasets, the backbone of AI/ML and other modeling approaches, proving very useful in the design process.

Identifiants

pubmed: 37921672
doi: 10.1080/17460441.2023.2277342
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

1-7

Auteurs

Alfonso T García-Sosa (AT)

Chair of Molecular Technology, Institute of Chemistry, University of Tartu, Tartu, Estonia.

Classifications MeSH