Benford's Law and distributions for better drug design.
AI
Distribution
ML
bias
drug design
machine learning
Journal
Expert opinion on drug discovery
ISSN: 1746-045X
Titre abrégé: Expert Opin Drug Discov
Pays: England
ID NLM: 101295755
Informations de publication
Date de publication:
03 Nov 2023
03 Nov 2023
Historique:
medline:
3
11
2023
pubmed:
3
11
2023
entrez:
3
11
2023
Statut:
aheadofprint
Résumé
Modern drug discovery incorporates various tools and data, heralding the beginning of the data-driven drug design (DD) era. The distributions of chemical and physical data used for Artificial Intelligence (AI)/Machine Learning (ML) and to drive DD have thus become highly important to be understood and used effectively. The authors perform a comprehensive exploration of the statistical distributions driving the data-intensive era of drug discovery, including Benford's Law in AI/ML-based DD. As the relevance of data-driven discovery escalates, we anticipate meticulous scrutiny of datasets utilizing principles like Benford's Law to enhance data integrity and guide efficient resource allocation and experimental planning. In this data-driven era of the pharmaceutical and medical industries, addressing critical aspects such as bias mitigation, algorithm effectiveness, data stewardship, effects, and fraud prevention are essential. Harnessing Benford's Law and other distributions and statistical tests in DD provides a potent strategy to detect data anomalies, fill data gaps, and enhance dataset quality. Benford's Law is a fast method for data integrity and quality of datasets, the backbone of AI/ML and other modeling approaches, proving very useful in the design process.
Identifiants
pubmed: 37921672
doi: 10.1080/17460441.2023.2277342
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM