A novel method to guide biomarker combinations to optimize the sensitivity.


Journal

bioRxiv : the preprint server for biology
Titre abrégé: bioRxiv
Pays: United States
ID NLM: 101680187

Informations de publication

Date de publication:
15 Apr 2024
Historique:
medline: 25 4 2024
pubmed: 25 4 2024
entrez: 25 4 2024
Statut: epublish

Résumé

Logistic regression has demonstrated its utility in classifying binary labeled datasets through the maximum likelihood approach. However, in numerous biological and clinical contexts, the aim is often to determine coefficients that yield the highest sensitivity at the pre-specified specificity or vice versa. Therefore, the application of logistic regression is limited in such settings. To this end, we have developed an improved regression framework, SMAGS, for binary classification that, for a given specificity, finds the linear decision rule that yields the maximum sensitivity. Furthermore, we employed the method for feature selection to find the features that are satisfying the sensitivity maximization goal. We compared our method with normal logistic regression by applying it to real clinical data as well as synthetic data. In the real application data (colorectal cancer dataset), we found 14% improvement of sensitivity at 98.5% specificity. Software is made available in Python ( https://github.com/smahmoodghasemi/SMAGS ).

Identifiants

pubmed: 38659773
doi: 10.1101/2024.04.12.589302
pmc: PMC11042214
pii:
doi:

Types de publication

Preprint

Langues

eng

Auteurs

Classifications MeSH