Improving the robustness and accuracy of biomedical language models through adversarial training.

Language Natural Language Processing Neural Networks, Computer Reproducibility of Results

Adversarial attack Adversarial training Biomedical natural language processing Biomedical text Deep learning Robustness

Journal

Journal of biomedical informatics

ISSN: 1532-0480

Titre abrégé: J Biomed Inform

Pays: United States

ID NLM: 100970413

Informations de publication

Date de publication:
08 2022

Historique:

received: 03 11 2021

revised: 08 04 2022

accepted: 05 06 2022

pubmed: 19 6 2022

medline: 16 8 2022

entrez: 18 6 2022

Statut: ppublish

Résumé

Deep transformer neural network models have improved the predictive accuracy of intelligent text processing systems in the biomedical domain. They have obtained state-of-the-art performance scores on a wide variety of biomedical and clinical Natural Language Processing (NLP) benchmarks. However, the robustness and reliability of these models has been less explored so far. Neural NLP models can be easily fooled by adversarial samples, i.e. minor changes to input that preserve the meaning and understandability of the text but force the NLP system to make erroneous decisions. This raises serious concerns about the security and trust-worthiness of biomedical NLP systems, especially when they are intended to be deployed in real-world use cases. We investigated the robustness of several transformer neural language models, i.e. BioBERT, SciBERT, BioMed-RoBERTa, and Bio-ClinicalBERT, on a wide range of biomedical and clinical text processing tasks. We implemented various adversarial attack methods to test the NLP systems in different attack scenarios. Experimental results showed that the biomedical NLP models are sensitive to adversarial samples; their performance dropped in average by 21 and 18.9 absolute percent on character-level and word-level adversarial noise, respectively, on Micro-F1, Pearson Correlation, and Accuracy measures. Conducting extensive adversarial training experiments, we fine-tuned the NLP models on a mixture of clean samples and adversarial inputs. Results showed that adversarial training is an effective defense mechanism against adversarial noise; the models' robustness improved in average by 11.3 absolute percent. In addition, the models' performance on clean data increased in average by 2.4 absolute percent, demonstrating that adversarial training can boost generalization abilities of biomedical NLP systems. This study takes an important step towards revealing vulnerabilities of deep neural language models in biomedical NLP applications. It also provides practical and effective strategies to develop secure, trust-worthy, and accurate intelligent text processing systems in the biomedical domain.

Identifiants

DOI: 10.1016/j.jbi.2022.104114 PMID: 35717011

pubmed: 35717011

pii: S1532-0464(22)00130-7

doi: 10.1016/j.jbi.2022.104114

pii:

doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

Pagination

104114

Improving the robustness and accuracy of biomedical language models through adversarial training.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Pagination

Informations de copyright

Auteurs

Milad Moradi (M)

Matthias Samwald (M)

Articles similaires

Relative victimization scale: initial development and retrospective reports of the impact on mental health.

Cultural adaptation and validation of the Sinhala version of the spiritual needs assessment for patients (S-SNAP) questionnaire.

The Importance of Language and Messaging in Psychological Treatment for Functional Neurological Disorder in Children and Adolescents.

Development of the two-factor modified Kids Eating Disorder Survey (M-KEDS): a validation study with hispanic adolescents.

Classifications MeSH