Global User-Level Perception of COVID-19 Contact Tracing Applications: Data-Driven Approach Using Natural Language Processing.

BERT COVID-19 NLP RoBerta contact tracing applications fastText sentiment analysis text classification transformers

Journal

JMIR formative research
ISSN: 2561-326X
Titre abrégé: JMIR Form Res
Pays: Canada
ID NLM: 101726394

Informations de publication

Date de publication:
11 May 2022
Historique:
received: 06 01 2022
accepted: 16 03 2022
revised: 06 02 2022
pubmed: 8 4 2022
medline: 8 4 2022
entrez: 7 4 2022
Statut: epublish

Résumé

Contact tracing has been globally adopted in the fight to control the infection rate of COVID-19. To this aim, several mobile apps have been developed. However, there are ever-growing concerns over the working mechanism and performance of these applications. The literature already provides some interesting exploratory studies on the community's response to the applications by analyzing information from different sources, such as news and users' reviews of the applications. However, to the best of our knowledge, there is no existing solution that automatically analyzes users' reviews and extracts the evoked sentiments. We believe such solutions combined with a user-friendly interface can be used as a rapid surveillance tool to monitor how effective an application is and to make immediate changes without going through an intense participatory design method. In this paper, we aim to analyze the efficacy of AI and NLP techniques for automatically extracting and classifying the polarity of users' sentiments by proposing a sentiment analysis framework to automatically analyze users' reviews on COVID-19 contact tracing mobile apps. We also aim to provide a large-scale annotated benchmark data set to facilitate future research in the domain. As a proof of concept, we also developed a web application based on the proposed solutions, which is expected to help the community quickly analyze the potential of an application in the domain. We propose a pipeline starting from manual annotation via a crowd-sourcing study and concluding with the development and training of artificial intelligence (AI) models for automatic sentiment analysis of users' reviews. In detail, we collected and annotated a large-scale data set of user reviews on COVID-19 contact tracing applications. We used both classical and deep learning methods for classification experiments. We used 8 different methods on 3 different tasks, achieving up to an average F1 score of 94.8%, indicating the feasibility of the proposed solution. The crowd-sourcing activity resulted in a large-scale benchmark data set composed of 34,534 manually annotated reviews. The existing literature mostly relies on the manual or exploratory analysis of users' reviews on applications, which is tedious and time-consuming. In existing studies, generally, data from fewer applications are analyzed. In this work, we showed that AI and natural language processing techniques provide good results for analyzing and classifying users' sentiments' polarity and that automatic sentiment analysis can help to analyze users' responses more accurately and quickly. We also provided a large-scale benchmark data set. We believe the presented analysis, data set, and proposed solutions combined with a user-friendly interface can be used as a rapid surveillance tool to analyze and monitor mobile apps deployed in emergency situations leading to rapid changes in the applications without going through an intense participatory design method.

Sections du résumé

BACKGROUND BACKGROUND
Contact tracing has been globally adopted in the fight to control the infection rate of COVID-19. To this aim, several mobile apps have been developed. However, there are ever-growing concerns over the working mechanism and performance of these applications. The literature already provides some interesting exploratory studies on the community's response to the applications by analyzing information from different sources, such as news and users' reviews of the applications. However, to the best of our knowledge, there is no existing solution that automatically analyzes users' reviews and extracts the evoked sentiments. We believe such solutions combined with a user-friendly interface can be used as a rapid surveillance tool to monitor how effective an application is and to make immediate changes without going through an intense participatory design method.
OBJECTIVE OBJECTIVE
In this paper, we aim to analyze the efficacy of AI and NLP techniques for automatically extracting and classifying the polarity of users' sentiments by proposing a sentiment analysis framework to automatically analyze users' reviews on COVID-19 contact tracing mobile apps. We also aim to provide a large-scale annotated benchmark data set to facilitate future research in the domain. As a proof of concept, we also developed a web application based on the proposed solutions, which is expected to help the community quickly analyze the potential of an application in the domain.
METHODS METHODS
We propose a pipeline starting from manual annotation via a crowd-sourcing study and concluding with the development and training of artificial intelligence (AI) models for automatic sentiment analysis of users' reviews. In detail, we collected and annotated a large-scale data set of user reviews on COVID-19 contact tracing applications. We used both classical and deep learning methods for classification experiments.
RESULTS RESULTS
We used 8 different methods on 3 different tasks, achieving up to an average F1 score of 94.8%, indicating the feasibility of the proposed solution. The crowd-sourcing activity resulted in a large-scale benchmark data set composed of 34,534 manually annotated reviews.
CONCLUSIONS CONCLUSIONS
The existing literature mostly relies on the manual or exploratory analysis of users' reviews on applications, which is tedious and time-consuming. In existing studies, generally, data from fewer applications are analyzed. In this work, we showed that AI and natural language processing techniques provide good results for analyzing and classifying users' sentiments' polarity and that automatic sentiment analysis can help to analyze users' responses more accurately and quickly. We also provided a large-scale benchmark data set. We believe the presented analysis, data set, and proposed solutions combined with a user-friendly interface can be used as a rapid surveillance tool to analyze and monitor mobile apps deployed in emergency situations leading to rapid changes in the applications without going through an intense participatory design method.

Identifiants

pubmed: 35389357
pii: v6i5e36238
doi: 10.2196/36238
pmc: PMC9097863
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e36238

Informations de copyright

©Kashif Ahmad, Firoj Alam, Junaid Qadir, Basheer Qolomany, Imran Khan, Talhat Khan, Muhammad Suleman, Naina Said, Syed Zohaib Hassan, Asma Gul, Mowafa Househ, Ala Al-Fuqaha. Originally published in JMIR Formative Research (https://formative.jmir.org), 11.05.2022.

Références

JMIR Mhealth Uhealth. 2020 Apr 20;8(4):e19359
pubmed: 32294052
J Syst Softw. 2022 Feb;184:111136
pubmed: 34751198
Diabetes Metab Syndr. 2020 Nov-Dec;14(6):1631-1636
pubmed: 32892060
Ir J Med Sci. 2021 Aug;190(3):863-887
pubmed: 33063226
Chaos Solitons Fractals. 2020 Oct;139:110059
pubmed: 32834612
JMIR Mhealth Uhealth. 2020 Aug 28;8(8):e19857
pubmed: 32759102
Ir J Med Sci. 2022 Feb;191(1):103-112
pubmed: 33604836
Lancet Infect Dis. 2020 Oct;20(10):1151-1160
pubmed: 32559451
J Am Med Inform Assoc. 2021 Jan 15;28(1):193-195
pubmed: 32584990
EClinicalMedicine. 2020 Jun 15;24:100412
pubmed: 32766536
MMWR Morb Mortal Wkly Rep. 2020 Sep 25;69(38):1360-1363
pubmed: 32970654

Auteurs

Kashif Ahmad (K)

Information and Computing Technology Division, College of Science and Engineering, Hamad Bin Khalifa University, Doha, Qatar.

Firoj Alam (F)

Qatar Computing Research Institute, Hamad Bin Khalifa University, Doha, Qatar.

Junaid Qadir (J)

Department of Computer Science and Engineering, Faculty of Engineering, Qatar University, Doha, Qatar.

Basheer Qolomany (B)

Department of Cyber Systems, University of Nebraska, Kearney, NE, United States.

Imran Khan (I)

Department of Computer Systems Engineering, University of Engineering and Technology, Peshawar, Pakistan.

Talhat Khan (T)

Department of Computer Systems Engineering, University of Engineering and Technology, Peshawar, Pakistan.

Muhammad Suleman (M)

Department of Computer Systems Engineering, University of Engineering and Technology, Peshawar, Pakistan.

Naina Said (N)

Department of Computer Systems Engineering, University of Engineering and Technology, Peshawar, Pakistan.

Syed Zohaib Hassan (SZ)

Department of Holistic Systems, SimulaMet, Oslo, Norway.

Asma Gul (A)

Department of Statistics, Shaheed Benazir Bhutto Women University, Peshawar, Pakistan.

Mowafa Househ (M)

Information and Computing Technology Division, College of Science and Engineering, Hamad Bin Khalifa University, Doha, Qatar.

Ala Al-Fuqaha (A)

Information and Computing Technology Division, College of Science and Engineering, Hamad Bin Khalifa University, Doha, Qatar.

Classifications MeSH