Weakly Supervised Learning for Categorization of Medical Inquiries for Customer Service Effectiveness.
customer inquiry
deep learning
medical information
natural language processing
weakly supervised learning
Journal
Frontiers in research metrics and analytics
ISSN: 2504-0537
Titre abrégé: Front Res Metr Anal
Pays: Switzerland
ID NLM: 101718019
Informations de publication
Date de publication:
2021
2021
Historique:
received:
20
03
2021
accepted:
28
06
2021
entrez:
19
8
2021
pubmed:
20
8
2021
medline:
20
8
2021
Statut:
epublish
Résumé
With the growing unstructured data in healthcare and pharmaceutical, there has been a drastic adoption of natural language processing for generating actionable insights from text data sources. One of the key areas of our exploration is the Medical Information function within our organization. We receive a significant amount of medical information inquires in the form of unstructured text. An enterprise-level solution must deal with medical information interactions via multiple communication channels which are always nuanced with a variety of keywords and emotions that are unique to the pharmaceutical industry. There is a strong need for an effective solution to leverage the contextual knowledge of the medical information business along with digital tenants of natural language processing (NLP) and machine learning to build an automated and scalable process that generates real-time insights on conversation categories. The traditional supervised learning methods rely on a huge set of manually labeled training data and this dataset is difficult to attain due to high labeling costs. Thus, the solution is incomplete without its ability to self-learn and improve. This necessitates techniques to automatically build relevant training data using a weakly supervised approach from textual inquiries across consumers, healthcare professionals, sales, and service providers. The solution has two fundamental layers of NLP and machine learning. The first layer leverages heuristics and knowledgebase to identify the potential categories and build an annotated training data. The second layer, based on machine learning and deep learning, utilizes the training data generated using the heuristic approach for identifying categories and sub-categories associated with verbatim. Here, we present a novel approach harnessing the power of weakly supervised learning combined with multi-class classification for improved categorization of medical information inquiries.
Identifiants
pubmed: 34409245
doi: 10.3389/frma.2021.683400
pii: 683400
pmc: PMC8366288
doi:
Types de publication
Journal Article
Langues
eng
Pagination
683400Informations de copyright
Copyright © 2021 Singhal, Hegde, Karmalkar, Muhith and Gurulingappa.
Déclaration de conflit d'intérêts
The authors declare that the research was conducted in the absence of any commercial or financial relationships that could be construed as a potential conflict of interest.
Références
EGEMS (Wash DC). 2016 Aug 10;4(3):1222
pubmed: 27683664
BMC Bioinformatics. 2016 Jan 11;17 Suppl 1:1
pubmed: 26817711
Bioinformatics. 2020 Feb 15;36(4):1234-1240
pubmed: 31501885
J Am Med Inform Assoc. 2011 Sep-Oct;18(5):580-7
pubmed: 21709161
Sci Rep. 2020 Jun 9;10(1):9297
pubmed: 32518413