Medical Needs Extraction for Breast Cancer Patients from Question and Answer Services: Natural Language Processing-Based Approach.

internet use natural language processing neoplasms patient generated health data

Journal

JMIR cancer
ISSN: 2369-1999
Titre abrégé: JMIR Cancer
Pays: Canada
ID NLM: 101666844

Informations de publication

Date de publication:
28 Oct 2021
Historique:
received: 14 07 2021
accepted: 04 10 2021
revised: 25 09 2021
entrez: 28 10 2021
pubmed: 29 10 2021
medline: 29 10 2021
Statut: epublish

Résumé

A large number of patient narratives are available on various web services. As for web question and answer services, patient questions often relate to medical needs, and we expect these questions to provide clues for a better understanding of patients' medical needs. This study aimed to extract patients' needs and classify them into thematic categories. Clarifying patient needs is the first step in solving social issues that patients with cancer encounter. For this study, we used patient question texts containing the key phrase "breast cancer," available at the Yahoo! Japan question and answer service, Yahoo! Chiebukuro, which contains over 60,000 questions on cancer. First, we converted the question text into a vector representation. Next, the relevance between patient needs and existing cancer needs categories was calculated based on cosine similarity. The proportion of correct classifications in our proposed method was approximately 70%. Considering the results of classifying questions, we found the variation and the number of needs. We created 3 corpora to classify the problems of patients with cancer. The proposed method was able to classify the problems considering the question text. Moreover, as an application example, the question text that included the side effect signaling of drugs and the unmet needs of cancer patients could be extracted. Revealing these needs is important to fulfill the medical needs of patients with cancer.

Sections du résumé

BACKGROUND BACKGROUND
A large number of patient narratives are available on various web services. As for web question and answer services, patient questions often relate to medical needs, and we expect these questions to provide clues for a better understanding of patients' medical needs.
OBJECTIVE OBJECTIVE
This study aimed to extract patients' needs and classify them into thematic categories. Clarifying patient needs is the first step in solving social issues that patients with cancer encounter.
METHODS METHODS
For this study, we used patient question texts containing the key phrase "breast cancer," available at the Yahoo! Japan question and answer service, Yahoo! Chiebukuro, which contains over 60,000 questions on cancer. First, we converted the question text into a vector representation. Next, the relevance between patient needs and existing cancer needs categories was calculated based on cosine similarity.
RESULTS RESULTS
The proportion of correct classifications in our proposed method was approximately 70%. Considering the results of classifying questions, we found the variation and the number of needs.
CONCLUSIONS CONCLUSIONS
We created 3 corpora to classify the problems of patients with cancer. The proposed method was able to classify the problems considering the question text. Moreover, as an application example, the question text that included the side effect signaling of drugs and the unmet needs of cancer patients could be extracted. Revealing these needs is important to fulfill the medical needs of patients with cancer.

Identifiants

pubmed: 34709187
pii: v7i4e32005
doi: 10.2196/32005
pmc: PMC8587180
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e32005

Informations de copyright

©Masaru Kamba, Masae Manabe, Shoko Wakamiya, Shuntaro Yada, Eiji Aramaki, Satomi Odani, Isao Miyashiro. Originally published in JMIR Cancer (https://cancer.jmir.org), 28.10.2021.

Références

J Med Internet Res. 2013 Jun 18;15(6):e124
pubmed: 23778053
J Med Internet Res. 2014 May 27;16(5):e137
pubmed: 24867458
Qual Life Res. 2010 Oct;19(8):1087-96
pubmed: 20512662
J Med Internet Res. 2017 Apr 21;19(4):e126
pubmed: 28432038
BMJ. 2013 Jan 28;346:f167
pubmed: 23358487
J Med Internet Res. 2020 Apr 16;22(4):e13071
pubmed: 32297872
JMIR Med Inform. 2016 Nov 24;4(4):e41
pubmed: 27884812

Auteurs

Masaru Kamba (M)

Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Nara, Japan.

Masae Manabe (M)

Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Nara, Japan.

Shoko Wakamiya (S)

Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Nara, Japan.

Shuntaro Yada (S)

Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Nara, Japan.

Eiji Aramaki (E)

Division of Information Science, Graduate School of Science and Technology, Nara Institute of Science and Technology, Nara, Japan.

Satomi Odani (S)

Cancer Control Center, Osaka International Cancer Institute, Osaka, Japan.

Isao Miyashiro (I)

Cancer Control Center, Osaka International Cancer Institute, Osaka, Japan.

Classifications MeSH