Exploring Natural Language Processing through an Exemplar Using YouTube.
Natural Language Processing
methodology
unstructured data analysis
Journal
International journal of environmental research and public health
ISSN: 1660-4601
Titre abrégé: Int J Environ Res Public Health
Pays: Switzerland
ID NLM: 101238455
Informations de publication
Date de publication:
15 Oct 2024
15 Oct 2024
Historique:
received:
14
08
2024
revised:
08
10
2024
accepted:
09
10
2024
medline:
26
10
2024
pubmed:
26
10
2024
entrez:
26
10
2024
Statut:
epublish
Résumé
There has been a growing emphasis on data across various health-related fields, not just in nursing research, due to the increasing volume of unstructured data in electronic health records (EHRs). Natural Language Processing (NLP) provides a solution by transforming this unstructured data into structured formats, thereby facilitating valuable insights. This methodology paper explores the application of NLP in nursing, using an exemplar case study that analyzes YouTube data to investigate social phenomena among adults living alone. The methodology involves five steps: accessing data through YouTube's API, data cleaning, preprocessing (tokenization, sentence segmentation, linguistic normalization), sentiment analysis using Python, and topic modeling. This study serves as a comprehensive guide for integrating NLP into nursing research, supplemented with digital content demonstrating each step. For successful implementation, nursing researchers must grasp the fundamental concepts and processes of NLP. The potential of NLP in nursing is significant, particularly in utilizing unstructured textual data from nursing documentation and social media. Its benefits include streamlining nursing documentation, enhancing patient communication, and improving data analysis.
Identifiants
pubmed: 39457330
pii: ijerph21101357
doi: 10.3390/ijerph21101357
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : the National Research Foundation of Korea (NRF) grant
ID : NRF-2022R1F1A1068236