What Patients Say: Large-Scale Analyses of Replies to the Parkinson's Disease Patient Report of Problems (PD-PROP).


Journal

Journal of Parkinson's disease
ISSN: 1877-718X
Titre abrégé: J Parkinsons Dis
Pays: Netherlands
ID NLM: 101567362

Informations de publication

Date de publication:
2023
Historique:
medline: 1 8 2023
pubmed: 19 6 2023
entrez: 19 6 2023
Statut: ppublish

Résumé

Free-text, verbatim replies in the words of people with Parkinson's disease (PD) have the potential to provide unvarnished information about their feelings and experiences. Challenges of processing such data on a large scale are a barrier to analyzing verbatim data collection in large cohorts. To develop a method for curating responses from the Parkinson's Disease Patient Report of Problems (PD-PROP), open-ended questions that asks people with PD to report their most bothersome problems and associated functional consequences. Human curation, natural language processing, and machine learning were used to develop an algorithm to convert verbatim responses to classified symptoms. Nine curators including clinicians, people with PD, and a non-clinician PD expert classified a sample of responses as reporting each symptom or not. Responses to the PD-PROP were collected within the Fox Insight cohort study. Approximately 3,500 PD-PROP responses were curated by a human team. Subsequently, approximately 1,500 responses were used in the validation phase; median age of respondents was 67 years, 55% were men and median years since PD diagnosis was 3 years. 168,260 verbatim responses were classified by machine. Accuracy of machine classification was 95% on a held-out test set. 65 symptoms were grouped into 14 domains. The most frequently reported symptoms at first report were tremor (by 46% of respondents), gait and balance problems (>39%), and pain/discomfort (33%). A human-in-the-loop method of curation provides both accuracy and efficiency, permitting a clinically useful analysis of large datasets of verbatim reports about the problems that bother PD patients.

Sections du résumé

BACKGROUND
Free-text, verbatim replies in the words of people with Parkinson's disease (PD) have the potential to provide unvarnished information about their feelings and experiences. Challenges of processing such data on a large scale are a barrier to analyzing verbatim data collection in large cohorts.
OBJECTIVE
To develop a method for curating responses from the Parkinson's Disease Patient Report of Problems (PD-PROP), open-ended questions that asks people with PD to report their most bothersome problems and associated functional consequences.
METHODS
Human curation, natural language processing, and machine learning were used to develop an algorithm to convert verbatim responses to classified symptoms. Nine curators including clinicians, people with PD, and a non-clinician PD expert classified a sample of responses as reporting each symptom or not. Responses to the PD-PROP were collected within the Fox Insight cohort study.
RESULTS
Approximately 3,500 PD-PROP responses were curated by a human team. Subsequently, approximately 1,500 responses were used in the validation phase; median age of respondents was 67 years, 55% were men and median years since PD diagnosis was 3 years. 168,260 verbatim responses were classified by machine. Accuracy of machine classification was 95% on a held-out test set. 65 symptoms were grouped into 14 domains. The most frequently reported symptoms at first report were tremor (by 46% of respondents), gait and balance problems (>39%), and pain/discomfort (33%).
CONCLUSION
A human-in-the-loop method of curation provides both accuracy and efficiency, permitting a clinically useful analysis of large datasets of verbatim reports about the problems that bother PD patients.

Identifiants

pubmed: 37334615
pii: JPD225083
doi: 10.3233/JPD-225083
pmc: PMC10473108
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

757-767

Références

IEEE Comput Graph Appl. 2010 Nov-Dec;30(6):42-53
pubmed: 24807897
Nucleic Acids Res. 2004 Jan 1;32(Database issue):D267-70
pubmed: 14681409
Appl Clin Inform. 2020 Mar;11(2):242-252
pubmed: 32236917
J Patient Rep Outcomes. 2022 Apr 20;6(1):38
pubmed: 35441987
Proc IEEE Int Symp Bioinformatics Bioeng. 2017 Oct;2017:163-170
pubmed: 29399672
J Am Med Inform Assoc. 2019 Apr 1;26(4):276-285
pubmed: 30840079
Sci Data. 2020 Feb 24;7(1):67
pubmed: 32094335
J Parkinsons Dis. 2021;11(2):715-724
pubmed: 33459664
J Parkinsons Dis. 2022;12(6):1969-1978
pubmed: 35694935
J Biomed Semantics. 2022 Jun 11;13(1):17
pubmed: 35690873
J Parkinsons Dis. 2021;11(4):1957-1964
pubmed: 34250951
Health Qual Life Outcomes. 2020 Jun 10;18(1):177
pubmed: 32522194
J Patient Exp. 2020 Aug;7(4):541-548
pubmed: 33062876

Auteurs

Connie Marras (C)

Edmond J Safra Program in Parkinson's Disease, University Health Network, University of Toronto, Toronto, Canada.

Lakshmi Arbatti (L)

Grey Matter Technologies, a Wholly Owned Subsidiary of Modality.ai, San Francisco, CA, USA.

Abhishek Hosamath (A)

Grey Matter Technologies, a Wholly Owned Subsidiary of Modality.ai, San Francisco, CA, USA.

Amy Amara (A)

Department of Neurology, University of Colorado Anschutz Medical Campus, Aurora, CO, USA.

Karen E Anderson (KE)

Departments of Psychiatry and Neurology, Georgetown University, Washington DC, USA.

Lana M Chahine (LM)

Department of Neurology, University of Pittsburgh, Pittsburgh, PA, USA.

Shirley Eberly (S)

Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, USA.

Dan Kinel (D)

Department of Neurology, University of Rochester, Rochester NY, USA.

Sneha Mantri (S)

Department of Neurology, Duke University, Durham, NC, USA.

Soania Mathur (S)

PD Avengers, Toronto, Canada.

David Oakes (D)

Department of Biostatistics and Computational Biology, University of Rochester, Rochester, NY, USA.

Jennifer L Purks (JL)

Department of Neurology, University of Rochester, Rochester NY, USA.

David G Standaert (DG)

PD Avengers, Toronto, Canada.

Caroline M Tanner (CM)

Department of Neurology, Weill Institute for Neurosciences, University of California - San Francisco, San Francisco, CA, USA.

Daniel Weintraub (D)

Departments of Psychiatry and Neurology, Perelman School of Medicine at the University of Pennsylvania, Philadelphia, PA, USA.

Ira Shoulson (I)

Grey Matter Technologies, a Wholly Owned Subsidiary of Modality.ai, San Francisco, CA, USA.
Department of Neurology, University of Rochester, Rochester NY, USA.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH