Validity evidence for Quality Improvement Knowledge Application Tool Revised (QIKAT-R) scores: consequences of rater number and type using neurology cases.

Adult Clinical Competence Education, Medical, Graduate / methods Educational Measurement / methods Female Florida Humans Internship and Residency / methods Knowledge Male Minnesota Neurology / education Quality Improvement Reproducibility of Results

graduate medical education medical education quality improvement

Journal

BMJ quality & safety

ISSN: 2044-5423

Titre abrégé: BMJ Qual Saf

Pays: England

ID NLM: 101546984

Informations de publication

Date de publication:
11 2019

Historique:

received: 07 08 2018

revised: 05 02 2019

accepted: 31 03 2019

pubmed: 19 4 2019

medline: 10 6 2020

entrez: 19 4 2019

Statut: ppublish

Résumé

To develop neurology scenarios for use with the Quality Improvement Knowledge Application Tool Revised (QIKAT-R), gather and evaluate validity evidence, and project the impact of scenario number, rater number and rater type on score reliability. Six neurological case scenarios were developed. Residents were randomly assigned three scenarios before and after a quality improvement (QI) course in 2015 and 2016. For each scenario, residents crafted an aim statement, selected a measure and proposed a change to address a quality gap. Responses were scored by six faculty raters (two with and four without QI expertise) using the QIKAT-R. Validity evidence from content, response process, internal structure, relations to other variables and consequences was collected. A generalisability (G) study examined sources of score variability, and decision analyses estimated projected reliability for different numbers of raters and scenarios and raters with and without QI expertise. Raters scored 163 responses from 28 residents. The mean QIKAT-R score was 5.69 (SD 1.06). G-coefficient and Phi-coefficient were 0.65 and 0.60, respectively. Interrater reliability was fair for raters without QI expertise (intraclass correlation = 0.53, 95% CI 0.30 to 0.72) and acceptable for raters with QI expertise (intraclass correlation = 0.66, 95% CI 0.02 to 0.88). Postcourse scores were significantly higher than precourse scores (6.05, SD 1.48 vs 5.22, SD 1.5; p < 0.001). Sufficient reliability for formative assessment (G-coefficient > 0.60) could be achieved by three raters scoring six scenarios or two raters scoring eight scenarios, regardless of rater QI expertise. Validity evidence was sufficient to support the use of the QIKAT-R with multiple scenarios and raters to assess resident QI knowledge application for formative or low-stakes summative purposes. The results provide practical information for educators to guide implementation decisions.

Identifiants

DOI: 10.1136/bmjqs-2018-008689 PMID: 30996038

pubmed: 30996038

pii: bmjqs-2018-008689

doi: 10.1136/bmjqs-2018-008689

doi:

Types de publication

Comparative Study Journal Article Validation Study

Langues

eng

Pagination

925-933

Informations de copyright

Déclaration de conflit d'intérêts

Competing interests: None declared.

Validity evidence for Quality Improvement Knowledge Application Tool Revised (QIKAT-R) scores: consequences of rater number and type using neurology cases.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Pagination

Informations de copyright

Déclaration de conflit d'intérêts

Auteurs

Charles Kassardjian (C)

Yoon Soo Park (YS)

Sherri Braksick (S)

Jeremy Cutsforth-Gregory (J)

Carrie Robertson (C)

Nathan Young (N)

Andrea Leep Hunderfund (A)

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Smoking Cessation and Incident Cardiovascular Disease.

Evaluation of Low-Value Services Across Major Medicare Advantage Insurers and Traditional Medicare.

Effectiveness of Virtual Yoga for Chronic Low Back Pain: A Randomized Clinical Trial.

Classifications MeSH