Two-stage sampling in the estimation of growth parameters and percentile norms: sample weights versus auxiliary variable estimation.

Auxiliary variable Missing data Percentile norms Population norms Two-stage design Weights

Journal

BMC medical research methodology
ISSN: 1471-2288
Titre abrégé: BMC Med Res Methodol
Pays: England
ID NLM: 100968545

Informations de publication

Date de publication:
17 08 2021
Historique:
received: 18 11 2020
accepted: 20 07 2021
entrez: 18 8 2021
pubmed: 19 8 2021
medline: 28 8 2021
Statut: epublish

Résumé

The use of auxiliary variables with maximum likelihood parameter estimation for surveys that miss data by design is not a widespread approach, despite its documented improved efficiency over traditional approaches that deploy sampling weights. Although efficiency gains from the use of Normally distributed auxiliary variables in a model have been recorded in the literature, little is known about the effects of non-Normal auxiliary variables in the parameter estimation. We simulate growth data to mimic SCALES, a two-stage survey of language development with a screening phase (stage one) for which data are observed for the whole sample and an intensive assessments phase (stage two), for which data are observed for a sub-sample, selected using stratified random sampling. In the simulation, we allow a fully observed Poisson distributed stratification criterion to be correlated with the partially observed model responses and develop five generalised structural equation growth models that host the auxiliary information from this criterion. We compare these models with each other and with a weighted growth model in terms of bias, efficiency, and coverage. We finally apply our best performing model to SCALES data and show how to obtain growth parameters and population norms. Parameter estimation from a model that incorporates a non-Normal auxiliary variable is unbiased and more efficient than its weighted counterpart. The auxiliary variable method is capable of producing efficient population percentile norms and velocities. The deployment of a fully observed variable that dominates the selection of the sample and correlates strongly with the incomplete variable of interest appears beneficial for the estimation process.

Sections du résumé

BACKGROUND
The use of auxiliary variables with maximum likelihood parameter estimation for surveys that miss data by design is not a widespread approach, despite its documented improved efficiency over traditional approaches that deploy sampling weights. Although efficiency gains from the use of Normally distributed auxiliary variables in a model have been recorded in the literature, little is known about the effects of non-Normal auxiliary variables in the parameter estimation.
METHODS
We simulate growth data to mimic SCALES, a two-stage survey of language development with a screening phase (stage one) for which data are observed for the whole sample and an intensive assessments phase (stage two), for which data are observed for a sub-sample, selected using stratified random sampling. In the simulation, we allow a fully observed Poisson distributed stratification criterion to be correlated with the partially observed model responses and develop five generalised structural equation growth models that host the auxiliary information from this criterion. We compare these models with each other and with a weighted growth model in terms of bias, efficiency, and coverage. We finally apply our best performing model to SCALES data and show how to obtain growth parameters and population norms.
RESULTS
Parameter estimation from a model that incorporates a non-Normal auxiliary variable is unbiased and more efficient than its weighted counterpart. The auxiliary variable method is capable of producing efficient population percentile norms and velocities.
CONCLUSIONS
The deployment of a fully observed variable that dominates the selection of the sample and correlates strongly with the incomplete variable of interest appears beneficial for the estimation process.

Identifiants

pubmed: 34404347
doi: 10.1186/s12874-021-01353-3
pii: 10.1186/s12874-021-01353-3
pmc: PMC8369688
doi:

Types de publication

Journal Article Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

173

Subventions

Organisme : Wellcome
ID : WT094836AIA
Organisme : Economic and Social Research Council
ID : ES/R003041
Organisme : Economic and Social Research Council
ID : ES/R003041

Informations de copyright

© 2021. The Author(s).

Références

Stat Med. 1992 Jul;11(10):1305-19
pubmed: 1518992
Stat Med. 2011 Feb 20;30(4):377-99
pubmed: 21225900
Arch Dis Child. 1952 Feb;27(131):10-33
pubmed: 14904111
J Child Psychol Psychiatry. 2016 Nov;57(11):1247-1257
pubmed: 27184709
JAMA. 1985 Jan 25;253(4):530-4
pubmed: 3968785
J Speech Lang Hear Res. 1997 Dec;40(6):1245-60
pubmed: 9430746
Stat Med. 1998 Feb 28;17(4):407-29
pubmed: 9496720
Clin Chem. 2001 Oct;47(10):1804-10
pubmed: 11568090
Stat Methods Med Res. 1996 Sep;5(3):239-61
pubmed: 8931195
Clin Exp Immunol. 1993 Feb;91(2):337-41
pubmed: 8428400
Stat Med. 2019 May 20;38(11):2074-2102
pubmed: 30652356
PLoS One. 2019 Mar 7;14(3):e0213492
pubmed: 30845265
J Speech Hear Disord. 1986 May;51(2):98-110
pubmed: 3702369
J Child Psychol Psychiatry. 2017 Oct;58(10):1092-1105
pubmed: 28921543
Stat Methods Med Res. 1995 Mar;4(1):73-89
pubmed: 7613639
Psychol Methods. 2001 Dec;6(4):330-51
pubmed: 11778676
Vital Health Stat 11. 2002 May;(246):1-190
pubmed: 12043359

Auteurs

George Vamvakas (G)

Department of Biostatistics and Health Informatics, Institute of Psychology, Psychiatry and Neuroscience, Kings College London, London, UK. george.vamvakas@kcl.ac.uk.

Courtenay Norbury (C)

Psychology and Language Sciences, University College London, London, UK.

Andrew Pickles (A)

Department of Biostatistics and Health Informatics, Institute of Psychology, Psychiatry and Neuroscience, Kings College London, London, UK.

Articles similaires

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C
1.00
Humans Yoga Low Back Pain Female Male

Classifications MeSH