Two-stage sampling in the estimation of growth parameters and percentile norms: sample weights versus auxiliary variable estimation.
Auxiliary variable
Missing data
Percentile norms
Population norms
Two-stage design
Weights
Journal
BMC medical research methodology
ISSN: 1471-2288
Titre abrégé: BMC Med Res Methodol
Pays: England
ID NLM: 100968545
Informations de publication
Date de publication:
17 08 2021
17 08 2021
Historique:
received:
18
11
2020
accepted:
20
07
2021
entrez:
18
8
2021
pubmed:
19
8
2021
medline:
28
8
2021
Statut:
epublish
Résumé
The use of auxiliary variables with maximum likelihood parameter estimation for surveys that miss data by design is not a widespread approach, despite its documented improved efficiency over traditional approaches that deploy sampling weights. Although efficiency gains from the use of Normally distributed auxiliary variables in a model have been recorded in the literature, little is known about the effects of non-Normal auxiliary variables in the parameter estimation. We simulate growth data to mimic SCALES, a two-stage survey of language development with a screening phase (stage one) for which data are observed for the whole sample and an intensive assessments phase (stage two), for which data are observed for a sub-sample, selected using stratified random sampling. In the simulation, we allow a fully observed Poisson distributed stratification criterion to be correlated with the partially observed model responses and develop five generalised structural equation growth models that host the auxiliary information from this criterion. We compare these models with each other and with a weighted growth model in terms of bias, efficiency, and coverage. We finally apply our best performing model to SCALES data and show how to obtain growth parameters and population norms. Parameter estimation from a model that incorporates a non-Normal auxiliary variable is unbiased and more efficient than its weighted counterpart. The auxiliary variable method is capable of producing efficient population percentile norms and velocities. The deployment of a fully observed variable that dominates the selection of the sample and correlates strongly with the incomplete variable of interest appears beneficial for the estimation process.
Sections du résumé
BACKGROUND
The use of auxiliary variables with maximum likelihood parameter estimation for surveys that miss data by design is not a widespread approach, despite its documented improved efficiency over traditional approaches that deploy sampling weights. Although efficiency gains from the use of Normally distributed auxiliary variables in a model have been recorded in the literature, little is known about the effects of non-Normal auxiliary variables in the parameter estimation.
METHODS
We simulate growth data to mimic SCALES, a two-stage survey of language development with a screening phase (stage one) for which data are observed for the whole sample and an intensive assessments phase (stage two), for which data are observed for a sub-sample, selected using stratified random sampling. In the simulation, we allow a fully observed Poisson distributed stratification criterion to be correlated with the partially observed model responses and develop five generalised structural equation growth models that host the auxiliary information from this criterion. We compare these models with each other and with a weighted growth model in terms of bias, efficiency, and coverage. We finally apply our best performing model to SCALES data and show how to obtain growth parameters and population norms.
RESULTS
Parameter estimation from a model that incorporates a non-Normal auxiliary variable is unbiased and more efficient than its weighted counterpart. The auxiliary variable method is capable of producing efficient population percentile norms and velocities.
CONCLUSIONS
The deployment of a fully observed variable that dominates the selection of the sample and correlates strongly with the incomplete variable of interest appears beneficial for the estimation process.
Identifiants
pubmed: 34404347
doi: 10.1186/s12874-021-01353-3
pii: 10.1186/s12874-021-01353-3
pmc: PMC8369688
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
173Subventions
Organisme : Wellcome
ID : WT094836AIA
Organisme : Economic and Social Research Council
ID : ES/R003041
Organisme : Economic and Social Research Council
ID : ES/R003041
Informations de copyright
© 2021. The Author(s).
Références
Stat Med. 1992 Jul;11(10):1305-19
pubmed: 1518992
Stat Med. 2011 Feb 20;30(4):377-99
pubmed: 21225900
Arch Dis Child. 1952 Feb;27(131):10-33
pubmed: 14904111
J Child Psychol Psychiatry. 2016 Nov;57(11):1247-1257
pubmed: 27184709
JAMA. 1985 Jan 25;253(4):530-4
pubmed: 3968785
J Speech Lang Hear Res. 1997 Dec;40(6):1245-60
pubmed: 9430746
Stat Med. 1998 Feb 28;17(4):407-29
pubmed: 9496720
Clin Chem. 2001 Oct;47(10):1804-10
pubmed: 11568090
Stat Methods Med Res. 1996 Sep;5(3):239-61
pubmed: 8931195
Clin Exp Immunol. 1993 Feb;91(2):337-41
pubmed: 8428400
Stat Med. 2019 May 20;38(11):2074-2102
pubmed: 30652356
PLoS One. 2019 Mar 7;14(3):e0213492
pubmed: 30845265
J Speech Hear Disord. 1986 May;51(2):98-110
pubmed: 3702369
J Child Psychol Psychiatry. 2017 Oct;58(10):1092-1105
pubmed: 28921543
Stat Methods Med Res. 1995 Mar;4(1):73-89
pubmed: 7613639
Psychol Methods. 2001 Dec;6(4):330-51
pubmed: 11778676
Vital Health Stat 11. 2002 May;(246):1-190
pubmed: 12043359