Matrix completion under complex survey sampling.
Asymptotic upper bound
Augmented inverse probability weighting estimator
Low-rank structure
Missingness at random
Journal
Annals of the Institute of Statistical Mathematics
ISSN: 0020-3157
Titre abrégé: Ann Inst Stat Math
Pays: Japan
ID NLM: 101509177
Informations de publication
Date de publication:
Jun 2023
Jun 2023
Historique:
medline:
30
8
2023
pubmed:
30
8
2023
entrez:
30
8
2023
Statut:
ppublish
Résumé
Multivariate nonresponse is often encountered in complex survey sampling, and simply ignoring it leads to erroneous inference. In this paper, we propose a new matrix completion method for complex survey sampling. Different from existing works either conducting row-wise or column-wise imputation, the data matrix is treated as a whole which allows for exploiting both row and column patterns simultaneously. A column-space-decomposition model is adopted incorporating a low-rank structured matrix for the finite population with easy-to-obtain demographic information as covariates. Besides, we propose a computationally efficient projection strategy to identify the model parameters under complex survey sampling. Then, an augmented inverse probability weighting estimator is used to estimate the parameter of interest, and the corresponding asymptotic upper bound of the estimation error is derived. Simulation studies show that the proposed estimator has a smaller mean squared error than other competitors, and the corresponding variance estimator performs well. The proposed method is applied to assess the health status of the U.S. population.
Identifiants
pubmed: 37645434
doi: 10.1007/s10463-022-00851-5
pmc: PMC10465119
mid: NIHMS1875523
doi:
Types de publication
Journal Article
Langues
eng
Pagination
463-492Subventions
Organisme : NIA NIH HHS
ID : R01 AG066883
Pays : United States
Organisme : NIEHS NIH HHS
ID : R01 ES031651
Pays : United States
Références
Proc Natl Acad Sci U S A. 2019 Nov 12;116(46):22931-22937
pubmed: 31666329
Vital Health Stat 2. 2020 Apr;(184):1-35
pubmed: 33663649
Biostatistics. 2020 Apr 1;21(2):236-252
pubmed: 30203058
J Mach Learn Res. 2010 Mar 1;11:2287-2322
pubmed: 21552465
Int Stat Rev. 2010 Apr;78(1):40-64
pubmed: 21743766
IEEE Trans Neural Netw Learn Syst. 2012 May;23(5):737-48
pubmed: 24806123
Biometrika. 2018 Jun;105(2):479-486
pubmed: 30799873