Matrix completion under complex survey sampling.

Asymptotic upper bound Augmented inverse probability weighting estimator Low-rank structure Missingness at random

Journal

Annals of the Institute of Statistical Mathematics
ISSN: 0020-3157
Titre abrégé: Ann Inst Stat Math
Pays: Japan
ID NLM: 101509177

Informations de publication

Date de publication:
Jun 2023
Historique:
medline: 30 8 2023
pubmed: 30 8 2023
entrez: 30 8 2023
Statut: ppublish

Résumé

Multivariate nonresponse is often encountered in complex survey sampling, and simply ignoring it leads to erroneous inference. In this paper, we propose a new matrix completion method for complex survey sampling. Different from existing works either conducting row-wise or column-wise imputation, the data matrix is treated as a whole which allows for exploiting both row and column patterns simultaneously. A column-space-decomposition model is adopted incorporating a low-rank structured matrix for the finite population with easy-to-obtain demographic information as covariates. Besides, we propose a computationally efficient projection strategy to identify the model parameters under complex survey sampling. Then, an augmented inverse probability weighting estimator is used to estimate the parameter of interest, and the corresponding asymptotic upper bound of the estimation error is derived. Simulation studies show that the proposed estimator has a smaller mean squared error than other competitors, and the corresponding variance estimator performs well. The proposed method is applied to assess the health status of the U.S. population.

Identifiants

pubmed: 37645434
doi: 10.1007/s10463-022-00851-5
pmc: PMC10465119
mid: NIHMS1875523
doi:

Types de publication

Journal Article

Langues

eng

Pagination

463-492

Subventions

Organisme : NIA NIH HHS
ID : R01 AG066883
Pays : United States
Organisme : NIEHS NIH HHS
ID : R01 ES031651
Pays : United States

Références

Proc Natl Acad Sci U S A. 2019 Nov 12;116(46):22931-22937
pubmed: 31666329
Vital Health Stat 2. 2020 Apr;(184):1-35
pubmed: 33663649
Biostatistics. 2020 Apr 1;21(2):236-252
pubmed: 30203058
J Mach Learn Res. 2010 Mar 1;11:2287-2322
pubmed: 21552465
Int Stat Rev. 2010 Apr;78(1):40-64
pubmed: 21743766
IEEE Trans Neural Netw Learn Syst. 2012 May;23(5):737-48
pubmed: 24806123
Biometrika. 2018 Jun;105(2):479-486
pubmed: 30799873

Auteurs

Xiaojun Mao (X)

School of Mathematical Sciences, Ministry of Education Key Laboratory of Scientific and Engineering Computing, Shanghai Jiao Tong University, Shanghai 200240, People's Republic of China.

Zhonglei Wang (Z)

Wang Yanan Institute for Studies in Economics and School of Economics, Xiamen University, Xiamen 361005, Fujian, People's Republic of China.

Shu Yang (S)

Department of Statistics, North Carolina State University, Raleigh, NC 27695, USA.

Classifications MeSH