A Case Study Competition Among Methods for Analyzing Large Spatial Data.
Big data
Gaussian process
Low-rank approximation
Parallel computing
Journal
Journal of agricultural, biological, and environmental statistics
ISSN: 1085-7117
Titre abrégé: J Agric Biol Environ Stat
Pays: Switzerland
ID NLM: 9890703
Informations de publication
Date de publication:
2019
2019
Historique:
received:
30
11
2018
accepted:
05
12
2018
entrez:
10
9
2019
pubmed:
10
9
2019
medline:
10
9
2019
Statut:
ppublish
Résumé
The Gaussian process is an indispensable tool for spatial data analysts. The onset of the "big data" era, however, has lead to the traditional Gaussian process being computationally infeasible for modern spatial data. As such, various alternatives to the full Gaussian process that are more amenable to handling big spatial data have been proposed. These modern methods often exploit low-rank structures and/or multi-core and multi-threaded computing environments to facilitate computation. This study provides, first, an introductory overview of several methods for analyzing large spatial data. Second, this study describes the results of a predictive competition among the described methods as implemented by different groups with strong expertise in the methodology. Specifically, each research group was provided with two training datasets (one simulated and one observed) along with a set of prediction locations. Each group then wrote their own implementation of their method to produce predictions at the given location and each was subsequently run on a common computing environment. The methods were then compared in terms of various predictive diagnostics. Supplementary materials regarding implementation details of the methods and code are available for this article online. Supplementary materials for this article are available at 10.1007/s13253-018-00348-w.
Identifiants
pubmed: 31496633
doi: 10.1007/s13253-018-00348-w
pii: 348
pmc: PMC6709111
doi:
Types de publication
Journal Article
Langues
eng
Pagination
398-425Références
IEEE Trans Neural Netw Learn Syst. 2020 Nov;31(11):4405-4423
pubmed: 31944966
Biometrika. 2019 Jun;106(2):267-286
pubmed: 31097832
Spat Stat. 2018 Dec;28:59-78
pubmed: 31008043
J R Stat Soc Ser C Appl Stat. 2014 Nov;63(5):737-761
pubmed: 26401059
J Am Stat Assoc. 2016;111(514):800-812
pubmed: 29720777
ISPRS J Photogramm Remote Sens. 2014 Dec;98:106-118
pubmed: 25642100
Biostatistics. 2014 Jul;15(3):457-69
pubmed: 24622038
J Comput Graph Stat. 2019;28(2):401-414
pubmed: 31543693
Technometrics. 2018;60(4):430-444
pubmed: 31007296
Wiley Interdiscip Rev Comput Stat. 2016 Sep-Oct;8(5):162-171
pubmed: 29657666
J Am Stat Assoc. 2007 Mar;102(477):321-331
pubmed: 19079638
Ann Appl Stat. 2016 Sep;10(3):1286-1316
pubmed: 29657659
J R Stat Soc Series B Stat Methodol. 2008 Sep 1;70(4):825-848
pubmed: 19750209
Comput Stat Data Anal. 2009 Jun 15;53(8):2873-2884
pubmed: 20016667
Stat Sin. 2019;29:1155-1180
pubmed: 33311955
Biometrics. 2000 Mar;56(1):13-21
pubmed: 10783772