Spatial multi-attention conditional neural processes.

Attention mechanism Conditional neural processes Gaussian processes Spatial prediction

Journal

Neural networks : the official journal of the International Neural Network Society
ISSN: 1879-2782
Titre abrégé: Neural Netw
Pays: United States
ID NLM: 8805018

Informations de publication

Date de publication:
28 Feb 2024
Historique:
received: 16 07 2023
revised: 03 01 2024
accepted: 20 02 2024
medline: 7 3 2024
pubmed: 7 3 2024
entrez: 6 3 2024
Statut: aheadofprint

Résumé

Spatial prediction tasks are challenging when observed samples are sparse and prediction samples are abundant. Gaussian processes (GPs) are commonly used in spatial prediction tasks and have the advantage of measuring the uncertainty of the interpolation result. However, as the sample size increases, GPs suffer from significant overhead. Standard neural networks (NNs) provide a powerful and scalable solution for modeling spatial data, but they often overfit small sample data. Based on conditional neural processes (CNPs), which combine the advantages of GPs and NNs, we propose a new framework called Spatial Multi-Attention Conditional Neural Processes (SMACNPs) for spatial small sample prediction tasks. SMACNPs are a modular model that can predict targets by employing different attention mechanisms to extract relevant information from different forms of sample data. The task representation is inferred by measuring the spatial correlation contained in different sample points and the relationship contained in attribute variables, respectively. The distribution of the target variable is predicted by GPs parameterized by NNs. SMACNPs allow us to obtain accurate predictions of the target value while quantifying the prediction uncertainty. Experiments on spatial prediction tasks on simulated and real-world datasets demonstrate that this framework flexibly incorporates spatial context and correlation into the model, achieving state-of-the-art results in spatial small sample prediction tasks in terms of both predictive performance and reliability. For example, on the California housing dataset, our method reduces MAE by 8% and MSE by 7% compared to the second-best method. In addition, a spatiotemporal prediction task to forecast traffic speed further confirms the effectiveness and generality of our method.

Identifiants

pubmed: 38447305
pii: S0893-6080(24)00125-4
doi: 10.1016/j.neunet.2024.106201
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

106201

Informations de copyright

Copyright © 2024 Elsevier Ltd. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Li-Li Bao (LL)

School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an Shaanxi, 710049, China.

Jiang-She Zhang (JS)

School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an Shaanxi, 710049, China. Electronic address: jszhang@mail.xjtu.edu.cn.

Chun-Xia Zhang (CX)

School of Mathematics and Statistics, Xi'an Jiaotong University, Xi'an Shaanxi, 710049, China.

Classifications MeSH