VRT: A Video Restoration Transformer.

Journal

IEEE transactions on image processing : a publication of the IEEE Signal Processing Society

ISSN: 1941-0042

Titre abrégé: IEEE Trans Image Process

Pays: United States

ID NLM: 9886191

Informations de publication

Date de publication:
07 Mar 2024

Historique:

pubmed: 7 3 2024

medline: 7 3 2024

entrez: 7 3 2024

Statut: aheadofprint

Résumé

Video restoration aims to restore high-quality frames from low-quality frames. Different from single image restoration, video restoration generally requires to utilize temporal information from multiple adjacent but usually misaligned video frames. Existing deep methods generally tackle with this by exploiting a sliding window strategy or a recurrent architecture, which are restricted by frame-by-frame restoration. In this paper, we propose a Video Restoration Transformer (VRT) with parallel frame prediction ability. More specifically, VRT is composed of multiple scales, each of which consists of two kinds of modules: temporal reciprocal self attention (TRSA) and parallel warping. TRSA divides the video into small clips, on which reciprocal attention is applied for joint motion estimation, feature alignment and feature fusion, while self attention is used for feature extraction. To enable cross-clip interactions, the video sequence is shifted for every other layer. Besides, parallel warping is used to further fuse information from neighboring frames by parallel feature warping. Experimental results on five tasks, including video super-resolution, video deblurring, video denoising, video frame interpolation and space-time video super-resolution, demonstrate that VRT outperforms the state-of-the-art methods by large margins (up to 2.16dB) on fourteen benchmark datasets. The codes are available at https://github.com/JingyunLiang/VRT.

Identifiants

DOI: 10.1109/TIP.2024.3372454 PMID: 38451763

pubmed: 38451763

doi: 10.1109/TIP.2024.3372454

doi:

Types de publication

Journal Article

Langues

eng

VRT: A Video Restoration Transformer.

Journal

Informations de publication

Résumé

Identifiants

Types de publication

Langues

Sous-ensembles de citation

Auteurs

Jingyun Liang (J)

Jiezhang Cao (J)

Yuchen Fan (Y)

Kai Zhang (K)

Rakesh Ranjan (R)

Yawei Li (Y)

Radu Timofte (R)

Luc Van Gool (L)

Classifications MeSH