Implicit and Explicit Regularization for Optical Flow Estimation.

coordconv motion consistency optical flow regularization semantic segmentation

Journal

Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366

Informations de publication

Date de publication:
10 Jul 2020
Historique:
received: 02 06 2020
revised: 02 07 2020
accepted: 07 07 2020
entrez: 16 7 2020
pubmed: 16 7 2020
medline: 16 7 2020
Statut: epublish

Résumé

In this paper, two novel and practical regularizing methods are proposed to improve existing neural network architectures for monocular optical flow estimation. The proposed methods aim to alleviate deficiencies of current methods, such as flow leakage across objects and motion consistency within rigid objects, by exploiting contextual information. More specifically, the first regularization method utilizes semantic information during the training process to explicitly regularize the produced optical flow field. The novelty of this method lies in the use of semantic segmentation masks to teach the network to implicitly identify the semantic edges of an object and better reason on the local motion flow. A novel loss function is introduced that takes into account the objects' boundaries as derived from the semantic segmentation mask to selectively penalize motion inconsistency within an object. The method is architecture agnostic and can be integrated into any neural network without modifying or adding complexity at inference. The second regularization method adds spatial awareness to the input data of the network in order to improve training stability and efficiency. The coordinates of each pixel are used as an additional feature, breaking the invariance properties of the neural network architecture. The additional features are shown to implicitly regularize the optical flow estimation enforcing a consistent flow, while improving both the performance and the convergence time. Finally, the combination of both regularization methods further improves the performance of existing cutting edge architectures in a complementary way, both quantitatively and qualitatively, on popular flow estimation benchmark datasets.

Identifiants

pubmed: 32664442
pii: s20143855
doi: 10.3390/s20143855
pmc: PMC7412516
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : Horizon 2020
ID : H2020-787061

Références

IEEE Trans Pattern Anal Mach Intell. 2020 Jun;42(6):1408-1423
pubmed: 30676944
J Cogn Neurosci. 2020 May;32(5):847-861
pubmed: 31933430
Exp Brain Res. 2017 May;235(5):1297-1308
pubmed: 28255843
Sensors (Basel). 2019 May 29;19(11):
pubmed: 31146404
IEEE Trans Pattern Anal Mach Intell. 1986 May;8(5):565-93
pubmed: 21869357
Sensors (Basel). 2020 Mar 04;20(5):
pubmed: 32143297

Auteurs

Konstantinos Karageorgos (K)

The Visual Computing Lab, Information Technologies Institute, Centre for Research and Technology Hellas, 57001 Thessaloniki, Greece.

Anastasios Dimou (A)

The Visual Computing Lab, Information Technologies Institute, Centre for Research and Technology Hellas, 57001 Thessaloniki, Greece.
Universidad Politécnica de Madrid, 28040 Madrid, Spain.

Federico Alvarez (F)

Universidad Politécnica de Madrid, 28040 Madrid, Spain.

Petros Daras (P)

The Visual Computing Lab, Information Technologies Institute, Centre for Research and Technology Hellas, 57001 Thessaloniki, Greece.

Classifications MeSH