Unsupervised Learning for Depth, Ego-Motion, and Optical Flow Estimation Using Coupled Consistency Conditions.

camera ego-motion coupled consistency conditions depth estimation optical flow unsupervised learning

Journal

Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366

Informations de publication

Date de publication:
29 May 2019
Historique:
received: 03 05 2019
revised: 23 05 2019
accepted: 27 05 2019
entrez: 1 6 2019
pubmed: 31 5 2019
medline: 31 5 2019
Statut: epublish

Résumé

Herein, we propose an unsupervised learning architecture under coupled consistency conditions to estimate the depth, ego-motion, and optical flow. Previously invented learning techniques in computer vision adopted a large amount of the ground truth dataset for network training. A ground truth dataset, including depth and optical flow collected from the real world, requires tremendous effort in pre-processing due to the exposure to noise artifacts. In this paper, we propose a framework that trains networks while using a different type of data with combined losses that are derived from a coupled consistency structure. The core concept is composed of two parts. First, we compare the optical flows, which are estimated from both the depth plus ego-motion and flow estimation network. Subsequently, to prevent the effects of the artifacts of the occluded regions in the estimated optical flow, we compute flow local consistency along the forward-backward directions. Second, synthesis consistency enables the exploration of the geometric correlation between the spatial and temporal domains in a stereo video. We perform extensive experiments on the depth, ego-motion, and optical flow estimation on the Karlsruhe Institute of Technology and Toyota Technological Institute (KITTI) dataset. We verify that the flow local consistency loss improves the optical flow accuracy in terms of the occluded regions. Furthermore, we also show that the view-synthesis-based photometric loss enhances the depth and ego-motion accuracy via scene projection. The experimental results exhibit the competitive performance of the estimated depth and the optical flow; moreover, the induced ego-motion is comparable to that obtained from other unsupervised methods.

Identifiants

pubmed: 31146404
pii: s19112459
doi: 10.3390/s19112459
pmc: PMC6603746
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : MOTIE Research Grant; ICT R&D program of MSIT/IITP
ID : 10067764; 2016-0-00098

Références

IEEE Trans Image Process. 2004 Apr;13(4):600-12
pubmed: 15376593
IEEE Trans Pattern Anal Mach Intell. 2008 Feb;30(2):328-41
pubmed: 18084062

Auteurs

Ji-Hun Mun (JH)

School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea. jhm@gist.ac.kr.

Moongu Jeon (M)

School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea. mgjeon@gist.ac.kr.

Byung-Geun Lee (BG)

School of Electrical Engineering and Computer Science, Gwangju Institute of Science and Technology, Gwangju 61005, Korea. bglee@gist.ac.kr.

Classifications MeSH