Novel Method of Semantic Segmentation Applicable to Augmented Reality.

atrous pyramid pooling module augmented reality backpropagation convolutional neural network fully convolutional network modified dilated residual network semantic segmentation

Journal

Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366

Informations de publication

Date de publication:
20 Mar 2020
Historique:
received: 25 02 2020
revised: 17 03 2020
accepted: 19 03 2020
entrez: 5 4 2020
pubmed: 5 4 2020
medline: 5 4 2020
Statut: epublish

Résumé

This paper proposes a novel method of semantic segmentation, consisting of modified dilated residual network, atrous pyramid pooling module, and backpropagation, that is applicable to augmented reality (AR). In the proposed method, the modified dilated residual network extracts a feature map from the original images and maintains spatial information. The atrous pyramid pooling module places convolutions in parallel and layers feature maps in a pyramid shape to extract objects occupying small areas in the image; these are converted into one channel using a 1 × 1 convolution. Backpropagation compares the semantic segmentation obtained through convolution from the final feature map with the ground truth provided by a database. Losses can be reduced by applying backpropagation to the modified dilated residual network to change the weighting. The proposed method was compared with other methods on the Cityscapes and PASCAL VOC 2012 databases. The proposed method achieved accuracies of 82.8 and 89.8 mean intersection over union (mIOU) and frame rates of 61 and 64.3 frames per second (fps) for the Cityscapes and PASCAL VOC 2012 databases, respectively. These results prove the applicability of the proposed method for implementing natural AR applications at actual speeds because the frame rate is greater than 60 fps.

Identifiants

pubmed: 32245002
pii: s20061737
doi: 10.3390/s20061737
pmc: PMC7146136
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Références

IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495
pubmed: 28060704
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848
pubmed: 28463186
IEEE Trans Image Process. 2019 Jan 25;:
pubmed: 30703024

Auteurs

Tae-Young Ko (TY)

Department of Electronic Engineering, Hanbat National University, Daejeon 34158, Korea.

Seung-Ho Lee (SH)

Department of Electronics & Control Engineering, Hanbat National University, Daejeon 34158, Korea.

Classifications MeSH