Wall segmentation in 2D images using convolutional neural networks.

ADE20K Encoder-decoder PSPNet Semantic segmentation Wall segmentation

Journal

PeerJ. Computer science
ISSN: 2376-5992
Titre abrégé: PeerJ Comput Sci
Pays: United States
ID NLM: 101660598

Informations de publication

Date de publication:
2023
Historique:
received: 14 12 2022
accepted: 13 08 2023
medline: 9 10 2023
pubmed: 9 10 2023
entrez: 9 10 2023
Statut: epublish

Résumé

Wall segmentation is a special case of semantic segmentation, and the task is to classify each pixel into one of two classes: wall and no-wall. The segmentation model returns a mask showing where objects like windows and furniture are located, as well as walls. This article proposes the module's structure for semantic segmentation of walls in 2D images, which can effectively address the problem of wall segmentation. The proposed model achieved higher accuracy and faster execution than other solutions. An encoder-decoder architecture of the segmentation module was used. Dilated ResNet50/101 network was used as an encoder, representing ResNet50/101 network in which dilated convolutional layers replaced the last convolutional layers. The ADE20K dataset subset containing only interior images, was used for model training, while only its subset was used for model evaluation. Three different approaches to model training were analyzed in the research. On the validation dataset, the best approach based on the proposed structure with the ResNet101 network resulted in an average accuracy at the pixel level of 92.13% and an intersection over union (IoU) of 72.58%. Moreover, all proposed approaches can be applied to recognize other objects in the image to solve specific tasks.

Identifiants

pubmed: 37810356
doi: 10.7717/peerj-cs.1565
pii: cs-1565
pmc: PMC10557507
doi:

Types de publication

Journal Article

Langues

eng

Pagination

e1565

Informations de copyright

© 2023 Bjekic et al.

Déclaration de conflit d'intérêts

Mihailo Bjekic is employed by Everseen, Serbia. Ana Lazovic is employed by Daon, Serbia. The rest of the authors declare that they have no competing interests.

Références

IEEE Trans Image Process. 2019 Sep;28(9):4219-4232
pubmed: 30932837
PeerJ Comput Sci. 2021 Aug 10;7:e629
pubmed: 34458570
PeerJ Comput Sci. 2022 Jan 20;8:e847
pubmed: 35174267
PeerJ Comput Sci. 2022 May 13;8:e962
pubmed: 35634107
IEEE Trans Pattern Anal Mach Intell. 2022 Jul;44(7):3523-3542
pubmed: 33596172
IEEE Trans Pattern Anal Mach Intell. 2017 Dec;39(12):2481-2495
pubmed: 28060704
IEEE Trans Pattern Anal Mach Intell. 2018 Apr;40(4):834-848
pubmed: 28463186
PeerJ Comput Sci. 2021 Sep 17;7:e719
pubmed: 34616895
PeerJ Comput Sci. 2023 Mar 10;9:e1262
pubmed: 37346717

Auteurs

Mihailo Bjekic (M)

Everseen, Belgrade, Serbia.

Ana Lazovic (A)

Daon, Belgrade, Serbia.

Venkatachalam K (V)

Department of Applied Cybernetics, University of Hradec Králové, Faculty of Science, Hradec Králové, Czech Republic.

Nebojsa Bacanin (N)

Department of Informatics and Computing, Singidunum University, Belgrade, Serbia.

Miodrag Zivkovic (M)

Department of Informatics and Computing, Singidunum University, Belgrade, Serbia.

Goran Kvascev (G)

University of Belgrade, School of Electrical Engineering, Belgrade, Serbia.

Bosko Nikolic (B)

University of Belgrade, School of Electrical Engineering, Belgrade, Serbia.

Classifications MeSH