CVCC Model: Learning-Based Computer Vision Color Constancy with RiR-DSN Architecture.
RiR-DSN architecture
computer vision color constancy
illumination estimation
scene illuminant color
Journal
Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366
Informations de publication
Date de publication:
05 Jun 2023
05 Jun 2023
Historique:
received:
05
04
2023
revised:
26
05
2023
accepted:
01
06
2023
medline:
12
6
2023
pubmed:
10
6
2023
entrez:
10
6
2023
Statut:
epublish
Résumé
To achieve computer vision color constancy (CVCC), it is vital but challenging to estimate scene illumination from a digital image, which distorts the true color of an object. Estimating illumination as accurately as possible is fundamental to improving the quality of the image processing pipeline. CVCC has a long history of research and has significantly advanced, but it has yet to overcome some limitations such as algorithm failure or accuracy decreasing under unusual circumstances. To cope with some of the bottlenecks, this article presents a novel CVCC approach that introduces a residual-in-residual dense selective kernel network (RiR-DSN). As its name implies, it has a residual network in a residual network (RiR) and the RiR houses a dense selective kernel network (DSN). A DSN is composed of selective kernel convolutional blocks (SKCBs). The SKCBs, or neurons herein, are interconnected in a feed-forward fashion. Every neuron receives input from all its preceding neurons and feeds the feature maps into all its subsequent neurons, which is how information flows in the proposed architecture. In addition, the architecture has incorporated a dynamic selection mechanism into each neuron to ensure that the neuron can modulate filter kernel sizes depending on varying intensities of stimuli. In a nutshell, the proposed RiR-DSN architecture features neurons called SKCBs and a residual block in a residual block, which brings several benefits such as alleviation of the vanishing gradients, enhancement of feature propagation, promotion of the reuse of features, modulation of receptive filter sizes depending on varying intensities of stimuli, and a dramatic drop in the number of parameters. Experimental results highlight that the RiR-DSN architecture performs well above its state-of-the-art counterparts, as well as proving to be camera- and illuminant-invariant.
Identifiants
pubmed: 37300068
pii: s23115341
doi: 10.3390/s23115341
pmc: PMC10256021
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : NRF
ID : NRF-2019R1I1A3A01061844
Références
IEEE Trans Pattern Anal Mach Intell. 2012 Aug;34(8):1509-19
pubmed: 22745000
IEEE Trans Pattern Anal Mach Intell. 2015 Oct;37(10):1973-85
pubmed: 26353182
IEEE Trans Pattern Anal Mach Intell. 2014 May;36(5):860-73
pubmed: 26353222
Sci Am. 1977 Dec;237(6):108-28
pubmed: 929159
IEEE Trans Pattern Anal Mach Intell. 2012 May;34(5):918-29
pubmed: 22442121
IEEE Trans Image Process. 2000;9(10):1774-83
pubmed: 18262915
IEEE Trans Pattern Anal Mach Intell. 2018 Sep;40(9):2081-2094
pubmed: 28922115
IEEE Trans Pattern Anal Mach Intell. 2011 Apr;33(4):687-98
pubmed: 20421672
IEEE Trans Image Process. 2007 Sep;16(9):2207-14
pubmed: 17784594
J Opt Soc Am A Opt Image Sci Vis. 2004 Mar;21(3):321-34
pubmed: 15005396
IEEE Trans Image Process. 2020 Apr 13;:
pubmed: 32286983
IEEE Trans Image Process. 2016 Mar;25(3):1219-32
pubmed: 26766375