Task-Decoupled Knowledge Transfer for Cross-Modality Object Detection.

cross-modality knowledge transfer task-decoupled pre-training task-relevant hyperparameter evolution

Journal

Entropy (Basel, Switzerland)
ISSN: 1099-4300
Titre abrégé: Entropy (Basel)
Pays: Switzerland
ID NLM: 101243874

Informations de publication

Date de publication:
04 Aug 2023
Historique:
received: 19 04 2023
revised: 27 05 2023
accepted: 02 08 2023
medline: 26 8 2023
pubmed: 26 8 2023
entrez: 26 8 2023
Statut: epublish

Résumé

In harsh weather conditions, the infrared modality can supplement or even replace the visible modality. However, the lack of a large-scale dataset for infrared features hinders the generation of a robust pre-training model. Most existing infrared object-detection algorithms rely on pre-training models from the visible modality, which can accelerate network convergence but also limit performance due to modality differences. In order to provide more reliable feature representation for cross-modality object detection and enhance its performance, this paper investigates the impact of various task-relevant features on cross-modality object detection and proposes a knowledge transfer algorithm based on classification and localization decoupling analysis. A task-decoupled pre-training method is introduced to adjust the attributes of various tasks learned by the pre-training model. For the training phase, a task-relevant hyperparameter evolution method is proposed to increase the network's adaptability to attribute changes in pre-training weights. Our proposed method improves the accuracy of multiple modalities in multiple datasets, with experimental results on the FLIR ADAS dataset reaching a state-of-the-art level and surpassing most multi-spectral object-detection methods.

Identifiants

pubmed: 37628196
pii: e25081166
doi: 10.3390/e25081166
pmc: PMC10453456
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : National Natural Science Foundation of China
ID : 62101256
Organisme : China Postdoctoral Science Foundation
ID : 2021M691591
Organisme : Jiangsu Provincial Key Research and Development Program
ID : BE2022391

Références

IEEE Trans Pattern Anal Mach Intell. 2017 Jun;39(6):1137-1149
pubmed: 27295650
Sensors (Basel). 2022 May 11;22(10):
pubmed: 35632059

Auteurs

Chiheng Wei (C)

The School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

Lianfa Bai (L)

The School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

Xiaoyu Chen (X)

The School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

Jing Han (J)

The School of Electronic and Optical Engineering, Nanjing University of Science and Technology, Nanjing 210094, China.

Classifications MeSH