AM-MSFF: A Pest Recognition Network Based on Attention Mechanism and Multi-Scale Feature Fusion.

attention mechanism cross-entropy loss multi-scale feature fusion pest recognition

Journal

Entropy (Basel, Switzerland)
ISSN: 1099-4300
Titre abrégé: Entropy (Basel)
Pays: Switzerland
ID NLM: 101243874

Informations de publication

Date de publication:
20 May 2024
Historique:
received: 18 03 2024
revised: 12 05 2024
accepted: 16 05 2024
medline: 24 5 2024
pubmed: 24 5 2024
entrez: 24 5 2024
Statut: epublish

Résumé

Traditional methods for pest recognition have certain limitations in addressing the challenges posed by diverse pest species, varying sizes, diverse morphologies, and complex field backgrounds, resulting in a lower recognition accuracy. To overcome these limitations, this paper proposes a novel pest recognition method based on attention mechanism and multi-scale feature fusion (AM-MSFF). By combining the advantages of attention mechanism and multi-scale feature fusion, this method significantly improves the accuracy of pest recognition. Firstly, we introduce the relation-aware global attention (RGA) module to adaptively adjust the feature weights of each position, thereby focusing more on the regions relevant to pests and reducing the background interference. Then, we propose the multi-scale feature fusion (MSFF) module to fuse feature maps from different scales, which better captures the subtle differences and the overall shape features in pest images. Moreover, we introduce generalized-mean pooling (GeMP) to more accurately extract feature information from pest images and better distinguish different pest categories. In terms of the loss function, this study proposes an improved focal loss (FL), known as balanced focal loss (BFL), as a replacement for cross-entropy loss. This improvement aims to address the common issue of class imbalance in pest datasets, thereby enhancing the recognition accuracy of pest identification models. To evaluate the performance of the AM-MSFF model, we conduct experiments on two publicly available pest datasets (IP102 and D0). Extensive experiments demonstrate that our proposed AM-MSFF outperforms most state-of-the-art methods. On the IP102 dataset, the accuracy reaches 72.64%, while on the D0 dataset, it reaches 99.05%.

Identifiants

pubmed: 38785680
pii: e26050431
doi: 10.3390/e26050431
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : National Key R&D Program of China
ID : 2022ZD0115802
Organisme : Key Research and Development Program of the Autonomous Region
ID : 2022B01008
Organisme : National Natural Science Foundation of China
ID : 62262065
Organisme : Tianshan Elite Science and Technology Innovation Leading Talents Program of the Autonomous Region
ID : 2022TSYCLJ0037

Auteurs

Meng Zhang (M)

School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.

Wenzhong Yang (W)

School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.
Xinjiang Key Laboratory of Multilingual Information Technology, Xinjiang University, Urumqi 830017, China.

Danny Chen (D)

School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.

Chenghao Fu (C)

School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.

Fuyuan Wei (F)

School of Computer Science and Technology, Xinjiang University, Urumqi 830017, China.

Classifications MeSH