Deep Reinforcement Learning Based Decision Making for Complex Jamming Waveforms.

Wolpertinger architecture cognitive radio deep reinforcement learning intelligent jamming soft actor-critic

Journal

Entropy (Basel, Switzerland)
ISSN: 1099-4300
Titre abrégé: Entropy (Basel)
Pays: Switzerland
ID NLM: 101243874

Informations de publication

Date de publication:
10 Oct 2022
Historique:
received: 17 09 2022
revised: 04 10 2022
accepted: 07 10 2022
medline: 8 7 2023
pubmed: 8 7 2023
entrez: 8 7 2023
Statut: epublish

Résumé

With the development of artificial intelligence, intelligent communication jamming decision making is an important research direction of cognitive electronic warfare. In this paper, we consider a complex intelligent jamming decision scenario in which both communication parties choose to adjust physical layer parameters to avoid jamming in a non-cooperative scenario and the jammer achieves accurate jamming by interacting with the environment. However, when the situation becomes complex and large in number, traditional reinforcement learning suffers from the problems of failure to converge and a high number of interactions, which are fatal and unrealistic in a real warfare environment. To solve this problem, we propose a deep reinforcement learning based and maximum-entropy-based soft actor-critic (SAC) algorithm. In the proposed algorithm, we add an improved Wolpertinger architecture to the original SAC algorithm in order to reduce the number of interactions and improve the accuracy of the algorithm. The results show that the proposed algorithm shows excellent performance in various scenarios of jamming and achieves accurate, fast, and continuous jamming for both sides of the communication.

Identifiants

pubmed: 37420461
pii: e24101441
doi: 10.3390/e24101441
pmc: PMC9601320
pii:
doi:

Types de publication

Journal Article

Langues

eng

Déclaration de conflit d'intérêts

The authors declare no conflict of interest.

Références

Nature. 2015 Feb 26;518(7540):529-33
pubmed: 25719670
Entropy (Basel). 2020 May 27;22(6):
pubmed: 33286368

Auteurs

Yuting Xu (Y)

Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China.

Chao Wang (C)

Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China.

Jiakai Liang (J)

Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China.

Keqiang Yue (K)

Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China.
Science and Technology on Communication Information Security Control Laboratory, The No. 011 Research Center, Jiaxing 314033, China.

Wenjun Li (W)

Key Laboratory of RF Circuits and Systems, Ministry of Education, Hangzhou Dianzi University, Hangzhou 310018, China.

Shilian Zheng (S)

Science and Technology on Communication Information Security Control Laboratory, The No. 011 Research Center, Jiaxing 314033, China.

Zhijin Zhao (Z)

The School of Communication Engineering, Hangzhou Dianzi University, Hangzhou 310018, China.

Classifications MeSH