D2D Mobile Relaying Meets NOMA-Part II: A Reinforcement Learning Perspective.
5G/B5G/6G
D2D relaying
NOMA/OMA
Nash equilibrium
biform game
distributed reinforcement learning
self-organized devices
Journal
Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366
Informations de publication
Date de publication:
04 Mar 2021
04 Mar 2021
Historique:
received:
13
01
2021
revised:
28
01
2021
accepted:
15
02
2021
entrez:
3
4
2021
pubmed:
4
4
2021
medline:
4
4
2021
Statut:
epublish
Résumé
Structureless communications such as Device-to-Device (D2D) relaying are undeniably of paramount importance to improving the performance of today's mobile networks. Such a communication paradigm requires a certain level of intelligence at the device level, thereby allowing it to interact with the environment and make proper decisions. However, decentralizing decision-making may induce paradoxical outcomes, resulting in a drop in performance, which sustains the design of self-organizing yet efficient systems. We propose that each device decides either to directly connect to the eNodeB or get access via another device through a D2D link. In the first part of this article, we describe a biform game framework to analyze the proposed self-organized system's performance, under pure and mixed strategies. We use two reinforcement learning (RL) algorithms, enabling devices to self-organize and learn their pure/mixed equilibrium strategies in a fully distributed fashion. Decentralized RL algorithms are shown to play an important role in allowing devices to be self-organized and reach satisfactory performance with incomplete information or even under uncertainties. We point out through a simulation the importance of D2D relaying and assess how our learning schemes perform under slow/fast channel fading.
Identifiants
pubmed: 33806302
pii: s21051755
doi: 10.3390/s21051755
pmc: PMC7961575
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Références
Sensors (Basel). 2020 Apr 21;20(8):
pubmed: 32326250
Sensors (Basel). 2020 Oct 13;20(20):
pubmed: 33066295
Sensors (Basel). 2021 Jan 20;21(3):
pubmed: 33498586