D2D Mobile Relaying Meets NOMA-Part II: A Reinforcement Learning Perspective.

5G/B5G/6G D2D relaying NOMA/OMA Nash equilibrium biform game distributed reinforcement learning self-organized devices

Journal

Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366

Informations de publication

Date de publication:
04 Mar 2021
Historique:
received: 13 01 2021
revised: 28 01 2021
accepted: 15 02 2021
entrez: 3 4 2021
pubmed: 4 4 2021
medline: 4 4 2021
Statut: epublish

Résumé

Structureless communications such as Device-to-Device (D2D) relaying are undeniably of paramount importance to improving the performance of today's mobile networks. Such a communication paradigm requires a certain level of intelligence at the device level, thereby allowing it to interact with the environment and make proper decisions. However, decentralizing decision-making may induce paradoxical outcomes, resulting in a drop in performance, which sustains the design of self-organizing yet efficient systems. We propose that each device decides either to directly connect to the eNodeB or get access via another device through a D2D link. In the first part of this article, we describe a biform game framework to analyze the proposed self-organized system's performance, under pure and mixed strategies. We use two reinforcement learning (RL) algorithms, enabling devices to self-organize and learn their pure/mixed equilibrium strategies in a fully distributed fashion. Decentralized RL algorithms are shown to play an important role in allowing devices to be self-organized and reach satisfactory performance with incomplete information or even under uncertainties. We point out through a simulation the importance of D2D relaying and assess how our learning schemes perform under slow/fast channel fading.

Identifiants

pubmed: 33806302
pii: s21051755
doi: 10.3390/s21051755
pmc: PMC7961575
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Références

Sensors (Basel). 2020 Apr 21;20(8):
pubmed: 32326250
Sensors (Basel). 2020 Oct 13;20(20):
pubmed: 33066295
Sensors (Basel). 2021 Jan 20;21(3):
pubmed: 33498586

Auteurs

Safaa Driouech (S)

NEST Research Group, LRI Lab., ENSEM, Hassan II University of Casablanca, Casablanca 20000, Morocco.
Laoratoire de Reacherche en Informatique, Sorbonne Université, CNRS, LIP6, F-75005 Paris, France.

Essaid Sabir (E)

NEST Research Group, LRI Lab., ENSEM, Hassan II University of Casablanca, Casablanca 20000, Morocco.
Department of Computer Science, University of Quebec at Montreal, Montreal, QC H2L 2C4, Canada.

Mounir Ghogho (M)

TICLab, International University of Rabat, Rabat 11100, Morocco.

El-Mehdi Amhoud (EM)

School of Computer Science, Mohammed VI Polytechnic University, Ben Guerir 43150, Morocco.

Classifications MeSH