Critic Learning-Based Safe Optimal Control for Nonlinear Systems with Asymmetric Input Constraints and Unmatched Disturbances.

adaptive dynamic programming asymmetric input constraints critic neural network nonlinear systems safety unmatched disturbances

Journal

Entropy (Basel, Switzerland)
ISSN: 1099-4300
Titre abrégé: Entropy (Basel)
Pays: Switzerland
ID NLM: 101243874

Informations de publication

Date de publication:
24 Jul 2023
Historique:
received: 29 05 2023
revised: 01 07 2023
accepted: 07 07 2023
medline: 29 7 2023
pubmed: 29 7 2023
entrez: 29 7 2023
Statut: epublish

Résumé

In this paper, the safe optimal control method for continuous-time (CT) nonlinear safety-critical systems with asymmetric input constraints and unmatched disturbances based on the adaptive dynamic programming (ADP) is investigated. Initially, a new non-quadratic form function is implemented to effectively handle the asymmetric input constraints. Subsequently, the safe optimal control problem is transformed into a two-player zero-sum game (ZSG) problem to suppress the influence of unmatched disturbances, and a new Hamilton-Jacobi-Isaacs (HJI) equation is introduced by integrating the control barrier function (CBF) with the cost function to penalize unsafe behavior. Moreover, a damping factor is embedded in the CBF to balance safety and optimality. To obtain a safe optimal controller, only one critic neural network (CNN) is utilized to tackle the complex HJI equation, leading to a decreased computational load in contrast to the utilization of the conventional actor-critic network. Then, the system state and the parameters of the CNN are uniformly ultimately bounded (UUB) through the application of the Lyapunov stability method. Lastly, two examples are presented to confirm the efficacy of the presented approach.

Identifiants

pubmed: 37510048
pii: e25071101
doi: 10.3390/e25071101
pmc: PMC10378920
pii:
doi:

Types de publication

Journal Article

Langues

eng

Subventions

Organisme : cience and technology research project of the Henan province
ID : 222102240014

Références

Entropy (Basel). 2022 Jun 11;24(6):
pubmed: 35741537
IEEE Trans Neural Netw Learn Syst. 2020 Dec;31(12):5441-5455
pubmed: 32054590
IEEE Trans Neural Netw Learn Syst. 2016 Jan;27(1):165-77
pubmed: 26357411
IEEE Trans Neural Netw Learn Syst. 2018 Apr;29(4):957-969
pubmed: 28141530
IEEE Trans Neural Netw Learn Syst. 2018 Jun;29(6):2042-2062
pubmed: 29771662
IEEE Trans Cybern. 2021 Oct;51(10):4860-4872
pubmed: 32112694
ISA Trans. 2022 Oct;129(Pt B):361-369
pubmed: 35190194
Neural Netw. 2021 Dec;144:101-112
pubmed: 34478940

Auteurs

Chunbin Qin (C)

School of Artificial Intelligence, Henan University, Zhengzhou 450000, China.

Kaijun Jiang (K)

School of Artificial Intelligence, Henan University, Zhengzhou 450000, China.

Jishi Zhang (J)

School of Software, Henan University, Kaifeng 475000, China.

Tianzeng Zhu (T)

School of Artificial Intelligence, Henan University, Zhengzhou 450000, China.

Classifications MeSH