Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration----中国科学院自动化研究所

科研成果

论文编号:
第一作者所在部门:
中文论文题目:	Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration
英文论文题目:	Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration
论文题目英文:
作者:	Yan, Pengfei
论文出处:
刊物名称:	IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
年:	2017
卷:	47
期:	7
页:	1207-1216
联系作者:
收录类别:
影响因子:
摘要:	In this paper, we present error bound analysis of the Q-function for the action-dependent adaptive dynamic programming for solving discounted optimal control problems of unknown discrete-time nonlinear systems. The convergence of Q-functions derived by a policy iteration algorithm under ideal conditions is given. Considering the approximated errors of the Q-function and control policy in the policy evaluation step and policy improvement step, we establish error bounds of approximate Q-functions in each iteration. With the given boundedness conditions, the approximate Q-function will converge to a finite neighborhood of the optimal Q-function. To implement the presented algorithm, two three-layer neural networks are employed to approximate the Q-function and the control policy, respectively. Finally, a simulation example is utilized to verify the validity of the presented algorithm.
英文摘要:
外单位作者单位:
备注:
关闭窗口