>
    当前位置:首页>科研成果
论文编号:
第一作者所在部门:
中文论文题目: Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration
英文论文题目: Error Bound Analysis of Q-Function for Discounted Optimal Control Problems With Policy Iteration
论文题目英文:
作者: Yan, Pengfei
论文出处:
刊物名称: IEEE TRANSACTIONS ON SYSTEMS MAN CYBERNETICS-SYSTEMS
年: 2017
卷: 47
期: 7
页: 1207-1216
联系作者:
收录类别:
影响因子:
摘要: In this paper, we present error bound analysis of the Q-function for the action-dependent adaptive dynamic programming for solving discounted optimal control problems of unknown discrete-time nonlinear systems. The convergence of Q-functions derived by a policy iteration algorithm under ideal conditions is given. Considering the approximated errors of the Q-function and control policy in the policy evaluation step and policy improvement step, we establish error bounds of approximate Q-functions in each iteration. With the given boundedness conditions, the approximate Q-function will converge to a finite neighborhood of the optimal Q-function. To implement the presented algorithm, two three-layer neural networks are employed to approximate the Q-function and the control policy, respectively. Finally, a simulation example is utilized to verify the validity of the presented algorithm.
英文摘要:
外单位作者单位:
备注:

关闭窗口