您的位置: 专家智库 > >

国家自然科学基金(s61304079)

作品数:4 被引量:6H指数:1
发文基金:国家自然科学基金中国博士后科学基金北京市自然科学基金更多>>
相关领域:理学自动化与计算机技术更多>>

文献类型

  • 3篇中文期刊文章

领域

  • 3篇理学

主题

  • 2篇CHAOTI...
  • 2篇CHAOTI...
  • 2篇CONTIN...
  • 1篇OPTIMA...
  • 1篇REINFO...
  • 1篇APPROA...
  • 1篇CLASS
  • 1篇DYNAMI...
  • 1篇INTEGR...
  • 1篇METHOD
  • 1篇ZERO-S...
  • 1篇SYNCHR...
  • 1篇UNKNOW...
  • 1篇ONLINE

传媒

  • 3篇Chines...

年份

  • 1篇2017
  • 1篇2015
  • 1篇2014
4 条 记 录,以下是 1-3
排序方式:
Chaotic system optimal tracking using data-based synchronous method with unknown dynamics and disturbances
2017年
We develop an optimal tracking control method for chaotic system with unknown dynamics and disturbances. The method allows the optimal cost function and the corresponding tracking control to update synchronously. According to the tracking error and the reference dynamics, the augmented system is constructed. Then the optimal tracking control problem is defined. The policy iteration (PI) is introduced to solve the rain-max optimization problem. The off-policy adaptive dynamic programming (ADP) algorithm is then proposed to find the solution of the tracking Hamilton-Jacobi- Isaacs (HJI) equation online only using measured data and without any knowledge about the system dynamics. Critic neural network (CNN), action neural network (ANN), and disturbance neural network (DNN) are used to approximate the cost function, control, and disturbance. The weights of these networks compose the augmented weight matrix, and the uniformly ultimately bounded (UUB) of which is proven. The convergence of the tracking error system is also proven. Two examples are given to show the effectiveness of the proposed synchronous solution method for the chaotic system tracking problem.
宋睿卓魏庆来
关键词:ZERO-SUM
A new approach of optimal control for a class of continuous-time chaotic systems by an online ADP algorithm
2014年
We develop an online adaptive dynamic programming (ADP) based optimal control scheme for continuous-time chaotic systems. The idea is to use the ADP algorithm to obtain the optimal control input that makes the performance index function reach an optimum. The expression of the performance index function for the chaotic system is first presented. The online ADP algorithm is presented to achieve optimal control. In the ADP structure, neural networks are used to construct a critic network and an action network, which can obtain an approximate performance index function and the control input, respectively. It is proven that the critic parameter error dynamics and the closed-loop chaotic systems are uniformly ultimately bounded exponentially. Our simulation results illustrate the performance of the established optimal control method.
宋睿卓肖文栋魏庆来
Off-policy integral reinforcement learning optimal tracking control for continuous-time chaotic systems
2015年
This paper estimates an off-policy integral reinforcement learning(IRL) algorithm to obtain the optimal tracking control of unknown chaotic systems. Off-policy IRL can learn the solution of the HJB equation from the system data generated by an arbitrary control. Moreover, off-policy IRL can be regarded as a direct learning method, which avoids the identification of system dynamics. In this paper, the performance index function is first given based on the system tracking error and control error. For solving the Hamilton–Jacobi–Bellman(HJB) equation, an off-policy IRL algorithm is proposed.It is proven that the iterative control makes the tracking error system asymptotically stable, and the iterative performance index function is convergent. Simulation study demonstrates the effectiveness of the developed tracking control method.
魏庆来宋睿卓孙秋野肖文栋
共1页<1>
聚类工具0