| 论文 |
|
郝钢,叶秀芬,陈亭 |
|
秦伟伟,马建军,郑志强,刘刚 |
|
李绍勇,王安荣 |
|
马广富,梅杰 |
|
韩德强,邓勇,韩崇昭,杨艺,蒋雯,侯志强 |
|
李仁兵,李艾华,白向峰,蔡艳平,王德生 |
|
杨斌虎,杨卫东,曲蕾 |
|
蒲明,吴庆宪,姜长生,程路 |
|
李辉,阳春华,邓文浪 |
|
邓亮,陈抱雪,隋国荣,张建彬,王关德 |
|
杨波,王哲 |
| 短文 |
|
李军军,甘世红,许波桅 |
|
邓勇,王栋,李齐,章雅娟 |
|
韩敏,梁志平 |
|
张新建,刘雄伟 |
|
高宪文,张立,王介生,赵娟平 |
|
李俊民,王元亮,李新民 |
|
裴文卉,张承慧,李珂,崔纳新 |
|
张霞,高岩,夏尊铨 |
|
王苏滨,张泽焕,汪红宇 |
|
黄曼磊,宋克明,魏志达 |
|
李鑫,陈薇,董学平,陈梅,蒋琳 |
| Special issue on approximate dynamic programming and reinforcement learning |
|
Silvia Ferrari,Jagannathan Sarangapani and Frank L. Lewis Editorial: Special issue on approximate dynamic programming and reinforcement learning J. Control Theory and Applications, 2011,9(3):309~309 Abstract | PDF |
|
Dimitri P. BERTSEKAS Approximate policy iteration: a survey and some new methods J. Control Theory and Applications, 2011,9(3):310~335 Abstract | PDF |
|
Warren B. POWELL and Jun MA A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications J. Control Theory and Applications, 2011,9(3):336~352 Abstract | PDF |
|
Draguna VRABIE and Frank LEWIS Adaptive dynamic programming for online solution of a zero-sum differential game J. Control Theory and Applications, 2011,9(3):353~360 Abstract | PDF |
|
Jie DING and S. N. BALAKRISHNAN Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems J. Control Theory and Applications, 2011,9(3):370~380 Abstract | PDF |
|
Qinglai WEI and Derong LIU Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming J. Control Theory and Applications, 2011,9(3):381~390 Abstract | PDF |
|
Greg FODERARO,Vikram RAJU and Silvia FERRARI A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man J. Control Theory and Applications, 2011,9(3):391~399 Abstract | PDF |
|
Shubhendu BHASIN,Nitin SHARMA,Parag PATRE and Warren DIXON Asymptotic tracking by a reinforcement learning-based adaptive critic controller J. Control Theory and Applications, 2011,9(3):400~409 Abstract | PDF |
|
James Nate KNIGHT and Charles ANDERSON Stable reinforcement learning with recurrent neural networks J. Control Theory and Applications, 2011,9(3):410~420 Abstract | PDF |
|
Ketaki KULKARNI,Abhijit GOSAVI,Susan MURRAY and Katie GRANTHAM Semi-Markov adaptive critic heuristics with application to airline revenue management J. Control Theory and Applications, 2011,9(3):421~430 Abstract | PDF |
|
Amanda LAMPTON,John VALASEK and Mrinal KUMAR Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization J. Control Theory and Applications, 2011,9(3):431~439 Abstract | PDF |
|
Xueqing SUN,Tao MAO,Laura RAY,Dongqing SHI and Jerald KRALIK Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning J. Control Theory and Applications, 2011,9(3):440~450 Abstract | PDF |
|
Mingyuan ZHONG and Emanuel TODOROV Moving least-squares approximations for linearly-solvable stochastic optimal control problems J. Control Theory and Applications, 2011,9(3):451~463 Abstract | PDF |
|
Travis DIERKS and Sarangapani JAGANNATHAN Online optimal control of nonlinear discrete-time systems using approximate dynamic programming J. Control Theory and Applications, 2011,9(3):361~369 Abstract | PDF |
| 长论文 |
| 柴天佑 张亚军 基于未建模动态补偿的非线性自适应切换控制方法 自动化学报, 2011 37 (7): 773-786 Abstract | PDF |
| 傅向华 李坚强 王志强 杜文峰 基于Nyström 低阶近似的半监督流形排序图像检索 自动化学报, 2011 37 (7): 787-793 Abstract | PDF |
| 汤影 李建平 吴淮 一种分离低维信号的ICA快速算法 自动化学报, 2011 37 (7): 794-799 Abstract | PDF |
| 论文与报告 |
|
鞠明 李成 高山 穆举国 毕笃彦 基于向心自动波交叉皮质模型的非均匀光照图像增强 自动化学报, 2011 37 (7): 800-810 Abstract | PDF |
|
程光权 张继东 成礼智 黄金才 刘忠 基于几何结构失真模型的图像质量评价研究 自动化学报, 2011 37 (7): 811-819 Abstract | PDF |
|
何楚 刘明 冯倩 邓新萍 基于多尺度压缩感知金字塔的极化干涉SAR图像分类 自动化学报, 2011 37 (7): 820-827 Abstract | PDF |
|
刘胜蓝 闫德勤 一种新的全局嵌入降维算法 自动化学报, 2011 37 (7): 828-835 Abstract | PDF |
|
付世昌 董一鸿 唐燕琳 基于事件的位置不确定移动对象连续概率Skyline查询 自动化学报, 2011 37 (7): 836-848 Abstract | PDF |
|
何亮 史永哲 刘加 联合因子分析中的本征信道空间拼接方法 自动化学报, 2011 37 (7): 849-856 Abstract | PDF |
|
杨芳 王朝立 基于视觉伺服反馈的不确定非完整动态移动机器人的自适应镇定 自动化学报, 2011 37 (7): 857-864 Abstract | PDF |
|
苏兆品 蒋建国 梁昌勇 张国富 一种基于P学习的分布式并行多任务分配算法 自动化学报, 2011 37 (7): 865-872 Abstract | PDF |
|
金弟 刘杰 杨博 何东晓 刘大有 局部搜索与遗传算法结合的大规模复杂网络社区探测 自动化学报, 2011 37 (7): 873-882 Abstract | PDF |
|
焦慧敏 王党校 张玉茹 方磊 基于书写摩擦力的签名识别方法 自动化学报, 2011 37 (7): 883-890 Abstract | PDF |
|
张亮 张亮 黄曙光 石昭祥 一种基于拒识的高可靠性CAPTCHA识别算法 自动化学报, 2011 37 (7): 891-900 Abstract | PDF |