论文 |
郝钢,叶秀芬,陈亭 |
秦伟伟,马建军,郑志强,刘刚 |
李绍勇,王安荣 |
马广富,梅杰 |
韩德强,邓勇,韩崇昭,杨艺,蒋雯,侯志强 |
李仁兵,李艾华,白向峰,蔡艳平,王德生 |
杨斌虎,杨卫东,曲蕾 |
蒲明,吴庆宪,姜长生,程路 |
李辉,阳春华,邓文浪 |
邓亮,陈抱雪,隋国荣,张建彬,王关德 |
杨波,王哲 |
短文 |
李军军,甘世红,许波桅 |
邓勇,王栋,李齐,章雅娟 |
韩敏,梁志平 |
张新建,刘雄伟 |
高宪文,张立,王介生,赵娟平 |
李俊民,王元亮,李新民 |
裴文卉,张承慧,李珂,崔纳新 |
张霞,高岩,夏尊铨 |
王苏滨,张泽焕,汪红宇 |
黄曼磊,宋克明,魏志达 |
李鑫,陈薇,董学平,陈梅,蒋琳 |
Special issue on approximate dynamic programming and reinforcement learning |
Silvia Ferrari,Jagannathan Sarangapani and Frank L. Lewis Editorial: Special issue on approximate dynamic programming and reinforcement learning J. Control Theory and Applications, 2011,9(3):309~309 Abstract | PDF |
Dimitri P. BERTSEKAS Approximate policy iteration: a survey and some new methods J. Control Theory and Applications, 2011,9(3):310~335 Abstract | PDF |
Warren B. POWELL and Jun MA A review of stochastic algorithms with continuous value function approximation and some new approximate policy iteration algorithms for multidimensional continuous applications J. Control Theory and Applications, 2011,9(3):336~352 Abstract | PDF |
Draguna VRABIE and Frank LEWIS Adaptive dynamic programming for online solution of a zero-sum differential game J. Control Theory and Applications, 2011,9(3):353~360 Abstract | PDF |
Jie DING and S. N. BALAKRISHNAN Approximate dynamic programming solutions with a single network adaptive critic for a class of nonlinear systems J. Control Theory and Applications, 2011,9(3):370~380 Abstract | PDF |
Qinglai WEI and Derong LIU Finite horizon optimal control of discrete-time nonlinear systems with unfixed initial state using adaptive dynamic programming J. Control Theory and Applications, 2011,9(3):381~390 Abstract | PDF |
Greg FODERARO,Vikram RAJU and Silvia FERRARI A model-based approximate λ-policy iteration approach to online evasive path planning and the video game Ms. Pac-Man J. Control Theory and Applications, 2011,9(3):391~399 Abstract | PDF |
Shubhendu BHASIN,Nitin SHARMA,Parag PATRE and Warren DIXON Asymptotic tracking by a reinforcement learning-based adaptive critic controller J. Control Theory and Applications, 2011,9(3):400~409 Abstract | PDF |
James Nate KNIGHT and Charles ANDERSON Stable reinforcement learning with recurrent neural networks J. Control Theory and Applications, 2011,9(3):410~420 Abstract | PDF |
Ketaki KULKARNI,Abhijit GOSAVI,Susan MURRAY and Katie GRANTHAM Semi-Markov adaptive critic heuristics with application to airline revenue management J. Control Theory and Applications, 2011,9(3):421~430 Abstract | PDF |
Amanda LAMPTON,John VALASEK and Mrinal KUMAR Multiresolution state-space discretization for Q-Learning with pseudorandomized discretization J. Control Theory and Applications, 2011,9(3):431~439 Abstract | PDF |
Xueqing SUN,Tao MAO,Laura RAY,Dongqing SHI and Jerald KRALIK Hierarchical state-abstracted and socially augmented Q-Learning for reducing complexity in agent-based learning J. Control Theory and Applications, 2011,9(3):440~450 Abstract | PDF |
Mingyuan ZHONG and Emanuel TODOROV Moving least-squares approximations for linearly-solvable stochastic optimal control problems J. Control Theory and Applications, 2011,9(3):451~463 Abstract | PDF |
Travis DIERKS and Sarangapani JAGANNATHAN Online optimal control of nonlinear discrete-time systems using approximate dynamic programming J. Control Theory and Applications, 2011,9(3):361~369 Abstract | PDF |
长论文 |
柴天佑 张亚军 基于未建模动态补偿的非线性自适应切换控制方法 自动化学报, 2011 37 (7): 773-786 Abstract | PDF |
傅向华 李坚强 王志强 杜文峰 基于Nyström 低阶近似的半监督流形排序图像检索 自动化学报, 2011 37 (7): 787-793 Abstract | PDF |
汤影 李建平 吴淮 一种分离低维信号的ICA快速算法 自动化学报, 2011 37 (7): 794-799 Abstract | PDF |
论文与报告 |
鞠明 李成 高山 穆举国 毕笃彦 基于向心自动波交叉皮质模型的非均匀光照图像增强 自动化学报, 2011 37 (7): 800-810 Abstract | PDF |
程光权 张继东 成礼智 黄金才 刘忠 基于几何结构失真模型的图像质量评价研究 自动化学报, 2011 37 (7): 811-819 Abstract | PDF |
何楚 刘明 冯倩 邓新萍 基于多尺度压缩感知金字塔的极化干涉SAR图像分类 自动化学报, 2011 37 (7): 820-827 Abstract | PDF |
刘胜蓝 闫德勤 一种新的全局嵌入降维算法 自动化学报, 2011 37 (7): 828-835 Abstract | PDF |
付世昌 董一鸿 唐燕琳 基于事件的位置不确定移动对象连续概率Skyline查询 自动化学报, 2011 37 (7): 836-848 Abstract | PDF |
何亮 史永哲 刘加 联合因子分析中的本征信道空间拼接方法 自动化学报, 2011 37 (7): 849-856 Abstract | PDF |
杨芳 王朝立 基于视觉伺服反馈的不确定非完整动态移动机器人的自适应镇定 自动化学报, 2011 37 (7): 857-864 Abstract | PDF |
苏兆品 蒋建国 梁昌勇 张国富 一种基于P学习的分布式并行多任务分配算法 自动化学报, 2011 37 (7): 865-872 Abstract | PDF |
金弟 刘杰 杨博 何东晓 刘大有 局部搜索与遗传算法结合的大规模复杂网络社区探测 自动化学报, 2011 37 (7): 873-882 Abstract | PDF |
焦慧敏 王党校 张玉茹 方磊 基于书写摩擦力的签名识别方法 自动化学报, 2011 37 (7): 883-890 Abstract | PDF |
张亮 张亮 黄曙光 石昭祥 一种基于拒识的高可靠性CAPTCHA识别算法 自动化学报, 2011 37 (7): 891-900 Abstract | PDF |