学术报告《强化学习概述及其应用范例》

活动时间：活动地点：

报告时间：2017年10月30日（周一）10:00

报告地点：东校区信息馆401学术报告厅

报告题目：《强化学习概述及其应用范例》

报告人简介：黄国胜，美国西北大学计算机工程博士，国立中山大学电机工程学系教授，台湾系统学会理事、台湾机器人学会理事、台湾自动控制学会理事、IEEE Trans. on Cybernetics, Associate Editor及IEEE Trans. on Mechatronics, Technical Editor。研究领域：智能机器人及智能系统。在IEEE Transactions FS, IE, II, SMC-A, B ,NNLT等国际期刊发表论文70余篇。

内容简介：强化学习是机器学习中的一个领域，强调如何基于环境而行动，以取得最大化的预期利益。其灵感来源于心理学中的行为主义理论，即有机体如何在环境给予的奖励或惩罚的刺激下，逐步形成对刺激的预期，产生能获得最大利益的习惯性行为。这个方法具有普适性，因此在其他许多领域都有研究，例如博弈论、控制论、运筹学、信息论、仿真优化、多主体系统学习、群体智能、统计学以及遗传算法。强化学习和标准的监督式学习之间的区别在于，它并不需要出现正确的输入/输出对，也不需要精确校正次优化的行为。强化学习更加专注于在线规划，需要在探索（在未知的领域）和遵从（现有知识）之间找到平衡。本次报告将简介强化学习的基本概念以及其在报告人的研究团队所应用范例。希望在听讲后能对强化学习有初步理论概念，激发在此研究兴趣。

Reinforcement learning (RL) is an area of machine learning inspired by behaviorist psychology, concerned with how software agents ought to take actions in an environment so as to maximize some notion of cumulative reward. The problem, due to its generality, is studied in many other disciplines, such as game theory, control theory, operations research, information theory, Reinforcement learning differs from standard supervised learning in that correct input/output pairs are never presented, nor sub-optimal actions explicitly corrected. Instead the focus is on on-line performance, which involves finding a balance between exploration (of uncharted territory) and exploitation (of current knowledge).The theme of this speech will deliver a preliminary concept of RL and introduction to its applications implemented by the speaker’s laboratory. It is expected that the audiences can acquire rapidly the insights into the researches on RL and the inspiration as well.

信息科学与工程学院

2017年10月29日

学术活动

学术报告《强化学习概述及其应用范例》

最近更新

“红色旋律”读书会（第89期）：纪念抗...

“周末专家河北行”暨“燕山大讲堂”：...

“红色旋律”讲坛（第192期）：大模型与...

“红色旋律”影苑（第98期）：“捷径”...

高效开展创新性科研——基于Web of Scie...

学术活动

学术报告 《强化学习概述及其应用范例》

最近更新

“红色旋律”读书会（第89期）：纪念抗...

“周末专家河北行”暨“燕山大讲堂”：...

“红色旋律”讲坛（第192期）：大模型与...

“红色旋律”影苑（第98期）：“捷径”...

高效开展创新性科研——基于Web of Scie...

学术报告《强化学习概述及其应用范例》