博士学术论坛系列活动

　发布时间：2024年04月24日 12:22　阅读量：

活动时间：2024年4月25日下午1：45

活动地点：学11-306

活动主题：Nature 论文分享

论文题目：Mastering the game of Go with deep neural networks and tree search

论文刊物：Nature

出版时间：2016

报告人：陈哲艺

主要内容：The game of Go has long been viewed as the most challenging of classic games for artificial intelligence owing to its enormous search space and the difficulty of evaluating board positions and moves. Here we introduce a new approach to computer Go that uses ‘value networks’ to evaluate board positions and ‘policy networks’ to select moves. These deep neural networks are trained by a novel combination of supervised learning from human expert games, and reinforcement learning from games of self-play. Without any lookahead search, the neural networks play Go at the level of state-of-the-art Monte Carlo tree search programs that simulate thousands of random games of self-play. We also introduce a new search algorithm that combines Monte Carlo simulation with value and policy networks. Using this search algorithm, our program AlphaGo achieved a 99.8% winning rate against other Go programs and defeated the human European Go champion by 5 games to 0. This is the first time that a computer program has defeated a human professional player in the full-sized game of Go, a feat previously thought to be at least a decade away.

活动介绍该文章的研究背景，分析文章的优点、缺点，并就文章对我们写高水平文章有何帮助提出看法。

欢迎广大师生积极参加！

上一条：安全教育主题讲座——《规范使用电动车，共建平安校园》

下一条：零壹讲坛：第九期案例研究:理论与实践