<P>The complexity of computing Markov perfect equilibrium in general-sum stochastic games<BR>【2023.5.11 10:00am, N204】</P>----中国科学院国家数学与交叉科学中心 (NCMIS)

　　 2023-5-9　

　　Colloquia & Seminars　

Speaker	邓小铁教授，北京大学
Title	The complexity of computing Markov perfect equilibrium in general-sum stochastic games
Time	5月11日10:00
Venue	N204
Abstract	Similar to the role of Markov decision processes in reinforcement learning, Markov games (also known as stochastic games) form the basis for the study of multi-agent reinforcement learning and sequence-agent interaction. We introduce an approximate Markov perfect equilibrium as a computational problem for solving finite-state stochastic games under infinite time discounting, and prove its PPAD completeness. This solution concept preserves the Markovian-perfect property, opening the possibility to extend successful multi-agent reinforcement learning algorithms to multi-agent dynamic games, thus extending the range of PPAD complete classes.
Affiliation	邓小铁教授，北京大学计算机学院前沿计算研究中心讲席教授，于1982年在清华大学获得学士学位，于1984年在中国科学院获得硕士学位，于1989年在斯坦福大学获得博士学位。2017年12月他入职北京大学，任计算机学院前沿计算研究中心讲席教授。他曾任教于上海交通大学、利物浦大学、香港城市大学和约克大学。在此之前，他还是西蒙菲莎大学的 NSERC 国际研究员。邓小铁教授的主要科研方向为算法博弈论、区块链、互联网经济、在线算法及并行计算。2008年，他因在算法博弈论领域的贡献当选 ACM Fellow；2019年，因在不完全信息计算和交互环境计算领域的贡献当选 IEEE fellow；2020年当选欧洲科学院外籍院士；2021年当选中国工业与应用数学学会会士（CSIAM Fellow）；2021年被任命为博弈论学会（GTS）理事；2021年被聘为中国运筹学会博弈论分会荣誉理事；2021年获得 CCF 人工智能学会多智能体与多智能体系统研究成就奖；2022年获得 ACM 计算经济学的“时间检验奖”（Test of Time Award）。