<P>Robust Generalization Requires Exponentially Large Models<BR>【2023.5.12 3:00pm, N109】</P>----中国科学院国家数学与交叉科学中心 (NCMIS)

　　 2023-5-4　

　　Colloquia & Seminars　

Speaker	王立威教授，北京大学智能学院
Title	Robust Generalization Requires Exponentially Large Models
Time	5月12日15:00
Venue	N109
Abstract	It is well-known that modern neural networks are vulnerable to adversarial examples. To mitigate this problem, a series of robust learning algorithms have been proposed. However, although the robust training error can be near zero via some methods, all existing algorithms lead to a high robust generalization error. In this talk, I will provide a theoretical understanding of this puzzling phenomenon from the perspective of expressive power for deep neural networks. Specifically, for binary classification problems with well-separated data, we show that, for ReLU networks, while mild over-parameterization is sufficient for high robust training accuracy, there exists a constant robust generalization gap unless the size of the neural network is exponential in the data dimension d. This result holds even if the data is linear separable.
Affiliation	王立威北京大学智能学院教授。长期从事机器学习研究。在机器学习理论方面取得一系列成果。在机器学习国际权威期刊会议发表高水平论文200余篇。担任人工智能权威期刊TPAMI编委。获ICLR 2023 Outstanding Paper Award。曾入选AI’s 10 to Watch，是该奖项自设立以来首位获此荣誉的亚洲学者。