Talk by Prof. Zhouchen Lin (PKU)
Title of the talk: Advances on the Training Algorithms for Large Models
Speaker: Zhouchen Lin (Peking University), https://zhouchenlin.github.io/
Date and time: 2025/Aug 6th, 10:30-11:30 (JST)
Venue: Hybrid (Zoom / RIKEN AIP Nihonbashi Office Meeting Room C)
*RIKEN AIP Nihonbashi office is only for AIP members
Abstract: Artificial intelligence has entered the era of big models, and training costs have surged. Finding more efficient training algorithms is of great significance for industrial applications. This report will report on two new developments in my research on large model training algorithms, including Adan algorithm and gradient memory reuse technology.
Bio: Zhouchen Lin received the Ph.D. degree in applied mathematics from Peking University in 2000. He is currently a Professor with the State Key Laboratory of General Artificial Intelligence, School of Intelligence Science and Technology, Peking University. His research interests include machine learning and numerical optimization. He has published over 340 technical papers and 5 monographs, receiving over 40,000 Google Scholar citations. He has been Area Chairs and Senior Area Chairs of ACML, ACCV, CVPR, ICCV, NIPS/NeurIPS, AAAI, IJCAI, ICLR, and ICML for many times. He is currently a Board Director of ICML. He was an Associate Editor of the IEEE Transactions on Pattern Analysis and Machine Intelligence and currently is an associate editor of the International Journal of Computer Vision and Optimization Methods and Software. He is a Fellow of the IAPR, the IEEE, the AAIA and the CSIG.
Language: English
Registration required: yes