Hi! I’m Kaixuan Ji, a third-year Ph.D. student in Computer Science at UCLA, fortunately advised by Professor Quanquan Gu. Before coming to UCLA, I completed my undergraduate studies in Department of Computer Science and Technology at Tsinghua University, where I was fortunate to work with Professors Jie Tang and Juanzi Li. My current research explores reinforcement learning theory and its role in training large language models.
Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
Kaixuan Ji*, Qingyue Zhao*, Heyang Zhao*, Qiwei Di, Quanquan Gu, ICML 2026