Hi! I’m Kaixuan Ji, a third-year Ph.D. student in Computer Science at UCLA, fortunately advised by Professor Quanquan Gu. Before coming to UCLA, I completed my undergraduate studies in Department of Computer Science and Technology at Tsinghua University, where I was fortunate to work with Professors Jie Tang and Juanzi Li. My current research explores reinforcement learning theory and its role in training large language models. Here is my latest CV.
Near-Optimal Regret for KL-Regularized Multi-Armed Bandits
Kaixuan Ji*, Qingyue Zhao*, Heyang Zhao*, Qiwei Di, Quanquan Gu, ICML 2026