I am a final-year Ph.D. candidate at Shanghai Jiao Tong University (SJTU), where I am fortunately co-advised by Prof. Jiangchao Yao, Prof. Ya Zhang and Prof. Yanfeng Wang. I received my bachelor's degree in information engineering from SJTU in 2021.
My research is driven by the goal of making efficient and reliable AI models, including Large Language Models, Diffusion Models, and Multi-modal Models. This technical vision is deeply informed by my practical experience as a research intern at Microsoft Research Asia (MSRA), A*STAR Centre for Frontier AI Research (CFAR), and Meituan Longcat team.
For a complete list of my research, please visit Google Scholar.
(* Equal contribution, † Corresponding author)