About Me

I am currently a machine learning researcher at the model alignment team at NVIDIA. Prior to NVIDIA, I worked on recommendation systems at Amazon Advertising team.

I obtained my PhD degree at Department of Computer Science, University of Toronto, supervised by Roger Grosse. Previously, I did my undergraduate at Department of Electronic Engineering, Tsinghua University.

I am generally interested in all approaches to building strong AI models that acts intelligently, solves problems, and aids human. I am a believer of Andrew Ng’s vision that “AI is the new electricity”. I conduct research to realize this goal and enable AI for widerange positive impacts.

Specifically, my research at NVIDIA focuses on improving large language model performances in the post-training stage. My research involves model alignment algorithms, synthetic data generation, and long-horizon reason processes.

NEWS

  • 2024.12, my research regarding “Knowledge Distillation in Model Alignment” goes out in NeMo-Aligner and NVIDIA Technical Blog.
  • 2024.06, our model alignment toolcase, NeMo-Aligner, is accepted in COLM 2024.
  • 2024.04, our model Nemotron-4-340B-Instruct (me as the leading author) is realeased (blog, HF). We also open-sourced our synthetic data generation pipeline to traing the model.