About me
I am a fourth year Ph.D. student in Machine Learning at Georgia Institute of Technology. My advisor is Prof. Tuo Zhao. My C.V. can be found here.
My current research is focused on training large language models with reinforcement learning. Most recently I have been working on using RL to improve language model’s capabilities in mathematical reasoning, code generation, and instruction following. Earlier in my PhD I worked on parameter efficient LLM training, machine learning force fields, and robust multi-agent reinforcement learning.