About me

I am a research scientist at NVIDIA, where I am working on reinforcement learning and language models. Previously, I recieved a Ph.D. in Machine Learning from Georgia Institute of Technology working with Prof. Tuo Zhao on reward design and robustness in reinforcement learning. Currently I am focused on aligning language models with human values and scaling RL training. For my most recent work, see my google scholar page.

Alex Bukharin