Publications

You can also find my articles on my Google Scholar profile.

HelpSteer2-Preference: Complementing Ratings with Preferences
Zhilin Wang, Alexander Bukharin, Olivier Delalleau, Daniel Egert, Gerald Shen, Jiaqi Zeng, Oleksii Kuchaiev, Yi Dong
Submitted

Deep Reinforcement Learning from Hierarchical Weak Preference Feedback
Alexander Bukharin, Yixiao Li, Pengcheng He, Weizhu Chen, Tuo Zhao
Submitted

Robust Reinforcement Learning from Corrupted Human Feedback
Alexander Bukharin, Ilgee Hong, Haoming Jiang, Zichong Li, Qingru Zhang, Zixuan Zhang, Tuo Zhao
NeurIPS 2024

Data Diversity Matters for Robust Instruction Tuning
Alexander Bukharin, Tuo Zhao
EMNLP 2024

Adaptive Preference Scaling for Reinforcement Learning with Human Feedback
Ilgee Hong, Zichong Li, Alexander Bukharin, Yixiao Li, Haoming Jiang, Tianbao Yang, Tuo Zhao
NeurIPS 2024

RNR: Teaching Large Language Models to Follow Roles and Rules
Alexander Bukharin*, Kuan Wang*, Haoming Jiang, Qingyu Yin, Zhengyang Wang, Tuo Zhao, Jingbo Shang, Chao Zhang, Bing Yin, Xian Li, Jianshu Chen, Shiyang Li
ICML 2024 Workshop on Foundation Models in the Wild

Robust Multi-Agent Reinforcement Learning via Adversarial Regularization: Theoretical Foundation and Stable Algorithms
Alexander Bukharin, Yue Yu, Qingru Zhang, Zhehui Chen, Simiao Zuo, Yan Li, Chao Zhang, Tuo Zhao
NeurIPS 2023

Machine Learning Force Fields with Data Cost Aware Training
Alexander Bukharin, Tianyi Liu, Shengjie Wang, Simiao Zuo, Weihao Gao, Wen Yan, Tuo Zhao
International Conference on Machine Learning, 2023

Ambient Noise based Weakly Supervised Manhole Localization Methods over Deployed Fiber Networks
Alexander Bukharin, Shaobo Han, Yuheng Chen, Ming-Fang Huang, Yue-Kai Huang, Yao Xie, Ting Wang
Optics Express, March 2023

Adaptive Budget Allocation for Parameter-Efficient Fine-Tuning
Qingru Zhang, Minshuo Chen, Alexander Bukharin, Pengcheng He, Yu Cheng, Weizhu Chen, Tuo Zhao
International Conference on Learning Representations, 2023

PLATON: Pruning Large Transformer Models with Upper Confidence Bound of Weight Importance
Qingru Zhang, Simiao Zuo, Chen Liang, Alexander Bukharin, Pengcheng He, Weizhu Chen, Tuo Zhao
International Conference on Machine Learning, 2022

Early Detection of COVID-19 Hotspots Using Spatio-Temporal Data
Shixiang Zhu, Alexander Bukharin, Liyan Xie, Khurram Yamin, Shihao Yang, Pinar Keskinocack, and Yao Xie
IEEE Journal of Selected Topics in Signal Processing, 2022

High-resolution Spatio-temporal Model for County-level COVID-19 Activity in the US
Shixiang Zhu, Alexander Bukharin, Liyan Xie, Mauricio Santillana, Shihao Yang, and Yao Xie
ACM Transactions on Management Information Systems (TMIS), 2021

Data-Driven Optimization for Police Beat Design in South Fulton, Georgia
Shixiang Zhu, Alexander Bukharin, Le Lu, He Wang, and Yao Xie
KDD Workshop on Data Science for Social Good, 2021

Five-Year Project-Level Statewide Pavement Performance Forecasting Using a Two-Stage Machine Learning Approach Based on Long Short-Term Memory
Alexander Bukharin, Zhongyu Yang, and Yichang (James) Tsai
Transportation Research Record, 2021