About Me

I am a second-year Ph.D. Student in Prof. Mohit Bansalโ€™s group (MURGe Lab) at UNC Chapel Hill. Previously, I was a Research Resident under the supervision of Prof. Viet Anh Nguyen at VinAI Research, Vietnam. I received a bachelorโ€™s degree in Computer Science from Hanoi University of Science and Technology in 2022.

My research focuses on mechanistic interpretability and inference-time interventions for interpreting and monitoring the behaviors of (multimodal) LLMs. Additionally, I am interested in post-training methods for LLMs, including Reinforcement Learning from Human Feedback (RLHF) and Reinforcement Learning with Verifiable Rewards (RLVR).

๐Ÿ”ฅ News

Old news

๐Ÿ“ Publications

* denotes equal contribution

LASeR: Learning to Adaptively Select Reward Models with Multi-Armed Bandits
Duy Nguyen*, Archiki Prasad*, Elias Stengel-Eskin, Mohit Bansal
NeurIPS 2025 | Conference on Neural Information Processing Systems
Multi-Attribute Steering of Language Models via Targeted Intervention
Duy Nguyen, Archiki Prasad, Elias Stengel-Eskin, Mohit Bansal
ACL 2025 | Association for Computational Linguistics
Distributional Surgery for Language Model Activations
Bao Nguyen, Binh Nguyen, Duy Nguyen, Viet Anh Nguyen
EMNLP 2025 Findings | Conference on Empirical Methods in Natural Language Processing
Cold-start Recommendation by Personalized Embedding Region Elicitation
Hieu Nguyen, Duy Nguyen, Khoa Doan, Viet Anh Nguyen
UAI 2024 | Conference on Uncertainty in Artificial Intelligence
Coverage-Validity-Aware Algorithmic Recourse
Ngoc Bui, Duy Nguyen, Man-Chung Yue, Viet Anh Nguyen
Distributionally Robust Recourse Action
Duy Nguyen, Ngoc Bui, Viet Anh Nguyen
ICLR 2023 | International Conference on Learning Representations
Feasible Recourse Plan via Diverse Interpolation
Duy Nguyen, Ngoc Bui, Viet Anh Nguyen
AISTATS 2023 | International Conference on Artificial Intelligence and Statistics
Robust Bayesian Recourse
Tuan-Duy H. Nguyen, Ngoc Bui, Duy Nguyen, Man-Chung Yue, Viet Anh Nguyen
UAI 2022 | Conference on Uncertainty in Artificial Intelligence
Counterfactual Plans under Distributional Ambiguity
Ngoc Bui, Duy Nguyen, Viet Anh Nguyen
ICLR 2022 | International Conference on Learning Representations

๐ŸŽ– Honors and Awards

October 2022
Honorable Mention
INFORMS Undergraduate Operations Research Prize
October 2022
Best Thesis Presentation Award
Hanoi University of Science and Technology
September 2019
Excellence Scholarship
Hanoi University of Science and Technology

๐Ÿ’ป Experience

  • May 2025 โ€“ August 2025
    Applied Scientist Intern
    Amazon Science ยท USA
  • August 2022 โ€“ August 2024
    Research Resident
    VinAI Research ยท Vietnam