Research Interests
I have a broad interest in trustworthy AI. Specifically, I am passionate about exploring the vulnerability of foundation models, ensuring their reliability and robustness, and evaluating their real-world abilities.
Recently, I have been also interested in reinforcement learning with verifiable reward and reasoning.
Selected Publications & Manuscripts (* denotes the equal contribution)
|
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
Junyu Zhang*, Runpei Dong*, Han Wang, Xuying Ning, Haoran Geng, Peihao Li, Xialin He, Yutong Bai, Jitendra Malik, Saurabh Gupta, Huan Zhang
|
|
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination
Yifan Sun*, Han Wang*, Dongbai Li*, Gang Wang, Huan Zhang
|
|
Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks
Han Wang, Gang Wang, Huan Zhang
|
|
ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Jingnan Zheng*, Han Wang*, An Zhang, Tai D. Nguyen, Jun Sun, Tat-Seng Chua
|
Education
University of Illinois Urbana-Champaign, IL
Ph.D. • Aug. 2024 to Present
|
|
|
Zhejiang University, China
B.Eng. • Aug. 2020 to June 2024
|
|
|
Services
Conference Reviewer: NeurIPS 2025, ACM CCS AISec Workshop 2025
Journay Reviewer: IEEE TNNLS 2025
Website source from Jon Barron.
Last Updated: June 8th, 2025.
|
|