Research Interests
I have a broad interest in AI safety. Specifically, I am passionate about exploring the vulnerability of foundation models and ensuring their reliability and robustness.
Selected Publications & Manuscripts (* denotes the equal contribution)
|
The Emperor's New Clothes in Benchmarking? A Rigorous Examination of Mitigation Strategies for LLM Benchmark Data Contamination
Yifan Sun*, Han Wang*, Dongbai Li*, Gang Wang, Huan Zhang
|
|
Steering Away from Harm: An Adaptive Approach to Defending Vision Language Model Against Jailbreaks
Han Wang, Gang Wang, Huan Zhang
|
|
ALI-Agent: Assessing LLMs' Alignment with Human Values via Agent-based Evaluation
Jingnan Zheng*, Han Wang*, An Zhang, Tai D. Nguyen, Jun Sun, Tat-Seng Chua
|
Education
University of Illinois Urbana-Champaign, IL
Ph.D. • Aug. 2024 to Present
|
|
|
Zhejiang University, China
B.Eng. • Aug. 2020 to June 2024
|
|
|
Services
Conference Reviewer: NeurIPS 2025
Journay Reviewer: IEEE TNNLS
Website source from Jon Barron.
Last Updated: May 11th, 2025.
|
|