Research
Problems I keep returning to
My work sits between Trustworthy AI,
decentralized financial systems, and
human-centered evaluation. The topic blocks below jump to the publication cards in each area.
01
Trustworthy AI
Jailbreak robustness, AI-powered search safety, privacy risk, covert content, and reliable evaluation for frontier AI systems.
02
DeFi & Blockchain
Secure and efficient decentralized protocols, smart contract analysis, gas optimization, payment channels, and cross-chain financial primitives.
03
Human & Society-Centered AI
LLM evaluation, policy simulation, cognitive assessment, visual assistance, human-agent applications, and applied machine learning.
* denotes equal contribution. A full list is also available on Google Scholar .
Topic 01
Trustworthy AI
Jailbreak robustness, AI-powered search safety, privacy risk, covert content, and reliable evaluation for frontier AI systems.
2026
ICLR 2026
JALMBench: Benchmarking Jailbreak Vulnerabilities in Audio Language Models
Zifan Peng , Yule Liu, Zhen Sun, Mingchen Li, Zeren Luo, Jingyi Zheng, Wenhan Dong, Xinlei He, Xuechao Wang, Yingjie Xue, Shengmin Xu, Xinyi Huang
International Conference on Learning Representations
2025
arXiv 2025
Source Coverage and Citation Bias in LLM-based vs. Traditional Search Engines
Peixian Zhang* , Qiming Ye* , Zifan Peng , Kiran Garimella, Gareth Tyson
arXiv preprint arXiv:2512.09483
2025
arXiv 2025
GRPO Privacy Is at Risk: A Membership Inference Attack Against Reinforcement Learning With Verifiable Rewards
Yule Liu, Heyi Zhang, Jinyi Zheng, Zhen Sun, Zifan Peng , Tianshuo Cong, Yilong Yang, Xinlei He, Zhuo Ma
arXiv preprint arXiv:2511.14045
2025
NeurIPS 2025
CHASM: Unveiling Covert Advertisements on Chinese Social Media
Jingyi Zheng* , Tianyi Hu* , Yule Liu, Zhen Sun, Zongmin Zhang, Wenhan Dong, Zifan Peng , Xinlei He
Conference on Neural Information Processing Systems
2025
USENIX Security 2025
Unsafe LLM-Based Search: Quantitative Analysis and Mitigation of Safety Risks in AI Web Search
Zeren Luo* , Zifan Peng * , Yule Liu, Zhen Sun, Mingchen Li, Jingyi Zheng, Xinlei He
USENIX Security Symposium
2025
arXiv 2025
Thought Manipulation: External Thought Can Be Efficient for Large Reasoning Models
Yule Liu, Jingyi Zheng, Zhen Sun, Zifan Peng , Wenhan Dong, Zeyang Sha, Shiwen Cui, Weiqiang Wang, Xinlei He
arXiv preprint arXiv:2504.13626
Topic 02
DeFi & Blockchain
Secure and efficient decentralized protocols, smart contract analysis, gas optimization, payment channels, and cross-chain financial primitives.
2026
arXiv 2026
TxSum: User-Centered Ethereum Transaction Understanding with Micro-Level Semantic Grounding
Zifan Peng , Jingyi Zheng, Yule Liu, Huaiyu Jia, Qiming Ye, Jingyu Liu, Xufeng Yang, Mingchen Li, Qingyuan Gong, Xuechao Wang, Xinlei He
arXiv preprint arXiv:2512.06933
2025
arXiv 2025
GasAgent: A Multi-Agent Framework for Automated Gas Optimization in Smart Contracts
Jingyi Zheng* , Zifan Peng * , Yule Liu, Junfeng Wang, Yifan Liao, Wenhan Dong, Xinlei He
arXiv preprint arXiv:2507.15761
2025
ICWS 2025
Cross-Chain Options: A Bridgeless, Universal, and Efficient Approach
Zifan Peng , Yingjie Xue, Jingyu Liu
IEEE International Conference on Web Services
2025
TDSC 2025
FairRelay: Fair and Cost-Efficient Peer-to-Peer Content Delivery through Payment Channel Networks
Jingyu Liu, Yingjie Xue, Zifan Peng , Chao Lin, Xinyi Huang
IEEE Transactions on Dependable and Secure Computing
Topic 03
Human & Society-Centered AI
LLM evaluation, policy simulation, cognitive assessment, visual assistance, human-agent applications, and applied machine learning.
2026
JASSS 2026
GPLab: A Generative Agent-Based Framework for Policy Simulation and Evaluation
Shuhan Zhang, Zifan Peng , Yinwang Ren
Journal of Artificial Societies & Social Simulation, 29(1), 1
2026
ICML 2026
NL2Repo-Bench: Towards Long-Horizon Repository Generation Evaluation of Coding Agents
Zifan Peng , etc.
International Conference on Machine Learning
2025
NeurIPS 2025
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines
M-A-P Team et al., including Zifan Peng
Conference on Neural Information Processing Systems
2025
arXiv 2025
ZPD-SCA: Unveiling the Blind Spots of LLMs in Assessing Students' Cognitive Abilities
Wenhan Dong, Zhen Sun, Yuemeng Zhao, Zifan Peng , Jun Wu, Jingyi Zheng, Yule Liu, Xinlei He, Yu Wang, Ruiming Wang, Xinyi Huang, Lei Mo
arXiv preprint arXiv:2508.14377
2025
arXiv 2025
"I Can See Forever!": Evaluating Real-time VideoLLMs for Assisting Individuals with Visual Impairments
Ziyi Zhang, Zhen Sun, Zongmin Zhang, Zifan Peng , Yuemeng Zhao, Zichun Wang, Zeren Luo, Ruiting Zuo, Xinlei He
arXiv preprint arXiv:2505.04488
2025
arXiv 2025
Humanizing LLMs: A Survey of Psychological Measurements with Tools, Datasets, and Human-Agent Applications
Wenhan Dong, Yuemeng Zhao, Zhen Sun, Yule Liu, Zifan Peng , Jingyi Zheng, Zongmin Zhang, Ziyi Zhang, Jun Wu, Ruiming Wang, Shengmin Xu, Xinyi Huang, Xinlei He
arXiv preprint arXiv:2505.00049
2025
arXiv 2025
Automatic Pruning via Structured Lasso with Class-wise Information
Xiang Liu, Mingchen Li, Xia Li, Leigang Qu, Zifan Peng , Yijun Song, Zemin Liu, Linshan Jiang, Jialin Li
arXiv preprint arXiv:2502.09125
2025
MLJ 2025
Prompt-based Contrastive Learning to Combat the COVID-19 Infodemic
Zifan Peng , Mingchen Li, Yue Wang, George T.S. Ho
Machine Learning, 114(1), 6
2024
ESWA 2024
Combating the COVID-19 Infodemic Using Prompt-Based Curriculum Learning
Zifan Peng , Mingchen Li, Yue Wang, George T.S. Ho
Expert Systems with Applications, 229, 120501