RubricBench: Aligning Model-Generated Rubrics with Human Standards
Qiyuan Zhang, Junyi Zhou,Yufei Wang, Fuyuan Lyu, Yidong Ming, Can Xu, Qingfeng Sun, Kai Zheng, Peng Kang, Xue Liu, Chen Ma. · ACL 2026 Main
Ph.D. Student
City University of Hong Kong
I am currently a candidate Ph.D. student advised by Prof. Chen Ma. Previously, I completed my B.Sc. and M.Sc. in Computer Science at the University of Electronic Science and Technology of China and spent time at Singapore Management University working with Jing Jiang. Soon, I will join Prof. Xue Liu’s group at MBZUAI as a visiting student.
My research interests lie in auto‑evaluation, reward modeling, preference modeling, and improved scaling strategies such as test‑time scaling for large language models. I am always excited about new collaborations—if you share these interests or see potential synergies, feel free to reach out via email!
Now, I have interned with Noah Lab@Huawei (Hong Kong) and Hunyuan team@Tencent, where I am focusing my efforts on advancing reward modeling. I am also seeking visiting or research‑intern opportunities to further explore frontier research topics.
In addition, I regularly post self-reflections on Medium—feel free to take a look if you’re interested!
My selected publications represent my research style and interests.
RubricBench: Aligning Model-Generated Rubrics with Human Standards
Qiyuan Zhang, Junyi Zhou,Yufei Wang, Fuyuan Lyu, Yidong Ming, Can Xu, Qingfeng Sun, Kai Zheng, Peng Kang, Xue Liu, Chen Ma. · ACL 2026 Main
Beyond Length Scaling: Synergizing Breadth and Depth for Generative Reward Models
Qiyuan Zhang, Yufei Wang, Tianhe Wu, Can Xu, Qingfeng Sun, Kai Zheng, Xue Liu, Chen Ma. · ACL 2026 Finding
From Verifiable Dot to Reward Chain: Harnessing Verifiable Reference-based Rewards for Reinforcement Learning of Open-ended Generation
Yuxin Jiang, Yufei Wang, Qiyuan Zhang, Xingshan Zeng, Liangyou Li, Jierun Chen, Chaofan Tao, Haoli Bai, Lifeng Shang. · ICLR 2026 Poster
What, How, Where, and How Well? A Survey on Test-Time Scaling in Large Language Models
Qiyuan Zhang, Fuyuan Lyu, Zexu Sun, Lei Wang, Weixu Zhang, Wenyue Hua, Haolun Wu, Zhihan Guo, Yufei Wang, Niklas Muennighoff, Irwin King, Xue Liu, Chen Ma. · Preprint
Crowd Comparative Reasoning: Unlocking Comprehensive Evaluations for LLM-as-a-Judge
Qiyuan Zhang, Yufei Wang, Yuxin Jiang, Liangyou Li, Chuhan Wu, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Fuyuan Lyu, Chen Ma. · ACL 2025
RevisEval: Improving LLM-as-a-Judge via Response-Adapted References
Qiyuan Zhang, Yufei Wang, Tiezheng YU, Yuxin Jiang, Chuhan Wu, Liangyou Li, Yasheng Wang, Xin Jiang, Lifeng Shang, Ruiming Tang, Fuyuan Lyu, Chen Ma. · ICLR 2024
Collaborative Performance Prediction for Large Language Models
Qiyuan Zhang, Fuyuan Lyu, Xue Liu, Chen Ma. · EMNLP 2024
NOAHQA: Numerical Reasoning with Interpretable Graph QA Dataset
Qiyuan Zhang, Lei Wang, Sicheng Yu, Shuohang Wang, Yang Wang, Jing Jiang, Ee-Peng Lim. · EMNLP 2021 Findings
MWPToolkit: An Open-Source Framework for DL-Based Math Word Problem Solvers
Yihuai Lan, Lei Wang, Qiyuan Zhang , Yunshi Lan, Bing Tian Dai, Yan Wang, Dongxiang Zhang, Ee-Peng Lim. · AAAI 2021 Workshop