Biography

I am a first-year Ph.D. student at the University of Hong Kong (HKU), supervised by Prof. Ping Luo at MMLAB@HKU. I previously received my B.Eng. with Outstanding Graduate Honors from Nanjing University (NJU).

My research interests lie in agent-centric post-training of large vision–language models and their practical applications, with a current emphasis on LLM-based agents in GUI and trading domains.

News

  • [06/2025] GUIOdyssey was accepted to ICCV’25, with 100+ GitHub stars and 500K+ Hugging Face downloads!
  • [04/2025] PhyGenBench was accepted to ICML’25.
  • [03/2025] MM-EUREKA received 700+ GitHub stars.
  • [01/2025] MMIU was accepted to ICLR’25.
  • [05/2024] ChartAssistant was accepted to ACL’24.
  • [05/2024] MMT-Bench was accepted to ICML’24.
  • [03/2024] OmniMedVQA was accepted to CVPR’24.
  • [10/2022] Awarded the National Scholarship.

Selected Publications

  • TVWorld: Foundations for Remote-Control TV Agents,
    Zhantao Ma*, Quanfeng Lu*, Shuai Zhong, Dahai Yu, Ping Luo, Michael K. Ng
    arXiv, 2026. Paper, Code, Hugging Face
  • SWIRL: A Staged Workflow for Interleaved Reinforcement Learning in Mobile GUI Control,
    Quanfeng Lu, Zhantao Ma, Shuai Zhong, Jin Wang, Dahai Yu, Michael K Ng, Ping Luo
    arXiv, 2025. Paper, Code
  • UniFork: Exploring Modality Alignment for Unified Multimodal Understanding and Generation,
    Teng Li, Quanfeng Lu, Lirui Zhao, Hao Li, Xizhou Zhu, Yu Qiao, Jun Zhang, Wenqi Shao
    Paper, Code
  • GUIOdyssey: A Comprehensive Dataset for Cross-App GUI Navigation on Mobile Devices,
    Quanfeng Lu, Wenqi Shao, Zitao Liu, Lingxiao Du, …, Ping Luo
    ICCV, 2025. Paper, Code, Hugging Face
  • MM-Eureka: Exploring the Frontiers of Multimodal Reasoning with Rule-based Reinforcement Learning,
    Fanqing Meng, Lingxiao Du, Zongkai Liu, Zhixiang Zhou, Quanfeng Lu, …, Wenqi Shao
    arXiv, 2025. Paper, Code
  • OmniMedVQA: A New Large-Scale Comprehensive Evaluation Benchmark for Medical LVLM,
    Yutao Hu*, Tianbin Li*, Quanfeng Lu*, Wenqi Shao, Junjun He, Yu Qiao, Ping Luo
    CVPR, 2024. Paper, Code
  • ChartAssisstant: A Universal Chart Multimodal Language Model via Chart-to-Table Pre-training and Multitask Instruction Tuning,
    Fanqing Meng, Wenqi Shao, Quanfeng Lu, Peng Gao, Kaipeng Zhang, Yu Qiao, Ping Luo
    ACL, 2024. Paper

Education

  • Sept. 2025 - June. 2029 (expected) Ph.D., The University of Hong Kong.
  • Sept. 2020 - June. 2024 B.Eng., Nanjing University.
  • Sept. 2017 - July. 2020 The Affiliated High School of South China Normal University.

Selected Honors

  • [2024] Outstanding Graduate of Nanjing University
  • [2022] National Scholarship

Services

Conference Reviewers:

  • CVPR: 2025, 2026
  • ICCV: 2025
  • ECCV: 2024, 2026
  • ICLR: 2026
  • ICML: 2026
  • ACL: 2026
  • ACM MM: 2025