About me

I serve as an algorithm expert at Alibaba’s Tongyi Laboratory. I received my Ph.D. from Fudan University in 2023, under the supervision of Professor Jianqing Fan and Professor Zhongyu Wei. Before that, I received my Bachelor’s degree in Mathematics and Applied Mathematics from Fudan University. During my Ph.D., I also spent time with MReaL (Machine Reasoning and Learning) at Nanyang Technological University, advised by Hanwang Zhang.

My primary research interests lie in Vision-Language Reasearch including large vision-language model and multi-modal agents. Currently, I focus on the Qwen-VL.

We are hiring self-motivated research interns about Video LLM, Multi-Modal Agent and Embodied AI. Feel free to contant me.

Professional Service

  • Area Chair: ACL 2023/2024, EMNLP 2024
  • Reviewer: NeurIPS, ICML, ICLR, ACL, EMNLP, IJCAI, AAAI.

Honors and Awards

  • Alibaba Star (Top 1%), Alibaba Group, 2023
  • ByteDance Scholarship (10 in China), 2021
  • National Scholarship for Doctoral Students (Top 1%) in Fudan University, Ministry of Education, 2019

Preprint

Selected Publications

(Full list see Google Scholar)

Experience

  • Alibaba. Tongyi Lab. Sep 2023 - Present
    • Algorithm Expert.
    • Focus:
      • Large Vision-Language Model
      • Multimodal Agents
  • Fudan University. DISC Lab. Oct 2016 - Sep 2023
    • Graduate Research Assistant. Advisor: Zhongyu Wei
    • Focus:
      • Vision-Language Generation and Retrieval
      • Vision-Language Pre-training
  • Nanyang Technological University. MReal Lab. Oct 2022 - Apr 2023
  • Microsoft Research Asia. NLC Group. Sep 2019 - Mar. 2020
    • Research Intern. Advisor: Yeyun Gong
    • Focus: Transformer Architecture