Yi Zhang

Ph.D. Candidate, Beihang University · Visiting Student, Tsinghua University

prof_pic.jpg

Room 3-523, FIT Building

Tsinghua University, Beijing

yi.zhang.4096 [at] gmail.com

I am currently a Ph.D. candidate at the State Key Laboratory of Virtual Reality Technology and Systems, School of Computer Science and Engineering (SCSE), Beihang University, advised by Prof. Shi-Min Hu. I am also a visiting student at the Graphics and Geometric Computing Group, Tsinghua University, where I am a core developer of the Jittor framework. I have served as a reviewer for top-tier conferences and journals, including ICLR, ICML, IEEE Transactions on Image Processing, and Computational Visual Media Journal.

Research Interests

  • Multimodal Large Language Models
  • Embodied AI
  • Computer Vision

My current research focuses on multimodal large language models and embodied AI.

news

Jan 22, 2026 One paper was accepted by ICLR 2026.
Nov 01, 2025 Received the National Scholarship for Ph.D. Students.
Sep 20, 2025 One paper was accepted by NeurIPS 2025.
Jul 01, 2025 Supported by the CIE-Tencent Doctoral Research Incentive Project (中国电子学会—腾讯博士生科研激励计划,混元大模型专项).
Apr 15, 2025 One paper was accepted by ICML 2025.

selected publications

  1. Bee: A High-Quality Corpus and Full-Stack Suite to Unlock Advanced Fully Open MLLMs
    Yi Zhang, Bolin Ni, Xin-Sheng Chen, and 7 more authors
    In International Conference on Learning Representations (ICLR), 2026
  2. Adaptive Parameter Selection for Tuning Vision-Language Models
    Yi Zhang, Yi-Xuan Deng, Meng-Hao Guo, and 1 more author
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2025
  3. Exploring Regional Clues in CLIP for Zero-Shot Semantic Segmentation
    Yi Zhang, Meng-Hao Guo, Miao Wang, and 1 more author
    In IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2024