Haoyu Zhang

Hi, welcome to my webpage! zhang.hy.2019@gmail.com


About Me

I’m Haoyu Zhang (张昊宇), and I am currently a PhD Student at the School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), advised by Prof. Liqiang Nie, Prof. Yaowei Wang, and Prof. Meng Liu. Before that, I received my M.S. degree in the School of Computer Science and Technology from Shandong University, advised by Prof. Liqiang Nie and Prof. Meng Liu. My research interests include Egocentric Vision Understanding, Multimodal Dialog, and Person Re-identification.

Publications

Haoyu Zhang, Meng Liu, Zaijing Li, Haokun Wen, Weili Guan, Yaowei Wang, Liqiang Nie. “Spatial Understanding from Videos: Structured Prompts Meet Simulation Data”. arXiv preprint, arXiv:2506.03642, 2025.
[Paper] [Code] [BibTex]

Yisen Feng, Haoyu Zhang, Qiaohui Chu, Meng Liu, Weili Guan, Yaowei Wang, Liqiang Nie. “OSGNet @ Ego4D Episodic Memory Challenge 2025”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), Oral, 2025.
[Paper] [Code] [BibTex]

Qiaohui Chu, Haoyu Zhang, Yisen Feng, Meng Liu, Weili Guan, Yaowei Wang, Liqiang Nie. “Technical Report for Ego4D Long-Term Action Anticipation Challenge 2025”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), 2025.
[Paper] [Code] [BibTex]

Haoyu Zhang, Yisen Feng, Qiaohui Chu, Meng Liu, Weili Guan, Yaowei Wang, Liqiang Nie. “HCQA-1.5 @ Ego4D EgoSchema Challenge 2025”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), 2025.
[Paper] [Code] [BibTex]

Haoyu Zhang, Qiaohui Chu, Meng Liu, Yunxiao Wang, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Yaowei Wang, Liqiang Nie. “Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding”. arXiv preprint, arXiv:2503.09143, 2025.
[Paper] [Code] [BibTex]

Yunxiao Wang, Meng Liu, Rui Shao, Haoyu Zhang, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Liqiang Nie. “TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs”. arXiv preprint, arXiv:2503.09994, 2025.
[Paper] [Code] [BibTex]

Yisen Feng, Haoyu Zhang, Meng Liu, Weili Guan, Liqiang Nie. “Object-Shot Enhanced Grounding Network for Egocentric Video”. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), CCF-A, Full Paper, 2025.
[Paper] [Code] [BibTex]

Haoyu Zhang, Yuquan Xie, Yisen Feng, Zaijing Li, Meng Liu, Liqiang Nie. “HCQA @ Ego4D EgoSchema Challenge 2024”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), 2024.
[Paper] [Code] [BibTex]

Yisen Feng, Haoyu Zhang, Yuquan Xie, Zaijing Li, Meng Liu, Liqiang Nie. “ObjectNLQ @ Ego4D Episodic Memory Challenge 2024”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), 2024.
[Paper] [Code] [BibTex]

Haoyu Zhang, Meng Liu, Zixin Liu, Xuemeng Song, Yaowei Wang, Liqiang Nie. “Multi-Factor Adaptive Vision Selection for Egocentric Video Question Answering”. International Conference on Machine Learning (ICML), CCF-A, Full Paper, 2024.
[Paper] [Code] [BibTex]

Haoyu Zhang, Meng Liu, Yisen Feng, Yaowei Wang, Weili Guan, Liqiang Nie. “Uncovering Hidden Connections: Iterative Search and Reasoning for Video-grounded Dialog”. arXiv preprint, arXiv:2310.07259, 2023.
[Paper] [Code] [BibTex]

Haoyu Zhang, Meng Liu, Yuhong Li, Ming Yan, Zan Gao, Xiaojun Chang, Liqiang Nie. “Attribute-guided Collaborative Learning for Partial Person Re-identification”. IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), CCF-A, SCI-1, IF 24.3, 2023.
[Paper] [Code] [BibTex]

Weili Guan, Xuemeng Song, Haoyu Zhang, Meng Liu, Chung-Hsing Yeh, Xiaojun Chang. “Bi-directional Heterogeneous Graph Hashing towards Efficient Outfit Recommendation”. ACM International Conference on Multimedia (ACM MM), CCF-A, Oral, 2022.
[Paper] [Code] [BibTex]

Haoyu Zhang, Meng Liu, Zan Gao, Xiaoqiang Lei, Yinglong Wang, Liqiang Nie. “Multimodal Dialog System: Relational Graph-based Context-aware Question Understanding”. ACM International Conference on Multimedia (ACM MM), CCF-A, Full Paper, 2021.
[Paper] [Code] [BibTex]

Education

Harbin Institute of Technology, Shenzhen, China
  • School of Computer Science and Technology, Sep 2023 - Now
  • Doctor, Advisor: Prof. Liqiang Nie and Prof. Meng Liu
Shandong University, Qingdao, China
  • School of Computer Science and Technology, Sep 2020 - Jun 2023
  • Master, Advisor: Prof. Liqiang Nie and Prof. Meng Liu
Shandong University of Science and Technology, Qingdao, China            
  • School of Computer Science and Engineering, Sep 2016 - Jun 2020
  • Bachelor, Graduated with Excellent Thesis Award

Experience

Kuaishou Technology, Beijing, China (Jul 2024 - Jun 2025)            
  • Academic Intern, in Community Science
  • Project: Multimodal Large Language Model
Alibaba Group, Hangzhou, China (Wed 2022 - Jul 2022)            
  • Academic Intern, in Alibaba Cloud Intelligence
  • Project: Partial Person Re-identification

Academic Services

Conference Reviewer: CVPR 2025, ICCV 2025, NeurIPS 2022-2025, ICML 2023-2025, ICLR 2024-2025, SIGIR 2025, ACM MM 2022-2024, AAAI 2025, IJCAI 2024, ICMR 2025, PRCV 2023-2024, ISCAS 2024, ECAI 2024.

Journal Reviewer: Information Sciences, IEEE TCSVT, IEEE TMM, IEEE TKDE.

Volunteer: ICML 2024.

Awards

INSUN Education Scholarship, 2024

ICML Travel Award, 2024

Second and Third Prize at the Xinchuang and Digital Economy Postdoctoral Haihe Academic Event, 2023

Outstanding Graduates of Shandong University, 2023

National Scholarship, 2021

ACM MM Student Travel Award, 2021

Outstanding Graduates of Shandong Province, 2020

Top 10 Outstanding Students in the School of Computer Science and Engineering, 2019

Competitions

CVPR EgoVis Workshop, Ego4D Challenge-Long-Term Action Anticipation Track, Champion, 2025

CVPR EgoVis Workshop, Ego4D Challenge-Moment Queries Track, Champion, 2025

CVPR EgoVis Workshop, Ego4D Challenge-Natural Language Query Track, Champion, 2025

CVPR EgoVis Workshop, Ego4D Challenge-Goal Step Track, Champion, 2025

CVPR EgoVis Workshop, Ego4D Challenge-EgoSchema Track, Third Place, 2025

Kaggle EgoSchema Competition, Champion, 2024

CVPR EgoVis Workshop, Ego4D Challenge-EgoSchema Track, Champion, 2024

CVPR EgoVis Workshop, Ego4D Challenge-Natural Language Query Track, Runner-up, 2024

CVPR EgoVis Workshop, Ego4D Challenge-Goal Step Track, Third Place, 2024