Hi, welcome to my webpage! zhang.hy.2019@gmail.com
I’m Haoyu Zhang (张昊宇), and I am currently a PhD Student at the School of Computer Science and Technology, Harbin Institute of Technology (Shenzhen), advised by Prof. Liqiang Nie, Prof. Yaowei Wang, and Prof. Meng Liu. Before that, I received my M.S. degree in the School of Computer Science and Technology from Shandong University, advised by Prof. Liqiang Nie and Prof. Meng Liu. My research interests include Egocentric Vision Understanding, Multimodal Dialog, and Person Re-identification.
Haoyu Zhang, Meng Liu, Zaijing Li, Haokun Wen, Weili Guan, Yaowei Wang, Liqiang Nie. “Spatial Understanding from Videos: Structured Prompts Meet Simulation Data”. arXiv preprint, arXiv:2506.03642, 2025.
[Paper] [Code] [BibTex]
Yisen Feng, Haoyu Zhang, Qiaohui Chu, Meng Liu, Weili Guan, Yaowei Wang, Liqiang Nie. “OSGNet @ Ego4D Episodic Memory Challenge 2025”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), Oral, 2025.
[Paper] [Code] [BibTex]
Qiaohui Chu, Haoyu Zhang, Yisen Feng, Meng Liu, Weili Guan, Yaowei Wang, Liqiang Nie. “Technical Report for Ego4D Long-Term Action Anticipation Challenge 2025”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), 2025.
[Paper] [Code] [BibTex]
Haoyu Zhang, Yisen Feng, Qiaohui Chu, Meng Liu, Weili Guan, Yaowei Wang, Liqiang Nie. “HCQA-1.5 @ Ego4D EgoSchema Challenge 2025”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), 2025.
[Paper] [Code] [BibTex]
Haoyu Zhang, Qiaohui Chu, Meng Liu, Yunxiao Wang, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Yaowei Wang, Liqiang Nie. “Exo2Ego: Exocentric Knowledge Guided MLLM for Egocentric Video Understanding”. arXiv preprint, arXiv:2503.09143, 2025.
[Paper] [Code] [BibTex]
Yunxiao Wang, Meng Liu, Rui Shao, Haoyu Zhang, Bin Wen, Fan Yang, Tingting Gao, Di Zhang, Liqiang Nie. “TIME: Temporal-sensitive Multi-dimensional Instruction Tuning and Benchmarking for Video-LLMs”. arXiv preprint, arXiv:2503.09994, 2025.
[Paper] [Code] [BibTex]
Yisen Feng, Haoyu Zhang, Meng Liu, Weili Guan, Liqiang Nie. “Object-Shot Enhanced Grounding Network for Egocentric Video”. IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), CCF-A, Full Paper, 2025.
[Paper] [Code] [BibTex]
Haoyu Zhang, Yuquan Xie, Yisen Feng, Zaijing Li, Meng Liu, Liqiang Nie. “HCQA @ Ego4D EgoSchema Challenge 2024”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), 2024.
[Paper] [Code] [BibTex]
Yisen Feng, Haoyu Zhang, Yuquan Xie, Zaijing Li, Meng Liu, Liqiang Nie. “ObjectNLQ @ Ego4D Episodic Memory Challenge 2024”. IEEE/CVF Conference on Computer Vision and Pattern Recognition EgoVis Workshop (CVPR Workshop), 2024.
[Paper] [Code] [BibTex]
Haoyu Zhang, Meng Liu, Zixin Liu, Xuemeng Song, Yaowei Wang, Liqiang Nie. “Multi-Factor Adaptive Vision Selection for Egocentric Video Question Answering”. International Conference on Machine Learning (ICML), CCF-A, Full Paper, 2024.
[Paper] [Code] [BibTex]
Haoyu Zhang, Meng Liu, Yisen Feng, Yaowei Wang, Weili Guan, Liqiang Nie. “Uncovering Hidden Connections: Iterative Search and Reasoning for Video-grounded Dialog”. arXiv preprint, arXiv:2310.07259, 2023.
[Paper] [Code] [BibTex]
Haoyu Zhang, Meng Liu, Yuhong Li, Ming Yan, Zan Gao, Xiaojun Chang, Liqiang Nie. “Attribute-guided Collaborative Learning for Partial Person Re-identification”. IEEE Transactions on Pattern Analysis and Machine Intelligence (IEEE TPAMI), CCF-A, SCI-1, IF 24.3, 2023.
[Paper] [Code] [BibTex]
Weili Guan, Xuemeng Song, Haoyu Zhang, Meng Liu, Chung-Hsing Yeh, Xiaojun Chang. “Bi-directional Heterogeneous Graph Hashing towards Efficient Outfit Recommendation”. ACM International Conference on Multimedia (ACM MM), CCF-A, Oral, 2022.
[Paper] [Code] [BibTex]
Haoyu Zhang, Meng Liu, Zan Gao, Xiaoqiang Lei, Yinglong Wang, Liqiang Nie. “Multimodal Dialog System: Relational Graph-based Context-aware Question Understanding”. ACM International Conference on Multimedia (ACM MM), CCF-A, Full Paper, 2021.
[Paper] [Code] [BibTex]
Conference Reviewer: CVPR 2025, ICCV 2025, NeurIPS 2022-2025, ICML 2023-2025, ICLR 2024-2025, SIGIR 2025, ACM MM 2022-2024, AAAI 2025, IJCAI 2024, ICMR 2025, PRCV 2023-2024, ISCAS 2024, ECAI 2024.
Journal Reviewer: Information Sciences, IEEE TCSVT, IEEE TMM, IEEE TKDE.
Volunteer: ICML 2024.
INSUN Education Scholarship, 2024
ICML Travel Award, 2024
Second and Third Prize at the Xinchuang and Digital Economy Postdoctoral Haihe Academic Event, 2023
Outstanding Graduates of Shandong University, 2023
National Scholarship, 2021
ACM MM Student Travel Award, 2021
Outstanding Graduates of Shandong Province, 2020
Top 10 Outstanding Students in the School of Computer Science and Engineering, 2019
CVPR EgoVis Workshop, Ego4D Challenge-Long-Term Action Anticipation Track, Champion, 2025
CVPR EgoVis Workshop, Ego4D Challenge-Moment Queries Track, Champion, 2025
CVPR EgoVis Workshop, Ego4D Challenge-Natural Language Query Track, Champion, 2025
CVPR EgoVis Workshop, Ego4D Challenge-Goal Step Track, Champion, 2025
CVPR EgoVis Workshop, Ego4D Challenge-EgoSchema Track, Third Place, 2025
Kaggle EgoSchema Competition, Champion, 2024
CVPR EgoVis Workshop, Ego4D Challenge-EgoSchema Track, Champion, 2024
CVPR EgoVis Workshop, Ego4D Challenge-Natural Language Query Track, Runner-up, 2024
CVPR EgoVis Workshop, Ego4D Challenge-Goal Step Track, Third Place, 2024