Hao Zhou   周 浩
Researcher, Ph.D. Monetization GenAI Bytedance Inc Email: zhouh156 (at) mail.ustc.edu.cn |
I am currently a researcher in Bytedance and focus on developing generative AI in the ads tech and creative industry. I obtained my Ph.D. degree in University of Science and Technology of China (USTC) in 2022. My supervisors are Prof. Wengang Zhou and Prof. Houqiang Li. Prior to that, I received my B.S. degree from Xidian University (XDU) in 2017.
My research interests are in computer vision, and I am currently working on video understanding, generation and editing for creative ads.
*Equal contribition, †Corresponding author
![]() |
Change3D: Revisiting Change Detection and Captioning from A Video Modeling Perspective
Duowang Zhu, Xiaohu Huang, Haiyan Huang, Hao Zhou, Zhenfeng Shao IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), Highlight, 2025 [pdf] [code] |
![]() |
Skim then Focus: Integrating Contextual and Fine-grained Views for Repetitive Action Counting
Zhengqi Zhao*, Xiaohu Huang*, Hao Zhou†, Kun Yao, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin Feng† International Journal of Computer Vision (IJCV), 2025 [pdf] |
![]() |
Semi-Supervised Spoken Language Glossification
Huijie Yao, Wengang Zhou, Hao Zhou, Houqiang Li The Annual Meeting of the Association for Computational Linguistics (ACL), 2024 [pdf] [code] |
![]() |
StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Pengyuan Lyu*, Yulin Li*, Hao Zhou, Weihong Ma, Xingyu Wan, Qunyi Xie, Liang Wu, Chengquan Zhang, Kun Yao, Errui Ding, Jingdong Wang arXiv preprint, 2024 [pdf] |
![]() |
FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition
Xiaohu Huang, Hao Zhou, Kun Yao, Kai Han International Conference on Learning Representations (ICLR), 2024 [pdf] [code] |
![]() |
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Junkun Yuan*, Xinyu Zhang*, Hao Zhou, Jian Wang, Zhongwei Qiu, Zhiyin Shao, Shaofeng Zhang, Sifan Long, Kun Kuang, Kun Yao, Junyu Han, Errui Ding, Lanfen Lin, Fei Wu and Jingdong Wang Conference on Neural Information Processing Systems (NeurIPS), 2023 [pdf] [code] |
![]() |
Sign Language Translation with Iterative Prototype
Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, Houqiang Li International Conference on Computer Vision (ICCV), 2023 [pdf] |
![]() |
Graph Contrastive Learning for Skeleton-based Action Recognition
Xiaohu Huang, Hao Zhou†, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin Feng International Conference on Learning Representations (ICLR), 2023 [pdf] [code] |
![]() |
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition and Translation
Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li IEEE Transactions on Multimedia, 2021 [pdf] |
![]() |
Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, Houqiang Li IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021 [pdf] |
![]() |
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition
Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li AAAI Conference on Artificial Intelligence (AAAI), Oral, 2020 [pdf] |
![]() |
Dynamic Pseudo Label Decoding for Continuous Sign Language Recognition
Hao Zhou, Wengang Zhou, Houqiang Li IEEE International Conference on Multimedia and Expo (ICME), 2019 [pdf] |
IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI) |
IEEE Transactions on Circuits and Systems for Video Technology (TCSVT) |
IEEE Transactions on Multimedia (TMM) |
IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024, 2025 |
International Conference on Computer Vision (ICCV), 2025 |
The Annual Meeting of the Association for Computational Linguistics (ACL), 2025 |
International Conference on Machine Learning (ICML), 2025 |
AAAI Conference on Artificial Intelligence (AAAI), 2023, 2024, 2025 |
January. 2025 - Now Researcher, Monetization GenAI, Bytedance |
July. 2022 - Dec. 2024 Researcher, Department of Computer Vision Technology (VIS), Baidu |
May. 2019 - July. 2019 Research Intern, BigData Group, Baidu |
Spring 2020 INY5205.02, Digital Image Analysis |
Spring 2019 210708.01, Digital Image Processing B |
2021 Runner-up (2/132), ChaLearn LAP Large Scale Signer Independent Isolated SLR Challenge, CVPR |
2017 Outstanding Graduate Student, XDU |
2014 National Scholarship, Ministry of Education, PRC |