Hao Zhou   周 浩

Researcher, Ph.D.

Department of Computer Vision Technology (VIS)

Baidu Inc

Email: zhouh156 (at) mail.ustc.edu.cn

Google Scholar / Github

About Me

I am currently a researcher at Baidu. I obtained my Ph.D. degree in University of Science and Technology of China (USTC) in 2022. My supervisors are Prof. Wengang Zhou and Prof. Houqiang Li. Prior to that, I received my B.S. degree from Xidian University (XDU) in 2017.

My research interests are in computer vision, and I am currently working on cross-modal understanding (image, video and text) with large language models.

Publications

*Equal contribition, †Corresponding author

点击图片
Semi-Supervised Spoken Language Glossification
Huijie Yao, Wengang Zhou, Hao Zhou, Houqiang Li
The Annual Meeting of the Association for Computational Linguistics (ACL), 2024
[pdf] [code]
点击图片
StrucTexTv3: An Efficient Vision-Language Model for Text-rich Image Perception, Comprehension, and Beyond
Pengyuan Lyu*, Yulin Li*, Hao Zhou, Weihong Ma, Xingyu Wan, Qunyi Xie, Liang Wu, Chengquan Zhang, Kun Yao, Errui Ding, Jingdong Wang
arXiv preprint, 2024
[pdf]
点击图片
FROSTER: Frozen CLIP Is A Strong Teacher for Open-Vocabulary Action Recognition
Xiaohu Huang, Hao Zhou, Kun Yao, Kai Han
International Conference on Learning Representations (ICLR), 2024
[pdf] [code]
点击图片
HAP: Structure-Aware Masked Image Modeling for Human-Centric Perception
Junkun Yuan*, Xinyu Zhang*, Hao Zhou, Jian Wang, Zhongwei Qiu, Zhiyin Shao, Shaofeng Zhang, Sifan Long, Kun Kuang, Kun Yao, Junyu Han, Errui Ding, Lanfen Lin, Fei Wu and Jingdong Wang
Conference on Neural Information Processing Systems (NeurIPS), 2023
[pdf] [code]
点击图片
Sign Language Translation with Iterative Prototype
Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, Houqiang Li
International Conference on Computer Vision (ICCV), 2023
[pdf]
点击图片
Graph Contrastive Learning for Skeleton-based Action Recognition
Xiaohu Huang, Hao Zhou†, Jian Wang, Haocheng Feng, Junyu Han, Errui Ding, Jingdong Wang, Xinggang Wang, Wenyu Liu, Bin Feng
International Conference on Learning Representations (ICLR), 2023
[pdf] [code]
点击图片
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition and Translation
Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li
IEEE Transactions on Multimedia, 2021
[pdf]
点击图片
Improving Sign Language Translation with Monolingual Data by Sign Back-Translation
Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, Houqiang Li
IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2021
[pdf]
点击图片
Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition
Hao Zhou, Wengang Zhou, Yun Zhou, Houqiang Li
AAAI Conference on Artificial Intelligence (AAAI), Oral, 2020
[pdf]
点击图片
Dynamic Pseudo Label Decoding for Continuous Sign Language Recognition
Hao Zhou, Wengang Zhou, Houqiang Li
IEEE International Conference on Multimedia and Expo (ICME), 2019
[pdf]

Academic Services

Invited Reviewer for Journals and Conferences:
  IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI)
  IEEE Transactions on Circuits and Systems for Video Technology (TCSVT)
  IEEE Transactions on Multimedia (TMM)
  IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024
  AAAI Conference on Artificial Intelligence (AAAI), 2023, 2024, 2025

Working Experience

July. 2022 - Present            Researcher, Department of Computer Vision Technology (VIS), Baidu
May. 2019 - July. 2019       Research Intern, BigData Group, Baidu

Teaching Experience

Spring 2020            INY5205.02, Digital Image Analysis
Spring 2019            210708.01, Digital Image Processing B

Awards

2021       Runner-up (2/132), ChaLearn LAP Large Scale Signer Independent Isolated SLR Challenge, CVPR
2017       Outstanding Graduate Student, XDU
2014       National Scholarship, Ministry of Education, PRC