Sheng Zhou

Sheng Zhou (周晟)

I am a final-year Ph.D. student in Computer Science at Hefei University of Technology (HFUT), advised by by Prof. Dan Guo and Prof. Meng Wang. Previously, I studied at University of Science and Technology of China (USTC) for one year under the guidance of Prof. Xun Yang. Currently, I am studying at National University of Singapore (NUS) as a visiting student under the guidance of Dr. Junbin Xiao, Prof. Angela Yao, and Prof. Tat-Seng Chua.

I am actively seeking research discussions and collaboration opportunities, so feel free to contact me!

🔥 As I near the end of my PhD journey, I am looking for career opportunities that align with my passion and drive!

[Email] [Github] [Google Scholar]

Research

My research focuses on visual-language understanding and reasoning, primarily on scene-text visual question answering. I am currently expanding my research scope to egocentric video understanding and multimodal large language models for human assistance.

🔥News

2025.05: Our work EgoTextVQA will be presented at the Egocentric Vision (EgoVis) Workshop and Vision-based Assistants in the Real-World (VAR) Workshop @ CVPR 2025! 😄

2025.05: One Paper is accepted by IEEE TMM. 🎉

2025.02: One Paper is accepted by CVPR. 🎉

2025.02: I honor Tat-Seng Chua Scholarship.

2024.09: I will be a visiting student at NUS for one year, collaborating with Dr. Junbin Xiao.

2024.01: One paper is accepted by ACM TOMM. 🎉

2023.07: Start a study at USTC and supervised by Prof. Xun Yang.

2023.09: One paper is accepted by IEEE TIP. 🎉

Publications and Preprints

	EgoTextVQA: Towards Egocentric Scene-Text Aware Video Question Answering Sheng Zhou, Junbin Xiao, Qingyun Li, Yicong Li, Xun Yang, Dan Guo, Meng Wang, Tat-Seng Chua, Angela Yao. CVPR'25 [arXiv] [Project Page] [Code] [Dataset]
	Scene-Text Grounding for Text-Based Video Question Answering Sheng Zhou, Junbin Xiao, Xun Yang, Peipei Song, Dan Guo, Angela Yao, Meng Wang, Tat-Seng Chua. TMM'25 [arXiv] [Code] [Dataset]
	Graph Pooling Inference Network for Text-based VQA Sheng Zhou, Dan Guo, Xun Yang, Jianfeng Dong, Meng Wang. TOMM'24 [Paper] [Code]
	Exploring Sparse Spatial Relation in Graph Inference for Text-Based VQA Sheng Zhou, Dan Guo, Jia Li, Xun Yang, Meng Wang. TIP'23 [Paper] [Code]

Selected Honors and Awards

[2025.02] Tat-Seng Chua Scholarship

[2022 - 2025] First Class Academic Scholarship (three times)

[2020 - 2022] Second Class Academic Scholarship (two times)

[2020] Outstanding Graduate of Innovation and Entrepreneurship in Hunan Province

[2016 - 2019] National Encouragement Scholarship (three times)