I am currently an independent researcher. My research interests lie in AI and machine learning. Specifically, I am interested in how to make AI models more general to facilitate their real-world application.
I was a Ph.D. student in the School of Computing and Information Systems (SCIS) at Singapore Management University (SMU), under the supervision of Dr. Thivya Kandappu and Dr. Ma Dong. I also worked closely with Dr. Guohao Lan. My research areas were Human-Computer Interaction (HCI) and Pervasive Sensing. However, I chose to withdraw from the PhD program because of some personal issues.
Prior to this, I earned my Masterโs degree from the University of Electronic Science and Technology of China (UESTC). During my studies, I focused on Computer Vision and Vision-Language Processing in the Intelligent Vision Information Processing (IVIP) Lab, under the guidance of Prof. Hongliang Li.
๐ Publications
Spatial-Semantic Attention for Grounded Image Captioning. Wenzhe Hu, Lanxiao Wang, Linfeng Xu. ICIP 2022.
What Happens in Crowd Scenes: A New Dataset About Crowd Scenes for Image Captioning. Lanxiao Wang, Hongliang Li, Wenzhe Hu, Xiaoliang Zhang, Heqian Qiu, Fanman Meng, Qingbo Wu. TMM 2022.
A Survey of Vision and Language Related Multi-Modal Task. Lanxiao Wang, Wenzhe Hu, Heqian Qiu, Chao Shang, Taijin Zhao, Benliu Qiu, King Ngi Ngan, Hongliang Li. CAAI Artificial Intelligence Research, 2022.
๐ Projects
Gaze-Aided Low-Vision Assistance
Jan 2024 - Nov 2024
- Worked as the first author and project leader.
- Aimed at designing a low-vision assistive system, which can provide assistance given the userโs gaze movements. Afterward changed to implement a visual impairment early detection system.
- I did a comprehensive survey, read papers under various areas including ophthalmology, accessibility, HCI, and mobile computing.
- I conducted pilot studies and formal data collection study; Implemented a web-based interface for experiments; Collected eye movement data from participants.
- I extracted eye movement features; Utilized statistical analytic methods to analyze eye movement data; Designed some machine learning and deep learning-based models to classify gaze features.
๐ Education
- Jan 2024 - Nov 2024, Ph.D. student in Singapore Management University (Withdrawn). Main supervisor: Thivya Kandappu, Co-supervisor: Ma Dong. Area: Pervasive Sensing and Systems.
- Sep 2020 - Jun 2023, Master of Engineering in Information and Communication Engineering, University of Electronic Science and Technology of China. Supervisor: Prof. Hongliang Li Thesis: Image Captioning Theories and Methods
- Jun 2020 - Sep 2016, Bachelor of Engineering in Electronic Information Engineering, University of Electronic Science and Technology of China
๐ Experience
- Jan 2023 - Jun 2023, Research Assistant, DeepSE Lab, The Hong Kong University of Science and Technology, Guangzhou. Participated an OCR competition - Hierarchical Text: Challenge on Unified OCR and Layout Analysis. Achieved the 2nd place.