I am currently pursuing my PhD in the Institute for Intelligent Collaborative Computing at UESTC (电子科技大学).

I was guided by Tao He and worked closely with Dongyang Zhang, and Guiduo Duan at Ubiquitous Intelligence and Trusted Services Key Laboratory of Sichuan Province. I am currently doing an internship at the VIVO BlueImage Laboratory and am being guided by Qingnan Fan.

My research interest includes Multimodal perception, MLLM Post-training, Affective computing. The future research direction will focus on Unified Multimodal Models (UMM) and Omni-modal models. I have published 15+ papers

🔥 News

  • 2026.04: 🧱 Begin my research internship at VIVO.
  • 2026.02: 🎉 Two paper are accepted by CVPR 2026!
  • 2026.01: 🎉 One paper is accepted by ACL 2026!
  • 2026.01: 🎉 Two paper are accepted by ICASSP 2026!
  • 2025.11: 🎉 Two paper are accepted by AAAI 2026!
  • 2025.03: 🎉 One paper is accepted by CVPR 2025!
  • 2024.02: 🎉 One paper is accepted by COLING 2024!
  • 2023.05: 🎉 One paper is accepted by ICANN 2023!

📝 Publications

† indicates the corresponding author

🧡 Affective Computing

AAAI 2026
sym

TiCAL:Typicality-Based Consistency-Aware Learning for Multimodal Emotion Recognition
Wen Yin, Siyu Zhan, Cencen Liu, XIN Hu, Guiduo Duan, Xiurui Xie, Yuan-Fang Li, Tao He

  • We propose TiCAL, a novel framework that performs dynamic multi-stage fusion by leveraging inter-modal consistency and unimodal typicality, mimicking human-like emotion perception.
  • TiCAL assess unimodal emotional pseudo-labels and its typicality, then defined the modal consistency quantification matrix, enabling robust MER.
CVPR 2025
sym

Knowledge-Aligned Counterfactual-Enhancement Diffusion Perception for Unsupervised Cross-Domain Visual Emotion Recognition
Wen Yin, Yong Wang , Guiduo Duan, Dongyang Zhang, Xin Hu, Yuan-Fang Li, Tao He

  • We introduce Unsupervised Cross-Domain Visual Emotion Recognition (UCDVER).
  • we propose a Knowledge-aligned Counterfactual enhancement Diffusion Perception (KCDP) framework to learn domain-agnostic knowledge across diverse emotion domains.

✨ Multimodal Perception and MLLM

Others

📖 Educations

  • 2017.09 - 2021.06, Undergraduate, Henan University of Economics and Law, Zhengzhou.
  • 2021.09 - 2024.06, Master, University of Electronic Science and Technology of China, Chengdu.
  • 2024.09 - now, Phd, University of Electronic Science and Technology of China, Chengdu.

💼 Internships

BlueImage Lab, VivoResearch Intern, focusing on Unified Multimodal Model (UMM)Supervisor: Qingnan Fan
2026.04 - Present
Personality Perception Department, Megvii TechnologyResearch Intern, focusing on MLLM and Post-training.
2025.09 - 2026.03

🎖 Honors and Awards

  • 2025.11 Yanbao Special Scholarship
  • 2025.10 Academic Scholarship
  • 2024.01 Silver Star Scholarship
  • 2023.10 Academic Scholarship
  • 2022.12 Second prize of the National University Computer Ability Challenge
  • 2021.05 Huang Tingfang Scholarship