About Me

I am currently a Research Fellow at the NExT++ Center, National University of Singapore (NUS), working with Prof. Chua Tat-Seng. I received my Ph.D. in Computer Science and Technology from Nanjing University of Science and Technology, supervised by Prof. Jinhui Tang, and conducted research at Tencent AI Lab in collaboration with Prof. Xiao Han and Dr. Jun Zhang. To date, I have achieved 50+ top-three finishes in algorithm competitions, including five CCF A workshop championships, and maintain deep collaborations with Tencent AI Lab, Huawei, AIUNI, GuijiAI, and Mobvoi. I serves as a reviewer for TPAMI, NeurIPS, ICML, ICLR, CVPR, ECCV, and ICCV, among others.

PS: I am actively seeking motivated Ph.D., M.S., and undergraduate students interested in these areas. If you are interested, please send your CV to shenfei29@nus.edu.sg.

📚 Research Interests

My research focuses on Large Model Safety and Controllable Generation, organized into two themes:

  • Large Model Safety
    • Multilingual safety
    • Multiagent safety
    • Multimodal safety
  • Image/Video Generation and Editing
    • Human/Story image generation
    • Clothing/Fashion image generation
    • Controllable editing
    • Controllable adapter

🔥 News

  • 2025.06:  🔥 We released IMAGHarmony for image editing.
  • 2025.05:  🎉 Two papers were accepted by ICML 2025.
  • 2025.04:  🔥 We released IMAGGarment for fine-grained garment generation.
  • 2025.03:  🔥 We released HeyGem.ai, a long-term digital human tool for local and API use.
  • 2025.01:  🎉 Two papers were accepted by ICLR 2025.
  • 2024.12:  🎉 Five papers were accepted by AAAI 2025.
  • 2024.11:  🔥 We released IMAGPose, a unified conditional framework for person generation.
  • 2024.09:  🎉 One paper was accepted by NeurIPS 2024.
  • 2024.05:  🔥 We released IMAGDressing-v1 for customizable virtual dressing.
  • 2024.05:  🔥 We released V-Express for portrait video generation.
  • 2024.01:  🎉 One paper was accepted by ICLR 2024.
  • 2023.07:  🎉 One paper was accepted by ACM Multimedia 2023.


📝 Selected Publications

IMAGDressing-v1
IMAGDressing-v1: Customizable Virtual Dressing
F. Shen, X. Jiang, X. He, H. Ye, C. Wang, X. Du, Z. Li, J. Tang
AAAI Conference on Artificial Intelligence (AAAI), 2025.
[PDF]    [Code]
MCDM
Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model
F. Shen, C. Wang, J. Gao, Q. Guo, J. Dang, J. Tang, T.-S. Chua
International Conference on Machine Learning (ICML), 2025.
[PDF]    [Code]
RCDMs
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
F. Shen, H. Ye, S. Liu, J. Zhang, C. Wang, X. Han, W. Yang
AAAI Conference on Artificial Intelligence (AAAI), 2025.
[PDF]    [Code]
IMAGPose
IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation
F. Shen, J. Tang
Neural Information Processing Systems (NeurIPS), 2024.
[PDF]    [Code]
PCDMs
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
F. Shen, H. Ye, J. Zhang, C. Wang, X. Han, W. Yang
International Conference on Learning Representations (ICLR), 2024.
[PDF]    [Code]
GiT: Graph Interactive Transformer for Vehicle Re-identification
GiT: Graph Interactive Transformer for Vehicle Re-identification
F. Shen, Y. Xie, J. Zhu, X. Zhu, H. Zeng
IEEE Transactions on Image Processing (IEEE TIP), 2023
[PDF]    [Code]
PBSL: Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval
Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval
F. Shen, X. Shu, X. Du, J. Tang
ACM International Conference on Multimedia (ACM MM), 2023.
[PDF]    [Code]