About Me

I am a Research Fellow at the NExT++ Center, National University of Singapore (NUS), working with Prof. Tat-Seng Chua. Previously, I received my Ph.D. from Nanjing University of Science and Technology under Prof. Jinhui Tang, and conducted research at Tencent AI Lab with Prof. Xiao Han and Dr. Jun Zhang. I was very lucky to participate in several interesting projects, including Duix.Avatar Duix.Avatar stars, V-Express V-Express stars, and IMAGDressing IMAGDressing stars. To date, I have achieved 50+ top-three finishes in algorithm competitions, including five CCF-A workshop championships, and maintain close collaborations with Tencent AI Lab, Huawei, AIUNI, GuijiAI, and Mobvoi. I serve as a reviewer for TPAMI, NeurIPS, ICML, ICLR, CVPR, ECCV, and ICCV. PS: We are actively seeking motivated Ph.D., M.S., and undergraduate students to collaborate with Prof. Chua and myself through CSC, RA, visiting positions, or remote internships in the following research areas. If you are interested, please send your CV to shenfei29@nus.edu.sg.

📚 Research Interests

My research focuses on Large Model Safety and Controllable Generation, organized into two themes:

  • Large Model Safety
    • Multilingual safety
    • Multiagent safety
    • Multimodal safety
  • Image/Video Generation and Editing
    • Human/Story image generation
    • Clothing/Fashion image generation
    • Controllable editing
    • Controllable adapter

🔥 News

  • 2025.09:  🎉 Three papers were accepted by NeurIPS 2025.
  • 2025.06:  🔥 We released IMAGHarmony for image editing.
  • 2025.05:  🎉 Two papers were accepted by ICML 2025.
  • 2025.04:  🔥 We released IMAGGarment for fine-grained garment generation.
  • 2025.03:  🔥 We released HeyGem.ai, a long-term digital human tool for local and API use.
  • 2025.01:  🎉 Two papers were accepted by ICLR 2025.
  • 2024.12:  🎉 Five papers were accepted by AAAI 2025.
  • 2024.11:  🔥 We released IMAGPose, a unified conditional framework for person generation.
  • 2024.09:  🎉 One paper was accepted by NeurIPS 2024.
  • 2024.05:  🔥 We released IMAGDressing-v1 for customizable virtual dressing.
  • 2024.05:  🔥 We released V-Express for portrait video generation.
  • 2024.01:  🎉 One paper was accepted by ICLR 2024.
  • 2023.07:  🎉 One paper was accepted by ACM Multimedia 2023.


📝 Selected Publications

imagedit
IMAGEdit : Let Any Subject Transform
F. Shen, W. Xu, R. Yan, D. Zhang, X. Shu, J. Tang
Under Review.
[PDF]    [Code]
imagharmony
IMAGHarmony: Controllable image editing with consistent object quantity and layout
F. Shen, X. Du, Y. Gao, J. Yu, Y. Cao, X. Lei, J. Tang
Under Review.
[PDF]    [Code]
imaggarment
IMAGGarment: Fine-grained garment generation for controllable fashion design
F. Shen, J. Yu, C. Wang, X. Jiang, X. Du, J. Tang
Under Review.
[PDF]    [Code]
IMAGDressing-v1
IMAGDressing-v1: Customizable Virtual Dressing
F. Shen, X. Jiang, X. He, H. Ye, C. Wang, X. Du, Z. Li, J. Tang
AAAI Conference on Artificial Intelligence (AAAI), 2025.
[PDF]    [Code]
MCDM
Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model
F. Shen, C. Wang, J. Gao, Q. Guo, J. Dang, J. Tang, T.-S. Chua
International Conference on Machine Learning (ICML), 2025.
[PDF]    [Code]
RCDMs
Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models
F. Shen, H. Ye, S. Liu, J. Zhang, C. Wang, X. Han, W. Yang
AAAI Conference on Artificial Intelligence (AAAI), 2025.
[PDF]    [Code]
IMAGPose
IMAGPose: A Unified Conditional Framework for Pose-Guided Person Generation
F. Shen, J. Tang
Neural Information Processing Systems (NeurIPS), 2024.
[PDF]    [Code]
PCDMs
Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
F. Shen, H. Ye, J. Zhang, C. Wang, X. Han, W. Yang
International Conference on Learning Representations (ICLR), 2024.
[PDF]    [Code]
GiT: Graph Interactive Transformer for Vehicle Re-identification
GiT: Graph Interactive Transformer for Vehicle Re-identification
F. Shen, Y. Xie, J. Zhu, X. Zhu, H. Zeng
IEEE Transactions on Image Processing (IEEE TIP), 2023
[PDF]    [Code]
PBSL: Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval
Pedestrian-specific Bipartite-aware Similarity Learning for Text-based Person Retrieval
F. Shen, X. Shu, X. Du, J. Tang
ACM International Conference on Multimedia (ACM MM), 2023.
[PDF]    [Code]