Publications
Preprint Papers, Accept Articles
(*) beside authors' names indicates equal contributions.
(✉) beside authors' names indicates the corresponding author.
Preprint Papers
StyleTailor: Towards Personalized Fashion Styling via Hierarchical Negative Feedback.
H. Ma, F. Shen, H. Xu, X. Wang, G. Xu, J. Zheng, L. Qu, M. Li
Under Review.CharacterShot: Controllable and Consistent 4D Character Animation.
J. Gao, J. Li, W. Liu, Y. Zeng, F. Shen, K. Chen, Y. Sun, C. Zhao
Under Review.Advanced Sign Language Video Generation with Compressed and Quantized Multi-Condition Tokenization.
C. Wang, Z. Deng, Z. Jiang, F. Shen, Y. Yin, S. Gan, Z. Cheng, S. Ge, Q. Gu
Under Review.Fine-grained Image Retrieval via Dual-Vision Adaptation.
X. Jiang, M. Cao, H. Tang, F. Shen, Z. Li
Under Review.IMAGHarmony: Controllable Image Editing with Consistent Object Quantity and Layout.
F. Shen, X. Du, Y. Gao, J. Yu, Y. Cao, X. Lei, J. Tang
Under Review.IMAGGarment: Fine-Grained Garment Generation for Controllable Fashion Design.
F. Shen, J. Yu, C. Wang, X. Jiang, X. Du, J. Tang
Under Review.Bridging Supervised and Temporal Difference Learning with Q-Conditioned Maximization.
X. Lei, Z. Zhuang, S. Yang, S. Xu, Y. Luo, F. Shen, X. Zhang, D. Wang
Under Review.R-Genie: Reasoning-Guided Generative Image Editing.
D. Zhang, L. He, R. Yan, F. Shen, J. Tang
Under Review.V-Express: Conditional Dropout for Progressive Training of Portrait Video Generation.
C. Wang, K. Tian, J. Zhang, Y. Guan, F. Luo, F. Shen, Z. Jiang, Q. Gu, X. Han, W. Yang
Under Review.
(This code has received 2k stars.)
Accept Articles
Long-Term TalkingFace Generation via Motion-Prior Conditional Diffusion Model.
F. Shen, C. Wang, J. Gao, Q. Guo, J. Dang, J. Tang, T.-S. Chua
In International Conference on Machine Learning (ICML 2025).UniMC: Taming Diffusion Transformer for Unified Keypoint-Guided Multi-Class Image Generation.
Q. Guo, A. Zeng, D. Yue, C. Yang, Y. Cao, H. Guo, F. Shen, W. Liu, X. Liu, D. Xu
In International Conference on Machine Learning (ICML 2025).Visual Content Generation in the Era of Large Foundation Models.
L. Qu, F. Shen, Z. Zhou, J. Lyu, W. Wang, L. Jiang
In International Conference on Multimedia Retrieval (ICMR 2025).IMAGDressing-v1: Customizable Virtual Dressing.
F. Shen, X. Jiang, X. He, H. Ye, C. Wang, X. Du, Z. Li, J. Tang
In AAAI Conference on Artificial Intelligence (AAAI 2025).
(This code has received 1k stars.)Boosting Consistency in Story Visualization with Rich-Contextual Conditional Diffusion Models.
F. Shen, H. Ye, S. Liu, J. Zhang, C. Wang, X. Han, Y. Wei
In AAAI Conference on Artificial Intelligence (AAAI 2025).MSP-MVS: Multi-granularity Segmentation Prior Guided Multi-View Stereo.
Z. Yuan, C. Liu, F. Shen, Z. Li, J. Luo, T. Mao, Z. Wang
In AAAI Conference on Artificial Intelligence (AAAI 2025).DVP-MVS: Synergize Depth-Edge and Visibility Prior for Multi-View Stereo.
Z. Yuan, J. Luo, F. Shen, Z. Li, C. Liu, T. Mao, Z. Wang
In AAAI Conference on Artificial Intelligence (AAAI 2025).Exploring the Potential of Large Vision-Language Models for Unsupervised Text-Based Person Retrieval.
Z. Li, J. Li, Y. Shi, J. Chen, S. Huang, L. Tu, F. Shen, H. Ling
In AAAI Conference on Artificial Intelligence (AAAI 2025).FaceShot: Bring Any Character into Life.
J. Gao, Y. Sun, F. Shen, X. Jiang, Z. Xing, K. Chen, C. Zhao
In International Conference on Learning Representations (ICLR 2025).Ensembling Diffusion Models via Adaptive Feature Aggregation.
C. Wang, K. Tian, Y. Guan, F. Shen, Z. Jiang, Q. Gu, J. Zhang
In International Conference on Learning Representations (ICLR 2025).AS-Memory: Adaptive Sparse Memory Meeting Video-Language Models.
B. Wang, H. Song, J. Dang, F. Shen, H. Zhang, L. Wang, M. Xie, J. Liu, J. Weng
In IEEE International Conference on Multimedia & Expo (ICME 2025).SPEAK: Speech-Driven Pose and Emotion-Adjustable Talking Head Generation.
C. Cai, G. Guo, J. Li, J. Su, F. Shen, C. He, J. Xiao, Y. Chen, L. Dai, F. Zhu
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025).Towards Maximizing Semantic Coverage for Image-Text Retrieval.
J. Xu, Z. Liu, Z. Chen, F. Shen
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025).PrimeK-Net: Multi-scale Spectral Learning via Group Prime-Kernel Convolutional Neural Networks for Single Channel Speech Enhancement.
Z. Lin, J. Wang, R. Li, F. Shen, X. Xuan
In IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2025).IMAGPose: A unified conditional framework for pose-guided person generation.
F. Shen, J. Tang
In Neural Information Processing Systems (NeurIPS 2024).Triplet contrastive representation learning for unsupervised vehicle re-identification.
F. Shen, X. Du, L. Zhang, X. Shu, J. Tang
In ACM Transactions on Multimedia Computing, Communications and Applications (ACM TOMM 2024).Enhancing Aerial Object Detection with Selective Frequency Interaction Network.
W. Weng, M. Wei, J. Ren, F. Shen✉
In IEEE Transactions on Artificial Intelligence (IEEE TAI 2024).Exploring LR-FPN: Enhancing Remote Sensing Object Detection with Location Refined Feature Pyramid Network.
H. Li, R. Zhang, Y. Pan, J. Ren, F. Shen✉
In International Joint Conference on Neural Networks (IJCNN 2024).Exploring warping-guided features via adaptive latent diffusion model for virtual try-on.
B. Gao, J. Ren, F. Shen✉, M. Wei, Z. Huang
In IEEE International Conference on Multimedia & Expo (ICME 2024).Advancing pose-guided image synthesis with progressive conditional diffusion models.
F. Shen, H. Ye, J. Zhang, C. Wang, X. Han, W. Yang
In International Conference on Learning Representations (ICLR 2024).Pedestrian-specific bipartite-aware similarity learning for text-based person retrieval.
F. Shen, X. Shu, X. Du, J. Tang
In ACM International Conference on Multimedia (ACM MM 2023).GiT: Graph interactive transformer for vehicle re-identification.
F. Shen, Y. Xie, J. Zhu, X. Zhu, H. Zeng
In IEEE Transactions on Image Processing (IEEE TIP 2023).A rubust method for roof extraction and height estimation.
J. Hu, Z. Huang, F. Shen✉, D. He, Q. Xian
In International Geoscience and Remote Sensing Symposium (IGRSS 2023).A bag of tricks for fine-grained roof extraction.
J. Hu, Z. Huang, F. Shen✉, D. He, Q. Xian
In International Geoscience and Remote Sensing Symposium (IGRSS 2023).A sample‐proxy dual triplet loss function for object re‐identification.
H. Wu, F. Shen, J. Zhu, H. Zeng, X. Zhu, Z. Lei
In IET Image Processing (IET IP 2022).A novel multi-frequency coordinated module for sar ship detection.
C. Qiao, F. Shen, X. Wang, R. Wang, F. Cao, S. Zhao, C. Li
In International Conference on Tools with Artificial Intelligence (ICTAI 2022).A Competitive Method for Dog Nose-print Re-identification.
F. Shen, Z. Wang, Z. Huang, X. Fu, J. Chen, X. Du, J. Tang
In IEEE Computer Vision and Pattern Recognition Workshop (CVPRW 2022).An efficient multiresolution network for vehicle reidentification.
F. Shen, J. Zhu, X. Zhu, J. Huang, H. Zeng, Z. Lei, C. Cai
In IEEE Internet of Things Journal (IOTJ 2022).Enhancing part features via contrastive attention module for vehicle re-identification.
M. Li, M. Wei, X. He, F. Shen✉
In International Conference on Image Processing (ICIP 2022).Hsgm: A hierarchical similarity graph module for object re-identification.
F. Shen, X. Peng, L. Wang, X. Hao, M. Shu, Y. Wang
In IEEE International Conference on Multimedia & Expo (ICME 2022).Object Re-identification Using Teacher-Like and Light Students.
Y. Xie, H. Wu, F. Shen, J. Zhu, H. Zeng
In British Machine Vision Conference (BMVC 2022).A Competitive Method to VIPriors Object Detection Challenge.
F. Shen, X. He, M. Wei, Y. Xie
In European Conference on Computer Vision (ECCVW 2022).Exploring spatial significance via hybrid pyramidal graph network for vehicle re-identification.
F. Shen, J. Zhu, X. Zhu, Y. Xie, J. Huang
In IEEE Transactions on Intelligent Transportation Systems (IEEE TITS 2022).