Penghang Yu, Zhiyi Tan, Guanming Lu, and Bing-Kun Bao, LD4MRec: Simplifying and Powering Diffusion Model for Multimedia Recommendation, arXiv preprint arXiv:2309.15363, 2023.
Jiayi Zou, Chaofan Chen, Bing-kun Bao, Changsheng Xu. DMC3:Dual-Modal Counterfactual Contrastive Construction for Egocentric Video Question Answering. ACM International Conference on Multimedia, 2025. (ACM MM'25)
Jie Fu, Bing-kun Bao. Retaining Temporal Semantics and Relation Topologies for Continual Weakly-Supervised Audio-Visual Video Parsing. ACM International Conference on Multimedia, 2025. (ACM MM'25)
Yefei Sheng, Jie Wang, Ming Tao, Bing-kun Bao. Gaussian: Dynamic Control with Discretized 3D View Modeling for Text-Driven 3D Gaussian Splatting Editing. ACM International Conference on Multimedia, 2025. (ACM MM'25)
Mengling Xu, Ming Tao, Bing-kun Bao. Chain-of-Cooking: Cooking Process Visualization via Bidirectional Chain-of-Thought Guidance. ACM International Conference on Multimedia, 2025. (ACM MM'25)
Yiming Li, Xiaoshan Yang, Bing-Kun Bao* and Changsheng Xu. "Graph Prompts: Adapting Video Graph for Video Question Answering." Proceedings of the 34th International Joint Conference on Artificial Intelligence, 2025. (IJCAI'25)
Bowen Yuan, Sisi You, Bing-Kun Bao*. "DToMA: Training-free Dynamic Token MAnipulation for Long Video Understanding." Proceedings of the 34th International Joint Conference on Artificial Intelligence, 2025. (IJCAI'25)
Sisi You, Bowen Yuan, Bing-Kun Bao*. "SCVBench: A Benchmark with Multi-turn Dialogues for Story-Centric Video Understanding." Proceedings of the 34th International Joint Conference on Artificial Intelligence, 2025. (IJCAI'25)
Tianshan Liu, Kin-Man Lam, and Bing-Kun Bao. "A Memory-Assisted Knowledge Transferring Framework with Curriculum Anticipation for Weakly Supervised Online Activity Detection." International Journal of Computer Vision 133.4 (2025): 1940-1963.
Jiayi Zou, Gengyun Jia, Bing-Kun Bao. "Causal Debiasing for Visual Commonsense Reasoning." 2025 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Yiming Li, Miao Ji, Sisi You, Bing-Kun Bao. "Spatial-Temporal Prior Knowledge Guidance for Long-term Action Anticipation." IEEE International Conference on Multimedia & Expo (ICME) 2025.
Xuancheng Xu, Ming Tao, Bing-Kun Bao. "CLGC: Continuous Layout Guidance for Consistent Text-to-Video Editing. " IEEE International Conference on Multimedia & Expo (ICME) 2025.
Sisi You, Jiachang Li, and Bing-Kun Bao. "Pro-MA: Progressively Margin-based Attribution in Pre-trained Vision-Language Models." IEEE MultiMedia (2025).
Penghang Yu, Zhiyi Tan, Guanming Lu, and Bing-Kun Bao, "Mind Individual Information! Principal Graph Learning for Multimedia Recommendation," AAAI Conference on Artificial Intelligence (AAAI) 2025 (Oral).
Ruizhi Pu, Gezheng Xu, Ruiyi Fang, Bin-Kun Bao, Charles Ling, Boyu Wang, "Leveraging Group Classification with Descending Soft Labeling for Deep Imbalanced Regression," AAAI Conference on Artificial Intelligence (AAAI) 2025 (Oral).
Qile Fan, Penghang Yu, Zhiyi Tan, Bing-Kun Bao, and Guanming Lu, "BeFA: A General Behavior-driven Feature Adapter for Multimedia Recommendation," AAAI Conference on Artificial Intelligence (AAAI) 2025.
MingCai Chen, Baoming Zhang, Zongbo Han, Yuntao Du, Wenyu Jiang, Yanmeng Wang, Shuai Feng, Bing-Kun Bao, "Test-Time Selective Adaptation for Uni-Modal Distribution Shift in Multi-Modal Data," International Conference on Machine Learning (ICML) 2025.
Yuyang Chang, Yifan Jiao, and Bing-Kun Bao. "SVSRD: Spatial Visual and Statistical Relation Distillation for Class-Incremental Semantic Segmentation." IEEE Transactions on Multimedia 2025.
Jian Zhong, Yifan Jiao, and Bing-Kun Bao. "Replay-Based Incremental Object Detection With Local Response Exploration." IEEE Transactions on Multimedia 2025.
Chaochao Niu, Ming Tao, and Bing-Kun Bao. SEMACOL: Semantic-enhanced Multi-scale Approach for Text-guided Grayscale Image Colorization. Pattern Recognition 160 (2025): 111203.
Mingjie Qiu, Zhiyi Tan, and Bing-Kun Bao, "MSGNN: Multi-scale Spatio-temporal Graph Neural Network for Epidemic Forecasting," Data Mining and Knowledge Discovery, vol. 38, no. 4, pp. 2348-2376, 2024.
Jie Fu, Junyu Gao, Bing-Kun Bao, and Changsheng Xu, "Multimodal Imbalance-aware Gradient Modulation for Weakly-Supervised Audio-visual Video Parsing," IEEE Transactions on Circuits and Systems for Video Technology, vol. 34, no. 6, pp. 4843-4856, 2024.
Yifan Jiao, Hantao Yao, Bing-Kun. Bao, and Changsheng Xu, "Source-guided Target Feature Reconstruction for Cross-domain Classification and Detection," IEEE Transactions on Image Processing, vol. 33, pp. 2808-2822, 2024.
Bowen Yuan, Yefei Sheng, Bing-Kun Bao, Yi-Ping Phoebe Chen, and Changsheng Xu, "Semantic Distance Adversarial Learning for Text-to-image Synthesis," IEEE Transactions on Multimedia, vol. 26, pp. 1255-1266, 2024.
Mengqi Yuan, Gengyun Jia, and Bing-Kun Bao, "GPT-based Knowledge Guiding Network for Commonsense Video Captioning," IEEE Transactions on Multimedia, vol. 26, pp. 5147-5158, 2024.
Sisi You, Hantao Yao, BingKun Bao, and Changsheng Xu, "Multi-object Tracking with Spatial-temporal Tracklet Association," ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 20, no. 5, p. 129:1-129:21, 2024.
Yefei Sheng, Ming Tao, Jie Wang, and Bing-Kun Bao, "ISF-GAN: Imagine, Select, and Fuse with GPT-based Text Enrichment for Text-to-image Synthesis," ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 20, no. 7, p. 222:1-222:17, 2024.
Shengrong Ling, Sisi You, and Bing-Kun. Bao, "Two-stage Reasoning Network with Modality Decomposition for Text VQA," in Proceedings of the Multimedia Modeling Conference (MMM), 2024, pp. 127-140.
Ming Tao, Bing-Kun Bao, Hao Tang, Yaowei Wang, and Changsheng Xu, "StoryImager: A Unified and Efficient Framework for Coherent Story Visualization and Completion," in Proceedings of the European Conference on Computer Vision (ECCV), 2024.
Tianshan Liu, Kin-Man Lam, and Bing-Kun Bao. “Label Text-aided Hierarchical Semantics Mining for Panoramic Activity Recognition,” in Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM). 2024: 8139-8148.
Ming Tao, Bing-Kun Bao, Hao Tang, Yaowei Wang, and Changsheng Xu, CoIn: "A Lightweight and Effective Framework for Story Visualization and Continuation," in Proceedings of the 32nd ACM International Conference on Multimedia (ACM MM). 2024: 10659-10668.
Zhiyi Tan, and Bing-kun Bao. "A Novel Hybrid Epidemic Prediction Model based on Cross-Modal Information." IEEE MultiMedia (2024).
Chaochao Niu, Ming Tao, and Bing-Kun Bao. "SEMACOL: Semantic-enhanced Multi-scale Approach for Text-guided Grayscale Image Colorization," Pattern Recognition (2024): 111203.
Tianshan Liu, Kin-Man Lam, and Bing-Kun Bao. "Injecting Text Clues for Improving Anomalous Event Detection From Weakly Labeled Videos." IEEE Transactions on Image Processing 2024.
Mengqi Yuan, Gengyun Jia, and Bing-Kun Bao. "Relation Inference Enhancement Network for Visual Commonsense Reasoning." IEEE Transactions on Multimedia 2024.
Nimbeshaho Thierry, Bing-Kun Bao, Zafar Ali, Zhiyi Tan, Ingabire Batamira Christ Chatelain, and Pavlos Kefalas, "PRM-KGED: Paper Recommender Model Using Knowledge Graph Embedding and Deep Neural Network," Applied Intelligence, vol. 53, no. 24, pp. 30482-30496, 2023.
Pengju Li, Zhiyi Tan, and Bing-Kun Bao, "Multiview Language Bias Reduction for Visual Question Answering," IEEE Multimedia, vol. 30, no. 1, pp. 91-99, 2023.
Yingyuan Zhao, Zhiyi Tan, Bing-Kun Bao, and Zhengzheng Tu, "Centralized Sub-critic based Hierarchical-structured Reinforcement Learning for Temporal Sentence Grounding," Multimedia Systems, vol. 29, no. 4, pp. 2181-2191, 2023.
Nimbeshaho Thierry, Bing-Kun Bao, and Zafar Ali, "RAR-SB: Research Article Recommendation Using SciBERT with BiGRU," Scientometrics, vol. 128, no. 12, pp. 6427-6448, 2023.
Mengqi Yuan, Bing-Kun Bao, Zhiyi Tan, and Changsheng Xu, "Adaptive Text Denoising Network for Image Caption Editing," ACM Transactions on Multimedia Computing, Communications, and Applications, vol. 19, no. 1s, p. 41:1-41:18, 2023.
Ming Tao, Bing-Kun Bao, Hao Tang, Fei Wu, Longhui Wei, and Qi Tian, "DE-net: Dynamic Text-guided Image Editing Adversarial Networks," in Proceedings of the AAAI Conference on Artificial Intelligence (AAAI), 2023, pp. 9971-9979.
Ming Tao, Bing-Kun Bao, Hao Tang, and Changsheng Xu, "GALIP: Generative Adversarial CLIPs for Text-to-image Synthesis," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 14214-14223.
Sisi You, Hantao Yao, Bing-Kun Bao, and Changsheng Xu, "UTM: A Unified Multiple Object Tracking Model with Identity-aware Feature Enhancement," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2023, pp. 21876-21886.
Tiankui Fu, Bing-Kun Bao, and Xi Shao, "Multi-level Semantic Extraction Using Graph Pooling Network for Text Representation," in Proceedings of the International Conference on Image and Graphics (ICIG), 2023, pp. 72-83.
Huaize Dong, Yifan Jiao, and Bing-Kun Bao, "TPM: Two-stage Prediction Mechanism for Universal Adversarial Patch Defense," in Proceedings of the International Conference on Image and Graphics (ICIG), 2023, pp. 256-267.
Junjie Ye, Bing-Kun Bao, and Zhiyi Tan, "Multi-modal Context-aware Network for Scene Graph Generation," in Proceedings of the International Conference on Image and Graphics (ICIG), 2023, pp. 335-347.
Yuling Jiang, Yingyuan Zhao, and Bing-Kun Bao, "Recombination Samples Training for Robust Natural Language Visual Reasoning," in Proceedings of the International Conference on Multimedia Engineering (ICME), 2023, pp. 564-569.
Bowen Yuan, Sisi You, and Bing-Kun Bao, "Self-PT: Adaptive Self-prompt Tuning for Low-resource Visual Question Answering," in Proceedings of the ACM Multimedia Conference (ACM MM), 2023, pp. 5089-5098.
Penghang Yu, Zhiyi Tan, Guanming Lu, and Bing-Kun Bao, "Multi-view Graph Convolutional Network for Multimedia Recommendation," in Proceedings of the ACM Multimedia Conference (ACM MM), 2023, pp. 6576-6585.
Bowen Yuan, Bairu Chen, Zhiyi Tan, Xi Shao, and Bing-Kun Bao, "Unbiased Feature Enhancement Framework for Cross-Modality Person Re-identification," Multimedia Systems, vol. 28, no. 3, pp. 749-759, 2022.
Zhenzhen Yang, Yongpeng Yang, Lu Fan, and Bing-Kun Bao, "Truncated γ Norm-based Low-rank and Sparse Decomposition," Multimedia Tools and Applications, vol. 81, no. 27, pp. 38279-38295, 2022.
Bin Han and Bing-Kun Bao, "River Channel Extraction in SAR Images Using Level Sets Driven by Symmetric Kullback-Leibler Distance," IEEE Transactions on Geoscience and Remote Sensing, vol. 60, pp. 1-16, 2022.
Jianyu Wang, Bing-Kun Bao, and Changsheng Xu, "DualVGR: A Dual-visual Graph Reasoning Unit for Video Question Answering," IEEE Transactions on Multimedia, vol. 24, pp. 3369-3380, 2022.
Ming Tao, Hao Tang, Fei Wu, Xiao-Yuan Jing, Bing-Kun Bao, and Changsheng Xu, "DF-GAN: A Simple and Effective Baseline for Text-to-Image Synthesis," in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), 2022, pp. 16494-16504. (Oral)
Yifan Jiao, Hantao Yao, and Changsheng Xu. "Dual Instance-consistent Network for Cross-domain Object Detection." IEEE Transactions on Pattern Analysis and Machine Intelligence 45.6 (2022): 7338-7352.
Hu Zhu, Zhongyang Wang, Taiyu Yan, Yu-Feng Yu, Lizhen Deng and Bing-Kun Bao, "A Parallel Multi-block Alternating Direction Method of Multipliers for Tensor Completion," IET Image Processing, vol. 15, no. 13, pp. 3053-3062, 2021.
Lizhen Deng, Guoxia Xu, Hu Zhu, and Bing-Kun Bao, "RoDeRain: Rotational Video Derain via Nonconvex and Nonsmooth Optimization," Mobile Networks and Applications (MONET), vol. 26, no. 1, pp. 57-66, 2021.
Zhenzhen Yang, Pengfei Xu, Yongpeng Yang, Bing-Kun Bao, "A Densely Connected Network Based on U-Net for Medical Image Segmentation," ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 17, no. 3, p. 89:1-89:14, 2021.
Xiangjun Shen, Jinghui Zhou, Zhongchen Ma, Bing-kun Bao and Zhengjun Zha, "Cross-domain Object Representation via Robust Low-rank Correlation Analysis," ACM Transactions on Multimedia Computing, Communications, and Applications (TOMM), vol. 17, no. 4, p. 126:1-126:20, 2021.
Bairu Chen, Yibo Gan, and Bing-Kun Bao, "Multi-pose Facial Expression Recognition Based on Unpaired Images," in Proceedings of the International Conference on Image and Graphics (ICIG), 2021, pp. 374-385.
Zhaoquan Yuan, Xiao Peng, Xiao Wu, Bing-Kun Bao, and Changsheng Xu, "Meta-learning Causal Feature Selection for Stable Prediction," in Proceedings of the International Conference on Multimedia Engineering (ICME), 2021, pp. 1-6.
Lizhen Deng, Zhetao Zhou, Guoxia Xu, Hu Zhu and Bing-Kun Bao, "TV2++: a novel spatial-temporal total variation for super resolution with exponential-type norm," EURASIP Journal on Wireless Communications and Networking, vol. 2020, no. 1, p. 223, 2020.
Fudong Nian, Teng Li, Bing-Kun Bao and Changsheng Xu, "Relative coordinates constraint for face alignment," Neurocomputing, vol. 395, pp. 119-127, 2020.
Xiang-Jun Shen, Yuxuan Wang, Liangjun Wang, Sumet Mehta, Bing-Kun Bao and Jianping Fan, "Robust low rank representation via feature and sample scaling," Neurocomputing, vol. 409, pp. 431-442, 2020.
Wen-Ze Shao, Yun-Zhi Lin, Yuan-Yuan Liu, Li-Qian Wang, Qi Ge, Bing-Kun Bao and Hai-Bo Li, "Gradient-based discriminative modeling for blind image deblurring," Neurocomputing, vol. 413, pp. 305-327, 2020.
Xiang-Jun Shen, Si-Xing Liu, Bing-Kun Bao, Chun-Hong Pan, Zheng-Jun Zha and Jianping Fan, "A generalized least-squares approach regularized with graph embedding for dimensionality reduction," Pattern Recognition, vol. 98, 2020.
Wen-Ze Shao, Yuan-Yuan Liu, Lu-Yue Ye, Li-Qian Wang, Qi Ge, Bing-Kun Bao and Hai-Bo Li, "DeblurGAN+: Revisiting blind motion deblurring using conditional adversarial networks," Signal Processing, vol. 168, 2020.
Timothy Apasiba Abeo, Xiang-Jun Shen, Ernest Domanaanmwi Ganaa, Qian Zhu, Bing-Kun Bao and Zheng-Jun Zha, "Manifold alignment via global and local structures preserving PCA framework," IEEE Access, vol. 7, pp. 38123-38134, 2019.
Xi Shao, Jin Zhang, Bing-Kun Bao and Yang Xia, "Automatic scene recognition based on constructed knowledge space learning," IEEE Access, vol. 7, pp. 102902-102910, 2019.
Xi Shao, Guijin Tang, and Bing-Kun Bao, "Personalized travel recommendation based on sentiment-aware multimodal topic model," IEEE Access, vol. 7, pp. 113043-113052, 2019.
Timothy Apasiba Abeo, Xiang-Jun Shen, Jian-Ping Gou, Qi-Rong Mao, Bing-Kun Bao and Shuying Li, "Dictionary-induced least squares framework for multi-view dimensionality reduction with multi-manifold embeddings," IET Computer Vision, vol. 13, no. 2, pp. 97-108, 2019.
Wen-Ze Shao, Jing-Jing Xu, Long Chen, Qi Ge, Li-Qian Wang, Bing-Kun Bao, and Haibo Li, "On potentials of regularized Wasserstein generative adversarial networks for realistic hallucination of tiny faces," Neurocomputing, vol. 364, pp. 1-15, 2019.
Wen-Ze Shao, Bing-Kun Bao, and Haibo Li, "Enhancing blurred low-resolution images via exploring the potentials of learning-based super-resolution," International Journal of Pattern Recognition and Artificial Intelligence, vol. 33, no. 7, p. 1940007:1-1940007:21, 2019.
Timothy Apasiba Abeo, Xiang-Jun Shen, Bing-Kun Bao, Zheng-Jun Zha, and Jianping Fan, "A generalized multi-dictionary least squares framework regularized with multi-graph embeddings," Pattern Recognition, vol. 90, pp. 1-11, 2019.
Xuan Ma, Bing-Kun Bao, Lingling Yao and Changsheng Xu, "Multimodal latent factor model with language constraint for predicate detection," in Proceedings of the International Conference on Image Processing (ICIP), 2019, pp. 4454-4458.
Yuan-Yuan Liu, Lu-Yue Ye, Wen-Ze Shao, Qi Ge, Li-Qian Wang, Bing-Kun Bao and Haibo Li, "Adversarial representation learning for dynamic scene deblurring: A simple, fast and robust approach," in Proceedings of the International Conference on Image Processing (ICIP), 2019, pp. 4644-4648.
Peng Zhang, Li Su, Liang Li, Bing-Kun Bao, Pamela Cosman, GuoRong Li and Qingming Huang, "Training efficient saliency prediction models with knowledge distillation," in Proceedings of the ACM Multimedia Conference, 2019, pp. 512-520.
Junyi Wang, Bing-Kun Bao, and Changsheng Xu, "Sentiment-aware multi-modal recommendation on tourist attractions," in Proceedings of the Multimedia Modeling Conference (MMM), 2019, pp. 3-1
地址:江苏省南京市栖霞区仙林大学城文苑路9号(南京邮电大学仙林校区)计算机学科楼
电话:13813992640(贾老师)
邮箱:bingkunbao@njupt.edu.cn(鲍老师)