Journal Paper
Jinchen Yao, Zhenbo Lu, Yunyao Mao, Wengang Zhou, and Houqiang Li, “Diffusion with Reinforcement Learning for Pedestrian Trajectory Prediction,” Accepted to IEEE Transactions on Intelligent Transportation Systems (TITS), September 2025.
Hanyue Tu, Siqi Wu, Li Li, Wengang Zhou, Yonghui Wang, Hao Feng, and Houqiang Li, “Multi-Scale Invertible Neural Network for Wide-Range Variable-Rate Learned Image Compression,” Accepted to IEEE Transactions on Multimedia (TMM), September 2025.
Zhiyang Guo, Wengang Zhou, Min Wang, Li Li, and Houqiang Li, “HandNeRF++: Modeling Animatable Interacting Hands with Neural Radiance Fields,” Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), May 2025.
Min Wang, Wengang Zhou, and Houqiang Li, “Revisit Weakly Supervised Hashing with Deep Multi-modal Foundation Models,” Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), May 2025.
Hao Feng, Wengang Zhou, Jiajun Deng, Qi Tian, and Houqiang Li, “DocScanner: Robust Document Image Rectification with Progressive Learning,” Accepted to International Journal of Computer Vision (IJCV), March 2025.
Keyi Zhou, Li Li, Wengang Zhou, Yonghui Wang, Hao Feng, and Houqiang Li, “LaneTCA: Enhancing Video Lane Detection with Temporal Context Aggregation,” Accepted to IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), March 2025.
Yunyao Mao, Jiajun Deng, Wengang Zhou, Zhenbo Lu, Wanli Ouyang, and Houqiang Li, “I2MD: 3D Action Representation Learning with Inter- and Intra-modal Mutual Distillation,” Accepted to International Journal of Computer Vision (IJCV), February 2025.
Xiaohan Lei, Min Wang, Wengang Zhou, and Houqiang Li, “GaussNav: Gaussian Splatting for Visual Navigation,” Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), January 2025.
Weilun Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Dong Chen, Lu Yuan, and Houqiang Li, “SinDiffusion: Learning a Diffusion Model from a Single Natural Image,” Accepted to IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), January 2025.
Min Wang, Wengang Zhou, Xin Yao, and Houqiang Li, “Adaptive Bit Selection for Scalable Deep Hashing,” Accepted to IEEE Transactions on Image Processing (TIP), January 2025.
Weilun Wang, Hezhen Hu, Wengang Zhou, Li Li, and Houqiang Li, “Pose-Guided Interacting Hand Image Generation,” Accepted to ACM Transactions on Multimedia Computing Communications and Applications (TOMM), December 2024.
Xuanqing Cao, Wengang Zhou, Qi Sun, Weilun Wang, Li Li, and Houqiang Li, “DISA: Disentangled Dual-Branch Framework for Affordance-Aware Human Insertion,” Accepted to ACM Transactions on Multimedia Computing Communications and Applications (TOMM), December 2024.
Mingxiao Feng, Yaodong Yang, Wengang Zhou, and Houqiang Li, “TIMAR: Transition-Informed Representation for Sample-Efficient Multi-Agent Reinforcement Learning,” Accepted to Neural Networks (NN), December 2024.
Zhiyang Guo, Wengang Zhou, Li Li, Min Wang, and Houqiang Li, “Motion-aware 3D Gaussian Splatting for Efficient Dynamic Scene Reconstruction,” Accepted to IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), November 2024.
Hao Feng, Qi Liu, Hao Liu, Jingqun Tang, Wengang Zhou, Houqiang Li, and Can Huang, “DocPedia: Unleashing the Power of Large Multimodal Model in the Frequency Domain for Versatile Document Understanding,” Accepted to SCIENCE CHINA Information Sciences, November 2024.
Youpeng Zhao, Yudong Lu, Jian Zhao, Wengang Zhou, and Houqiang Li, “DanZero+: Dominating the GuanDan Game through Reinforcement Learning,” IEEE Transactions on Games (ToG), 16(4):914-926, December 2024.
Hanyue Tu, Li Li, Wengang Zhou, and Houqiang Li, “Toward On-Demand Transmission: Joint Feature and Image Coding With Reversible Neural Networks,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 34(10): 9620-9632, October 2024.
Yi Jiang, Mingxiao Feng, Wengang Zhou, Lin Liu, and Houqiang Li, “Recovering Permuted Sequential Features for Effective Reinforcement Learning,” Accepted to Neural Networks (NN), October 2024.
Jian Zhao, Mingyu Yang, Youpeng Zhao, Xunhan Hu, Wengang Zhou, and Houqiang Li, “MCMARL: Parameterizing Value Function via Mixture of Categorical Distributions for Multi-Agent Reinforcement Learning,” IEEE Transactions on Games (ToG), 16(3): 556-565, Septerber 2024.
Hao Feng, Keyi Zhou, Wengang Zhou, Yufei Yin, Jiajun Deng, Qi Sun, and Houqiang Li, “Recurrent Generic Contour-based Instance Segmentation with Progressive Learning,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 34(9): 7947-7961, September 2024.
Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, and Houqiang Li, “Full DouZero+: Improving DouDizhu AI by Opponent Modeling, Coach-guided Training and Bidding Learning,” IEEE Transactions on Game (ToG), 16(3): 518-529, September 2024.
Wengang Zhou, Jiajun Deng, Niculae Sebe, Qi Tian, Alan L. Yuille, Concetto Spampinato, and Zakia Hammal, “Guest Editorial Introduction to the Issue on Pre-Trained Models for Multi-Modality Understanding,” IEEE Transactions on Multimedia (TMM), 26: 8291-8296, August 2024.
Yonghui Wang, Shaokai Liu, Li Li, Wengang Zhou, and Houqiang Li, “SwinShadow: Shifted Window for Ambiguous Adjacent Shadow Detection,” Accepted to ACM Transactions on Multimedia Computing Communications and Applications (TOMM), August 2024.
Hao Feng, Wendi Wang, Shaokai Liu, Jiajun Deng, Wengang Zhou, and Houqiang Li, “DeepEraser: Deep Iterative Context Mining for Generic Text Eraser,” Accepted to IEEE Transactions on Multimedia (TMM), August 2024.
Yixuan Wang, Wengang Zhou, Jianmin Bao, Weilun Wang, Li Li, and Houqiang Li, “CLIP2GAN: Towards Bridging Text with the Latent Space of GANs,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 34(8): 6847-6859, August 2024.
Zhikai Chen, Fuchen Long, Zhaofan Qiu, Ting Yao, Wengang Zhou, Jiebo Luo, and Tao Mei, “Learning 3D Shape Latent for Point Cloud Completion,” IEEE Transactions on Multimedia (TMM), 26: 8717-8729, 2024.
Weichao Zhao, Wengang Zhou, Hezhen Hu, Min Wang, Houqiang Li, “Self-Supervised Representation Learning with Spatial-Temporal Consistency for Sign Language Recognition,” IEEE Transactions on Image Processing (TIP), 33: 4188-4201, July 2024.
Shaokai Liu, Hao Feng, and Wengang Zhou, “Rethinking Supervision in Document Unwarping: A Self-consistent Flow-free Approach,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 34(6): 4817-4828, June 2024.
Yonghui Wang, Wengang Zhou, Yunyao Mao, and Houqiang Li, “Detect Any Shadow: Segment Anything for Video Shadow Detection,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 34(5): 3782-3794, May 2024.
Weichao Zhao, Hezhen Hu, Wengang Zhou, Yunyao Mao, Min Wang, Houqiang Li, “MASA: Motion-aware Masked Autoencoder with Semantic Alignment for Sign Language Recognition,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 34(11): 10793-10804, November 2024.
Zhaokang Liao, Wengang Zhou, and Houqiang Li, “DaFIR: Distortion-aware Representation Learning for Fisheye Image Rectification,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 34(5): 3606-3618, May 2024.
Lin Liu, Junfeng An, Shanxin Yuan, Wengang Zhou, Houqiang Li, Yanfeng Wang, and Qi Tian, “Video Demoiréing With Deep Temporal Color Embedding and Video-Image Invertible Consistency,” IEEE Transactions on Multimedia (TMM), 26: 7386-7397, 2024.
Hao Feng, Shaokai Liu, Jiajun Deng, Wengang Zhou, and Houqiang Li, “Deep Unrestricted Document Image Rectification,” IEEE Transactions on Multimedia (TMM), 26: 6142-6154, 2024.
Jian Zhao, Xunhan Hu, Mingyu Yang, Wengang Zhou, Jiangcheng Zhu, and Houqiang Li, “CTDS: Centralized Teacher with Decentralized Student for Multi-Agent Reinforcement Learning,” IEEE Transactions on Games (ToG), 16(1): 140-150, March 2024.
Hui Wu, Min Wang, Wengang Zhou, and Houqiang Li, “Structure Similarity Preservation Learning for Asymmetric Image Retrieval,” IEEE Transactions on Multimedia (TMM), 26: 4693-4705, 2024.
Xinyue Huo, Lingxi Xie, Hengtong Hu, Wengang Zhou, Houqiang Li, and Qi Tian, “Domain-Agnostic Priors for Semantic Segmentation Under Unsupervised Domain Adaptation and Domain Generalization,” Accepted to International Journal of Computer Vision (IJCV), Feb. 2024.
Zheng Chen, Jian Zhao, Mingyu Yang, Wengang Zhou, and Houqiang Li, “Optimizing Camera Motion with MCTS and Target Motion Modeling in Multi-Target Active Object Tracking,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 20(7): 1-19, July 2024.
Liping Bao, Longhui Wei, Wengang Zhou, Lin Liu, Lingxi Xie, Houqing Li, and Qi Tian, “Multi-granularity Matching Transformer for Text-based Person Search,” IEEE Transactions on Multimedia (TMM), 26: 4281-4293, 2024.
Yongchao Du, Min Wang, Wengang Zhou, and Houqiang Li, “Progressive Similarity Preservation Learning for Deep Scalable Product Quantization,” IEEE Transactions on Multimedia (TMM), 26: 3034-3045, 2024
Hezhen Hu, Junfu Pu, Wengang Zhou, Hang Fang, and Houqiang Li, “Prior-aware Cross Modality Augmentation Learning for Continuous Sign Language Recognition,” IEEE Transactions on Multimedia (TMM), 26: 593-606, 2024.
Min Wang, Wengang Zhou, Xin Yao, Qi Tian, and Houqiang Li, “Towards Codebook-free Deep Probabilistic Quantization for Image Retrieval,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 46(2): 626-640, 2024.
Weichao Zhao, Hezhen Hu, Wengang Zhou, Li Li, and Houqiang Li, “Exploiting Spatial-Temporal Context for Interacting Hand Reconstruction on Monocular RGB Video,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 20(6): 1-18, June 2024.
Yonghui Wang, Wengang Zhou, Hao Feng, Li Li, and Houqiang Li, “Progressive Recurrent Network for Shadow Removal,” Computer Vision and Image Understanding (CVIU), 238: article 103861, January 2024.
Bingyi Feng, Mingxiao Feng, Minrui Wang, Wengang Zhou, and Houqiang Li, “Multi-Agent Hierarchical Graph Attention Reinforcement Learning for Grid-Aware Energy Management,” ZTE Communications, 21(3): 11–21, 2023.
Wendi Wang, Hao Feng, Wengang Zhou, Zhaokang Liao, and Houqiang Li, “Model-aware Pre-training for Radial Distortion Rectification,” IEEE Transactions on Image Processing (TIP), 32: 5764-5778, 2023.
Jiajun Deng, Zhengyuan Yang, Daqing Liu, Tianlang Chen, Wengang Zhou, Yanyong Zhang, Houqiang Li, and Wanli Ouyang, “TransVG++: End-to-End Visual Grounding with Language Conditioned Vision Transformer,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 45(11): 13636-13652, 2023.
Hezhen Hu, Weichao Zhao, Wengang Zhou, and Houqiang Li, “SignBERT+: Hand-model-aware Self-supervised Pre-training for Sign Language Understanding,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 45(9):11221-11239, 2023.
Yongchao Du, Min Wang, Zhenbo Lu, Wengang Zhou, and Houqiang Li, “Weakly Supervised Hashing with Reconstructive Cross-modal Attention,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 19(6): 1-19, 2023.
Longhui Wei, Lingxi Xie, Wengang Zhou, Houqiang Li and Qi Tian, “Exploring the diversity and invariance in yourself for visual pre-training task,” Parttern Recognition (PR), 139: 109437, February, 2023.
Weilun Wang, Wengang Zhou, Jianmin Bao, and Houqiang Li, “Coherent Image Animation using Spatial-Temporal Correspondence,” IEEE Transactions on Multimedia (TMM), 25: 3397-3408, 2023.
Min Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “Deep Graph Convolutional Quantization Networks for Image Retrieval,” IEEE Transactions on Multimedia (TMM), 25: 2164-2175, 2023.
Jian Zhao, Weide Shu, Youpeng Zhao, Wengang Zhou, and Houqiang Li, “Improving Deep Reinforcement Learning with Mirror Loss,” IEEE Transactions on Games (TOG), 15(3): 337-347, 2023.
Yufei Yin, Jiajun Deng, Wengang Zhou, Li Li, and Houqiang Li, “FI-WSOD: Foreground Information Guided Weakly Supervised Object Detection,” IEEE Transactions on Multimedia (TMM), 25: 1890-1902, 2023.
Hezhen Hu, Junfu Pu, Wengang Zhou, and Houqiang Li, “Collaborative Multilingual Continuous Sign Language Recognition: A Unified Framework,” IEEE Transactions on Multimedia (TMM), 25: 7559-7570, 2023.
Xin Yao, Min Wang, Wengang Zhou, and Houqiang Li, “Hash Bit Selection with Reinforcement Learning for Image Retrieval,” Accepted to IEEE Transactions on Multimedia (TMM),25: 6678-6687, 2023.
Qiaokang Xie, Zhenbo Lu, Wengang Zhou, and Houqiang Li, “Improving Person Re-identification with Multi-cue Similarity Embedding and Propagation,” IEEE Transactions on Multimedia (TMM), 25: 6384-6396, 2023.
Yiheng Liu, Wengang Zhou, Qiaokang Xie, and Houqiang Li, “Unsupervised Person Re-Identification with Wireless Positioning under Weak Scene Labeling,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 45(4): 5282-5295, 2023.
Jinhua Zhu, Yingce Xia, Lijun Wu, Jiajun Deng, Wengang Zhou, Tao Qin, Tie-Yan Liu, and Houqiang Li, “Masked Contrastive Representation Learning for Reinforcement Learning,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), 45(3): 3421-3433, 2023.
Jinhua Zhu, Yingce Xia, Chang Liu, Lijun Wu, Shufang Xie, Yusong Wang, Tao Qin, Wengang Zhou, Houqiang Li, Haiguang Liu, and Tie-Yan Liu, “Direct Molecular Conformation Generation,” Transactions on Machine Learning Research (TMLR), October 2022.
Jian Zhao, Youpeng Zhao, Weixun Wang, Mingyu Yang, Xunhan Hu, Wengang Zhou, Jianye Hao, and Houqiang Li, “Coach-assisted Multi-Agent Reinforcement Learning Framework for Unexpected Crashed Agents,” Frontiers of Information Technology & Electronic Engineering, 23(7): 1032-1042, 2022.
Mao Xi, Yun Zhou, Zheng Chen, Wengang Zhou, and Houqiang Li, “Anti-distractor Active Object Tracking in 3D Environments,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 32(6): 3697-3707, 2022.
Yiheng Liu, Wengang Zhou, Mao Xi, Sanjing Shen, and Houqiang Li, “Multi-Modal Context Propagation for Person Re-Identification with Wireless Positioning,” IEEE Transactions on Multimedia (TMM), 24: 3060-3073, 2022.
Mao Xi, Wengang Zhou, Ning Wang, and Houqiang Li, “Learning Temporal-Correlated and Channel-Decorrelated Siamese Networks for Visual Tracking,” IEEE Transactions on Multimedia (TMM), 22: 2791-2803, 2022.
Min Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “Deep Enhanced Weakly-Supervised Hashing with Iterative Tag Refinement,” IEEE Transactions on Multimedia (TMM), 24: 2779-2790, 2022.
Jian Zhao, Weizhen Qi, Wengang Zhou, Nan Duan, Ming Zhou, and Houqiang Li, “Conditional Sentence Generation and Cross-modal Reranking for Sign Language Translation,” IEEE Transactions on Multimedia (TMM), 24: 2662-2672, 2022.
Hao Zhou, Wengang Zhou, Yun Zhou, and Houqiang Li, “Spatial-Temporal Multi-Cue Network for Sign Language Recognition and Translation,” IEEE Transactions on Multimedia (TMM), 24: 768-779, 2022.
Xinyue Huo, Lingxi Xie, Longhui Wei, Xiaopeng Zhang, Xin Chen, Hao Li, Zijie Yang, Wengang Zhou, Houqiang Li, and Qi Tian, “Heterogeneous Contrastive Learning: Encoding Spatial Information for Compact Visual Representations,” IEEE Transactions on Multimedia (TMM), 24: 4224-4235, 2022.
Yuechen Wang, Jiajun Deng, Wengang Zhou, and Houqiang Li, “Weakly Supervised Temporal Adjacent Network for Language Grounding,” IEEE Transactions on Multimedia (TMM), 24: 3276-3286, 2022.
Jiajun Deng, Wengang Zhou, Yanyong Zhang, and Houqiang Li, “From Multi-View to Hollow-3D: Hallucinated Hollow-3D R-CNN for 3D Object Detection,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 31(12): 4722-4734, 2021.
Jiajun Deng, Yingwei Pan, Ting Yao, Wengang Zhou, Houqiang Li, and Tao Mei, “MINet: Meta-Learning Instance Identifiers for Video Object Detection,” IEEE Transactions on Image Processing (TIP), 30: 6879-6891, 2021.
Tianyu Zhao, Jian Zhao, Wengang Zhou, Yun Zhou and Houqiang Li, “State Representation Learning with Adjacent State Consistency Loss for Deep Reinforcement Learning,” IEEE Multimedia, 28(3): 117-127, 2021.
Yiheng Liu, Wengang Zhou, Jianzhuang Liu, Guojun Qi, Qi Tian, and Houqiang Li, “An End-to-End Foreground-Aware Network for Person Re-Identification,” IEEE Transactions on Image Processing (TIP), 30: 2060-2071, 2021.
Ning Wang, Wengang Zhou, and Houqiang Li, “Learning Diverse Models for End-to-end Ensemble Tracking,” IEEE Transactions on Image Processing (TIP), 30: 2220-2231, 2021.
Xiaodong Yang, Wengang Zhou, and Houqiang Li, “MCFD: A Hardware-efficient Noniterative Multicue Fusion Demosaicing Algorithm,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 31(9): 3575-3589, 2021.
Feng Lin, Wengang Zhou, Jiajun Deng, Bin Li, Yan Lu, and Houqiang Li, “Residual Refinement Network with Attribute Guidance for Precise Saliency Detection,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 17(3): 81:1-81:19, 2021.
Zhandong Liu, Wengang Zhou, and Houqiang Li, “MFECN: Multi-level Feature Enhanced Cumulative Network for Scene Text Detection,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 17(3): 78:1-78:22, 2021.
Hezhen Hu, Wengang Zhou, Junfu Pu, and Houqiang Li, “Global-local Enhancement Network for NMFs-aware Sign Language Recognition,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 17(3): 80:1-80:19, 2021.
Yifan Zhang, Wengang Zhou, Min Wang, Qi Tian, and Houqiang Li, “Deep Relation Embedding for Cross-Modal Retrieval,” IEEE Transactions on Image Processing (TIP), 30: 617-627, 2021.
Jianbo Ouyang, Wengang Zhou, Min Wang, Qi Tian, and Houqiang Li, “Collaborative Image Relevance Learning for Visual Re-ranking,” IEEE Transactions on Multimedia (TMM), 23: 3646-3656, 2021.
Ning Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “Cascaded Regression Tracking: Towards Online Hard Distractor Discrimination,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 31(4): 1580-1592, 2021.
Yiding Liu, Siyu Yang, Bin Li, Wengang Zhou, Jizheng Xu, Houqiang Li, and Yan Lu, “Affinity Derivation for Accurate Instance Segmentation,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 17(1): 1-20, 2021.
Ning Wang, Wengang Zhou, Yibing Song, Chao Ma, Wei Liu, and Houqiang Li, “Unsupervised Deep Representation Learning for Real-Time Tracking,” International Journal of Computer Vision (IJCV), 129(2): 400-418, 2021.
Chengcheng Wei, Jian Zhao, Wengang Zhou, and Houqiang Li, “Semantic Boundary Detection with Reinforcement Learning for Continuous Sign Language Recognition,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), 31(3): 1138-1149, 2021.
Zhengguang Zhou, Wengang Zhou, Xutao Lv, Xuan Huang, Xiaoyu Wang, and Houqiang Li, “Progressive Learning of Low-Precision Networks for Image Classification,” IEEE Transactions on Multimedia (TMM), 23: 871-882, 2021.
Jiajun Deng, Yingwei Pan, Ting Yao, Wengang Zhou, Houqiang Li, and Tao Mei, “Single Shot Video Object Detector,” IEEE Transactions on Multimedia (TMM), 23: 846-858, 2021.
Qiaokang Xie, Wengang Zhou, Guojun Qi, Qi Tian, and Houqiang Li, “Progressive Unsupervised Person Re-identification by Tracklet Association with Spatio-Temporal Regularization,” IEEE Transactions on Multimedia (TMM), 23: 597-610, 2021.
Hezhen Hu, Wengang Zhou, Xingze Li, Ning Yan, and Houqiang Li, “MV2Flow: Learning Motion Representation for Fast Compressed Video Action Recognition,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 16(3s): 1-19, 2021.
Ning Wang, Wengang Zhou, Yibing Song, Chao Ma, and Houqiang Li, “Real-Time Correlation Tracking via Joint Model Compression and Transfer,” IEEE Transactions on Image Processing (TIP), 29: 6123-6135, 2020.
Feng Lin, Bin Li, Wengang Zhou, Houqiang Li, and Yan Lv, “Single Stage Instance Segmentation,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 16(3): 86:1-86:19, 2020.
Zhandong Liu, Wengang Zhou, and Houqiang Li, “AB-LSTM: Attention-Based Bidirectional LSTM Model for Scene Text Detection,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), 15(4): 1-23, 2020.
Min Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “Neighborhood Pyramid Preserving Hashing,” IEEE Transactions on Multimedia (TMM), 22(6): 1507-1518, 2020.
Dan Guo, Wengang Zhou, Anyang Li, Houqiang Li, and Meng Wang, “Hierarchical Recurrent DeepFusion using Adaptive Clip Summarization for Sign Language Translation,” IEEE Transactions on Image Processing (TIP), 29: 1575-1590, 2019.
Min Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “Deep Scalable Supervised Quantization by Self-Organizing Map,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), vol. 15, no. 3, pp. 1-18, 2019.
Jianglei Huang, Wengang Zhou, and Houqiang Li, “Exploiting weak mask representation with convolutional neural networks for accurate object tracking,” Multimedia Tools and Applications (MTA), vol. 78, no. 15, pp. 20961-20985, 2019.
Zhandong Liu, Wengang Zhou, and Houqiang Li, “Scene Text Detection with Fully Convolutional Neural Networks,” Multimedia Tools and Applications (MTA), vol. 78, no. 13, pp. 18205-18227, 2019.
Kai Zhang, Wengang Zhou, Shaoyan Sun, and Bin Li, “Multiple Complementary Inverted Indexing Based on Multiple Metrics,” Multimedia Tools and Applications (MTA), vol. 78, no. 6, pp. 7727-7747, 2019.
Chao Xie, Ning Wang, Wengang Zhou, Weiping Li, and Houqiang Li, “Multi-Tracker Fusion via Adaptive Outlier Detection,” Multimedia Tools and Applications (MTA), vol. 78, no. 2, pp. 2227-2250, 2019.
Ning Wang, Wengang Zhou, and Houqiang Li, “Reliable Re-detection for Long-Term Tracking,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 29, no. 3, pp. 730-743, 2019.
Jie Huang, Wengang Zhou, Houqiang Li, and Weiping Li, “Attention based 3D-CNNs for Large-Vocabulary Sign Language Recognition,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 29, no. 9, pp. 2822-2832, 2018.
Shaoyan Sun, Wengang Zhou, Qi Tian, Ming Yang, and Houqiang Li, “Assessing Image Retrieval Quality at the First Glance,” IEEE Transactions on Image Processing (TIP), vol. 27, no. 12, pp. 6124-6134, 2018.
Yue Lv, Wengang Zhou, Qi Tian, Shaoyan Sun, and Houqiang Li, “Retrieval Oriented Deep Feature Learning with Complementary Supervision Mining,” IEEE Transactions on Image Processing (TIP), vol. 27, no. 10, pp. 4945-4957, 2018.
Min Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “A General Framework for Linear Distance Preserving Hashing,” IEEE Transactions on Image Processing (TIP), vol. 27, no. 2, pp. 907-922, 2018.
Wengang Zhou, Houqiang Li, Jian Sun, and Qi Tian, “Collaborative Index Embedding for Image Retrieval,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 40, no. 5, pp. 1154-1166, 2018.
Dan Guo, Wengang Zhou, Houqiang Li, and Meng Wang, “Online Early-Late Fusion Based on Adaptive HMM for Sign Language Recognition,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), vol. 14, no. 1, Article 8, Dec. 2017.
Shaoyan Sun, Ying Li, Wengang Zhou, Qi Tian, and Houqiang Li, “Local Residual Similarity for Image Re-ranking,” Information Sciences, vol. 417, pp. 143-153, 2017.
Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, Weiyao Lin, and Qi Tian, “Picking Neural Activations for Fine-grained Localization and Description,” IEEE Transactions on Multimedia (TMM), vol. 19, no. 12, pp. 2736-2750, 2017.
Qingbo Lu, Wengang Zhou, and Houqiang Li, “Similar Reference Image Quality Assessment: A New Database and A Trial with Local Feature Matching,” Sensing and Imaging, vol. 17, no. 1, pp. 1-20, November 2016.
Qingbo Lu, Wengang Zhou, and Houqiang Li, “A No-Reference Image Sharpness Metric Based on Structural Information Using Sparse Representation,” Information Sciences, vol. 369, pp. 334-346, 2016.
Zhanning Gao, Jianru Xue, Wengang Zhou, Shanmin Pang, and Qi Tian, “Democratic Diffusion Aggregation for Image Retrieval,” IEEE Transactions on Multimedia (TMM), vol. 18, no. 8, pp. 1661-1674, Aug. 2016.
Qingbo Lu, Wengang Zhou, Lu Fang, and Houqiang Li, “Robust Blur Kernel Estimation for License Plate Images from Fast Moving Vehicles,” IEEE Transactions on Image Processing (TIP), vol. 25, no. 5, pp. 2311-2323, Feb. 2016.
Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, and Qi Tian, “Fused One-vs-All Features with Semantic Alignments for Fine-Grained Visual Categorization,” IEEE Transactions on Image Processing (TIP), vol. 25, no. 2, pp. 878-892, 2016.
Xingyang Cai, Wengang Zhou, Lei Wu, Jiebo Luo, and Houqiang Li, “Effective Active Skeleton Representation for Low Latency Human Action Recognition,” IEEE Transactions on Multimedia (TMM), vol. 18, no. 2, pp. 141-154, 2016.
Wengang Zhou, Ming Yang, Xiaoyu Wang, Houqiang Li, Yuanqing Lin, and Qi Tian, “Scalable Feature Matching by Dual Cascaded Scalar Quantization for Image Retrieval,” IEEE Transactions on Pattern Analysis and Machine Intelligence (TPAMI), vol. 38, no. 1, pp. 159-171, 2016.
Zhen Liu, Houqiang Li, Wengang Zhou, and Qi Tian, “Uniforming residual vector distribution for distinctive image representation,” IEEE Transactions on Circuits and Systems for Video Technology (TCSVT), vol. 26, no. 2, pp. 375-384, 2016.
Shaoyan Sun, Wengang Zhou, Houqiang Li, and Qi Tian, “Scalable Object Retrieval with Compact Image Representation from Generic Object Regions,” ACM Transactions on Multimedia Computing Communications and Applications (TOMM), vol. 12, no. 29, 2016.
Lingxi Xie, Qi Tian, Wengang Zhou, and Bo Zhang, “Heterogeneous Graph Propagation for Large-Scale Web Image Search,” IEEE Transactions on Image Processing (TIP), vol. 24, no. 1, pp. 4287-4298, 2015.
Wengang Zhou, Houqiang Li, Richang Hong, Yijuan Lu, and Qi Tian, “BSIFT: towards Data-Independent Codebook for Large Scale Image Search,” IEEE Transactions on Image Processing (TIP), vol. 24, no. 3, pp. 967-979, 2015.
Zhen Liu, Houqiang Li, Wengang Zhou, and Qi Tian, “Uniting Keypoints: Local Visual Information Fusion for Large Scale Image Search,” IEEE Transactions on Multimedia (TMM), vol. 17, no. 4, pp. 538-548, 2015.
Wengang Zhou, Houqiang Li, Yijuan Lu, and Qi Tian, “Visual Word Expansion and BSIFT Verification for Large Scale Image Search,” Multimedia System Journal (MMSJ), vol. 21, no. 3, pp. 245-254, 2015.
Wengang Zhou, Houqiang Li, and Qi Tian, “Multimedia content-based visual retrieval,” Pages 383-416, Chapter 12, Volume 5, Academic Press Library in Signal Processing: Image and Video Compression and Multimedia, Elsevier, June, 2014. (Book Chapter)
Zhen Liu, Houqiang Li, Liyan Zhang, Wengang Zhou, and Qi Tian, “Cross-Indexing of Binary SIFT Codes for Large-Scale Image Search,” IEEE Transactions on Image Processing (TIP), vol. 23, no. 5, pp. 2047-2057, 2014.
Wengang Zhou, Ming Yang, Houqiang Li, Xiaoyu Wang, Yuanqing Lin, and Qi Tian, “Towards Codebook-free: Scalable Cascaded Hashing for Mobile Image Search,” IEEE Transactions on Multimedia (TMM), vol. 16, no. 3, pp. 601-611, 2014.
Zhen Liu, Houqiang Li, Wengang Zhou, Ruizhen Zhao, and Qi Tian, “Contextual Hashing for Large-scale Image Search,” IEEE Transactions on Image Processing (TIP), vol. 23, no. 4, pp. 1606-1614, 2014.
Wengang Zhou, Houqiang Li, Yijuan Lu, and Qi Tian, “Encoding Spatial Context for Large Scale Partial-Duplicate Web Image Retrieval,” Journal of Computer Science and Technology (JCST), vol. 29, no. 5, pp. 837-848, 2014.
Lingxi Xie, Qi Tian, Wengang Zhou, and Bo Zhang, “Fast and Accurate Near-duplicate Image Search with Affinity Propagation on the ImageWeb,” Journal of Computer Vision and Image Understanding (CVIU), vol. 124, pp. 31-41, 2014.
Wengang Zhou, Houqiang Li, Yijuan Lu, and Qi Tian, “SIFT Match Verification by Geometric Coding for Large-scale Partial-Duplicate Web Image Search,” ACM Transactions on Multimedia Computing, Communications and Applications (TOMCCAP), vol. 9, no. 1, article 4, February, 2013.
Wengang Zhou, Houqiang Li, Yijuan Lu, and Qi Tian, “Principal Visual Word Discovery for Automatic License Plate Detection,” IEEE Transactions on Image Processing (TIP), vol. 21, no. 6, pp. 4269-4279, 2012.
Qi Tian, Shiliang Zhang, Wengang Zhou, Rongrong Ji, et al. “Building Descriptive and Discriminative Visual Codebook for Large-scale Image Applications,” International Journal of Multimedia Tools and Applications, vol. 51, no. 2, pp.441-477, 2011.
Shiliang Zhang, Qi Tian, Gang Hua,Wengang Zhou,Qingming Huang,Houqiang Li,and Wen Gao, “Modeling spatial and semantic cues for large-scale near-duplicated image retrieval,” Journal of Computer Vision and Image Understanding, vol. 115, no. 3, pp. 403-414, 2011.
Wengang Zhou, Q Tian, Yijuan Lu, and Houqiang Li, “Latent visual context learning for web image applications,” Pattern Recognition, vol. 44, no. 10-11, pp.2263-2273 2011.
Kaihua Zhang, Lei Zhang, Huihui Song, and Wengang Zhou, “Active contours with selective local or global segmentation: A new formulation and level set method,” Image Vision Computing, vol. 28, no. 4, pp. 668-676, 2010.
Kaihua Zhang, S. Xu, Wengang Zhou, and Bo Liu, “Active contours based on image Laplacian fitting energy,” Chinese of Journal Electronics (English version), vol. 18, no. 2, pp. 281-284, 2009.
Wengang Zhou, Houqiang Li, and Xiaobo Zhou, “3D neuron dendritic spine detection and dendrite reconstruction,” International Journal of Computer Aided Engineering and Technology, vol. 1, no. 4, pp. 516-531, 2009.
Kaihua Zhang, Wengang Zhou, Zhen Zhang, and Xiaojuan Zheng, “Improved CV active contour model,” Opto-Electronic Engineering, vol. 35, no. 12, 2008.
Conference Paper
Jianfeng Cai, Jiale Hong, Zongmeng Zhang, Wengang Zhou, zhannianji, Houqiang Li, “Mitigating Hallucination in VideoLLMs via Temporal-Aware Activation Engineering,” Annual Conference on Neural Information Processing Systems (NeurIPS), Dec. 2025.
Jiahao Wang, Weiye Xu, Aijun Yang, Wengang Zhou, Lewei Lu, Houqiang Li, Xiaohua Wang, Jinguo Zhu, “Enhancing the Outcome Reward-based RL Training of MLLMs with Self-Consistency Sampling,” Annual Conference on Neural Information Processing Systems (NeurIPS), Dec. 2025.
Dongnan Gui, Xun Guo, Wengang Zhou, Yan Lu, “Image as a World: Generating Interactive World from Single Image via Panoramic Video Generation,” Annual Conference on Neural Information Processing Systems (NeurIPS), Dec. 2025.
Xiao Cui, Yulei Qin, Wengang Zhou, Hongsheng Li, Houqiang Li, “Optimizing Distributional Geometry Alignment with Optimal Transport for Generative Dataset Distillation,” Annual Conference on Neural Information Processing Systems (NeurIPS), Dec. 2025.
Liang Qin, Min Wang, Peiwei Li, Wengang Zhou, and Houqiang Li, “Aligning Global Semantics and Local Textures in Generative Video Enhancement,” International Conference on Computer Vision (ICCV), October 2025.
Zhikai Chen, Fuchen Long, Zhaofan Qiu, Ting Yao, Wengang Zhou, Jiebo Luo, and Tao Mei, “Aligning Global Semantics and Local Textures in Generative Video Enhancement,” International Conference on Computer Vision (ICCV), October 2025.
Weiqi Wang, Wengang Zhou, Zongmeng Zhang, Jie Zhao, Houqiang Li, “Controllable Style Arithmetic with Language Models,” Accepted to Annual Meeting of the Association for Computational Linguistics (ACL), May 2025.
Zongmeng Zhang, Wengang Zhou, jie Zhao, and Houqiang Li, “Robust Multimodal Large Language Models Against ModalityConflicts,” Accepted to IEEE Conference on Machine Learning (ICML), May 2025.
Yufei Yin, Lechao Cheng, Wengang Zhou, Jiajun Deng, Zhou Yu, and Houqiang Li, “Self-Classification Enhancement and Correction forWeakly Supervised Object Detection,” Accepted to International Joint Conference on Artificial Intelligence (IJCAI), April 2025.
Zhiyang Guo, Jinxu Xiang, Kai Ma, Wengang Zhou, Houqiang Li, and Ran Zhang , “Make-It-Animatable: An Efficient Framework for Authoring Animation-Ready 3D Characters,” Accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
Xiao Cui, Yulei Qin, Wengang Zhou, Hongsheng Li, and Houqiang Li, “OPTICAL: Leveraging Optimal Transport for Contribution Allocation in Dataset Distillation,” Accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
Dongnan Gui, Xun Guo, Wengang Zhou, and Yan Lu, “I2VGuard: Safeguarding Images against Misuse in Diffusion-based Image-to-Video Models,” Accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
Longtao Jiang, Zhendong Wang, Jianmin Bao, Wengang Zhou, Dongdong Chen, Lei Shi, Dong Chen, and Houqiang Li, “SmartEraser: Remove Anything from Images using Masked-Region Guidance,” Accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025.
Zhendong Wang, Jianmin Bao, Shuyang Gu, Dong Chen, Wengang Zhou, and Houqiang Li, “DesignDiffusion: High-Quality Text-to-Design Image Generation with Diffusion Models,” Accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2025. (Oral Paper)
Zecheng Li, Wengang Zhou, Weichao Zhao, Kepeng Wu, Hezhen Hu, and Houqiang Li, “Uni-Sign: Toward Unified Sign Language Understanding at Scale,” Accepted to International Conference on Learning Representations (ICLR), January 2025.
Xiao Cui, Mo Zhu, Yulei Qin, Liang Xie, Wengang Zhou, and Houqiang Li, “Multi-Level Optimal Transport for Universal Cross-Tokenizer Knowledge Distillation on Language Models,” Accepted to AAAI Conference on Artificial Intelligence (AAAI), 2025. (Oral Paper)
Weichao Zhao, Hao Feng, Qi Liu, Jingqun Tang, Shu Wei, Binghong Wu, Lei Liao, Yongjie Ye, Hao Liu, Wengang Zhou, Houqiang Li, and Can Huang, “TabPedia: Towards Comprehensive Visual Table Understanding with Concept Synergy,” Accepted to Advances in Neural Information Processing Systems (NeurIPS), September 2024.
Zongmeng Zhang, Jinhua Zhu, Wengang Zhou, Xiang Qi, Peng Zhang, and Houqiang Li, “BoolQuestions: Does Dense Retrieval Understand Boolean Logic in Language?” Accepted to Findings of the Conference Empirical Methods in Natural Language Processing (EMNLP), September 2024.
Longtao Jiang, Min Wang, Zecheng Li, Yao Fang, Wengang Zhou, and Hougiang Li, “SEDS: Semantically Enhanced Dual-Stream Encoder for Sign Language Retrieval,” Accepted to ACM International Conference on Multimedia (ACM MM), July 2024.
Weiye Xu, Min Wang, Wengang Zhou, and Hougiang Li, “P-RAG: Progressive Retrieval Augmented Generation for Planning on Embodied Everyday Task,” Accepted to ACM International Conference on Multimedia (ACM MM), July 2024.
Yong Wang, Mingxiao Feng, Haolin Song, Wengang Zhou, and Hougiang Li, “Temporal State Prediction and Sequence Recovery for Multi-Agent Reinforcement Learning,” Accepted to International Conference on Neural Information Processing (ICONIP), July 2024.
Huijie Yao, Wengang Zhou, Hao Zhou, and Hougiang Li, “Semi-Supervised Spoken Language Glossification,” Accepted to Annual Meeting of the Association for Computational Linguistics (ACL), May 2024.
Zongmeng Zhang, Yufeng Shi, Jinhua Zhu, Wengang Zhou, Xiang Qi, Peng Zhang, Hougiang Li, “Trustworthy Alignment of Retrieval-Augmented Large Language Models via Reinforcement Learning,” Accepted to IEEE Conference on Machine Learning (ICML), 2024.
Xiaoyu Qiu, Yuechen Wang, Jiaxin Shi, Wengang Zhou, and Houqiang Li, “Cross-Lingual Transfer for Natural Language Inference via Multilingual Prompt Translator,” Accepted to IEEE Conference on Multimedia Expo (ICME), 2024.
Zhikai Chen, Fuchen Long, Zhaofan Qiu, Ting Yao, Wengang Zhou, Jiebo Luo, and Tao Mei, “Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation,” Accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
Xiaohan Lei, Min Wang, Wengang Zhou, Li Li, and Houqiang Li, “Instance-aware Exploration-Verification-Exploitation for Instance ImageGoal Navigation,” Accepted to IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2024.
Yue Zhang, Yaodong Yang, Zhenbo Lu, Wengang Zhou, and Houqiang Li, “Remember the Past for Better Future: Memory-Augmented Offline RL,” Accepted to International Joint Conference on Neural Networks (IJCNN), 2024.
Yongchao Du, Min Wang, Wengang Zhou, Shuping Hui, and Houqiang Li, “Image2Sentence based Asymmetrical Zero-shot Composed Image Retrieval,” Accepted to International Conference on Learning Representations (ICLR), Feb. 2024. (Spotlight Paper)
Jiaheng Feng, Mingxiao Feng, Haolin Song, Wengang Zhou, and Houqiang Li, “SUF: Stabilized Unconstrained Fine-tuning for Offline-to-Online Reinforcement Learning,” AAAI Conference on Artificial Intelligence (AAAI), Feb. 20-27, 2024.
Yufei Yin, Hao Chen, Wengang Zhou, Jiajun Deng, Haiming Xu, and Houqiang Li, “Revisiting Open-Set Panoptic Segmentation,” AAAI Conference on Artificial Intelligence (AAAI), Feb. 20-27, 2024.
Mingyu Yang, Yaodong Yang, Zhenbo Lu, Wengang Zhou, and Houqiang Li, “Hierarchical Multi-Agent Skill Discovery,” Advances in Neural Information Processing Systems (NeurIPS), Dec. 10-16, 2023.
Youpeng Zhao, Yaodong Yang, Zhenbo Lu, Wengang Zhou, and Houqiang Li, “Multi-Agent First Order Constrained Optimization in Policy Space,” Advances in Neural Information Processing Systems (NeurIPS), Dec. 10-16, 2023.
Xunhan Hu, Jian Zhao, Wengang Zhou, Ruili Feng, and Houqiang Li, “DIFFER: Decomposing Individual Reward for Fair Experience Replay in Multi-Agent Reinforcement Learning,” Advances in Neural Information Processing Systems (NeurIPS), Dec. 10-16, 2023.
Yunyao Mao, Jiajun Deng, Wengang Zhou, Li Li, Yao Fang, and Houqiang Li, “CLIP4HOI: Towards Adapting CLIP for Practical Zero-Shot HOI Detection,” Advances in Neural Information Processing Systems (NeurIPS), Dec. 10-16, 2023.
Mingxuan Ye, Yufei Kuang, Jie Wang, Rui Yang, Wengang Zhou, Houqiang Li, and Feng Wu, “State Sequences Prediction via Fourier Transform for Representation Learning,” Advances in Neural Information Processing Systems (NeurIPS), Dec. 10-16, 2023.
Yuechen Wang, Wengang Zhou, Zhenbo Lu, and Houqiang Li, “Text-Only Training for Visual Storytelling,” ACM International Conference on Multimedia (MM), Oct. 28-Nov. 3, 2023.
Yunyao Mao, Jiajun Deng, Wengang Zhou, Yao Fang, Wanli Ouyang, and Houqiang Li, “Masked Motion Predictors are Strong 3D Action Representation Learners,” International Conference on Computer Vision (ICCV), Oct. 2-6, 2023.
Zhendong Wang, Jianmin Bao, Wengang Zhou, Weilun Wang, Hezhen Hu, Hong Chen, and Houqiang Li, “DIRE for Diffusion-Generated Image Detection,” International Conference on Computer Vision (ICCV), Oct. 2-6, 2023.
Xinyue Huo, Lingxi Xie, Wengang Zhou, Houqiang Li, and Qi Tian, “Focus on Your Target: A Dual Teacher-Student Framework for Domain-adaptive Semantic Segmentation,” International Conference on Computer Vision (ICCV), Oct. 2-6, 2023.
Hao Feng, Wendi Wang, Jiajun Deng, Wengang Zhou, Li Li, Houqiang Li, and Qi Tian, “SimFIR: A Simple Framework for Fisheye Image Rectification with Self-supervised Representation Learning,” International Conference on Computer Vision (ICCV), Oct. 2-6, 2023.
Huijie Yao, Wengang Zhou, Hao Feng, Hezhen Hu, Hao Zhou, and Houqiang Li, “Sign Language Translation with Iterative Prototype,” International Conference on Computer Vision (ICCV), Oct. 2-6, 2023.
Yufei Yin, Jiajun Deng, Wengang Zhou, Li Li, and Houqiang Li, “Cyclic-Bootstrap Labeling for Weakly Supervised Object Detection,” International Conference on Computer Vision (ICCV), Oct. 2-6, 2023.
Haolin Song, Mingxiao Feng, Wengang Zhou, and Houqiang Li, “MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning,” International Joint Conference on Artificial Intelligence (IJCAI), Aug. 19-25, 2023.
Jinhua Zhu, Yingce Xia, Lijun Wu, Shufang Xie, Wengang Zhou, Tao Qin, Houqiang Li, and Tie-Yan Liu, “Dual-view Molecular Pre-training,” SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Aug. 6-10, 2023.
Zongmeng Zhang, Wengang Zhou, Jiaxin Shi, and Houqiang Li, “Hybrid and Collaborative Passage Reranking,” Findings of the Association for Computational Linguistics (ACL Findings), July 9-14, 2023.
Xunhan Hu, Jian Zhao, Youpeng Zhao, Wengang Zhou, and Houqiang Li, “Q-SAT: Value Factorization with Self-Attention for Deep Multi-Agent Reinforcement Learning,” International Joint Conference on Neural Networks (IJCNN), June 18-23, 2023.
Zeyu Fang, Jian Zhao, Wengang Zhou, and Houqiang Li, “Implementing First-Person Shooter Game AI in WILD-SCAV with Rule-Enhanced Deep Reinforcement Learning,” IEEE Conference on Games (COG), Aug. 21-24, 2023.
Jiale Han, Mingxiao Feng, Wengang Zhou, and Houqiang Li, “Sample Efficient Reinforcement Learning with Double Importance Sampling Weight Clipping,” IEEE Conference on Games (COG), Aug. 21-24, 2023.
Yudong Lu, Jian Zhao, Youpeng Zhao, Wengang Zhou, and Houqiang Li, “DanZero: Mastering GuanDan Game with Reinforcement Learning,” IEEE Conference on Games (COG), Aug. 21-24, 2023.
Junjie Lin, Yuhao Gong, Jian Zhao, Wengang Zhou, and Houqiang Li, “Mastering Curling with RL-revised Decision Tree,” IEEE Conference on Games (COG), Aug. 21-24, 2023.
Dong Xi, Wengang Zhou, and Houqiang Li, “Robust Person Re-Identification with Wireless Signals,” IEEE International Conference on Multimedia and Expo (ICME), July 10-14, 2023.
Shaokai Liu, Hao Feng, Wengang Zhou, Houqiang Li, Cong Liu, and Feng Wu, “DocMAE: Document Image Rectification via Self-supervised Representation Learning,” IEEE International Conference on Multimedia and Expo (ICME), July 10-14, 2023.
Hui Wu, Min Wang, Wengang Zhou, Zhenbo Lu, and Houqiang Li, “Asymmetric Feature Fusion for Image Retrieval,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 18-22, 2023.
Zhiyang Guo, Wengang Zhou, Min Wang, Li Li, and Houqiang Li, “HandNeRF: Neural Radiance Fields for Animatable Interacting Hands,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 18-22, 2023.
Zhendong Wang, Jianmin Bao, Wengang Zhou, Weilun Wang, and Houqiang Li, “AltFreezing for More General Video Face Forgery Detection,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 18-22, 2023.
Zhikai Chen, Fuchen Long, Zhaofan Qiu, Ting Yao, Wengang Zhou, Jiebo Luo, and Tao Mei, “AnchorFormer: Point Cloud Completion from Discriminative Nodes,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 18-22, 2023.
Hui Wu, Min Wang, Wengang Zhou, and Houqiang Li, “Rank Preserving Framework for Asymmetric Image Retrieval,” International Conference on Learning Representations (ICLR), May 1-5, 2023.
Jinhua Zhu, Kehan Wu, Bohan Wang, Yingce Xia, Shufang Xie, Qi Meng, Lijun Wu, Tao Qin, Wengang Zhou, Houqiang Li, and Tie-Yan Liu, “O-GNN: incorporating ring priors into molecular modeling,” International Conference on Learning Representations (ICLR), May 1-5, 2023.
Jinhua Zhu, Yue Wang, Lijun Wu, Tao Qin, Wengang Zhou, Tie-Yan Liu, and Houqiang Li, “Making Better Decision by Directly Planning in Continuous Control,” International Conference on Learning Representations (ICLR), May 1-5, 2023.
Yufeng Shi, Mingxiao Feng, Wengang Zhou, and Houqiang Li, “Multi-Agent Reinforcement Learning with Safety Layer for Voltage Control,” International Conference on Autonomous Agents and Multiagent Systems (AAMAS), May 29-June 2, 2023.
Weichao Zhao, Hezhen Hu, Wengang Zhou, Jiaxin Shi, and Houqiang Li, “BEST: BERT Pre-Training for Sign Language Recognition with Coupling Tokenization,” AAAI Conference on Artificial Intelligence (AAAI), Feb. 7-14, 2023.
Mingyu Yang, Jian Zhao, Xunhan Hu, Wengang Zhou, Jiangcheng Zhu, and Houqiang Li, “LDSA: Learning Dynamic Subtask Assignment in Cooperative Multi-Agent Reinforcement Learning,” Advances in Neural Information Processing Systems (NeurIPS), Nov. 28-Dec. 9, 2022.
Hezhen Hu, Weilun Wang, Wengang Zhou, and Houqiang Li, “Hand-Object Interaction Image Generation,” Advances in Neural Information Processing Systems (NeurIPS), Nov. 28-Dec. 9, 2022.
Hao Feng, Wengang Zhou, Jiajun Deng, Yuechen Wang, and Houqiang Li, “Geometric Representation Learning for Document Image Rectification,” European Conference on Computer Vision (ECCV), Oct. 23-27, 2022.
Yunyao Mao, Wengang Zhou, Zhenbo Lu, Jiajun Deng, and Houqiang Li, “CMD: Self-supervised 3D Action Representation Learning with Cross-modal Mutual Distillation,” European Conference on Computer Vision (ECCV), Oct. 23-27, 2022.
Zhiyang Guo, Yunyao Mao, Wengang Zhou, and Houqiang Li, “CMT: Context-Matching-Guided Transformer for 3D Tracking in Point Clouds,” European Conference on Computer Vision (ECCV), Oct. 23-27, 2022.
Longhui Wei, Lingxi Xie, Wengang Zhou, Houqiang Li, and Qi Tian, “MVP: Multimodality-guided Visual Pre-training,” European Conference on Computer Vision (ECCV), Oct. 23-27, 2022.
Lin Liu, Lingxi Xie, Xiaopeng Zhang, Shanxin Yuan, Xiangyu Chen, Wengang Zhou, Houqiang Li, and Qi Tian, “TAPE: Task-Agnostic Prior Embedding for Image Restoration,” European Conference on Computer Vision (ECCV), Oct. 23-27, 2022.
Yonghui Wang, Wengang Zhou, Zhenbo Lu, Houqiang Li, “UDoc-GAN: Unpaired Document Illumination Correction with Background Light Prior,” ACM International Conference on Multimedia (MM), Oct. 10-14, 2022.
Minrui Wang, Mingxiao Feng, Wengang Zhou, and Houqiang Li, “Stabilizing Voltage in Power Distribution Networks via Multi-Agent Reinforcement Learning with Transformer,” ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Aug. 14-18, 2022.
Jinhua Zhu, Yingce Xia, Lijun Wu, Shufang Xie, Tao Qin, Wengang Zhou, Houqiang Li, and Tie-Yan Liu, “Unified 2D and 3D Pre-Training of Molecular Representations,” ACM SIGKDD Conference on Knowledge Discovery and Data Mining (KDD), Aug. 14-18, 2022.
Jitao Wang, Dongyun Xue, Jian Zhao, Wengang Zhou, and Houqiang Li, “Mastering the Game of 3v3 Snakes with Rule-Enhanced Multi-Agent Reinforcement Learning,” IEEE Conference on Games (COG), Aug. 21-24, 2022.
Youpeng Zhao, Jian Zhao, Xunhan Hu, Wengang Zhou, and Houqiang Li, “DouZero+: Improving DouDizhu AI by Opponent Modeling and Coach-guided Learning,” IEEE Conference on Games (COG), Aug. 21-24, 2022.
Hui Wu, Min Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “Contextual Similarity Distillation for Asymmetric Image Retrieval,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2022.
Xinyue Huo, Lingxi Xie, Hengtong Hu, Wengang Zhou, Houqiang Li, and Qi Tian, “Domain-Agnostic Prior for Transfer Semantic Segmentation,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 19-24, 2022.
Zhendong Wang, Xiaodong Cun, Jianmin Bao, Wengang Zhou, Jianmin Bao, and Houqiang Li, “Uformer: A General U-Shaped Transformer for Image Restoration,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), June 19-24, 2022.
Hui Wu, Min Wang, Wengang Zhou, Yang Hu, and Houqiang Li, “Learning Token-based Representation for Image Retrieval,” AAAI Conference on Artificial Intelligence (AAAI), Feb. 22-March 1, 2022.
Yuechen Wang, Wengang Zhou, and Houqiang Li, “Fine-grained Semantic Alignment Network for Weakly Supervised Temporal Language Grounding,” Findings of the Conference Empirical Methods in Natural Language Processing (EMNLP), pp. 89-99, 2021.
Jianbo Ouyang, Hui Wu, Min Wang, Wengang Zhou, and Houqiang Li, “Contextual Similarity Aggregation with Self-attention for Visual Re-ranking,” Accepted to Advances in Neural Information Processing Systems (NeurIPS ), 2021.
Hui Wu, Min Wang, Wengang Zhou, and Houqiang Li, “Learning Deep Local Features with Multiple Dynamic Attentions for Large-Scale Image Retrieval,” International Conference on Computer Vision (ICCV), pp. 11416-11425, 2021.
Yunyao Mao, Ning Wang, Wengang Zhou, and Houqiang Li, “Joint Inductive and Transductive Learning for Video Object Segmentation,” International Conference on Computer Vision (ICCV), pp. 9670-9679, 2021.
Weilun Wang, Wengang Zhou, Jianmin Bao, Dong Chen, and Houqiang Li, “Instance-wise Hard Negative Example Generation for Contrastive Learning in Unpaired Image-to-Image Translation,” International Conference on Computer Vision (ICCV), pp. 14020-14029, 2021.
Jiajun Deng, Zhengyuan Yang, Tianlang Chen, Wengang Zhou, and Houqiang Li, “TransVG: End-to-End Visual Grounding with Transformers,” International Conference on Computer Vision (ICCV), pp. 1769-1779, 2021.
Hezhen Hu, Weichao Zhao, Wengang Zhou, Yuechen Wang, and Houqiang Li, “SignBERT: Pre-Training of Hand-Model-Aware Representation for Sign Language Recognition,” International Conference on Computer Vision (ICCV), pp. 11087-11096, 2021.
Hao Feng, Yuechen Wang, Wengang Zhou, Jiajun Deng, and Houqiang Li, “DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction,” ACM International Conference on Multimedia (ACM MM), pp. 273-281, 2021.
Yuchen Yang, Min Wang, Wengang Zhou, and Houqiang Li, “Cross-modal Joint Prediction and Alignment for Composed Query Image Retrieval,” ACM International Conference on Multimedia (ACM MM), pp. 3303-3311, 2021.
Hanyue Tu, Li Li, Wengang Zhou, and Houqiang Li, “Semantic Scalable Image Compression with Cross-Layer Priors,” ACM International Conference on Multimedia (ACM MM), pp. 4044-4052, 2021.
Qing Li, Wengang Zhou, Yun Zhou, and Houqiang Li, “Attentive Update of Multi-Critic for Deep Reinforcement Learning,” IEEE International Conference on Multimedia Expo (ICME), Oral Paper, 2021.
Hao Zhou, Wengang Zhou, Weizhen Qi, Junfu Pu, and Houqiang Li, “Improving Sign Language Translation with Monolingual Data by Sign Back-Translation,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1316-1325, 2021.
Ning Wang, Wengang Zhou, Jie Wang, and Houqiang Li, “Transformer Meets Tracker: Exploiting Temporal Context for Robust Visual Tracking,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1571-1580, Oral Paper, 2021.
Xinyue Huo, Lingxi Xie, Jianzhong He, Zijie Yang, Wengang Zhou, Houqiang Li, and Qi Tian, “ATSO: Asynchronous Teacher-Student Optimization for Semi-Supervised Image Segmentation,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1235-1244, 2021.
Hezhen Hu, Weilun Wang, Wengang Zhou, Weichao Zhao, and Houqiang Li, “Model-Aware Gesture-to-Gesture Translation,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 16428-16437, 2021.
Jinhua Zhu, Lijun Wu, Yingce Xia, Shufang Xie, Tao Qin, Wengang Zhou, Houqiang Li, Tie-Yan Liu, “IOT: Instance-wise Layer Reordering for Transformer Structures,” International Conference on Learning Representations (ICLR), January 2021.
Hezhen Hu, Wengang Zhou, and Houqiang Li, “Hand-Model-Aware Sign Language Recognition,” AAAI Conference on Artificial Intelligence (AAAI), pp. 1558-1566, 2021.
Ning Wang, Wengang Zhou, and Houqiang Li, “Contrastive Transformation for Self-supervised Correspondence Learning,” AAAI Conference on Artificial Intelligence (AAAI), pp. 10174-10182, 2021.
Yufei Yin, Jiajun Deng, Wengang Zhou, and Houqiang Li, “Instance Mining with Class Feature Banks for Weakly Supervised Object Detection,” AAAI Conference on Artificial Intelligence (AAAI), pp. 3190-3198, 2021.
Jiajun Deng, Shaoshuai Shi, Peiwei Li, Wengang Zhou, Yanyong Zhang, and Houqiang Li, “Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection,” AAAI Conference on Artificial Intelligence (AAAI), pp. 1201-1209, 2021.
Junfu Pu, Wengang Zhou, Hezhen Hu, and Houqiang Li, “Boosting Continuous Sign Language Recognition via Cross Modality Augmentation,” ACM International Conference on Multimedia (MM), 2020.
Yiheng Liu, Wengang Zhou, Mao Xi, Sanjing Shen, and Houqiang Li, “Vision Meets Wireless Positioning: Effective Person Re-identification with Recurrent Context Propagation,” ACM International Conference on Multimedia (MM), 2020.
Lin Liu, Jianzhuang Liu, Shanxin Yuan, Gregory Slabaugh, Ales Leonardis, Wengang Zhou, Qi Tian, “Wavelet-Based Dual-Branch Network for Image Demoireing,” European Conference on Computer Vision (ECCV), 2020.
Jian Zhao, Wengang Zhou, Tianyu Zhao, Yun Zhou, and Houqiang Li, “State Representation Learning for Effective Deep Reinforcement Learning,” IEEE International Conference on Multimedia Expo (ICME), Oral Paper, 2020.
Hantao Zhang, Wengang Zhou, and Houqiang Li, “Contextual Adversarial Attacks for Object Detection,” IEEE International Conference on Multimedia Expo (ICME), 2020.
Jiayu Wang, Wengang Zhou, Guojun Qi, Qi Tian, and Houqiang Li, “Transformation GAN for Unsupervised Image Synthesis and Representation Learning,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2020.
Jinhua Zhu, Yingce Xia, Lijun Wu, Di He, Tao Qin, Wengang Zhou, Houqiang Li, Tieyan Liu, “Incorporating BERT into Neural Machine Translation,” International Conference on Learning Representation (ICLR), 2020.
Hao Zhou, Wengang Zhou, Yun Zhou, and Houqiang Li, “Spatial-Temporal Multi-Cue Network for Continuous Sign Language Recognition,” AAAI Conference on Artificial Intelligence (AAAI), Oral Paper, 2020.
Ning Wang, Wengang Zhou, Guo-Jun Qi, and Houqiang Li, “POST: POlicy-based Switch Tracking,” AAAI Conference on Artificial Intelligence (AAAI), 2020.
Xingze Li, Wengang Zhou, Yun Zhou, and Houqiang Li, “Relation-Guided Spatial Attention and Temporal Refinement for Video-based Person Re-Identification,” AAAI Conference on Artificial Intelligence (AAAI), 2020.
Peiquan Sun, Wengang Zhou, and Houqiang Li, “Attentive Experience Replay,” AAAI Conference on Artificial Intelligence (AAAI), 2020.
Chengcheng Wei, Wengang Zhou, Junfu Pu, and Houqiang Li, “Deep Grammatical Multi-classifier for Continuous Sign Language Recognition,” IEEE International Conference on Multimedia Big Data (BigMM), 2019.
Jiajun Deng, Yingwei Pan, Wengang Zhou, Ting Yao, Houqiang Li, and Tao Mei, “Relation Distillation Networks for Video Object Detection,” IEEE International Conference on Computer Vision (ICCV), 2019.
Zhihao Zhang, Junfu Pu, Liansheng Zhuang, Wengang Zhou, Houqiang Li, “Continuous Sign Language Recognition via Reinforcement Learning,” IEEE International Conference on Image Processing (ICIP), 2019. (Best Student Paper Finalists)
Zhihao Zhang, Liansheng Zhuang, Wengang Zhou, and Houqiang Li, “Dynamic Cascaded Regression Network with Reinforcement Learning for Robust Face Alignment,” IEEE International Conference on Multimedia Expo (ICME), Oral Paper, 2019.
Hao Zhou, Wengang Zhou, and Houqiang Li, “Dynamic Pseudo Label Decoding for Continuous Sign Language Recognition,” IEEE International Conference on Multimedia Expo (ICME), 2019.
Qianqian Wang, Liansheng Zhuang, Ning Wang, Wengang Zhou, and Houqiang Li, “Learning Motion-aware Policies for Robust Visual Tracking,” IEEE International Conference on Multimedia Expo (ICME), 2019.
Lei Jiang, Wengang Zhou, and Houqiang Li, “Knowledge Distillation with Category-aware Attention and Discriminant Logit Losses,” IEEE International Conference on Multimedia Expo (ICME), 2019.
Junfu Pu, Wengang Zhou, and Houqiang Li, “Iterative Alignment Network for Continuous Sign Language Recognition,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
Ning Wang, Yibing Song, Chao Ma, Wengang Zhou, Wei Liu, and Houqiang Li, “Unsupervised Deep Tracking,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2019.
Jianglei Huang and Wengang Zhou, “Re2EMA: Regularized and Reinitialized Exponential Moving Average for Target Model Update in Object Tracking,” AAAI Conference on Artificial Intelligence (AAAI), 2019.
Yiheng Liu, Zhenxun Yuan, Wengang Zhou, and Houqiang Li, “Spatial and Temporal Mutual Promotion for Video-based Person Re-identification,” AAAI Conference on Artificial Intelligence (AAAI), 2019.
Yunfeng Wang, Wengang Zhou, Qilin Zhang, and Houqiang Li, “Convolutional Neural Networks with Generalized Attentional Pooling for Action Recognition,” IEEE International Conference on Visual Communications and Image Processing (VCIP), Oral Paper, August 2018.
Yiheng Liu, Chao Xie, Wengang Zhou, and Houqiang Li, “Effective Similarity Measurement for Video-based Person Re-identification,” IEEE International Conference on Visual Communications and Image Processing (VCIP), August 2018.
Chao Xie, Ning Wang, Wengang Zhou, Weiping Li, and Houqiang Li, “Residual Compression Network for Faster Correlation Tracking,” Pacific-Rim Conference on Multimedia (PCM), July 2018.
Yifan Zhang, Wengang Zhou, and Houqiang Li, “Retrieval across Optical and SAR Images with Deep Neural Network,” Pacific-Rim Conference on Multimedia (PCM), July 2018.
Yiding Liu, Siyu Yang, Bin Li, Wengang Zhou, Ji-Zeng Xu, Houqiang Li, and Yan Lu, “Affinity Derivation and Graph Merge for Instance Segmentation,” European Conference on Computer Vision (ECCV), July 2018.
Yuanqiang Fang, Wengang Zhou, Yijuan Lu, Jinhui Tang, Qi Tian, and Houqiang Li, “Cascaded Feature Augmentation with Diffusion for Image Retrieval,” ACM International Conference on Multimedia (MM), Oral Paper, July 2018.
Jiayu Wang, Wengang Zhou, Jinhui Tang, Zhongqian Fu, Qi Tian, and Houqiang Li, “Unregularized Auto-Encoder with Generative Adversarial Networks for Image Generation,” ACM International Conference on Multimedia (MM), Poster Paper, July 2018.
Xusong Chen, Dong Liu, Zheng-Jun Zha, Wengang Zhou, and Zhiwei Xiong, “Temporal Hierarchical Attention at Category- and Item-Level for Micro-Video Click-Through Prediction,” ACM International Conference on Multimedia (MM), Poster Paper, July 2018.
Shuo Wang, Dan Guo, Wengang Zhou, Zheng-Jun Zha, and Meng Wang, “Connectionist Temporal Fusion for Sign Language Translation,” ACM International Conference on Multimedia (MM), Poster Paper, July 2018.
Zhihua Huang, Wengang Zhou, and Houqiang Li, “Cascaded Deep Convolutional Neural Network for Robust Face Alignment,” IEEE International Conference on Image Processing (ICIP), May 2018.
Zhengguang Zhou, Wengang Zhou, Richang Hong, and Houqiang Li, “Online Filter Clustering and Pruning for Efficient ConvNets,” IEEE International Conference on Image Processing (ICIP), May 2018.
Feng Lin, Wengang Zhou, Richang Hong, and Houqiang Li, “Facial Expression Recognition with Data Augmentation and Compact Feature Learning,” IEEE International Conference on Image Processing (ICIP), May 2018.
Xiaotian Zhu, Wengang Zhou, and Houqiang Li, “Improving Deep Neural Network Sparsity through Decorrelation Regularization,” International Joint Conference on Artificial Intelligence (IJCAI), April 2018.
Junfu Pu, Wengang Zhou, and Houqiang Li, “Dilated Convolutional Network with Iterative Optimization for Continuous Sign Language Recognition,” International Joint Conference on Artificial Intelligence (IJCAI), April 2018.
Yunfeng Wang, Wengang Zhou, Qilin Zhang, and Houqiang Li, “Weighted Multi-Region Convolutional Neural Network for Action Recognition with Low-Latency Online Prediction,” the Emerging Multimedia Systems and Applications Workshop at IEEE International Conference on Multimedia Expo (ICMEW), March 2018.
Yunfeng Wang, Wengang Zhou, Qilin Zhang, and Houqiang Li, “Enhanced Action Recognition with Visual Attribute-augmented 3D Convolutional Neural Network,” IEEE International Conference on Multimedia Expo Workshop(ICMEW), March 2018.
Ning Wang, Wengang Zhou, and Houqiang Li, “Robust Object Tracking via Part-Based Correlation Particle Filter,” IEEE International Conference on Multimedia Expo (ICME), Oral Paper, March 2018.
Xiaotian Zhu, Wengang Zhou, and Houqiang Li, “Adaptive Layerwise Quantization for Deep Neural Network Compression,” IEEE International Conference on Multimedia Expo (ICME), Oral Paper, March 2018.
Zhengguang Zhou, Wengang Zhou, Richang Hong, and Houqiang Li, “Online Filter Weakening and Pruning for Efficient Converts,” IEEE International Conference on Multimedia Expo (ICME), Oral Paper, March 2018.
Yilin He, Wengang Zhou, and Houqiang Li, “Major-Subordinate-Task Learning for Image Orientation Estimation,” IEEE International Conference on Multimedia Expo (ICME), March 2018.
Ning Wang, Wengang Zhou, Qi Tian, Richang Hong, Meng Wang, and Houqiang Li, “Multi-Cue Correlation Filters for Robust Visual Tracking,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), Feb. 2018.
Jie Huang, Wengang Zhou, Qilin Zhang, Houqiang Li and Weiping Li, “Video-based Sign Language Recognition without Temporal Segmentation,” AAAI Conference on Artificial Intelligence (AAAI), 2018.
Dan Guo, Wengang Zhou, Meng Wang, and Houqiang Li, “Hierarchical LSTM for Sign Language Translation,” AAAI Conference on Artificial Intelligence (AAAI), 2018.
Yue Lv, Wengang Zhou, Qi Tian, and Houqiang Li, “Scalable Bag of Selected Deep Features for Visual Instance Retrieval,” International Conference on Multimedia Modeling (MMM), 2018.
Jie Sun, Wengang Zhou, and Houqiang Li, “Orientation Estimation Network,” International Conference on Image and Graphics (ICIG), Oral Paper, July 2017.
Min Wang, Wengang Zhou, Qi Tian, Junfu Pu, and Houqiang Li, “Deep Supervised Quantization by Self-Organized Map,” ACM International Conference on Multimedia (MM), Long Paper, 2017.
Yiding Liu, Wengang Zhou, and Houqiang Li, “Quasi Rate Distoration Optimization for Binary Hashing,” International Conference on Image Processing (ICIP), 2017.
Xinchun Qian, Wengang Zhou, and Houqiang Li, “No-Reference Image Quality ssessment based on Internal Generative Mechanism,” International Conference on Multimedia Modelling (MMM), pp. 264-276, 2017.
Tianqi Zheng, Chao Xie, Wengang Zhou, and Houqiang Li, “Compressive Tracking with Adaptive Color Feature Selection and Foreground Modeling,” IEEE International Conference on Visual Communications and Image Processing (VCIP), pp. 1-4, 2016.
Xuya Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “Adaptively Weighted Graph Fusion for Image Retrieval,” ACM International Conference on Internet Multimedia Computing and Service (ICIMCS), pp. 18-21, 2016.
Tianqi Zheng, Chao Xie, Wengang Zhou, and Houqiang Li, “Improve Visual Tracking by End-to-end Multi-Tracker Selection,” ACM International Conference on Internet Multimedia Computing and Service (ICIMCS), pp. 242-245, 2016.
Min Wang, Wengang Zhou, Qi Tian, and Houqiang Li, “Sparse Matrix based Hashing for Approximate Nearest Neighbor Search,” Pacific-Rim Conference on Multimedia (PCM), pp. 559-568, June 2016.
Junfu Pu, Wengang Zhou, and Houqiang Li, “Sign Language Recognition with Multi-modal Features,” Pacific-Rim Conference on Multimedia (PCM), 252-261, June 2016.
Min Wang, Wengang Zhou, Qi Tian, Zheng-jun Zha, and Houqiang Li, “Linear Distance Preserving Pseudo-Supervised and Unsupervised Hashing,” ACM International Conference on Multimedia (MM), pp. 1257-1266, long paper, 1257-1266, 2016.
Tao Liu, Wengang Zhou, and Houqiang Li, “Sign Language Recognition with Long Short Term Memory,” IEEE International Conference on Image Processing (ICIP), pp. 2871-2875, 2016.
Dan Guo, Wengang Zhou, Meng Wang, and Houqiang Li, “Sign Language Recognition based on Adaptive HMMs with Data Augmentation,” IEEE International Conference on Image Processing (ICIP), pp. 2876-2880, 2016.
Jihai Zhang, Wengang Zhou, Chao Xie, Junfu Pu, and Houqiang Li, “Chinese Sign Language Recognition with Adaptive HMM,” IEEE International Conference on Multimedia and Expo (ICME), pp. 1-6, 2016.
Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, Weiyao Lin, and Qi Tian, “Picking Deep Filter Responses for Fine-grained Image Recognition,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), pp. 1134-1142, 2016.
Junfu Pu, Wengang Zhou, Jihai Zhang, and Houqiang Li, “Sign Language Recognition Based on Trajectory Modeling with HMMs,” International Conference on Multimedia Modelling (MMM), pp. 686-697, 2016.
Chao Xie, Wengang Zhou, and Houqiang Li, “Breath Motion State Estimation on 4D CT Rib Cage Images,” International Conference on Multimedia Modelling (MMM), pp. 818-828, 2016.
Xingyang Cai, Wengang Zhou, and Houqiang Li, “Attribute Mining for Scalable 3D Human Action Recognition,” ACM International Conference on Multimedia (MM), 1075-1078, 2015.
Wengang Zhou, Houqiang Li, and Qi Tian, “Scalable local feature matching without visual codebook training,” ACM International Conference on Internet Multimedia Computing and Service (ICIMCS), 2015.
Xu Xie, Wengang Zhou, Houqiang Li, and Qi Tian, “Rank-aware Graph Fusion with Contextual Dissimilarity Measurement for Image Retrieval,” IEEE International Conference on Image Processing (ICIP), 2015.
Jie Huang, Wengang Zhou, Houqiang Li, and Weiping Li, “Sign Language Recognition using Real-Sense,” IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP), 2015.
Jihang Zhang, Wengang Zhou, and Houqiang Li, “A New System forChinese Sign Language Recognition,” IEEE China Summit and International Conference on Signal and Information Processing (ChinaSIP), 2015.
Zhanning Gao, Jianru Xue, Wengang Zhou, Shanmin Pang, and Qi Tian, “Fast Democratic Aggregation and Query Fusion for Image Search,” IEEE International Conference on Multimedia Retrieval (ICMR), 2015.
Jie Huang, Wengang Zhou, Houqiang Li, and Weiping Li, “Sign language recognition using 3D convolutional neural networks,” IEEE International Conference on Multimedia and Expo (ICME), 2015.
Peng Zhang, Wengang Zhou, Lei Wu, and Houqiang Li, “SOM: Semantic Obviousness Metric for Image Quality Assessment,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2015.
Xingyang Cai, Wengang Zhou, and Houqiang Li, “An effective representation for action recognition with human skeleton joints,” SPIE/COS Photonics Asia. International Society for Optics and Photonics, 2014.
Xiaopeng Zhang, Hongkai Xiong, Wengang Zhou, and Qi Tian, “Fused one-vs-all mid-level features for fine-grained visual categorization,” In Proceedings of the ACM International Conference on Multimedia (MM), pp. 287-296, Full Paper, 2014.
Shaoyan Sun, Wengang Zhou, Houqiang Li, and Qi Tian, “Search by Detection: Object-Level Feature for Image Retrieval,” In Proceedings of International Conference on Internet Multimedia Computing and Service, pp. 46-49, 2014.
Jihai Zhang, Wengang Zhou, and Houqiang Li, “A Threshold-based HMM-DTW Approach for Continuous Sign Language Recognition,” In Proceedings of International Conference on Internet Multimedia Computing and Service, 2014.
Jing Wen, Wengang Zhou, Richang Hong, Meng Wang, and Qi Tian, “Evaluation on the Impact of Image Quality on Image Retrieval,” In Proceedings of International Conference on Internet Multimedia Computing and Service, 2014.
Liang Zheng, Shengjin Wang, Wengang Zhou, and Qi Tian, “Bayes Merging of Multiple Vocabularies for Scalable Image Retrieval,” IEEE Conference on Computer Vision and Pattern Recognition (CVPR), 2014.
Junhua Mao, Houqiang Li, Wengang Zhou, Shuicheng Yan, and Qi Tian, “Scale-based region growing for scene text detection,” The 21st ACM International Conference on Multimedia (MM"13), Full Paper, Acceptance Rate: 20%, Barcelona, Spain, October 21-25, 2013.
Wengang Zhou, Houqiang Li, M. Wang, Yijuan Lu and Qi Tian, “Binary SIFT: towards efficient feature matching verification for image search,” Best Paper Award, the 4th ACM International Conference on Internet Multimedia Computing and Service (ICIMCS 2012), Wuhan, China, September 9-11, pp. 1-6, 2012.
Wengang Zhou, Yijuan Lu, Houqiang Li, and Qi Tian, “Scalar quantization for large scale image search,” ACM International Conference on Multimedia (MM), Full Paper (acceptance rate 20.2%), 2012.
Zhen Liu, Houqiang Li, Wengang Zhou, and Qi Tian, “Embedding spatial context information into inverted file for large-scale image retrieval,” ACM International Conference on Multimedia (MM), Full Paper (acceptance rate 20.2%), 2012.
Xia Li, Wengang Zhou, J. Tang, and Qi Tian, “Query expansion enhancement by fast binary matching,” ACM International Conference on Multimedia (MM), Short Paper (acceptance rate 31.2%), 2012.
Junjie Cai, Zheng-jun. Zha, Wengang Zhou, and Qi Tian, “Attribute-assisted reranking for Web image retrieval,” ACM International Conference on Multimedia (MM), Short Paper (acceptance rate 31.2%), 2012.
Jie Xiao, Wengang Zhou, Xia Li, Meng Wang, and Qi Tian, “Image tag re-ranking by coupled probability transition,” ACM International Conference on Multimedia (MM), Short Paper (acceptance rate 31.2%), 2012.
Jia Xiao, Wengang Zhou, and Qi Tian, “Exploring tag relevance for image tag re-ranking,” International ACM conference on Research and Development in Information Retrieval (SIGIR 2012), pp. 1069-1070, 2012.
Wengang Zhou, Houqiang Li, Yijuan Lu, and Qi Tian, “Large scale image retrieval with geometric coding,” ACM International Conference on Multimedia (MM), Short paper (acceptance rate: 36.3%, 120 out of 331), Scottsdale, Arizona, Nov. 28-Dec. 1, pp.1349-1352, 2011.
Wengang Zhou, Yijuan Lu, Houqiang Li, Y. Song, and Qi Tian, “Spatial coding for large scale partial-duplicate web image search,” ACM International Conference on Multimedia (MM), Full Paper (acceptance rate: 16%), Florence, Italy, October 25-29, pp.131-140, 2010.
Wengang Zhou, Yijuan Lu, Houqiang Li, Y. Song, and Qi Tian, “Large scale partially duplicated web image retrieval,” Demo paper at ACM International Conference on Multimedia (MM), Florence, Italy, October 25-29, pp. 1523-1524, 2010.
Wengang Zhou, Qi Tian, L. Yang, and Houqiang Li, “Latent visual context analysis for image re-ranking,” ACM International Conference on Image and Video Retrieval (CIVR), Xi’an,China, July 5-7, pp.205-212, 2010.
Wengang Zhou, Yijuan Lu, Houqiang Li, and Qi Tian, “Canonical image selection by visual context learning,” International Conference on Pattern Recognition (ICPR),August 23-26, Istanbul, Turkey, pp.834-837, 2010.
Wengang Zhou, Houqiang Li, Yijuan Lu, and Qi Tian, “Large scale partial-duplicate image retrieval with bi-Space quantization and geometric consistency,” IEEE International Conference on Acoustics,Speech,and Signal Processing (ICASSP),March 14-19, pp. 2394-2397, 2010.
Wengang Zhou, Qi Tian, Houqiang Li, “Visual block link analysis for image re-ranking,” ACM International Conference on Internet Multimedia Computing and Services (ICIMCS),Kunming,China,November 23-25, pp.14-20, 2009.
Wengang Zhou, Houqiang Li, and Xiaobo Zhou, “3D dendrite reconstruction and spine identification,” International Conference on Medical Image Computing and Computer Assisted Intervention (MICCAI), Sept. 6-10, pp.18-26, 2008.
Wengang Zhou, Houqiang Li, Xiaobo Zhou, and Stephen Wong, “A new algorithm for 3D dendritic spine detection,” International Symposium on Computational Models for Life Sciences (CMLS), Dec. 17-19, pp.137-146, 2007.