Jie Zhang @ University of Science and Technology of China

Welcome To Jie Zhang's Home Page

Home | Project | Publication | Group | Links

Journal publications:

Jie Zhang, Guanghui Zhang*, Li-Rong Dai, "Frequency-invariant sensor selection for MVDR beamforming in wireless acoustic sensor networks", IEEE Trans. Wireless Communications, DOI: 10.1109/TWC.2022.3185713, 2022. (to appear) (pdf)
Guanghui Zhang, Jie Zhang*, Ke Liu, Jing Guo, Jack Y. B. Lee, Haibo Hu, and Vaneet Aggarwal, "DUASVS: A Mobile Data Saving Strategy in Short-form Video Streaming", IEEE Trans. Services Computing, DOI: 10.1109/TSC.2022.3150012, 2022. (to appear) (pdf)
Guanghui Zhang, Jie Zhang*, Yan Liu, Haibo Hu, Jack Lee, Vaneet Aggarwal, "Adaptive Video Streaming with Automatic Quality-of-Experience Optimization", IEEE Trans. Mobile Computing, DOI: 10.1109/TMC.2022.3161351, 2022. (to appear) (pdf)
Jie Zhang, Guanghui Zhang*, "A parametric unconstrained beamformer based binaural noise reduction for assistive hearing", IEEE/ACM Trans. Audio, Speech, Lang. Process., DOI: 10.1109/TASLP.2021.3138675, 30:292-304, 2022. (pdf)
Jie Zhang, Changheng Li*, "Quantization-aware binaural MWF based noise reduction incorporating external wireless devices", IEEE/ACM Trans. Audio, Speech, Lang. Process., DOI: 10.1109/TASLP.2021.3120639, 29:3118-3131, 2021. (pdf)
Jian Tang, Jie Zhang*, Yan Song, Ian Mcloughlin, Li-Rong Dai, "Multi-granularity sequence alignment mapping for encoder-decoder based end-to-end ASR", IEEE/ACM Trans. Audio, Speech, Lang. Process., 29:2816-2828, 2021. (pdf, Python code)
Jie Zhang*, Jun Du, Li-Rong Dai, "Sensor selection for relative acoustic transfer function steered linearly-constrained beamformers", IEEE/ACM Trans. Audio, Speech, Lang. Process., 29:1220-1232, 2021. (pdf)
Jie Zhang*, Huawei Chen, Li-Rong Dai, Richard C. Hendriks, "A study on reference microphone selection for multi-microphone speech enhancement", IEEE/ACM Trans. Audio, Speech, Lang. Process., 29:1220-1232, 2021. (pdf, matlab code)
Jie Zhang*, "Power optimized and power constrained randomized gossip approaches for wireless sensor networks", IEEE Wireless Communications Letters, 10(2):241-245, 2020. (pdf)
Jie Zhang, Pingping Wu* "Joint sampling synchronization and source localization for wireless acoustic sensor networks", IEEE Communications Letters, 24(5):1020-1023, 2020. (pdf)
Jie Zhang*, Richard Heusdens, Richard C. Hendriks, "Relative acoustic transfer function estimation in wireless acoustic sensor networks", IEEE/ACM Trans. Audio, Speech, Lang. Process., 27(10):1507–1579, 2019. (Featured article) (pdf, matlab code)
Jie Zhang*, Andreas I. Koutrouvelis, Richard Heusdens, Richard C. Hendriks, "Distributed rate-constrained LCMV beamforming", IEEE Signal Process. Letters, 26(5):675–679, 2019. (pdf, matlab examples)
Jie Zhang*, Richard Heusdens, Richard C. Hendriks, "Rate-distributed spatial filtering based noise reduction in wireless acoustic sensor network", IEEE/ACM Trans. Audio, Speech, Lang. Process., 26(11):2015–2026, 2018. (pdf, matlab examples)
Jie Zhang*, Sundeep Prabhakar Chepuri, Richard Heusdens, Richard C. Hendriks, "Microphone subset selection for MVDR beamformer based noise reduction", IEEE/ACM Trans. Audio, Speech, Lang. Process., 26(3):550–563, 2018. (pdf, matlab examples)
Cheng Pang, Hong Liu*, Jie Zhang*, Xiaofei Li, "Binaural sound localization based on reverberation weighting and generalized parametric mapping", IEEE/ACM Trans. Audio, Speech, Lang. Process., 25(8):1618–1632, 2017. (pdf)
Jie Zhang, Hong Liu*, "Robust acoustic localization via time-delay compensation and interaural matching filter", IEEE Trans. Signal Process., 63(18):4771–4783, 2015. (pdf)
Hong Liu, Mengdi Yue, Jie Zhang, "Bi-direction interaural matching filter and decision weighting fusion for sound source localization in noisy environments", IEICE Trans. Information and Systems, 99(12):3192-3196, 2016.
Cheng Pang, Xiuling Wang, Jie Zhang, Hong Liu*, "Mandarin accent identification based on gmm with multi-feature fusion", Journal of Huazhong University of Science and Technology: Science edition, (S1):381-384, 2015.

Conference publications:

Xiao-Ying Zhao, Qiu-Shi Zhu, Jie Zhang*, "Speech Enhancement Using Self-Supervised Pre-Trained Model and Vector Quantization", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), Chiang Mai, Thailand, Dec. 2022. (accepted) (pdf)
Guolong Zhong, Hongyu Song, Ruoyu Wang, Lei Sun, Diyuan Liu, Jia Pan, Xin Fang, Jun Du, Jie Zhang*, Li-Rong Dai, "External Text Based Data Augmentation for Low-Resource Speech Recognition in the Constrained Condition of OpenASR21 Challenge", ISCA Interspeech, Incheon, South Korea, Sept. 2022. (accepted) (pdf)
Yeqian Du, Jie Zhang*, Qiu-Shi Zhu, Li-Rong Dai, Minghui Wu, Xin Fang, and Zhouwang Yang, "A Complementary Joint Training Approach Using Unpaired Speech and Text for Low-Resource Automatic Speech Recognition", ISCA Interspeech, Incheon, South Korea, Sept. 2022. (accepted) (pdf)
Hai-Tao Xu, Jie Zhang*, Li-Rong Dai, "Differential Time-frequency Log-mel Spectrogram Features for Vision Transformer Based Infant Cry Recognition", ISCA Interspeech, Incheon, South Korea, Sept. 2022. (accepted) (pdf)
Zi-Qiang Zhang, Jie Zhang*, Jian-Shu Zhang, Ming-Hui Wu, Xin Fang, Li-Rong Dai, "Learning contextually fused audio-visual representations for audio-visual speech recognition", IEEE Int. Conf. on Image Process. (ICIP), Bordeaux, France, Oct. 2022. (accepted) (pdf)
Ao-Ran Gan, Jie Zhang*, Ming-Hui Wu, Xin Fang, Li-Rong Dai, "An experimental comparison between low-resource semi-supervised and high-resource supervised automatic speech recognition models", IEEE Int. Conf. on Multimedia & Expo (ICME), Taipei, Taiwan, July, 2022. (accepted) (pdf)
Qiu-Shi Zhu, Jie Zhang*, Zi-Qiang Zhang, Minghui Wu, Xin Fang, and Li-Rong Dai, "A noise-robust self-supervised pre-training model based speech representation learning for automatic speech recognition", IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), pp. 3174-3178, Singapore, May 2022. (pdf)
Xing-yu Chen, Qiu-Shi Zhu, Jie Zhang*, and Li-Rong Dai, "Supervised and self-supervised pretraining based COVID-19 detection using acoustic breathing/cough/speech signals", IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), pp. 561-565, Singapore, May 2022. (The Second DiCOVA Competition Winner) (pdf)
Xing-yu Chen, Jie Zhang*, and Li-Rong Dai, "Reference microphone selection and low-rank approximation based multichannel Wiener filter with application to speech recognition", IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), pp. 4963-4967, Singapore, May 2022. (pdf, matlab code)
Jie Zhang*, "A parametric unconstrained binaural beamformer based noise reduction and spatial cue preservation for hearing-assistive devices", IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), pp. 791-795, Toronto, Canada, 2021. (pdf)
Qiu-Shi Zhu, Jie Zhang*, Minghui Wu, Xin Fang, and Li-Rong Dai, "An improved wav2vec 2.0 pre-training approach using enhanced local dependency modeling for speech recognition", ISCA Interspeech, pp. 4334-4338, Brno, Czechia, Sept. 2021. (pdf)
Muhammad Muzamil Aslam, Jie Zhang*, Bushra Qureshi, Zahoor Ahmed, "Beyond6G-Consensus Traffic Management in CRN, Applications, Architecture and key Challenges", IEEE 11th Int. Conf. on Electronics Information and Emergency Communication (ICEIEC), pp. 182-185, June, 2021. (pdf)
Liangfa Wei, Jie Zhang*, Junfeng Hou, and Li-Rong Dai, "Attentive fusion enhanced audio-visual encoding for transformer based robust speech recognition", Asia-Pacific Signal and Information Processing Association Annual Summit and Conference (APSIPA-ASC), pp. 638-643, Auckland, New Zealand, Dec. 2020. (pdf)
Jie Zhang*, Richard Heusdens, and Richard C. Hendriks, "Sensor selection and rate distribution based beamforming for wireless acoustic sensor networks", EURASIP Europ. Signal Proces. Conf. (EUSIPCO), pp. 1-5, A Coruna, Spain, Sept. 2019. (pdf)
Jie Zhang*, Richard Heusdens, and Richard C. Hendriks, "Rate-distributed binaural LCMV beamforming for assistive hearing in wireless acoustic sensor networks", IEEE Workshop on Sensor Array and Multichannel Signal Process. (SAM), pp. 460-464, Sheffield, U.K., Jul. 2018. (Best Paper Award) (pdf)
Jie Zhang*, Richard C. Hendriks, and Richard Heusdens, "Structured total least squares based internal delay estimation for distributed microphone auto-localization, Int. Workshop Acoustic Signal Enhancement (IWAENC), pp. 1-5, Xi'an, China, Sept. 2016. (Best Paper Finalist) (pdf)
Jie Zhang*, Richard C. Hendriks, and Richard Heusdens, "Greedy gossip algorithm with synchronous communication for wireless sensor networks", In 6th Joint WIC/IEEE Symposium on Information Theory and Signal Processing in the Benelux, pp. 228-235, Belgium, April, 2016. (pdf)
Hong Liu, Mengdi Yue, Jie Zhang, "Probabilistic binaural multiple sources localization based on time-delay compensation estimator and clustering analysis", IEEE/RSJ Int. Conf. on Intelligent Robots and Systems (IROS), pp. 4537-4544, Daejeon, South Korea, Oct. 2016. (pdf)
Ling Chen, Jie Zhang, Guodong Chen, Meng Zhang, and Hong Liu, "Binaural cues estimates based on interaural matching filter for sound source localization", IEEE Int. Conf. on Robotics and Biomimetics (ROBIO), pp. 863-868, Shenzhen, China, 2016. (pdf)
Jie Zhang, Hong Liu, "A dual-channel beamformer based on time-delay compensation estimator and shifted PCA for speech enhancement", IEEE 23rd Int. Conf. on Software, Telecommunications and Computer Networks (SoftCOM), pp. 180-184, Split, Crotia, 2015. (pdf)
Hong Liu, Cheng Pang, Jie Zhang, "Binaural sound source localization based on generalized parametric model and two-layer matching strategy in complex environments", IEEE Int. Conf. on Robotics and Automation (ICRA), pp. 4496-4503, Seattle, WA, USA, May 2015. (pdf)
Cheng Pang, Jie Zhang and Hong Liu, "Direction of arrival estimation based on reverberation weighting and noise error estimator", ISCA Interspeech, pp. 3436-3440, Dresden, Germany, Sept. 2015. (pdf)
Hong Liu, Jie Zhang, "A binaural sound source localization model based on time-delay compensation and interaural coherence", IEEE Int. Conf. Acoust., Speech, Signal Process. (ICASSP), pp. 1424-1428, Florence, Italy, May, 2014. (pdf)
Hong Liu, Jie Zhang, Zhuo Fu, "A new hierarchical binaural sound source localization method based on interaural matching filter", IEEE Int. Conf. on Robotics and Automation (ICRA), pp. 1598-1605, Hong Kong, China, June, 2014. (pdf)
Baolong Zhao, Zhuo Fu, Jie Zhang, Hong Liu, "Binaural Sound Source Localization Based on Time-delay Compensation and Spatial Grid Matching", IEEE Int. Conf. on Cloud Computing and Intelligence Systems (CCIS), pp. 287-291, Shenzhen, China, 2014.
Mengdi Yue, Ling Chen, Jie Zhang, Hong Liu, "Speaker Age Recognition Based on Isolated Words by Using SVM", IEEE Int. Conf. on Cloud Computing and Intelligence Systems (CCIS), pp. 287-291, Shenzhen, China, 2014.

Patents:

张结，戴礼荣，基于单外部无线声学传感器速率优化的双耳维纳滤波方法，专利申请号：202210547834.9，申请日：2022年5月18日。
张结，戴礼荣，基于参数化无约束波束形成的双耳语音增强方法及装置，专利申请号：202210150297.4，申请日：2022年2月18日。
张结，徐海涛，戴礼荣，声音识别方法、声音识别装置及电子设备，专利申请号：202111479297.0，申请日：2021年12月3日。
朱秋实，张结，陈星宇，戴礼荣，声音事件分析模型的训练方法、事件分析方法及其装置，专利申请号：202111495065.4，申请日：2021年12月8日。
张结，陈星宇，戴礼荣，一种基于参考麦克风优化的多通道语音增强方法，专利申请号：202110505085.9，申请日：2021年5月10日
刘宏，张结，丁润伟，一种基于时延补偿和双耳一致性的双耳声源定位方法，专利号：ZL 2014 1 014277.1，专利申请日：2014年4月10日，专利授权日：2016年8月17日。
刘宏，张结，丁润伟，一种基于双耳匹配滤波器的双耳音源定位方法，专利号：ZL 2014 1 0143474.1，专利申请日：2014年4月10日，专利授权日：2016年8月17日。