Chao Wang's Homepage-USTC

Selected Publications:

Most of the papers are copyrighted by ACM or IEEE. These publications are posted here for personal use, to ensure timely dissemination of research work with no commercial purpose. Permission to reprint/republish this material for advertising or promotional purposes or for creating new collective works for resale or redistribution to servers or lists, or to reuse any copyrighted component of this work in other works must be obtained from the ACM or IEEE.

For other papers listed below but not included in ACM or IEEE digital library, welcome to email me for a copy of PDF file.

Books:

[1]. High Performance Computing for Big Data: Methodologies and Applications, published by Chapman and Hall/CRC.

[2]. Reconfigurable and Adaptive Computing: Theory and Applications, published by Chapman and Hall/CRC.

2024:

[3]. [TPDS] Changlong Li, Yu Liang, Liang Shi, Chao Wang, Chun Jason Xue, Xuehai Zhou: Flexible and Efficient Memory Swapping Across Mobile Devices With LegoSwap. IEEE Trans. Parallel Distributed Syst. 35(1): 140-153 (2024)

2023：

[4]. [TC] Lei Gong, Chao Wang, Haojun Xia, Xianglan Chen, Xi Li, Xuehai Zhou: Enabling Fast and Memory-Efficient Acceleration for Pattern Matching Workloads: The Lightweight Automata Processing Engine. IEEE Trans. Computers 72(4): 1011-1025 (2023)

[5]. [TCAD] Yingxue Gao, Lei Gong, Chao Wang, Teng Wang, Xi Li, Xuehai Zhou: Algorithm/Hardware Co-Optimization for Sparsity-Aware SpMM Acceleration of GNNs. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 42(12): 4763-4776 (2023)

[6]. [DATE 2023] Yingxue Gao, Teng Wang, Lei Gong, Chao Wang, Xi Li, Xuehai Zhou: FastRW: A Dataflow-Efficient and Memory-Aware Accelerator for Graph Random Walk on FPGAs. DATE 2023: 1-6

[7]. [DATE 2023] Wenqi Lou, Jiaming Qian, Lei Gong, Xuan Wang, Chao Wang, Xuehai Zhou: NAF: Deeper Network/Accelerator Co-Exploration for Customizing CNNs on FPGA. DATE 2023: 1-6

[8]. [FPGA 2023] Xuan Wang, Lei Gong, Jing Cao, Wenqi Lou, Weiya Wang, Chao Wang, Xuehai Zhou: hAP: A Spatial-von Neumann Heterogeneous Automata Processor with Optimized Resource and IO Overhead on FPGA. FPGA 2023: 185-196

2022：

[9]. [TC]Wenqi Lou, Lei Gong, Chao Wang, Zidong Du, Xuehai Zhou: OctCNN: A High Throughput FPGA Accelerator for CNNs Using Octave Convolution Algorithm. IEEE Trans. Computers 71(8): 1847-1859 (2022)

[10]. [TC]Yuanbo Wen, Qi Guo, Zidong Du, Jianxing Xu, Zhenxing Zhang, Xing Hu, Wei Li, Rui Zhang, Chao Wang, Xuehai Zhou, Tianshi Chen: Enabling One-Size-Fits-All Compilation Optimization for Inference Across Machine Learning Computers. IEEE Trans. Computers 71(9): 2313-2326 (2022)

[11]. [TCAD] Teng Wang, Lei Gong, Chao Wang, Yang Yang, Yingxue Gao, Xuehai Zhou, Huaping Chen: ViA: A Novel Vision-Transformer Accelerator Based on FPGA. IEEE Trans. Comput. Aided Des. Integr. Circuits Syst. 41(11): 4088-4099 (2022)

[12]. [NEUCOM]Yang Yang, Chao Wang, Lei Gong, Min Wu, Xuehai Zhou: Conv-inheritance: A hardware-efficient method to compress convolutional neural networks for edge applications. Neurocomputing 487: 172-180 (2022)

[13]. [ICML] Yuanbo Wen, Qi Guo, Qiang Fu, Xiaqing Li, Jianxing Xu, Yanlin Tang, Yongwei Zhao, Xing Hu, Zidong Du, Ling Li, Chao Wang, Xuehai Zhou, Yunji Chen: BabelTower: Learning to Auto-parallelized Program Translation. ICML 2022: 23685-23700

2021：

[14]. [TC] Chao Wang, Lei Gong, Fahui Jia, Xuehai Zhou, An FPGA based Accelerator for Ubiquitous Clustering Applications with Custom Instructions, IEEE Trans. Computers

[15]. [TCAD] Chao Wang, Lihui Jin, Lei Gong, Chongchong Xu, Yahui Hu, Luchao Tan, Xuehai Zhou,Tinker: A Middleware for Deploying Multiple NN-based Applications on a Single Machine，IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems

[16]. [TSC] Chao Wang, Lei Gong, Xi Li, Qi Yu, Aili Wang, Patrick Hung, and Xuehai Zhou, SOLAR: Services-oriented Deep Learning Architectures, IEEE Transactions on Services Computing.

[17]. [TCBB] Chao Wang, Lei Gong, Shiming Lei, Haijie Fang, Aili Wang, Xi Li, and Xuehai Zhou, GenSeq+: A Scalable High-Performance Accelerator for Genome Sequencing, IEEE/ACM Transactions on Computational Biology and Bioinformatics 18(4): 1512-1523 (2021)

[18]. [TSC] Changlong Li ; Hang Zhuang ; Qingfeng Wang ; Chao Wang ; Zhou Xuehai, LKSM: Light Weight Key-Value Store for Efficient Application Services on Local Distributed Mobile Devices, IEEE Transactions on Services Computing 14(4): 1026-1039 (2021)

[19]. [TPDS]Lei Gong, Chao Wang, Xi Li, Xuehai Zhou: Improving HW/SW Adaptability for Accelerating CNNs on FPGAs Through A Dynamic/Static Co-Reconfiguration Approach. IEEE Trans. Parallel Distributed Syst. 32(7): 1854-1865 (2021)

[20]. [TC]Wenqi Lou, Lei Gong, Chao Wang, Zidong Du, Xuehai Zhou, OctCNN: A High Throughput FPGA Accelerator for CNNs using Octave Convolution Algorithm, IEEE Trans. Computers71(8): 1847-1859 (2022)

[21]. [TIE]Qing Xu, Zhenghua Chen, Keyu Wu, Chao Wang, Min Wu, Xiaoli Li, KDnet-RUL: A Knowledge Distillation Framework to Compress Deep Neural Networks for Machine Remaining Useful Life Prediction, IEEE Transactions on Industrial Electronics

[22]. [DATE]Haojun Xia, Lei Gong, Chao Wang, Xianglan Chen, Xuehai Zhou: LAP: A Lightweight Automata Processor for Pattern Matching Tasks. DATE 2021: 844-849

[23]. [ICCD]Xuan Wang, Lei Gong, Chao Wang, Xi Li and Xuehai Zhou, UH-JLS: A Parallel Ultra-High Throughput JPEG-LS Encoding Architecture for Lossless Image Compression, International Conference on Computer Design 2021: 335-343

[24]. [CCF THPC]Haoyu Cai, Chao Wang, Xuehai Zhou: Deployment and verification of machine learning tool-chain based on kubernetes distributed clusters. CCF Trans. High Perform. Comput. 3(2): 157-170 (2021)

2020：

[25]. [TC]Chao Wang, Lei Gong, Xiang Ma, Xi Li, Xuehai Zhou, WooKong: A Ubiquitous Accelerator for Recommendation Algorithms with Custom Instruction Sets on FPGA, IEEE Trans. Computers. 69(7): 1071-1082 (2020)

[26]. [TC] Xi Zeng, Tian Zhi, Xuda Zhou, Zidong Du, Qi Guo, Shaoli Liu, Chao Wang, Ling Li, Xuehai Zhou, Tianshi Chen, Yunji Chen, Addressing Irregularity in Sparse Neural Networks through a Cooperative Software/Hardware Approach, IEEE Trans. Computers. 69(7): 968-985 (2020)

[27]. [TPDS]Chao Wang, Lei Gong, Xiang Ma, Xi Li, Xuehai Zhou, A Ubiquitous Machine Learning Accelerator with Automatic Parallelization on FPGA, IEEE Transactions on Parallel and Distributed Systems. 31(10): 2346-2359 (2020)

[28]. [TCAD] Xuan Wang, Chao Wang, Jing Cao, Lei Gong, Xuehai Zhou: WinoNN: Optimising FPGA-based Neural Network Accelerators using Sparse Winograd Algorithm. IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems (CODES+ISSS 2020 Journal Track).

[29]. [Cluster] Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou: OctCNN: An Energy-Efficient FPGA Accelerator for CNNs using Octave Convolution Algorithm, IEEE Cluster 2020.

[30]. [计算机学报] 王超，王腾，马翔，周学海，基于FPGA的机器学习硬件加速研究进展，Chinese Journal of Computers (计算机学报)，第43卷，第6期，2020.

[31]. [软件学报] 娄文启，王超，宫磊，周学海，一种神经网络指令集扩展与代码映射机制，Journal of Software (软件学报)，第31卷，第10期，2020.

2019：

[32]. [TODAES]Bo Wan, Xi Li, Bo Zhang, Caixu Zhao, Xianglan Chen, Chao Wang, and Xuehai Zhou, DCW: A Reactive and Predictable Programming Framework for LET-based Distributed Real-time Systems, ACM Transactions on Design Automation of Electronic Systems.

[33]. [APPT]Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou: RV-CNN: Flexible and Efficient Instruction Set for CNNs Based on RISC-V Processors. APPT 2019: 3-14

[34]. [BIBM] Qingfeng Wang, Jun Huang, Zhiqin Liu, Jie-Zhi Cheng, Ying Zhou, Qiyu Liu, Yaobin Wang, Xuehai Zhou, Chao Wang: Higher-order Transfer Learning for Pulmonary Nodule Attribute Prediction in Chest CT Images. BIBM 2019: 741-745

[35]. [Cluster] Teng Wang, Lei Gong, Chao Wang, Xuehai Zhou, Huaping Chen: Design Exploration of Multi-FPGAs for Accelerating Deep Learning. CLUSTER 2019: 1-2

[36]. [CODES+ISSS] Yang Yang, Chao Wang, Xuehai Zhou: Drama: A high efficient neural network accelerator on FPGA using dynamic reconfiguration: work-in-progress. CODES+ISSS 2019: 13:1-13:2

[37]. [FPT] Yang Yang, Chao Wang, Lei Gong, Xuehai Zhou: FPNet: Customized Convolutional Neural Network for FPGA Platforms. FPT 2019: 399-402

2018:

[38]. [TCAD] Lei Gong ; Chao Wang ; Xi Li ; Huaping Chen ; Xuehai Zhou MALOC: A Fully Pipelined FPGA Accelerator for Convolutional Neural Networks with All Layers Mapped On Chip, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems.

[39]. [TVLSI] Maohua Zhu, Youwei Zhuo, Chao Wang, Wenguang Chen, and Yuan Xie: Performance Evaluation and Optimization of HBM-Enabled GPU for Data-intensive Applications, IEEE Transactions on VLSI Systems.

[40]. [BMC]Bo Xu, Changlong Li, Hang Zhuang, Jiali Wang, Qingfeng Wang, Chao Wang, Xuehai Zhou, Distributed Gene Clinical Decision Support System Based on Cloud Computing, BMC Medical Genomics.

[41]. [IJPP]Yuntao Lu, Chao Wang, Lei Gong, Xuehai Zhou: SparseNN: A Performance-Efficient Accelerator for Large-Scale Sparse Neural Networks. International Journal of Parallel Programming 46(4): 648-659 (2018)

[42]. [IJPP]Fan Sun, Chao Wang, Lei Gong, Yiwei Zhang, Chongchong Xu, Yuntao Lu, Xi Li, Xuehai Zhou: UniCNN: A Pipelined Accelerator Towards Uniformed Computing for CNNs. International Journal of Parallel Programming 46(4): 776-787 (2018)

[43]. [FPGA] Chongchong Xu, Chao Wang, Yiwei Zhang, Lei Gong, Xi Li and Xuehai Zhou, Domino: An Asynchronous and Energy-efficient Accelerator for Graph Processing, in 26th ACM/SIGDA International Symposium on Field-Programmable Gate Arrays .

[44]. [ICWS]Chongchong Xu, Chao Wang, Lei Gong, Lihui Jin, Xi Li, Xuehai Zhou: Domino: Graph Processing Services on Energy-Efficient Hardware Accelerator. ICWS 2018: 274-281

[45]. [GLSVLSI]Yuming Cheng, Chao Wang, Yangyang Zhao, Xianglan Chen, Xuehai Zhou, Xi Li: MuDBN: An Energy-Efficient and High-Performance Multi-FPGA Accelerator for Deep Belief Networks. ACM Great Lakes Symposium on VLSI 2018: 435-438

[46]. [MICRO] Xuda Zhou, Zidong Du, Qi Guo, Chengsi Liu, Chao Wang, Xuehai Zhou, Ling Li, Tianshi Chen, Yunji Chen, "Cambricon-S: Addressing Irregularity in Sparse Neural Networks through a Cooperative Software/Hardware Approach", in Proceedings of the 51st IEEE/ACM International Symposium on Microarchitecture (MICRO'18), 2018.

[47]. [CODES+ISSS]Lei Gong ; Chao Wang ; Xi Li ; Huaping Chen ; Xuehai Zhou MALOC: A Fully Pipelined FPGA Accelerator for Convolutional Neural Networks with All Layers Mapped On Chip, International Conference on Hardware/Software Codesign and System Synthesis {Best Paper Candidate}.

2017:

[48]. [TPDS] Chao Wang, Xi Li, Yunji Chen, Youhui Zhang, Oliver Diessel, Xuehai Zhou: Service-oriented Architecture on FPGA-based MPSoC, IEEE Transactions on Parallel and Distributed Systems.

[49]. [TCBB]Chao Wang, Xi Li, Dong Dai, Aili Wang, and Xuehai Zhou, Accelerating Computation of Large Biological Datasets using MapReduce Framework IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[50]. [TCAD]Chao Wang, Lei Gong, Qi Yu, Xi Li, Yuan Xie, Xuehai Zhou, DLAU: A Scalable Deep Learning Accelerator Unit on FPGA, IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems. 36(3): 513-517 (2017)

[51]. [TSC] Chao Wang, Xi Li, Aili Wang and Xuehai Zhou, A Classroom Scheduling Service for Smart Classes, IEEE Transactions on Services Computing.

[52]. [JSS]Chao Wang, Xi Li, Huizhen Zhang, Aili Wang, and Xuehai Zhou, HOTISE: Hot Spots Profiling and Dataflow Analysis in Custom Dataflow Computing SoftProcessors, Journal of Systems and Software.

[53]. [DATE]Maohua Zhu, Youwei Zhuo, Chao Wang, Wenguang Chen and Yuan Xie, Performance Evaluation and Optimization of HBM-Enabled GPU for Data-intensive Applications, Design, Automation and Test in Europe, 2017.

[54]. [ICWS]Chao Wang, Jinhong Zhou, Lei Gong, Xi Li, Aili Wang, and Xuehai Zhou, xFilter: A Temporal Locality Accelerator for Intrusion Detection System Services, 24th IEEE International Conference on Web Services (Research Track).

[55]. [ICWS]Chao Wang, Haijie Fang, Shiming Lei, Lei Gong, Aili Wang, Xi Li, and Xuehai Zhou, GenServ: Genome Sequencing Services on Scalable Energy Efficient Accelerators, 24th IEEE International Conference on Web Services.

[56]. [ICWS]Changlong Li, Hang Zhuang, Bo Xu, Jiali Wang, Chao Wang, and Xuehai Zhou, Light Weight Key-Value Store for Efficient Services on Local Distributed Mobile Devices, 24th IEEE International Conference on Web Services (Research Track).

[57]. [ICWS] Hang Zhuang, Chao Wang, Changlong Li, Qingfeng Wang, and Xuehai Zhou, Natural Language Processing Service Based on Stroke-level Convolutional Networks for Chinese Text Classification, 24th IEEE International Conference on Web Services.

[58]. [ICWS]Chongchong Xu, Jinhong Zhou, Yuntao Lu, Fan Sun, Lei Gong, Chao Wang, Xi Li, and Xuehai Zhou, Evaluation and Trade-offs of Graph Processing for Cloud Services, 24th IEEE International Conference on Web Services.

[59]. [Cluster] Chongchong Xu, Chao Wang, Lei Gong, Yuntao Lu, Fan Sun, Yiwei Zhang, Xi Li and Xuehai Zhou, OmniGraph: A Scalable Hardware Accelerator For Graph Processing, in IEEE Cluster Conference.

[60]. [Cluster] Yiwei Zhang, Chao Wang, Lei Gong, Yuntao Lu, Fan Sun, Chongchong Xu, Xi Li and Xuehai Zhou, A Power-Efficiency Accelerator Based on FPGAs for LSTM Network, in IEEE Cluster Conference.

[61]. [Cluster] Fan Sun, Chao Wang, Lei Gong, Chongchong Xu, Yiwei Zhang, Yuntao Lu, Xi Li and Xuehai Zhou, A Pipeline Power-efficient Accelerator for Convolutional Neural Networks, in IEEE Cluster Conference.

[62]. [RTSS] Bo Wan, Xi Li, Haizhao Luo, Chao Wang, Xianglan Chen, Xuehai Zhou, Working In Progress: TTI: A Timing ISA for LET Model in Safety-critical Systems. In Proceedings of IEEE Real-Time Systems Symposium{WiP}.

[63]. [CyPhy] Chao Wang, Yuming Cheng, Lei Gong, Bo Wan, Aili Wang, Xi Li and Xuehai Zhou. FPGA based Big Data Accelerator Design in Teaching Computer Architecture and Organization, in 7th Workshop on Design, Modeling and Evaluation of Cyber Physical Systems (CyPhy'17), with ESWEEK’17.

[64]. [CCCF] 王超，孙凡，李曦，周学海，EDA领域的神经网络研究热点，中国计算机学会通讯，第13卷，第7期，65-70.

2016:

[65]. [TPDS] Chao Wang, Xi Li, Junneng Zhang, Aili Wang, Xuehai Zhou: Hardware Implementation on FPGA for Task-level Parallel Dataflow Execution Engine, IEEE Transactions on Parallel and Distributed Systems.

[66]. [TVLSI] Qi Guo, Xi Li, Chao Wang, Xuehai Zhou: Evaluation and Tradeoffs for Out-of-Order Execution on Reconfigurable Heterogeneous MPSoC. IEEE Trans. VLSI Syst. 24(1): 79-91

[67]. [ASOC] Chao Wang, Xi Li, Xuehai Zhou, Aili Wang, Nadia Nedjah: Soft computing in big data intelligent transportation systems. Appl. Soft Computing. 38: 1099-1108.

[68]. [JSA] Beilei Sun, Xi Li, Bo Wan, Chao Wang, Xuehai Zhou, Xianglan Chen, Definitions of Predictability for Cyber Physical Systems, Journal of Systems Architecture.

[69]. [GLSVLSI] Jiachen Song, Xi Li, Beilei Sun, Zhinan Cheng, Chao Wang and Xuehai Zhou: FCM: Towards Fine-Grained GPU Power Management for Closed Source Mobile Games, in GLSVLSI 2016.

[70]. [ASAP] Zhinan Cheng, Xi Li, Jiachen Song, Beilei Sun, Xuehai Zhou and Chao Wang: Display Power Reduction for Mobile Closed-Source Games, 27th Annual IEEE International Conference on Application-specific Systems, Architectures and Processors.

[71]. [ICWS] Chao Wang, Xi Li, Qi Yu, Aili Wang, Patrick Hung, Xuehai Zhou: SOLAR: Services-oriented Learning Architectures, IEEE International Conference on Web Services.

[72]. [ICWS] Chao Wang, Xi Li, Jinhong Zhou, Aili Wang, Xuehai Zhou: FairPlay: Services Migration with Lock-free Mechanisms for Load Balancing in Cloud Architectures, IEEE International Conference on Web Services (Research Track).

[73]. [SPAA] Chao Wang, Xi Li, Aili Wang and Xuehai Zhou: Brief Announcement: MIC++: Accelerating Maximal Information Coefficient Calculation with GPUs and FPGAs, 28th ACM Symposium on Parallelism in Algorithms and Architectures.

[74]. [ICPADS]Yangyang Zhao, Qi Yu, Xuda Zhou, Xuehai Zhou, Chao Wang and Xi Li, PIE: A Pipeline Energy-efficient Accelerator for Inference Process in Deep Neural Networks, The 22nd IEEE International Conference on Parallel and Distributed Systems

2015:

[75]. [TC] Chao Wang, Xi Li, Junneng Zhang, Peng Chen, Yunji Chen, Xuehai Zhou, Ray C.C. Cheung: Architecture Support for Task Out-of-order Execution in MPSoCs, IEEE Transactions on Computers.

[76]. [TCBB] Chao Wang, Xi Li, Peng Chen, Xuehai Zhou, Aili Wang and Hong Yu, “Heterogeneous Cloud Framework for Big Data Genome Sequencing”, IEEE/ACM Transactions on Computational Biology and Bioinformatics. Featured Spotlight Paper.

[77]. [TPDS] Shaoli Liu, Tianshi Chen, Ling Li, Xi Li, Mingzhe Zhang, Chao Wang, Haibo Meng, Xuehai Zhou, and Yunji Chen, "FreeRider: Non-local Adaptive Network-on-Chip Routing with Packet-Carried Propagation of Congestion Information", IEEE Transactions on Parallel and Distributed Systems.

[78]. [JPDC] Chao Wang, Xi Li, Peng Chen, and Xuehai Zhou, “A Case Study of Parallel JPEG Encoding on an FPGA,” Journal of Parallel and Distributed Computing.

[79]. [IJE] Chao Wang, Xi Li, Xuehai Zhou, Nadia Nedjah, Aili Wang, “Codem: A Software/Hardware Codesign Flow for Embedded Multicore Systems Supporting Hardware Services”, International Journal of Electronics.

[80]. [JCST] Chao Wang, Xi Li and Xuehai Zhou, “CRAIS: A Crossbar based Interconnection Scheme on FPGA for Big Data”, Journal of Computer Science and Technology.

[81]. [DATE] Chao Wang, Xi Li, Xuehai Zhou, SODA: Software Defined FPGA based Accelerators for Big Data, Design, Automation and Test in Europe, 2015. Best IP Paper Nomination.

[82]. [FPGA] Chao Wang, Xi Li, Qi Guo, Peng Chen and Xuehai Zhou, “RapidPath: Accelerating Constrained Shortest Path Finding in Graphs on FPGA”, FPGA 2015 .

[83]. [CCGRID] Qi Yu, Chao Wang, Xiang Ma, Xi Li and Xuehai Zhou, “A Deep Learning accelerator based FPGA” 15th IEEE/ACM International Symposium on Cluster, Cloud and Grid Computing.

[84]. [Cluster] Xiang Ma, Chao Wang, Qi Yu, Xi Li, Xuehai Zhou: An FPGA-Based Accelerator for Neighborhood-Based Collaborative Filtering Recommendation Algorithms. CLUSTER 2015: 494-495.

[85]. [ICA3PP] Fahui Jia, Chao Wang, Xi Li, Xuehai Zhou: SAKMA: Specialized FPGA-Based Accelerator Architecture for Data-Intensive K-Means Algorithms. ICA3PP (2) 2015: 106-119

2014:

[86]. [JSA] Chao Wang, Xi Li, Xiaojing Feng, Peng Chen, Xuehai Zhou，“Colored Petri Net Model with Automatic Parallelization on Real-Time Multicore Architectures”，Journal of Systems Architecture.

[87]. [TCBB] Peng Chen, Chao Wang, Xi Li, Xuehai Zhou, “Accelerating the Next Generation long read mapping with the FPGA-based system”, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[88]. [ARC] Chao Wang, Xi Li, Huizhen Zhang, Liang Shi, and Xuehai Zhou, “Instruction Extension and Generation for Adaptive Processors”, ARC 2014.

[89]. [FPGA] Chao Wang, Xi Li, Xuehai Zhou, Yunji Chen, Ray C.C. Cheung, “Big Data Genome Sequencing on Zynq based Clusters”, FPGA 2014 .

[90]. [FPGA] Chao Wang, Xi Li, Xuehai Zhou, Yunji Chen, Koen Bertels, “Co-processing with Dynamic Reconfiguration on Heterogeneous MPSoC: Practices and Design Tradeoffs”, FPGA 2014 .

[91]. [ISIC] Peng Chen, Chao Wang, Xi Li, Xuehai Zhou and Ray C.C. Cheung, Trade-offs between the Sensitivity and the Speed of the FPGA-based Sequence Aligner, ISIC 2014.

2013:

[92]. [TACO] Chao Wang, Xi Li, Junneng Zhang, Xuehai Zhou, Xiaoning Nie. “MP-Tomasulo: a Dependency-aware Automatic Parallel Execution Engine for Sequential Programs”. ACM Transactions on Architecture and Code Optimization.

[93]. [SIGBED Review] Chao Wang, Xi Li, Xuehai Zhou, “Heterothread: Hybrid Thread Level Parallelism on Heterogeneous Multicore Architectures”. ACM SIGBED Review.

[94]. [ISCAS] Junneng Zhang, Chao Wang, Xi Li, Xuehai Zhou. “FPGA Implementation of a Scheduler Supporting Parallel Dataflow Execution”, ISCAS 2013.

[95]. [FPGA] Chao Wang, Xi Li, Xuehai Zhou, Jim Martin and Ray Cheung. “Genome Sequencing Using MapReduce on FPGA with Multiple Hardware Accelerators”. FPGA 2013.

[96]. [FPGA] Chao Wang, Xi Li, Huizhen Zhang, Jinsong Ji and Xuehai Zhou. “Custom Instruction Generation and Mapping for Reconfigurable Instruction Set Processors”. FPGA 2013.

[97]. [FPT]Peng Chen, Chao Wang, Xi Li, Xuehai Zhou: Hardware acceleration for the banded Smith-Waterman algorithm with the cycled systolic array. FPT 2013: 480-481

[98]. [ICA3PP] Gangyong Jia, Xi Li, Jian Wan, Chao Wang, Dong Dai, Congfeng Jiang: Coordinate Task and Memory Management for Improving Power Efficiency. ICA3PP 2013: 267-278

[99]. [CCCF] 李曦，陈香兰，王超，周学海，异构计算需要新的操作系统抽象，中国计算机学会通讯，第九卷，第11期，2013.

2012:

[100]. [TJS] Chao Wang, Xi Li, Junneng Zhang, Xuehai Zhou, Aili Wang, “ A Star Network Approach in Heterogeneous Multi Processors System on Chip”, Journal of Supercomputing.

[101]. [FPT] Chao Wang, Xi Li, Xuehai Zhou and Yajun Ha. “Parallel Dataflow Execution for Sequential Programs on Reconfigurable Hybrid MPSoCs”. FPT 2012.53-56.

[102]. [FPT] Junneng Zhang, Chao Wang, Xi Li, Xuehai Zhou,”A Task-Level OoO Framework for Heterogeneous Systems”, FPT 2012,pp.33-36.

[103]. [FPL] Chao Wang, Xi Li, Peng Chen, Xuehai Zhou. “CaaS: Core as a Service Bring SOA to Reconfigurable MPSoC for High level Parallelization,” FPL 2012.pp.495-498.

[104]. [RAW] Chao Wang, Peng Chen, Xi Li, Xiaojing Feng, Xuehai Zhou. “Detecting Data Hazards in Multi-Processor System-on-Chips on FPGA “, RAW 2012, pp.282-287.

[105]. [RAW] Chao Wang, Peng Chen, Xi Li, Xiaojing Feng, Xuehai Zhou. “FPM: A Flexible Programming Model for MPSoCs “, RAW 2012 pp. 4770-484.

[106]. [ARC] Chao Wang, Xi Li, Xiaojing Feng and Xuehai Zhou. “An Approach of Reconfigurable Network of Heterogeneous MPSoC”. ARC 2012, pp.379-384.

[107]. [ICPADS] Gangyong Jia, Xi Li and Chao Wang,” Behavior Aware Data Locality for Caches”, ICPADS 2012,pp.514-521.

[108]. [Cluster] Chao Wang, Xi Li, Dong Dai, Gangyong Jia, Xuehai Zhou, “Phase Detection for Loop-based Programs on Multicore Architectures”, IEEE Cluster 2012.pp.584-587.

[109]. [ICA3PP] Chunsheng Li, Xuehai Zhou, Fangling Zeng, Chao Wang, “A Dependency Aware Task Partitioning and Scheduling Algorithm for HW/SW Codesign on MPSoCs”. ICA3PP 2012.pp.332-346.

[110]. [MASCOTS] Chao Wang, Xi Li, Peng Chen, Xiaojing Feng, Xuehai Zhou. “Analyzing and Extending Amdahl’s Law in Heterogeneous on-chip Clusters”, MASCOTS 2012.pp.489-491.

[111]. [MASCOTS] Gangyong Jia, Xi Li, Chao Wang, Xuehai Zhou. “Frequency Affinity: Analyzing and Maximizing Power Efficiency in Multi-core Systems”, MASCOTS 2012.pp.495-497.

[112]. [SCC] Chao Wang, Xi Li, Xuehai Zhou. “Regarding Processors and Reconfigurable IP Cores as Services”. IEEE SCC 2012. pp.668-669.

2011:

[113]. [SCC] Chao Wang, Xuehai Zhou, Junneng Zhang, Xiaojing Feng and Xiaoning Nie. “SOMP: Services-Oriented Multi Processors”， IEEE SCC 2011, pp.709-716.

[114]. [ISPA] Chao Wang, Junneng Zhang, Xuehai Zhou, Xiaojing Feng and Aili Wang, “A Flexible High Speed Star Network Based on Peer to Peer Links on FPGA”, ISPA 2011, pp.107-112.

[115]. [ISPA] Chao Wang, Huizhen Zhang, Xuehai Zhou, Jinsong Ji, "Tool Chain Support with Dynamic Profiling for RISP", ISPA 2011, pp.155-160.