Selected
Publications:
For other papers listed below but
not included in ACM or IEEE digital library, welcome to email me for a copy
of PDF file. Books: [1]. High
Performance Computing for Big Data: Methodologies and Applications,
published by Chapman and Hall/CRC. [2]. Reconfigurable
and Adaptive Computing: Theory and Applications, published by Chapman and Hall/CRC. [3]. [TPDS] Changlong Li,
Yu Liang, Liang Shi, Chao Wang, Chun Jason Xue,
Xuehai Zhou: Flexible and Efficient Memory Swapping Across Mobile
Devices With LegoSwap. IEEE Trans.
Parallel Distributed Syst. 35(1): 140-153 (2024) 2023: [4]. [TC] Lei Gong, Chao Wang, Haojun Xia, Xianglan Chen, Xi
Li, Xuehai Zhou: Enabling Fast and Memory-Efficient Acceleration
for Pattern Matching Workloads: The Lightweight Automata Processing Engine.
IEEE Trans. Computers 72(4): 1011-1025 (2023) [5]. [TCAD] Yingxue Gao, Lei Gong, Chao
Wang, Teng Wang, Xi Li, Xuehai Zhou: Algorithm/Hardware
Co-Optimization for Sparsity-Aware SpMM
Acceleration of GNNs. IEEE Trans. Comput.
Aided Des. Integr. Circuits Syst. 42(12): 4763-4776
(2023) [6]. [DATE 2023] Yingxue Gao, Teng Wang, Lei Gong, Chao Wang, Xi Li, Xuehai
Zhou: FastRW: A Dataflow-Efficient and Memory-Aware
Accelerator for Graph Random Walk on FPGAs. DATE 2023: 1-6 [7]. [DATE 2023] Wenqi Lou, Jiaming Qian, Lei Gong, Xuan Wang, Chao Wang, Xuehai
Zhou: NAF:
Deeper Network/Accelerator Co-Exploration for Customizing CNNs on FPGA. DATE
2023: 1-6 [8]. [FPGA 2023] Xuan Wang, Lei Gong, Jing Cao, Wenqi Lou, Weiya Wang, Chao
Wang, Xuehai Zhou: hAP: A Spatial-von
Neumann Heterogeneous Automata Processor with Optimized Resource and IO
Overhead on FPGA. FPGA 2023: 185-196 2022: [9]. [TC]Wenqi Lou, Lei Gong, Chao Wang, Zidong
Du, Xuehai Zhou: OctCNN: A High Throughput FPGA
Accelerator for CNNs Using Octave Convolution Algorithm. IEEE
Trans. Computers 71(8): 1847-1859 (2022)
[10].
[TC]Yuanbo Wen, Qi Guo, Zidong Du, Jianxing Xu, Zhenxing Zhang, Xing Hu, Wei Li, Rui Zhang, Chao Wang, Xuehai
Zhou, Tianshi Chen: Enabling One-Size-Fits-All Compilation
Optimization for Inference Across Machine Learning Computers. IEEE
Trans. Computers 71(9): 2313-2326 (2022)
[11].
[TCAD] Teng Wang, Lei Gong,
Chao Wang, Yang Yang, Yingxue Gao, Xuehai Zhou, Huaping Chen: ViA: A Novel
Vision-Transformer Accelerator Based on FPGA. IEEE Trans. Comput. Aided Des. Integr.
Circuits Syst. 41(11): 4088-4099 (2022)
[12].
[NEUCOM]Yang Yang, Chao Wang,
Lei Gong, Min Wu, Xuehai Zhou: Conv-inheritance: A hardware-efficient
method to compress convolutional neural networks for edge applications.
Neurocomputing 487: 172-180 (2022) [13].
[ICML] Yuanbo
Wen, Qi Guo, Qiang Fu, Xiaqing
Li, Jianxing Xu, Yanlin
Tang, Yongwei Zhao, Xing Hu, Zidong
Du, Ling Li, Chao Wang, Xuehai Zhou, Yunji Chen: BabelTower: Learning to
Auto-parallelized Program Translation. ICML 2022: 23685-23700 2021: [14]. [TC]
Chao Wang,
Lei Gong, Fahui Jia, Xuehai
Zhou, An
FPGA based Accelerator for Ubiquitous Clustering Applications with Custom
Instructions, IEEE Trans. Computers [15]. [TCAD]
Chao Wang, Lihui Jin, Lei Gong, Chongchong Xu, Yahui Hu, Luchao Tan, Xuehai Zhou,Tinker: A Middleware for Deploying Multiple
NN-based Applications on a Single Machine,IEEE
Transactions on Computer-Aided Design of Integrated Circuits and Systems [16]. [TSC]
Chao Wang,
Lei Gong, Xi Li, Qi Yu, Aili Wang, Patrick Hung,
and Xuehai Zhou, SOLAR: Services-oriented Deep Learning
Architectures, IEEE Transactions on Services Computing. [17]. [TCBB] Chao
Wang, Lei Gong, Shiming Lei, Haijie Fang, Aili Wang, Xi Li,
and Xuehai Zhou, GenSeq+: A Scalable
High-Performance Accelerator for Genome Sequencing, IEEE/ACM
Transactions on Computational Biology and Bioinformatics 18(4): 1512-1523
(2021) [18]. [TSC] Changlong Li ; Hang
Zhuang ; Qingfeng Wang ; Chao Wang ; Zhou Xuehai, LKSM: Light Weight Key-Value Store for Efficient
Application Services on Local Distributed Mobile Devices, IEEE
Transactions on Services Computing 14(4): 1026-1039 (2021) [19]. [TPDS]Lei Gong, Chao Wang, Xi Li, Xuehai Zhou: Improving
HW/SW Adaptability for Accelerating CNNs on FPGAs Through A Dynamic/Static
Co-Reconfiguration Approach. IEEE Trans. Parallel Distributed
Syst. 32(7): 1854-1865 (2021) [20]. [TC]Wenqi Lou, Lei Gong, Chao Wang, Zidong
Du, Xuehai Zhou, OctCNN: A High
Throughput FPGA Accelerator for CNNs using Octave Convolution Algorithm, IEEE
Trans. Computers71(8): 1847-1859 (2022) [21]. [TIE]Qing Xu, Zhenghua
Chen, Keyu Wu, Chao
Wang, Min Wu, Xiaoli Li, KDnet-RUL: A
Knowledge Distillation Framework to Compress Deep Neural Networks for Machine
Remaining Useful Life Prediction, IEEE Transactions on Industrial
Electronics [22]. [DATE]Haojun Xia, Lei Gong, Chao Wang, Xianglan
Chen, Xuehai Zhou: LAP: A Lightweight Automata Processor
for Pattern Matching Tasks. DATE 2021: 844-849 [23]. [ICCD]Xuan Wang, Lei Gong, Chao Wang, Xi Li and Xuehai Zhou, UH-JLS: A Parallel Ultra-High Throughput JPEG-LS
Encoding Architecture for Lossless Image Compression, International
Conference on Computer Design 2021: 335-343 [24]. [CCF
THPC]Haoyu Cai, Chao Wang, Xuehai Zhou: Deployment and
verification of machine learning tool-chain based on kubernetes
distributed clusters. CCF Trans. High Perform. Comput.
3(2): 157-170 (2021) 2020: [25]. [TC]Chao
Wang, Lei
Gong, Xiang Ma, Xi Li, Xuehai Zhou, WooKong: A Ubiquitous Accelerator for Recommendation
Algorithms with Custom Instruction Sets on FPGA, IEEE Trans.
Computers. 69(7): 1071-1082 (2020) [26]. [TC] Xi Zeng, Tian Zhi, Xuda Zhou, Zidong Du, Qi Guo, Shaoli Liu, Chao Wang, Ling Li, Xuehai Zhou, Tianshi Chen, Yunji Chen, Addressing Irregularity in Sparse Neural Networks
through a Cooperative Software/Hardware Approach, IEEE Trans.
Computers. 69(7): 968-985 (2020) [27]. [TPDS]Chao
Wang, Lei
Gong, Xiang Ma, Xi Li, Xuehai Zhou, A Ubiquitous
Machine Learning Accelerator with Automatic Parallelization on FPGA, IEEE
Transactions on Parallel and Distributed Systems. 31(10): 2346-2359 (2020) [28]. [TCAD]
Xuan Wang, Chao Wang, Jing Cao, Lei Gong, Xuehai Zhou: WinoNN: Optimising
FPGA-based Neural Network Accelerators using Sparse Winograd Algorithm.
IEEE Transactions on Computer-Aided Design of Integrated Circuits and Systems
(CODES+ISSS 2020 Journal Track). [29]. [Cluster] Wenqi
Lou, Chao Wang, Lei Gong, Xuehai Zhou: OctCNN: An Energy-Efficient
FPGA Accelerator for CNNs using Octave Convolution Algorithm, IEEE
Cluster 2020. [30]. [计算机学报] 王超,王腾,马翔,周学海,基于FPGA的机器学习硬件加速研究进展,Chinese
Journal of Computers (计算机学报),第43卷,第6期,2020. [31]. [软件学报] 娄文启,王超,宫磊,周学海,一种神经网络指令集扩展与代码映射机制,Journal of Software (软件学报),第31卷,第10期,2020. 2019: [32]. [TODAES]Bo Wan, Xi Li, Bo Zhang, Caixu Zhao, Xianglan Chen, Chao Wang, and Xuehai Zhou, DCW: A Reactive and Predictable Programming Framework for LET-based Distributed Real-time Systems, ACM Transactions on Design Automation of Electronic Systems. [33]. [APPT]Wenqi Lou, Chao Wang, Lei Gong, Xuehai Zhou: RV-CNN:
Flexible and Efficient Instruction Set for CNNs Based on RISC-V Processors.
APPT 2019: 3-14 [34]. [BIBM]
Qingfeng Wang, Jun Huang, Zhiqin Liu, Jie-Zhi Cheng, Ying
Zhou, Qiyu Liu, Yaobin
Wang, Xuehai Zhou, Chao Wang: Higher-order Transfer Learning for Pulmonary Nodule
Attribute Prediction in Chest CT Images. BIBM 2019: 741-745 [35]. [Cluster]
Teng Wang,
Lei Gong, Chao Wang, Xuehai Zhou, Huaping Chen: Design
Exploration of Multi-FPGAs for Accelerating Deep Learning. CLUSTER
2019: 1-2 [36]. [CODES+ISSS]
Yang Yang, Chao Wang,
Xuehai Zhou: Drama: A high efficient neural network accelerator on
FPGA using dynamic reconfiguration: work-in-progress. CODES+ISSS
2019: 13:1-13:2 [37]. [FPT]
Yang Yang, Chao Wang,
Lei Gong, Xuehai Zhou: FPNet: Customized
Convolutional Neural Network for FPGA Platforms. FPT 2019: 399-402 2018: [38]. [TCAD] Lei Gong ; Chao Wang ; Xi
Li ; Huaping Chen ; Xuehai Zhou MALOC: A Fully Pipelined FPGA Accelerator for
Convolutional Neural Networks with All Layers Mapped On Chip, IEEE
Transactions on Computer-Aided Design of Integrated Circuits and Systems. [39]. [TVLSI]
Maohua Zhu, Youwei
Zhuo, Chao
Wang, Wenguang Chen, and Yuan Xie: Performance Evaluation and Optimization of HBM-Enabled
GPU for Data-intensive Applications, IEEE
Transactions on VLSI Systems. [40]. [BMC]Bo Xu, Changlong Li, Hang
Zhuang, Jiali Wang, Qingfeng Wang, Chao
Wang, Xuehai Zhou, Distributed
Gene Clinical Decision Support System Based on Cloud Computing,
BMC Medical Genomics. [41]. [IJPP]Yuntao Lu, Chao Wang, Lei Gong, Xuehai Zhou: SparseNN: A Performance-Efficient Accelerator for Large-Scale Sparse Neural Networks. International Journal of Parallel Programming 46(4): 648-659 (2018) [42]. [IJPP]Fan Sun, Chao Wang, Lei Gong, Yiwei Zhang, Chongchong
Xu, Yuntao Lu, Xi Li, Xuehai
Zhou: UniCNN: A Pipelined Accelerator Towards
Uniformed Computing for CNNs. International Journal of Parallel
Programming 46(4): 776-787 (2018) [43]. [FPGA]
Chongchong Xu, Chao
Wang, Yiwei Zhang, Lei Gong, Xi Li and Xuehai Zhou, Domino: An Asynchronous and Energy-efficient
Accelerator for Graph Processing, in 26th ACM/SIGDA International
Symposium on Field-Programmable Gate Arrays . [44]. [ICWS]Chongchong Xu, Chao Wang, Lei Gong, Lihui Jin, Xi Li, Xuehai Zhou: Domino: Graph Processing Services on Energy-Efficient Hardware Accelerator. ICWS 2018: 274-281 [45]. [GLSVLSI]Yuming Cheng, Chao Wang, Yangyang Zhao, Xianglan Chen, Xuehai Zhou, Xi Li: MuDBN: An Energy-Efficient and High-Performance Multi-FPGA Accelerator for Deep Belief Networks. ACM Great Lakes Symposium on VLSI 2018: 435-438 [46]. [MICRO] Xuda
Zhou, Zidong Du, Qi Guo, Chengsi
Liu, Chao Wang, Xuehai Zhou, Ling Li, Tianshi
Chen, Yunji Chen, "Cambricon-S:
Addressing Irregularity in Sparse Neural Networks through a Cooperative
Software/Hardware Approach", in Proceedings of the 51st IEEE/ACM International Symposium on Microarchitecture
(MICRO'18), 2018. [47]. [CODES+ISSS]Lei Gong ; Chao Wang ; Xi
Li ; Huaping Chen ; Xuehai Zhou MALOC: A Fully Pipelined FPGA Accelerator for
Convolutional Neural Networks with All Layers Mapped On Chip,
International Conference on Hardware/Software Codesign and System Synthesis {Best Paper Candidate}. 2017: [48].
[TPDS] Chao Wang, Xi Li, Yunji
Chen, Youhui Zhang, Oliver Diessel,
Xuehai Zhou: Service-oriented
Architecture on FPGA-based MPSoC, IEEE Transactions
on Parallel and Distributed Systems. [49].
[TCBB]Chao Wang,
Xi Li, Dong
Dai, Aili Wang, and Xuehai
Zhou, Accelerating
Computation of Large Biological Datasets using MapReduce Framework
IEEE/ACM Transactions on Computational Biology and Bioinformatics. [50].
[TCAD]Chao Wang, Lei Gong, Qi Yu, Xi Li, Yuan Xie, Xuehai Zhou, DLAU: A Scalable Deep Learning
Accelerator Unit on FPGA, IEEE Transactions on Computer-Aided Design of Integrated
Circuits and Systems. [51].
[TSC] Chao Wang, Xi Li, Aili Wang and Xuehai Zhou, A
Classroom Scheduling Service for Smart Classes, IEEE
Transactions on Services Computing. [52].
[JSS]Chao Wang, Xi Li, Huizhen
Zhang, Aili Wang, and Xuehai
Zhou, HOTISE: Hot Spots Profiling
and Dataflow Analysis in Custom Dataflow Computing SoftProcessors, Journal
of Systems and Software. [53].
[DATE]Maohua Zhu, Youwei
Zhuo, Chao
Wang, Wenguang Chen and Yuan Xie, Performance Evaluation and Optimization of HBM-Enabled
GPU for Data-intensive Applications, Design, Automation and Test
in Europe, 2017. [54].
[ICWS]Chao Wang, Jinhong
Zhou, Lei Gong, Xi Li, Aili Wang, and Xuehai Zhou, xFilter: A Temporal
Locality Accelerator for Intrusion Detection System Services, 24th
IEEE International Conference on Web Services (Research Track). [55].
[ICWS]Chao Wang, Haijie
Fang, Shiming Lei, Lei Gong, Aili
Wang, Xi Li, and Xuehai Zhou, GenServ: Genome
Sequencing Services on Scalable Energy Efficient Accelerators, 24th
IEEE International Conference on Web Services. [56].
[ICWS]Changlong Li, Hang Zhuang, Bo
Xu, Jiali Wang, Chao Wang, and Xuehai Zhou, Light Weight
Key-Value Store for Efficient Services on Local Distributed Mobile Devices, 24th
IEEE International Conference on Web Services (Research Track). [57].
[ICWS] Hang
Zhuang, Chao Wang, Changlong Li, Qingfeng Wang, and Xuehai Zhou,
Natural
Language Processing Service Based on Stroke-level Convolutional Networks for
Chinese Text Classification, 24th IEEE International Conference on
Web Services. [58]. [ICWS]Chongchong Xu, Jinhong
Zhou, Yuntao Lu, Fan Sun, Lei Gong, Chao Wang, Xi Li, and Xuehai Zhou, Evaluation and Trade-offs of Graph Processing for Cloud
Services, 24th IEEE International Conference on Web Services. [59]. [Cluster] Chongchong
Xu, Chao Wang, Lei Gong, Yuntao Lu, Fan Sun, Yiwei
Zhang, Xi Li and Xuehai Zhou, OmniGraph: A Scalable
Hardware Accelerator For Graph Processing, in IEEE Cluster
Conference. [60]. [Cluster] Yiwei
Zhang, Chao Wang, Lei Gong, Yuntao Lu, Fan Sun, Chongchong
Xu, Xi Li and Xuehai Zhou, A Power-Efficiency Accelerator Based on
FPGAs for LSTM Network, in IEEE Cluster Conference. [61]. [Cluster] Fan Sun, Chao Wang, Lei Gong, Chongchong Xu, Yiwei Zhang, Yuntao Lu, Xi Li
and Xuehai Zhou, A Pipeline Power-efficient Accelerator
for Convolutional Neural Networks, in IEEE Cluster Conference. [62]. [RTSS]
Bo Wan, Xi
Li, Haizhao Luo, Chao Wang, Xianglan Chen, Xuehai Zhou, Working In Progress: TTI: A Timing ISA for LET Model in
Safety-critical Systems. In Proceedings of IEEE Real-Time Systems
Symposium{WiP}. [63]. [CyPhy] Chao Wang, Yuming Cheng,
Lei Gong, Bo Wan, Aili Wang, Xi Li and Xuehai Zhou. FPGA
based Big Data Accelerator Design in Teaching Computer Architecture and
Organization, in 7th Workshop on Design, Modeling and Evaluation
of Cyber Physical Systems (CyPhy'17), with ESWEEK’17. [64]. [CCCF]
王超,孙凡,李曦,周学海,EDA领域的神经网络研究热点,中国计算机学会通讯,第13卷,第7期,65-70. 2016: [65].
[TPDS] Chao Wang, Xi Li, Junneng Zhang, Aili Wang, Xuehai Zhou: Hardware Implementation
on FPGA for Task-level Parallel Dataflow Execution Engine,
IEEE Transactions on Parallel and Distributed Systems. [66].
[TVLSI] Qi Guo, Xi Li, Chao Wang, Xuehai
Zhou: Evaluation
and Tradeoffs for Out-of-Order Execution on Reconfigurable Heterogeneous MPSoC. IEEE Trans. VLSI Syst. 24(1): 79-91 [67].
[ASOC] Chao Wang, Xi Li, Xuehai
Zhou, Aili Wang, Nadia Nedjah:
Soft
computing in big data intelligent transportation systems. Appl.
Soft Computing. 38: 1099-1108. [68].
[JSA] Beilei
Sun, Xi Li, Bo Wan, Chao Wang, Xuehai Zhou, Xianglan Chen, Definitions
of Predictability for Cyber Physical Systems,
Journal of Systems Architecture. [69].
[GLSVLSI] Jiachen Song, Xi Li, Beilei Sun, Zhinan Cheng, Chao Wang and Xuehai
Zhou: FCM:
Towards Fine-Grained GPU Power Management for Closed Source Mobile Games,
in GLSVLSI 2016. [70].
[ASAP] Zhinan
Cheng, Xi Li, Jiachen Song, Beilei
Sun, Xuehai Zhou and Chao Wang: Display Power Reduction for Mobile Closed-Source Games,
27th Annual IEEE International Conference on Application-specific
Systems, Architectures and Processors. [71].
[ICWS] Chao Wang, Xi Li, Qi Yu, Aili Wang, Patrick Hung, Xuehai
Zhou: SOLAR:
Services-oriented Learning Architectures, IEEE
International Conference on Web Services. [72].
[ICWS] Chao Wang, Xi Li, Jinhong
Zhou, Aili Wang, Xuehai
Zhou: FairPlay: Services Migration with Lock-free Mechanisms for Load
Balancing in Cloud Architectures, IEEE
International Conference on Web Services (Research Track). [73].
[SPAA] Chao Wang, Xi Li, Aili
Wang and Xuehai Zhou: Brief
Announcement: MIC++: Accelerating Maximal Information Coefficient Calculation
with GPUs and FPGAs, 28th ACM Symposium on
Parallelism in Algorithms and Architectures. [74].
[ICPADS]Yangyang Zhao, Qi Yu, Xuda Zhou, Xuehai Zhou, Chao Wang and Xi Li, PIE: A
Pipeline Energy-efficient Accelerator for Inference Process in Deep Neural
Networks, The 22nd IEEE International Conference on Parallel and
Distributed Systems 2015: [75]. [TC]
Chao Wang, Xi
Li, Junneng Zhang, Peng Chen, Yunji
Chen, Xuehai Zhou, Ray C.C. Cheung: Architecture Support for
Task Out-of-order Execution in MPSoCs, IEEE Transactions on
Computers. [76]. [TCBB]
Chao Wang, Xi
Li, Peng Chen, Xuehai Zhou, Aili Wang and Hong
Yu, “Heterogeneous Cloud
Framework for Big Data Genome Sequencing”, IEEE/ACM
Transactions on Computational Biology and Bioinformatics. Featured
Spotlight Paper. [77]. [TPDS] Shaoli
Liu, Tianshi Chen, Ling Li, Xi Li, Mingzhe Zhang, Chao
Wang, Haibo Meng, Xuehai
Zhou, and Yunji Chen, "FreeRider:
Non-local Adaptive Network-on-Chip Routing with Packet-Carried Propagation of
Congestion Information",
IEEE Transactions on Parallel and Distributed Systems. [78]. [JPDC] Chao Wang, Xi Li, Peng Chen, and Xuehai
Zhou, “A
Case Study of Parallel JPEG Encoding on an FPGA,” Journal of
Parallel and Distributed Computing. [79]. [IJE] Chao Wang, Xi Li, Xuehai Zhou, Nadia Nedjah, Aili Wang, “Codem: A Software/Hardware
Codesign Flow for Embedded Multicore Systems Supporting Hardware Services”,
International Journal of Electronics. [80].
[JCST] Chao Wang, Xi Li and Xuehai Zhou, “CRAIS: A
Crossbar based Interconnection Scheme on FPGA for Big Data”,
Journal of Computer Science and Technology. [81].
[DATE] Chao Wang, Xi Li, Xuehai Zhou, SODA: Software Defined
FPGA based Accelerators for Big Data, Design, Automation and
Test in Europe, 2015. Best IP Paper Nomination.
[82].
[FPGA] Chao Wang, Xi Li, Qi Guo, Peng Chen and Xuehai
Zhou, “RapidPath: Accelerating Constrained Shortest Path
Finding in Graphs on FPGA”, FPGA 2015 . [83].
[CCGRID] Qi Yu, Chao Wang, Xiang Ma, Xi Li and Xuehai
Zhou, “A Deep
Learning accelerator based FPGA” 15th IEEE/ACM International
Symposium on Cluster, Cloud and Grid Computing. [84].
[Cluster] Xiang Ma, Chao Wang, Qi Yu, Xi Li, Xuehai Zhou: An
FPGA-Based Accelerator for Neighborhood-Based Collaborative Filtering
Recommendation Algorithms. CLUSTER 2015: 494-495. [85].
[ICA3PP] Fahui Jia, Chao Wang, Xi Li, Xuehai Zhou: SAKMA:
Specialized FPGA-Based Accelerator Architecture for Data-Intensive K-Means
Algorithms. ICA3PP (2) 2015: 106-119 2014: [86].
[JSA] Chao Wang, Xi Li, Xiaojing Feng, Peng Chen, Xuehai
Zhou,“Colored Petri Net Model with Automatic Parallelization on Real-Time Multicore Architectures”,Journal
of Systems Architecture. [87].
[TCBB]
Peng Chen, Chao Wang, Xi
Li, Xuehai Zhou, “Accelerating the Next Generation long read
mapping with the FPGA-based system”, IEEE/ACM Transactions on
Computational Biology and Bioinformatics. [88].
[ARC] Chao Wang, Xi Li, Huizhen Zhang,
Liang Shi, and Xuehai Zhou, “Instruction Extension and Generation
for Adaptive Processors”, ARC 2014. [89].
[FPGA] Chao Wang, Xi Li, Xuehai Zhou, Yunji Chen, Ray C.C. Cheung, “Big Data Genome Sequencing on Zynq based Clusters”, FPGA 2014 . [90].
[FPGA]
Chao Wang, Xi Li,
Xuehai Zhou, Yunji Chen,
Koen Bertels, “Co-processing with Dynamic Reconfiguration on
Heterogeneous MPSoC: Practices and Design
Tradeoffs”, FPGA 2014 . [91].
[ISIC] Peng Chen, Chao
Wang, Xi Li, Xuehai Zhou and Ray C.C. Cheung, Trade-offs between the Sensitivity and the Speed of the
FPGA-based Sequence Aligner, ISIC 2014. 2013: [92].
[TACO]
Chao Wang, Xi
Li, Junneng Zhang, Xuehai
Zhou, Xiaoning Nie. “MP-Tomasulo: a Dependency-aware
Automatic Parallel Execution Engine for Sequential Programs”.
ACM Transactions on Architecture and Code Optimization. [93].
[SIGBED
Review] Chao Wang,
Xi Li, Xuehai Zhou, “Heterothread: Hybrid Thread Level Parallelism on Heterogeneous
Multicore Architectures”. ACM SIGBED Review. [94]. [ISCAS] Junneng Zhang, Chao Wang, Xi Li, Xuehai Zhou. “FPGA Implementation of a Scheduler Supporting Parallel Dataflow Execution”, ISCAS 2013. [95]. [FPGA] Chao Wang, Xi Li, Xuehai Zhou, Jim Martin and Ray Cheung. “Genome Sequencing Using MapReduce on FPGA with Multiple Hardware Accelerators”. FPGA 2013. [96]. [FPGA] Chao Wang, Xi Li, Huizhen Zhang, Jinsong Ji and Xuehai Zhou. “Custom Instruction Generation and Mapping for Reconfigurable Instruction Set Processors”. FPGA 2013. [97].
[FPT]Peng Chen, Chao
Wang, Xi Li, Xuehai Zhou: Hardware acceleration
for the banded Smith-Waterman algorithm with the cycled systolic array. FPT
2013: 480-481 [98].
[ICA3PP] Gangyong Jia, Xi Li, Jian Wan, Chao Wang, Dong Dai, Congfeng Jiang: Coordinate Task and Memory Management
for Improving Power Efficiency. ICA3PP 2013: 267-278 [99]. [CCCF] 李曦,陈香兰,王超,周学海,异构计算需要新的操作系统抽象,中国计算机学会通讯,第九卷,第11期,2013. 2012: [100].
[TJS]
Chao Wang, Xi Li, Junneng Zhang,
Xuehai Zhou, Aili Wang, “ A Star Network
Approach in Heterogeneous Multi Processors System on Chip”, Journal
of Supercomputing. [101].
[FPT] Chao
Wang, Xi Li, Xuehai Zhou and Yajun Ha. “Parallel
Dataflow Execution for Sequential Programs on Reconfigurable Hybrid MPSoCs”. FPT 2012.53-56.
[102]. [FPT] Junneng Zhang, Chao Wang, Xi Li, Xuehai Zhou,”A Task-Level OoO Framework for Heterogeneous Systems”, FPT 2012,pp.33-36. [103]. [FPL] Chao Wang, Xi Li, Peng Chen, Xuehai Zhou. “CaaS: Core as a Service Bring SOA to Reconfigurable MPSoC for High level Parallelization,” FPL 2012.pp.495-498. [104].
[RAW] Chao Wang, Peng Chen, Xi Li, Xiaojing Feng, Xuehai Zhou. “Detecting Data
Hazards in Multi-Processor System-on-Chips on FPGA “, RAW 2012,
pp.282-287. [105].
[RAW] Chao Wang, Peng Chen, Xi Li, Xiaojing Feng, Xuehai Zhou. “FPM: A Flexible
Programming Model for MPSoCs “, RAW 2012
pp. 4770-484. [106].
[ARC] Chao Wang, Xi Li, Xiaojing
Feng and Xuehai Zhou. “An Approach of Reconfigurable Network of
Heterogeneous MPSoC”. ARC 2012,
pp.379-384. [107].
[ICPADS] Gangyong Jia, Xi Li and
Chao Wang,” Behavior Aware Data
Locality for Caches”, ICPADS 2012,pp.514-521. [108].
[Cluster] Chao Wang, Xi Li, Dong Dai, Gangyong Jia, Xuehai Zhou, “Phase Detection for Loop-based Programs on Multicore
Architectures”, IEEE Cluster 2012.pp.584-587. [109].
[ICA3PP] Chunsheng
Li, Xuehai Zhou, Fangling
Zeng, Chao Wang, “A Dependency Aware Task Partitioning and Scheduling Algorithm
for HW/SW Codesign on MPSoCs”. ICA3PP 2012.pp.332-346. [110]. [MASCOTS] Chao Wang, Xi Li, Peng Chen, Xiaojing Feng, Xuehai Zhou. “Analyzing and Extending Amdahl’s Law in Heterogeneous on-chip Clusters”, MASCOTS 2012.pp.489-491. [111].
[MASCOTS] Gangyong Jia, Xi Li, Chao Wang, Xuehai
Zhou. “Frequency Affinity: Analyzing
and Maximizing Power Efficiency in Multi-core Systems”, MASCOTS
2012.pp.495-497. [112].
[SCC]
Chao Wang, Xi
Li, Xuehai Zhou. “Regarding Processors and Reconfigurable IP Cores as
Services”. IEEE SCC 2012. pp.668-669. 2011: [113]. [SCC]
Chao Wang,
[114]. [ISPA] Chao Wang, [115]. [ISPA]
Chao Wang, |