CSpace

浏览/检索结果: 共8条,第1-8条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
An Application-oblivious Memory Scheduling System for DNN Accelerators 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2022, 卷号: 19, 期号: 4, 页码: 26
作者:  Li, Jiansong;  Wang, Xueying;  Chen, Xiaobing;  Li, Guangli;  Dong, Xiao;  Zhao, Peng;  Yu, Xianzhi;  Yang, Yongxin;  Cao, Wei;  Liu, Lei;  Feng, Xiaobing
收藏  |  浏览/下载:14/0  |  提交时间:2023/07/12
Deep learning  memory scheduling  runtime system  DNN accelerators  
Scaling Poisson Solvers on Many Cores via MMEwald 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 8, 页码: 1888-1901
作者:  Wu, Mingchuan;  Wu, Yangjun;  Shang, Honghui;  Liu, Ying;  Cui, Huimin;  Li, Fang;  Duan, Xiaohui;  Zhang, Yunquan;  Feng, Xiaobing
收藏  |  浏览/下载:36/0  |  提交时间:2022/06/21
Optimization  Bandwidth  Supercomputers  Electric potential  Boundary conditions  Electrostatics  Silicon  Poisson solver  architecture-specific optimizations  many-core processor  
Optimizing deep neural networks on intelligent edge accelerators via flexible-rate filter pruning 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2022, 卷号: 124, 页码: 11
作者:  Li, Guangli;  Ma, Xiu;  Wang, Xueying;  Yue, Hengshan;  Li, Jiansong;  Liu, Lei;  Feng, Xiaobing;  Xue, Jingling
收藏  |  浏览/下载:25/0  |  提交时间:2022/12/07
Edge intelligence  Deep learning  Neural network compression  
Compiler-assisted Operator Template Library for DNN Accelerators 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2021, 页码: 18
作者:  Li, Jiansong;  Cao, Wei;  Dong, Xiao;  Li, Guangli;  Wang, Xueying;  Zhao, Peng;  Liu, Lei;  Feng, Xiaobing
收藏  |  浏览/下载:36/0  |  提交时间:2021/12/01
DNN Accelerators  Template Library  Address Space Management  
Fusion-Catalyzed Pruning for Optimizing Deep Learning on Intelligent Edge Devices 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 卷号: 39, 期号: 11, 页码: 3614-3626
作者:  Li, Guangli;  Ma, Xiu;  Wang, Xueying;  Liu, Lei;  Xue, Jingling;  Feng, Xiaobing
收藏  |  浏览/下载:47/0  |  提交时间:2021/12/01
Deep learning system  edge intelligence  model compression and acceleration  neural networks  
ElasticActor: An Actor System with Automatic Granularity Adjustment 期刊论文
INTERNATIONAL JOURNAL OF PARALLEL PROGRAMMING, 2019, 卷号: 47, 期号: 3, 页码: 520-534
作者:  Zhao, Peng;  Liu, Lei;  Cao, Wei;  Dong, Xiao;  Li, Jiansong;  Feng, Xiaobing
收藏  |  浏览/下载:260/0  |  提交时间:2019/08/16
Actor model  Concurrency granularity  Cloud computing  Performance optimization  
Articulation Points Guided Redundancy Elimination for Betweenness Centrality 期刊论文
ACM SIGPLAN NOTICES, 2016, 卷号: 51, 期号: 8, 页码: 73-86
作者:  Wang, Lei;  Yang, Fan;  Zhuang, Liangji;  Cui, Huimin;  Lv, Fang;  Feng, Xiaobing
收藏  |  浏览/下载:55/0  |  提交时间:2019/12/12
Algorithms  Performance  Partial Redundancy Elimination  Parallelism  Betweenness Centrality  
Practical Iterative Optimization for the Data Center 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2015, 卷号: 12, 期号: 2, 页码: 26
作者:  Fang, Shuangde;  Xu, Wenwen;  Chen, Yang;  Eeckhout, Lieven;  Temam, Olivier;  Chen, Yunji;  Wu, Chengyong;  Feng, Xiaobing
收藏  |  浏览/下载:49/0  |  提交时间:2019/12/13
Design  Performance  Iterative optimization  compiler  MapReduce  server  data center  co-run