CSpace

浏览/检索结果: 共101条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Improving Utilization of Dataflow Unit for Multi-Batch Processing 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2024, 卷号: 21, 期号: 1, 页码: 26
作者:  Fan, Zhihua;  Li, Wenming;  Wang, Zhen;  Yang, Yu;  Ye, Xiaochun;  Fan, Dongrui;  Sun, Ninghui;  An, Xuejun
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Utilization  network-on-chip  decoupled architecture  batch processing  
Frequency-Domain Inference Acceleration for Convolutional Neural Networks Using ReRAMs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2023, 卷号: 34, 期号: 12, 页码: 3133-3146
作者:  Liu, Bosheng;  Jiang, Zhuoshen;  Wu, Yalan;  Wu, Jigang;  Chen, Xiaoming;  Liu, Peng;  Zhou, Qingguo;  Han, Yinhe
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Frequency-domain accelerator  energy efficiency  resistive random access memory  frequency-domain convolutions  
Dadu-SV: Accelerate Stereo Vision Processing on NPU 期刊论文
IEEE EMBEDDED SYSTEMS LETTERS, 2022, 卷号: 14, 期号: 4, 页码: 191-194
作者:  Min, Feng;  Wang, Ying;  Xu, Haobo;  Huang, Junpei;  Wang, Yujie;  Zou, Xingqi;  Lu, Meixuan;  Han, Yinhe
收藏  |  浏览/下载:14/0  |  提交时间:2023/07/12
Hardware acceleration  neural computing  neural processing unit (NPU)  semiglobal matching (SGM)  stereo vision  
Accelerating Data Transfer in Dataflow Architectures Through a Look-Ahead Acknowledgment Mechanism 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2022, 卷号: 37, 期号: 4, 页码: 942-959
作者:  Feng, Yu-Jing;  Li, De-Jian;  Tan, Xu;  Ye, Xiao-Chun;  Fan, Dong-Rui;  Li, Wen-Ming;  Wang, Da;  Zhang, Hao;  Tang, Zhi-Min
收藏  |  浏览/下载:25/0  |  提交时间:2022/12/07
dataflow model  control-ow model  high-performance computing application  data transfer  power efficiency  
Search-Free Inference Acceleration for Sparse Convolutional Neural Networks 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 7, 页码: 2156-2169
作者:  Liu, Bosheng;  Chen, Xiaoming;  Han, Yinhe;  Wu, Jigang;  Chang, Liang;  Liu, Peng;  Xu, Haobo
收藏  |  浏览/下载:24/0  |  提交时间:2022/12/07
Internal interconnection  memory bandwidth  sparse accelerators  sparse convolution neural networks (CNNs)  
ShuntFlowPlus: An Efficient and Scalable Dataflow Accelerator Architecture for Stream Applications 期刊论文
ACM JOURNAL ON EMERGING TECHNOLOGIES IN COMPUTING SYSTEMS, 2021, 卷号: 17, 期号: 4, 页码: 24
作者:  Gong, Shijun;  Li, Jiajun;  Lu, Wenyan;  Yan, Guihai;  Li, Xiaowei
收藏  |  浏览/下载:21/0  |  提交时间:2022/12/07
Streaming processing  sliding-window aggregations  dataflow  buffer sharing  
EnGN: A High-Throughput and Energy-Efficient Accelerator for Large Graph Neural Networks 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2021, 卷号: 70, 期号: 9, 页码: 1511-1525
作者:  Liang, Shengwen;  Wang, Ying;  Liu, Cheng;  He, Lei;  Li, Huawei;  Xu, Dawen;  Li, Xiaowei
收藏  |  浏览/下载:37/0  |  提交时间:2021/12/01
Neural networks  Hardware  System-on-chip  Task analysis  Feature extraction  Memory management  Graph neural network  accelerator architecture  hardware acceleration  
An efficient scheduling algorithm for dataflow architecture using loop-pipelining 期刊论文
INFORMATION SCIENCES, 2021, 卷号: 547, 页码: 1136-1153
作者:  Li, Yi;  Wu, Meng;  Ye, Xiaochun;  Li, Wenming;  Xue, Rui;  Wang, Da;  Zhang, Hao;  Fan, Dongrui
收藏  |  浏览/下载:38/0  |  提交时间:2021/12/01
Dataflow architecture  Instruction scheduling  Multicast  Sharing path  Loop optimization  
Swallow: A Versatile Accelerator for Sparse Neural Networks 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2020, 卷号: 39, 期号: 12, 页码: 4881-4893
作者:  Liu, Bosheng;  Chen, Xiaoming;  Han, Yinhe;  Xu, Haobo
收藏  |  浏览/下载:28/0  |  提交时间:2021/12/01
Accelerator  convolutional (Conv) layers  fully connected (FC) layers  sparse neural networks (SNNs)  
An efficient dataflow accelerator for scientific applications 期刊论文
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 卷号: 112, 页码: 580-588
作者:  Ye, Xiaochun;  Tan, Xu;  Wu, Meng;  Feng, Yujing;  Wang, Da;  Zhang, Hao;  Pei, Songwen;  Fan, Dongrui
收藏  |  浏览/下载:218/0  |  提交时间:2020/12/10
Dataflow architecture  Scientific computing  Instruction level parallelism