CSpace

浏览/检索结果: 共4条,第1-4条 帮助

已选(0)清除 条数/页:   排序方式:
Efficient and Fast High-Performance Library Generation for Deep Learning Accelerators 期刊论文
IEEE TRANSACTIONS ON COMPUTERS, 2025, 卷号: 74, 期号: 1, 页码: 155-169
作者:  Bi, Jun;  Wen, Yuanbo;  Li, Xiaqing;  Zhao, Yongwei;  Guo, Yuxuan;  Zhou, Enshuai;  Hu, Xing;  Du, Zidong;  Li, Ling;  Chen, Huaping;  Chen, Tianshi;  Guo, Qi
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
Optimization  Space exploration  Schedules  Libraries  Biological cells  Deep learning  Costs  Computers  Search problems  Tensors  Code generation  compiler optimization  tensor computation  
IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 9, 页码: 1672-1689
作者:  Wei, Cunyang;  Jia, Haipeng;  Zhang, Yunquan;  Yao, Jianyu;  Li, Chendi;  Cao, Wenxuan
收藏  |  浏览/下载:16/0  |  提交时间:2024/12/06
Kernel  Libraries  Computer architecture  Tuning  Layout  Optimization  Codes  Batch GEMM  code generation  compact GEMM  dynamic programming  TSMM  
An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 11, 页码: 3129-3140
作者:  Li, Kun;  Yuan, Liang;  Zhang, Yunquan;  Chen, Gongwei
收藏  |  浏览/下载:61/0  |  提交时间:2022/12/07
Clustering algorithms  Training  Mathematical models  Computational modeling  Libraries  Kernel  Support vector machines  Distributed machine learning  scalable algorithm  large-scale clustering  parallel regression  
A Pattern-Based SpGEMM Library for Multi-Core and Many-Core Architectures 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 1, 页码: 159-175
作者:  Xie, Zhen;  Tan, Guangming;  Liu, Weifeng;  Sun, Ninghui
收藏  |  浏览/下载:74/0  |  提交时间:2021/12/01
Libraries  Sparse matrices  Prediction algorithms  Neural networks  Predictive models  Memory management  Tuners  SpGEMM  spare BLAS  sparse format  auto-tuning  neural network