CSpace

浏览/检索结果: 共196条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Toward Network-Aware Query Execution Systems in Large Datacenters 期刊论文
IEEE TRANSACTIONS ON NETWORK AND SERVICE MANAGEMENT, 2023, 卷号: 20, 期号: 4, 页码: 4494-4504
作者:  Cheng, Long;  Wang, Ying;  Jhaveri, Rutvij H.;  Wang, Qingle;  Mao, Ying
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Query data operator  coflow scheduling  network communication  performance optimizations  datacenters  
CoAxNN: Optimizing on-device deep learning with conditional approximate neural networks 期刊论文
JOURNAL OF SYSTEMS ARCHITECTURE, 2023, 卷号: 143, 页码: 14
作者:  Li, Guangli;  Ma, Xiu;  Yu, Qiuchu;  Liu, Lei;  Liu, Huaxiao;  Wang, Xueying
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
On-device deep learning  Efficient neural networks  Model approximation and optimization  
Predicting for I/O stack optimizations on cyber-physical systems 期刊论文
MICROPROCESSORS AND MICROSYSTEMS, 2023, 卷号: 101, 页码: 10
作者:  Zhang, Yangmei;  Shen, Fanfan;  Li, Mengquan;  Wu, Chao
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
I  O stack  Cyber-physical system  O idle time  Prediction  
ESA: An efficient sequence alignment algorithm for biological database search on Sunway TaihuLight 期刊论文
PARALLEL COMPUTING, 2023, 卷号: 117, 页码: 11
作者:  Zhang, Hao;  Huang, Zhiyi;  Chen, Yawen;  Liang, Jianguo;  Gao, Xiran
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Hybrid sequence alignment  Biological database search  Sunway TaihuLight  SW26010  Heterogeneous architecture  
VTensor: Using Virtual Tensors to Build a Layout-Oblivious AI Programming Framework 期刊论文
JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY, 2023, 卷号: 38, 期号: 5, 页码: 1074-1097
作者:  Yu, Feng;  Zhao, Jia-Cheng;  Cui, Hui-Min;  Feng, Xiao-Bing;  Xue, Jingling
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
artificial intelligence (AI) programming  layout-oblivious  tensor processing  
Characterizing and Understanding Defense Methods for GNNs on GPUs 期刊论文
IEEE COMPUTER ARCHITECTURE LETTERS, 2023, 卷号: 22, 期号: 2, 页码: 137-140
作者:  Wu, Meng;  Yan, Mingyu;  Yang, Xiaocheng;  Li, Wenming;  Zhang, Zhimin;  Ye, Xiaochun;  Fan, Dongrui
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Kernel  Purification  Estimation  Graphics processing units  Perturbation methods  Electric breakdown  Training  Graph neural networks  defense  execution semantic  execution pattern  overhead  
Enabling In-Network Floating-Point Arithmetic for Efficient Computation Offloading 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 12, 页码: 4918-4934
作者:  Cui, Penglai;  Pan, Heng;  Li, Zhenyu;  Zhang, Penghao;  Miao, Tianhao;  Zhou, Jianer;  Guan, Hongtao;  Xie, Gaogang
收藏  |  浏览/下载:14/0  |  提交时间:2023/07/12
Open area test sites  Arithmetic  Memory management  Task analysis  Training  Standards  Servers  In-network computation  computation offloading  floating-point operation  
An Efficient Deep Learning Accelerator Architecture for Compressed Video Analysis 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 9, 页码: 2808-2820
作者:  Wang, Yongchen;  Wang, Ying;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:30/0  |  提交时间:2022/12/07
Streaming media  Neural networks  Image coding  Decoding  Metadata  Deep learning  Hardware  Neural network acceleration  specialized accelerator  video analysis  
Scaling Poisson Solvers on Many Cores via MMEwald 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 8, 页码: 1888-1901
作者:  Wu, Mingchuan;  Wu, Yangjun;  Shang, Honghui;  Liu, Ying;  Cui, Huimin;  Li, Fang;  Duan, Xiaohui;  Zhang, Yunquan;  Feng, Xiaobing
收藏  |  浏览/下载:36/0  |  提交时间:2022/06/21
Optimization  Bandwidth  Supercomputers  Electric potential  Boundary conditions  Electrostatics  Silicon  Poisson solver  architecture-specific optimizations  many-core processor  
Taming Process Variations in CNFET for Efficient Last-Level Cache Design 期刊论文
IEEE TRANSACTIONS ON VERY LARGE SCALE INTEGRATION (VLSI) SYSTEMS, 2022, 卷号: 30, 期号: 4, 页码: 418-431
作者:  Xu, Dawen;  Feng, Zhuangyu;  Liu, Cheng;  Li, Li;  Wang, Ying;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:25/0  |  提交时间:2022/12/07
CNTFETs  Delays  Transistors  Layout  Very large scale integration  Radio frequency  Energy consumption  nanotube field-effect transistor (CNFET)  last-level cache (LLC)  process variation (PV)  variation-aware cache