CSpace

浏览/检索结果: 共25条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
OptiFX: Automatic Optimization for Convolutional Neural Networks with Aggressive Operator Fusion on GPUs 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 27
作者:  Wang, Xueying;  Li, Shigang;  Qian, Hao;  Luo, Fan;  Hao, Zhaoyang;  Wu, Tong;  Xu, Ruiyuan;  Cui, Huimin;  Feng, Xiaobing;  Li, Guangli
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Deep learning systems  convolutional neural networks  operator fusion  
SRSparse: Generating Codes for High-Performance Sparse Matrix-Vector Semiring Computations 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2025, 卷号: 22, 期号: 2, 页码: 26
作者:  Du, Zhen;  Li, Ying;  Sun, Ninghui;  Cui, Huimin;  Feng, Xiaobing;  Li, Jiajia
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
High performance computing  sparse matrix computation  auto-tuning  code generator  semiring computation  
面向新型应用范式与新型体系结构的编译技术 期刊论文
航空学报, 2024, 卷号: 45, 期号: 20
作者:  李广力;  杜臻;  赵家程;  刘颖;  俞峰;  李奕瑾;  张忠诚;  崔慧敏
收藏  |  浏览/下载:10/0  |  提交时间:2025/12/04
emerging application paradigm  advanced computer architecture  system software  programming model  compiler technology  新型应用范式  新型体系结构  系统软件  编程框架  编译技术  
OpenCL-accelerated first-principles calculations of all-electron quantum perturbations on HPC resources (vol 11, 1156891, 2023) 期刊论文
FRONTIERS IN CHEMISTRY, 2023, 卷号: 11, 页码: 1
作者:  Wu, Zhikun;  Shang, Honghui;  Wu, Yangjun;  Zhang, Zhongcheng;  Liu, Ying;  Zhang, Yuyang;  Ouyang, Yucheng;  Cui, Huimin;  Feng, Xiaobing
收藏  |  浏览/下载:47/0  |  提交时间:2023/12/04
OpenCL  DFPT  GPU  optimization  heterogeneous  
OpenCL-accelerated first-principles calculations of all-electron quantum perturbations on HPC resources 期刊论文
FRONTIERS IN CHEMISTRY, 2023, 卷号: 11, 页码: 15
作者:  Wu, Zhikun;  Shang, Honghui;  Wu, Yangjun;  Zhang, Zhongcheng;  Liu, Ying;  Zhang, Yuyang;  Ouyang, Yucheng;  Cui, Huimin;  Feng, Xiaobing
收藏  |  浏览/下载:43/0  |  提交时间:2023/12/04
OpenCL  DFPT  GPU  optimization  heterogeneous  
Scaling Poisson Solvers on Many Cores via MMEwald 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2022, 卷号: 33, 期号: 8, 页码: 1888-1901
作者:  Wu, Mingchuan;  Wu, Yangjun;  Shang, Honghui;  Liu, Ying;  Cui, Huimin;  Li, Fang;  Duan, Xiaohui;  Zhang, Yunquan;  Feng, Xiaobing
收藏  |  浏览/下载:73/0  |  提交时间:2022/06/21
Optimization  Bandwidth  Supercomputers  Electric potential  Boundary conditions  Electrostatics  Silicon  Poisson solver  architecture-specific optimizations  many-core processor  
DNNTune: Automatic Benchmarking DNN Models for Mobile-cloud Computing 期刊论文
ACM TRANSACTIONS ON ARCHITECTURE AND CODE OPTIMIZATION, 2019, 卷号: 16, 期号: 4, 页码: 26
作者:  Xia, Chunwei;  Zhao, Jiacheng;  Cui, Huimin;  Feng, Xiaobing;  Xue, Jingling
收藏  |  浏览/下载:123/0  |  提交时间:2020/12/10
DNN  mobile-cloud computing  heterogeneous computing  
面向大数据处理的基于Spark的异质内存编程框架 期刊论文
计算机研究与发展, 2018, 卷号: 55.0, 期号: 002, 页码: 246
作者:  王晨曦;  吕方;  崔慧敏;  曹婷;  John Zigman;  庄良吉;  冯晓兵
收藏  |  浏览/下载:45/0  |  提交时间:2023/12/04
内存计算  Spark  异质内存  非易失性内存  编程框架  
数据中心中DVFS对程序性能影响模型的设计 期刊论文
软件学报, 2017, 卷号: 28.0, 期号: 004, 页码: 845
作者:  李登辉;  赵家程;  崔慧敏;  冯晓兵
收藏  |  浏览/下载:40/0  |  提交时间:2023/12/04
DVFS  数据中心  能耗  频率  性能预测模型  
Articulation Points Guided Redundancy Elimination for Betweenness Centrality 期刊论文
ACM SIGPLAN NOTICES, 2016, 卷号: 51, 期号: 8, 页码: 73-86
作者:  Wang, Lei;  Yang, Fan;  Zhuang, Liangji;  Cui, Huimin;  Lv, Fang;  Feng, Xiaobing
收藏  |  浏览/下载:101/0  |  提交时间:2019/12/12
Algorithms  Performance  Partial Redundancy Elimination  Parallelism  Betweenness Centrality