CSpace

浏览/检索结果: 共71条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Computational Burst Buffers: Accelerating HPC I/O via In-Storage Compression Offloading 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2026, 卷号: 37, 期号: 2, 页码: 518-532
作者:  Chen, Xiang;  Lu, Bing;  Long, Haoquan;  Luo, Huizhang;  Ma, Yili;  Tan, Guangming;  Tao, Dingwen;  Wu, Fei;  Lu, Tao
收藏  |  浏览/下载:1/0  |  提交时间:2026/05/25
Hardware  Computer architecture  File systems  Nonvolatile memory  Bandwidth  Engines  Prototypes  Data compression  Software  Flash memories  high performance computing  solid state drives  
CGA: Accelerating BFS Through an Sparsity-Aware Adaptive Framework on Heterogeneous Platforms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2026, 卷号: 37, 期号: 1, 页码: 45-59
作者:  Xu, Lei;  Jia, Haipeng;  Zhang, Yunquan
收藏  |  浏览/下载:2/0  |  提交时间:2026/05/25
Graphics processing units  Kernel  Vectors  Sparse matrices  Optimization  Machine learning algorithms  Throughput  Adaptation models  Multicore processing  Data transfer  Sparse matrix-sparse vector multiplication (SpMSpV)  sparse matrix-dense vector multiplication (SpMV)  multi-core CPU  GPU  adaptive performance optimization  machine learning  
RHINO: An Efficient Serverless Container System for Small-Scale HPC Applications 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 卷号: 36, 期号: 8, 页码: 1560-1573
作者:  Zhu, He;  Li, Mingyu;  You, Haihang
收藏  |  浏览/下载:24/0  |  提交时间:2025/12/03
Runtime  Codes  Computational modeling  Costs  Containers  Adaptation models  Parallel processing  Training  Schedules  Scalability  Serverless computing  HPC system  runtime optimization  execution model  
Fast and Scalable Neural Network Quantum States Method for Molecular Potential Energy Surfaces 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 卷号: 36, 期号: 7, 页码: 1431-1443
作者:  Wu, Yangjun;  Cao, Wanlu;  Zhao, Jiacheng;  Shang, Honghui
收藏  |  浏览/下载:22/0  |  提交时间:2025/12/03
Artificial neural networks  Computational efficiency  Training  Wave functions  Quantum state  Computational modeling  Optimization  Electrons  Convergence  Potential energy  Quantum computational chemistry  many-body Schr & ouml  neural network quantum state  transformer based architecture  autoregressive sampling  potential energy surfaces  dinger equation  
DFU-E: A Dataflow Architecture for Edge DSP and AI Applications 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2025, 卷号: 36, 期号: 6, 页码: 1100-1114
作者:  Li, Wenming;  Fan, Zhihua;  Liu, Tianyu;  Wang, Zhen;  Wu, Haibin;  Wu, Meng;  Zhang, Kunming;  Liu, Yanhuan;  Sun, Ninghui;  Ye, Xiaochun;  Fan, Dongrui
收藏  |  浏览/下载:67/0  |  提交时间:2025/06/25
Artificial intelligence  Hardware  Edge computing  Computer architecture  Computational modeling  Single instruction multiple data  Energy efficiency  Target recognition  Radar polarimetry  Real-time systems  Dataflow architecture  edge computing  digital signal processing  AI  multi-layer dataflow mechanism  
Mitosis: A Scalable Sharding System Featuring Multiple Dynamic Relay Chains 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 12, 页码: 2497-2512
作者:  Wang, Keyuan;  Jia, Linpeng;  Song, Zhaoxiong;  Sun, Yi
收藏  |  浏览/下载:58/0  |  提交时间:2024/12/06
Blockchain  sharding  relay chain  relay chain  scalability  scalability  scalability  
BIRD plus : Design of a Lightweight Communication Compressor for Resource-Constrained Distribution Learning Platforms 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 11, 页码: 2193-2207
作者:  Wu, Donglei;  Yang, Weihao;  Zou, Xiangyu;  Feng, Hao;  Tao, Dingwen;  Li, Shiyi;  Xia, Wen;  Fang, Binxing
收藏  |  浏览/下载:88/0  |  提交时间:2024/12/06
Indexes  Costs  Computational modeling  Distance learning  Computer aided instruction  Training  Tensors  Distributed learning  communication compression  random sampling  neural network  
ElasticBatch: A Learning-Augmented Elastic Scheduling System for Batch Inference on MIG 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 10, 页码: 1708-1720
作者:  Qi, Jiaxing;  Xiao, Wencong;  Li, Mingzhen;  Yang, Chaojie;  Li, Yong;  Lin, Wei;  Yang, Hailong;  Luan, Zhongzhi;  Qian, Depei
收藏  |  浏览/下载:52/0  |  提交时间:2024/12/06
Graphics processing units  Dynamic scheduling  Throughput  Processor scheduling  Pipelines  Costs  Quality of service  MIG  batch inference  scheduling system  machine learning  
IrGEMM: An Input-Aware Tuning Framework for Irregular GEMM on ARM and X86 CPUs 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 9, 页码: 1672-1689
作者:  Wei, Cunyang;  Jia, Haipeng;  Zhang, Yunquan;  Yao, Jianyu;  Li, Chendi;  Cao, Wenxuan
收藏  |  浏览/下载:54/0  |  提交时间:2024/12/06
Kernel  Libraries  Computer architecture  Tuning  Layout  Optimization  Codes  Batch GEMM  code generation  compact GEMM  dynamic programming  TSMM  
FastTuning: Enabling Fast and Efficient Hyper-Parameter Tuning With Partitioning and Parallelism of Search Space 期刊论文
IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS, 2024, 卷号: 35, 期号: 7, 页码: 1174-1188
作者:  Li, Xiaqing;  Guo, Qi;  Zhang, Guangyan;  Ye, Siwei;  He, Guanhua;  Yao, Yiheng;  Zhang, Rui;  Hao, Yifan;  Du, Zidong;  Zheng, Weimin
收藏  |  浏览/下载:80/0  |  提交时间:2024/12/06
Deep learning  distributed hyper-parameter tuning (HPT) system  parallel computing