Institute of Computing Technology, Chinese Academy IR
| An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering | |
| Li, Kun1,2; Yuan, Liang1; Zhang, Yunquan1; Chen, Gongwei2,3 | |
| 2022-11-01 | |
| 发表期刊 | IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS
![]() |
| ISSN | 1045-9219 |
| 卷号 | 33期号:11页码:3129-3140 |
| 摘要 | As the data size in Machine Learning fields grows exponentially, it is inevitable to accelerate the computation by utilizing the ever-growing large number of available cores provided by high-performance computing hardware. However, existing parallel methods for clustering or regression often suffer from problems of low accuracy, slow convergence, and complex hyperparameter-tuning. Furthermore, the parallel efficiency is usually difficult to improve while striking a balance between preserving model properties and partitioning computing workloads on distributed systems. In this article, we propose a novel and simple data structure capturing the most important information among data samples. It has several advantageous properties supporting a hierarchical clustering strategy that contains well-defined metrics for determining optimal hierarchy, balanced partition for maintaining the clustering property, and efficient parallelization for accelerating computation phases. Then we combine the clustering with regression techniques as a parallel library and utilize a hybrid structure of data and model parallelism to make predictions. Experiments illustrate that our library obtains remarkable performance on convergence, accuracy, and scalability. |
| 关键词 | Clustering algorithms Training Mathematical models Computational modeling Libraries Kernel Support vector machines Distributed machine learning scalable algorithm large-scale clustering parallel regression |
| DOI | 10.1109/TPDS.2021.3134336 |
| 收录类别 | SCI |
| 语种 | 英语 |
| 资助项目 | National Natural Science Foundation of China[61972376] ; National Natural Science Foundation of China[62072431] ; National Natural Science Foundation of China[62032023] ; Science Foundation of Beijing[L182053] |
| WOS研究方向 | Computer Science ; Engineering |
| WOS类目 | Computer Science, Theory & Methods ; Engineering, Electrical & Electronic |
| WOS记录号 | WOS:000800198000038 |
| 出版者 | IEEE COMPUTER SOC |
| 引用统计 | |
| 文献类型 | 期刊论文 |
| 条目标识符 | http://119.78.100.204/handle/2XEOYT63/19583 |
| 专题 | 中国科学院计算技术研究所期刊论文_英文 |
| 通讯作者 | Yuan, Liang |
| 作者单位 | 1.Chinese Acad Sci, Inst Comp Technol, State Key Lab Comp Architecture, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci UCAS, Sch Comp Sci & Technol, Beijing 100190, Peoples R China 3.Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China |
| 推荐引用方式 GB/T 7714 | Li, Kun,Yuan, Liang,Zhang, Yunquan,et al. An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering[J]. IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,2022,33(11):3129-3140. |
| APA | Li, Kun,Yuan, Liang,Zhang, Yunquan,&Chen, Gongwei.(2022).An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering.IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS,33(11),3129-3140. |
| MLA | Li, Kun,et al."An Accurate and Efficient Large-Scale Regression Method Through Best Friend Clustering".IEEE TRANSACTIONS ON PARALLEL AND DISTRIBUTED SYSTEMS 33.11(2022):3129-3140. |
| 条目包含的文件 | 条目无相关文件。 | |||||
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论