Institute of Computing Technology, Chinese Academy IR
Adapting centroid classifier for document categorization | |
Tan, Songbo1; Wang, Yuefen2; Wu, Gaowei1 | |
2011-08-01 | |
发表期刊 | EXPERT SYSTEMS WITH APPLICATIONS |
ISSN | 0957-4174 |
卷号 | 38期号:8页码:10264-10273 |
摘要 | In the community of information retrieval, Centroid Classifier has been showed to be a simple and yet effective method for text categorization. However, it is often plagued with model misfit (or inductive bias) incurred by its assumption. Various methods have been proposed to address this issue, such as Weight Adjustment, Voting, Refinement and DragPushing. However, existing methods employ only one criterion, i.e., training-set error. Researches in machine learning indicate that training-set error based method cannot guarantee the generalization capability of base classifiers for unseen examples. To overcome this problem, we propose a novel Model Adjustment algorithm, which makes use of training-set errors as well as training-set margins. Furthermore, we prove that for a linearly separable problem, proposed method converges to the optimal solution after finite updates using any learning parameter eta(eta > 0). The empirical assessment conducted on four benchmark collections indicates that proposed method performs slightly better than SVM classifier in prediction accuracy, as well as beats it in running time. (C) 2011 Elsevier Ltd. All rights reserved. |
关键词 | Centroid classifier Text categorization Information retrieval Data mining |
DOI | 10.1016/j.eswa.2011.02.114 |
收录类别 | SCI |
语种 | 英语 |
资助项目 | [60933005] ; [60803085] |
WOS研究方向 | Computer Science ; Engineering ; Operations Research & Management Science |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic ; Operations Research & Management Science |
WOS记录号 | WOS:000290237500138 |
出版者 | PERGAMON-ELSEVIER SCIENCE LTD |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/12616 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Tan, Songbo |
作者单位 | 1.Chinese Acad Sci, Inst Comp Technol, Key Lab Network, Beijing 100190, Peoples R China 2.Chinese Acad Geol Sci, Informat Ctr, Beijing, Peoples R China |
推荐引用方式 GB/T 7714 | Tan, Songbo,Wang, Yuefen,Wu, Gaowei. Adapting centroid classifier for document categorization[J]. EXPERT SYSTEMS WITH APPLICATIONS,2011,38(8):10264-10273. |
APA | Tan, Songbo,Wang, Yuefen,&Wu, Gaowei.(2011).Adapting centroid classifier for document categorization.EXPERT SYSTEMS WITH APPLICATIONS,38(8),10264-10273. |
MLA | Tan, Songbo,et al."Adapting centroid classifier for document categorization".EXPERT SYSTEMS WITH APPLICATIONS 38.8(2011):10264-10273. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论