Institute of Computing Technology, Chinese Academy IR
Using DragPushing to refine concept index for text categorization | |
Cheng, Xueqi; Tan, Songbo; Tang, Lilian | |
2006-07-01 | |
发表期刊 | JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY |
ISSN | 1000-9000 |
卷号 | 21期号:4页码:592-596 |
摘要 | Concept index (CI) is a very fast and efficient feature extraction (FE) algorithm for text classification. The key approach in CI scheme is to express each document as a function of various concepts (centroids) present in the collection. However, the representative ability of centroids for categorizing corpus is often influenced by so-called model misfit caused by a number of factors in the FE process including feature selection to similarity measure. In order to address this issue, this work employs the "DragPushing" Strategy to refine the centroids that are used for concept index. We present an extensive experimental evaluation of refined concept index (RCI) on two English collections and one Chinese corpus using state-of-the-art Support Vector Machine (SVM) classifier. The results indicate that in each case, RCI-based SVM yields a much better performance than the normal CI-based SVM but lower computation cost during training and classification phases. |
关键词 | text classification information retrieval machine learning |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science |
WOS类目 | Computer Science, Hardware & Architecture ; Computer Science, Software Engineering |
WOS记录号 | WOS:000239255200017 |
出版者 | SCIENCE CHINA PRESS |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/10445 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Cheng, Xueqi |
作者单位 | 1.Chinese Acad Sci, Inst Comp Technol, Div Intelligent Software Syst, Beijing 100080, Peoples R China 2.Univ Surrey, Dept Comp, Surrey, England |
推荐引用方式 GB/T 7714 | Cheng, Xueqi,Tan, Songbo,Tang, Lilian. Using DragPushing to refine concept index for text categorization[J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY,2006,21(4):592-596. |
APA | Cheng, Xueqi,Tan, Songbo,&Tang, Lilian.(2006).Using DragPushing to refine concept index for text categorization.JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY,21(4),592-596. |
MLA | Cheng, Xueqi,et al."Using DragPushing to refine concept index for text categorization".JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 21.4(2006):592-596. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论