Institute of Computing Technology, Chinese Academy IR
Computation on sentence semantic distance for novelty detection | |
Zhang, HP; Sun, J; Wang, B; Bai, S | |
2005-05-01 | |
发表期刊 | JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY |
ISSN | 1000-9000 |
卷号 | 20期号:3页码:331-337 |
摘要 | Novelty detection is to retrieve new information and filter redundancy from given sentences that are relevant to a specific topic. In TREC2003, the authors tried an approach to novelty detection with semantic distance computation. The motivation is to expand a sentence by introducing semantic information. Computation on semantic distance between sentences incorporates WordNet with statistical information. The novelty detection is treated as a binary classification problem: new sentence or not. The feature vector, used in the vector space model for classification, consists of various factors, including the semantic distance from the sentence to the topic and the distance from the sentence to the previous relevant context occurring before it. Now sentences are then detected with Winnow and support vector machine classifiers, respectively. Several experiments are conducted to survey the relationship between different factors and performance. It is proved that semantic computation is promising in novelty detection. The ratio of new sentence size to relevant size is further studied given different relevant document sizes. It is found that the ratio reduced with a certain speed (about 0.86). Then another group of experiments is performed supervised with the ratio. It is demonstrated that the ratio is helpful to improve the novelty detection performance. |
关键词 | novelty detection sentence semantic distance categorization |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science |
WOS类目 | Computer Science, Hardware & Architecture ; Computer Science, Software Engineering |
WOS记录号 | WOS:000229292300005 |
出版者 | SCIENCE PRESS |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/9949 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Zhang, HP |
作者单位 | 1.Chinese Acad Sci, Inst Comp Technol, Beijing 100080, Peoples R China 2.Chinese Acad Sci, Grad Sch, Beijing 100039, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, HP,Sun, J,Wang, B,et al. Computation on sentence semantic distance for novelty detection[J]. JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY,2005,20(3):331-337. |
APA | Zhang, HP,Sun, J,Wang, B,&Bai, S.(2005).Computation on sentence semantic distance for novelty detection.JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY,20(3),331-337. |
MLA | Zhang, HP,et al."Computation on sentence semantic distance for novelty detection".JOURNAL OF COMPUTER SCIENCE AND TECHNOLOGY 20.3(2005):331-337. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论