Institute of Computing Technology, Chinese Academy IR
A Hybrid Framework for Semantic Relation Extraction over Enterprise Data | |
Shen, Wei1; Wang, Jianyong2,3; Luo, Ping4; Wang, Min5 | |
2015-07-01 | |
发表期刊 | INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS |
ISSN | 1552-6283 |
卷号 | 11期号:3页码:1-24 |
摘要 | Relation extraction from the Web data has attracted a lot of attention in recent years. However, little work has been done when it comes to relation extraction from the enterprise data regardless of the urgent needs to such work in real applications (e.g., E-discovery). One distinct characteristic of the enterprise data (in comparison with the Web data) is its low redundancy. Previous work on relation extraction from the Web data largely relies on the data's high redundancy level and thus cannot be applied to the enterprise data effectively. This paper proposes an unsupervised hybrid framework called REACTOR. REACTOR combines a statistical method, classification, and clustering to identify various types of relations among entities appearing in the enterprise data automatically. Furthermore, the authors explore to apply pronominal anaphora resolution to extract more relations expressed across multiple sentences. They evaluate REACTOR over a real-world enterprise data set from HP that contains over three million pages and the experimental results show the effectiveness of REACTOR. |
关键词 | Anaphora Resolution Enterprise Data Information Extraction Relation Extraction Relation Tagging |
DOI | 10.4018/IJSWIS.2015070101 |
收录类别 | SCI |
语种 | 英语 |
资助项目 | National Basic Research Program of China (973 Program)[2014CB340505] ; National Natural Science Foundation of China[61532010] ; National Natural Science Foundation of China[61272088] ; National Natural Science Foundation of China[61502253] ; National Natural Science Foundation of China[61473274] ; Tsinghua University Initiative Scientific Research Program ; National High-tech R&D Program of China (863 Program)[2014AA015105] |
WOS研究方向 | Computer Science |
WOS类目 | Computer Science, Artificial Intelligence ; Computer Science, Information Systems |
WOS记录号 | WOS:000370264200001 |
出版者 | IGI PUBL |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/8765 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Shen, Wei |
作者单位 | 1.Nankai Univ, CCCE & CS, Tianjin 300071, Peoples R China 2.Tsinghua Univ, Dept Comp Sci & Technol, Beijing 100084, Peoples R China 3.Jiangsu Normal Univ, Jiangsu Collaborat Innovat Ctr Language Abil, Xuzhou, Peoples R China 4.Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China 5.Visa Inc, Foster City, CA USA |
推荐引用方式 GB/T 7714 | Shen, Wei,Wang, Jianyong,Luo, Ping,et al. A Hybrid Framework for Semantic Relation Extraction over Enterprise Data[J]. INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS,2015,11(3):1-24. |
APA | Shen, Wei,Wang, Jianyong,Luo, Ping,&Wang, Min.(2015).A Hybrid Framework for Semantic Relation Extraction over Enterprise Data.INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS,11(3),1-24. |
MLA | Shen, Wei,et al."A Hybrid Framework for Semantic Relation Extraction over Enterprise Data".INTERNATIONAL JOURNAL ON SEMANTIC WEB AND INFORMATION SYSTEMS 11.3(2015):1-24. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论