CSpace  > 中国科学院计算技术研究所期刊论文  > 英文
Building descriptive and discriminative visual codebook for large-scale image applications
Tian, Qi1; Zhang, Shiliang2; Zhou, Wengang3; Ji, Rongrong4; Ni, Bingbing5; Sebe, Nicu6
2011
发表期刊MULTIMEDIA TOOLS AND APPLICATIONS
ISSN1380-7501
卷号51期号:2页码:441-477
摘要Inspired by the success of textual words in large-scale textual information processing, researchers are trying to extract visual words from images which function similar as textual words. Visual words are commonly generated by clustering a large amount of image local features and the cluster centers are taken as visual words. This approach is simple and scalable, but results in noisy visual words. Lots of works are reported trying to improve the descriptive and discriminative ability of visual words. This paper gives a comprehensive survey on visual vocabulary and details several state-of-the-art algorithms. A comprehensive review and summarization of the related works on visual vocabulary is first presented. Then, we introduce our recent algorithms on descriptive and discriminative visual word generation, i.e., latent visual context analysis for descriptive visual word identification [74], descriptive visual words and visual phrases generation [68], contextual visual vocabulary which combines both semantic contexts and spatial contexts [69], and visual vocabulary hierarchy optimization [18]. Additionally, we introduce two interesting post processing strategies to further improve the performance of visual vocabulary, i.e., spatial coding [73] is proposed to efficiently remove the mismatched visual words between images for more reasonable image similarity computation; user preference based visual word weighting [44] is developed to make the image similarity computed based on visual words more consistent with users' preferences or habits.
关键词Visual vocabulary Large-scale image retrieval Image search re-ranking Feature space quantization
DOI10.1007/s11042-010-0636-6
收录类别SCI
语种英语
资助项目NSF[IIS 1052851] ; Akiira Media Systems, Inc. ; FIRB SPATTERN project ; FP7 IP GLOCAL European project
WOS研究方向Computer Science ; Engineering
WOS类目Computer Science, Information Systems ; Computer Science, Software Engineering ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic
WOS记录号WOS:000286472300003
出版者SPRINGER
引用统计
被引频次:13[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://119.78.100.204/handle/2XEOYT63/12540
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Tian, Qi
作者单位1.Univ Texas San Antonio, Dept Comp Sci, San Antonio, TX 78249 USA
2.Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
3.Univ Sci & Technol China, EEIS Dept, Hefei 230027, Peoples R China
4.Harbin Inst Technol, Harbin 150001, Heilongjiang, Peoples R China
5.Natl Univ Singapore, Singapore 117576, Singapore
6.Univ Trent, Dept Informat Engn & Comp Sci, I-38100 Trento, Italy
推荐引用方式
GB/T 7714
Tian, Qi,Zhang, Shiliang,Zhou, Wengang,et al. Building descriptive and discriminative visual codebook for large-scale image applications[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2011,51(2):441-477.
APA Tian, Qi,Zhang, Shiliang,Zhou, Wengang,Ji, Rongrong,Ni, Bingbing,&Sebe, Nicu.(2011).Building descriptive and discriminative visual codebook for large-scale image applications.MULTIMEDIA TOOLS AND APPLICATIONS,51(2),441-477.
MLA Tian, Qi,et al."Building descriptive and discriminative visual codebook for large-scale image applications".MULTIMEDIA TOOLS AND APPLICATIONS 51.2(2011):441-477.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Tian, Qi]的文章
[Zhang, Shiliang]的文章
[Zhou, Wengang]的文章
百度学术
百度学术中相似的文章
[Tian, Qi]的文章
[Zhang, Shiliang]的文章
[Zhou, Wengang]的文章
必应学术
必应学术中相似的文章
[Tian, Qi]的文章
[Zhang, Shiliang]的文章
[Zhou, Wengang]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。