CSpace  > 中国科学院计算技术研究所期刊论文  > 英文
Cross-modal semantic correlation learning by Bi-CNN network
Wang, Chaoyi1; Li, Liang2; Yan, Chenggang1; Wang, Zhan3; Sun, Yaoqi1; Zhang, Jiyong1
2021-03-18
发表期刊IET IMAGE PROCESSING
ISSN1751-9659
页码11
摘要Cross modal retrieval can retrieve images through a text query and vice versa. In recent years, cross modal retrieval has attracted extensive attention. The purpose of most now available cross modal retrieval methods is to find a common subspace and maximize the different modal correlation. To generate specific representations consistent with cross modal tasks, this paper proposes a novel cross modal retrieval framework, which integrates feature learning and latent space embedding. In detail, we proposed a deep CNN and a shallow CNN to extract the feature of the samples. The deep CNN is used to extract the representation of images, and the shallow CNN uses a multi-dimensional kernel to extract multi-level semantic representation of text. Meanwhile, we enhance the semantic manifold by constructing cross modal ranking and within-modal discriminant loss to improve the division of semantic representation. Moreover, the most representative samples are selected by using online sampling strategy, so that the approach can be implemented on a large-scale data. This approach not only increases the discriminative ability among different categories, but also maximizes the relativity between different modalities. Experiments on three real word datasets show that the proposed method is superior to the popular methods.
DOI10.1049/ipr2.12176
收录类别SCI
语种英语
WOS研究方向Computer Science ; Engineering ; Imaging Science & Photographic Technology
WOS类目Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic ; Imaging Science & Photographic Technology
WOS记录号WOS:000630032600001
出版者WILEY
引用统计
被引频次:4[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://119.78.100.204/handle/2XEOYT63/16808
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Zhang, Jiyong
作者单位1.Hangzhou Dianzi Univ, Hangzhou, Peoples R China
2.Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China
3.RTInvent Technol Co Ltd, Beijing, Peoples R China
推荐引用方式
GB/T 7714
Wang, Chaoyi,Li, Liang,Yan, Chenggang,et al. Cross-modal semantic correlation learning by Bi-CNN network[J]. IET IMAGE PROCESSING,2021:11.
APA Wang, Chaoyi,Li, Liang,Yan, Chenggang,Wang, Zhan,Sun, Yaoqi,&Zhang, Jiyong.(2021).Cross-modal semantic correlation learning by Bi-CNN network.IET IMAGE PROCESSING,11.
MLA Wang, Chaoyi,et al."Cross-modal semantic correlation learning by Bi-CNN network".IET IMAGE PROCESSING (2021):11.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wang, Chaoyi]的文章
[Li, Liang]的文章
[Yan, Chenggang]的文章
百度学术
百度学术中相似的文章
[Wang, Chaoyi]的文章
[Li, Liang]的文章
[Yan, Chenggang]的文章
必应学术
必应学术中相似的文章
[Wang, Chaoyi]的文章
[Li, Liang]的文章
[Yan, Chenggang]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。