Institute of Computing Technology, Chinese Academy IR
Cross-Modal Knowledge Adaptation for Language-Based Person Search | |
Chen, Yucheng1,2,3; Huang, Rui4; Chang, Hong1,2,3; Tan, Chuanqi5; Xue, Tao5; Ma, Bingpeng3 | |
2021 | |
发表期刊 | IEEE TRANSACTIONS ON IMAGE PROCESSING |
ISSN | 1057-7149 |
卷号 | 30页码:4057-4069 |
摘要 | In this paper, we present a method named Cross-Modal Knowledge Adaptation (CMKA) for language-based person search. We argue that the image and text information are not equally important in determining a person's identity. In other words, image carries image-specific information such as lighting condition and background, while text contains more modal agnostic information that is more beneficial to cross-modal matching. Based on this consideration, we propose CMKA to adapt the knowledge of image to the knowledge of text. Specially, text-to-image guidance is obtained at different levels: individuals, lists, and classes. By combining these levels of knowledge adaptation, the image-specific information is suppressed, and the common space of image and text is better constructed. We conduct experiments on the CUHK-PEDES dataset. The experimental results show that the proposed CMKA outperforms the state-of-the-art methods. |
关键词 | Feature extraction Task analysis Lighting Learning systems Logic gates Knowledge engineering Training Language-based person search cross-modal knowledge adaptation image-specific information |
DOI | 10.1109/TIP.2021.3068825 |
收录类别 | SCI |
语种 | 英语 |
资助项目 | Natural Science Foundation of China (NSFC)[61876171] ; Natural Science Foundation of China (NSFC)[61976203] ; Open Project Fund from Shenzhen Institute of Artificial Intelligence and Robotics for Society[AC01202005015] ; Open Project Fund from Shenzhen Institute of Artificial Intelligence and Robotics for Society[2019-INT006] |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000638400000007 |
出版者 | IEEE-INST ELECTRICAL ELECTRONICS ENGINEERS INC |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/16636 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Ma, Bingpeng |
作者单位 | 1.Chinese Acad Sci, Key Lab Intelligent Informat Proc, Beijing, Peoples R China 2.Chinese Acad Sci, Inst Comp Technol, Beijing 100190, Peoples R China 3.Univ Chinese Acad Sci, Sch Comp Sci & Technol, Beijing 100049, Peoples R China 4.Chinese Univ Hong Kong, Shenzhen Inst Artificial Intelligence & Robot Soc, Shenzhen 518172, Peoples R China 5.Tencent, Beijing 100193, Peoples R China |
推荐引用方式 GB/T 7714 | Chen, Yucheng,Huang, Rui,Chang, Hong,et al. Cross-Modal Knowledge Adaptation for Language-Based Person Search[J]. IEEE TRANSACTIONS ON IMAGE PROCESSING,2021,30:4057-4069. |
APA | Chen, Yucheng,Huang, Rui,Chang, Hong,Tan, Chuanqi,Xue, Tao,&Ma, Bingpeng.(2021).Cross-Modal Knowledge Adaptation for Language-Based Person Search.IEEE TRANSACTIONS ON IMAGE PROCESSING,30,4057-4069. |
MLA | Chen, Yucheng,et al."Cross-Modal Knowledge Adaptation for Language-Based Person Search".IEEE TRANSACTIONS ON IMAGE PROCESSING 30(2021):4057-4069. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论