Institute of Computing Technology, Chinese Academy IR
Hierarchical compositional representations for few-shot action recognition | |
Li, Changzhen1,2,3; Zhang, Jie1,2; Wu, Shuzhe4; Jin, Xin4; Shan, Shiguang1,2,3 | |
2024-03-01 | |
发表期刊 | COMPUTER VISION AND IMAGE UNDERSTANDING |
ISSN | 1077-3142 |
卷号 | 240页码:11 |
摘要 | Recently action recognition has received more and more attention for its comprehensive and practical applications in intelligent surveillance and human-computer interaction. However, few-shot action recognition has not been well explored and remains challenging because of data scarcity. In this paper, we propose a novel hierarchical compositional representations (HCR) learning approach for few-shot action recognition. Specifically, we divide a complicated action into several sub-actions by carefully designed hierarchical clustering and further decompose the sub-actions into more fine-grained spatially attentional sub-actions (SASactions). Although there exist large differences between base classes and novel classes, they can share similar patterns in sub-actions or SAS-actions. Furthermore, we adopt the Earth Mover's Distance in the transportation problem to measure the similarity between video samples in terms of sub-action representations. It computes the optimal matching flows between sub-actions as distance metric, which is favorable for comparing finegrained patterns. Extensive experiments show our method achieves the state-of-the-art results on HMDB51, UCF101 and Kinetics datasets. |
关键词 | Action recognition Few-shot learning Hierarchical compositional representations Body parts EMD distance |
DOI | 10.1016/j.cviu.2023.103911 |
收录类别 | SCI |
语种 | 英语 |
资助项目 | National Key R&D Program of China[2021YFC3310100] ; National Natural Science Foundation of China[62176251] ; Youth Innovation Promotion Association CAS |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Artificial Intelligence ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:001166488900001 |
出版者 | ACADEMIC PRESS INC ELSEVIER SCIENCE |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/38843 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Zhang, Jie |
作者单位 | 1.Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China 2.Univ Chinese Acad Sci, Beijing 100049, Peoples R China 3.UCAS, Hangzhou Inst Adv Study, Sch Intelligent Sci & Technol, Hangzhou, Peoples R China 4.Beijing Huawei Cloud Comp Technol Co Ltd, 3 Xinxi Rd, Beijing 100095, Peoples R China |
推荐引用方式 GB/T 7714 | Li, Changzhen,Zhang, Jie,Wu, Shuzhe,et al. Hierarchical compositional representations for few-shot action recognition[J]. COMPUTER VISION AND IMAGE UNDERSTANDING,2024,240:11. |
APA | Li, Changzhen,Zhang, Jie,Wu, Shuzhe,Jin, Xin,&Shan, Shiguang.(2024).Hierarchical compositional representations for few-shot action recognition.COMPUTER VISION AND IMAGE UNDERSTANDING,240,11. |
MLA | Li, Changzhen,et al."Hierarchical compositional representations for few-shot action recognition".COMPUTER VISION AND IMAGE UNDERSTANDING 240(2024):11. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论