CSpace

浏览/检索结果: 共3条,第1-3条 帮助

限定条件        
已选(0)清除 条数/页:   排序方式:
Semantic and Relation Modulation for Audio-Visual Event Localization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 7711-7725
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Visualization  Location awareness  Correlation  Proposals  Semantics  Task analysis  Modulation  Audio-visual learning  event localization  normalization  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:23/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer