CSpace

浏览/检索结果: 共9条,第1-9条 帮助

已选(0)清除 条数/页:   排序方式:
Learning Hierarchical Modular Networks for Video Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 2, 页码: 1049-1064
作者:  Li, Guorong;  Ye, Hanhua;  Qi, Yuankai;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:10/0  |  提交时间:2024/05/20
Video captioning  hierarchical modular network  scene-graph reward  reinforcement learning  
CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 508, 页码: 293-304
作者:  Luo, Huaishao;  Ji, Lei;  Zhong, Ming;  Chen, Yang;  Lei, Wen;  Duan, Nan;  Li, Tianrui
收藏  |  浏览/下载:47/0  |  提交时间:2022/12/07
Video retrieval  Video captioning  CLIP  
Multimodal graph neural network for video procedural captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 488, 页码: 88-96
作者:  Ji, Lei;  Tu, Rongcheng;  Lin, Kevin;  Wang, Lijuan;  Duan, Nan
收藏  |  浏览/下载:24/0  |  提交时间:2022/12/07
Multimodal video captioning  Graph neural network  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:27/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:28/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:32/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
Adaptive Spatial Location With Balanced Loss for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 1, 页码: 17-30
作者:  Li, Linghui;  Zhang, Yongdong;  Tang, Sheng;  Xie, Lingxi;  Li, Xiaoyong;  Tian, Qi
收藏  |  浏览/下载:24/0  |  提交时间:2022/12/07
Task analysis  Redundancy  Feature extraction  Visualization  Detectors  Computer vision  Training  Convolutional neural network  recurrent neural network  video captioning  adaptive spatial location  balanced loss  
Refocused Attention: Long Short-Term Rewards Guided Video Captioning 期刊论文
NEURAL PROCESSING LETTERS, 2020, 卷号: 52, 期号: 2, 页码: 935-948
作者:  Dong, Jiarong;  Gao, Ke;  Chen, Xiaokai;  Cao, Juan
收藏  |  浏览/下载:74/0  |  提交时间:2020/12/10
Video captioning  Hierarchical attention  Reinforcement learning  Reward  
Dual-Stream Recurrent Neural Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 卷号: 29, 期号: 8, 页码: 2482-2493
作者:  Xu, Ning;  Liu, An-An;  Wong, Yongkang;  Zhang, Yongdong;  Nie, Weizhi;  Su, Yuting;  Kankanhalli, Mohan
收藏  |  浏览/下载:88/0  |  提交时间:2019/12/10
Video captioning  hidden state fusion  dual stream  recurrent neural network  attention module