CSpace

浏览/检索结果: 共18条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4073-4087
作者:  Yang, Jiahao;  Li, Xiangyang;  Zheng, Mao;  Wang, Zihan;  Zhu, Yongqing;  Guo, Xiaoqian;  Yuan, Yuchen;  Chai, Zifeng;  Jiang, Shuqiang
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Video-language pre-training  inter-modality bridge  memory module  
CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 508, 页码: 293-304
作者:  Luo, Huaishao;  Ji, Lei;  Zhong, Ming;  Chen, Yang;  Lei, Wen;  Duan, Nan;  Li, Tianrui
收藏  |  浏览/下载:27/0  |  提交时间:2022/12/07
Video retrieval  Video captioning  CLIP  
Multimodal graph neural network for video procedural captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 488, 页码: 88-96
作者:  Ji, Lei;  Tu, Rongcheng;  Lin, Kevin;  Wang, Lijuan;  Duan, Nan
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Multimodal video captioning  Graph neural network  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Task-Adaptive Attention for Image Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 1, 页码: 43-51
作者:  Yan, Chenggang;  Hao, Yiming;  Li, Liang;  Yin, Jian;  Liu, Anan;  Mao, Zhendong;  Chen, Zhenyu;  Gao, Xingyu
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Task analysis  Visualization  Feature extraction  Decoding  Computational modeling  Adaptation models  Feeds  Image captioning  attention mechanism  transformer  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
Adaptive Spatial Location With Balanced Loss for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 1, 页码: 17-30
作者:  Li, Linghui;  Zhang, Yongdong;  Tang, Sheng;  Xie, Lingxi;  Li, Xiaoyong;  Tian, Qi
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Task analysis  Redundancy  Feature extraction  Visualization  Detectors  Computer vision  Training  Convolutional neural network  recurrent neural network  video captioning  adaptive spatial location  balanced loss  
Refocused Attention: Long Short-Term Rewards Guided Video Captioning 期刊论文
NEURAL PROCESSING LETTERS, 2020, 卷号: 52, 期号: 2, 页码: 935-948
作者:  Dong, Jiarong;  Gao, Ke;  Chen, Xiaokai;  Cao, Juan
收藏  |  浏览/下载:63/0  |  提交时间:2020/12/10
Video captioning  Hierarchical attention  Reinforcement learning  Reward  
Dual-Stream Recurrent Neural Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2019, 卷号: 29, 期号: 8, 页码: 2482-2493
作者:  Xu, Ning;  Liu, An-An;  Wong, Yongkang;  Zhang, Yongdong;  Nie, Weizhi;  Su, Yuting;  Kankanhalli, Mohan
收藏  |  浏览/下载:75/0  |  提交时间:2019/12/10
Video captioning  hidden state fusion  dual stream  recurrent neural network  attention module