CSpace

浏览/检索结果: 共20条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
Learning Hierarchical Modular Networks for Video Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 2, 页码: 1049-1064
作者:  Li, Guorong;  Ye, Hanhua;  Qi, Yuankai;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Video captioning  hierarchical modular network  scene-graph reward  reinforcement learning  
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4073-4087
作者:  Yang, Jiahao;  Li, Xiangyang;  Zheng, Mao;  Wang, Zihan;  Zhu, Yongqing;  Guo, Xiaoqian;  Yuan, Yuchen;  Chai, Zifeng;  Jiang, Shuqiang
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Video-language pre-training  inter-modality bridge  memory module  
CLIP4Clip: An empirical study of CLIP for end to end video clip retrieval and captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 508, 页码: 293-304
作者:  Luo, Huaishao;  Ji, Lei;  Zhong, Ming;  Chen, Yang;  Lei, Wen;  Duan, Nan;  Li, Tianrui
收藏  |  浏览/下载:27/0  |  提交时间:2022/12/07
Video retrieval  Video captioning  CLIP  
Multimodal graph neural network for video procedural captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 488, 页码: 88-96
作者:  Ji, Lei;  Tu, Rongcheng;  Lin, Kevin;  Wang, Lijuan;  Duan, Nan
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Multimodal video captioning  Graph neural network  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Task-Adaptive Attention for Image Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 1, 页码: 43-51
作者:  Yan, Chenggang;  Hao, Yiming;  Li, Liang;  Yin, Jian;  Liu, Anan;  Mao, Zhendong;  Chen, Zhenyu;  Gao, Xingyu
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Task analysis  Visualization  Feature extraction  Decoding  Computational modeling  Adaptation models  Feeds  Image captioning  attention mechanism  transformer  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
Adaptive Spatial Location With Balanced Loss for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 1, 页码: 17-30
作者:  Li, Linghui;  Zhang, Yongdong;  Tang, Sheng;  Xie, Lingxi;  Li, Xiaoyong;  Tian, Qi
收藏  |  浏览/下载:16/0  |  提交时间:2022/12/07
Task analysis  Redundancy  Feature extraction  Visualization  Detectors  Computer vision  Training  Convolutional neural network  recurrent neural network  video captioning  adaptive spatial location  balanced loss