CSpace

浏览/检索结果: 共5条,第1-5条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Multimodal graph neural network for video procedural captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 488, 页码: 88-96
作者:  Ji, Lei;  Tu, Rongcheng;  Lin, Kevin;  Wang, Lijuan;  Duan, Nan
收藏  |  浏览/下载:18/0  |  提交时间:2022/12/07
Multimodal video captioning  Graph neural network  
Self-Supervised Enhancement for Named Entity Disambiguation via Multimodal Graph Convolution 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Zhou, Pengfei;  Ying, Kaining;  Wang, Zhenhua;  Guo, Dongyan;  Bai, Cong
收藏  |  浏览/下载:29/0  |  提交时间:2022/12/07
Task analysis  Convolution  Semantics  Internet  Bit error rate  Visualization  Pipelines  Graph convolutional network (GCN)  multimodal data  named entity disambiguation (NED)  self-supervised learning (SSL)  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context  
Mixed Dish Recognition With Contextual Relation and Domain Alignment 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2022, 卷号: 24, 页码: 2034-2045
作者:  Deng, Lixi;  Chen, Jingjing;  Ngo, Chong-Wah;  Sun, Qianru;  Tang, Sheng;  Zhang, Yongdong;  Chua, Tat-Seng
收藏  |  浏览/下载:18/0  |  提交时间:2022/12/07
Visualization  Semantics  Feature extraction  Image recognition  Training  Testing  Context modeling  Mixed dish recognition  Contextual relation  Domain alignment  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:23/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer