CSpace

浏览/检索结果: 共97条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Learning Hierarchical Modular Networks for Video Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 2, 页码: 1049-1064
作者:  Li, Guorong;  Ye, Hanhua;  Qi, Yuankai;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Video captioning  hierarchical modular network  scene-graph reward  reinforcement learning  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Semantic-Context Graph Network for Point-Based 3D Object Detection 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6474-6486
作者:  Dong, Shuwei;  Kong, Xiaoyu;  Pan, Xingjia;  Tang, Fan;  Li, Wei;  Chang, Yi;  Dong, Weiming
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
3D object detection  graph neural networks  information entanglement  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
Semantic and Relation Modulation for Audio-Visual Event Localization 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 7711-7725
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Visualization  Location awareness  Correlation  Proposals  Semantics  Task analysis  Modulation  Audio-visual learning  event localization  normalization  
Focus and Align: Learning Tube Tokens for Video-Language Pre-Training 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 8036-8050
作者:  Zhu, Yongqing;  Li, Xiangyang;  Zheng, Mao;  Yang, Jiahao;  Wang, Zihan;  Guo, Xiaoqian;  Chai, Zifeng;  Yuan, Yuchen;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Electron tubes  Semantics  Visualization  Feature extraction  Task analysis  Transformers  Detectors  Local alignment mechanism  semantic centers  tube tokens  video-language pre-training  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:23/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
IAUnet: Global Context-Aware Feature Learning for Person Reidentification 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2021, 卷号: 32, 期号: 10, 页码: 4460-4474
作者:  Hou, Ruibing;  Ma, Bingpeng;  Chang, Hong;  Gu, Xinqian;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:35/0  |  提交时间:2021/12/01
Context modeling  Feature extraction  Computational modeling  Semantics  Aggregates  Visualization  Task analysis  Feature enhancing  interaction-aggregation  person reidentification (reID)  spatial-temporal context modeling