CSpace

浏览/检索结果: 共289条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
Learning Hierarchical Modular Networks for Video Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 2, 页码: 1049-1064
作者:  Li, Guorong;  Ye, Hanhua;  Qi, Yuankai;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Video captioning  hierarchical modular network  scene-graph reward  reinforcement learning  
Cross Modal Compression With Variable Rate Prompt 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3444-3456
作者:  Gao, Junlong;  Li, Jiguo;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Cross modal compression  semantic fidelity  variable rate prompt  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Semantic-Context Graph Network for Point-Based 3D Object Detection 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6474-6486
作者:  Dong, Shuwei;  Kong, Xiaoyu;  Pan, Xingjia;  Tang, Fan;  Li, Wei;  Chang, Yi;  Dong, Weiming
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
3D object detection  graph neural networks  information entanglement  
Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 8, 页码: 4441-4445
作者:  Zhang, Pingping;  Wang, Shiqi;  Wang, Meng;  Li, Jiguo;  Wang, Xu;  Kwong, Sam
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Semantic image compression  cross-modality  scalable coding  
Context-Aware Proposal-Boundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 11
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Audiovisual learning  context learning  event localization  
MIFNet: Multiple instances focused temporal action proposal generation 期刊论文
NEUROCOMPUTING, 2023, 卷号: 538, 页码: 13
作者:  Wang, Lining;  Yao, Hongxun;  Yang, Haosen;  Wang, Sibo;  Jin, Sheng
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Video understanding  Temporal action proposal  Temporal action detection  Contrastive learning  Multiple instances  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
Robust Pose Transfer With Dynamic Details Using Neural Video Rendering 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 2, 页码: 2660-2666
作者:  Sun, Yang-Tian;  Huang, Hao-Zhi;  Wang, Xuan;  Lai, Yu-Kun;  Liu, Wei;  Gao, Lin
收藏  |  浏览/下载:16/0  |  提交时间:2023/07/12
Deep generative model  dynamic details generation  human video synthesis  neural rendering  pose transfer