CSpace

浏览/检索结果: 共30条,第1-10条 帮助

限定条件            
已选(0)清除 条数/页:   排序方式:
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
Learning Hierarchical Modular Networks for Video Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 2, 页码: 1049-1064
作者:  Li, Guorong;  Ye, Hanhua;  Qi, Yuankai;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Video captioning  hierarchical modular network  scene-graph reward  reinforcement learning  
Temporal Dynamic Concept Modeling Network for Explainable Video Event Recognition 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2023, 卷号: 19, 期号: 6, 页码: 22
作者:  Zhang, Weigang;  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Event recognition  temporal concept receptive field  dynamic convolution  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
Spatial-Temporal Graph Network for Video Crowd Counting 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 1, 页码: 228-241
作者:  Wu, Zhe;  Zhang, Xinfeng;  Tian, Geng;  Wang, Yaowei;  Huang, Qingming
收藏  |  浏览/下载:13/0  |  提交时间:2023/07/12
Computational modeling  Predictive models  Analytical models  Long short term memory  Optical flow  Integrated circuit modeling  Head  Video-based crowd counting  spatiotemporal graph attention  multi-scale module  
Weakly Supervised Anomaly Detection in Videos Considering the Openness of Events 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2022, 页码: 13
作者:  Zhang, Chen;  Li, Guorong;  Xu, Qianqian;  Zhang, Xinfeng;  Su, Li;  Huang, Qingming
收藏  |  浏览/下载:23/0  |  提交时间:2022/12/07
Anomaly detection  Videos  Open data  Data models  Training  Feature extraction  Predictive models  Anomaly detection  surveillance videos  openness  meta-learning  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:23/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
Stereoscopic Image Retargeting Based on Deep Convolutional Neural Network 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2021, 卷号: 31, 期号: 12, 页码: 4759-4770
作者:  Fan, Xiaoting;  Lei, Jianjun;  Liang, Jie;  Fang, Yuming;  Ling, Nam;  Huang, Qingming
收藏  |  浏览/下载:29/0  |  提交时间:2022/06/21
Stereo image processing  Three-dimensional displays  Two dimensional displays  Feature extraction  Distortion  Visualization  Shape  Stereoscopic image  image retargeting  cross-attention  disparity consistency