CSpace

浏览/检索结果: 共904条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Toward Egocentric Compositional Action Anticipation with Adaptive Semantic Debiasing 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 5, 页码: 21
作者:  Zhang, Tianyu;  Min, Weiqing;  Liu, Tao;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Egocentric video understanding  compositional action anticipation  semantic bias  adaptive counterfactual analysis  
Deep Learning for Logo Detection: A Survey 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 3, 页码: 23
作者:  Hou, Sujuan;  Li, Jiacheng;  Min, Weiqing;  Hou, Qiang;  Zhao, Yanna;  Zheng, Yuanjie;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Logo detection  computer vision  deep learning  datasets  
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
Hierarchical compositional representations for few-shot action recognition 期刊论文
COMPUTER VISION AND IMAGE UNDERSTANDING, 2024, 卷号: 240, 页码: 11
作者:  Li, Changzhen;  Zhang, Jie;  Wu, Shuzhe;  Jin, Xin;  Shan, Shiguang
收藏  |  浏览/下载:4/0  |  提交时间:2024/05/20
Action recognition  Few-shot learning  Hierarchical compositional representations  Body parts  EMD distance  
Mortar-FP8: Morphing the Existing FP32 Infrastructure for High-Performance Deep Learning Acceleration 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 3, 页码: 878-891
作者:  Li, Hongyan;  Lu, Hang;  Li, Xiaowei
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Deep learning accelerator  deep neural network (DNN)  fp8 format  
Learning Hierarchical Modular Networks for Video Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 2, 页码: 1049-1064
作者:  Li, Guorong;  Ye, Hanhua;  Qi, Yuankai;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Video captioning  hierarchical modular network  scene-graph reward  reinforcement learning  
Real-Time Robust Video Object Detection System Against Physical-World Adversarial Attacks 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2024, 卷号: 43, 期号: 1, 页码: 366-379
作者:  Han, Husheng;  Hu, Xing;  Hao, Yifan;  Xu, Kaidi;  Dang, Pucheng;  Wang, Ying;  Zhao, Yongwei;  Du, Zidong;  Guo, Qi;  Wang, Yanzhi;  Zhang, Xishan;  Chen, Tianshi
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Object detection  Streaming media  Optical flow  Feature extraction  Real-time systems  Task analysis  Detectors  Adversarial patch attack  deep learning security  domain-specific accelerator  hardware/software co-design  real time  
Cross Modal Compression With Variable Rate Prompt 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3444-3456
作者:  Gao, Junlong;  Li, Jiguo;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Cross modal compression  semantic fidelity  variable rate prompt  
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1522-1533
作者:  Wang, Yabing;  Wang, Shuhui;  Luo, Hao;  Dong, Jianfeng;  Wang, Fan;  Han, Meng;  Wang, Xun;  Wang, Meng
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Noise measurement  Estimation  Costs  Transportation  Training  Task analysis  Cross-modal retrieval  noise correspondence learning  cross-lingual transfer  optimal transport  machine translation  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning