CSpace

浏览/检索结果: 共364条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
Learning Hierarchical Modular Networks for Video Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 2, 页码: 1049-1064
作者:  Li, Guorong;  Ye, Hanhua;  Qi, Yuankai;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming;  Yang, Ming-Hsuan
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Video captioning  hierarchical modular network  scene-graph reward  reinforcement learning  
Cross Modal Compression With Variable Rate Prompt 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3444-3456
作者:  Gao, Junlong;  Li, Jiguo;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Cross modal compression  semantic fidelity  variable rate prompt  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Multi-state Ingredient Recognition via Adaptive Multi-centric Network 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 页码: 10
作者:  Wen, Min;  Song, Jiajun;  Min, Weiqing;  Xiao, Weimin;  Han, Lin;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Ingredient recognition  intelligent cooking device  
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models 期刊论文
ACM TRANSACTIONS ON GRAPHICS, 2023, 卷号: 42, 期号: 6, 页码: 14
作者:  Zhang, Yuxin;  Dong, Weiming;  Tang, Fan;  Huang, Nisha;  Huang, Haibin;  Ma, Chongyang;  Lee, Tong-Yee;  Deussen, Oliver;  Xu, Changsheng
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Image generation  Diffusion models  Attribute-aware editing  Model personalization  
DyTSCL: Dynamic graph representation via tempo-structural contrastive learning 期刊论文
NEUROCOMPUTING, 2023, 卷号: 556, 页码: 8
作者:  Li, Jianian;  Bao, Peng;  Yan, Rong;  Shen, Huawei
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Graph representation learning  Contrastive learning  Dynamic graph  Tempo-structural information  
Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 8, 页码: 4441-4445
作者:  Zhang, Pingping;  Wang, Shiqi;  Wang, Meng;  Li, Jiguo;  Wang, Xu;  Kwong, Sam
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Semantic image compression  cross-modality  scalable coding  
Large Scale Visual Food Recognition 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9932-9949
作者:  Min, Weiqing;  Wang, Zhiling;  Liu, Yuxin;  Luo, Mengjiang;  Kang, Liping;  Wei, Xiaoming;  Wei, Xiaolin;  Jiang, Shuqiang
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Image recognition  Visualization  Task analysis  Benchmark testing  Representation learning  Training  Semantics  Food dataset  food recognition  large-scale datasets  fine-grained recognition  
Context-Aware Proposal-Boundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 11
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Audiovisual learning  context learning  event localization