CSpace

浏览/检索结果: 共30条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Prompting Video-Language Foundation Models With Domain-Specific Fine-Grained Heuristics for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 2, 页码: 1615-1630
作者:  Yu, Ting;  Fu, Kunhao;  Wang, Shuhui;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:0/0  |  提交时间:2025/06/25
Cognition  Computational modeling  Visualization  Context modeling  Data models  Adaptation models  Accuracy  Question answering (information retrieval)  Transformers  Feature extraction  Video question answering  discriminative unimodal comprehension  cross-modal interaction  domain-specific heuristics  video-language foundation models  entity-action relationships  context-aware reasoning  
Consistency Conditioned Memory Augmented Dynamic Diagnosis Model for Medical Visual Question Answering 期刊论文
IEEE JOURNAL OF BIOMEDICAL AND HEALTH INFORMATICS, 2025, 卷号: 29, 期号: 2, 页码: 1357-1370
作者:  Yu, Ting;  Ge, Binhui;  Wang, Shuhui;  Yang, Yan;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:0/0  |  提交时间:2025/06/25
Medical diagnostic imaging  Visualization  Question answering (information retrieval)  Feature extraction  Semantics  Engines  Cognition  Accuracy  Predictive models  Electronic mail  Clinical decisions  consistency  dynamic memory diagnosis  dynamic reasoning  medical assistance  medical visual question answering  
Screen Content-Aware Video Coding Through Non-Local Model Embedded With Intra-Inter In-Loop Filtering 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 2, 页码: 1870-1883
作者:  Li, Mingxuan;  Ji, Wen
收藏  |  浏览/下载:0/0  |  提交时间:2025/06/25
Encoding  Feature extraction  Computational modeling  Adaptation models  Visualization  Nonlinear distortion  Information filters  Deep learning  Standards  Image color analysis  Video coding  in-loop filtering  screen content  deep learning  high-efficiency video coding (HEVC)  
Boost Tracking by Natural Language With Prompt-Guided Grounding 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 13
作者:  Li, Hengyou;  Liu, Xinyan;  Li, Guorong;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming
收藏  |  浏览/下载:0/0  |  提交时间:2025/06/25
Target tracking  Grounding  Switches  Visualization  Feature extraction  Computational modeling  Adaptation models  Location awareness  Linguistics  Memory management  Vision-language tracking  prompt learning  inverse tracking  
Context-Aware ProposalBoundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2024, 卷号: 35, 期号: 11, 页码: 15872-15882
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:1/0  |  提交时间:2025/06/25
Proposals  Visualization  Location awareness  Encoding  Task analysis  Feature extraction  Aggregates  Audiovisual learning  context learning  event localization  
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:29/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:27/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 2572-2586
作者:  Liu, Yuxin;  Min, Weiqing;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:26/0  |  提交时间:2024/05/20
Semantics  Visualization  Transformers  Task analysis  Feature extraction  Image recognition  Fish  Food recognition  ingredient recognition  food computing  fine-grained recognition  multi-label recognition  
Synthesizing Knowledge-Enhanced Features for Real-World Zero-Shot Food Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1285-1298
作者:  Zhou, Pengfei;  Min, Weiqing;  Song, Jiajun;  Zhang, Yang;  Jiang, Shuqiang
收藏  |  浏览/下载:27/0  |  提交时间:2024/05/20
Semantics  Feature extraction  Visualization  Annotations  Correlation  Training  Task analysis  Food detection  zero-shot detection  food computing  object detection  zero-shot learning  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:32/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning