CSpace

浏览/检索结果: 共1条,第1-1条 帮助

已选(0)清除 条数/页:   排序方式:
Prompting Video-Language Foundation Models With Domain-Specific Fine-Grained Heuristics for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 2, 页码: 1615-1630
作者:  Yu, Ting;  Fu, Kunhao;  Wang, Shuhui;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:1/0  |  提交时间:2025/06/25
Cognition  Computational modeling  Visualization  Context modeling  Data models  Adaptation models  Accuracy  Question answering (information retrieval)  Transformers  Feature extraction  Video question answering  discriminative unimodal comprehension  cross-modal interaction  domain-specific heuristics  video-language foundation models  entity-action relationships  context-aware reasoning