CSpace

浏览/检索结果: 共3条,第1-3条 帮助

已选(0)清除 条数/页:   排序方式:
Prompting Video-Language Foundation Models With Domain-Specific Fine-Grained Heuristics for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 2, 页码: 1615-1630
作者:  Yu, Ting;  Fu, Kunhao;  Wang, Shuhui;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:53/0  |  提交时间:2025/06/25
Cognition  Computational modeling  Visualization  Context modeling  Data models  Adaptation models  Accuracy  Question answering (information retrieval)  Transformers  Feature extraction  Video question answering  discriminative unimodal comprehension  cross-modal interaction  domain-specific heuristics  video-language foundation models  entity-action relationships  context-aware reasoning  
OTRec: Cross-Modal Learning for Multimodal Recommendation via Optimal Transport 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 卷号: 27, 页码: 8603-8617
作者:  Cao, Zongsheng;  Xu, Qianqian;  Yang, Zhiyong;  He, Yuan;  Cao, Xiaochun;  Huang, Qingming
收藏  |  浏览/下载:1/0  |  提交时间:2026/05/25
Semantics  Recommender systems  Contrastive learning  Electronic mail  Lattices  Data models  Data mining  Artificial intelligence  Accuracy  Visualization  Multimodal recommendation  optimal transport  modal-invariant  modal-specific  
Cross-Modal Knowledge Adaptation for Language-Based Person Search 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 4057-4069
作者:  Chen, Yucheng;  Huang, Rui;  Chang, Hong;  Tan, Chuanqi;  Xue, Tao;  Ma, Bingpeng
收藏  |  浏览/下载:88/0  |  提交时间:2021/12/01
Feature extraction  Task analysis  Lighting  Learning systems  Logic gates  Knowledge engineering  Training  Language-based person search  cross-modal knowledge adaptation  image-specific information