CSpace

浏览/检索结果: 共11条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1522-1533
作者:  Wang, Yabing;  Wang, Shuhui;  Luo, Hao;  Dong, Jianfeng;  Wang, Fan;  Han, Meng;  Wang, Xun;  Wang, Meng
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Noise measurement  Estimation  Costs  Transportation  Training  Task analysis  Cross-modal retrieval  noise correspondence learning  cross-lingual transfer  optimal transport  machine translation  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 2572-2586
作者:  Liu, Yuxin;  Min, Weiqing;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Semantics  Visualization  Transformers  Task analysis  Feature extraction  Image recognition  Fish  Food recognition  ingredient recognition  food computing  fine-grained recognition  multi-label recognition  
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4073-4087
作者:  Yang, Jiahao;  Li, Xiangyang;  Zheng, Mao;  Wang, Zihan;  Zhu, Yongqing;  Guo, Xiaoqian;  Yuan, Yuchen;  Chai, Zifeng;  Jiang, Shuqiang
收藏  |  浏览/下载:9/0  |  提交时间:2023/12/04
Video-language pre-training  inter-modality bridge  memory module  
Learned Image Compression Using Cross-Component Attention Mechanism 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5478-5493
作者:  Duan, Wenhong;  Chang, Zheng;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Song, Li;  Gao, Wen
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Image coding  Context modeling  Transforms  Decoding  Standards  Image reconstruction  Transform coding  Image compression  cross-component  information-guided unit  attention mechanism  information-preserving  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
CIR-Net: Cross-Modality Interaction and Refinement for RGB-D Salient Object Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 6800-6815
作者:  Cong, Runmin;  Lin, Qinwei;  Zhang, Chen;  Li, Chongyi;  Cao, Xiaochun;  Huang, Qingming;  Zhao, Yao
收藏  |  浏览/下载:14/0  |  提交时间:2023/07/12
Decoding  Task analysis  Periodic structures  Middleware  Logic gates  Electronic mail  Object detection  Salient object detection  RGB-D images  cross-modality attention  cross-modality interaction  
DPANet: Depth Potentiality-Aware Gated Attention Network for RGB-D Salient Object Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2021, 卷号: 30, 页码: 7012-7024
作者:  Chen, Zuyao;  Cong, Runmin;  Xu, Qianqian;  Huang, Qingming
收藏  |  浏览/下载:38/0  |  提交时间:2021/12/01
Logic gates  Object detection  Contamination  Task analysis  Saliency detection  Computer science  Image color analysis  Salient object detection  RGB-D images  depth potentiality perception  gated multi-modality attention  
Textual-Visual Reference-Aware Attention Network for Visual Dialog 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 卷号: 29, 页码: 6655-6666
作者:  Guo, Dan;  Wang, Hui;  Wang, Shuhui;  Wang, Meng
收藏  |  浏览/下载:44/0  |  提交时间:2020/12/10
Visual dialog  attention network  textual reference  visual reference  multimodal semantic interaction  
Online Asymmetric Metric Learning With Multi-Layer Similarity Aggregation for Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2019, 卷号: 28, 期号: 9, 页码: 4299-4312
作者:  Wu, Yiling;  Wang, Shuhui;  Song, Guoli;  Huang, Qingming
收藏  |  浏览/下载:253/0  |  提交时间:2019/08/16
Cross-modal retrieval  asymmetric metric  online learning  multi-layer aggregation