CSpace

浏览/检索结果: 共16条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1522-1533
作者:  Wang, Yabing;  Wang, Shuhui;  Luo, Hao;  Dong, Jianfeng;  Wang, Fan;  Han, Meng;  Wang, Xun;  Wang, Meng
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Noise measurement  Estimation  Costs  Transportation  Training  Task analysis  Cross-modal retrieval  noise correspondence learning  cross-lingual transfer  optimal transport  machine translation  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4073-4087
作者:  Yang, Jiahao;  Li, Xiangyang;  Zheng, Mao;  Wang, Zihan;  Zhu, Yongqing;  Guo, Xiaoqian;  Yuan, Yuchen;  Chai, Zifeng;  Jiang, Shuqiang
收藏  |  浏览/下载:9/0  |  提交时间:2023/12/04
Video-language pre-training  inter-modality bridge  memory module  
Learned Image Compression Using Cross-Component Attention Mechanism 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 5478-5493
作者:  Duan, Wenhong;  Chang, Zheng;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Song, Li;  Gao, Wen
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Image coding  Context modeling  Transforms  Decoding  Standards  Image reconstruction  Transform coding  Image compression  cross-component  information-guided unit  attention mechanism  information-preserving  
Contrastive Learning of Person-Independent Representations for Facial Action Unit Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3212-3225
作者:  Li, Yong;  Shan, Shiguang
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Gold  Videos  Training  Image reconstruction  Feature extraction  Faces  Task analysis  Facial action unit detection  contrastive Learning  self-supervised learning  person-independent action unit detection  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Motion Feature Aggregation for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3908-3919
作者:  Gu, Xinqian;  Chang, Hong;  Ma, Bingpeng;  Shan, Shiguang
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Feature extraction  Optical imaging  Computational modeling  Spatiotemporal phenomena  Data mining  Training  Tracking  Video-based person re-identification  temporal information modeling  motion feature extraction  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:23/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
RhythmNet: End-to-End Heart Rate Estimation From Face via Spatial-Temporal Representation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 卷号: 29, 页码: 2409-2423
作者:  Niu, Xuesong;  Shan, Shiguang;  Han, Hu;  Chen, Xilin
收藏  |  浏览/下载:51/0  |  提交时间:2020/12/10
Heart rate  Estimation  Webcams  Databases  Skin  Image color analysis  Head  Remote heart rate estimation  rPPG  spatial-temporal representation  end-to-end learning  
Deep Heterogeneous Hashing for Face Video Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 卷号: 29, 页码: 1299-1312
作者:  Qiao, Shishi;  Wang, Ruiping;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:59/0  |  提交时间:2020/12/10
Face  Covariance matrices  Task analysis  Binary codes  Kernel  Manifolds  Feature extraction  Face video retrieval  deep heterogeneous hashing  Riemannian kernel mapping  structured matrix backpropagation