CSpace

浏览/检索结果: 共7条,第1-7条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4073-4087
作者:  Yang, Jiahao;  Li, Xiangyang;  Zheng, Mao;  Wang, Zihan;  Zhu, Yongqing;  Guo, Xiaoqian;  Yuan, Yuchen;  Chai, Zifeng;  Jiang, Shuqiang
收藏  |  浏览/下载:9/0  |  提交时间:2023/12/04
Video-language pre-training  inter-modality bridge  memory module  
Contrastive Learning of Person-Independent Representations for Facial Action Unit Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3212-3225
作者:  Li, Yong;  Shan, Shiguang
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Gold  Videos  Training  Image reconstruction  Feature extraction  Faces  Task analysis  Facial action unit detection  contrastive Learning  self-supervised learning  person-independent action unit detection  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
RhythmNet: End-to-End Heart Rate Estimation From Face via Spatial-Temporal Representation 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 卷号: 29, 页码: 2409-2423
作者:  Niu, Xuesong;  Shan, Shiguang;  Han, Hu;  Chen, Xilin
收藏  |  浏览/下载:51/0  |  提交时间:2020/12/10
Heart rate  Estimation  Webcams  Databases  Skin  Image color analysis  Head  Remote heart rate estimation  rPPG  spatial-temporal representation  end-to-end learning  
Spatial Pyramid Covariance-Based Compact Video Code for Robust Face Retrieval in TV-Series 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 卷号: 25, 期号: 12, 页码: 5905-5919
作者:  Li, Yan;  Wang, Ruiping;  Cui, Zhen;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:40/0  |  提交时间:2019/12/13
Face video retrieval  covariance matrix  spatial pyramid covariance  compact video code  binary code learning  
Learning Expressionlets via Universal Manifold Model for Dynamic Facial Expression Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2016, 卷号: 25, 期号: 12, 页码: 5920-5932
作者:  Liu, Mengyi;  Shan, Shiguang;  Wang, Ruiping;  Chen, Xilin
收藏  |  浏览/下载:44/0  |  提交时间:2019/12/13
Facial expression recognition  universal manifold model  Riemannian manifold  discriminant Learning  expressionlets