CSpace

浏览/检索结果: 共51条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 8, 页码: 4441-4445
作者:  Zhang, Pingping;  Wang, Shiqi;  Wang, Meng;  Li, Jiguo;  Wang, Xu;  Kwong, Sam
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Semantic image compression  cross-modality  scalable coding  
Context-Aware Proposal-Boundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 11
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Audiovisual learning  context learning  event localization  
MIFNet: Multiple instances focused temporal action proposal generation 期刊论文
NEUROCOMPUTING, 2023, 卷号: 538, 页码: 13
作者:  Wang, Lining;  Yao, Hongxun;  Yang, Haosen;  Wang, Sibo;  Jin, Sheng
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Video understanding  Temporal action proposal  Temporal action detection  Contrastive learning  Multiple instances  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
Robust Pose Transfer With Dynamic Details Using Neural Video Rendering 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 2, 页码: 2660-2666
作者:  Sun, Yang-Tian;  Huang, Hao-Zhi;  Wang, Xuan;  Lai, Yu-Kun;  Liu, Wei;  Gao, Lin
收藏  |  浏览/下载:17/0  |  提交时间:2023/07/12
Deep generative model  dynamic details generation  human video synthesis  neural rendering  pose transfer  
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4073-4087
作者:  Yang, Jiahao;  Li, Xiangyang;  Zheng, Mao;  Wang, Zihan;  Zhu, Yongqing;  Guo, Xiaoqian;  Yuan, Yuchen;  Chai, Zifeng;  Jiang, Shuqiang
收藏  |  浏览/下载:9/0  |  提交时间:2023/12/04
Video-language pre-training  inter-modality bridge  memory module  
STAM: A SpatioTemporal Attention Based Memory for Video Prediction 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 2354-2367
作者:  Chang, Zheng;  Zhang, Xinfeng;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Global spatiotemporal information  spatio temporal receptive field  3D convolutional neural network  spatiotemporal attention  sequence learning  video prediction  
Contrastive Learning of Person-Independent Representations for Facial Action Unit Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 3212-3225
作者:  Li, Yong;  Shan, Shiguang
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Gold  Videos  Training  Image reconstruction  Feature extraction  Faces  Task analysis  Facial action unit detection  contrastive Learning  self-supervised learning  person-independent action unit detection  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Learning Representations for Facial Actions From Unlabeled Videos 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2022, 卷号: 44, 期号: 1, 页码: 302-317
作者:  Li, Yong;  Zeng, Jiabei;  Shan, Shiguang
收藏  |  浏览/下载:22/0  |  提交时间:2022/06/21
Facial action unit detection  self-supervised learning  representation learning  feature disentanglement  encoder-decoder structure