CSpace

浏览/检索结果: 共20条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Toward Egocentric Compositional Action Anticipation with Adaptive Semantic Debiasing 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 5, 页码: 21
作者:  Zhang, Tianyu;  Min, Weiqing;  Liu, Tao;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Egocentric video understanding  compositional action anticipation  semantic bias  adaptive counterfactual analysis  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
Focus and Align: Learning Tube Tokens for Video-Language Pre-Training 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 8036-8050
作者:  Zhu, Yongqing;  Li, Xiangyang;  Zheng, Mao;  Yang, Jiahao;  Wang, Zihan;  Guo, Xiaoqian;  Chai, Zifeng;  Yuan, Yuchen;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Electron tubes  Semantics  Visualization  Feature extraction  Task analysis  Transformers  Detectors  Local alignment mechanism  semantic centers  tube tokens  video-language pre-training  
An Efficient Deep Learning Accelerator Architecture for Compressed Video Analysis 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 9, 页码: 2808-2820
作者:  Wang, Yongchen;  Wang, Ying;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:31/0  |  提交时间:2022/12/07
Streaming media  Neural networks  Image coding  Decoding  Metadata  Deep learning  Hardware  Neural network acceleration  specialized accelerator  video analysis  
SANet: Statistic Attention Network for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 6, 页码: 3866-3879
作者:  Bai, Shutao;  Ma, Bingpeng;  Chang, Hong;  Huang, Rui;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:25/0  |  提交时间:2022/12/07
Feature extraction  Task analysis  Computational modeling  Visualization  Video sequences  Fuses  Computer science  Person re-identification  self-attention  long-range dependencies  high-order statistics  
Astute Video Transmission for Geographically Dispersed Devices in Visual IoT Systems 期刊论文
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 卷号: 21, 期号: 2, 页码: 448-464
作者:  Ji, Wen;  Duan, Lingyu;  Huang, Xi;  Chai, Yueting
收藏  |  浏览/下载:18/0  |  提交时间:2022/12/07
Task analysis  Streaming media  Performance evaluation  Delays  Visualization  DH-HEMTs  Fractals  Video transmission  device hypergraph  submodular function  dispersed devices  optimization  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:20/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:23/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
Adaptive Spatial Location With Balanced Loss for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 1, 页码: 17-30
作者:  Li, Linghui;  Zhang, Yongdong;  Tang, Sheng;  Xie, Lingxi;  Li, Xiaoyong;  Tian, Qi
收藏  |  浏览/下载:17/0  |  提交时间:2022/12/07
Task analysis  Redundancy  Feature extraction  Visualization  Detectors  Computer vision  Training  Convolutional neural network  recurrent neural network  video captioning  adaptive spatial location  balanced loss