CSpace

浏览/检索结果: 共17条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
An Efficient Deep Learning Accelerator Architecture for Compressed Video Analysis 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2022, 卷号: 41, 期号: 9, 页码: 2808-2820
作者:  Wang, Yongchen;  Wang, Ying;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:37/0  |  提交时间:2022/12/07
Streaming media  Neural networks  Image coding  Decoding  Metadata  Deep learning  Hardware  Neural network acceleration  specialized accelerator  video analysis  
SANet: Statistic Attention Network for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 6, 页码: 3866-3879
作者:  Bai, Shutao;  Ma, Bingpeng;  Chang, Hong;  Huang, Rui;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:29/0  |  提交时间:2022/12/07
Feature extraction  Task analysis  Computational modeling  Visualization  Video sequences  Fuses  Computer science  Person re-identification  self-attention  long-range dependencies  high-order statistics  
Astute Video Transmission for Geographically Dispersed Devices in Visual IoT Systems 期刊论文
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2022, 卷号: 21, 期号: 2, 页码: 448-464
作者:  Ji, Wen;  Duan, Lingyu;  Huang, Xi;  Chai, Yueting
收藏  |  浏览/下载:22/0  |  提交时间:2022/12/07
Task analysis  Streaming media  Performance evaluation  Delays  Visualization  DH-HEMTs  Fractals  Video transmission  device hypergraph  submodular function  dispersed devices  optimization  
I(2)Transformer: Intra- and Inter-Relation Embedding Transformer for TV Show Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3565-3577
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Gao, Shengxiang;  Yan, Chenggang;  Zha, Zheng-Jun;  Yu, Zhengtao;  Huang, Qingming
收藏  |  浏览/下载:25/0  |  提交时间:2022/12/07
Transformers  Semantics  Task analysis  Visualization  TV  Electronic mail  Graph neural networks  TV Show captioning  video and subtitle  intra-relation embedding  inter-relation embedding  transformer  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:29/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
Adaptive Spatial Location With Balanced Loss for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 1, 页码: 17-30
作者:  Li, Linghui;  Zhang, Yongdong;  Tang, Sheng;  Xie, Lingxi;  Li, Xiaoyong;  Tian, Qi
收藏  |  浏览/下载:21/0  |  提交时间:2022/12/07
Task analysis  Redundancy  Feature extraction  Visualization  Detectors  Computer vision  Training  Convolutional neural network  recurrent neural network  video captioning  adaptive spatial location  balanced loss  
A Decomposable Winograd Method for N-D Convolution Acceleration in Video Analysis 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2021, 页码: 21
作者:  Huang, Di;  Zhang, Rui;  Zhang, Xishan;  Wu, Fan;  Wang, Xianzhuo;  Jin, Pengwei;  Liu, Shaoli;  Li, Ling;  Chen, Yunji
收藏  |  浏览/下载:42/0  |  提交时间:2021/12/01
Convolution neural networks  Model acceleration  Winograd algorithm  Video analysis  
An Edge 3D CNN Accelerator for Low-Power Activity Recognition 期刊论文
IEEE TRANSACTIONS ON COMPUTER-AIDED DESIGN OF INTEGRATED CIRCUITS AND SYSTEMS, 2021, 卷号: 40, 期号: 5, 页码: 918-930
作者:  Wang, Ying;  Wang, Yongchen;  Shi, Cong;  Cheng, Long;  Li, Huawei;  Li, Xiaowei
收藏  |  浏览/下载:42/0  |  提交时间:2021/12/01
Three-dimensional displays  Two dimensional displays  Arrays  Feature extraction  System-on-chip  Redundancy  3D CNN  activity analysis  CNN accelerator  network-on-chip  video  
Bridging Text and Video: A Universal Multimodal Transformer for Audio-Visual Scene-Aware Dialog 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 2476-2483
作者:  Li, Zekang;  Li, Zongjia;  Zhang, Jinchao;  Feng, Yang;  Zhou, Jie
收藏  |  浏览/下载:46/0  |  提交时间:2021/12/01
Task analysis  Feature extraction  Visualization  Speech processing  History  Social networking (online)  Pattern recognition  Dialogue System  Multimodal  Natural Language Processing  Video Understanding