CSpace

浏览/检索结果: 共11条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Prompting Video-Language Foundation Models With Domain-Specific Fine-Grained Heuristics for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 2, 页码: 1615-1630
作者:  Yu, Ting;  Fu, Kunhao;  Wang, Shuhui;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:12/0  |  提交时间:2025/06/25
Cognition  Computational modeling  Visualization  Context modeling  Data models  Adaptation models  Accuracy  Question answering (information retrieval)  Transformers  Feature extraction  Video question answering  discriminative unimodal comprehension  cross-modal interaction  domain-specific heuristics  video-language foundation models  entity-action relationships  context-aware reasoning  
Screen Content-Aware Video Coding Through Non-Local Model Embedded With Intra-Inter In-Loop Filtering 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 2, 页码: 1870-1883
作者:  Li, Mingxuan;  Ji, Wen
收藏  |  浏览/下载:4/0  |  提交时间:2025/06/25
Encoding  Feature extraction  Computational modeling  Adaptation models  Visualization  Nonlinear distortion  Information filters  Deep learning  Standards  Image color analysis  Video coding  in-loop filtering  screen content  deep learning  high-efficiency video coding (HEVC)  
ParaLoupe: Real-Time Video Analytics on Edge Cluster via Mini Model Parallelization 期刊论文
IEEE TRANSACTIONS ON MOBILE COMPUTING, 2024, 卷号: 23, 期号: 12, 页码: 13945-13962
作者:  Wang, Hanling;  Li, Qing;  Kang, Haidong;  Hu, Dieli;  Ma, Lianbo;  Tyson, Gareth;  Yuan, Zhenhui;  Jiang, Yong
收藏  |  浏览/下载:14/0  |  提交时间:2025/06/25
Accuracy  Task analysis  Computational modeling  Image edge detection  Visual analytics  Streaming media  Real-time systems  Distributed computing  edge computing  real-time video analytics  
Linguistic Hallucination for Text-Based Video Retrieval 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 10, 页码: 9692-9705
作者:  Fang, Sheng;  Dang, Tiantian;  Wang, Shuhui;  Huang, Qingming
收藏  |  浏览/下载:24/0  |  提交时间:2024/12/06
Linguistics  Training  Testing  Encoding  Context modeling  Feature extraction  Task analysis  Text-video retrieval  partially relevant video retrieval  linguistic hallucination  curriculum learning  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:32/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Spatial-Temporal Graph Network for Video Crowd Counting 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 1, 页码: 228-241
作者:  Wu, Zhe;  Zhang, Xinfeng;  Tian, Geng;  Wang, Yaowei;  Huang, Qingming
收藏  |  浏览/下载:43/0  |  提交时间:2023/07/12
Computational modeling  Predictive models  Analytical models  Long short term memory  Optical flow  Integrated circuit modeling  Head  Video-based crowd counting  spatiotemporal graph attention  multi-scale module  
SANet: Statistic Attention Network for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 6, 页码: 3866-3879
作者:  Bai, Shutao;  Ma, Bingpeng;  Chang, Hong;  Huang, Rui;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:52/0  |  提交时间:2022/12/07
Feature extraction  Task analysis  Computational modeling  Visualization  Video sequences  Fuses  Computer science  Person re-identification  self-attention  long-range dependencies  high-order statistics  
Motion Feature Aggregation for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 3908-3919
作者:  Gu, Xinqian;  Chang, Hong;  Ma, Bingpeng;  Shan, Shiguang
收藏  |  浏览/下载:43/0  |  提交时间:2022/12/07
Feature extraction  Optical imaging  Computational modeling  Spatiotemporal phenomena  Data mining  Training  Tracking  Video-based person re-identification  temporal information modeling  motion feature extraction  
Hybrid-Attention Enhanced Two-Stream Fusion Network for Video Venue Prediction 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2021, 卷号: 23, 页码: 2917-2929
作者:  Zhang, Yanchao;  Min, Weiqing;  Nie, Liqiang;  Jiang, Shuqiang
收藏  |  浏览/下载:66/0  |  提交时间:2021/12/01
Visualization  Feature extraction  Convolution  Streaming media  Object oriented modeling  Three-dimensional displays  Neural networks  Feature extraction  knowledge representation  supervised learning  video signal processing  
Video modeling and learning on Riemannian manifold for emotion recognition in the wild 期刊论文
JOURNAL ON MULTIMODAL USER INTERFACES, 2016, 卷号: 10, 期号: 2, 页码: 113-124
作者:  Liu, Mengyi;  Wang, Ruiping;  Li, Shaoxin;  Huang, Zhiwu;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:57/0  |  提交时间:2019/12/13
Emotion recognition  Video modeling  Riemannian manifold  EmotiW challenge