CSpace

浏览/检索结果: 共58条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Modality-Consistent Prompt Tuning With Optimal Transport 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 3, 页码: 2499-2512
作者:  Ren, Hairui;  Tang, Fan;  Zheng, Huangjie;  Zhao, He;  Guo, Dandan;  Chang, Yi
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
Prompt tuning  modality-consistent  optimal transport  distribution matching  Prompt tuning  modality-consistent  optimal transport  distribution matching  
Accelerate Point Cloud Structuring for Deep Neural Networks via Fast Spatial-Searching Tree 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 3, 页码: 2570-2585
作者:  Zhan, Jinyu;  Zou, Shiyu;  Jiang, Wei;  Zhang, Youyuan;  Peng, Suidi;  Wang, Ying
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
Deep neural networks  point cloud structuring  fast spatial-searching tree  sampling  neighbor query  acceleration  Deep neural networks  point cloud structuring  fast spatial-searching tree  sampling  neighbor query  acceleration  
Prompting Video-Language Foundation Models With Domain-Specific Fine-Grained Heuristics for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 2, 页码: 1615-1630
作者:  Yu, Ting;  Fu, Kunhao;  Wang, Shuhui;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:4/0  |  提交时间:2025/06/25
Cognition  Computational modeling  Visualization  Context modeling  Data models  Adaptation models  Accuracy  Question answering (information retrieval)  Transformers  Feature extraction  Video question answering  discriminative unimodal comprehension  cross-modal interaction  domain-specific heuristics  video-language foundation models  entity-action relationships  context-aware reasoning  
Screen Content-Aware Video Coding Through Non-Local Model Embedded With Intra-Inter In-Loop Filtering 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 2, 页码: 1870-1883
作者:  Li, Mingxuan;  Ji, Wen
收藏  |  浏览/下载:3/0  |  提交时间:2025/06/25
Encoding  Feature extraction  Computational modeling  Adaptation models  Visualization  Nonlinear distortion  Information filters  Deep learning  Standards  Image color analysis  Video coding  in-loop filtering  screen content  deep learning  high-efficiency video coding (HEVC)  
COMICS: End-to-End Bi-Grained Contrastive Learning for Multi-Face Forgery Detection 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 10, 页码: 10223-10236
作者:  Zhang, Cong;  Qi, Honggang;  Wang, Shuhui;  Li, Yuezun;  Lyu, Siwei
收藏  |  浏览/下载:15/0  |  提交时间:2024/12/06
Face recognition  Forgery  Feature extraction  Proposals  Object detection  Faces  Generators  DeepFake  multi-face forgery detection  contrastive learning  fine-grained feature learning  
Linguistic Hallucination for Text-Based Video Retrieval 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 10, 页码: 9692-9705
作者:  Fang, Sheng;  Dang, Tiantian;  Wang, Shuhui;  Huang, Qingming
收藏  |  浏览/下载:17/0  |  提交时间:2024/12/06
Linguistics  Training  Testing  Encoding  Context modeling  Feature extraction  Task analysis  Text-video retrieval  partially relevant video retrieval  linguistic hallucination  curriculum learning  
Mind the Gap: Open Set Domain Adaptation via Mutual-to-Separate Framework 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 6, 页码: 4159-4174
作者:  Chang, Dongliang;  Sain, Aneeshan;  Ma, Zhanyu;  Song, Yi-Zhe;  Wang, Ruiping;  Guo, Jun
收藏  |  浏览/下载:17/0  |  提交时间:2024/12/06
Picture archiving and communication systems  Training  Task analysis  Labeling  Adaptation models  Visualization  Information exchange  Domain adaptation  open set  mutual learning  transfer learning  
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:25/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
Semantic-Context Graph Network for Point-Based 3D Object Detection 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6474-6486
作者:  Dong, Shuwei;  Kong, Xiaoyu;  Pan, Xingjia;  Tang, Fan;  Li, Wei;  Chang, Yi;  Dong, Weiming
收藏  |  浏览/下载:24/0  |  提交时间:2024/05/20
3D object detection  graph neural networks  information entanglement  
Lightweight Multiattention Recursive Residual CNN-Based In-Loop Filter Driven by Neuron Diversity 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 11, 页码: 6996-7008
作者:  Li, Mingxuan;  Ji, Wen
收藏  |  浏览/下载:24/0  |  提交时间:2024/05/20
Video coding  in-loop filtering  convolutional neural network  deep learning  high-efficiency video coding (HEVC)