CSpace

浏览/检索结果: 共25条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
Synthesizing Knowledge-Enhanced Features for Real-World Zero-Shot Food Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1285-1298
作者:  Zhou, Pengfei;  Min, Weiqing;  Song, Jiajun;  Zhang, Yang;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Semantics  Feature extraction  Visualization  Annotations  Correlation  Training  Task analysis  Food detection  zero-shot detection  food computing  object detection  zero-shot learning  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 2572-2586
作者:  Liu, Yuxin;  Min, Weiqing;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Semantics  Visualization  Transformers  Task analysis  Feature extraction  Image recognition  Fish  Food recognition  ingredient recognition  food computing  fine-grained recognition  multi-label recognition  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 2947-2959
作者:  Guo, Xiaoqian;  Li, Xiangyang;  Wang, Yaowei;  Jiang, Shuqiang
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Proposals  Object detection  Task analysis  Feature extraction  Visualization  Training  Measurement  Common object detection  transweaver  transformer  
Focus and Align: Learning Tube Tokens for Video-Language Pre-Training 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 8036-8050
作者:  Zhu, Yongqing;  Li, Xiangyang;  Zheng, Mao;  Yang, Jiahao;  Wang, Zihan;  Guo, Xiaoqian;  Chai, Zifeng;  Yuan, Yuchen;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Electron tubes  Semantics  Visualization  Feature extraction  Task analysis  Transformers  Detectors  Local alignment mechanism  semantic centers  tube tokens  video-language pre-training  
A Comprehensive Framework for Long-Tailed Learning via Pretraining and Normalization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 13
作者:  Kang, Nan;  Chang, Hong;  Ma, Bingpeng;  Shan, Shiguang
收藏  |  浏览/下载:22/0  |  提交时间:2022/12/07
Training  Tail  Task analysis  Head  Visualization  Feature extraction  Data models  Classifier design  contrastive learning  long-tailed recognition  normalization  
SANet: Statistic Attention Network for Video-Based Person Re-Identification 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 6, 页码: 3866-3879
作者:  Bai, Shutao;  Ma, Bingpeng;  Chang, Hong;  Huang, Rui;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:25/0  |  提交时间:2022/12/07
Feature extraction  Task analysis  Computational modeling  Visualization  Video sequences  Fuses  Computer science  Person re-identification  self-attention  long-range dependencies  high-order statistics  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context