CSpace

浏览/检索结果: 共172条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
Synthesizing Knowledge-Enhanced Features for Real-World Zero-Shot Food Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1285-1298
作者:  Zhou, Pengfei;  Min, Weiqing;  Song, Jiajun;  Zhang, Yang;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Semantics  Feature extraction  Visualization  Annotations  Correlation  Training  Task analysis  Food detection  zero-shot detection  food computing  object detection  zero-shot learning  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 2572-2586
作者:  Liu, Yuxin;  Min, Weiqing;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Semantics  Visualization  Transformers  Task analysis  Feature extraction  Image recognition  Fish  Food recognition  ingredient recognition  food computing  fine-grained recognition  multi-label recognition  
Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Cai, Shaofei;  Li, Liang;  Han, Xinzhe;  Huang, Shan;  Tian, Qi;  Huang, Qingming
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Attention mechanism  feature disentangling  graph convolutional network (GCN)  multilabel recognition  
Large Scale Visual Food Recognition 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9932-9949
作者:  Min, Weiqing;  Wang, Zhiling;  Liu, Yuxin;  Luo, Mengjiang;  Kang, Liping;  Wei, Xiaoming;  Wei, Xiaolin;  Jiang, Shuqiang
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Image recognition  Visualization  Task analysis  Benchmark testing  Representation learning  Training  Semantics  Food dataset  food recognition  large-scale datasets  fine-grained recognition  
Reference-Based Deep Line Art Video Colorization 期刊论文
IEEE TRANSACTIONS ON VISUALIZATION AND COMPUTER GRAPHICS, 2023, 卷号: 29, 期号: 6, 页码: 2965-2979
作者:  Shi, Min;  Zhang, Jia-Qi;  Chen, Shu-Yu;  Gao, Lin;  Lai, Yu-Kun;  Zhang, Fang-Lue
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Image color analysis  Art  Animation  Feature extraction  Three-dimensional displays  Transforms  Color  Line art colorization  color transform  temporal coherence  few shot learning  
Self-Regulated Learning for Egocentric Video Activity Anticipation 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 6, 页码: 6715-6730
作者:  Qi, Zhaobo;  Wang, Shuhui;  Su, Chi;  Su, Li;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Predictive models  Dairy products  Semantics  Feature extraction  Visualization  Activity recognition  Task analysis  Egocentric video activity anticipaiton  third-person video activity anticipaiton  contrastive learning  multi-task learning  self-regulated learning  
TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 2947-2959
作者:  Guo, Xiaoqian;  Li, Xiangyang;  Wang, Yaowei;  Jiang, Shuqiang
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Proposals  Object detection  Task analysis  Feature extraction  Visualization  Training  Measurement  Common object detection  transweaver  transformer  
Focus and Align: Learning Tube Tokens for Video-Language Pre-Training 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2023, 卷号: 25, 页码: 8036-8050
作者:  Zhu, Yongqing;  Li, Xiangyang;  Zheng, Mao;  Yang, Jiahao;  Wang, Zihan;  Guo, Xiaoqian;  Chai, Zifeng;  Yuan, Yuchen;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Electron tubes  Semantics  Visualization  Feature extraction  Task analysis  Transformers  Detectors  Local alignment mechanism  semantic centers  tube tokens  video-language pre-training