CSpace

浏览/检索结果: 共578条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:9/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
Cross Modal Compression With Variable Rate Prompt 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 3444-3456
作者:  Gao, Junlong;  Li, Jiguo;  Jia, Chuanmin;  Wang, Shanshe;  Ma, Siwei;  Gao, Wen
收藏  |  浏览/下载:8/0  |  提交时间:2024/05/20
Cross modal compression  semantic fidelity  variable rate prompt  
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1522-1533
作者:  Wang, Yabing;  Wang, Shuhui;  Luo, Hao;  Dong, Jianfeng;  Wang, Fan;  Han, Meng;  Wang, Xun;  Wang, Meng
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
Visualization  Noise measurement  Estimation  Costs  Transportation  Training  Task analysis  Cross-modal retrieval  noise correspondence learning  cross-lingual transfer  optimal transport  machine translation  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:7/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 2572-2586
作者:  Liu, Yuxin;  Min, Weiqing;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
Semantics  Visualization  Transformers  Task analysis  Feature extraction  Image recognition  Fish  Food recognition  ingredient recognition  food computing  fine-grained recognition  multi-label recognition  
Improving metric-based few-shot learning with dynamically scaled softmax loss 期刊论文
IMAGE AND VISION COMPUTING, 2023, 卷号: 140, 页码: 15
作者:  Zhang, Yu;  Zuo, Xin;  Zheng, Xuxu;  Gao, Xiaoyong;  Wang, Bo;  Hu, Weiming
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
Few-shot learning  Metric-based learning framework  Softmax loss improvement  
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models 期刊论文
ACM TRANSACTIONS ON GRAPHICS, 2023, 卷号: 42, 期号: 6, 页码: 14
作者:  Zhang, Yuxin;  Dong, Weiming;  Tang, Fan;  Huang, Nisha;  Huang, Haibin;  Ma, Chongyang;  Lee, Tong-Yee;  Deussen, Oliver;  Xu, Changsheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
Image generation  Diffusion models  Attribute-aware editing  Model personalization  
Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Cai, Shaofei;  Li, Liang;  Han, Xinzhe;  Huang, Shan;  Tian, Qi;  Huang, Qingming
收藏  |  浏览/下载:8/0  |  提交时间:2024/05/20
Attention mechanism  feature disentangling  graph convolutional network (GCN)  multilabel recognition  
An automated optical inspection (AOI) platform for three-dimensional (3D) defects detection on glass micro-optical components (GMOC) 期刊论文
OPTICS COMMUNICATIONS, 2023, 卷号: 545, 页码: 7
作者:  Du, Yinchao;  Chen, Jiangpeng;  Zhou, Han;  Yang, Xiaoling;  Wang, Zhongqi;  Zhang, Jie;  Shi, Yuechun;  Chen, Xiangfei;  Zheng, Xuezhe
收藏  |  浏览/下载:14/0  |  提交时间:2023/12/04
Automated optical inspection  Glass micro -optical components  Defects detection  3D video acquisition  Machine-learning algorithm  
Rethinking Semantic Image Compression: Scalable Representation With Cross-Modality Transfer 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2023, 卷号: 33, 期号: 8, 页码: 4441-4445
作者:  Zhang, Pingping;  Wang, Shiqi;  Wang, Meng;  Li, Jiguo;  Wang, Xu;  Kwong, Sam
收藏  |  浏览/下载:12/0  |  提交时间:2023/12/04
Semantic image compression  cross-modality  scalable coding