CSpace

浏览/检索结果: 共13条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
LLM-led vision-spectral fusion: A zero-shot approach to temporal fruit image classification 期刊论文
NEURAL NETWORKS, 2026, 卷号: 194, 页码: 10
作者:  Wu, Huyu;  Jia, Bowen;  Yuan, Xue-Ming
收藏  |  浏览/下载:6/0  |  提交时间:2025/12/03
Temporally relevant images  Multimodal classification  Large language models  Zero-shot segmentation  Vision-spectral fusion  
Sycophancy in vision-language models: A systematic analysis and an inference-time mitigation framework 期刊论文
NEUROCOMPUTING, 2026, 卷号: 659, 页码: 14
作者:  Zhao, Yunpu;  Zhang, Rui;  Xiao, Junbin;  Ke, Changxin;  Hou, Ruibo;  Hao, Yifan;  Li, Ling
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Vision-language models  Contrastive decoding  Model hallucinations  
Enhanced Dual-Pattern Matching With Vision-Language Representation for Out-of-Distribution Detection 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 卷号: 47, 期号: 11, 页码: 9673-9687
作者:  Xiang, Xiang;  Xu, Zhuo;  Zhang, Zihan;  Zeng, Zhigang;  Chen, Xilin
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Visualization  Adaptation models  Training  Data models  Computational modeling  Feature extraction  Pattern matching  Tuning  Robustness  Data mining  OOD detection  vision-language models  
FullLoRA: Efficiently Boosting the Robustness of Pretrained Vision Transformers 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 卷号: 34, 页码: 4580-4590
作者:  Yuan, Zheng;  Zhang, Jie;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Training  Computational modeling  Robustness  Adaptation models  Computer vision  Transformers  Visualization  Natural language processing  Image classification  Head  Adversarial training  parameter-efficient  pretrained model  
Dual-Alignment CLIP: Task-Specific Alignment of Text and Visual Features for Few-Shot Remote Sensing Scene Classification 期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 卷号: 18, 页码: 19260-19272
作者:  Deng, Dongmei;  Yao, Ping
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Remote sensing  Scene classification  Visualization  Training  Manuals  Few shot learning  Feature extraction  Adaptation models  Training data  Streaming media  Contrastive vision-language pretraining (CLIP)  few-shot learning (FSL)  image classification  remote sensing  
Enhancing the Robustness of Vision-Language Foundation Models by Alignment Perturbation 期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 卷号: 20, 页码: 7091-7105
作者:  Zhang, Cong;  Wang, Shuhui;  Li, Xiaodan;  Zhu, Yao;  Qi, Honggang;  Huang, Qingming
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Multimedia forensics  adversarial perturbation  robust training  robust training  vision-language models  vision-language models  vision-language models  
DomainVerse: A Benchmark Towards Real-World Distribution Shifts for Training-Free Adaptive Domain Generalization 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 卷号: 27, 页码: 6648-6660
作者:  Hou, Feng;  Yuan, Jin;  Yang, Ying;  Zhang, Yao;  Liu, Yang;  Zhang, Yang;  Zhong, Cheng;  Shi, Zhongchao;  Fan, Jianping;  He, Zhiqiang;  Rui, Yong
收藏  |  浏览/下载:4/0  |  提交时间:2025/12/03
Adaptation models  Training  Benchmark testing  Picture archiving and communication systems  Data models  Image color analysis  Computational modeling  Data mining  Training data  Painting  DomainVerse  training-free adaptive domain generalization  vision-language models  
Boost Tracking by Natural Language With Prompt-Guided Grounding 期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 13
作者:  Li, Hengyou;  Liu, Xinyan;  Li, Guorong;  Wang, Shuhui;  Qing, Laiyun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2025/06/25
Target tracking  Grounding  Switches  Visualization  Feature extraction  Computational modeling  Adaptation models  Location awareness  Linguistics  Memory management  Vision-language tracking  prompt learning  inverse tracking  
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:49/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
avtmNet:Adaptive Visual-Text Merging Network for Image Captioning 期刊论文
COMPUTERS & ELECTRICAL ENGINEERING, 2020, 卷号: 84, 页码: 12
作者:  Song, Heng;  Zhu, Junwu;  Jiang, Yi
收藏  |  浏览/下载:72/0  |  提交时间:2020/12/10
Image captioning  Computer Vision  Natural Language Processing  Attention Mechanism  Neural networks