CSpace

浏览/检索结果: 共2条,第1-2条 帮助

已选(0)清除 条数/页:   排序方式:
Patching the visual ability of large multimodal models by collaborating with small models 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2026, 卷号: 20, 期号: 9, 页码: 17
作者:  Liang, Hao;  Zhang, Xiaolong;  Kan, Meina;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:1/0  |  提交时间:2026/05/25
model collaboration  patching visual ability  large multimodal models  
LLM-led vision-spectral fusion: A zero-shot approach to temporal fruit image classification 期刊论文
NEURAL NETWORKS, 2026, 卷号: 194, 页码: 10
作者:  Wu, Huyu;  Jia, Bowen;  Yuan, Xue-Ming
收藏  |  浏览/下载:24/0  |  提交时间:2025/12/03
Temporally relevant images  Multimodal classification  Large language models  Zero-shot segmentation  Vision-spectral fusion