CSpace

浏览/检索结果: 共29条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Patching the visual ability of large multimodal models by collaborating with small models 期刊论文
FRONTIERS OF COMPUTER SCIENCE, 2026, 卷号: 20, 期号: 9, 页码: 17
作者:  Liang, Hao;  Zhang, Xiaolong;  Kan, Meina;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:1/0  |  提交时间:2026/05/25
model collaboration  patching visual ability  large multimodal models  
LLM-led vision-spectral fusion: A zero-shot approach to temporal fruit image classification 期刊论文
NEURAL NETWORKS, 2026, 卷号: 194, 页码: 10
作者:  Wu, Huyu;  Jia, Bowen;  Yuan, Xue-Ming
收藏  |  浏览/下载:24/0  |  提交时间:2025/12/03
Temporally relevant images  Multimodal classification  Large language models  Zero-shot segmentation  Vision-spectral fusion  
Entropy-regulated cross-modal generative fusion for multimodal network intrusion detection 期刊论文
INFORMATION FUSION, 2026, 卷号: 126, 页码: 16
作者:  Wang, Xiangbin;  Yuan, Qingjun;  Yu, Wentao;  Meng, Qianwei;  Lu, Siqi;  He, Wenqi;  Gu, Chunxiang;  Wang, Yongjuan
收藏  |  浏览/下载:30/0  |  提交时间:2025/12/03
Generative artificial intelligence  Intrusion detection system  Multimodal fusion  Diffusion model  Cross-modal representation  Differential entropy  
Pathway-Aware Multimodal Transformer (PAMT): Integrating Pathological Image and Gene Expression for Interpretable Cancer Survival Analysis 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2026, 卷号: 48, 期号: 1, 页码: 896-913
作者:  Yan, Rui;  Zhang, Xueyuan;  Jiang, Zihang;  Wang, Baizhi;  Bian, Xiuwu;  Ren, Fei;  Zhou, S. Kevin
收藏  |  浏览/下载:1/0  |  提交时间:2026/05/25
Pathology  Cancer  Transformers  Feature extraction  Data models  Biological system modeling  Analytical models  Deep learning  Semantics  Multimodal transformer  model interpretability  survival analysis  pathological image analysis  gene expression  gene expression  
VPA: Multi-Modal Virtual Point Augmentation for 3D Object Detection 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2025, 卷号: 35, 期号: 12, 页码: 12410-12425
作者:  Zhong, Jianping;  Qi, Zhaobo;  Duan, Kaiwen;  Xu, Yuanrong;  Zhang, Weigang;  Huang, Qingming
收藏  |  浏览/下载:1/0  |  提交时间:2026/05/25
Three-dimensional displays  Point cloud compression  Object detection  Semantics  Laser radar  Feature extraction  Detectors  Accuracy  Bicycles  Solids  3D object detection  multimodal fusion  virtual point augmenting  
CL-DGCN: contrastive learning based deeper graph convolutional network for traffic flow data prediction 期刊论文
TRANSPORTATION RESEARCH PART E-LOGISTICS AND TRANSPORTATION REVIEW, 2025, 卷号: 203, 页码: 18
作者:  Zhang, Enwei;  Lv, Zhiqiang;  Cheng, Zesheng;  Ke, Jintao
收藏  |  浏览/下载:22/0  |  提交时间:2025/12/03
Multimodal transportation  Traffic flow prediction  Graph convolutional network  Hyperaggregation function  
Improving multimodal named entity recognition via text-image relevance prediction with large language models 期刊论文
NEUROCOMPUTING, 2025, 卷号: 651, 页码: 10
作者:  Zeng, Qingyang;  Yuan, Minghui;  Su, Yueyang;  Mi, Jia;  Che, Qianzi;  Wan, Jing
收藏  |  浏览/下载:20/0  |  提交时间:2025/12/03
Multimodal named entity recognition  Multimodal learning  Large language model  Contrastive learning  Social media  
Consistent multimodal pre-training for visual tokenization 期刊论文
SCIENCE CHINA-INFORMATION SCIENCES, 2025, 卷号: 68, 期号: 10, 页码: 15
作者:  Pan, Ting;  Tang, Lulu;  Wang, Xinlong;  Liu, Xin;  Shan, Shiguang
收藏  |  浏览/下载:33/0  |  提交时间:2025/12/03
foundation model  multimodal  representation learning  visual tokenization  
Multimodal Food Learning 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2025, 卷号: 21, 期号: 7, 页码: 28
作者:  Min, Weiqing;  Hong, Xingjian;  Liu, Yuxin;  Huang, Mingyu;  Jin, Ying;  Zhou, Pengfei;  Xu, Leyi;  Wang, Yilin;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:27/0  |  提交时间:2025/12/03
Multimodal Food Learning  Cross-modal Retrieval  Cross-modal Generation  Food Recognition  
Text-guided multimodal depression detection via cross-modal feature reconstruction and decomposition 期刊论文
INFORMATION FUSION, 2025, 卷号: 117, 页码: 10
作者:  Chen, Ziqiang;  Wang, Dandan;  Lou, Liangliang;  Zhang, Shiqing;  Zhao, Xiaoming;  Jiang, Shuqiang;  Yu, Jun;  Xiao, Jun
收藏  |  浏览/下载:36/0  |  提交时间:2025/06/25
Depression detection  Cross-modal feature reconstruction  Feature decomposition  Multimodal fusion