×
验证码:
换一张
忘记密码?
记住我
×
登录
中文版
|
English
中国科学院计算技术研究所机构知识库
Institute of Computing Technology, Chinese Academy IR
登录
注册
ALL
ORCID
题名
作者
学科领域
关键词
文献类型
出处
收录类别
出版者
发表日期
存缴日期
资助项目
学科门类
学习讨论厅
图片搜索
粘贴图片网址
首页
研究单元&专题
作者
文献类型
学科分类
知识图谱
新闻&公告
在结果中检索
研究单元&专题
中国科学院计算技术... [13]
作者
Huang, Qin... [3]
Wang, Shuh... [3]
Chen, Xili... [2]
Jiang, Shu... [2]
Li, Xiangy... [2]
Deng, Dong... [1]
更多...
文献类型
期刊论文 [13]
发表日期
2026 [2]
2025 [5]
2024 [2]
2020 [1]
2019 [1]
2018 [1]
更多...
语种
英语 [13]
出处
IEEE TRANS... [3]
COMPUTERS ... [1]
GESTURE IN... [1]
IEEE JOURN... [1]
IEEE TRANS... [1]
IEEE TRANS... [1]
更多...
资助项目
National N... [2]
National N... [2]
National N... [2]
National P... [2]
National P... [2]
AI Lab [1]
更多...
收录类别
SCI [13]
资助机构
×
知识图谱
CSpace
开始提交
已提交作品
待认领作品
已认领作品
未提交全文
收藏管理
QQ客服
官方微博
反馈留言
浏览/检索结果:
共13条,第1-10条
帮助
已选(
0
)
清除
条数/页:
5
10
15
20
25
30
35
40
45
50
55
60
65
70
75
80
85
90
95
100
排序方式:
请选择
期刊影响因子升序
期刊影响因子降序
提交时间升序
提交时间降序
发表日期升序
发表日期降序
题名升序
题名降序
作者升序
作者降序
WOS被引频次升序
WOS被引频次降序
LLM-led vision-spectral fusion: A zero-shot approach to temporal fruit image classification
期刊论文
NEURAL NETWORKS, 2026, 卷号: 194, 页码: 10
作者:
Wu, Huyu
;
Jia, Bowen
;
Yuan, Xue-Ming
收藏
  |  
浏览/下载:6/0
  |  
提交时间:2025/12/03
Temporally relevant images
Multimodal classification
Large language models
Zero-shot segmentation
Vision-spectral fusion
Sycophancy in vision-language models: A systematic analysis and an inference-time mitigation framework
期刊论文
NEUROCOMPUTING, 2026, 卷号: 659, 页码: 14
作者:
Zhao, Yunpu
;
Zhang, Rui
;
Xiao, Junbin
;
Ke, Changxin
;
Hou, Ruibo
;
Hao, Yifan
;
Li, Ling
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2025/12/03
Vision-language models
Contrastive decoding
Model hallucinations
Enhanced Dual-Pattern Matching With Vision-Language Representation for Out-of-Distribution Detection
期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2025, 卷号: 47, 期号: 11, 页码: 9673-9687
作者:
Xiang, Xiang
;
Xu, Zhuo
;
Zhang, Zihan
;
Zeng, Zhigang
;
Chen, Xilin
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2025/12/03
Visualization
Adaptation models
Training
Data models
Computational modeling
Feature extraction
Pattern matching
Tuning
Robustness
Data mining
OOD detection
vision-language models
FullLoRA: Efficiently Boosting the Robustness of Pretrained Vision Transformers
期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2025, 卷号: 34, 页码: 4580-4590
作者:
Yuan, Zheng
;
Zhang, Jie
;
Shan, Shiguang
;
Chen, Xilin
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2025/12/03
Training
Computational modeling
Robustness
Adaptation models
Computer vision
Transformers
Visualization
Natural language processing
Image classification
Head
Adversarial training
parameter-efficient
pretrained model
Dual-Alignment CLIP: Task-Specific Alignment of Text and Visual Features for Few-Shot Remote Sensing Scene Classification
期刊论文
IEEE JOURNAL OF SELECTED TOPICS IN APPLIED EARTH OBSERVATIONS AND REMOTE SENSING, 2025, 卷号: 18, 页码: 19260-19272
作者:
Deng, Dongmei
;
Yao, Ping
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2025/12/03
Remote sensing
Scene classification
Visualization
Training
Manuals
Few shot learning
Feature extraction
Adaptation models
Training data
Streaming media
Contrastive vision-language pretraining (CLIP)
few-shot learning (FSL)
image classification
remote sensing
Enhancing the Robustness of Vision-Language Foundation Models by Alignment Perturbation
期刊论文
IEEE TRANSACTIONS ON INFORMATION FORENSICS AND SECURITY, 2025, 卷号: 20, 页码: 7091-7105
作者:
Zhang, Cong
;
Wang, Shuhui
;
Li, Xiaodan
;
Zhu, Yao
;
Qi, Honggang
;
Huang, Qingming
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2025/12/03
Multimedia forensics
adversarial perturbation
robust training
robust training
vision-language models
vision-language models
vision-language models
DomainVerse: A Benchmark Towards Real-World Distribution Shifts for Training-Free Adaptive Domain Generalization
期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2025, 卷号: 27, 页码: 6648-6660
作者:
Hou, Feng
;
Yuan, Jin
;
Yang, Ying
;
Zhang, Yao
;
Liu, Yang
;
Zhang, Yang
;
Zhong, Cheng
;
Shi, Zhongchao
;
Fan, Jianping
;
He, Zhiqiang
;
Rui, Yong
收藏
  |  
浏览/下载:4/0
  |  
提交时间:2025/12/03
Adaptation models
Training
Benchmark testing
Picture archiving and communication systems
Data models
Image color analysis
Computational modeling
Data mining
Training data
Painting
DomainVerse
training-free adaptive domain generalization
vision-language models
Boost Tracking by Natural Language With Prompt-Guided Grounding
期刊论文
IEEE TRANSACTIONS ON INTELLIGENT TRANSPORTATION SYSTEMS, 2024, 页码: 13
作者:
Li, Hengyou
;
Liu, Xinyan
;
Li, Guorong
;
Wang, Shuhui
;
Qing, Laiyun
;
Huang, Qingming
收藏
  |  
浏览/下载:19/0
  |  
提交时间:2025/06/25
Target tracking
Grounding
Switches
Visualization
Feature extraction
Computational modeling
Adaptation models
Location awareness
Linguistics
Memory management
Vision-language tracking
prompt learning
inverse tracking
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes
期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:
Yu, Ting
;
Lin, Xiaojun
;
Wang, Shuhui
;
Sheng, Weiguo
;
Huang, Qingming
;
Yu, Jun
收藏
  |  
浏览/下载:49/0
  |  
提交时间:2024/05/20
Three-dimensional displays
Task analysis
Visualization
Point cloud compression
Grounding
Surveys
Solid modeling
3D dense captioning
vision-language bridging
visual captioning
3D point cloud
avtmNet:Adaptive Visual-Text Merging Network for Image Captioning
期刊论文
COMPUTERS & ELECTRICAL ENGINEERING, 2020, 卷号: 84, 页码: 12
作者:
Song, Heng
;
Zhu, Junwu
;
Jiang, Yi
收藏
  |  
浏览/下载:72/0
  |  
提交时间:2020/12/10
Image captioning
Computer Vision
Natural Language Processing
Attention Mechanism
Neural networks