CSpace

浏览/检索结果: 共248条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:9/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
Hierarchical image-to-image translation with nested distributions modeling 期刊论文
PATTERN RECOGNITION, 2024, 卷号: 146, 页码: 12
作者:  Qiao, Shishi;  Wang, Ruiping;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
Image-to-image translation  Distribution modeling  Information entropy  Generative adversarial network  
Panoptic Segmentation with Convex Object Representation 期刊论文
COMPUTER JOURNAL, 2023, 页码: 11
作者:  Yao, Zhicheng;  Wang, Sa;  Zhu, Jinbin;  Bao, Yungang
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
deep learning  computer vision  image segmentation  panoptic segmentation  instance representation  
ProSpect: Prompt Spectrum for Attribute-Aware Personalization of Diffusion Models 期刊论文
ACM TRANSACTIONS ON GRAPHICS, 2023, 卷号: 42, 期号: 6, 页码: 14
作者:  Zhang, Yuxin;  Dong, Weiming;  Tang, Fan;  Huang, Nisha;  Huang, Haibin;  Ma, Chongyang;  Lee, Tong-Yee;  Deussen, Oliver;  Xu, Changsheng
收藏  |  浏览/下载:6/0  |  提交时间:2024/05/20
Image generation  Diffusion models  Attribute-aware editing  Model personalization  
A Unified Arbitrary Style Transfer Framework via Adaptive Contrastive Learning 期刊论文
ACM TRANSACTIONS ON GRAPHICS, 2023, 卷号: 42, 期号: 5, 页码: 16
作者:  Zhang, Yuxin;  Tang, Fan;  Dong, Weiming;  Huang, Haibin;  Ma, Chongyang;  Lee, Tong-Yee;  Xu, Changsheng
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Arbitrary style transfer  contrastive learning  style encoding  
General Greedy De-Bias Learning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9789-9805
作者:  Han, Xinzhe;  Wang, Shuhui;  Su, Chi;  Huang, Qingming;  Tian, Qi
收藏  |  浏览/下载:12/0  |  提交时间:2023/12/04
Task analysis  Correlation  Training  Data models  Question answering (information retrieval)  Visualization  Image classification  Curriculum learning  dataset biases  greedy strategy  robust learning  
Importance First: Generating Scene Graph of Human Interest 期刊论文
INTERNATIONAL JOURNAL OF COMPUTER VISION, 2023, 页码: 27
作者:  Wang, Wenbin;  Wang, Ruiping;  Shan, Shiguang;  Chen, Xilin
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Key relationship  Hierarchical entity tree  Hierarchical contextual propagation  Relationship ranking  Spatial scale  Visual saliency  
Radiology report generation with a learned knowledge base and multi-modal alignment 期刊论文
MEDICAL IMAGE ANALYSIS, 2023, 卷号: 86, 页码: 10
作者:  Yang, Shuxin;  Wu, Xian;  Ge, Shen;  Zheng, Zhuozhao;  Zhou, S. Kevin;  Xiao, Li
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Radiology report generation  Knowledge base  Multi-modal alignment  
Free-Viewpoint Navigation of Indoor Scene with 360ffi Field of View 期刊论文
ELECTRONICS, 2023, 卷号: 12, 期号: 8, 页码: 16
作者:  Xu, Hang;  Zhao, Qiang;  Ma, Yike;  Wang, Shuai;  Yan, Chenggang;  Dai, Feng
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
viewpoint navigation  360 degrees field of view  multi-view stereo  real-time rendering  
MemBridge: Video-Language Pre-Training With Memory-Augmented Inter-Modality Bridge 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 4073-4087
作者:  Yang, Jiahao;  Li, Xiangyang;  Zheng, Mao;  Wang, Zihan;  Zhu, Yongqing;  Guo, Xiaoqian;  Yuan, Yuchen;  Chai, Zifeng;  Jiang, Shuqiang
收藏  |  浏览/下载:14/0  |  提交时间:2023/12/04
Video-language pre-training  inter-modality bridge  memory module