CSpace

浏览/检索结果: 共59条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
SMART: Syntax-Calibrated Multi-Aspect Relation Transformer for Change Captioning 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 7, 页码: 4926-4943
作者:  Tu, Yunbin;  Li, Liang;  Su, Li;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:2/0  |  提交时间:2024/12/06
Semantics  Visualization  Transformers  Decoding  Switches  Syntactics  Image representation  Change captioning  multi-aspect relation learning  part-of-speech  visual switch  transformer  
Mind the Gap: Open Set Domain Adaptation via Mutual-to-Separate Framework 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 6, 页码: 4159-4174
作者:  Chang, Dongliang;  Sain, Aneeshan;  Ma, Zhanyu;  Song, Yi-Zhe;  Wang, Ruiping;  Guo, Jun
收藏  |  浏览/下载:1/0  |  提交时间:2024/12/06
Picture archiving and communication systems  Training  Task analysis  Labeling  Adaptation models  Visualization  Information exchange  Domain adaptation  open set  mutual learning  transfer learning  
SLAM-CIM: A Visual SLAM Backend Processor With Dynamic-Range-Driven-Skipping Linear-Solving FP-CIM Macros 期刊论文
IEEE JOURNAL OF SOLID-STATE CIRCUITS, 2024, 页码: 13
作者:  Li, Mengjie;  Zhu, Haozhe;  He, Siqi;  Zhang, Hongyi;  Liao, Jie;  Zhai, Danfeng;  Chen, Chixiao;  Liu, Qi;  Zeng, Xiaoyang;  Sun, Ninghui;  Liu, Ming
收藏  |  浏览/下载:3/0  |  提交时间:2024/12/06
Simultaneous localization and mapping  Sorting  In-memory computing  Visualization  Energy efficiency  Optimization  Common Information Model (computing)  Compute in memory (CIM)  floating point (FP)  linear system solver  simultaneous localization and mapping (SLAM)  
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:14/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:10/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
A2Pt : Anti-Associative Prompt Tuning for Open Set Visual Recognition 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2024, 卷号: 26, 页码: 8419-8431
作者:  Ren, Hairui;  Tang, Fan;  Pan, Xingjia;  Cao, Juan;  Dong, Weiming;  Lin, Zhiwen;  Yan, Ke;  Xu, Changsheng
收藏  |  浏览/下载:2/0  |  提交时间:2024/12/06
Tuning  Neck  Task analysis  Image recognition  Calibration  Visualization  Training  Multi-modality Pre-trained models (PTMs)  open set recognition (OSR)  class-aware representation  anti-associative prompt tuning (A(2)Pt)  
Overview of the Tenth Dialog System Technology Challenge: DSTC10 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 765-778
作者:  Yoshino, Koichiro;  Chen, Yun-Nung;  Crook, Paul;  Kottur, Satwik;  Li, Jinchao;  Hedayatnia, Behnam;  Moon, Seungwhan;  Fei, Zhengcong;  Li, Zekang;  Zhang, Jinchao;  Feng, Yang;  Zhou, Jie;  Kim, Seokhwan;  Liu, Yang;  Jin, Di;  Papangelis, Alexandros;  Gopalakrishnan, Karthik;  Hakkani-Tur, Dilek;  Damavandi, Babak;  Geramifard, Alborz;  Hori, Chiori;  Shah, Ankit;  Zhang, Chen;  Li, Haizhou;  Sedoc, Joao;  D'Haro, Luis F.;  Banchs, Rafael;  Rudnicky, Alexander
收藏  |  浏览/下载:37/0  |  提交时间:2024/05/20
Task analysis  Internet  History  Oral communication  Measurement  Context modeling  Visualization  Dialog systems  natural language processing  speech processing  multimodal sensors  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:12/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 2572-2586
作者:  Liu, Yuxin;  Min, Weiqing;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:10/0  |  提交时间:2024/05/20
Semantics  Visualization  Transformers  Task analysis  Feature extraction  Image recognition  Fish  Food recognition  ingredient recognition  food computing  fine-grained recognition  multi-label recognition  
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1522-1533
作者:  Wang, Yabing;  Wang, Shuhui;  Luo, Hao;  Dong, Jianfeng;  Wang, Fan;  Han, Meng;  Wang, Xun;  Wang, Meng
收藏  |  浏览/下载:11/0  |  提交时间:2024/05/20
Visualization  Noise measurement  Estimation  Costs  Transportation  Training  Task analysis  Cross-modal retrieval  noise correspondence learning  cross-lingual transfer  optimal transport  machine translation