CSpace

浏览/检索结果: 共113条,第1-10条 帮助

限定条件    
已选(0)清除 条数/页:   排序方式:
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
A Comprehensive Survey of 3D Dense Captioning: Localizing and Describing Objects in 3D Scenes 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2024, 卷号: 34, 期号: 3, 页码: 1322-1338
作者:  Yu, Ting;  Lin, Xiaojun;  Wang, Shuhui;  Sheng, Weiguo;  Huang, Qingming;  Yu, Jun
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Three-dimensional displays  Task analysis  Visualization  Point cloud compression  Grounding  Surveys  Solid modeling  3D dense captioning  vision-language bridging  visual captioning  3D point cloud  
GJFusion: A Channel-Level Correlation Construction Method for Multimodal Physiological Signal Fusion 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2024, 卷号: 20, 期号: 2, 页码: 23
作者:  Huang, Wuliang;  Chen, Yiqiang;  Jiang, Xinlong;  Zhang, Teng;  Chen, Qian
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Multimodal  physiological signal  graph neural network  emotion state recognition  ubiquitous computing  
Overview of the Tenth Dialog System Technology Challenge: DSTC10 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 765-778
作者:  Yoshino, Koichiro;  Chen, Yun-Nung;  Crook, Paul;  Kottur, Satwik;  Li, Jinchao;  Hedayatnia, Behnam;  Moon, Seungwhan;  Fei, Zhengcong;  Li, Zekang;  Zhang, Jinchao;  Feng, Yang;  Zhou, Jie;  Kim, Seokhwan;  Liu, Yang;  Jin, Di;  Papangelis, Alexandros;  Gopalakrishnan, Karthik;  Hakkani-Tur, Dilek;  Damavandi, Babak;  Geramifard, Alborz;  Hori, Chiori;  Shah, Ankit;  Zhang, Chen;  Li, Haizhou;  Sedoc, Joao;  D'Haro, Luis F.;  Banchs, Rafael;  Rudnicky, Alexander
收藏  |  浏览/下载:1/0  |  提交时间:2024/05/20
Task analysis  Internet  History  Oral communication  Measurement  Context modeling  Visualization  Dialog systems  natural language processing  speech processing  multimodal sensors  
Dual-View Curricular Optimal Transport for Cross-Lingual Cross-Modal Retrieval 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1522-1533
作者:  Wang, Yabing;  Wang, Shuhui;  Luo, Hao;  Dong, Jianfeng;  Wang, Fan;  Han, Meng;  Wang, Xun;  Wang, Meng
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Visualization  Noise measurement  Estimation  Costs  Transportation  Training  Task analysis  Cross-modal retrieval  noise correspondence learning  cross-lingual transfer  optimal transport  machine translation  
Synthesizing Knowledge-Enhanced Features for Real-World Zero-Shot Food Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1285-1298
作者:  Zhou, Pengfei;  Min, Weiqing;  Song, Jiajun;  Zhang, Yang;  Jiang, Shuqiang
收藏  |  浏览/下载:1/0  |  提交时间:2024/05/20
Semantics  Feature extraction  Visualization  Annotations  Correlation  Training  Task analysis  Food detection  zero-shot detection  food computing  object detection  zero-shot learning  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Convolution-Enhanced Bi-Branch Adaptive Transformer With Cross-Task Interaction for Food Category and Ingredient Recognition 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 2572-2586
作者:  Liu, Yuxin;  Min, Weiqing;  Jiang, Shuqiang;  Rui, Yong
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Semantics  Visualization  Transformers  Task analysis  Feature extraction  Image recognition  Fish  Food recognition  ingredient recognition  food computing  fine-grained recognition  multi-label recognition  
Streamlining spatial omics data analysis with Pysodb 期刊论文
NATURE PROTOCOLS, 2023, 页码: 72
作者:  Lin, Senlin;  Zhao, Fangyuan;  Wu, Zihan;  Yao, Jianhua;  Zhao, Yi;  Yuan, Zhiyuan
收藏  |  浏览/下载:0/0  |  提交时间:2024/05/20
Semantic and Correlation Disentangled Graph Convolutions for Multilabel Image Recognition 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 13
作者:  Cai, Shaofei;  Li, Liang;  Han, Xinzhe;  Huang, Shan;  Tian, Qi;  Huang, Qingming
收藏  |  浏览/下载:1/0  |  提交时间:2024/05/20
Attention mechanism  feature disentangling  graph convolutional network (GCN)  multilabel recognition