CSpace

浏览/检索结果: 共261条,第1-10条 帮助

已选(0)清除 条数/页:   排序方式:
Context Disentangling and Prototype Inheriting for Robust Visual Grounding 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2024, 卷号: 46, 期号: 5, 页码: 3213-3229
作者:  Tang, Wei;  Li, Liang;  Liu, Xuejing;  Jin, Lu;  Tang, Jinhui;  Li, Zechao
收藏  |  浏览/下载:3/0  |  提交时间:2024/05/20
Visualization  Grounding  Prototypes  Transformers  Task analysis  Linguistics  Feature extraction  Context disentangling  open-vocabulary scene  prototype discovering  robust grounding  visual grounding (VG)  
Overview of the Tenth Dialog System Technology Challenge: DSTC10 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2024, 卷号: 32, 页码: 765-778
作者:  Yoshino, Koichiro;  Chen, Yun-Nung;  Crook, Paul;  Kottur, Satwik;  Li, Jinchao;  Hedayatnia, Behnam;  Moon, Seungwhan;  Fei, Zhengcong;  Li, Zekang;  Zhang, Jinchao;  Feng, Yang;  Zhou, Jie;  Kim, Seokhwan;  Liu, Yang;  Jin, Di;  Papangelis, Alexandros;  Gopalakrishnan, Karthik;  Hakkani-Tur, Dilek;  Damavandi, Babak;  Geramifard, Alborz;  Hori, Chiori;  Shah, Ankit;  Zhang, Chen;  Li, Haizhou;  Sedoc, Joao;  D'Haro, Luis F.;  Banchs, Rafael;  Rudnicky, Alexander
收藏  |  浏览/下载:7/0  |  提交时间:2024/05/20
Task analysis  Internet  History  Oral communication  Measurement  Context modeling  Visualization  Dialog systems  natural language processing  speech processing  multimodal sensors  
Event Graph Guided Compositional Spatial--Temporal Reasoning for Video Question Answering 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2024, 卷号: 33, 页码: 1109-1121
作者:  Bai, Ziyi;  Wang, Ruiping;  Gao, Difei;  Chen, Xilin
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Visualization  Cognition  Transformers  Semantics  Feature extraction  Context modeling  Task analysis  VideoQA  video representation  transformer  spatial-temporal reasoning  compositional reasoning  
Multi-state Ingredient Recognition via Adaptive Multi-centric Network 期刊论文
IEEE TRANSACTIONS ON INDUSTRIAL INFORMATICS, 2023, 页码: 10
作者:  Wen, Min;  Song, Jiajun;  Min, Weiqing;  Xiao, Weimin;  Han, Lin;  Jiang, Shuqiang
收藏  |  浏览/下载:2/0  |  提交时间:2024/05/20
Ingredient recognition  intelligent cooking device  
Large Scale Visual Food Recognition 期刊论文
IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2023, 卷号: 45, 期号: 8, 页码: 9932-9949
作者:  Min, Weiqing;  Wang, Zhiling;  Liu, Yuxin;  Luo, Mengjiang;  Kang, Liping;  Wei, Xiaoming;  Wei, Xiaolin;  Jiang, Shuqiang
收藏  |  浏览/下载:7/0  |  提交时间:2023/12/04
Image recognition  Visualization  Task analysis  Benchmark testing  Representation learning  Training  Semantics  Food dataset  food recognition  large-scale datasets  fine-grained recognition  
Context-Aware Proposal-Boundary Network With Structural Consistency for Audiovisual Event Localization 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2023, 页码: 11
作者:  Wang, Hao;  Zha, Zheng-Jun;  Li, Liang;  Chen, Xuejin;  Luo, Jiebo
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Audiovisual learning  context learning  event localization  
TransWeaver: Weave Image Pairs for Class Agnostic Common Object Detection 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2023, 卷号: 32, 页码: 2947-2959
作者:  Guo, Xiaoqian;  Li, Xiangyang;  Wang, Yaowei;  Jiang, Shuqiang
收藏  |  浏览/下载:8/0  |  提交时间:2023/12/04
Proposals  Object detection  Task analysis  Feature extraction  Visualization  Training  Measurement  Common object detection  transweaver  transformer  
Multimodal graph neural network for video procedural captioning 期刊论文
NEUROCOMPUTING, 2022, 卷号: 488, 页码: 88-96
作者:  Ji, Lei;  Tu, Rongcheng;  Lin, Kevin;  Wang, Lijuan;  Duan, Nan
收藏  |  浏览/下载:17/0  |  提交时间:2022/12/07
Multimodal video captioning  Graph neural network  
Self-Supervised Enhancement for Named Entity Disambiguation via Multimodal Graph Convolution 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 15
作者:  Zhou, Pengfei;  Ying, Kaining;  Wang, Zhenhua;  Guo, Dongyan;  Bai, Cong
收藏  |  浏览/下载:29/0  |  提交时间:2022/12/07
Task analysis  Convolution  Semantics  Internet  Bit error rate  Visualization  Pipelines  Graph convolutional network (GCN)  multimodal data  named entity disambiguation (NED)  self-supervised learning (SSL)  
Syntax-Guided Hierarchical Attention Network for Video Captioning 期刊论文
IEEE TRANSACTIONS ON CIRCUITS AND SYSTEMS FOR VIDEO TECHNOLOGY, 2022, 卷号: 32, 期号: 2, 页码: 880-892
作者:  Deng, Jincan;  Li, Liang;  Zhang, Beichen;  Wang, Shuhui;  Zha, Zhengjun;  Huang, Qingming
收藏  |  浏览/下载:19/0  |  提交时间:2022/12/07
Syntactics  Feature extraction  Visualization  Generators  Semantics  Two dimensional displays  Three-dimensional displays  Video captioning  syntax attention  content attention  global sentence-context