CSpace

浏览/检索结果: 共14条,第1-10条 帮助

限定条件                
已选(0)清除 条数/页:   排序方式:
Radiology report generation with a learned knowledge base and multi-modal alignment 期刊论文
MEDICAL IMAGE ANALYSIS, 2023, 卷号: 86, 页码: 10
作者:  Yang, Shuxin;  Wu, Xian;  Ge, Shen;  Zheng, Zhuozhao;  Zhou, S. Kevin;  Xiao, Li
收藏  |  浏览/下载:11/0  |  提交时间:2023/12/04
Radiology report generation  Knowledge base  Multi-modal alignment  
A Pyramid Semi-Autoregressive Transformer with Rich Semantics for Sign Language Production 期刊论文
SENSORS, 2022, 卷号: 22, 期号: 24, 页码: 15
作者:  Cui, Zhenchao;  Chen, Ziang;  Li, Zhaoxin;  Wang, Zhaoqi
收藏  |  浏览/下载:17/0  |  提交时间:2023/07/12
human pose generation  sign language production  semi-autoregressive transformer  deep learning  
Channel-Aware Decoupling Network for Multiturn Dialog Comprehension 期刊论文
IEEE TRANSACTIONS ON NEURAL NETWORKS AND LEARNING SYSTEMS, 2022, 页码: 12
作者:  Zhang, Zhuosheng;  Zhao, Hai;  Liu, Longxiang
收藏  |  浏览/下载:17/0  |  提交时间:2023/07/12
Deep neural networks  dialog modeling  natural language generation  open domain conversation system  
A Review on Question Generation from Natural Language Text 期刊论文
ACM TRANSACTIONS ON INFORMATION SYSTEMS, 2022, 卷号: 40, 期号: 1, 页码: 43
作者:  Zhang, Ruqing;  Guo, Jiafeng;  Chen, Lu;  Fan, Yixing;  Cheng, Xueqi
收藏  |  浏览/下载:21/0  |  提交时间:2022/12/07
Question generation  natural language generation  survey  
Long Short-Term Relation Transformer With Global Gating for Video Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2022, 卷号: 31, 页码: 2726-2738
作者:  Li, Liang;  Gao, Xingyu;  Deng, Jincan;  Tu, Yunbin;  Zha, Zheng-Jun;  Huang, Qingming
收藏  |  浏览/下载:29/0  |  提交时间:2022/12/07
Transformers  Cognition  Visualization  Feature extraction  Decoding  Task analysis  Semantics  Video captioning  relational reasoning  long short-term graph  transformer  
Integrating Scene Semantic Knowledge into Image Captioning 期刊论文
ACM TRANSACTIONS ON MULTIMEDIA COMPUTING COMMUNICATIONS AND APPLICATIONS, 2021, 卷号: 17, 期号: 2, 页码: 22
作者:  Wei, Haiyang;  Li, Zhixin;  Huang, Feicheng;  Zhang, Canlong;  Ma, Huifang;  Shi, Zhongzhi
收藏  |  浏览/下载:41/0  |  提交时间:2021/12/01
Image captioning  attention mechanism  scene semantics  encoder-decoder framework  
Bridging Text and Video: A Universal Multimodal Transformer for Audio-Visual Scene-Aware Dialog 期刊论文
IEEE-ACM TRANSACTIONS ON AUDIO SPEECH AND LANGUAGE PROCESSING, 2021, 卷号: 29, 页码: 2476-2483
作者:  Li, Zekang;  Li, Zongjia;  Zhang, Jinchao;  Feng, Yang;  Zhou, Jie
收藏  |  浏览/下载:46/0  |  提交时间:2021/12/01
Task analysis  Feature extraction  Visualization  Speech processing  History  Social networking (online)  Pattern recognition  Dialogue System  Multimodal  Natural Language Processing  Video Understanding  
Grouping sentences as better language unit for extractive text summarization 期刊论文
FUTURE GENERATION COMPUTER SYSTEMS-THE INTERNATIONAL JOURNAL OF ESCIENCE, 2020, 卷号: 109, 页码: 331-359
作者:  Cao, Mengyun;  Zhuge, Hai
收藏  |  浏览/下载:35/0  |  提交时间:2020/12/10
Text summarization  Semantic Link Network  Clustering  Natural language processing  
Spatio-Temporal Memory Attention for Image Captioning 期刊论文
IEEE TRANSACTIONS ON IMAGE PROCESSING, 2020, 卷号: 29, 页码: 7615-7628
作者:  Ji, Junzhong;  Xu, Cheng;  Zhang, Xiaodan;  Wang, Boyue;  Song, Xinhang
收藏  |  浏览/下载:53/0  |  提交时间:2020/12/10
Image captioning  spatio-temporal relationship  attention transmission  memory attention  LSTM  
Know More Say Less: Image Captioning Based on Scene Graphs 期刊论文
IEEE TRANSACTIONS ON MULTIMEDIA, 2019, 卷号: 21, 期号: 8, 页码: 2117-2130
作者:  Li, Xiangyang;  Jiang, Shuqiang
收藏  |  浏览/下载:81/0  |  提交时间:2019/12/10
Image captioning  scene graph  relationship  long short-term network  attention mechanism  vision-language