CSpace  > 中国科学院计算技术研究所期刊论文  > 英文
Radiology report generation with a learned knowledge base and multi-modal alignment
Yang, Shuxin1,6; Wu, Xian3; Ge, Shen3; Zheng, Zhuozhao4,7; Zhou, S. Kevin1,5; Xiao, Li1,2,6
2023-05-01
发表期刊MEDICAL IMAGE ANALYSIS
ISSN1361-8415
卷号86页码:10
摘要In clinics, a radiology report is crucial for guiding a patient's treatment. However, writing radiology reports is a heavy burden for radiologists. To this end, we present an automatic, multi-modal approach for report generation from a chest x-ray. Our approach, motivated by the observation that the descriptions in radiology reports are highly correlated with specific information of the x-ray images, features two distinct modules: (i) Learned knowledge base: To absorb the knowledge embedded in the radiology reports, we build a knowledge base that can automatically distill and restore medical knowledge from textual embedding without manual labor; (ii) Multi-modal alignment: to promote the semantic alignment among reports, disease labels, and images, we explicitly utilize textual embedding to guide the learning of the visual feature space. We evaluate the performance of the proposed model using metrics from both natural language generation and clinic efficacy on the public IU-Xray and MIMIC-CXR datasets. Our ablation study shows that each module contributes to improving the quality of generated reports. Furthermore, the assistance of both modules, our approach outperforms state-of-the-art methods over almost all the metrics. Code is available at https://github.com/LX-doctorAI1/M2KT.
关键词Radiology report generation Knowledge base Multi-modal alignment
DOI10.1016/j.media.2023.102798
收录类别SCI
语种英语
资助项目Open Fund Project of Guangdong Academy of Medical Sciences, China[YKY-KF202206] ; [62271465]
WOS研究方向Computer Science ; Engineering ; Radiology, Nuclear Medicine & Medical Imaging
WOS类目Computer Science, Artificial Intelligence ; Computer Science, Interdisciplinary Applications ; Engineering, Biomedical ; Radiology, Nuclear Medicine & Medical Imaging
WOS记录号WOS:001054264400001
出版者ELSEVIER
引用统计
被引频次:26[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://119.78.100.204/handle/2XEOYT63/21382
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Xiao, Li
作者单位1.Chinese Acad Sci, Inst Comp Technol, Key Lab Intelligent Informat Proc, Beijing 100190, Peoples R China
2.Beijing Univ Posts & Telecommun, Sch Artificial Intelligence, Beijing 100876, Peoples R China
3.Tencent Med AI Lab, Beijing, Peoples R China
4.Beijing Tsinghua Changgung Hosp, Dept Radiol, Beijing 102218, Peoples R China
5.Univ Sci & Technol China, Sch Biomed Engn Suzhou Inst Adv Res, Ctr Med Imaging Robot & Analyt Comp & LEarning MIR, Suzhou 215123, Peoples R China
6.Univ Chinese Acad Sci, Beijing 100049, Peoples R China
7.Tsinghua Univ, Sch Med, Beijing 100084, Peoples R China
推荐引用方式
GB/T 7714
Yang, Shuxin,Wu, Xian,Ge, Shen,et al. Radiology report generation with a learned knowledge base and multi-modal alignment[J]. MEDICAL IMAGE ANALYSIS,2023,86:10.
APA Yang, Shuxin,Wu, Xian,Ge, Shen,Zheng, Zhuozhao,Zhou, S. Kevin,&Xiao, Li.(2023).Radiology report generation with a learned knowledge base and multi-modal alignment.MEDICAL IMAGE ANALYSIS,86,10.
MLA Yang, Shuxin,et al."Radiology report generation with a learned knowledge base and multi-modal alignment".MEDICAL IMAGE ANALYSIS 86(2023):10.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Yang, Shuxin]的文章
[Wu, Xian]的文章
[Ge, Shen]的文章
百度学术
百度学术中相似的文章
[Yang, Shuxin]的文章
[Wu, Xian]的文章
[Ge, Shen]的文章
必应学术
必应学术中相似的文章
[Yang, Shuxin]的文章
[Wu, Xian]的文章
[Ge, Shen]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。