Institute of Computing Technology, Chinese Academy IR
Named entity recognition in the perovskite field based on convolutional neural networks and MatBERT | |
Zhang, Jiaxin1,2; Zhang, Lingxue1,2; Sun, Yuxuan1,2; Li, Wei3; Quhe, Ruge1,2 | |
2024-05-01 | |
发表期刊 | COMPUTATIONAL MATERIALS SCIENCE |
ISSN | 0927-0256 |
卷号 | 240页码:7 |
摘要 | Due to the significant increase in publications in the field of materials science, there has been a bottleneck in organizing material science knowledge and discovering new materials. The number of literature in the emerging field of perovskite materials has grown to a massive scale. It is necessary to compile information on the structure, properties, synthesis methods, characterization techniques, and applications of perovskite materials. To address this issue, we employed named entity recognition, a natural language processing technique, to extract important entities from perovskite material texts. In this paper, we propose a method based on convolutional neural networks (CNN) and MatBERT. Firstly, we utilized MatBERT, which has been pre-trained on a large amount of material science text, to generate contextualized word embeddings. Next, we extracted feature information using a CNN model. Finally, a conditional random field (CRF) layer was used for decoding sequences in addition to calculating the training and validation loss. Experimental results demonstrated that the performance of our model on perovskite material dataset was improved by 1 %similar to 6% compared with BERT, SciBERT and MatBERT models. Through this model, we extracted the entities of 2389 abstracts to obtain knowledge of perovskite materials. |
关键词 | Named Entity Recognition BERT Convolutional Neural Network Conditional Random Field |
DOI | 10.1016/j.commatsci.2024.113014 |
收录类别 | SCI |
语种 | 英语 |
资助项目 | National Key R & D Program of China[2021ZD0110102] |
WOS研究方向 | Materials Science |
WOS类目 | Materials Science, Multidisciplinary |
WOS记录号 | WOS:001229879500001 |
出版者 | ELSEVIER |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/40089 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Li, Wei; Quhe, Ruge |
作者单位 | 1.Beijing Univ Posts & Telecommun, State Key Lab Informat Photon & Opt Commun, Beijing 100876, Peoples R China 2.Beijing Univ Posts & Telecommun, Sch Sci, Beijing 100876, Peoples R China 3.Chinese Acad Sci, Inst Comp Technol, SKL Processors, Beijing, Peoples R China |
推荐引用方式 GB/T 7714 | Zhang, Jiaxin,Zhang, Lingxue,Sun, Yuxuan,et al. Named entity recognition in the perovskite field based on convolutional neural networks and MatBERT[J]. COMPUTATIONAL MATERIALS SCIENCE,2024,240:7. |
APA | Zhang, Jiaxin,Zhang, Lingxue,Sun, Yuxuan,Li, Wei,&Quhe, Ruge.(2024).Named entity recognition in the perovskite field based on convolutional neural networks and MatBERT.COMPUTATIONAL MATERIALS SCIENCE,240,7. |
MLA | Zhang, Jiaxin,et al."Named entity recognition in the perovskite field based on convolutional neural networks and MatBERT".COMPUTATIONAL MATERIALS SCIENCE 240(2024):7. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论