Institute of Computing Technology, Chinese Academy IR
Improving speech transcription by exploiting user feedback and word repetition | |
Wang, Xiangdong1,2; Yang, Ying3; Liu, Hong1,2; Qian, Yueliang1,2 | |
2017-10-01 | |
发表期刊 | MULTIMEDIA TOOLS AND APPLICATIONS |
ISSN | 1380-7501 |
卷号 | 76期号:19页码:20359-20376 |
摘要 | Speech Transcription is important for video/audio retrieval and many other applications. In automatic speech transcription, recognition errors are inevitable, which makes user feedback such as manual error correction necessary. In this paper, an approach is proposed to improve the accuracy of speech transcription by exploiting user feedback and word repetition. The method aims at learning from user feedback and recognition results of preceding utterances and then correcting errors when repeated words are falsely recognized in following utterances. An interaction scheme for user feedback is proposed, which facilitate error correction by candidate lists and provide a new kind of feedback referred to as word indication to extend error correction from repeated words to repeated phrases. For template extraction and matching, the representation of word template and recognition results based on syllable confusion network (SCN) is proposed. During the transcription, templates of multi-syllable words/phrases based on SCN are extracted from user feedback and the N-best lattice, and then matched in SCN corresponding to recognition results of subsequent utterances to yield a new candidate list when repeated words are detected. Experimental results show that considerate error reduction is achieved in the newly-generated candidate lists. |
关键词 | Speech transcription Error correction User feedback Repeated word |
DOI | 10.1007/s11042-017-4714-x |
收录类别 | SCI |
语种 | 英语 |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Information Systems ; Computer Science, Software Engineering ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000409180500058 |
出版者 | SPRINGER |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/6621 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Wang, Xiangdong |
作者单位 | 1.Chinese Acad Sci, Inst Comp Technol, Res Ctr Ubiquitous Comp Syst, Beijing 100190, Peoples R China 2.Chinese Acad Sci, Beijing Key Lab Mobile Comp & Pervas Device, Inst Comp Technol, Beijing 100190, Peoples R China 3.China Agr Univ, Beijing 100083, Peoples R China |
推荐引用方式 GB/T 7714 | Wang, Xiangdong,Yang, Ying,Liu, Hong,et al. Improving speech transcription by exploiting user feedback and word repetition[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2017,76(19):20359-20376. |
APA | Wang, Xiangdong,Yang, Ying,Liu, Hong,&Qian, Yueliang.(2017).Improving speech transcription by exploiting user feedback and word repetition.MULTIMEDIA TOOLS AND APPLICATIONS,76(19),20359-20376. |
MLA | Wang, Xiangdong,et al."Improving speech transcription by exploiting user feedback and word repetition".MULTIMEDIA TOOLS AND APPLICATIONS 76.19(2017):20359-20376. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论