CSpace  > 中国科学院计算技术研究所期刊论文  > 英文
Improving speech transcription by exploiting user feedback and word repetition
Wang, Xiangdong1,2; Yang, Ying3; Liu, Hong1,2; Qian, Yueliang1,2
2017-10-01
发表期刊MULTIMEDIA TOOLS AND APPLICATIONS
ISSN1380-7501
卷号76期号:19页码:20359-20376
摘要Speech Transcription is important for video/audio retrieval and many other applications. In automatic speech transcription, recognition errors are inevitable, which makes user feedback such as manual error correction necessary. In this paper, an approach is proposed to improve the accuracy of speech transcription by exploiting user feedback and word repetition. The method aims at learning from user feedback and recognition results of preceding utterances and then correcting errors when repeated words are falsely recognized in following utterances. An interaction scheme for user feedback is proposed, which facilitate error correction by candidate lists and provide a new kind of feedback referred to as word indication to extend error correction from repeated words to repeated phrases. For template extraction and matching, the representation of word template and recognition results based on syllable confusion network (SCN) is proposed. During the transcription, templates of multi-syllable words/phrases based on SCN are extracted from user feedback and the N-best lattice, and then matched in SCN corresponding to recognition results of subsequent utterances to yield a new candidate list when repeated words are detected. Experimental results show that considerate error reduction is achieved in the newly-generated candidate lists.
关键词Speech transcription Error correction User feedback Repeated word
DOI10.1007/s11042-017-4714-x
收录类别SCI
语种英语
WOS研究方向Computer Science ; Engineering
WOS类目Computer Science, Information Systems ; Computer Science, Software Engineering ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic
WOS记录号WOS:000409180500058
出版者SPRINGER
引用统计
被引频次:2[WOS]   [WOS记录]     [WOS相关记录]
文献类型期刊论文
条目标识符http://119.78.100.204/handle/2XEOYT63/6621
专题中国科学院计算技术研究所期刊论文_英文
通讯作者Wang, Xiangdong
作者单位1.Chinese Acad Sci, Inst Comp Technol, Res Ctr Ubiquitous Comp Syst, Beijing 100190, Peoples R China
2.Chinese Acad Sci, Beijing Key Lab Mobile Comp & Pervas Device, Inst Comp Technol, Beijing 100190, Peoples R China
3.China Agr Univ, Beijing 100083, Peoples R China
推荐引用方式
GB/T 7714
Wang, Xiangdong,Yang, Ying,Liu, Hong,et al. Improving speech transcription by exploiting user feedback and word repetition[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2017,76(19):20359-20376.
APA Wang, Xiangdong,Yang, Ying,Liu, Hong,&Qian, Yueliang.(2017).Improving speech transcription by exploiting user feedback and word repetition.MULTIMEDIA TOOLS AND APPLICATIONS,76(19),20359-20376.
MLA Wang, Xiangdong,et al."Improving speech transcription by exploiting user feedback and word repetition".MULTIMEDIA TOOLS AND APPLICATIONS 76.19(2017):20359-20376.
条目包含的文件
条目无相关文件。
个性服务
推荐该条目
保存到收藏夹
查看访问统计
导出为Endnote文件
谷歌学术
谷歌学术中相似的文章
[Wang, Xiangdong]的文章
[Yang, Ying]的文章
[Liu, Hong]的文章
百度学术
百度学术中相似的文章
[Wang, Xiangdong]的文章
[Yang, Ying]的文章
[Liu, Hong]的文章
必应学术
必应学术中相似的文章
[Wang, Xiangdong]的文章
[Yang, Ying]的文章
[Liu, Hong]的文章
相关权益政策
暂无数据
收藏/分享
所有评论 (0)
暂无评论
 

除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。