Institute of Computing Technology, Chinese Academy IR
Replay attack detection based on distortion by loudspeaker for voice authentication | |
Ren, Yanzhen1; Fang, Zhong2; Liu, Dengkai1; Chen, Changwen3 | |
2019-04-01 | |
发表期刊 | MULTIMEDIA TOOLS AND APPLICATIONS |
ISSN | 1380-7501 |
卷号 | 78期号:7页码:8383-8396 |
摘要 | Identity authentication based on Automatic Speaker Verification (ASV) has attracted extensive attention. Voice can be used as a substitute of password in many applications. However, the security of current ASV systems has been seriously challenged by many malicious spoofing attacks. Among all those attacks, replay attack is one of the biggest threats to the ASV System, where an adversary can use a pre-recorded speech sample of the legal user to access the ASV system. In this paper, we present a replay attack detection (RAD) scheme to distinguish normal speech and replayed speech. We focus on the distortion caused by loudspeaker: low-frequency attenuation and high-frequency harmonics, and present a suite of RAD features DL-RAD, including Harmonic Energy Ratio (HER), Low Spectral Ratio (LSR), Low Spectral Variance (LSV), and Low Spectral Difference Variance (LSDV), to describe the different characteristics between the normal speech signal and replay speech signal. SVM is adopted as a classifier to evaluate the performance of these features. Experiment results show that the True Positive Rate (TPR), True Negative Rate (TNR) of the proposed method are about 98.15% and 98.75% respectively, which are significantly better than the existing scheme. The proposed scheme can be applied to both text-dependent and text-independent ASV systems. |
关键词 | Automatic Speaker Verification (ASV) Replay Attack Detection (RAD) Loudspeaker Low-frequency attenuation Spoofing attack |
DOI | 10.1007/s11042-018-6834-3 |
收录类别 | SCI |
语种 | 英语 |
资助项目 | Natural Science Foundation of China (NSFC)[U1536114] ; Natural Science Foundation of China (NSFC)[61872275] ; Natural Science Foundation of China (NSFC)[U1536204] ; China Scholarship Council |
WOS研究方向 | Computer Science ; Engineering |
WOS类目 | Computer Science, Information Systems ; Computer Science, Software Engineering ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic |
WOS记录号 | WOS:000466381800028 |
出版者 | SPRINGER |
引用统计 | |
文献类型 | 期刊论文 |
条目标识符 | http://119.78.100.204/handle/2XEOYT63/4264 |
专题 | 中国科学院计算技术研究所期刊论文_英文 |
通讯作者 | Ren, Yanzhen |
作者单位 | 1.Wuhan Univ, Sch Cyber Sci & Engn, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Wuhan, Hubei, Peoples R China 2.Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China 3.SUNY Buffalo, Buffalo, NY 14260 USA |
推荐引用方式 GB/T 7714 | Ren, Yanzhen,Fang, Zhong,Liu, Dengkai,et al. Replay attack detection based on distortion by loudspeaker for voice authentication[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2019,78(7):8383-8396. |
APA | Ren, Yanzhen,Fang, Zhong,Liu, Dengkai,&Chen, Changwen.(2019).Replay attack detection based on distortion by loudspeaker for voice authentication.MULTIMEDIA TOOLS AND APPLICATIONS,78(7),8383-8396. |
MLA | Ren, Yanzhen,et al."Replay attack detection based on distortion by loudspeaker for voice authentication".MULTIMEDIA TOOLS AND APPLICATIONS 78.7(2019):8383-8396. |
条目包含的文件 | 条目无相关文件。 |
除非特别说明,本系统中所有内容都受版权保护,并保留所有权利。
修改评论