Replay attack detection based on distortion by loudspeaker for voice authentication

doi:10.1007/s11042-018-6834-3

	Replay attack detection based on distortion by loudspeaker for voice authentication
	Ren, Yanzhen 1; Fang, Zhong 2; Liu, Dengkai 1; Chen, Changwen 3
	2019-04-01
发表期刊	MULTIMEDIA TOOLS AND APPLICATIONS
ISSN	1380-7501
卷号	78 期号:7 页码:8383-8396
摘要	Identity authentication based on Automatic Speaker Verification (ASV) has attracted extensive attention. Voice can be used as a substitute of password in many applications. However, the security of current ASV systems has been seriously challenged by many malicious spoofing attacks. Among all those attacks, replay attack is one of the biggest threats to the ASV System, where an adversary can use a pre-recorded speech sample of the legal user to access the ASV system. In this paper, we present a replay attack detection (RAD) scheme to distinguish normal speech and replayed speech. We focus on the distortion caused by loudspeaker: low-frequency attenuation and high-frequency harmonics, and present a suite of RAD features DL-RAD, including Harmonic Energy Ratio (HER), Low Spectral Ratio (LSR), Low Spectral Variance (LSV), and Low Spectral Difference Variance (LSDV), to describe the different characteristics between the normal speech signal and replay speech signal. SVM is adopted as a classifier to evaluate the performance of these features. Experiment results show that the True Positive Rate (TPR), True Negative Rate (TNR) of the proposed method are about 98.15% and 98.75% respectively, which are significantly better than the existing scheme. The proposed scheme can be applied to both text-dependent and text-independent ASV systems.
关键词	Automatic Speaker Verification (ASV) Replay Attack Detection (RAD) Loudspeaker Low-frequency attenuation Spoofing attack
DOI	10.1007/s11042-018-6834-3
收录类别	SCI
语种	英语
WOS研究方向	Computer Science ; Engineering
WOS类目	Computer Science, Information Systems ; Computer Science, Software Engineering ; Computer Science, Theory & Methods ; Engineering, Electrical & Electronic
WOS记录号	WOS:000466381800028
出版者	SPRINGER
引用统计	被引频次：13[WOS] [WOS记录] [WOS相关记录]
文献类型	期刊论文
条目标识符	http://119.78.100.204/handle/2XEOYT63/4264
专题	中国科学院计算技术研究所期刊论文_英文
通讯作者	Ren, Yanzhen
作者单位	1.Wuhan Univ, Sch Cyber Sci & Engn, Minist Educ, Key Lab Aerosp Informat Secur & Trusted Comp, Wuhan, Hubei, Peoples R China 2.Chinese Acad Sci, Inst Comp Technol, Beijing, Peoples R China 3.SUNY Buffalo, Buffalo, NY 14260 USA
推荐引用方式 GB/T 7714	Ren, Yanzhen,Fang, Zhong,Liu, Dengkai,et al. Replay attack detection based on distortion by loudspeaker for voice authentication[J]. MULTIMEDIA TOOLS AND APPLICATIONS,2019,78(7):8383-8396.
APA	Ren, Yanzhen,Fang, Zhong,Liu, Dengkai,&Chen, Changwen.(2019).Replay attack detection based on distortion by loudspeaker for voice authentication.MULTIMEDIA TOOLS AND APPLICATIONS,78(7),8383-8396.
MLA	Ren, Yanzhen,et al."Replay attack detection based on distortion by loudspeaker for voice authentication".MULTIMEDIA TOOLS AND APPLICATIONS 78.7(2019):8383-8396.