反蓄意模仿语音识别研究-控制理论与控制工程专业论文.docx

下载文档 降价啦

4
0
约5.03万字
约 117页
2018-10-28 发布于上海
举报
版权申诉
保障服务

反蓄意模仿语音识别研究-控制理论与控制工程专业论文.docx

1、原创力文档（book118）网站文档一经付费（服务费），不意味着购买了该文档的版权，仅供个人/单位学习、研究之用，不得用于商业用途，未经授权，严禁复制、发行、汇编、翻译或者网络传播等，侵权必究。。
2、本站所有内容均由合作方或网友上传，本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺！文档内容仅供研究参考，付费前请自行鉴别。如您付费，意味着您自己接受本站规则且自行承担风险，本站不退款、不进行额外附加服务；查看《如何避免下载的几个坑》。如果您已付费下载过本站文档，您可以点击这里二次下载。
3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等，请点击“版权申诉”（推荐），也可以打举报电话：400-050-0827(电话支持时间：9:00-18:30)。

反蓄意模仿语音识别研究-控制理论与控制工程专业论文

摘要摘要模仿者蓄意模仿说话人的语音，当相似度达到一定程度时，身份鉴别系统就有可能被模仿者欺骗，并授予其相应的权限，使得系统被模仿者侵入，致使用户的个人信息面临被窃取、破坏的危险。因此，进行蓄意模仿语音的分析研究对信息安全、刑侦和国防等均具有重大意义。文章首先介绍蓄意模仿语音的研究现状以及反蓄意模仿的相关概念和含义，分析并举例说明了蓄意模仿语音对社会和信息安全的危害。简要介绍了蓄意模仿语音说话人识别系统的构成，分析语音端点检测的重要性，提出一种融合时域和频域的音频短时特征参数的时-频端点检测算法，将所提算法与双门限法、短时 TEO 能量法的对比实验，实验结果表明，时-频端点检测算法在带噪语音时依然具有较高的识别语音端点的性能。其次，对 MFCC 倒谱参数及其差分倒谱参数进行详细阐述，在 Mid-MFCC 和 IMFCC 两种改进的 MFCC 特征参数的基础上，根据增减分量法原理，提出 MFCC ???MFCC 和 MMI-MFCC 混合特征参数。最后，介绍蓄意模仿语音库的建立和语音相似度主观评价方法；研究 MFCC、 MFCC ???MFCC 和 MMI-MFCC 混合特征参数对蓄意模仿语音的分辨性能，以及这些参数对蓄意模仿语音相似程度的描述能力，实验结果证明，MMI-MFCC 混合特征参数性能最好。建立基于 SVM 的反蓄意模仿语音识别系统，并与基于 VQ 的说话人识别系统性能作对比，错误接受率从 VQ 系统的 10.07%低到了 SVM 系统的 6.86%，验证了本文建立的系统具有更加优良的性能。关键词：蓄意模仿；支持向量机；MFCC；混合特征参数；语音端点检测 I Abstract Abstract Authentication system is likely to be deceived and gives the corresponding privileges when imitators deliberately imitate the speakers voice reaches a certain degree of similarity. Then, system may be forcible entry and the users personal information may be at high risk of theft or damage. Therefore, it is in necessary to analyze and study the mimicked voice, which plays an important role in such aspects as information security, forensic and national defense. Firstly, the current status of deliberately imitation speech and related concepts of deliberate imitation are introduced. The harm to society and the information security brought by the deliberate imitation voice are illustrated and the composition of deliberately imitate speech speaker recognition system is introduced briefly in the paper. The importance of speech endpoint detection is analyzed. Time domain and frequency domain of voice short-term characteristic parameters are fused to design time-frequency endpoint detection algorithm. The proposed algorithm is compared with the double threshold method and short-term TEO energy method, which verifies the time-frequency endpoint detection algorithm have high recognition performance in the speech with noise. Secondly