语音激活检测技术算法分析及其在语音编码器中的应用-algorithm analysis of speech activation detection technology and its application in speech coder.docxVIP
- 8
- 0
- 约3.99万字
- 约 52页
- 2018-06-05 发布于上海
- 举报
语音激活检测技术算法分析及其在语音编码器中的应用-algorithm analysis of speech activation detection technology and its application in speech coder
摘要本文研究了几种常用的语音激活检测算法,进行了仿真和比较,并在传统算法的基础上提出改进算法,提高了VAD检测的综合性能,能够比较准确区分各种含噪语音的语音/静音帧。同时,将语音激活检测应用到高斯混合模型的低码率语音编码器系统中,大大降低了编码速率。语音激活检测技术性能的优劣很大程度影响了编码器最终的编码速率,因此准确的语音激活检测对合成语音的质量非常关键,本文提出了一些改进的激活检测算法。第一种算法是在传统的基于倒谱参数进行语音激活检测算法的基础上,综合利用短时能量和过零率,建立了综合参数的判决准则,从而提高了性能。第二种是在传统谱熵VAD检测基础上,综合利用了谱减法的降噪增强和自适应子带划分,通过这两方面的改进,使得语音激活检测的准确率进一步提高。仿真结果表明改进算法可以从不同的背景噪声中有效地检测出语音,优于传统检测算法,且运算量较之并没有明显的增加。考虑到语音激活检测技术可以对语音中声音和噪声部分进行有效的区分,为了进一步降低GMM编码器的编码速率,本文将语音激活检测技术应用到GMM语音编码器中。在语音编码之前,首先执行VAD算法检测出语音帧和静音帧,然后对两者采用不同的编码算法,语音帧采用基于高斯混合模型的编码器算法,由于GMM参数较少,可以使码率得到一定降低;另外,静音帧仅发送帧幅度均值。同样解码段,对语音/静音帧分别采用不同的解码算法。仿真结果表明:该编码器可以使全语音时的编码速率降低到2.35kb/s左右,且解码得到的语音有较理想的清晰度、可懂度和自然度,令人比较满意。关键词:语音激活检测,GMM,低码率,语音编码AbstractSomeofcommonVoiceActivityDetectionalgorithmsaresimulatedandcomparedinthispaper,andonthebasisoftraditionalalgorithms,improvedalgorithmisproposedwhichraisethecomprehensivefunctionthatisabletoaccuratelydistinguishspeech/muteframeofvariousnoisevoice.MeanwhiletheimprovedalgorithmisappliedtotheGaussianMixtureModellowbitratevoiceencoderssystemandreducethecodingspeed.ThequalityofVADaffectthefinalencodingspeedgreatly,soexactdetectioniscrucialforthequalityofsynthesizedspeech.Forthispurpose,someimprovementisproposed.Thefirstkindofalgorithmisbasedonthetraditionalcepstrumparametersofvoiceactivationdetectionalgorithm,andutilizetheparameterwhichintegratesenergyandshort-timezero-crossingrate,therebytheperformanceismuchbetter.ThesecondisbasedonthetraditionalVADusingspectralentropy,simultaneouslyspectralsubtractionofnoiseandadaptivesub-banddivisionareapplied.Throughthoseimprovingoftheaspects,theaccuracyisfurtherimproved.Simulationresultsshowthattheimprovedalgorithmcandetectspeechfromdifferentnoisebackgroundeffectivelyandit’ssuperiortothetraditionalalgorithmwithoutobviouscomputationincrease.ConsideringtheVADtechnologycaneffectivelydistinguishnoisepart,hereappliedittoGMMspeechencoderinordertofurtherreducetheencodingspeed.Beforethespeechcodingalgorithm,VADisexecutedtodeterminatethespeechframesandsoundlessvoice,thend
您可能关注的文档
- 渔港的游港化改造利用研究——以沈家门渔港升级改造规划为例-study on the transformation and utilization of fishing ports into ports - taking the upgrading and transformation plan of shenjiamen fishing port as an example.docx
- 榆林盐化工项目可行性分析实证分析-empirical analysis on feasibility analysis of yulin salt chemical project.docx
- 与“边缘人”交往的传播行为研究——以结构主义符号学为研究视域-research on communication behavior with.docx
- 与2×2谱问题相关的孤子族及其无限维bihamilton结构-soliton families and their infinite bi hamilton structures related to 2× 2 spectral problems.docx
- 与cvb3 3a相互作用的人心脏蛋白的筛选及功能初探-screening of human cardiac protein interacting with cv b3 3a and preliminary study on its function.docx
- 与w代数相关联几类无限维李代数结构和表示-structure and representations of several kind of infinite dimensional lie algebras associated with w algebras.docx
- 与被测材料无关新型电涡流位移传感器设计与实现-design and implementation of a new eddy current displacement sensor independent of that material to be measure.docx
- 与地域相关类ba模型构造及应用-construction and application of ba model related to region.docx
- 与rtx集成的科研成果管理系统的分析与实现-analysis and implementation of scientific research achievement management system integrated with rtx.docx
- 与客户成交的n个技巧-n tips for dealing with customers.docx
- 语义分析法在城市色彩规划领域中的应用探讨——以安康城市色彩规划设计为例-discussion on the application of semantic analysis method in the field of city color planning - taking ankang city color planning and design as an example.docx
- 语音情感识别中主动学习和半监督学习方法分析-analysis of active learning and semi-supervised learning methods in speech emotion recognition.docx
- 语义符号学视角下的地域建筑设计方法分析——以遵义新浦新区行政办公楼为例-analysis of regional architectural design methods from the perspective of semantic semiotics - taking the administrative office building of xinpu new district in zunyi as an example.docx
- 语音识别关键技术分析及系统实现-analysis and system implementation of key technologies in speech recognition.docx
- 语音传输与加密系统的分析与实现-analysis and implementation of voice transmission and encryption system.docx
- 语音识别技术在功能测试系统中的应用分析-application analysis of speech recognition technology in function test system.docx
- 语音合成算法分析及嵌入式语音合成系统的实现-analysis of speech synthesis algorithm and implementation of embedded speech synthesis system.docx
- 语音识别算法及应用技术分析-analysis of speech recognition algorithm and application technology.docx
- 语音学习中母语迁移现象分析及其在英语教学中的启示-analysis of mother tongue transfer in phonetics learning and its enlightenment in english teaching.docx
- 语音视频融合互通的方案分析-scheme analysis of voice and video fusion and intercommunication.docx
最近下载
- 2024年苏州工业职业技术学院单招职业适应性测试题库及答案解析.docx VIP
- 数独题目100题1(可打印).pdf VIP
- 《城市轨道交通供电系统的运行》课件——典型牵引降压混合所识图及运行方式分析.pdf VIP
- (毕业论文)某六层框架宿舍楼结构设计计算书.doc VIP
- 产品图纸版本控制规定.docx VIP
- 无人机测绘技术与应用课件41--无人机倾斜摄影数据处理,三维模型生产(瞰景Smart3D建模).ppt
- 一年级的下册数学练习(补墙砖)1.doc VIP
- 学生宿舍楼框架结构设计毕业设计论文.doc VIP
- 2025年领导干部个人民主生活会对照检视剖析材料之在“带头固本培元、增强党性方面”存在的问题24条.docx VIP
- 新人教版八年级上《变量与函数》ppt课件[教学].ppt VIP
原创力文档

文档评论(0)