基于GMM的低码率语音编码器-电路与系统专业论文.docxVIP

  • 3
  • 0
  • 约4.82万字
  • 约 140页
  • 2019-01-04 发布于上海
  • 举报

基于GMM的低码率语音编码器-电路与系统专业论文.docx

基于GMM的低码率语音编码器-电路与系统专业论文

摘要摘 摘要 摘 要 本文研究了一种新颖的基于高斯混合模型(Gaussian Mixture Model,∞心压) 的低码率语音编码系统。该编码器利用GMM对短时语音谱包络进行拟合后用 GMM参数来表示语音谱包络。由于GMM参数较少,从而可以使得编码速率很 低。 语音谱包络决定了合成语音的可懂度,文中研究了LPC法、LPC倒谱法和 SEEVOC法的谱包络估计,并进行了仿真实验。经过对比,本系统采用SEEVOC 法来获取短时语音谱包络。研究了GMM和EM算法,用6阶GMM参数(均值、 方差、混合权重)表示短时语音谱包络。 人耳对基音的变化比对其它任何参数的变化都要敏感,因此基音的检测对合 成语音质量很关键。文中基于变长平均幅度差函数(L、啪F)提出了一种改进 的基音周期检测算法(Modified LVAMDF,M.I:VAMDF),改进算法在LVAMDF 的基础上结合修正的阈值线和简化的自相关函数(ACF)。经仿真测试表明,此 方法能检测出汉语语音中基音变化较快的语音帧的平均周期,提高了汉语语音解 码质量。 本文建立了基于GMM的低码率语音编码器方案,对方案各模块进行了仿真 并最终实现了整个编解码系统。仿真结果表明:该编码器在传输码率降低到 2.35kb/s时,解码得到的语音有较理想的清晰度、可懂度和自然度,令人比较满 意。 关键词:语音编码;G/VIM;低码率;谱包络;基音检测 ABSTRACTA ABSTRACT A novel low bit-rate speech coder based on Gaussian Mixture Model(GMM),which is used to parameterize the short-time speech spectrum envelope,is researched in this paper.Since the segmented speech can be represented by very few parameters of GMM,the bit-rate of the coder is very low. The spectrum envelope carries very important information of the speech.Different methods aS LPC,LPC cepstmm and SEEVOC for obtaining the spectrum envelope are analyzed.By some comparisons,the method SEEVOC is utilized.Then the spectrum envelope call be represented by the means,covariances and mixture weights of GMM. The pitch affects the q谢ity of synthesized speech.A modified pitch detection algorithm based on Length Varied Average Magnitude Difference Function(LVAMDF)is presented.The new algorithm Call be used to extract the average pitch period of the mandarin speech,when compared、砘m the LVAMDF pitch detection algorithm.The result of the test experiments shows that the modified pitch detection algorithm brings good precision and better synthesized speech. With the above improvements and other speech feature extracting methods,the system of speech coding and speech decoding is realized.The result of the experiments shows that the proposed speech coder presents a good performance.The quality of the synthesized speech is still satisfying when the bit-rate of the coder is reduced to

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档