嵌入自联想神经网络高斯混合模型说话人辨认.pdfVIP

下载本文档

4
0
约2.84万字
约 5页
2017-07-05 发布于湖北
举报

嵌入自联想神经网络高斯混合模型说话人辨认.pdf

更多技术文章，论文请登录第 32 卷第 3 期电子与信息学报 Vol.32No.3 2010 年 3 月 Journal of Electronics Information Technology Mar.2010 嵌入自联想神经网络的高斯混合模型说话人辨认陈存宝赵力 (东南大学信息科学与工程学院南京 210096) 摘要：该文提出了一种嵌入自联想神经网络的高斯混合模型，它充分利用了神经网络和高斯混合模型各自的优点，以最大似然概率(ML)为准则，把它们作为一个整体来进行训练。训练过程中，高斯混合模型和神经网络的参数交替更新。由于神经网络起到了“数据整形”的作用，因而提高了类内数据的相似性。实验结果表明，采用该文提出的模型在各种信噪比情况下的识别率都比基线系统有所提高，最高能达到 19%。关键词：说话人识别；高斯混合模型(GMM)；自联想神经网络(AANN)；嵌入中图分类号：TP391.42 文献标识码： A 文章编号：1009-5896(2010)03-0528-05 DOI:10.3724/SP.J.1146.2008.00275 Speaker Identification Based on GMM with Embedded AANN Chen Cun-bao Zhao Li (School of Information Science and Engineering, Southeast University, Nanjing 210096, China) Abstract: In this paper, a modified Gaussian Mixed Model (GMM) with an embedded Auto-Associate Neural Network (AANN) is proposed. It integrates the merits of GMM and AANN. GMM and AANN as a whole are trained by means of Maximum Likelihood (ML). In the process of training, the parameters of GMM and AANN are updated alternately. AANN reshapes the distribution of the data and improves the similarity of the data in one class. Experiments show that the proposed system improves accuracy rate against baseline GMM at all SNR, maximum to 19%. Key words: Speaker identification; Gaussian Mixed Model (GMM); Auto-Associate Neural Network (AANN); Embedded 1 引言 GMM 超向量的支持向量机(SVM)和因子分析方自动

您可能关注的文档

文档评论（0）

1亿VIP精品文档

更多 >

嵌入自联想神经网络高斯混合模型说话人辨认.pdfVIP