基于GMM-UBM的快速说话人识别方法-计算机科学与技术专业论文.docxVIP

  • 27
  • 0
  • 约4.87万字
  • 约 69页
  • 2019-01-04 发布于上海
  • 举报

基于GMM-UBM的快速说话人识别方法-计算机科学与技术专业论文.docx

基于GMM-UBM的快速说话人识别方法-计算机科学与技术专业论文

Abstract Due to its flexibility and facilitation in application, text-independent speaker recognition system has become a hot topic in the field of speech recognition. Since NIST (National Institute of Standards and Technology) 1999 sets the Gaussian Mixture Model - Universal Background Model (GMM-UBM) as a reference system to obtain excellent recognition rate, it has functioned as a baseline system for this research area, and got better and better through improvement. Although the speaker recognition system has achieved a relatively satisfactory result, it requires much time for the calculation of the likelihood before matches, which makes the system recognition speed decline significantly. As a result, its practicality in application is not that promising. The main goal of this paper is to achieve better and quicker speaker recognition without affecting the accuracy of recognition rate. As to the large calculation, slow speed in speaker recognition process, this paper makes improvement based on tree structure of the selection algorithm, top-down searching UBM in the output test speech feature vector core of the likelihood distribution of the highest points . It saves effort when matching with the target speaker model since it is only necessary to calculate the core of the likelihood distribution. Reference system with the improved algorithm the core selected speed has increased to 14.7 times. Because the order of feature vector having no effect to the results, so after combined with the vector sequence rearrangements pruning algorithm, the system speed increased by 21.7 times, with recognition rate slightly lower. To improve the recognition rate, the paper introduces the kernel function in SVM (Support Vector Machine) to the speaker recognition, which basically shares the same recognition rate with the referen ce system. For the open set recognition problems, this paper proposes the concept of probability threshold to resolve in the open set female voice recognit

您可能关注的文档

文档评论(0)

1亿VIP精品文档

相关文档