基于wfst的中文语音识别解码器的分析-analysis of chinese speech recognition decoder based on wfst.docxVIP

下载本文档

146
0
约3.65万字
约 41页
2018-05-18 发布于上海
举报

基于wfst的中文语音识别解码器的分析-analysis of chinese speech recognition decoder based on wfst.docx

基于wfst的中文语音识别解码器的分析-analysis of chinese speech recognition decoder based on wfst

优秀毕业论文精品参考文献资料摘要语音识别技术，主要是通过计算机语音处理技术，实现一种人机界面，为人与人和人与计算机之间的顺畅交流提供一种便捷的方式。自语音识别技术发展以来，已经取得了一定的研究成果，国内外诸多大公司也加大了对大词汇量中文语音识别技术的开发和研究。在语音识别技术中，解码器是最为关键的部分。近年来，有限状态转换器被广泛应用于语音识别技术中。由于有限状态转换器不仅可以使用于模拟讯号模型，更可以进一步模拟自然语言中许多重要且繁复的文法结构与文法特性。因此，有限状态转换器成为语音研究有力的工具。本文主要讨论带权有限状态转换器在大词汇量中文语音识别系统中的应用。它的基本思想是，将声学模型、发音词典、语言模型分别用一个加权有限状态转换器来表示。然后通过组合演算法将其整合为一个完整的加权有限状态转换器模型，从而可以得到一个同一维度的语音识别搜索空间。本论文可分为四个部分：第一个部分是带权有限状态机相关的基本概念和理论推导；第二部分讨论如何将传统语音识别中所使用的声学模型、发音词典和语音模型分别建立成有限状态转换器形式，以及介绍合并演算法，用来减少各有限状态转换器的状态数和转移数；第三部分讨论如何以组合算法将各带权有限状态转换器整合成为一个搜索空间，以及优化问题；第四部分，设计并实现解码器，在给出测试语料的基础上进行试验。最后，将实验结果与传统的基于 HTK 工具的识别结果，分别在识别率和解码速度两个方面进行比较，得出结论。证明基于加权有限状态转换器的识别系统的正确性及优越性。关键词：语音识别；加权有限状态转换器；解码器。 II Abstract Speech recognition technology, mainly using the computer speech processing technology, to provide a human-machine interface for people, so we can be more convenient exchange between man and computer. Since the development of speech recognition technology has achieved a certain amount of research, many domestic and foreign companies also increase the development and research of large vocab- ulary chinese speech recognition technology. In speech recognition technology, the decoder is the most critical part. The final search of the speech recognition is com- pleted by the decoder. In recent years, the finite-state transducers are widely used in speech recognition technology. The finite-state transducer not only can be used in analog signal model, but also can simulate many important and complicated grammar structure and grammar of the natural language characteristics. so it become a powerful tool for speech research. This article will mainly discuss the application of the finite-state transducer in the large vocabulary chinese speech recognition system. The basic idea is using the finite-state transducer to represent the acoustic model, the lexicon and the language model respectively. Then, they will be combined by combining algorithm. And we will get a complete weighted f

您可能关注的文档

文档评论（0）

1亿VIP精品文档

更多 >

基于wfst的中文语音识别解码器的分析-analysis of chinese speech recognition decoder based on wfst.docxVIP