第15章第6节：语音识别与概率推理概述.pdfVIP

下载本文档

0
0
约1.74万字
约 28页
2026-01-30 发布于四川
举报

第15章第6节：语音识别与概率推理概述.pdf

大纲

♦语音的概率推断♦语音声音♦

单词发音♦单词序列

第15章第6节2

Outline

♦Speechasprobabilisticinference

♦Speechsounds

♦Wordpronunciation

♦Wordsequences

Chapter15,Section62

语音作为概率推理

破坏一个美好的海滩并不容易

语音信号是嘈杂的、多变的、模糊的

给定语音信号，最可能的词序列？即，选择使P(

Words|signal)最大化的Words。使用贝叶斯法则：

P(词|信号)=αP(信号|词)P(词)

即，分解为声学模型+语言模型

词是隐藏状态序列，信号是观察序列

第15章，第6节3

Speechasprobabilisticinference

It’snoteasytowreckanicebeach

Speechsignalsarenoisy,variable,ambiguous

Whatisthemostlikelywordsequence,giventhespeechsignal?

I.e.,chooseWordstoizeP(Words|signal)

UseBayes’rule:

P(Words|signal)αP(signal|Words)P(Words)

I.e.,decomposesintoacousticmodel+languagemodel

Wordsarethehiddenstatesequence,signalistheobservationsequence

Chapter15,Section63

所有人类语言都是由40‑50个音素组成的，这些音素由发音

(嘴唇、牙齿、舌头、声带、气流)的配置决定。

在词和信号之间形成一个中间层次的隐状态⇒声学模型=发音

模型+音素模型

ARPA是为美式英语设计的

[iy]beat[b][p]pet

[ih]bit[ch]Chet[r]rat

[ey][d]debt[s]set

[ao]了西[hh]hat[th]thick

买东

[ow]boat

您可能关注的文档

文档评论（0）

1亿VIP精品文档

更多 >

第15章第6节：语音识别与概率推理概述.pdfVIP