Basic Features of Audio Signals(音讯的基本特徵).ppt

Basic Features of Audio Signals(音讯的基本特徵).ppt

  1. 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
  2. 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载
  3. 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Basic Features of Audio Signals(音讯的基本特徵).ppt

* * * * * * * * * * * * * * Basic Features of Audio Signals (音訊的基本特徵) Jyh-Shing Roger Jang (張智星) /jang MIR Lab, CSIE Dept National Taiwan Univ., Taiwan Audio Features Four commonly used audio features Volume, pitch, timbre, zero crossing rate Our goal These features can be perceived (more or less) subjectively. Our goal is to compute them quantitatively (and objectively) for further processing and recognition. General Steps for Audio Analysis Frame blocking Frame duration of 20~40 ms or so Frame-based feature extraction Volume, zero-crossing rate, pitch, MFCC, etc Frame-based Analysis Pitch vector for QBSH comparison MFCC for HMM evaluation … Frame Blocking Sample rate = 16 kHz Frame size = 512 samples Frame duration = 512/16000 = 0.032 s = 32 ms Overlap = 192 samples Hop size = frame size – overlap = 512-192 = 320 samples Frame rate = 16000/320 = 50 frames/sec Zoom in Overlap Frame Quiz candidate! Audio Features in Time Domain 3 of the most prominent time-domain audio features in a frame (also known as analysis window) Intensity Fundamental period Timbre: Waveform within an FP Quiz candidate! Audio Features in Frequency Domain Frequency-domain audio features in a frame Energy: Sum of power spectrum Pitch: Distance between harmonics Timbre: Smoothed spectrum Second formant F2 First formant F1 Pitch freq Energy Frame-based Manipulation For simplicity, we usually pack frames into a matrix for easy manipulation in MATLAB: [y, fs] = audioread(‘file.wav’); frameMat = enframe(y, frameSize, overlap); Frame 1 Frame 2 Frame n … frameMat = Introduction to Volume Loudness of audio signals Visual cue: Amplitude of vibration Also known as energy or intensity Two major ways of computing volume: Volume: Log energy (in decibel): Quiz candidate! Volume: Perceived and Computed Perceived volume is influenced by Frequency (example shown later) Timbre (example shown later) Computed volume is influenced by Microphone types Microphone setups Volume Computation To avoid D

文档评论(0)

gshbzl + 关注
实名认证
内容提供者

该用户很懒,什么也没介绍

1亿VIP精品文档

相关文档