- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
- 4、该文档为VIP文档,如果想要下载,成为VIP会员后,下载免费。
- 5、成为VIP后,下载本文档将扣除1次下载权益。下载后,不支持退款、换文档。如有疑问请联系我们。
- 6、成为VIP后,您将拥有八大权益,权益包括:VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
- 7、VIP文档为合作方或网友上传,每下载1次, 网站将根据用户上传文档的质量评分、类型等,对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档
查看更多
Automatic Segmentation of News Items Based on Video and Audio Features
Automatic Segmentation of News Items Based on Video
and Audio Features
Weiqiang Wang Wen Gao
Institute of Computing Technology, Chinese Academy of Sciences, Beijing , China, 100080
Email: {wqwang, wgao}@
Abstract. In the paper, we present an approach that exploits audio and video
features to automatically segment news items. Integration of audio and visual
analysis can overcome the weakness of the approach only using the image
analysis techniques. It brings our approach with more adaptation to variable
existence situations of news items. The proposed approach identifies silence
segments in accompanying audio, and integrates with shot segmentation results,
as well as anchor shot detection results, to determine boundaries between news
items. Experiments show that the integration of audio and video features is
effective to solve the problem of automatic segmentation of news items.
1. Introduction
Most early studies in video structure analysis are based on visual information. To
effectively index and retrieve video documents, they are segmented into scenes[1,2].
Furthermore, Shot boundaries are determined to characterize content details[3,4], and
key frames are extracted to construct index[5,6].
Audio, as another time-dependent media in video documents, can supplement
visual information, and supply a unique cue for video content analysis. For instance,
in anchor shots of CCTV news, visual content is almost unchanged, but it is possible
multiple news items are reported by anchorpersons synchronously. In a movie, a
group of consecutive shots maybe differs drastically in visual content, but the
accompanying music indicates they belong to the same semantic clip. Recently, more
literatures proposed to apply audio analysis techniques in characterizing video
content. [7] exploited multiple audio features and a neural net classifier to
differentiate five classes of TV programs, including advertisement, basketball,
football, news, weather. [8] pr
您可能关注的文档
- Anaphylactic shock and lethal anaphylaxis caused by food consumption in China.pdf
- anatomy of the head (human brain).pdf
- Anchor Bay Tars.pdf
- and Operator Theory.pdf
- AND-302-No Interop No Security How to Build Interoperable Web Service Security.pdf
- Anderson 艾德盛 型号对照表.pdf
- andritz-report-2006-en-customer-projects-rolling-mills-and-strip-processing-lines.pdf
- android 自动调整屏幕分辨率.pdf
- Android4.4(4.2)_RDA5991 WIFI3in1调试方法V1.2.pdf
- Android5.0竖向瀑布流RecyclerView+CardView.pdf
- Automating Image Registration and Absolute Orientation Solutions and Problems.pdf
- Autonomous Detection of Angle Random Walk System of Satellite.pdf
- Avoiding privacy violations caused by context-sensitive services.pdf
- AVRX1000快速安装.pdf
- Axial mixing and segregation in a gas-liquid-solid three-phase fluidized bed of solid particles of d.pdf
- AXPsSGRs Magnetars or Quark-stars.pdf
- AZ GXR 601 presentation.pdf
- Azimuthal Angle Correlations for Rapidity Separated Hadron Pairs in d+Au Collisions at sqrt.pdf
- Azimuthal angle decorrelation of Mueller-Navelet jets at NLO.pdf
- Azimuthal Angle Distribution in $B to K^ (to K pi) ell^+ ell^- $ at low invariant $m_{ell^+.pdf
文档评论(0)