计算机视觉new.pptVIP

下载本文档

11
0
约3.67千字
约 9页
2015-12-26 发布于江西
举报
版权申诉

计算机视觉new.ppt

1、原创力文档（book118）网站文档一经付费（服务费），不意味着购买了该文档的版权，仅供个人/单位学习、研究之用，不得用于商业用途，未经授权，严禁复制、发行、汇编、翻译或者网络传播等，侵权必究。。
2、本站所有内容均由合作方或网友上传，本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺！文档内容仅供研究参考，付费前请自行鉴别。如您付费，意味着您自己接受本站规则且自行承担风险，本站不退款、不进行额外附加服务；查看《如何避免下载的几个坑》。如果您已付费下载过本站文档，您可以点击这里二次下载。
3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等，请点击“版权申诉”（推荐），也可以打举报电话：400-050-0827(电话支持时间：9:00-18:30)。
4、该文档为VIP文档，如果想要下载，成为VIP会员后，下载免费。
5、成为VIP后，下载本文档将扣除1次下载权益。下载后，不支持退款、换文档。如有疑问请联系我们。
6、成为VIP后，您将拥有八大权益，权益包括：VIP文档下载权益、阅读免打扰、文档格式转换、高级专利检索、专属身份标志、高级客服、多端互通、版权登记。
7、VIP文档为合作方或网友上传，每下载1次，网站将根据用户上传文档的质量评分、类型等，对文档贡献者给予高额补贴、流量扶持。如果你也想贡献VIP文档。上传文档

计算机视觉new.ppt

* 计算机视觉 *使计算机通过二维图像认知三维环境信息 1.感知三维环境中物体的几何信息：形状，位置，姿态，运动 2. 描述，存储，识别，理解 Marr 的计算视觉理论三个阶段：三个层次：计算理论表达与算法硬件实现三个阶段：基元图 2.5D 3D Multiobject Tracking as MaximumWeight Independent Set This paper addresses the problem of simultaneous tracking of multiple targets in a video. We first apply object detectors to every video frame. Pairs of detection responses from every two consecutive frames are then used to build a graph of tracklets. The graph helps transitively link the best matching tracklets that do not violate hard and soft contextual constraints between the resulting tracks. We prove that this data association problem can be formulated as finding the maximum-weight independent set (MWIS) of the graph. We present a new, polynomial-time MWIS algorithm, and prove that it converges to an optimum. Similarity and contextual constraints between object detections, used for data association, are learned online from object appearance and motion properties. Long-term occlusions are addressed by iteratively repeating MWIS to hierarchically merge smaller tracks into longer ones. Our results demonstrate advantages of simultaneously accounting for soft and hard contextual constraints in multitarget tracking. We outperform the state of the art on the benchmark datasets. Overview of the Approach Step 1: We apply detectors of a set of object classes to all video frames. Each detection is characterized by a descriptor that records the following properties of the corresponding bounding box: location, size, and the histograms of color, intensity gradients, and optical flow. Step 2: The best matching detections are transitively linked across video into distinct tracks, whose total number is unknown a priori. This is done under the hard constraint that no two tracks may share the same detection, to prevent implausible video interpretations. In addition, the linking is informed by spatiotemporal relationships between the tracks, which provide for soft constrai