- 1、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。。
- 2、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
SMART SMART Overview * Moving Object Proposals(MOPs) Compute the optical flow Brox T, Malik J. Large displacement optical flow: descriptor matching in variational motion estimation[J]. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2011, 33(3): 500-513. * Compute optical flow boundries Dollár P, Zitnick C L. Structured forests for fast edge detection[C] //Computer Vision (ICCV), 2013 IEEE International Conference on. IEEE, 2013: 1841-1848. Moving Object Proposals(MOPs) Compute multiple figure-ground segmentations Kr?henbühl P, Koltun V. Geodesic object proposals[M]// Computer Vision–ECCV 2014. Springer International Publishing, 2014: 725-739. * Moving Object Proposals(MOPs) * Overview * Moving Objectness Detector(MOD) CNN – a dual-pathway architecture Krizhevsky A, Sutskever I, Hinton G E. Imagenet classification with deep convolutional neural networks[C] //Advances in neural information processing systems. 2012: 1097-1105. * Moving Objectness Detector(MOD) CNN – Initialization Girshick R, Donahue J, Darrell T, et al. Rich feature hierarchies for accurate object detection and semantic segmentation[C] //Computer Vision and Pattern Recognition (CVPR), 2014 IEEE Conference on. IEEE, 2014: 580-587. * CNN – Training using “Caffe” Jia Y, Shelhamer E, Donahue J, et al. Caffe: Convolutional architecture for fast feature embedding[C]//Proceedings of the ACM International Conference on Multimedia. ACM, 2014: 675-678. Overview * Tube proposal generation Per frame MOPs to spatio-temporal tubes Grady L. Random walks for image segmentation[J]. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 2006, 28(11): 1768-1783. * Tube proposal generation * Experiments * Moseg - Inaccurate mapping of trajectory clusters to pixel tubes often slightly leak to the background, especially for animals with thin limbs Discussion – Failure cases * VSB100 - Large motion or full object occlusion an additional linking step, where similarly looking tubes are linked a
文档评论(0)