Profiling tools课件.ppt

  1. 1、本文档共94页,可阅读全部内容。
  2. 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
  3. 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载
  4. 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
VTune Summary Pros: Allows to get best possible performance out of Intel architecture Cons: Extreme tuning requires deep understanding of processor and OS internals 精品文档 Valgrind Multi-purpose Linux x86 profiling tool 精品文档 Call Graph : understanding each line Current Function: g( ) Time that was propagated from the functions children into this parent Time that was propagated directly from the function into this parent Number of times this parent called the function `/‘ total number of times the function was called Call Graph : parents numbers 精品文档 Call Graph : “children” numbers Current Function: g( ) Amount of time that was propagated from the childs children to the function Amount of time that was propagated directly from the child into function Number of times this function called the child `/‘ total number of times this child was called 精品文档 How gprof works Instruments program to count calls Watches the program running, samples the PC every 0.01 sec Statistical inaccuracy : fast function may take 0 or 1 samples Run should be long enough comparing with sampling period Combine several gmon.out files into single report The output from gprof gives no indication of parts of your program that are limited by I/O or swapping bandwidth. This is because samples of the program counter are taken at fixed intervals of run time number-of-calls figures are derived by counting, not sampling. They are completely accurate and will not vary from run to run if your program is deterministic Profiling with inlining and other optimizations needs care 精品文档 VTune performance analyzer To squeeze every bit of power out of Intel architecture ! 精品文档 VTune Modes/Features Time- and Event-Based, System-Wide Sampling provides developers with the most accurate representation of their softwares actual performance with negligible overhead Call Graph Profiling provides developers with a pictorial view of program flow to quickly identify critical functions and call sequences Coun

文档评论(0)

liuxiaoyu98 + 关注
实名认证
内容提供者

该用户很懒,什么也没介绍

1亿VIP精品文档

相关文档