- 1、本文档共94页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
VTune Summary Pros: Allows to get best possible performance out of Intel architecture Cons: Extreme tuning requires deep understanding of processor and OS internals 精品文档 Valgrind Multi-purpose Linux x86 profiling tool 精品文档 Call Graph : understanding each line Current Function: g( ) Time that was propagated from the functions children into this parent Time that was propagated directly from the function into this parent Number of times this parent called the function `/‘ total number of times the function was called Call Graph : parents numbers 精品文档 Call Graph : “children” numbers Current Function: g( ) Amount of time that was propagated from the childs children to the function Amount of time that was propagated directly from the child into function Number of times this function called the child `/‘ total number of times this child was called 精品文档 How gprof works Instruments program to count calls Watches the program running, samples the PC every 0.01 sec Statistical inaccuracy : fast function may take 0 or 1 samples Run should be long enough comparing with sampling period Combine several gmon.out files into single report The output from gprof gives no indication of parts of your program that are limited by I/O or swapping bandwidth. This is because samples of the program counter are taken at fixed intervals of run time number-of-calls figures are derived by counting, not sampling. They are completely accurate and will not vary from run to run if your program is deterministic Profiling with inlining and other optimizations needs care 精品文档 VTune performance analyzer To squeeze every bit of power out of Intel architecture ! 精品文档 VTune Modes/Features Time- and Event-Based, System-Wide Sampling provides developers with the most accurate representation of their softwares actual performance with negligible overhead Call Graph Profiling provides developers with a pictorial view of program flow to quickly identify critical functions and call sequences Coun
您可能关注的文档
- Physiology & Psychology课件.ppt
- PIER (Peer-to-Peer Information Exchange and Retrieval)课件.ppt
- Planning课件.ppt
- PLATFORM FOR BUILDING PDE-BASEDPROBLEM-课件.ppt
- PLC控制技术及应用课件.ppt
- Pneumococcal VaccineAVI version 0.1课件.ppt
- POROSITY DETERMINATIONFROM LOGS课件.ppt
- Positioning课件.ppt
- Power Management In Windows Vista课件.ppt
- PowerPoint as a Powerful Tool课件.ppt
- 油库安全巡检制度范文(四篇).doc
- 校企合作共建解决方案范文(二篇).doc
- 环境卫生管理制度常用版(八篇).doc
- 公司仓库管理制度样本(五篇).doc
- 客户经理个人下半年工作计划(4篇).doc
- 线程在网络安全中的应用.pptx
- 线性搜索在网络安全中的应用.pptx
- 2024年05月山东济南市妇幼保健院引进急需紧缺专业人才41人笔试上岸历年高频考点(难、易错点)附带答案详解.docx
- 2024年04月黑龙江宁安市人才引进88人笔试上岸历年高频考点(难、易错点)附带答案详解.docx
- 2024年山东潍坊昌邑市卫健系统事业单位招考聘用118人笔试上岸历年高频考点(难、易错点)附带答案详解.docx
文档评论(0)