文件系统语义分析技术分析-analysis of file system semantic analysis technology.docx

下载文档 降价啦

18
0
约13.21万字
约 156页
2018-08-07 发布于上海
举报
版权申诉
保障服务

文件系统语义分析技术分析-analysis of file system semantic analysis technology.docx

1、原创力文档（book118）网站文档一经付费（服务费），不意味着购买了该文档的版权，仅供个人/单位学习、研究之用，不得用于商业用途，未经授权，严禁复制、发行、汇编、翻译或者网络传播等，侵权必究。。
2、本站所有内容均由合作方或网友上传，本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺！文档内容仅供研究参考，付费前请自行鉴别。如您付费，意味着您自己接受本站规则且自行承担风险，本站不退款、不进行额外附加服务；查看《如何避免下载的几个坑》。如果您已付费下载过本站文档，您可以点击这里二次下载。
3、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等，请点击“版权申诉”（推荐），也可以打举报电话：400-050-0827(电话支持时间：9:00-18:30)。

文件系统语义分析技术分析-analysis of file system semantic analysis technology

后者的模型准确度：实验表明最大能够提升达到20%左右。进一步提出了一种文件自相关性时间序列分析模型―TiMiner。该模型在文件系统语义挖掘过程中引入时间维度，运用时间序列分析方法研究文件系统现象随时间发生的变化。根据实际运行情况，总结了五条文件系统时间序列数据特征，分别是趋势性、周期性、异常观测值、条件异方差以及非线性特征，并针对这些特征分别采用不同时间序列分析方法进行分析。研究发现某一时刻的文件系统缓存命中率状态可以分解成为三个部分：之前时刻系统缓存状态的自相关部分、时间间隔内文件请求到达随机分布部分和相邻时刻状态的差分部分。实验结果表明，TiMiner文件系统缓存命中率预取模型能够比较好的匹配历史数据并有效地预测未来一段时间内的状态趋势。为了论证上述三种模型的有效性，设计和实现了一个实际的大规模分布式智能对象存储系统Cappella，该系统集成了一系列基于文件语义挖掘的服务优化模块来提高整体性能。本文同时讨论和展望了其他一些潜在的文件语义挖掘的应用，诸如：文件感知、可靠性和一致性等方面的问题，以及今后可能有更进一步研究潜力的方向和方法。此外，从若干典型的分布式文件系统的Trace中抽取出一些常用的文件变量要素并将这些要素集成到Cappella系统的实验测试环境中。实验结果表明，本文提出的一系列文件相关性分析模型能够有效的提升Cappella系统服务的性能和质量。关键词：文件系统语义，文件相关性，文件回归分析，文件系统时间序列分析AbstractFileSemanticisthestudyofmeaningthatcanbeusedtoinferfilesystemuserbehav-iors.Ithasbecomeanincreasinglyimportantpracticeinbothengineeringandresearchcom-munityoffilesystemdesignandimplementation.Comparingwithblocksemanticwhosemanifestationsareonlyintheformofdatablockscommonaccesslocality(temporalorspa-tial),filesystemcanprovidemoreusefulandinsightfulinformationaboutfilesemanticduetotheelaborateandrichI/Ointerfacesbetweenupperlayerapplicationsandfilesystems.Unfortunately,itischallengetoexploresemanticknowledgeinfilesystemseffectivelyandaccuratelybecauseavarietyoffactorscouldaffectthisknowledgeexplorationprocess.Ex-amplesofsemanticfactorsincludeuser/programbehavior,storageorganizationandfiledatatypes.Evenworse,thechallengesareexacerbatedduetotheintricateinterdependencybe-tweenthesefactors,makeitdifficulttofullyexploitthepotentiallyimportantcorrelationamongvarioussemanticknowledgethatinturnmayrevealmoreaccuratefilecorrelations.Thisarticleproposesaapproachtomeasureinter-filerelationships,calledFARMER.Inthisapproach,fileistreatedasamultivariatevectorspace,andeachitemwithinthevectorcorrespondsaseparatefactorofthegivenfile.Theselectionoffactordependsontheapplication,typicalfactorsarefilename,creatorandexecutingprogram.Ifoneparticularfactoroccursinbothfiles,itsvalueisnon-zero.Webelievethattheextentofinter-filerelationshipscanbemeasuredbasedonthelikenesso