- 19
- 0
- 约4.13万字
- 约 56页
- 2018-05-18 发布于上海
- 举报
基于抽样的deep web模式匹配分析-deep web pattern matching analysis based on sampling
基于抽样的DeepWeb模式匹配研究摘要随着电子商务的蓬勃发展,Web数据库数量激增,使得DeepWeb成为新的研究热点。网络中DeepWeb模式数量众多,传统的模式匹配方法通常在两个模式间进行,难以胜任DeepWeb模式匹配工作。多源模式匹配方法能够同时匹配多个模式,发掘所有属性间的匹配关系,对DeepWeb资源的高效利用具有重要意义。本文介绍了Deep Web的背景和研究现状,探讨了DeepWeb数据集成、模式匹配及其特点,重点研究了两种经典的多源Deep Web模式匹配框架。DCM框架是一种针对DeepWeb模式特性的匹配技术,能同时完成多个模式间的复杂匹配工作;但DCM框架在处理异常模式集时,查准率低下。针对DCM的缺陷,本文采用抽样的方法消除异常模式的影响,提出基于抽样的模式匹配框架。基于所提出的模式匹配框架,设计了一种DeepWeb模式匹配算法(SMBS),该算法挖掘出更加完整的模式信息,构建统一查询接口模式(GIS)的生成模型,与传统Deep Web模式匹配算法直接构建GIS不同,SMBS算法构建的模型能根据需求生成GIS,提高系统的普适性。以BAMM数据集作为实验数据,分别在正常模式集和异常模式集情况下,对SMBS进行测试,并与DCM、MGS进行比较,测试结果表明SMBS匹配查准率高于DCM。关键词:DeepWeb,模式匹配,相关性挖掘,抽样ResearchofDeepWebSchemaMatchingBased onSamplingAbstractWiththeboomofe-commerceandthesharpriseinthenumberofWebdata- bases,DeepWebhasbecomeanewresearchhotspot.Becauseofthelargenumber ofDeepWebschemas,traditionalschemamatchingmethodshavebeenincapable forthematchingwork.Multi-sourcematchingmethodswhichcanmatchschemas insame time have far-reaching significance inefficient using ofDeepWeb.ThisdissertationintroducesthebackgroundandthecurrentsituationofDeep Web,anddescribestheDeepWebdataintegration,schema matchingandthefea- turesofDeepWebschemamatching.Twoclassicschemamatchingframeworkfor DeepWebareanalyzed.DCM(dualcorrelationmining)frameworkisa schema matchingframeworkfittingforthecharacteristicsofDeepWebschemas,whichcangetthecomplexity matchsbetweenmultipleschemasinsametime.HowevertheDCMframework wouldhavealowprecisionwhensomespecialschemaswereintheset.Toraisethe precision,thisdissertationproposedanewschemamatchingframeworkwhich eliminates theimpact ofspecial schemas bysampling .Based on the proposed matching framework, a schema matching algo- rithm(SMBS)wasdesigned.SMBSdigsoutmorematchinginformation,andbuilds aunifiedqueryinterfaceschema(GIS)generationmodel.Sincetraditionalschema matchingalgorithmsbuildGISdirectly,SMBScangeneratGISondemandbythe generation model, which improves the systems universality.WithBAMMdatasetastheexperimentaldata,SMBSwastestedandcompared withDCMand MGSframework.Ther
您可能关注的文档
- 基于博弈模型的网络脆弱性评估的分析-analysis of network vulnerability assessment based on game model.docx
- 基于博弈论视角的融资型特许经营项目综合管理模式分析-analysis of comprehensive management model of financing franchise project based on game theory.docx
- 基于博弈视角的绿色供应链政府补贴政策研究-research on government subsidy policy of green supply chain from the perspective of game theory.docx
- 基于博弈视角的区域集群产业品牌培育分析-analysis of regional cluster industrial brand cultivation based on game theory.docx
- 基于卟啉的农药残留检测方法分析-analysis of pesticide residue detection method based on porphyrin.docx
- 基于卟啉及罗丹明内酰胺新型阳离子荧光分子探针设计与合成-design and synthesis of novel cationic fluorescent molecular probe based on porphyrin and rhodamine lactam.docx
- 基于补偿的时钟磁道闭环写入与伺服校验分析-analysis of clock track closed-loop writing and servo check based on compensation.docx
- 基于卟啉交互传感阵列检测农残方法分析-analysis of detection method of agricultural residue based on porphyrin interactive sensor array.docx
- 基于卟啉微阵列传感器系统的实现与分析-implementation and analysis of porphyrin - based microarray sensor system.docx
- 基于不等式方法的多目标遗传算法在排课问题中的应用分析-application analysis of multi-objective genetic algorithm based on inequality method in course scheduling problem.docx
- 小区绿化施工协议书.docx
- 墙面施工协议书.docx
- 1 古诗二首(课件)--2025-2026学年统编版语文二年级下册.pptx
- (2026春新版)部编版八年级道德与法治下册《3.1《公民基本权利》PPT课件.pptx
- (2026春新版)部编版八年级道德与法治下册《4.3《依法履行义务》PPT课件.pptx
- (2026春新版)部编版八年级道德与法治下册《6.2《按劳分配为主体、多种分配方式并存》PPT课件.pptx
- (2026春新版)部编版八年级道德与法治下册《6.1《公有制为主体、多种所有制经济共同发展》PPT课件.pptx
- 初三教学管理交流发言稿.docx
- 小学生课外阅读总结.docx
- 餐饮门店夜经济运营的社会责任报告(夜间贡献)撰写流程试题库及答案.doc
原创力文档

文档评论(0)