基于多表数据库中文关键词top-n查询处理-query proces of chinese keyword top - n based on multi-table database.docxVIP
- 5
- 0
- 约4.9万字
- 约 53页
- 2018-05-18 发布于上海
- 举报
基于多表数据库中文关键词top-n查询处理-query proces of chinese keyword top - n based on multi-table database
摘要关键词查询的理论和技术在信息检索和 Web 搜索引擎中得到了广泛深入的研究和应 用。传统数据库管理系统仅支持模式匹配,不支持自由形态的关键词查询。鉴于此,近 年来关系数据库上的关键词查询处理的研究成为备受关注的前沿课题之一。传统关系数 据库系统运用结构化查询语言(SQL)对数据库进行操作,需要用户掌握 SQL 和数据库模 式,这对于普通用户是困难的。此外,对返回的查询结果,传统数据库系统只能进行简 单排序,用户要想从中获取最感兴趣的信息是很困难的。目前,关键词查询的研究主要 针对英文关键词,因此针对具有多表的数据库,本文给出一种中文关键词 top-N 查询处 理方法。此方法创建索引表存储从数据库中析出的中文元组字及其相关信息,进而构造 索引用以快速匹配查询关键字,借鉴 IR 的相似度公式构造适合中文关键词查询的排序策 略。对于一个中文关键词查询,利用索引快速匹配查询字和元组字得到相应信息,并根 据这些信息创建候选元组生成链表和 SQL 查询语句, 进而得到候选元组及其与查询之间 的相似度,最终按相似度返回 Top-N 结果。此方法实现了按字搜索及中文的缩略词的查 询处理。最后利用真实数据集进行实验,实验内容包括对查询相应时间和准确性的验证, 实验数据显示本文方法是有效的。关键词关系数据库中文关键词索引排序策略IAbstractThe theories and techniques of keyword query have been extensively studied and applied in Information Retrieval and Web search engines. Traditional relational database management systems support pattern match of tuples with query conditions; however, they do not support free-form keyword search. Thus, the processing of keyword queries over relational databases has intensified in recent years, and has been one of active research issues. Traditional relational database systems utilize SQL (Structured Query Language) to search the database, and require users to know the database schema and SQL. These requirements are difficult for ordinary users to use such search model. Additionally, the ranking functions for results of a query are simple in traditional relational database systems; therefore, it is not easy for users to find their desired answers from too many results. Researches of keyword queries are in the majority of evaluating English keyword search at present. In this paper, we provide a new method for processing Chinese keyword queries in a database system with multiple relations. This method creates an index table to store the Chinese tuple words and the related information coming from the database, and then constructs a procedure of calculating the similarity. Given a Chinese keyword query, using the index to match the query words and tuple words, we establish a linked list to generate identifier
您可能关注的文档
- 基于地质图与钻孔数据的地质剖面自动生成技术分析-analysis of automatic generation technology of geological profile based on geological map and drilling data.docx
- 基于地域特色的新型种植园的景观设计与分析-landscape design and analysis of new plantations based on regional characteristics.docx
- 基于第三方物流企业的集配商策略分析-analysis of distributor strategy based on third party logistics enterprises.docx
- 基于点关联预报模型的抚顺发电厂边坡变形的分析-analysis of slope deformation in fushun power plant based on point correlation prediction model.docx
- 基于点击流分析的电子商务个性化服务分析-personalized service analysis of e-commerce based on click flow analysis.docx
- 基于点击化学改性生物降解聚乳酸的分析-analysis of biodegradable polylactic acid modified by click chemistry.docx
- 基于地质条件的喀斯特区农业土地利用分区—以清镇市为例-agricultural land use zoning in karst areas based on geological conditions - a case study of qingzhen city city.docx
- 基于点云多平面检测的三维重建关键技术分析-key technology analysis of 3d reconstruction based on point cloud multiplanar detection.docx
- 基于点云数据的逆向技术分析与设计-analysis and design of reverse technology based on point cloud data.docx
- 基于电子商务的企业竞争力分析--以中小板上市公司为例-analysis of enterprise competitiveness based on e - commerce - taking small and medium-sized listed companies as an example.docx
- 基于多臂星状聚合物的稳定囊泡的制备及应用分析-preparation and application analysis of stable vesicles based on multi-arm star polymer.docx
- 基于多波束数据的数字海底地形地貌研究分析-research and analysis of digital seabed topography based on multibeam data.docx
- 基于多参数pdpk结合模型的药代学计算系统设计及应用-design and application of pharmacokinetic calculation system based on multi-parameter pdpk combined model.docx
- 基于多波束的明渠断面水流量监控系统的分析与设计-analysis and design of open channel cross-section water flow monitoring system based on multi-beam technology.docx
- 基于多参数反演的油气检测方法分析及应用-analysis and application of oil and gas detection method based on multi-parameter inversion.docx
- 基于多层次赋权模型的并购估值价值乘数法研究与应用——以贵州科开医药股份有限公司估值项目为例-research and application of m & a valuation value multiplier method based on multilevel weighting model - a case study of guizhou kekai pharmaceutical co., ltd..docx
- 基于多层次灰决策干部测评甄选系统设计及实现-design and implementation of cadre evaluation and selection system base on multi-level grey decision - making.docx
- 基于多层次表示的三维同步建模方法分析-analysis of 3d synchronous modeling method based on multi-level representation.docx
- 基于多层次资本市场的贵州阳光产权交易所发展战略分析-development strategy analysis of guizhou sunshine property rights exchange based on multi-level capital market.docx
- 基于多层架构和.net技术高速公路公司资产管理系统设计与实现-design and implementation of expressway company asset management system base on multi-layer architecture and. net technology.docx
原创力文档

文档评论(0)