- 2
- 0
- 约7千字
- 约 33页
- 2017-03-09 发布于上海
- 举报
The XTree An Index structure for High Dimensional Data采油树的高维数据索引结构
The X-TreeAn Index Structure for High Dimensional Data Outline Introduction Problems of R-tree based structures X-tree Structure X-tree Algorithms Overall-Minimal Split Performance Evaluation Introduction Objective - To index point and spatial data in high-dimensional space Dimensions - few tens to hundreds Hyper-rectangles Fields - CAD, Molecular biology Improves upon R*-tree Approach ‘Minimal Overlap Split’ Directory structure organization - ‘Supernodes’ Performance is better than R* tree and TV tree by 2 orders of magnitude Previous work (on High Dimensional Data) Reduce dimensionality - two basic approaches: Data is highly clustered and correlated Occupy only some space Algorithms to transform to lower dimension Index using traditional multi-dimensional index structures Eg: SS Tree Small number of dimensions contain most of the information Eg: TV Tree BUT…reduced dimensions may still be too high Problem with R* tree Why R*-Trees ? Handles both point and spatial data Spatial data is not transformed to point data Performance deteriorates rapidly with dimension. After detailed evaluations, found that Overlap in directory increases rapidly with growing dimensionality. Dimension=5, Overlap=90% Overlap ?Query Performance ? Defining Overlap Intuitively - Percentage of volume covered by more than one directory hyper-rectangle Overlap R-tree node contains n hyper-rectangles {R1, R2, … Rn} Overlap directly corresponds to query performance (only if query objects are uniformly distributed) Query distribution estimated by data distribution In high dimensional data queries and data are clustered Defining Overlap (contd) Weighted Overlap More accurate Percentage of data objects in overlapping space Defining Overlap (contd) Multi Overlap - How many Ri’s in the overlapping space ? Overlap in R* Tree Dimensionality ?, Overlap ? So multiple paths need to be searched for each query X-Tree - eXtended node tree Goal - Efficient query processing of high di
您可能关注的文档
- The SONY Corporation A Case Study in Transnational Media索尼公司在跨国媒体中的个案研究.ppt
- The Southern Colonies Plantations and Slavery南部殖民地的种植园和奴隶制度.ppt
- The Spanish Conquer Two Empires in the Americas西班牙在美洲的征服.ppt
- The Space Elevator The Budker Group太空电梯的计组.ppt
- The South Carolina Geodetic Survey南卡罗来纳州大地测量.ppt
- The Spanish Conquistadors Conquer the Aztecs Yola西班牙征服者在征服奇拉.ppt
- The Space Elevator building our future太空电梯建设我们的未来.ppt
- The Space Grant Internet Telescope Network The 太空资助互联网望远镜网络.ppt
- The Spark火花.ppt
- The Special Challenges of NeurologicalBased Behavior神经行为的特殊挑战.ppt
- 2025-2026学年天津市和平区高三(上)期末数学试卷(含解析).pdf
- 2025-2026学年云南省楚雄州高三(上)期末数学试卷(含答案).pdf
- 2025-2026学年甘肃省天水市张家川实验中学高三(上)期末数学试卷(含答案).docx
- 2025-2026学年福建省厦门市松柏中学高二(上)期末数学试卷(含答案).docx
- 2025-2026学年广西钦州市高一(上)期末物理试卷(含答案).docx
- 2025-2026学年河北省邯郸市临漳县九年级(上)期末化学试卷(含答案).docx
- 2025-2026学年河北省石家庄二十三中七年级(上)期末历史试卷(含答案).docx
- 2025-2026学年海南省五指山市九年级(上)期末化学试卷(含答案).docx
- 2025-2026学年河北省唐山市玉田县九年级(上)期末化学试卷(含答案).docx
- 2025-2026学年河北省邢台市市区九年级(上)期末化学试卷(含答案).docx
原创力文档

文档评论(0)