- 1
- 0
- 约1.12万字
- 约 42页
- 2017-01-12 发布于辽宁
- 举报
IndexingandRepresentationTheVectorSpaceModel.ppt
Slide courtesy Ray Larson Indexing and Representation:The Vector Space Model Document represented by a vector of terms Words (or word stems) Phrases (e.g. computer science) Removes words on “stop list” Documents aren’t about “the” Often assumed that terms are uncorrelated. Correlations between term vectors implies a similarity between documents. For efficiency, an inverted index of terms is often stored. Document RepresentationWhat values to use for terms Boolean (term present /absent) tf (term frequency) - Count of times term occurs in document. The more times a term t occurs in document
您可能关注的文档
- ConsumerDecisionMakingModel.ppt
- ControllableAtomisticGrapheneOxideModelandits.docx
- CreativeCommons(CC)ANewModelforCopyright.ppt
- CurrentModeloftheAtom.ppt
- DatabaseDesignandTheEntity-RelationshipModel.ppt
- DelawareCountyTransitionalCareModel.ppt
- DesigningandTestingaPedagogicalModelforSimulation-.ppt
- DevelopingaViableBusinessModel.ppt
- Developmentofneuralnetworkemulationsofmodelphysics.ppt
- DoestheBarro-GordonModelExplaintheBehaviorofInflation.ppt
- 第一节 电阻和变阻器(讲义)物理沪科版2024九年级全一册.docx
- 第3节 质量的测量 (讲义) 物理沪科版(五四学制)2024 八年级上册.docx
- 第14讲 圆周运动(复习讲义)高考物理一轮复习.docx
- 暑假预习专题15 指数函数(20题型)新高一数学讲义(沪教版2020).docx
- 第二节 发电机是怎样工作的(讲义)物理沪科版2024九年级全一册.docx
- 4.18 东晋南朝政治和江南地区开发 教学设计 部编版七年级上学期历史.docx
- 2.5实验:用单摆测量重力加速度(表格式教学设计)物理人教版2019选择性必修第一册.docx
- 第49讲 沉淀溶解平衡及图像分析(讲义)高考化学复习讲义(新教材新高考).docx
- 旅游景区行业分析报告:内外兼修,多元创新.pdf
- Unit 1~2 单元语法知识点梳理 高二下学期期中考点(上教版2020选择性必修第二册).pptx
原创力文档

文档评论(0)