Comparing machine learning and knowledge discovery in databases an application to knowledge.pdf
- 1、本文档共20页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
Comparing machine learning and knowledge discovery in databases an application to knowledge
1Comparing Machine Learning and Knowledge Discovery in DataBases : An
Application to Knowledge Discovery in Texts
Yves Kodratoff
CNRS, LRI Bat. 490 Univ. Paris-Sud, F - 91405 Orsay Cedex
yk@lri.fr
Text associated to a course delivered at the ECCAI summer course, Crete July 1999.
To be published by Springer-Verlag
in the Lecture Notes on AI (LNAI) - Tutorials series, 2000.
SUMMARY :
This presentation has two goals.
The first goal is to compare ML and Knowledge Discovery in Data (KDD, also often
called Data Mining, DM) in order to insist on how much they actually differ In order to make
my ideas somewhat easier to understand, and as an illustration, I will include a description of
several research topics that I find relevant to KDD and to KDD only.
The second goal is to show that the definition I give of KDD can be almost directly
applied to text analysis, and that will lead us to a very restrictive definition of Knowledge
Discovery in Texts (KDT). I will provide a compelling example of a real-life set of rules
obtained by what I call KDT techniques.
1. INTRODUCTION
KDD is better known by the oversimplified name of Data Mining (DM). Actually, most
academics are rather interested by DM which develops methods for extracting knowledge
from a given set of data. Industrialists and experts should be more interested in KDD which
comprises the whole process of data selection, data cleaning, transfer to a DM technique,
applying the DM technique, validating the results of the DM technique, and finally interpreting
them for the user. In general, this process is a cycle that improves under the criticism of the
expert.
Machine Learning (ML) and KDD have in common a very strong link : they both
acknowledge the importance of induction as a normal way of thinking, while other scientific
fields are reluctant to accept it, to say the least. We shall first explore this common point. We
believe that this reluctance relies on a misuse of apparent contradictions inside the theory of
confi
您可能关注的文档
- C103 – General Checklist-Accreditation of Field Testing and.pdf
- C10-M53R说明书.pdf
- Cabin-fence.pdf
- CAD 绘图必须掌握的30个技巧.pdf
- CAD二次实验报告4.docx
- CAD绘图快捷键+实用命令.pdf
- CaII and NaI absorption signatures from extraplanar gas in the halo of the Milky Way.pdf
- Cadence Allegro OrCAD V16.3 安装步骤.pdf
- Calculating noise figure in op amps.pdf
- Calculating the interior permanent-magnet motor.pdf
文档评论(0)