开源数据挖掘工具比较
Something AboutWEKAKNIMEOpen-Source Data Mining Tools What is the Open-source?Free, Libr`e, Open,source software —— FLOSS. 【1】The common open sources licenses include the GPL, LGPL, BSD, NPL, and MPL, which give user privilege to do something.Open source software provides users with the freedom to run, copy, distribute, study, change and improve the software .[1]Question1What is Data Mining?Data Mining is the process of discovering interesting knowledge from large databases. Data mining is sometimes also referred as a part of knowledge discovery process (KDD).【3】Data Mining tools has different capabilities which provides researchers a platform to support their research activities.Question2Why we use Open Source Data Mining Tools?Open source ensures that staff can understand exactly how the algorithms work by examining the source code, if they so desire, and can also fine tune the algorithms to suit the specific purposes of the enterprise. [1]Open source software can be as robust, or even more robust, than commercial and closed source software. Using open source software generally also means saving on the software costs, and allowing an enterprise to instead invest in skilling its people[1]Question3What kind of things that these Tools can do?In general, a data mining process consists of the following seven steps[1]1. Identify the business problems.2. Identify and study data sources, and select data.3. Extract and preprocess data.4. Mine the data, e.g., discover association rules or build predictive models.5. Verify the mining results.6. Deploy models in the business process.7. Measure the return on investment (ROI).Question4What kind of operations that we can do on these Tools? Data UnderstandingAccess——Various Data Source Filtering, Cleaning, and Transformation Data PreprocessingAlgorithms(include Classification, Prediction, Clustering, Association rule, and Interactive exploration ) Data Modeling Evaluationconfusion matrix, lift chart, gain chart, cluster validati
您可能关注的文档
最近下载
- 药品处方集_模版.doc VIP
- 2025年大学大二(护理学)外科护理综合实训综合测试题及答案.doc VIP
- 第四章第五节服装流行色(课件)-《服装设计基础》同步教学(高教版.服装设计与工艺专业).pptx VIP
- 故事里的端午节.ppt VIP
- 标准图集-19DX101-1 建筑电气常用数据-下册.pdf VIP
- 2025年新版《小学生规范守则》和《日常行为规范准则》.docx VIP
- 树立和践行正确的政绩观研讨发言材料.docx VIP
- 2025年大学大二(海洋渔业科学与技术)渔业资源评估测试题及答案.doc VIP
- 市政工程预算编制.pptx VIP
- 张爱玲与艾米丽_勃朗特的爱情观及文学观比较_以_倾城之恋_呼啸山庄_为例.pdf VIP
原创力文档

文档评论(0)