- 1、本文档共58页,可阅读全部内容。
- 2、原创力文档(book118)网站文档一经付费(服务费),不意味着购买了该文档的版权,仅供个人/单位学习、研究之用,不得用于商业用途,未经授权,严禁复制、发行、汇编、翻译或者网络传播等,侵权必究。
- 3、本站所有内容均由合作方或网友上传,本站不对文档的完整性、权威性及其观点立场正确性做任何保证或承诺!文档内容仅供研究参考,付费前请自行鉴别。如您付费,意味着您自己接受本站规则且自行承担风险,本站不退款、不进行额外附加服务;查看《如何避免下载的几个坑》。如果您已付费下载过本站文档,您可以点击 这里二次下载。
- 4、如文档侵犯商业秘密、侵犯著作权、侵犯人身权等,请点击“版权申诉”(推荐),也可以打举报电话:400-050-0827(电话支持时间:9:00-18:30)。
查看更多
弹性图-图形与拓扑语
弹性图,图形与拓扑语法Elastic Maps, Graphs, and Topological Grammars Alexander Gorban, Leicester with Andrei Zinovyev, Paris and Neil Sumner, Leicester Plan of the talk INTRODUCTION Two paradigms for data analysis: statistics and modelling Clustering and K-means Self Organizing Maps PCA and local PCA Plan of the talk 1. Principal manifolds and elastic maps The notion of of principal manifold (PM) Constructing PMs: elastic maps Adaptation and grammars 2. Application technique Projection and regression Maps and visualization of functions 3. Implementation and examples Two basic paradigms for data analysis Statistical Analysis Existence of a Probability Distribution; Statistical Hypothesis about Data Generation; Verification/Falsification of Hypothesises about Hidden Properties of Data Distribution Data Modelling We should find the Best Model for Data description; We know the Universe of Models; We know the Fitting Criteria; Learning Errors and Generalization Errors analysis for the Model Verification Example: Simplest Clustering K-means algorithm “Centers” can be lines, manifolds,… with the same algorithm SOM - Self Organizing Maps Set of nodes is a finite metric space with distance d(N,M); 0) Map set of nodes into dataspace N→f0(N); 1) Select a datapoint X (random); 2) Find a nearest fi(N) (N=NX); 3) fi+1(N) = fi(N) +wi(d(N, NX))(X- fi(N)),where wi(d) (0wi(d)1) is a decreasing cutting function. The closest node to X is moved the most in the direction of X, while other nodes are moved by smaller amounts depending on their distance from the closest node in the initial geometry. PCA and Local PCA A top secret: the difference between two basic paradigms is not crucial (Almost) Back to Statistics: Quasi-statistics: 1) delete one point from the dataset, 2) fitting,3) analysis of the error for the deleted data; The overfitting problem and smoothed data points (it is very close to non-parametric statistics) Principal manifoldsElastic maps framework Mean point P
文档评论(0)