您所在位置网站首页 > 海量文档  > 计算机 > 数据挖掘与模式识别

大数据与数据挖掘1概述汇总.ppt 62页

本文档一共被下载: ,您可全文免费在线阅读后下载本文档。

  • 支付并下载
  • 收藏该文档
  • 百度一下本文档
  • 修改文档简介


特别说明: 下载前务必先预览,自己验证一下是不是你要下载的文档。
  • 上传作者 bbnm58850(上传创作收益人)
  • 发布时间:2017-05-12
  • 需要金币300(10金币=人民币1元)
  • 浏览人气
  • 下载次数
  • 收藏次数
  • 文件大小:3.87 MB
基于关系型数据库的数据挖掘:传统正面临挑战,常用的调优手段,一体机的理念,大数据的存储与分析 * 大数据市场正处于井喷式发展阶段,国家领导人对高数据高度重视,各地大数据产业蓬勃兴起,政治局委员汪洋在广东任职期间主抓大数据,互联网领军人物雷军力推国家大数据发展规划。大数据落地的5大成功因素(基础设施、产业链、人才、技术、立法),大数据基础设施是基础,潜藏着巨大的商机。 * 用户行为分析,反欺诈,反洗钱 * * Cluster filesystem - a distributed filesystem that is not a single server with a set of clients, but instead a cluster of servers that all work together to provide high performance service to their clients. To the clients the cluster is transparent - it is just "the filesystem", but the filesystem software deals with distributing requests to elements of the storage cluster. Examples include: HP (DEC) Tru64 cluster and Spinnaker is a clustered NAS (NFS) service. Panasas ActiveScale is a cluster filesystem Parallel filesystem - file systems with support for parallel applications, all nodes may be accessing the same files at the same time, concurrently reading and writing. Data for a single file is striped across multiple storage nodes to provide scalable performance to individual files. Examples of this include: Panasas ActiveScale, Lustre, GPFS and Sistina. NFSv4.1 will feature an extension to the NFS standard that supports parallel IO. These definitions overlap. * Level 4, Distributed file systems, such as Locus, Sun Network File System (NFS) and CMU Andrew , where multiple users who are physically dispersed in a network of autonomous computers share in the use of a common file system. New issues location transparency - dynamically maps file names to storage sites Availability – Fault Tolerance – Replication – Consistency Distributed Storage is software for files and directories synchronization locally and between many remote computers connected via LAN or Internet. wiki Consistency, availability and performance tend to be mutually contradictory goals in a distributed system. * object means an ordered set of bytes (within the OSD) that is associated with a unique identifier. * …老问题新需求 Cloud storage is a model of networked online storage where data is


用户名: 验证码: 点击我更换图片

“原创力文档”前称为“文档投稿赚钱网”,本站为“文档C2C交易模式”,即用户上传的文档直接卖给(下载)用户,本站只是中间服务平台,本站所有文档下载所得的收益归上传人(含作者)所有【成交的100%(原创)】。原创力文档是网络服务平台方,若您的权利被侵害,侵权客服QQ:3005833200 电话:19940600175 欢迎举报,上传者QQ群:784321556