JournalofStatisticalSoftware.PDFVIP

  • 9
  • 0
  • 约7.61万字
  • 约 28页
  • 2017-05-05 发布于湖北
  • 举报
JournalofStatisticalSoftware

JSS Journal of Statistical Software MMMMMM YYYY, Volume VV, Issue II. / Rmixmod: The R Package of the Model-Based Unsupervised, Supervised and Semi-Supervised Classi?cation Mixmod Library Rémi Lebret Serge Iovle? Florent Langrognet UTC, CNRS, Univ. Lille 1 Univ. Lille 1, CNRS, Inria CNRS, Univ. F.-Comté Christophe Biernacki Gilles Celeux Gérard Govaert Univ. Lille 1, CNRS, Inria Inria, Univ. Paris-Sud UTC, CNRS Abstract Mixmod is a well-established software package for ?tting a mixture model of multi- variate Gaussian or multinomial components to a given data set with either a clustering, a density estimation or a discriminant analysis purpose. The Rmixmod S4 package pro- vides a bridge between the C++ core library of Mixmod (mixmodLib) and the R statistical computing environment. In this article, we give an overview of the model-based clustering and classi?cation methods, and we show how the R package Rmixmod can be used for clustering and discriminant analysis. Keywords: model-based clustering, discriminant analysis, mixture models, visualization, R, Rmixmod. 1. Introduction Clustering and discriminant analysis (or classi?cation) methods are among the most impor- tant techniques in multivariate statistical learning. The goal of cluster analysis is to partition the observations into groups (“clusters”) so that the pairwise dissimilarities between those assigned to the same cluster tend to be smaller than those in di?erent clusters. The goal of classi?cation is to design a decision function from a learning data set to assign new data to groups a priori known. Mixture modeling supposes that the data are an i.i.d. sample from some population described by a probability de

文档评论(0)

1亿VIP精品文档

相关文档