A simple model-based approach to variable selection in classification and clusteringReport as inadecuate




A simple model-based approach to variable selection in classification and clustering - Download this document for free, or read online. Document in PDF available to download.

Published in: Canadian Journal Of Statistics-Revue Canadienne De Statistique (ISSN: 0319-5724), vol. 43, num. 2, p. 157-175 Hoboken: Wiley-Blackwell, 2015

Clustering and classification of replicated data is often performed using classical techniques that inappropriately treat the data as unreplicated, or by complex modern ones that are computationally demanding. In this paper, we introduce a simple approach based on a spike-and-slab mixture model that is fast, automatic, allows classification, clustering and variable selection in a single framework, and can handle replicated or unreplicated data. Simulation shows that our approach compares well with other recently proposed methods. The ideas are illustrated by application to microarray and metabolomic data. The Canadian Journal of Statistics 43: 157-175; 2015 (c) 2015 Statistical Society of Canada

Keywords: Classification ; Clustering ; High-dimensional data ; Hierarchical partitioning ; Laplace distribution ; Mixture model ; Variable selection Reference EPFL-ARTICLE-212575doi:10.1002/cjs.11241View record in Web of Science





Author: Partovi Nia, Vahid; Davison, Anthony C.

Source: https://infoscience.epfl.ch/record/212575?ln=en







Related documents