A Proposition for Sequence Mining Using Pattern StructuresReport as inadecuate

A Proposition for Sequence Mining Using Pattern Structures - Download this document for free, or read online. Document in PDF available to download.

1 DM2L - Data Mining and Machine Learning LIRIS - Laboratoire d-InfoRmatique en Image et Systèmes d-information 2 ORPAILLEUR - Knowledge representation, reasonning Inria Nancy - Grand Est, LORIA - NLPKD - Department of Natural Language Processing & Knowledge Discovery

Abstract : In this article we present a novel approach to rare sequence mining using pattern structures. Particularly, we are interested in mining closed sequences, a type of maximal sub-element which allows providing a succinct description of the patterns in a sequence database. We present and describe a sequence pattern structure model in which rare closed subsequences can be easily encoded. We also propose a discussion and characterization of the search space of closed sequences and, through the notion of sequence alignments, provide an intuitive implementation of a similarity operator for the sequence pattern structure based on directed acyclic graphs. Finally, we provide an experimental evaluation of our approach in comparison with state-of-the-art closed sequence mining algorithms showing that our approach can largely outperform them when dealing with large regions of the search space.

Author: Victor Codocedo - Guillaume Bosc - Mehdi Kaytoue - Jean-François Boulicaut - Amedeo Napoli -

Source: https://hal.archives-ouvertes.fr/


Related documents