Paper abstract

Adequate Condensed Representations of Patterns

Arnaud Soulet - Universite de Tours, France
Bruno Cremilleux - Universite de Caen, France

Session: Pattern Mining
Springer Link: http://dx.doi.org/10.1007/978-3-540-87479-9_18

Patterns are at the core of the discovery of a lot of knowledge from data but their uses are limited due to their huge number and their mining cost. During the last decade, many works addressed the concept of condensed representation w.r.t. frequency queries. Such representations are several orders of magnitude smaller than the size of the whole collections of patterns, and also enable us to regenerate the frequency information of any pattern. In this paper, we propose a framework for condensed representations w.r.t. a large set of new and various queries named condensable functions based on interestingness measures. Such condensed representations are achieved thanks to new closure operators automatically derived from each condensable function to get adequate condensed representations. We propose a generic algorithm MicMac to efficiently mine the adequate condensed representations. Exepriments show both the conciseness of the adequate condensed representations and the efficiency of our algorithm.