Título

Algorithm and hardware architecture for the discovery of frequent sequences

Autor

OSVALDO NAVARRO GUZMAN

Colaborador

RENE ARMANDO CUMPLIDO PARRA (Asesor de tesis)

LUIS VILLASEÑOR PINEDA (Asesor de tesis)

Nivel de Acceso

Acceso Abierto

Resumen o descripción

Sequential Pattern Mining is a widely addressed problem in data mining, with

applications such as analyzing Web usage, automatic text reuse detection, analyzing

purchase behavior, among others. Nevertheless, with the dramatic increase

in data volume, the current approaches result inefficient when dealing with large

input datasets, a large number of different symbols and low minimum supports.

We propose a new sequential pattern mining algorithm, which follows a pattern growth

scheme to discover frequent patterns, that is, by recursively growing an

already known frequent pattern p using frequent symbols from the projected

database with respect to p. Our algorithm only maintains in memory a structure

of the pseudo-projections and the symbols required for the algorithm in case

it has to go back and try to grow a pattern with another valid element. Also,

we propose a hardware architecture that implements the processes of generating

pseudo-projection databases and finding frequent elements from a projection

database, which comprehends the most costly operations of our algorithm, in

order to accelerate its running time. Experimental results showed that our algorithm

has a better performance and scalability, in comparison with the UDDAG

and PLWAP algorithms. Moreover, a performance estimate showed us that our

hardware architecture significantly reduces the running time of our proposed algorithm.

To our knowledge, this is the first hardware architecture that tackles

the problem of sequential pattern mining.

Editor

Instituto Nacional de Astrofísica, Óptica y Electrónica

Fecha de publicación

2012

Tipo de publicación

Tesis de maestría

Versión de la publicación

Versión aceptada

Formato

application/pdf

Idioma

Inglés

Audiencia

Estudiantes

Investigadores

Público en general

Sugerencia de citación

Navarro-Guzman O.

Repositorio Orígen

Repositorio Institucional del INAOE

Descargas

484

Comentarios



Necesitas iniciar sesión o registrarte para comentar.