Detalhes do Documento

Query driven sequence pattern mining

Autor(es): Azevedo, Paulo J. cv logo 1 ; Ferreira, Pedro Gabriel cv logo 2

Data: 2006

Identificador Persistente: http://hdl.handle.net/1822/6588

Origem: RepositóriUM - Universidade do Minho

Assunto(s): Bioinformatics; Databases


Descrição
The discovery of frequent patterns present in biological sequences has a large number of applications, ranging from classification, clustering and understanding sequence structure and function. This paper presents an algorithm that discovers frequent sequence patterns (motifs) present in a query sequence in respect to a database of sequences. The query is used to guide the mining process and thus only the patterns present in the query are reported. Two main types of patterns can be identified: flexible and rigid gap patterns. The user can choose to report all or only maximal patterns. Constraints and Substitution Sets are pushed directly into the mining process. Experimental evaluation shows the efficiency of the algorithm, the usefulness and the relevance of the extracted patterns.
Tipo de Documento Artigo
Idioma Inglês
delicious logo  facebook logo  linkedin logo  twitter logo 
degois logo
mendeley logo

Documentos Relacionados



    Financiadores do RCAAP

Fundação para a Ciência e a Tecnologia Universidade do Minho   Governo Português Ministério da Educação e Ciência Programa Operacional da Sociedade do Conhecimento União Europeia