Document details

Processing and extracting data from Dicionário Aberto

Author(s): Simões, Alberto cv logo 1 ; Almeida, J. J. cv logo 2 ; Farinha, Rita cv logo 3

Date: 2010

Persistent ID: http://hdl.handle.net/1822/16475

Origin: RepositóriUM - Universidade do Minho

Subject(s): Knowledge extraction; Text mining


Description
Synonyms dictionaries are useful resources for natural language processing. Unfortunately their availability in digital format is limited, as publishing companies do not release their dictionaries in open digital formats. Dicionário-Aberto is an open and free digital synonyms dictionary for the Portuguese language. It is under public domain which makes it usable for any task. Synonyms dictionaries are commonly used for the extraction of relations between words, constructing structures similar to WordNet, or just the extraction of lists of words of specific type. This article presents Dicionário-Aberto, discusses its characteristics and the type of information present on it. Then, we describe an API to help on processing Dicionário-Aberto without the need to tackle with the dictionary format. Finally, we analyze the results on some data extraction experiments.
Document Type Article
Language English
delicious logo  facebook logo  linkedin logo  twitter logo 
degois logo
mendeley logo

Related documents



    Financiadores do RCAAP

Fundação para a Ciência e a Tecnologia Universidade do Minho   Governo Português Ministério da Educação e Ciência Programa Operacional da Sociedade do Conhecimento EU