Detalhes do Documento

Testing extensive use of NER tools in article classification and a statistical ...

Autor(es): Lourenço, Anália cv logo 1 ; Conover, Michael cv logo 2 ; Wong, Andrew cv logo 3 ; Pan, Fengxia cv logo 4 ; Abi-Haidar, Alaa cv logo 5 ; Nematzadeh, Azadeh cv logo 6 ; Shatkay, Hagit cv logo 7 ; Rocha, L. M. cv logo 8

Data: 2010

Identificador Persistente: http://hdl.handle.net/1822/23803

Origem: RepositóriUM - Universidade do Minho


Descrição
We participated (as Team 81) in the Article Classification (ACT) and Interaction Method (IMT) subtasks of the Protein-Protein Interaction task of the Biocreative III Challenge. For the ACT we pursued an extensive testing of available Named Entity Recognition (NER) tools, and used the most promising ones to extend our the Variable Trigonometric Threshold (VTT) linear classifier we successfully used in BioCreative II and II.5. Our main goal was to exploit the power of available NER tools to aid in the document classification of documents relevant for Protein-Protein Interaction. We also used a Support Vector Machine Classifier on NER features for comparison purposes. For the IMT, we experimented with a primarily statistical approach, as opposed to a deeper natural language processing strategy; in a nutshell, we exploited classifiers, simple pattern matching, and ranking of candidate matches using statistical considerations. We will also report on our efforts to integrate our IMT method sentence classifier into our ACT pipeline.
Tipo de Documento Documento de conferência
Idioma Inglês
delicious logo  facebook logo  linkedin logo  twitter logo 
degois logo
mendeley logo

Documentos Relacionados



    Financiadores do RCAAP

Fundação para a Ciência e a Tecnologia Universidade do Minho   Governo Português Ministério da Educação e Ciência Programa Operacional da Sociedade do Conhecimento União Europeia