Document details

Probabilistic synSet based concept location

Author(s): Carvalho, Nuno cv logo 1 ; Almeida, José João cv logo 2 ; Pereira, Maria João cv logo 3 ; Henriques, Pedro cv logo 4

Date: 2012

Persistent ID: http://hdl.handle.net/10198/7062

Origin: Biblioteca Digital do IPB

Subject(s): Program comprehension; Concept location; Identifier analysis; Program and problem domains


Description
Concept location is a common task in program comprehension techniques, essential in many approaches used for software care and software evolution. An important goal of this process is to discover a mapping between source code and human oriented concepts. Although programs are written in a strict and formal language, natural language terms and sentences like identifiers (variables or functions names), constant strings or comments, can still be found embedded in programs. Using terminology concepts and natural language processing techniques these terms can be exploited to discover clues about which real world concepts source code is addressing. This work extends symbol tables build by compilers with ontology driven constructs, extends synonym sets defined by linguistics, with automatically created Probabilistic SynSets from software domain parallel corpora. And using a relational algebra, creates semantic bridges between program elements and human oriented concepts, to enhance concept location tasks.
Document Type Article
Language English
delicious logo  facebook logo  linkedin logo  twitter logo 
degois logo
mendeley logo

Related documents



    Financiadores do RCAAP

Fundação para a Ciência e a Tecnologia Universidade do Minho   Governo Português Ministério da Educação e Ciência Programa Operacional da Sociedade do Conhecimento EU