Document details

Spam email filtering using network-level properties

Author(s): Cortez, Paulo, 1971- cv logo 1 ; Correia, André cv logo 2 ; Sousa, Pedro cv logo 3 ; Rocha, Miguel cv logo 4 ; Rio, Miguel cv logo 5

Date: 2010

Persistent ID: http://hdl.handle.net/1822/10829

Origin: RepositóriUM - Universidade do Minho

Subject(s): Anti-Spam filtering; Text Mining; Naive Bayes; Support Vector Machines


Description
Spam is serious problem that affects email users (e.g. phishing attacks, viruses and time spent reading unwanted messages). We propose a novel spam email filtering approach based on network-level attributes (e.g. the IP sender geographic coordinates) that are more persistent in time when compared to message content. This approach was tested using two classifiers, Naive Bayes (NB) and Support Vector Machines (SVM), and compared against bag-of-words models and eight blacklists. Several experiments were held with recent collected legitimate (ham) and non legitimate (spam) messages, in order to simulate distinct user profiles from two countries (USA and Portugal). Overall, the network-level based SVM model achieved the best discriminatory performance. Moreover, preliminary results suggests that such method is more robust to phishing attacks.
Document Type Conference Object
Language English
delicious logo  facebook logo  linkedin logo  twitter logo 
degois logo
mendeley logo

Related documents



    Financiadores do RCAAP

Fundação para a Ciência e a Tecnologia Universidade do Minho   Governo Português Ministério da Educação e Ciência Programa Operacional da Sociedade do Conhecimento EU