|
ABSTRACT
ISSN: 0975-4024
Title |
: |
Towards Spam Mail Detection using Robust Feature Evaluated with Feature Selection Techniques |
Authors |
: |
Josin Thomas, Vinod P, Nisha S Raj |
Keywords |
: |
Dimensionality Reduction, Feature Selection, Spam Filtering, Classifier |
Issue Date |
: |
Oct - Nov 2014 |
Abstract |
: |
Filtering of spam emails is a significant operation in email system. The efficiency of this process is determined by many factors such as number of features, representation of samples, classifier etc. This study covers all these factors and aims to find the optimal settings for email spam filtering. Twelve feature selection methods extensively used in text categorization are implemented to synthesize prominent attributes from different categories (i.e. header, subject and body of the mails). Optimal classification performances are obtained for Weighted Mutual Information and Log-TFIDF-Cosine(LTC) feature selection methods for header and body features of the mail with Random Forest and Support Vector Machine classifiers respectively. An overall F1-measure of 0.978 with 0.44s prediction time is achieved when 20% of the original feature length is considered. |
Page(s) |
: |
2144-2158 |
ISSN |
: |
0975-4024 |
Source |
: |
Vol. 6, No.5 |
|