|
ABSTRACT
Title |
: |
Comparing Neural Network Approach with N-Gram Approach for Text Categorization |
Authors |
: |
A. Suresh Babu, P.N.V.S.Pavan Kumar |
Keywords |
: |
N-Gram, Neural Network, LanguageIdentification, Text categorization |
Issue Date |
: |
Jan 2010 |
Abstract |
: |
This paper compares Neural network Approach with N-gram approach, for text categorization, and demonstrates that Neural Network approach is similar to the N-gram approach but with much less judging time. Both methods demonstrated here are aimed at language identification. The presence of particular characters, words and the statistical information of word lengths are used as a feature vector. In an identification experiment with Asian languages the neural network approach achieved 98% correct classification rate with 500 bytes, but it is five times faster than n-gram based approach.
|
Page(s) |
: |
80-83 |
ISSN |
: |
0975–3397 |
Source |
: |
Vol. 2, Issue.1 |
|