|
ABSTRACT
Title |
: |
A Novel Approach for Text Categorization of Unorganized data based with Information Extraction |
Authors |
: |
Suneetha Manne, Dr. S. sameen Fatima |
Keywords |
: |
Text Categorization; Text Mining; Information Extraction; Feature Term Extraction; Information Retrieval; Pyramidal Model; Term Frequency. |
Issue Date |
: |
July 2011 |
Abstract |
: |
Internet has made a profound change in the lives of many enthusiastic innovators and researchers. The information available on the web has knocked the doors of Knowledge Discovery leading to a new Information era. Unfortunately, most Search Engines provide web content which is irrelevant to the information intended to the browser. Many Text Categorization techniques for web content have been developed, to recognize the given document’s category but failed to make trust worthy results. This paper primarily focuses on web content categorization based on classic summarization technique by enabling the classification at word level. The web document is preprocessed first which involves filtering the content with classical techniques and then is converted into organized data. The organized data is then treated with predefined hierarchical categorical set to identify theexact category. |
Page(s) |
: |
2846-2854 |
ISSN |
: |
0975–3397 |
Source |
: |
Vol. 3, Issue.7 |
|