IJCSE Abstract

ABSTRACT

Title

Survey on Feature Selection in Document Clustering

Authors

MS. K.Mugunthadevi, MRS. S.C. Punitha, Dr..M. Punithavalli

Keywords

text mining, feature selection, information retrieval, ontology, document clustering

Issue Date

March 2011.

Abstract

Text mining is to research technologies to discover useful knowledge from enormous collections of documents, and to develop a system to provide knowledge and to support in decision making. Basically cluster means a group of similar data, document clustering means segregating the data into different groups of similar data. Clustering is a fundamental data analysis technique used for various applications such as biology, psychology, control and signal processing, information theory and mining technologies. Text mining is not a stand-alone task that human analysts typically engage in. The goal is to transform text composed of everyday language into a structured, database format. In this way, heterogeneous documents are summarized and presented in a uniform manner. Among others, the challenging problems of text clustering are big volume, high dimensionality and complex semantics.

Page(s)

1240-1244

ISSN

0975–3397

Source

Vol. 3, Issue.03

IJCSE

ARTICLES IN PRESS

ISSUES

CALL FOR PAPERS

TOPICS

AUTHORS

EDITORIAL BOARD

REVIEWERS

ABSTRACT