Abstract |
: |
Today’s real world databases are highly susceptible to
noisy, missing and inconsistent data due to their typically huge
size data and their origin from multiple, heterogeneous sources.
Hence, pre-processing of data is necessary to help improve the
quality of data and consequently the mining results. There are
number of data pre-processing techniques. In this paper, we
would like to discuss two different approaches for data preprocessing
one based on XML and other based on text file. But
the basic algorithm and steps involved in pre-processing are
considered same for both the approaches.
|