Normalization based K-means Data Analysis Algorithm

Navdeep Kaur, Bhavika Jagga, Dinesh Gupta

Abstract


Data mining is analysis step to knowledge discovery in the database process. It is the process of extraction knowledge from large databases. Clustering is a process of partitioning a set of data (or objects) into a set of meaningful sub-classes, called clusters. Used either as a stand-alone tool to get insight into datadistribution or as a preprocessing step for other algorithms. K-means is a good clustering technique. With the proposed algorithm, normalization of data prior to clustering. Then a efficient algorithm used for clustering which is better than simple k-means algorithm.

Full Text:

PDF

References


Clifton, C. and R. Steinheiser. 1998. "Data Mining on Text",Proceedings of the 22nd Annual IEEE International Computer Software and Applications Conference, COMPSAC98, pp. 630–635.

Piatetsky-Shapiro G., Frawley W. (Eds.): “Knowledge Discovery in Databases”, AAAI Press, Menlo Park, CA, 1991.

http://www.cs.waikato.ac.nz/~ml/weka/

Frigui H. and Krishnapuram R. “Competitive Fuzzy Clustering”, IEEE 1996, Page No. 225-228.

Ester Martin, Kriegel Hans-Peter introduced the Idea of “Clustering for Mining in Large Spatial Database”.

BerkhinPavel introduced the Idea of “Survey of Clustering Data Mining Techniques”.

Hasan A. Moh. ,Chaoji V. , Salem S. and Zaki J. Moh. “Robust Partitional Clustering by Outlier and Density Insensitive Seeding”, ACM 2000.




DOI: https://doi.org/10.23956/ijarcsse.v9i5.1016

Refbacks

  • There are currently no refbacks.




© International Journals of Advanced Research in Computer Science and Software Engineering (IJARCSSE)| All Rights Reserved | Powered by Advance Academic Publisher.