1.
Kavita Yadav
– Dept. Of Comp. Science & Engg., Rungta College Of Engg. And Tech., Bhilai, Chhattisgarh, India.
2.
Om Prakash Dewangan
– Dept. Of Comp. Science & Engg., Rungta College Of Engg. And Tech., Bhilai, Chhattisgarh, India.
Abstract
Data mining refers to extraction of information from huge chunks of the dataset. Its also called information mining. It is exercised in numerous fields like medicine, environment, education, market and business analysis, fraud detection, customer retention, crime, etc. In this research work data mining, text mining and web mining techniques used for data analysis and discovers patterns. This review paper covers clustering techniques (K-Means clustering technique, density based clustering and cosine similarity), web mining and text mining techniques. Clustering helps to put objects into the same group. Cosine similarity measure helps in finding similarity among different texts.
Keywords Clustering, Cosine similarity, Data mining, Density based clustering, K-means clustering.