Input data consists of 8580 text records(news records) in sparse format without labels.
This program -
• Implements the DBSCAN clustering algorithm.
• Deals with text datain document-term sparse matrix format.
• Designs a proximity function for text data.
• Uses dimensionality reduction to deal with Curse of Dimensionality.