Description: A MATLAB spectral clustering package to deal with large data sets. Our tool can handle large data sets (200,000 RCV1 data) on a 4GB memory general machine. Spectral clustering algorithm has been shown to be more effective in finding clusters than some traditional algorithms such as kmeans. To perform clustering on large data sets, we implement various ways of approximating the dense similarity matrix, including nearest neighbors and the Nystrom method.
File list (Check if you may need any files):
spectralclustering-1.0
......................\data
......................\....\corel_100_NN_sym_distance.mat
......................\....\corel_10_NN_sym_distance.mat
......................\....\corel_150_NN_sym_distance.mat
......................\....\corel_15_NN_sym_distance.mat
......................\....\corel_200_NN_sym_distance.mat
......................\....\corel_20_NN_sym_distance.mat
......................\....\corel_50_NN_sym_distance.mat
......................\....\corel_5_NN_sym_distance.mat
......................\....\corel_feature.mat
......................\....\corel_label.mat
......................\gen_nn_distance.m
......................\k_means.m
......................\nmi.m
......................\nystrom.m
......................\README
......................\sc.m
......................\script_nystrom.m
......................\script_sc.m