FRAMEWORK FOR FASTER IMPLEMENTATION OF UNSUPERVISED CLUSTERING ALGORITHMS

Bhushan, Anant

DSpace Home
→
ELECTRONICS AND COMMUNICATION ENGINEERING (FORMERLY ELECTRONICS & COMPUTER ENGINEERING)
→
MASTERS' THESES (E & C)
→
View Item

dc.contributor.author	Bhushan, Anant
dc.date.accessioned	2014-11-30T05:46:22Z
dc.date.available	2014-11-30T05:46:22Z
dc.date.issued	2010
dc.identifier	M.Tech	en_US
dc.identifier.uri	http://hdl.handle.net/123456789/12177
dc.guide	Singh, Kuldip
dc.guide	Mittal, Ankush
dc.description.abstract	Clustering, particularly unsupervised clustering is central to a large number of com-puting application which involve machine learning and information retrieval. Algo-rithmic methods of improving the execution times by using filtering algorithm and kd-trees have been successful but do not provide scope for further improvements. The emergence of multi-core procdssors and their easy availability and low cost has made it possible to have increased computing power. The need of the hour is to have algorithms that can harness the increased computing power available at our disposal. In this work a novel approach has been presented which can reduce the running time of the clustering algorithms by exploiting the parallel computing architectures available today. We utilize the MPI libraries for creating parallel execution threads on multicore processors.Our approach involves adding a pre-processing and post-processing step to the parallel implementation of clustering using filtering algorithm.The preprocessing step is for finding groups of dimensions which have similar characteristics and which can therefore yield better quality clusters. These sub-groups of similar dimensions are clubbed together for parallel clustering operations in the subsequent steps, based on a similarity metric. The sub-groups of dimensions are created with an overlapping dimension among adjacent groups to facilitate merging of cluster centers during the post-processing step. The parallel clustering step produces overlapping cluster centers for the sub-groups of dimensions. The post-processing step takes the clusters created by the sub-groups of dimensions and merges the cluster centers based on the overlapping dimensions. The feasibility of the framework has been demonstrated through an implementation on multi-spectral image clustering using the filtering algorithm a.nd significantly re-duced running times were obtained. The pre-processing step involved the calculation of the kurtosis of the image data for calculating the similarity metric and grouping into sub-groups. The overhead involved in execution of the pre and post-processing steps was less than one percent of the time taken for clustering the data in parallel.	en_US
dc.language.iso	en	en_US
dc.subject	ELECTRONICS AND COMPUTER ENGINEERING	en_US
dc.subject	FASTER IMPLEMENTATION	en_US
dc.subject	UNSUPERVISED CLUSTERING	en_US
dc.subject	FRAMEWORK	en_US
dc.title	FRAMEWORK FOR FASTER IMPLEMENTATION OF UNSUPERVISED CLUSTERING ALGORITHMS	en_US
dc.type	M.Tech Dessertation	en_US
dc.accession.number	G20083	en_US