DSpace Repository

FRAMEWORK FOR FASTER IMPLEMENTATION OF UNSUPERVISED CLUSTERING ALGORITHMS

Show simple item record

dc.contributor.author Bhushan, Anant
dc.date.accessioned 2014-11-30T05:46:22Z
dc.date.available 2014-11-30T05:46:22Z
dc.date.issued 2010
dc.identifier M.Tech en_US
dc.identifier.uri http://hdl.handle.net/123456789/12177
dc.guide Singh, Kuldip
dc.guide Mittal, Ankush
dc.description.abstract Clustering, particularly unsupervised clustering is central to a large number of com-puting application which involve machine learning and information retrieval. Algo-rithmic methods of improving the execution times by using filtering algorithm and kd-trees have been successful but do not provide scope for further improvements. The emergence of multi-core procdssors and their easy availability and low cost has made it possible to have increased computing power. The need of the hour is to have algorithms that can harness the increased computing power available at our disposal. In this work a novel approach has been presented which can reduce the running time of the clustering algorithms by exploiting the parallel computing architectures available today. We utilize the MPI libraries for creating parallel execution threads on multicore processors.Our approach involves adding a pre-processing and post-processing step to the parallel implementation of clustering using filtering algorithm.The preprocessing step is for finding groups of dimensions which have similar characteristics and which can therefore yield better quality clusters. These sub-groups of similar dimensions are clubbed together for parallel clustering operations in the subsequent steps, based on a similarity metric. The sub-groups of dimensions are created with an overlapping dimension among adjacent groups to facilitate merging of cluster centers during the post-processing step. The parallel clustering step produces overlapping cluster centers for the sub-groups of dimensions. The post-processing step takes the clusters created by the sub-groups of dimensions and merges the cluster centers based on the overlapping dimensions. The feasibility of the framework has been demonstrated through an implementation on multi-spectral image clustering using the filtering algorithm a.nd significantly re-duced running times were obtained. The pre-processing step involved the calculation of the kurtosis of the image data for calculating the similarity metric and grouping into sub-groups. The overhead involved in execution of the pre and post-processing steps was less than one percent of the time taken for clustering the data in parallel. en_US
dc.language.iso en en_US
dc.subject ELECTRONICS AND COMPUTER ENGINEERING en_US
dc.subject FASTER IMPLEMENTATION en_US
dc.subject UNSUPERVISED CLUSTERING en_US
dc.subject FRAMEWORK en_US
dc.title FRAMEWORK FOR FASTER IMPLEMENTATION OF UNSUPERVISED CLUSTERING ALGORITHMS en_US
dc.type M.Tech Dessertation en_US
dc.accession.number G20083 en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record