Please use this identifier to cite or link to this item: http://localhost:8081/xmlui/handle/123456789/2173
Title: DATA MINING TECHNIQUES FOR HIGH DIMENSIONAL DATASETS
Authors: Kumar, Gajawada Satish
Keywords: HIGH DIMENSIONAL DATASETS;DATA MINING;GENETIC ALGORITHM;ELECTRONICS AND COMPUTER ENGINEERING
Issue Date: 2012
Abstract: Real world datasets are complex due to high dimensionality. In addition to many dimensions these datasets may have other problems like missing values, limited labels etc. Hence there is scope to explore data mining techniques for such complex high dimensional datasets. Clustering methods which find clusters in full dimensional space are not appropriate to find clusters in complex data with many dimensions due to problems associated with high dimensional data. In this work we proposed projected clustering methods based on optimization methods like Genetic Algorithm (GA) to find subspace clusters present in high dimensional datasets. The optimization technique finds optimal centers of subspace clusters by optimizing a subspace cluster validation index. Full dimensional clustering methods were used in literature in the pre-processing step to classification stage to solve problems associated with dataset. There is scope to explore usage of clustering methods which find clusters in high dimensional datasets in the pre-processing step to aid classification methods. In this work we proposed several classification methods based on this idea. We have proposed a framework for classification using projected clustering where subspace clusters found are used for s.everal kinds of pre-processing steps depending on the problem associated with the dataset. The proposed methods were applied on benchmark real and synthetic datasets.
URI: http://hdl.handle.net/123456789/2173
Other Identifiers: M.Tech
Research Supervisor/ Guide: Toshniwal, Durga
metadata.dc.type: M.Tech Dessertation
Appears in Collections:MASTERS' THESES (E & C)

Files in This Item:
File Description SizeFormat 
ECDG21480.pdf1.7 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.