Please use this identifier to cite or link to this item: http://localhost:8081/xmlui/handle/123456789/2173
Full metadata record
DC FieldValueLanguage
dc.contributor.authorKumar, Gajawada Satish-
dc.date.accessioned2014-09-26T13:37:56Z-
dc.date.available2014-09-26T13:37:56Z-
dc.date.issued2012-
dc.identifierM.Techen_US
dc.identifier.urihttp://hdl.handle.net/123456789/2173-
dc.guideToshniwal, Durga-
dc.description.abstractReal world datasets are complex due to high dimensionality. In addition to many dimensions these datasets may have other problems like missing values, limited labels etc. Hence there is scope to explore data mining techniques for such complex high dimensional datasets. Clustering methods which find clusters in full dimensional space are not appropriate to find clusters in complex data with many dimensions due to problems associated with high dimensional data. In this work we proposed projected clustering methods based on optimization methods like Genetic Algorithm (GA) to find subspace clusters present in high dimensional datasets. The optimization technique finds optimal centers of subspace clusters by optimizing a subspace cluster validation index. Full dimensional clustering methods were used in literature in the pre-processing step to classification stage to solve problems associated with dataset. There is scope to explore usage of clustering methods which find clusters in high dimensional datasets in the pre-processing step to aid classification methods. In this work we proposed several classification methods based on this idea. We have proposed a framework for classification using projected clustering where subspace clusters found are used for s.everal kinds of pre-processing steps depending on the problem associated with the dataset. The proposed methods were applied on benchmark real and synthetic datasets.en_US
dc.language.isoenen_US
dc.subjectHIGH DIMENSIONAL DATASETSen_US
dc.subjectDATA MININGen_US
dc.subjectGENETIC ALGORITHMen_US
dc.subjectELECTRONICS AND COMPUTER ENGINEERINGen_US
dc.titleDATA MINING TECHNIQUES FOR HIGH DIMENSIONAL DATASETSen_US
dc.typeM.Tech Dessertationen_US
dc.accession.numberG21480en_US
Appears in Collections:MASTERS' THESES (E & C)

Files in This Item:
File Description SizeFormat 
ECDG21480.pdf1.7 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.