PARALLELISED HIDING OF SENSITIVE PATTERNS FOR PRIVACY PRESERVATION

Agrawal, Nishtha

Please use this identifier to cite or link to this item: http://localhost:8081/jspui/handle/123456789/15318

Full metadata record

DC Field	Value	Language
dc.contributor.author	Agrawal, Nishtha	-
dc.date.accessioned	2022-02-07T07:13:33Z	-
dc.date.available	2022-02-07T07:13:33Z	-
dc.date.issued	2019-05	-
dc.identifier.uri	http://localhost:8081/xmlui/handle/123456789/15318	-
dc.description.abstract	Frequent itemset mining is a field of data mining where frequent itemsets are extracted from the dataset. This may reveal some sensitive information which is not meant to be shared with third party. Privacy Preserving Data Mining approaches are used to hide that sensitive information from the dataset but along with that they also have some side effects on the datasets. Among the three types of Privacy Preserving Data Mining methods, Heuristic-based are better in terms of scalability and time efficiency as compared to the border-based and exact approaches. Heuristics-based Privacy Preserving Data Mining approaches are used to sanitize the dataset i.e., removal of sensitive patterns from the transactions, based on some heuristics. So far most of the existing techniques used for hiding sensitive patterns make use of candidate-based pattern generation methods for generating frequent patterns which takes a lot of time because a large candidate itemset space is generated. In this work, we have proposed FP-Tree based Sensitive Patterns Removal (FSR) approach. This proposed approach makes use of candidate-less pattern generation technique for hiding the sensitive patterns which reduces a lot of time as compared to previous techniques. Experiments have been performed on benchmark dataset where the proposed approach has resulted into the sanitized data with substantially better utility and better time efficiency as compared to the existing approaches. But these sequential approaches are not able to cope up with the big data. So, there is another proposed approach- Parallelised FP-Tree based Sensitive Patterns Removal (PFSR), which is the parallel implementation of Proposed FSR approach on spark parallel computing framework. This parallelised approach is scalable enough for handling large dataset. Experiments performed using benchmark datasets shows that Proposed PFSR approach scales better as compared to Proposed FSR approach, and other existing sequential approaches.	en_US
dc.description.sponsorship	INDIAN INSTITUTE OF TECHNOLOGY, ROORKEE	en_US
dc.language.iso	en	en_US
dc.publisher	I I T ROORKEE	en_US
dc.subject	Frequent Itemset Mining	en_US
dc.subject	Sensitive Patterns Removal	en_US
dc.subject	Parallelised FP-Tree based Sensitive Patterns Removal (PFSR	en_US
dc.subject	Preserving Data Mining Approaches	en_US
dc.title	PARALLELISED HIDING OF SENSITIVE PATTERNS FOR PRIVACY PRESERVATION	en_US
dc.type	Other	en_US
Appears in Collections:	MASTERS' THESES (CSE)

Files in This Item:

File	Description	Size	Format
G29152.pdf		510.19 kB	Adobe PDF	View/Open

Show simple item record