DSpace Repository

PARALLELISED HIDING OF SENSITIVE PATTERNS FOR PRIVACY PRESERVATION

Show simple item record

dc.contributor.author Agrawal, Nishtha
dc.date.accessioned 2022-02-07T07:13:33Z
dc.date.available 2022-02-07T07:13:33Z
dc.date.issued 2019-05
dc.identifier.uri http://localhost:8081/xmlui/handle/123456789/15318
dc.description.abstract Frequent itemset mining is a field of data mining where frequent itemsets are extracted from the dataset. This may reveal some sensitive information which is not meant to be shared with third party. Privacy Preserving Data Mining approaches are used to hide that sensitive information from the dataset but along with that they also have some side effects on the datasets. Among the three types of Privacy Preserving Data Mining methods, Heuristic-based are better in terms of scalability and time efficiency as compared to the border-based and exact approaches. Heuristics-based Privacy Preserving Data Mining approaches are used to sanitize the dataset i.e., removal of sensitive patterns from the transactions, based on some heuristics. So far most of the existing techniques used for hiding sensitive patterns make use of candidate-based pattern generation methods for generating frequent patterns which takes a lot of time because a large candidate itemset space is generated. In this work, we have proposed FP-Tree based Sensitive Patterns Removal (FSR) approach. This proposed approach makes use of candidate-less pattern generation technique for hiding the sensitive patterns which reduces a lot of time as compared to previous techniques. Experiments have been performed on benchmark dataset where the proposed approach has resulted into the sanitized data with substantially better utility and better time efficiency as compared to the existing approaches. But these sequential approaches are not able to cope up with the big data. So, there is another proposed approach- Parallelised FP-Tree based Sensitive Patterns Removal (PFSR), which is the parallel implementation of Proposed FSR approach on spark parallel computing framework. This parallelised approach is scalable enough for handling large dataset. Experiments performed using benchmark datasets shows that Proposed PFSR approach scales better as compared to Proposed FSR approach, and other existing sequential approaches. en_US
dc.description.sponsorship INDIAN INSTITUTE OF TECHNOLOGY, ROORKEE en_US
dc.language.iso en en_US
dc.publisher I I T ROORKEE en_US
dc.subject Frequent Itemset Mining en_US
dc.subject Sensitive Patterns Removal en_US
dc.subject Parallelised FP-Tree based Sensitive Patterns Removal (PFSR en_US
dc.subject Preserving Data Mining Approaches en_US
dc.title PARALLELISED HIDING OF SENSITIVE PATTERNS FOR PRIVACY PRESERVATION en_US
dc.type Other en_US


Files in this item

This item appears in the following Collection(s)

Show simple item record