Please use this identifier to cite or link to this item: http://localhost:8081/xmlui/handle/123456789/12408
Title: CONTENT SIMILARITY BASED ANTI-PHISHING
Authors: Swagat, Konchada
Keywords: ELECTRONICS AND COMPUTER ENGINEERING;ANTI-PHISHING;PHISHING;DUPLICATE WEBSITE
Issue Date: 2011
Abstract: Phishing is a critical problem traditionally involving a deceiving mail and a duplicate website. Phishers endeavor to lure users into submitting their crucial information and use the information for illegal purposes. The menace of phishing is rampant on the Internet. A recent report of Gartner puts the losses due to phishing at more than $3.2 billion annually [1], banking and other financial institutions and their users being the most affected. But in recent times phishing has spread into new frontiers like social networking, cyber warfare, etc., Innovative phishing strategies are being developed from time to time which go unnoticed. In the midst of all the chaos, it's the user who bears the brunt of phishing attacks. Numerous anti-phishing tools have been developed with various strategies. Attempts have been done to stop phishing when the phish is in the form of a mail using mail filters[2]. Organizations which deal with highly financial transactions have chosen two-way authentication to avoid risks of phishing, but such methods could not revolutionize the web because of the additional costs and effort involved. Some anti-phishing browser tools use web related heuristics like URL usability, ip-address usage, etc., but lead to unimpressive performance. We present here, a Content Similarity based Anti-Phishing system, a ubiquitous, robust and scalable method to detect phishing. Instead of depending on phishing related heuristics, we directly address the source of phishing, the content of pages and the similarity of pages involved. A sophisticated algorithm has been presented with proof of concepts wherever required, to detect similar pages and further establish the authority of those pages based on their URL, content and newly introduced factors like site-population. The performance achieved by our system has been impressive with a true-positive rate of more than 99% and better compared to previously developed systems which were discussed.
URI: http://hdl.handle.net/123456789/12408
Other Identifiers: M.Tech
Research Supervisor/ Guide: Misra, Manoj
metadata.dc.type: M.Tech Dessertation
Appears in Collections:MASTERS' THESES (E & C)

Files in This Item:
File Description SizeFormat 
ECDG20984.pdf2.89 MBAdobe PDFView/Open


Items in DSpace are protected by copyright, with all rights reserved, unless otherwise indicated.