首页 | 本学科首页   官方微博 | 高级检索  
     


Two-stage ELM for phishing Web pages detection using hybrid features
Authors:Wei?Zhang  Email author" target="_blank">Qingshan?JiangEmail author  Lifei?Chen  Chengming?Li
Affiliation:1.Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences,Shenzhen,China;2.Shenzhen College of Advanced Technology,University of Chinese Academy of Sciences,Shenzhen,China;3.Fujian Normal University,Fuzhou,China
Abstract:Increasing high volume phishing attacks are being encountered every day due to attackers’ high financial returns. Recently, there has been significant interest in applying machine learning for phishing Web pages detection. Different from literatures, this paper introduces predicted labels of textual contents to be part of the features and proposes a novel framework for phishing Web pages detection using hybrid features consisting of URL-based, Web-based, rule-based and textual content-based features. We achieve this framework by developing an efficient two-stage extreme learning machine (ELM). The first stage is to construct classification models on textual contents of Web pages using ELM. In particular, we take Optical Character Recognition (OCR) as an assistant tool to extract textual contents from image format Web pages in this stage. In the second stage, a classification model on hybrid features is developed by using a linear combination model-based ensemble ELMs (LC-ELMs), with the weights calculated by the generalized inverse. Experimental results indicate the proposed framework is promising for detecting phishing Web pages.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号