首页 | 本学科首页   官方微博 | 高级检索  
     


VILO: a rapid learning nearest-neighbor classifier for malware triage
Authors:Arun Lakhotia  Andrew Walenstein  Craig Miles  Anshuman Singh
Affiliation:1. Center for Advanced Computer Studies, University of Louisiana at Lafayette, Lafayette, LA, USA
2. School of Computing and Informatics, University of Louisiana at Lafayette, Lafayette, LA, USA
Abstract:VILO is a lazy learner system designed for malware classification and triage. It implements a nearest neighbor (NN) algorithm with similarities computed over Term Frequency $\times $ Inverse Document Frequency (TFIDF) weighted opcode mnemonic permutation features (N-perms). Being an NN-classifier, VILO makes minimal structural assumptions about class boundaries, and thus is well suited for the constantly changing malware population. This paper presents an extensive study of application of VILO in malware analysis. Our experiments demonstrate that (a) VILO is a rapid learner of malware families, i.e., VILO’s learning curve stabilizes at high accuracies quickly (training on less than 20 variants per family is sufficient); (b) similarity scores derived from TDIDF weighted features should primarily be treated as ordinal measurements; and (c) VILO with N-perm feature vectors outperforms traditional N-gram feature vectors when used to classify real-world malware into their respective families.
Keywords:
本文献已被 SpringerLink 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号