期刊界 All Journals 搜尽天下杂志传播学术成果专业期刊搜索期刊信息化学术搜索

1.

Structural entropy and metamorphic malware

Donabelle Baysa Richard M. Low Mark Stamp 《Journal in Computer Virology》2013,9(4):179-192

Metamorphic malware is capable of changing its internal structure without altering its functionality. A common signature is nonexistent in highly metamorphic malware and, consequently, such malware can remain undetected under standard signature scanning. In this paper, we apply previous work on structural entropy to the metamorphic detection problem. This technique relies on an analysis of variations in the complexity of data within a file. The process consists of two stages, namely, file segmentation and sequence comparison. In the segmentation stage, we use entropy measurements and wavelet analysis to segment files. The second stage measures the similarity of file pairs by computing an edit distance between the sequences of segments obtained in the first stage. We apply this similarity measure to the metamorphic detection problem and show that we obtain strong results in certain challenging cases. 相似文献

2.

An entropy-based measure of software complexity

Harrison W. 《IEEE transactions on pattern analysis and machine intelligence》1992,18(11):1025-1029

It is proposed that the complexity of a program is inversely proportional to the average information content of its operators. An empirical probability distribution of the operators occurring in a program is constructed, and the classical entropy calculation is applied. The performance of the resulting metric is assessed in the analysis of two commercial applications totaling well over 130000 lines of code. The results indicate that the new metric does a good job of associating modules with their error spans (averaging number of tokens between error occurrences) 相似文献

3.

An entropy-based uncertainty measure of process models 总被引：1，自引：0，他引：1

Jae-Yoon Jung Jorge Cardoso 《Information Processing Letters》2011,111(3):135-141

In managing business processes, the process uncertainty and variability are significant factors causing difficulties in prediction and decision making, which evokes and augments the importance and need of process measures for systematic analysis. We propose an entropy-based process measure to quantify the uncertainty of business process models. The proposed measure enables capturing the dynamic behavior of processes, in contrast to previous work which focused on providing measures for the static aspect of process models. 相似文献

4.

Nonnegative matrix factorization and metamorphic malware detection

Ling Yeong Tyng Sani Nor Fazlida Mohd Abdullah Mohd Taufik Hamid Nor Asilah Wati Abdul 《Journal in Computer Virology》2019,15(3):195-208

Metamorphic malware change their internal code structure by adopting code obfuscation technique while maintaining their malicious functionality during each infection. This causes change of their signature pattern across each infection and makes signature based detection particularly difficult. In this paper, through static analysis, we use similarity score from matrix factorization technique called Nonnegative Matrix Factorization for detecting challenging metamorphic malware. We apply this technique using structural compression ratio and entropy features and compare our results with previous eigenvector-based techniques. Experimental results from three malware datasets show this is a promising technique as the accuracy detection is more than 95%.

相似文献

5.

Hybrid emulation for bypassing anti-reversing techniques and analyzing malware

Choi Seokwoo Chang Taejoo Yoon Sung-woo Park Yongsu 《The Journal of supercomputing》2021,77(1):471-497

The Journal of Supercomputing - Malware uses a variety of anti-reverse engineering techniques, which makes its analysis difficult. Dynamic analysis tools, e.g., debuggers, DBI (Dynamic Binary... 相似文献

6.

Detection of metamorphic and virtualization-based malware using algebraic specification

Matt Webster Grant Malcolm 《Journal in Computer Virology》2009,5(3):221-245

We present an overview of the latest developments in the detection of metamorphic and virtualization-based malware using an algebraic specification of the Intel 64 assembly programming language. After giving an overview of related work, we describe the development of a specification of a subset of the Intel 64 instruction set in Maude, an advanced formal algebraic specification tool. We develop the technique of metamorphic malware detection based on equivalence-in-context so that it is applicable to imperative programming languages in general, and we give two detailed examples of how this might be used in a practical setting to detect metamorphic malware. We discuss the application of these techniques within anti-virus software, and give a proof-of-concept system for defeating detection counter-measures used by virtualization-based malware, which is based on our Maude specification of Intel 64. Finally, we compare formal and informal approaches to malware detection, and give some directions for future research. 相似文献

7.

On normalized compression distance and large malware

Rebecca Schuller Borbely 《Journal in Computer Virology》2016,12(4):235-242

Normalized Compression Distance (NCD) is a popular tool that uses compression algorithms to cluster and classify data in a wide range of applications. Existing discussions of NCD’s theoretical merit rely on certain theoretical properties of compression algorithms. However, we demonstrate that many popular compression algorithms do not seem to satisfy these theoretical properties. We explore the relationship between some of these properties and file size, demonstrate that this theoretical problem is actually a practical problem for classifying malware with large file sizes, and propose some variants of NCD that mitigate this problem. 相似文献

8.

Simple substitution distance and metamorphic detection

Gayathri Shanmugam Richard M. Low Mark Stamp 《Journal in Computer Virology》2013,9(3):159-170

To evade signature-based detection, metamorphic viruses transform their code before each new infection. Software similarity measures are a potentially useful means of detecting such malware. We can compare a given file to a known sample of metamorphic malware and compute their similarity—if they are sufficiently similar, we classify the file as malware of the same family. In this paper, we analyze an opcode-based software similarity measure inspired by simple substitution cipher cryptanalysis. We show that the technique provides a useful means of classifying metamorphic malware. 相似文献

9.

Chi-squared distance and metamorphic virus detection

Annie H. Toderici Mark Stamp 《Journal in Computer Virology》2013,9(1):1-14

Metamorphic malware changes its internal structure with each generation, while maintaining its original behavior. Current commercial antivirus software generally scan for known malware signatures; therefore, they are not able to detect metamorphic malware that sufficiently morphs its internal structure. Machine learning methods such as hidden Markov models (HMM) have shown promise for detecting hacker-produced metamorphic malware. However, previous research has shown that it is possible to evade HMM-based detection by carefully morphing with content from benign files. In this paper, we combine HMM detection with a statistical technique based on the chi-squared test to build an improved detection method. We discuss our technique in detail and provide experimental evidence to support our claim of improved detection. 相似文献

10.

An affinity-based new local distance function and similarity measure for kNN algorithm 总被引：1，自引：0，他引：1

Gautam Bhattacharya 《Pattern recognition letters》2012,33(3):356-363

In this paper, we propose a modified version of the k-nearest neighbor (kNN) algorithm. We first introduce a new affinity function for distance measure between a test point and a training point which is an approach based on local learning. A new similarity function using this affinity function is proposed next for the classification of the test patterns. The widely used convention of k, i.e., k = [√N] is employed, where N is the number of data used for training purpose. The proposed modified kNN algorithm is applied on fifteen numerical datasets from the UCI machine learning data repository. Both 5-fold and 10-fold cross-validations are used. The average classification accuracy, obtained from our method is found to exceed some well-known clustering algorithms. 相似文献

11.

An interobject distance measure based on medial axes retrieved fromdiscrete distance maps

Forsgren P.-O. Seideman P. 《IEEE transactions on pattern analysis and machine intelligence》1990,12(4):390-397

A method that measures the distance between extended objects of nonregular shape is presented. The distance measure is an average of a set of minimal point-to-point distances between the borders of the objects. The set of points is collected with a well-defined criterion based on processing of distance values on a connected medial axis formed between the objects 相似文献

12.

Robust Hausdorff distance measure for face recognition

Vivek E. P 《Pattern recognition》2007,40(2):431-442

Face is considered to be one of the biometrics in automatic person identification. The non-intrusive nature of face recognition makes it an attractive choice. For face recognition system to be practical, it should be robust to variations in illumination, pose and expression as humans recognize faces irrespective of all these variations. In this paper, an attempt to address these issues is made using a new Hausdorff distance-based measure. The proposed measure represent the gray values of pixels in face images as vectors giving the neighborhood intensity distribution of the pixels. The transformation is expected to be less sensitive to illumination variations besides preserving the appearance of face embedded in the original gray image. While the existing Hausdorff distance-based measures are defined between the binary edge images of faces which contains primarily structural information, the proposed measure gives the dissimilarity between the appearance of faces. An efficient method to compute the proposed measure is presented. The performance of the method on bench mark face databases shows that it is robust to considerable variations in pose, expression and illumination. Comparison with some of the existing Hausdorff distance-based methods shows that the proposed method performs better in many cases. 相似文献

13.

Multi-objective self-adaptive differential evolution with elitist archive and crowding entropy-based diversity measure 总被引：1，自引：1，他引：1

Yao-Nan Wang Liang-Hong Wu Xiao-Fang Yuan 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2010,14(3):193-209

A self-adaptive differential evolution algorithm incorporate Pareto dominance to solve multi-objective optimization problems is presented. The proposed approach adopts an external elitist archive to retain non-dominated solutions found during the evolutionary process. In order to preserve the diversity of Pareto optimality, a crowding entropy diversity measure tactic is proposed. The crowding entropy strategy is able to measure the crowding degree of the solutions more accurately. The experiments were performed using eighteen benchmark test functions. The experiment results show that, compared with three other multi-objective optimization evolutionary algorithms, the proposed MOSADE is able to find better spread of solutions with better convergence to the Pareto front and preserve the diversity of Pareto optimal solutions more efficiently. 相似文献

14.

An investigation of byte n-gram features for malware classification

Edward Raff Richard Zak Russell Cox Jared Sylvester Paul Yacci Rebecca Ward Anna Tracy Mark McLean Charles Nicholas 《Journal in Computer Virology》2018,14(1):1-20

Malware classification using machine learning algorithms is a difficult task, in part due to the absence of strong natural features in raw executable binary files. Byte n-grams previously have been used as features, but little work has been done to explain their performance or to understand what concepts are actually being learned. In contrast to other work using n-gram features, in this work we use orders of magnitude more data, and we perform feature selection during model building using Elastic-Net regularized Logistic Regression. We compute a regularization path and analyze novel multi-byte identifiers. Through this process, we discover significant previously unreported issues with byte n-gram features that cause their benefits and practicality to be overestimated. Three primary issues emerged from our work. First, we discovered a flaw in how previous corpora were created that leads to an over-estimation of classification accuracy. Second, we discovered that most of the information contained in n-grams stem from string features that could be obtained in simpler ways. Finally, we demonstrate that n-gram features promote overfitting, even with linear models and extreme regularization. 相似文献

15.

基于微处理器的车流量数据的检测与分析

张建王传琦李瑞强闫海波熊伟《微计算机信息》2007,23(35):25-26,6

本文提出了矩阵式红外线车流量识别这个全新的技术，利用微处理器，设计出了经济、高效、准确的车流量检测装置，能实现对被监测路段的车辆行驶方向、车型、多车并行、车流量统计等复杂情形的识别。这有利于获取道路交通情况，促进交通管理。相似文献

16.

An incremental mixed data clustering method using a new distance measure

Fakhroddin Noorbehbahani Sayyed Rasoul Mousavi Abdolreza Mirzaei 《Soft Computing - A Fusion of Foundations, Methodologies and Applications》2015,19(3):731-743

相似文献

17.

Gray Hausdorff distance measure for comparing face images

Vivek E.P. Sudha N. 《Information Forensics and Security, IEEE Transactions on》2006,1(3):342-349

Human face recognition is considered to be one of the toughest problems in the domain of pattern recognition. The variations in face images due to differing expression, pose and illumination are some of the key issues to be addressed in developing a face recognition system. In this paper, a new measure called gray Hausdorff distance (denoted by H/sub pg/) is proposed to compare the gray images of faces directly. An efficient algorithm for computation of the new measure is presented. The computation time is linear in the size of the image. The performance of this measure is evaluated on benchmark face databases. The face recognition system based on the new measure is found to be robust to pose and expression variations, as well as to slight variation in illumination. Comparison studies show that the proposed measure performs better than the existing ones in most cases. 相似文献

18.

An entropy-based algorithm for data elimination in time-driven software instrumentation

Ahmet Özmen^{Author Vitae} 《Journal of Systems and Software》2009,82(5):907-913

While monitoring, instrumented long running parallel applications generate huge amount of instrumentation data. Processing and storing this data incurs overhead, and perturbs the execution. A technique that eliminates unnecessary instrumentation data and lowers the intrusion without loosing any performance information is valuable for tool developers. This paper presents a new algorithm for software instrumentation to measure the amount of information content of instrumentation data to be collected. The algorithm is based on entropy concept introduced in information theory, and it makes selective data collection for a time-driven software monitoring system possible. 相似文献

19.

An entropy-based evaluation method for knowledge bases of medical information systems

《Expert systems with applications》2016

相似文献

20.

Information measure for analyzing specific spiking patterns and applications to LGN bursts

Gaudry KS Reinagel P 《Network (Bristol, England)》2008,19(1):69-94

Neural spiking responses can include a variety of spiking patterns. However, neither the mere presence of the patterns nor the pattern's frequency indicates that the pattern conveys distinct stimulus information. Here, we present an in-depth analysis of a Pattern Information measure, which quantifies how informative it is to distinguish a particular pattern of spikes from either a single spike or an another pattern. (1) We show how a shuffle-controlled estimation method minimizes the impact of sampling bias. (2) We describe how the Pattern Information could arise from time-varying firing rates, and we demonstrate an analysis to determine whether Pattern Information associated with a particular pattern captures structure not contained in the time-varying firing rate. (3) Because patterns may contain several spikes or inter-spike intervals, we extend the Pattern Information measure to determine whether the complete pattern carries information distinct from sub-patterns containing only a fraction of these spikes or intervals. (4) The Pattern Information is applied to determine whether a plurality of patterns carry distinct stimulus information from one another. In particular, we demonstrate these concepts using data from cells of the lateral geniculate nucleus (LGN), thereby extending previous analysis demonstrating that distinguishes between bursts of spikes and single spikes providing visual information. 相似文献