Using new scheduling heuristics based on resource consumption information for increasing throughput on rule‐based spam filtering systems |
| |
Authors: | David Ruano‐Ordás Jorge Fdez‐Glez Florentino Fdez‐Riverola José Ramón Méndez |
| |
Affiliation: | Department of Computer Science, University of Vigo, ESEI, Ourense, Spain |
| |
Abstract: | The large increase of spam deliveries since the first half of 2013 entailed hard to solve troubles in spam filters. In order to adequately fight spam, the throughput of spam filtering platforms should be necessarily increased. In this context, and taking into consideration the widespread utilization of rule‐based filtering frameworks in the spam filtering domain, this work proposes three novel scheduling strategies for optimizing the time needed to classify new incoming e‐mails through an intelligent management of computational resources depending on the Central Processing Unit (CPU) usage and Input/Output (I/O) delays. In order to demonstrate the suitability of our approaches, we include in our experiments a comparative study in contrast to other successful heuristics previously published in the scientific literature. Results achieved demonstrated that one of our alternative heuristics allows time savings of up to 10% in message filtering, while keeping the same classification accuracy. Copyright © 2015 John Wiley & Sons, Ltd. |
| |
Keywords: | rule optimization schedulers increasing filtering throughput spam detection anti‐spam filtering platforms resource consumption‐based heuristics Wirebrush4SPAM framework |
|
|