首页 | 本学科首页   官方微博 | 高级检索  
     


A stratified traffic sampling methodology for seeing the big picture
Affiliation:1. Integrated Circuits and Systems Laboratory, Center for Strategic Technologies of the Northeast, Recife, Brazil;2. Informatics Center, Federal University of Pernambuco, Recife, Brazil;1. Informatics Center, Federal University of Pernambuco, Recife, Brazil;2. Department of Electrical Engineering, University of Brasília, Brazil;3. Department of Electronics and Systems, Federal University of Pernambuco, Recife, Brazil
Abstract:This work explores the use of statistical techniques, namely stratified sampling and cluster analysis, as powerful tools for deriving traffic properties at the flow level. Our results show that the adequate selection of samples leads to significant improvements allowing further important statistical analysis. Although stratified sampling is a well-known technique, the way we classify the data prior to sampling is innovative and deserves special attention. We evaluate two partitioning clustering methods, namely clustering large applications (CLARA) and K-means, and validate their outcomes by using them as thresholds for stratified sampling. We show that using flow sizes to divide the population we can obtain accurate estimates for both size and flow durations. The presented sampling and clustering classification techniques achieve data reduction levels higher than that of existing methods, on the order of 0.1% while maintaining good accuracy for the estimates of the sum, mean and variance for both flow duration and sizes.
Keywords:
本文献已被 ScienceDirect 等数据库收录!
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号