Journal of Software:2013.24(2):255-265

(东南大学 计算机科学与工程学院,江苏 南京 210096;计算机网络和信息集成教育部重点实验室(东南大学),江苏 南京 210096;School of Information Technology, Illinois State University, Normal 61790-5150, USA)
Estimation Algorithms of the Flow Number from Sampled Packets on Approximate Approaches
CHENG Guang,TANG Yong-Ning
(School of Computer Science and Engineering, Southeast University, Nanjing 210096, China;Key Laboratory of Computer Network and Information Integration (Southeast University), Ministry of Education, Nanjing 210096, China;School of Information Technology, Illinois State University, Normal 61790-5150, USA)
Chart / table
Similar Articles
Article :Browse 2827   Download 3339
Received:March 16, 2012    Revised:August 20, 2012
> 中文摘要: 维护每个报文的流记录需要占用大量测量资源.目前已有多种抽样技术估计网络流统计信息,然而精确地估计出流数统计信息是目前的研究难点.提出了Integral和Iteration 两种基于报文抽样样本估计网络流数的算法.Integral算法只需使用抽样流长为1的流数信息就可以近似推导出未抽样的流数.Iteration算法通过建立迭代函数估计未抽样流数,然后根据未抽样流数和已抽样的流数推断出原始流量的流数.采用CERNET(China education andresearch network)骨干网络链路数据将这两种算法与EM(expectation maximization)算法进行对比,表明Iteration算法具有较好的精度和性能.
中文关键词: 报文抽样  网络测量  流数  二项分布
Abstract:Recording flow statistics for each network packet is resource-intensive. Various sampling techniques are used to estimate flow statistics. However, the estimation accuracy based on the sampling remains a significant challenge. This paper introduces both sampling techniques denoted as Integral and Iteration algorithms, which can accurately infer the number of original flows from the sampled flow records. The Integral algorithm uses only the number of sampled flows with one sampled packet to approximately deduce the number of unsampled flows. The Iteration algorithm can estimate the number of unsampled flows using an iteration method. The number of original flows can be precisely estimated according to both the number of sampled flows and unsampled flows. Both the algorithms are compared to the EM (expectation maximization) algorithm using multiple traffic traces collected from CERNET (China education and research network) backbone. The result shows that the Iteration algorithm is superior to the EM algorithm and can provide highly accurate estimation on the number of original flows.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(60973123);国家重点基础研究发展计划(973)(2009CB320505);江苏省科技计划项目(科技支撑计划——工业部分)(BE2011173) 国家自然科学基金(60973123);国家重点基础研究发展计划(973)(2009CB320505);江苏省科技计划项目(科技支撑计划——工业部分)(BE2011173)
Foundation items:
Reference text:


CHENG Guang,TANG Yong-Ning.Estimation Algorithms of the Flow Number from Sampled Packets on Approximate Approaches.Journal of Software,2013,24(2):255-265