###
DOI:
Journal of Software:2007.18(4):933-942

高维类别属性数据流离群点快速检测算法
周晓云,孙志挥,张柏礼,杨宜东
(东南大学,计算机科学与工程系,江苏,南京,210096)
A Fast Outlier Detection Algorithm for High Dimensional Categorical Data Streams
ZHOU Xiao-Yun,SUN Zhi-Hui,ZHANG Bai-Li,YANG Yi-Dong
()
Abstract
Chart / table
Reference
Similar Articles
Article :Browse 3793   Download 3911
Received:November 02, 2005    Revised:February 23, 2006
> 中文摘要: 提出类别属性数据流数据离群度量--加权频繁模式离群因子(weighted frequent pattern outlier factor,简称WFPOF),并在此基础上给出一种快速数据流离群点检测算法FODFP-Stream(fast outlier detection for high dimensional categorical data streams based on frequent pattern).该算法通过动态发现和维护频繁模式来计算离群度,能够有效地处理高维类别属性数据流,并可进一步扩
Abstract:This paper considers the problem of outlier detection in data stream, proposes a new metric called weighted frequent pattern outlier factor for categorical data streams, and presents a novel fast outlier detection algorithm named FODFP-Stream (fast outlier detection for high dimensional categorical data streams based on frequent pattern). FODFP-Stream computes the outlier measure through discovering and maintaining the frequent patterns dynamically, and can deal with the high dimensional categorical data streams effectively. FODFP-Stream can also be extended to resolve continuous attributes and mixed attributes data streams. The experimental results on synthetic and real data sets show the promising availabilities of the approaches.
文章编号:     中图分类号:    文献标志码:
基金项目:Supported by the National Natural Science Foundation of China under Grant No.70371015(国家自然科学基金);the Doctor Science Research Foundation of the Education Ministry of China under Grant No.20040286009(国家教育部高等学校博士学科点科研基金) Supported by the National Natural Science Foundation of China under Grant No.70371015(国家自然科学基金);the Doctor Science Research Foundation of the Education Ministry of China under Grant No.20040286009(国家教育部高等学校博士学科点科研基金)
Foundation items:
Reference text:

周晓云,孙志挥,张柏礼,杨宜东.高维类别属性数据流离群点快速检测算法.软件学报,2007,18(4):933-942

ZHOU Xiao-Yun,SUN Zhi-Hui,ZHANG Bai-Li,YANG Yi-Dong.A Fast Outlier Detection Algorithm for High Dimensional Categorical Data Streams.Journal of Software,2007,18(4):933-942