A Scalable Classification Algorithm Exploring Database Technology
DOI:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    This paper focuses on the study of efficient and scalable classification algorithm that tightly integrates classification technology with relational database system technology. In this paper, an approach based on grouping and counting is proposed to build classifier, which uses SQL (structured query language) provided by relational database to implement the major computation tasks. In order to improve the performance, several optimization strategies and a redundant rules'pruning strategy together with a feature selection method integrating with the process of inding classification rules are also proposed.With all methods and strategies,the classification algrthm can find a compact set of classification rules quickly from a large volume of data.In addition the same classification accuracy with current popular classification algorithms and high training speed,the unique features of the classification algorithm also include its linear scalability with respect to the number of training samples and the number of attributes,and the simplicity in implementation.

    Reference
    Related
    Cited by
Get Citation

刘红岩,陆宏钧,陈剑.利用数据库技术实现的可扩展的分类算法.软件学报,2002,13(6):1075-1081

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:July 31,2000
  • Revised:November 23,2000
  • Adopted:
  • Online:
  • Published:
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063