Adjacency Matrix Based Full-Text Indexing Models
DOI:
Author:
Affiliation:

Clc Number:

Fund Project:

  • Article
  • |
  • Figures
  • |
  • Metrics
  • |
  • Reference
  • |
  • Related
  • |
  • Cited by
  • |
  • Materials
  • |
  • Comments
    Abstract:

    With the rapid growth of online text information and user accesses, query-processing efficiency has become the major bottleneck of information retri eval (IR) systems. This paper proposes two new full-text indexing models to impr ove query-processing efficiency of IR systems. By using directed graph to repres ent text string, the adjacency matrix of text string is introduced. Two approach es are proposed to implement the adjacency matrix of text string, which leads to two new full-text indexing models, I.e., adjacency matrix based inverted file and adjacency matrix based PAT array. Query algorithms for the new models are dev eloped and performance comparisons between the new models and the traditional models are carried out. Experiments over real-world text collections are conducted to validate the effectiveness and efficiency of the new models. The new models can improve query-processing efficiency considerably at the cost of much less amount of extra storage overhead compared to the size of original text database, so are suitable for applications of large-scale text databases.

    Reference
    Related
    Cited by
Get Citation

周水庚,胡运发,关佶红.基于邻接矩阵的全文索引模型.软件学报,2002,13(10):1933-1942

Copy
Share
Article Metrics
  • Abstract:
  • PDF:
  • HTML:
  • Cited by:
History
  • Received:January 18,2002
  • Revised:July 01,2002
  • Adopted:
  • Online:
  • Published:
You are the firstVisitors
Copyright: Institute of Software, Chinese Academy of Sciences Beijing ICP No. 05046678-4
Address:4# South Fourth Street, Zhong Guan Cun, Beijing 100190,Postal Code:100190
Phone:010-62562563 Fax:010-62562533 Email:jos@iscas.ac.cn
Technical Support:Beijing Qinyun Technology Development Co., Ltd.

Beijing Public Network Security No. 11040202500063