###
DOI:
Journal of Software:2008.19(10):2706-2719

基于P2P的Web搜索技术
方启明,杨广文,武永卫,郑纬民
(清华大学 计算机科学与技术系 清华信息科学与技术国家实验室(筹),北京 100084)
P2P Web Search Technology
FANG Qi-Ming,YANG Guang-Wen,WU Yong-Wei,ZHENG Wei-Min
()
Abstract
Chart / table
Reference
Similar Articles
Article :Browse 10201   Download 7903
Received:July 30, 2007    Revised:February 25, 2008
> 中文摘要: Web搜索引擎已经成为人们从海量Web信息中快速找到所需信息的重要工具,随着Web数据量的爆炸性增长,传统的集中式搜索引擎已经越来越不能满足人们不断增长的信息获取需求.随着对等网络(peer-to-peer,简称P2P)技术的快速发展,人们提出了基于P2P的Web搜索技术并迅速成为研究热点.研究的目的是对现有的基于P2P的Web搜索技术进行总结,以期为进一步研究指明方向.首先分析了基于P2P的Web搜索面临的诸多挑战;然后重点总结分析了基于P2P的Web搜索的各项关键技术的研究现状,包括系统拓扑结构、数据存放策略、查询路由机制、索引切分策略、数据集选择、相关性排序、网页收集方法等;最后对已有的3个较有特色的基于P2P的Web搜索原型系统进行了介绍.
Abstract:Web search engine has become a very important tool for finding information efficiently from the massive Web data. With the explosive growth of the Web data, traditional centralized search engines become harder to catch up with the growing step of people's information needs. With the rapid development of peer-to-peer (P2P) technology, the notion of P2P Web search has been proposed and quickly becomes a research focus. The goal of this paper is to give a brief summary of current P2P Web search technologies in order to facilitate future research. First, some main challenges for P2P Web search are presented. Then, key techniques for building a feasible and efficient P2P Web search engine are reviewed, including system topology, data placement, query routing, index partitioning, collection selection, relevance ranking and Web crawling. Finally, three recently proposed novel P2P Web search prototypes are introduced.
文章编号:     中图分类号:    文献标志码:
基金项目:Supported by the National Natural Science Foundation of China under Grant Nos.60433040, 90412006, 90412011, 60573110, 60673152, 90612016 (国家自然科学基金); the National Basic Research Program of China under Grant Nos.2003CB317007, 2004CB318000 (国家重点基础研究发展计划(973)); the National High-Tech Research and Development Plan of China under Grant Nos.2006AA01A101, 2006AA01A108, 2006AA01A111 (国家高技术研究发展计划(863)) Supported by the National Natural Science Foundation of China under Grant Nos.60433040, 90412006, 90412011, 60573110, 60673152, 90612016 (国家自然科学基金); the National Basic Research Program of China under Grant Nos.2003CB317007, 2004CB318000 (国家重点基础研究发展计划(973)); the National High-Tech Research and Development Plan of China under Grant Nos.2006AA01A101, 2006AA01A108, 2006AA01A111 (国家高技术研究发展计划(863))
Foundation items:
Reference text:

方启明,杨广文,武永卫,郑纬民.基于P2P的Web搜索技术.软件学报,2008,19(10):2706-2719

FANG Qi-Ming,YANG Guang-Wen,WU Yong-Wei,ZHENG Wei-Min.P2P Web Search Technology.Journal of Software,2008,19(10):2706-2719