###
Journal of Software:2012.23(6):1335-1349

基于网络信息搜索的Web Service 文本描述信息扩充方法
王立杰,李萌,蔡斯博,李戈,谢冰,杨芙清
(北京大学 信息科学技术学院 软件研究所,北京 100871;高可信软件技术教育部重点实验室,北京 100871)
Internet Information Search Based Approach to Enriching Textual Descriptions for Public Web Services
WANG Li-Jie,LI Meng,CAI Si-Bo,LI Ge,XIE Bing,YANG Fu-Qing
(Software Institute, School of Electronics Engineering and Computer Science, Peking University, Beijing 100871, China;Key Laboratory of High Confidence Software Technologies, Ministry of Education, Beijing 100871, China)
Abstract
Chart / table
Reference
Similar Articles
Article :Browse 3680   Download 4125
Received:March 11, 2011    Revised:July 04, 2011
> 中文摘要: 随着Web 服务技术的不断成熟和发展,互联网上出现了大量的公共Web 服务.在使用Web 服务开发软件系统的过程中,其文本描述信息(例如简介和使用说明等)可以帮助服务消费者直观有效地识别和理解Web 服务并加以利用.已有的研究工作大多关注于从Web 服务的WSDL 文件中获取此类信息进行Web 服务的发现或检索,调研发现,互联网上大部分Web 服务的WSDL 文件中普遍缺少甚至没有此类信息.为此,提出一种基于网络信息搜索的从WSDL 文件之外的信息源为Web 服务扩充文本描述信息的方法.从互联网上收集包含目标Web 服务特征标识的相关网页,基于从网页中抽取出的信息片段,利用信息检索技术计算信息片段与目标Web 服务的相关度,并选取相关度较高的文本片段为Web 服务扩充文本描述信息.基于互联网上的真实数据进行的实验,其结果表明,可为约51%的互联网上的Web 服务获取到相关网页,并为这些Web 服务中约88%扩充文本描述信息.收集到的Web 服务及其文本描述信息数据均已公开发布.
Abstract:With the development of Web services technologies, more and more public Web services have been published on the Internet. During the searching and utilizing of these public services, services' textual descriptions (such as introduction and user manual), which are generally expressed in natural language, provide great help for service consumers to locate, understand, and utilize proper Web services. Existing methods for services discovery usually try to obtain such descriptions only from services' WSDL files. However, according to this investigation, lots of Web services do not contain enough textual descriptions in their WSDL files. This paper proposes an approach to enriching textual descriptions for public Web services on the Internet using the information sources outside of WSDL files. Given a Web service, the study collects related Web pages containing its features from the Internet. Then, the enriched descriptions for the service are identified from the Web pages using information retrieval technologies. Experiments conducted on real data indicate that our approach can enrich descriptions for about half of the public services on the Internet effectively. The collected data is publicly available on the Internet.
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(60803010); 国家高技术研究发展计划(863)(2007AA010301) 国家自然科学基金(60803010); 国家高技术研究发展计划(863)(2007AA010301)
Foundation items:
Reference text:

王立杰,李萌,蔡斯博,李戈,谢冰,杨芙清.基于网络信息搜索的Web Service 文本描述信息扩充方法.软件学报,2012,23(6):1335-1349

WANG Li-Jie,LI Meng,CAI Si-Bo,LI Ge,XIE Bing,YANG Fu-Qing.Internet Information Search Based Approach to Enriching Textual Descriptions for Public Web Services.Journal of Software,2012,23(6):1335-1349