Journal of Software:2020.31(3):893-908

(武汉大学 计算机学院, 湖北 武汉 430072)
Survey and Prospect: Data Integration Methodologies
WANG Song,PENG Yu-Wei,LAN Hai,LUO Qian-Wen,PENG Zhi-Yong
(School of Computer Science, Wuhan University, Wuhan 430072, China)
Received:July 20, 2019    Revised:September 10, 2019
> 中文摘要: 数据集成在数据管理与分析领域起着重要的作用.尽管从学术界首次提出并开始研究数据集成问题已经过去30多年,但在各个领域仍然存在着大量与数据集成问题密切相关的问题亟待解决.对数据集成领域从2001年开始到现在相关工作的发展脉络进行了梳理与总结.通过追踪数据集成方法的发展轨迹,不仅可以了解前人在解决该问题时所作出的努力以及发掘出的研究方向,还可以进一步了解各个数据发展领域所研究问题的成因以及发展脉络.最终,通过分析近几年数据集成方面的工作,可以进一步展望未来在数据集成领域的潜在研究方向,为从事相关领域研究的学者提供参考.
中文关键词: 大数据  数据集成  数据管理  网页表  众包
Abstract:Data integration plays a very important role in data management and analytical area. Although there have been decades since the data integration problem was first proposed, there are many data integration problems that remain unsolved. This study surveys the works in data integration area from 2001 until now. By categorizing these papers and their methodologies, it is able to summarize how these works develop and how their research topics shift from time to time. Several research topics are also filtered out that draw much attention recently and hopefully the survey and conclusions may provide guidance to the related researchers.
基金项目:国家重点研发计划(2016YFB1000701) 国家重点研发计划(2016YFB1000701)
Foundation items:National Key Research and Development Program of China (2016YFB1000701)
