Journal of Software:2014.25(4):813-825

(中国人民大学 信息学院, 北京 100872)
Method for Top-K Query on Big Data in Cloud
CI Xiang,MA You-Zhong,MENG Xiao-Feng
(School of Information, Renmin University of China, Beijing 100872, China)
Chart / table
Similar Articles
Article :Browse 4628   Download 4485
Received:September 10, 2013    Revised:December 18, 2013
> 中文摘要: Top-K查询在搜索引擎、电子商务等领域有着广泛的应用.Top-K查询从海量数据中返回最符合用户需求的前K个结果,主要目的是消除信息过载带来的负面影响.大数据背景下的Top-K查询,给数据管理和分析等方面带来新的挑战.结合MapReduce的特点,从数据划分、数据筛选等方面对云环境下的大数据Top-K查询问题进行深入研究.实验结果表明,该方法具有良好的性能和扩展性.
中文关键词: Top-K查询  云计算  MapReduce
Abstract:Top-K query has been widely used in lots of modern applications such as search engine and e-commerce. Top-K query returns the most relative results for user from massive data, and its main purpose is to eliminate the negative effect of information overload. Top-K query on big data has brought new challenges to data management and analysis. In light of features of MapReduce, this paper presents an in-depth study of Top-K query on big data from the perspective of data partitioning and data filtering. Experimental results show that the proposed approaches have better performance and scalability.
keywords: top-K query  cloud  MapReduce
文章编号:     中图分类号:    文献标志码:
基金项目:国家自然科学基金(61379050,91224008);国家高技术研究发展计划(863)(2013AA013204);高等学校博士学科点专项科研基金(20130004130001) 国家自然科学基金(61379050,91224008);国家高技术研究发展计划(863)(2013AA013204);高等学校博士学科点专项科研基金(20130004130001)
Foundation items:
Reference text:


CI Xiang,MA You-Zhong,MENG Xiao-Feng.Method for Top-K Query on Big Data in Cloud.Journal of Software,2014,25(4):813-825