基于核外计算的Datalog引擎设计与实现
作者:
作者单位:

作者简介:

通讯作者:

左志强,E-mail:zqzuo@nju.edu.cn

中图分类号:

TP311

基金项目:

国家自然科学基金(61802168); 江苏省自然科学基金(BK20191247)


Design and Implementation of the out-of-core Datalog Engine DDL
Author:
Affiliation:

Fund Project:

  • 摘要
  • |
  • 图/表
  • |
  • 访问统计
  • |
  • 参考文献
  • |
  • 相似文献
  • |
  • 引证文献
  • |
  • 资源附件
  • |
  • 文章评论
    摘要:

    随着新兴技术的迅速发展, 领域软件对开发效率提出了新的要求.Datalog语言作为一门具有简洁语法和良好语义的声明式编程语言, 能帮助开发人员快速开发和解决问题, 近年来越来越受到重视与欢迎.但解决真实场景问题时, 现有的单机Datalog引擎计算规模往往受限于内存容量大小, 不具有可扩展性.为解决上述问题, 本文设计并实现了基于核外计算的Datalog引擎.方法首先设计了一系列计算Datalog程序所需的支持核外计算的操作算子, 然后将Datalog程序转换合成带核外计算算子的C++程序, 接着方法设计了基于Hash的分区策略和基于搜索树剪枝的最少置换调度策略, 将相应的分区文件调度执行计算并得到最终结果.基于该方法, 实现了原型工具DDL(Disk-Based DataLog Engine), 并选取广泛应用的真实Datalog程序, 在合成数据集以及真实数据集上进行实验, 实验结果体现了DDL良好性能以及高可扩展性.

    Abstract:

    With the rapid development of emerging technologies, domain software puts forward the new requirements on development efficiency. Datalog as a declarative programming language with concise syntax and good semantics, can help developers to reason and solve complex problems rapidly. However, when solving the real-world problems, the existing single-machine Datalog engines are often limited by the size of memory capacity and have no scalability. In order to solve the above problems, this paper designs and implements Datalog engine based on out-of-core computation. Methods firstly, a series of out-of-core operators are designed, and then the Datalog program is converted into the C++ program with the operators. Then, the partition strategy based on Hash and the minimum replacement scheduling strategy based on search tree pruning are designed. The corresponding partition files are scheduled and computed, and then the final results are generated. Based on this method, the prototype tool DDL(Disk-Based DataLog Engine) is implemented, and widely used real-world Datalog programs are selected to conduct experiments on both synthetic and real-world datasets. The experimental results show that DDL has good performance and high scalability.

    参考文献
    相似文献
    引证文献
引用本文

张奕裕,王归航,左志强,李宣东.基于核外计算的Datalog引擎设计与实现.软件学报,,():0

复制
分享
文章指标
  • 点击次数:
  • 下载次数:
  • HTML阅读次数:
  • 引用次数:
历史
  • 收稿日期:2021-08-09
  • 最后修改日期:2021-10-09
  • 录用日期:
  • 在线发布日期: 2022-01-28
  • 出版日期:
您是第位访问者
版权所有:中国科学院软件研究所 京ICP备05046678号-3
地址:北京市海淀区中关村南四街4号,邮政编码:100190
电话:010-62562563 传真:010-62562533 Email:jos@iscas.ac.cn
技术支持:北京勤云科技发展有限公司

京公网安备 11040202500063号