AlphaQO:鲁棒的学习型查询优化器

doi:10.13328/j.cnki.jos.006452

微信服务号

微信订阅号

首页 > 过刊浏览>2022年第33卷第3期 >814-831. DOI:10.13328/j.cnki.jos.006452

PDF HTML阅读 XML下载导出引用引用提醒

AlphaQO:鲁棒的学习型查询优化器
DOI:
                        10.13328/j.cnki.jos.006452
                    
作者:
                        
                        
                    
作者单位:
作者简介:余翔(1994-),男,博士生,主要研究领域为人工智能驱动的数据库优化技术;
汤南(1981-),男,博士,研究员,主要研究领域为数据准备,数据可视化;
柴成亮(1992-),男,博士,CCF专业会员,主要研究领域为人机协作的数据管理,数据库系统;
孙佶(1994-),男,博士,主要研究领域为分布式相似查询,人工智能和数据库交叉技术;
张辛宁(2000-),男,本科生,主要研究领域为人工智能驱动的数据库优化技术;
李国良(1981-),男,博士,教授,博士生导师,CCF杰出会员,主要研究领域为大数据,数据库,数据科学.
通讯作者:李国良,E-mail:liguoliang@tsinghua.edu.cn
中图分类号:
基金项目:国家自然科学基金（61925205，61632016）；华为和好未来资助项目

AlphaQO: Robust Learned Query Optimizer

Author:

Affiliation:

Fund Project:

摘要

图/表

访问统计

参考文献

相似文献

引证文献

资源附件

文章评论

摘要:

由深度学习驱动的学习型查询优化器正在越来越广泛地受到研究者的关注，这些优化器往往能够取得近似甚至超过传统商业优化器的性能.与传统优化器不同的是，一个成功的学习型优化器往往依赖于足够多的高质量的负载查询作为训练数据.低质量的训练查询会导致学习型优化器在未来的查询上失效.提出了基于强化学习的鲁棒的学习型查询优化器训练框架AlphaQO，提前找到学习型优化器做不好的查询，以提高学习型优化器的鲁棒性.AlphaQO中存在两个重要部分：查询生成器和学习型优化器.查询生成器的目标是生成“难”的查询（传统优化器做得好，但是学习型优化器反而做得不好的查询）.学习型优化器利用这些生成的查询进行测试和训练，并提供反馈让查询生成器进行更新.系统迭代交替的运行上述两个部分，分别进行训练.目的在于在提供尽量少的信息和消耗足够小的时间下找到足够多“难”的并且未见的查询给优化器训练，以提高学习型优化器的鲁棒性.实验结果显示：该生成器会提供越来越难的训练查询给学习型优化器；同时，这些查询能够提升学习型优化器的性能.

Abstract:

Learned database query optimizers, which are typically empowered by (deep) learning models, have attracted significant attention recently, because they can offer similar or even better performance than the state-of-the-art commercial optimizers that require hundreds of expert-hours to tune. A crucial factor of successfully training learned optimizers is training queries. Unfortunately, a good query workload that is sufficient for training learned optimizers is not always available. This study proposes a framework, called AlphaQO, on generating queries for learned optimizers with reinforcement learning (RL). AlphaQO is a loop system that consists of two main components, query generator and learned optimizer. Query generator aims at generating “hard” queries (i.e., those queries that the learned optimizer provides poor estimates). The learned optimizer will be trained using generated queries, as well as providing feedbacks (in terms of numerical rewards) to the query generator. If the generated queries are good, the query generator will get a high reward;otherwise, the query generator will get a low reward. The above process is performed iteratively, with the main goal that within a small budget, the learned optimizer can be trained and generalized well to a wide range of unseen queries. Extensive experiments show that AlphaQO can generate a relatively small number of queries and train a learned optimizer to outperform commercial optimizers. Moreover, learned optimizers need much less queries from AlphaQO than randomly generated queries, in order to well train the learned optimizer.

参考文献

相似文献

引证文献

引用本文

余翔,柴成亮,张辛宁,汤南,孙佶,李国良. AlphaQO:鲁棒的学习型查询优化器.软件学报,2022,33(3):814-831

复制

文章指标

点击次数:
下载次数:
HTML阅读次数:
引用次数:

历史

收稿日期:2021-06-30
最后修改日期:2021-07-31
录用日期:
在线发布日期: 2021-10-21
出版日期: 2022-03-06

微信服务号

微信订阅号

引用本文

分享

文章指标

历史