首页 | 本学科首页   官方微博 | 高级检索  
     

Geno:基于代价的异构融合查询优化器
引用本文:屠要峰,陈小强,周士俊,卞福升,吴非,陈兵.Geno:基于代价的异构融合查询优化器[J].软件学报,2022,33(3):774-796.
作者姓名:屠要峰  陈小强  周士俊  卞福升  吴非  陈兵
作者单位:南京航空航天大学 计算机科学与技术学院, 江苏 南京 211106;中兴通讯股份有限公司, 江苏 南京 210012
基金项目:国家重点研发计划项目(2019YFB2102002);江苏省重点研发计划项目(BE2019012)
摘    要:新型硬件及其构建的环境改变了传统的计算、存储以及网络体系,也改变了上层软件既往的设计假设,特别是通用处理器和专用加速器组成的异构计算架构,改变了数据库系统的底层框架设计和查询优化的代价模型.数据库系统需要针对新型硬件的特性做出适应性调整,以充分发挥新硬件的潜力.提出一种面向CPU/GPU/FPGA异构计算融合的基于代价...

关 键 词:数据库  查询优化  GPU  FPGA  异构计算
收稿时间:2021/6/29 0:00:00
修稿时间:2021/7/31 0:00:00

Geno: Cost-based Heterogeneous Fusion Query Optimizer
TU Yao-Feng,CHEN Xiao-Qiang,ZHOU Shi-Jun,BIAN Fu-Sheng,WU Fei,CHEN Bing.Geno: Cost-based Heterogeneous Fusion Query Optimizer[J].Journal of Software,2022,33(3):774-796.
Authors:TU Yao-Feng  CHEN Xiao-Qiang  ZHOU Shi-Jun  BIAN Fu-Sheng  WU Fei  CHEN Bing
Affiliation:College of Computer Science and Technology, Nanjing University of Aeronautics and Astronautics, Nanjing 211106, China;ZTE Corporation, Nanjing 210012, China
Abstract:The new hardware and its built environment have changed the traditional computing, storage and network systems, and also changed the previous design assumptions of the upper-level software. In particular, the heterogeneous computing architecture composed of general-purpose processors and dedicated accelerators has changed the design of the underlying framework of the database system and the cost model of query optimization. The database system needs to make adaptive adjustments to the characteristics of the new hardware to give full play to the potential of the new hardware. A cost-based query optimizer Geno for CPU/GPU/FPGA heterogeneous computing fusion is proposed, which can flexibly schedule and optimize the use of various computing resources. The main contribution is:finding that adjusting the cost parameters according to the actual hardware capabilities of the system environment can significantly improve the accuracy of the query plan, and proposing a calculation method and calibration tool for the cost of heterogeneous resources; through the estimation of the capabilities of heterogeneous hardware such as GPU and FPGA and the calibration of the actual capabilities of the database system hardware, establish a cost model for query processing in a heterogeneous computing environment; implemented GPU operators and FPGA operators that support selection, projection, connection and aggregation, realized GPU operator pipeline design and FPGA operator pipeline design; solved the operator allocation and scheduling through cost-based evaluation, and generated a heterogeneous collaborative execution plan to realize the collaborative optimization of heterogeneous computing resources to makes full use of the advantages of each heterogeneous resource. Experiments show that the parameter values calibrated by Geno are more compatible with the actual hardware capabilities. Compared with PostgreSQL and GPU database HeteroDB, Geno can generate a more reasonable query plan. In the TPC-H scenario, the execution time of Geno in the case of row storage is 64% to 93% less than that of Postgresql, and 1% to 39% less than that of Hetero-DB; in the case of column storage, Geno''s execution time is 87%~92% less than that of Postgresql, and 1%~81% less than that of Hetero-DB; Compared with row storage, Geno reduces query execution time 32%~89% in the case of column storage.
Keywords:database  query optimization  GPU  FPGA  heterogeneous calculation
点击此处可从《软件学报》浏览原始摘要信息
点击此处可从《软件学报》下载全文
设为首页 | 免责声明 | 关于勤云 | 加入收藏

Copyright©北京勤云科技发展有限公司  京ICP备09084417号