Revisiting Database Indexing for Parallel and Accelerated Computing: A Comprehensive Study and Novel Approaches

Author:

Abbasi Maryam1ORCID,Bernardo Marco V.23ORCID,Váz Paulo3ORCID,Silva José3ORCID,Martins Pedro3ORCID

Affiliation:

1. Applied Research Institute, Polytechnic Institute of Coimbra, 3045-093 Coimbra, Portugal

2. Instituto de Telecomunicações, 6201-001 Covilhã, Portugal

3. Department of Informatics, Polytechnic of Viseu, 3504-510 Viseu, Portugal

Abstract

While the importance of indexing strategies for optimizing query performance in database systems is widely acknowledged, the impact of rapidly evolving hardware architectures on indexing techniques has been an underexplored area. As modern computing systems increasingly leverage parallel processing capabilities, multi-core CPUs, and specialized hardware accelerators, traditional indexing approaches may not fully capitalize on these advancements. This comprehensive experimental study investigates the effects of hardware-conscious indexing strategies tailored for contemporary and emerging hardware platforms. Through rigorous experimentation on a real-world database environment using the industry-standard TPC-H benchmark, this research evaluates the performance implications of indexing techniques specifically designed to exploit parallelism, vectorization, and hardware-accelerated operations. By examining approaches such as cache-conscious B-Tree variants, SIMD-optimized hash indexes, and GPU-accelerated spatial indexing, the study provides valuable insights into the potential performance gains and trade-offs associated with these hardware-aware indexing methods. The findings reveal that hardware-conscious indexing strategies can significantly outperform their traditional counterparts, particularly in data-intensive workloads and large-scale database deployments. Our experiments show improvements ranging from 32.4% to 48.6% in query execution time, depending on the specific technique and hardware configuration. However, the study also highlights the complexity of implementing and tuning these techniques, as they often require intricate code optimizations and a deep understanding of the underlying hardware architecture. Additionally, this research explores the potential of machine learning-based indexing approaches, including reinforcement learning for index selection and neural network-based index advisors. While these techniques show promise, with performance improvements of up to 48.6% in certain scenarios, their effectiveness varies across different query types and data distributions. By offering a comprehensive analysis and practical recommendations, this research contributes to the ongoing pursuit of database performance optimization in the era of heterogeneous computing. The findings inform database administrators, developers, and system architects on effective indexing practices tailored for modern hardware, while also paving the way for future research into adaptive indexing techniques that can dynamically leverage hardware capabilities based on workload characteristics and resource availability.

Funder

Polytechnic Institute of Viseu

Publisher

MDPI AG

Reference19 articles.

1. BP-tree: Overcoming the Point-Range Operation Tradeoff for In-Memory B-trees;Xu;Proc. VLDB Endow.,2023

2. Optimization of local parallel index (LPI) in parallel/distributed database systems;Chakraoui;Int. J. Geomate,2016

3. Shahrokhi, H., and Shaikhha, A. (2023). An Efficient Vectorized Hash Table for Batch Computations. 37th European Conference on Object-Oriented Programming (ECOOP 2023), Schloss-Dagstuhl-Leibniz Zentrum für Informatik.

4. Learning to hash for indexing big data—A survey;Wang;Proc. IEEE,2015

5. Xin, G., Zhao, Y., and Han, J. (2021, January 22–28). A Multi-Layer Parallel Hardware Architecture for Homomorphic Computation in Machine Learning. Proceedings of the 2021 IEEE International Symposium on Circuits and Systems (ISCAS), Daegu, Republic of Korea.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3