TY - GEN
T1 - HINT on Steroids
T2 - 27th International Conference on Extending Database Technology, EDBT 2024
AU - Bouros, Panagiotis
AU - Titkov, Artur
AU - Christodoulou, George
AU - Rauch, Christian
AU - Mamoulis, Nikos
PY - 2024
Y1 - 2024
N2 - A wide range of applications manage interval data. HINT was recently proposed to hierarchically index intervals in main memory. The index outperforms competitive structures by a wide margin, but under its current setup, HINT is able to service only single query requests. In practice however, real systems receive a large number of queries at the same time and so, our focus in this paper is on batch query processing. We propose two novel evaluation strategies termed level-based and partition-based, which both work in a per-level fashion, i.e., all queries for an index level are computed before moving to the next level. The new strategies operate in a cache-aware fashion to reduce the cache misses caused by climbing the index hierarchy or accessing multiple partitions per level, and to decrease the total execution time for a query batch. Our experimental analysis with both real and synthetic datasets showed that our batch processing strategies always outperform a baseline that executes queries in a serial fashion, and that partition-based is overall the most efficient strategy.
AB - A wide range of applications manage interval data. HINT was recently proposed to hierarchically index intervals in main memory. The index outperforms competitive structures by a wide margin, but under its current setup, HINT is able to service only single query requests. In practice however, real systems receive a large number of queries at the same time and so, our focus in this paper is on batch query processing. We propose two novel evaluation strategies termed level-based and partition-based, which both work in a per-level fashion, i.e., all queries for an index level are computed before moving to the next level. The new strategies operate in a cache-aware fashion to reduce the cache misses caused by climbing the index hierarchy or accessing multiple partitions per level, and to decrease the total execution time for a query batch. Our experimental analysis with both real and synthetic datasets showed that our batch processing strategies always outperform a baseline that executes queries in a serial fashion, and that partition-based is overall the most efficient strategy.
UR - http://www.scopus.com/inward/record.url?scp=85190968481&partnerID=8YFLogxK
U2 - 10.48786/edbt.2024.38
DO - 10.48786/edbt.2024.38
M3 - Conference contribution
AN - SCOPUS:85190968481
T3 - Advances in Database Technology - EDBT
SP - 440
EP - 446
BT - Advances in Database Technology - EDBT
PB - OpenProceedings.org
Y2 - 25 March 2024 through 28 March 2024
ER -