Vectorizing an In Situ Query Engine

Επιστημονική δημοσίευση - Ανακοίνωση Συνεδρίου uoadl:3168895 16 Αναγνώσεις

Μονάδα:
Ερευνητικό υλικό ΕΚΠΑ
Τίτλος:
Vectorizing an In Situ Query Engine
Γλώσσες Τεκμηρίου:
Αγγλικά
Περίληψη:
Database systems serve a wide range of use cases efficiently, but
require data to be loaded and adapted to the system's execution engine.
This pre-processing step is a bottleneck to the analysis of the
increasingly large and heterogeneous datasets. Therefore, numerous
research efforts advocate for querying each dataset in situ, i.e.,
without pre-loading it in a DBMS. On the other hand, performing analysis
over raw data entails numerous overheads because of the potentially
inefficient data representations.
In this paper, we investigate the effect of vector processing on raw
data querying. We enhance the operators of a query engine to use SIMD
operations. Specifically, we examine the effect of SIMD on two different
cases: the scan operators that perform the CPU intensive task of input
parsing, and the part of the query pipeline that performs a selection
and computes an aggregate. We show that a vectorized approach has a lot
of potential to improve performance, which nevertheless comes with
trade-offs.
Έτος δημοσίευσης:
2016
Συγγραφείς:
Sioulas, Panagiotis
Ailamaki, Anastasia
Εκδότης:
ASSOCIATION FOR COMPUTING MACHINERY
Τίτλος συνεδρίου:
SIGMOD'16: PROCEEDINGS OF THE 2016 INTERNATIONAL CONFERENCE ON
MANAGEMENT OF DATA
Σελίδες:
2261-2262
Επίσημο URL (Εκδότης):
DOI:
10.1145/2882903.2914829
Το ψηφιακό υλικό του τεκμηρίου δεν είναι διαθέσιμο.