Technology moves fast! ⚡ Don't get left behind.🚶 Subscribe to our mailing list to keep up with latest and greatest in open source projects! 🏆


Subscribe to our mailing list

Alenka

GPU database engine

Subscribe to updates I use Alenka


Statistics on Alenka

Number of watchers on Github 1103
Number of open issues 11
Average time to close an issue about 1 month
Main language Cuda
Average time to merge a PR about 18 hours
Open pull requests 1+
Closed pull requests 10+
Last commit over 2 years ago
Repo Created about 7 years ago
Repo Last Updated over 1 year ago
Size 44.1 MB
Organization / Authorantonmks
Contributors7
Page Updated
Do you use Alenka? Leave a review!
View open issues (11)
View Alenka activity
View on github
Fresh, new opensource launches 🚀🚀🚀
Trendy new open source projects in your inbox! View examples

Subscribe to our mailing list

Evaluating Alenka for your project? Score Explanation
Commits Score (?)
Issues & PR Score (?)

Welcome to Alenka - GPU database engine

What is this?

This is a GPU based database engine written to use vector based processing and high bandwidth of modern GPUs

Requirements

  • CUDA (nvcc) + Nvidia GPU
  • bison
  • flex
  • Modern GPU Library (included as submodule)

How to build?

git clone --recursive https://github.com/antonmks/Alenka.git
cd Alenka
make

Features :

  • Vector-based processing
    CUDA programming model allows a single operation to be applied to an entire set of data at once.

  • Smart compression
    Ultra fast compression and decompression on GPU. Database operations on compressed data.

  • Column-based storage
    Minimizes disk I/O by only accessing the relevant data.

  • Data skipping
    Better performance without indexes.

  • Fast Loading
    Gpu based CSV parser loads the data into database at very high speed.

How to use it ?

Create your data files :

Run scripts load_orders.sql, load_lineitem.sql and load_customer.sql to create your database files.

Run your queries from a command prompt or use Alenka JDBC driver from Technica Corporation

Step 1 - Filter data

OFI := FILTER orders BY o_orderdate < 19950315;

CF := FILTER customers BY c_mktsegment == "BUILDING";

LF := FILTER lineitem BY shipdate > 19950315;

Step 2 - Join data

OLC := SELECT o_orderkey AS o_orderkey, o_orderdate AS o_orderdate, o_shippriority AS o_shippriority, price AS price, discount AS discount FROM LF JOIN OFI ON orderkey = o_orderkey JOIN CF ON o_custkey = c_custkey;

Step 3 - Group data

F := SELECT o_orderkey AS o_orderkey1, o_orderdate AS orderdate1, o_shippriority AS priority, SUM(price*(1-discount)) AS sum_revenue, COUNT(o_orderkey) AS cnt
FROM OLC GROUP BY o_orderkey, o_orderdate, o_shippriority;

Step 4 - Order data

RES := ORDER F BY sum_revenue DESC, orderdate1 ASC;

Step 5 - Save the results

STORE RES INTO 'results.txt' USING ('|') LIMIT 10;

Alenka is licensed under Apache 2.0 license.

Alenka open issues Ask a question     (View All Issues)
  • over 2 years Compilation on Windows
  • over 2 years OpenCL port
  • over 2 years Data partitioning architecture
  • over 2 years Loading error
  • over 2 years Device Errors
  • about 3 years Compile problem
  • about 6 years Is there in AlenkaDB such a thing as NULL?
Alenka open pull requests (View All Pulls)
  • file organization
Alenka questions on Stackoverflow (View All Questions)
  • 'thrust::system::system_error' error when running GPGPU database engine Alenka
Alenka list of languages used
More projects by antonmks View all
Other projects in Cuda