This repository was archived by the owner on Apr 23, 2024. It is now read-only.

Milestones

0.6.0 - Proof of concept version 2
The goal of this milestone is to have DataFusion working well enough to run single-threaded SQL queries against CSV and Parquet data sources, supporting projection, selection, cast, type coercion, sort (in memory) and simple aggregates (in memory).
Overdue by 7 year(s)
•
Due by January 6, 2019
•6/6 issues closed
100% complete0 open 6 closed
0.8.0 - Mature SQL Support
Implement JOIN, ORDER BY, UNION, SUBQUERY
No due date
•4/4 issues closed
100% complete0 open 4 closed
0.7.0 - Distributed Queries
Implement distributed processing: - Implement serialization for RecordBatch and Schema (using Arrow IPC) so that data can be persisted to disk and streamed between nodes - Implement basic distributed query planner - Implement serialization for query plans - Docker packaging for worker nodes - Kubernetes to orchestrate cluster
No due date
•27/27 issues closed
100% complete0 open 27 closed

Provide feedback