This repository was archived by the owner on Apr 23, 2024. It is now read-only.
List view
The goal of this milestone is to have DataFusion working well enough to run single-threaded SQL queries against CSV and Parquet data sources, supporting projection, selection, cast, type coercion, sort (in memory) and simple aggregates (in memory).
Overdue by 7 year(s)•Due by January 6, 2019•6/6 issues closedImplement JOIN, ORDER BY, UNION, SUBQUERY
No due date•4/4 issues closedImplement distributed processing: - Implement serialization for RecordBatch and Schema (using Arrow IPC) so that data can be persisted to disk and streamed between nodes - Implement basic distributed query planner - Implement serialization for query plans - Docker packaging for worker nodes - Kubernetes to orchestrate cluster
No due date•27/27 issues closed