Skip to content
This repository was archived by the owner on Apr 23, 2024. It is now read-only.

Milestones

List view

  • The goal of this milestone is to have DataFusion working well enough to run single-threaded SQL queries against CSV and Parquet data sources, supporting projection, selection, cast, type coercion, sort (in memory) and simple aggregates (in memory).

    Overdue by 7 year(s)
    Due by January 6, 2019
    6/6 issues closed
  • Implement JOIN, ORDER BY, UNION, SUBQUERY

    No due date
    4/4 issues closed
  • Implement distributed processing: - Implement serialization for RecordBatch and Schema (using Arrow IPC) so that data can be persisted to disk and streamed between nodes - Implement basic distributed query planner - Implement serialization for query plans - Docker packaging for worker nodes - Kubernetes to orchestrate cluster

    No due date
    27/27 issues closed