Skip to content

Blog post about 1000 distinct committers / history of the project #21305

@alamb

Description

@alamb

@nuno-faria just noticed apache/datafusion-site#162 (review)

Also, Datafusion hit recently 1000 unique contributors, in case it's relevant to mention here.

Image

It would be great to write a blog post , similar in spirit to Arrow's 10th anniversaty blog

https://arrow.apache.org/blog/2026/02/12/arrow-anniversary/

Some neat content woudl be:

  1. Project history (there is some additional content here): https://docs.google.com/presentation/d/1M9nsHLdOseDvRxaL28KnB7DPkrPEZOk19dpFHo3S6mw/edit
  2. summarize curernt users / usecases -- there is some content here https://docs.google.com/presentation/d/1h5HZwG2CQoXIU_Rdzs_cDIOs3lBRHKJi_ISkn4IHVHk/edit?slide=id.g361b54a0025_1_807#slide=id.g361b54a0025_1_807
  3. Look to the future

One interesting usecase that @gene-bordegaray and I were taliing about recently was that DataFusion's structure allows replacing legacy processing systems incrementally (e.g. from the bottom of their plans up) -- with a full on systems like Trino they would likely need to do it all at once, which would be much riskier. I think DataDog falls into this category given their public presentations. I know of other systems that do but I am not sure how much is public

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions