Skip to content

Lance-spark support in Gluten #12263

@FelixYBW

Description

@FelixYBW

Description

To add Lance read/write support, we have two ways:

  • Use lance connector in Velox, offload the whole read/write operator to velox plan [WIP]Add Lance connector facebookincubator/velox#16556
  • Use lance-spark datasource. To read, gluten gets the arrow data from spark-spark, fwd to velox pipeline. To write, gluten gets the velox data from Velox, converts it as arrow, fwd to lance-spark. No C2R/R2C conversion in the whole process

We are going to try lance-spark as the preferred way.

Gluten version

None

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions