Skip to content

Add uploading of datasets. #52

@ManuelAlvarezC

Description

@ManuelAlvarezC

Description

Trello card :https://trello.com/c/tb08vrGi

We need to upload the datasets generated by our data sources, to make them easily accesible to other teams.

To do we need to:

  1. Create a set of default parameters for each datasource, which may be empty.
  2. Create a function that given a data_source, runs it with the defined parameters, store the result as a csv and packs it in a folder with a copy of the audit and metapackage.json files.
  3. Make a function, using this notebook as a template that takes the path to a data package and uploads to kaggle.
  4. Create a function that take no arguments, and iterate through the data sources, generating the datapackages and uploading them to kaggle.
  5. Create a github action that runs the function of step 4 each 24 hours.

Metadata

Metadata

Assignees

No one assigned

    Labels

    enhancementNew feature or request

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions