Description
Trello card :https://trello.com/c/tb08vrGi
We need to upload the datasets generated by our data sources, to make them easily accesible to other teams.
To do we need to:
- Create a set of default parameters for each datasource, which may be empty.
- Create a function that given a data_source, runs it with the defined parameters, store the result as a csv and packs it in a folder with a copy of the audit and metapackage.json files.
- Make a function, using this notebook as a template that takes the path to a data package and uploads to kaggle.
- Create a function that take no arguments, and iterate through the data sources, generating the datapackages and uploading them to kaggle.
- Create a github action that runs the function of step 4 each 24 hours.
Description
Trello card :https://trello.com/c/tb08vrGi
We need to upload the datasets generated by our data sources, to make them easily accesible to other teams.
To do we need to: