Skip to content

version check support for downloaded datasets #97

@HowcanoeWang

Description

@HowcanoeWang

Thanks a lot for making this contribution, I will leave this PR for further discussion.

Before commiting, I run the pytest on my local machine every time. This non-reproducible problem is only caused by not updating the latest dataset in Google drive (and also the local Numpy versions).

To my understanding, the Action creates a new virtual envirnment each time to run the test code. But if is on a brand new environment, the pytest can not download the missing dataset automatically and raises FileNotFoundError in this case.

So this online test workflow may not be suitable for the cache rule. It requires the dataset initialize step for everytime in the Action virtual machine, and needs downloads such data again and again.

Personally prefer contributor running the pytest on their local enviroment, just like what scikit-learn and numpy require for pull request.

Then I need to figure out the following items:

  1. A solution to check the local dataset versions and support auto-update if it is out-of-date.
  2. Edit the 5 tests depend on the 3.3GB Lotus() dataset, using the 300MB TestData() only to decrease the required dataset size for testing

Originally posted by @HowcanoeWang in #87 (comment)

Metadata

Metadata

Assignees

Labels

enhancementNew feature or request

Type

No type

Projects

Status

No status

Relationships

None yet

Development

No branches or pull requests

Issue actions