version check support for downloaded datasets

Thanks a lot for making this contribution, I will leave this PR for further discussion.

Before commiting, I run the pytest on my local machine every time. This non-reproducible problem is only caused by not updating the latest dataset in Google drive (and also the local Numpy versions).

To my understanding, the Action creates a new virtual envirnment each time to run the test code. But if is on a brand new environment, the pytest can not download the missing dataset automatically and raises FileNotFoundError in this case. 

So this online test workflow may not be suitable for the cache rule. It requires the dataset initialize step for everytime in the Action virtual machine, and needs downloads such data again and again.

Personally prefer contributor running the pytest on their local enviroment, just like what `scikit-learn` and `numpy` require for pull request.

Then I need to figure out the following items:

1. A solution to check the local dataset versions and support auto-update if it is out-of-date.
2. Edit the 5 tests depend on the 3.3GB `Lotus()` dataset, using the 300MB `TestData()` only to decrease the required dataset size for testing

_Originally posted by @HowcanoeWang in https://github.com/UTokyo-FieldPhenomics-Lab/EasyIDP/issues/87#issuecomment-1489598614_
            

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

version check support for downloaded datasets #97

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

version check support for downloaded datasets #97

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions