Skip to content

Conversation

@ziotom78
Copy link
Contributor

@ziotom78 ziotom78 commented Jun 6, 2023

Currently the code is extremely slow in loading scientific data from HDF5 files. Even if you are interested in a few seconds of data and know the extrema of the time interval, the whole timestream of data (~4 hours) needs to be loaded in memory.

This PR adds the ability to save a Run-Length Encoded (RLE) version of the time stream of each polarimeter in the same HDF5 file. It is typically done when the SQLite3 database used by DataStorage is created. Once the RLE timeline is saved, it will be used whenever scientific data for a specific range of time are requested either by DataFile.load_sci or DataStorage.load_sci. This should reduce the time needed for I/O considerably, expecially if the chunk of data is small.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants