Skip to content

A range of helpful utilities for Python developers including streaming tabular data, date parsing, JSON and YAML handling, dictionary and list utilities

License

Notifications You must be signed in to change notification settings

OCHA-DAP/hdx-python-utilities

Repository files navigation

Build Status Coverage Status Ruff Downloads

The HDX Python Utilities Library provides a range of helpful utilities for Python developers. Note that these are not specific to HDX.

  1. Easy downloading of files with support for authentication, streaming and hashing
  2. Retrieval of data from url with saving to file or from data previously saved
  3. Date utilities
  4. Loading and saving JSON and YAML (maintaining order)
  5. Loading and saving HXLated csv and/or JSON
  6. Dictionary and list utilities
  7. HTML utilities (inc. BeautifulSoup helper)
  8. Compare files (eg. for testing)
  9. Simple emailing
  10. Easy logging setup and error logging
  11. State utility
  12. Path utilities
  13. URL utilities
  14. Text processing
  15. Stable file hashing
  16. Matching utilities
  17. Encoding utilities
  18. Check valid UUID
  19. Easy building and packaging

For more information, please read the documentation.

This library is part of the Humanitarian Data Exchange (HDX) project. If you have humanitarian related data, please upload your datasets to HDX.

Development

Environment

Development is currently done using Python 3.13. The environment can be created with:

    uv sync

This creates a .venv folder with the versions specified in the project's uv.lock file.

Pre-commit

pre-commit will be installed when syncing uv. It is run every time you make a git commit if you call it like this:

    pre-commit install

With pre-commit, all code is formatted according to ruff guidelines.

To check if your changes pass pre-commit without committing, run:

    pre-commit run --all-files

Packages

uv is used for package management. If you’ve introduced a new package to the source code (i.e. anywhere in src/), please add it to the project.dependencies section of pyproject.toml with any known version constraints.

To add packages required only for testing, add them to the [dependency-groups].

Any changes to the dependencies will be automatically reflected in uv.lock with pre-commit, but you can re-generate the files without committing by executing:

    uv lock --upgrade

Project

uv is used for project management. The project can be built using:

    uv build

Linting and syntax checking can be run with:

    uv run ruff check

To run the tests and view coverage, execute:

    uv run pytest

Documentation

The documentation, including API documentation, is generated using ReadtheDocs and MkDocs with Material. As you change the source code, remember to update the documentation at documentation/index.md.

About

A range of helpful utilities for Python developers including streaming tabular data, date parsing, JSON and YAML handling, dictionary and list utilities

Topics

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Contributors 3

  •  
  •  
  •  

Languages