gsoc26: Format Conversion Layer (Layer 2) + CLI refactor (#59, #61)#62
gsoc26: Format Conversion Layer (Layer 2) + CLI refactor (#59, #61)#62DhanashreePetare wants to merge 7 commits into
Conversation
|
Important Review skippedAuto reviews are disabled on base/target branches other than the default branch. Please check the settings in the CodeRabbit UI or the ⚙️ Run configurationConfiguration used: Organization UI Review profile: CHILL Plan: Pro Run ID: You can disable this status message by setting the Use the checkbox below for a quick retry:
✨ Finishing Touches🧪 Generate unit tests (beta)
Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out. Comment |
✅ Action performedReview finished.
|
Pull Request
Description
Implements Layer 2 (Format Conversion) for the Databus Python Client download pipeline, bringing it to feature parity with the Java client as described in Frey et al. Users can now convert between RDF serialization formats and tabular formats on-the-fly during download using the new --format flag. Also refactors the compression CLI (Issue #61) by replacing --convert-to / --convert-from with a single --compression flag.
Related Issues
Issue #59 (Format and Mapping Conversion Layer — Layer 2)
Issue #61 (Refactor CLI compression)
Type of change
Checklist:
poetry run pytest- all tests passedpoetry run ruff check- no linting errorsWhat was added:
databusclient/filehandling/format.py — Layer 2 with TripleHandler, QuadHandler, TSDHandler classes using rdflib.Graph, rdflib.Dataset, and list[list[str]] as intermediate representations respectively. Each handler exposes read(), write(), and convert().
What was changed:
Tests:
Closes #59
Closes #61
Summary by CodeRabbit
New Features
--compressionand--formatoptionsDocumentation
Tests