Skip to content

Releases: hplt-project/data-analytics-tool

HPLTAnalytics v1.2

29 Sep 14:59
14036b7

Choose a tag to compare

https://github.com/hplt-project/data-analytics-tool/blob/v1.2/CHANGELOG.md

What's Changed

New Contributors

  • @BoFFire made their first contribution in #50

Full Changelog: v1.1...v1.2

HPLTAnalytics v1.1

06 Aug 17:51
5988792

Choose a tag to compare

HPLTAnalytics v1.0

10 Jun 15:23
56c502b

Choose a tag to compare

HPLT Analytics 0.4

16 Apr 15:28

Choose a tag to compare

What's Changed

Full Changelog: v0.3...v0.4

HPLT Analytics 0.3

14 Mar 12:12
2091fd4

Choose a tag to compare

What's Changed

  • New frontend and graphics
  • Integrate WDS (Web Document Scorer)
  • Integrate HeLIPort
  • Support for more metadata (domains, tlds, collections,...)
  • Support for more languages (tokenization, stopwords...)
  • Added PII detection
  • Lighter-weight PDFs
  • Added samples to frontend
  • Added register labels to frontend
  • Added reports for HPLT v2
  • Added tests
  • Libraries version bumps
  • Other minor fixings

Contributors

Full Changelog: v0.2-ALPHA...v0.3

HPLT Analytics 0.2-ALPHA

11 Apr 15:28
944a6d2

Choose a tag to compare

What's Changed

Full Changelog: v0.1-ALPHA...v0.2-ALPHA

(Changelog)

HPLT Analytics 0.1-ALPHA

30 Aug 11:44
2f7d2d8

Choose a tag to compare

First released version.

Some known issues:

  • No GPU support.
  • Corpus names cannot contain strings such as ".tsv"
  • OOM errors on very big corpora processing.
  • YAML files will be overwritten if a new one is processed with the same name.

Will be fixed on next releases.