Skip to content
Change the repository type filter

All

    Repositories list

    • Python
      0000Updated Apr 30, 2026Apr 30, 2026
    • orqa

      Public
      Jupyter Notebook
      0210Updated Apr 29, 2026Apr 29, 2026
    • radler

      Public
      A novel solution to produce clean samples of datasets with duplicates following a target group distribution
      Python
      2300Updated Mar 16, 2026Mar 16, 2026
    • GitHub repository for the Big Data Management and Governance course labs at the University of Modena and Reggio Emilia, taught by Prof. Giovanni Simonini.
      0000Updated Dec 14, 2024Dec 14, 2024
    • bento

      Public
      The aim of this tool is to compare several frameworks who manage DataFrames on common operations of data preparation.
      Python
      3600Updated Sep 5, 2024Sep 5, 2024
    • A script that given a BibTeX generates a webpage of the publications divided by year
      TeX
      0000Updated Feb 28, 2024Feb 28, 2024
    • sloth

      Public
      Reference paper: "Determining the Largest Overlap between Tables" (Luca Zecchini, Tobias Bleifuß, Giovanni Simonini, Sonia Bergamaschi, Felix Naumann). Proceedi…
      Python
      1500Updated Oct 20, 2023Oct 20, 2023
    • BrewER

      Public
      Reference paper: "Entity Resolution On-Demand" (Giovanni Simonini, Luca Zecchini, Sonia Bergamaschi, Felix Naumann). Proceedings of the VLDB Endowment (PVLDB), …
      Python
      3600Updated Sep 8, 2023Sep 8, 2023
    • Reference paper: Luca Zecchini, Giovanni Simonini, Sonia Bergamaschi, Felix Naumann: "BrewER: Entity Resolution On-Demand". Demo paper submitted to Proceedings …
      Python
      1300Updated Jul 18, 2023Jul 18, 2023
    • sparker

      Public
      SparkER: an Entity Resolution framework for Apache Spark
      Scala
      GNU General Public License v3.0
      19000Updated Oct 12, 2022Oct 12, 2022
    • 0000Updated Mar 15, 2022Mar 15, 2022
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.