SparkER: an Entity Resolution framework for Apache Spark
-
Updated
Mar 29, 2024 - Scala
SparkER: an Entity Resolution framework for Apache Spark
Minoan ER is an Entity Resolution (ER) framework, built by researchers in Crete (the land of the ancient Minoan civilization). Entity resolution aims to identify descriptions that refer to the same entity within or across knowledge bases.
Parallel Meta-blocking in MapReduce
Reproducibility experiments for Generalized Supervised Meta-blocking
Addressed Entity Resolution challenges. Tasks include schema-agnostic blocking, pairwise comparisons, Meta-Blocking graph construction, and Jaccard similarity computation. Deliverables include source code, reports, and reproducibility guidelines in Python
Add a description, image, and links to the meta-blocking topic page so that developers can more easily learn about it.
To associate your repository with the meta-blocking topic, visit your repo's landing page and select "manage topics."