RulER is a tool for Apache Spark that uses a novel technique that allows to find similar records by applying complex joining rules on one or more attributes.
If use this library, please cite:
- Gagliardelli, L., Simonini, G., & Bergamaschi, S. (2020). RulER: Scaling Up Record-level Matching Rules. In EDBT 2020: 23nd International Conference on Extending Database Technology.
A brief presentation about RulER is available by clicking on the image below
For any questions about RulER write us at name.surname@unimore.it
- Luca Gagliardelli
- Giovanni Simonini