diff --git a/README.md b/README.md index b8ba635666..f697a1755e 100644 --- a/README.md +++ b/README.md @@ -16,11 +16,11 @@ Splink is a Python package for probabilistic record linkage (entity resolution) ## Key Features -⚡ **Speed:** Capable of linking a million records on a laptop in around a minute. -🎯 **Accuracy:** Support for term frequency adjustments and user-defined fuzzy matching logic. -🌐 **Scalability:** Execute linkage in Python (using DuckDB) or big-data backends like AWS Athena or Spark for 100+ million records. -🎓 **Unsupervised Learning:** No training data is required for model training. -📊 **Interactive Outputs:** A suite of interactive visualisations help users understand their model and diagnose problems. +⚡ **Speed:** Capable of linking a million records on a laptop in around a minute.
+🎯 **Accuracy:** Support for term frequency adjustments and user-defined fuzzy matching logic.
+🌐 **Scalability:** Execute linkage in Python (using DuckDB) or big-data backends like AWS Athena or Spark for 100+ million records.
+🎓 **Unsupervised Learning:** No training data is required for model training.
+📊 **Interactive Outputs:** A suite of interactive visualisations help users understand their model and diagnose problems.
Splink's linkage algorithm is based on Fellegi-Sunter's model of record linkage, with various customisations to improve accuracy.