Releases: gandersen101/spaczz
Releases · gandersen101/spaczz
v0.3.1: Adding SimilarityMatcher
Added an experimental similarity matcher that matches phrase patterns in the text based on word vector similarity.
v0.3.0: Returning Fuzzy Match Information
FuzzyMatcher
and RegexMatcher
now return fuzzy ratio and counts information respectively. The SpaczzRuler
sets this information as custom Span
attributes. Bug fix with spaczz ruler's attr
parameter that prevented it from actually modifying the attr.
v0.2.0: RapidFuzz and ent ids
v0.2.0 Release Notes:
- Fuzzy matching is now performed with RapidFuzz instead of FuzzyWuzzy.
- RapidFuzz is higher performance with a more liberal license.
- The spaczz ruler now automatically sets a custom, boolean, Span attribute on all entities it adds.
- This is set by the
attr
parameter duringSpaczzRuler
instantiation and defaults to: "spaczz_ent". - For example: an entity set by the spaczz ruler will have
ent._.spaczz_ent
set toTrue
.
- This is set by the
- Spaczz ruler patterns now support optional "id" values like spaCy's entity ruler. See this spaCy documentation for usage details.
- Automated Windows testing is now part of the build process.
v01.1. Integrating Read the Docs
Read the Docs integration and pyproject.toml now single source of version truth.
v0.1.0 Initial release
Initial release for spaczz, a package that adds fuzzy matching and additional regex support for spaCy.