Skip to content

Releases: gandersen101/spaczz

v0.3.1: Adding SimilarityMatcher

12 Nov 17:24
89eadfa
Compare
Choose a tag to compare

Added an experimental similarity matcher that matches phrase patterns in the text based on word vector similarity.

v0.3.0: Returning Fuzzy Match Information

23 Sep 15:02
4dafe0a
Compare
Choose a tag to compare

FuzzyMatcher and RegexMatcher now return fuzzy ratio and counts information respectively. The SpaczzRuler sets this information as custom Span attributes. Bug fix with spaczz ruler's attr parameter that prevented it from actually modifying the attr.

v0.2.0: RapidFuzz and ent ids

18 Aug 02:46
Compare
Choose a tag to compare

v0.2.0 Release Notes:

  • Fuzzy matching is now performed with RapidFuzz instead of FuzzyWuzzy.
    • RapidFuzz is higher performance with a more liberal license.
  • The spaczz ruler now automatically sets a custom, boolean, Span attribute on all entities it adds.
    • This is set by the attr parameter during SpaczzRuler instantiation and defaults to: "spaczz_ent".
    • For example: an entity set by the spaczz ruler will have ent._.spaczz_ent set to True.
  • Spaczz ruler patterns now support optional "id" values like spaCy's entity ruler. See this spaCy documentation for usage details.
  • Automated Windows testing is now part of the build process.

v01.1. Integrating Read the Docs

06 Jul 21:52
239107d
Compare
Choose a tag to compare

Read the Docs integration and pyproject.toml now single source of version truth.

v0.1.0 Initial release

06 Jul 20:44
4bda325
Compare
Choose a tag to compare

Initial release for spaczz, a package that adds fuzzy matching and additional regex support for spaCy.