From e1845708bb5a6da0b58eb15f9361e3ee042db8ea Mon Sep 17 00:00:00 2001 From: gandersen101 Date: Wed, 24 Feb 2021 20:33:15 -0600 Subject: [PATCH] Update readme. --- README.md | 4 +++- 1 file changed, 3 insertions(+), 1 deletion(-) diff --git a/README.md b/README.md index e3c8ec5..c6b2fe2 100644 --- a/README.md +++ b/README.md @@ -699,7 +699,9 @@ For example if we are looking for the string `"Ireland"` and have the patterns ` The above problem is twofold. First and foremost, I need to develop a way or ways to compare apples to oranges - fuzzy ratios and fuzzy regex counts. Then I need to figure out how to include match details from the `TokenMatcher` which supports both fuzzy and "fuzzy" regex matches. -For a short-term solution I am having the entity ruler first go through sorted fuzzy matches, then sorted regex matches, and lastly token matches (which will only be sorted by length of match, not quality). There is additional logic in place to filter overlapping matches preserving earlier matches over later ones. This order of priority may not be ideal for everyone but adding a way to change the order (say regex patterns first) would a temporary solution to a temporary problem. +For a short-term solution I am having the entity ruler first go through sorted fuzzy matches, then sorted regex matches, and lastly token matches. Token matches will only be sorted by length of match, not quality, so they may provide inconsistent results. Try to be mindful of your token patterns. + +There is additional logic in place to filter overlapping matches preserving earlier matches over later ones. This order of priority (fuzzy, regex, token) may not be ideal for everyone but adding a way to change the order (say regex patterns first) would a temporary solution to a temporary problem. Please bear with me through these growing pains.