You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I did some code for this over the weekend, but shelved it. I decided it's better to have false positives than false negatives and that there was no way to avoid false negatives (imagine a doc with actual dates redacted).
I'd like to see how many false positives this kind of thing creates. If it's a common pattern to put date strings into redactions, maybe we can take another swing at this.
It seems to be common to put dates under the redaction boxes, as you can see in the highlighted screenshot below:
Note that the date isn't actually relevant semantically to the sentence. Looking throughout the redactions of this document:
You see a pattern that the text is always the same date. When this is the case, we should nuke all such redactions from our list as false positives.
gov.uscourts.cacd.45170.569.9_2.pdf
The text was updated successfully, but these errors were encountered: