-
Notifications
You must be signed in to change notification settings - Fork 1
Simple code to convert pdf/s to image files and use Tesseract OCR on these image files to extract text from them. This code focuses on extracting Batch No. from pharmacy bills using RegEx. None of the actual pdfs and files could be added as all data used was real life/sensitive data.
avinxxsh/realDataOCR
Folders and files
Name | Name | Last commit message | Last commit date | |
---|---|---|---|---|
About
Simple code to convert pdf/s to image files and use Tesseract OCR on these image files to extract text from them. This code focuses on extracting Batch No. from pharmacy bills using RegEx. None of the actual pdfs and files could be added as all data used was real life/sensitive data.
Topics
Stars
Watchers
Forks
Releases
No releases published
Packages 0
No packages published