Ocr converter is an application that helps convert pdf to text/word document
- Install Poppler on your target machine.
It's an open source set of libraries and command line tools, very useful for dealing with PDF files. We used the
pdftocairo
utility to export pdf to images - Install Tessaract OCR Engine
Install the dependencies and devDependencies and start the server.
$ cd ocr-converter
$ npm install
$ node index.js
$ node index.js [command] --filepath [filepath] --destination [destionationpath]
Want to contribute? Great!
Clone the repo!
MIT
Free Software!