Skip to content

omob/ocr-converter

Repository files navigation

OCR Converter

Ocr converter is an application that helps convert pdf to text/word document

Requirement

  1. Install Poppler on your target machine. It's an open source set of libraries and command line tools, very useful for dealing with PDF files. We used the pdftocairo utility to export pdf to images
  2. Install Tessaract OCR Engine

Installation

Install the dependencies and devDependencies and start the server.

$ cd ocr-converter
$ npm install
$ node index.js

To run using command line args

$ node index.js [command] --filepath [filepath] --destination [destionationpath]

Development

Want to contribute? Great!

Clone the repo!

License

MIT

Free Software!

About

OCR Converter

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published