extracts and analyzes leaflets from www.prospektangebote.de
- the html pages are parsed in a non intelligent way
- the found images are downloaded and
- the images are analyzed via tesseract ocr
TODO: implement search on textfiles
extracts and analyzes leaflets from www.prospektangebote.de
TODO: implement search on textfiles