From e7f1a2d5c5a0c5d5e897225d4465062ebb11b6e1 Mon Sep 17 00:00:00 2001 From: Tom Date: Fri, 24 Apr 2015 09:53:40 -0700 Subject: [PATCH] Update README.md --- README.md | 5 +++++ 1 file changed, 5 insertions(+) diff --git a/README.md b/README.md index 0100fe4b..0de25fea 100644 --- a/README.md +++ b/README.md @@ -88,6 +88,11 @@ options and currently requiresloading training data into HDF5 files, instead of being trained off a list of image files directly (image file-based training will be added to `clstmctc` soon). +The CLSTM project also provides LSTM-based language modeling that works very +well with post-processing and correcting OCR output, as well as solving a number +of other OCR-related tasks, such as dehyphenation or changes in orthography +(see our publications). You can train language models using `clstmtext`. + Generally, your best bet for CLSTM and OCRopy is to rely only on the command line tools; that makes it easy to replace different components. In addition, you should keep your OCR training data in .png/.gt.txt files so that you can easily