transformers-text-recognition

Architecture of transformer-text-recognition model

This project will try to apply transformer to recognize the text from image. The input of model is a image and the output of the model is word taken from image. The input image feature is extracted by convolution network and then the extracted feature is used as a input sentence to train transformer model to translate image to text.

How to run

Download dataset from here
The trained models can be downloaded from here

Install

#python3.7
pip install --upgrade pip
pip install -r requirements.txt

Demo

python run_demo_server.py --port PORT --model_folder FOLDER_PATH

PORT: port to run server (default server will run on http://localhost:9595)
model_folder: folder store trained model

Training

python training.py --model_type MODEL_TYPE

model_type:
- 1: transformer-random-trg
- 2: transformer-no-trg
- 3: transformer-no-decoder
- 4: transformer-trg-same-src
- 5: transformer
The training model will be saved to ./checkpoints/{model_type}.pt

Eval

python evaluate.py --model_type MODEL_TYPE

model_type:
- 1: transformer-random-trg
- 2: transformer-no-trg
- 3: transformer-no-decoder
- 4: transformer-trg-same-src
- 5: transformer

Name		Name	Last commit message	Last commit date
Latest commit History 40 Commits
bash		bash
static		static
templates		templates
.gitignore		.gitignore
README.md		README.md
constants.py		constants.py
evaluate.py		evaluate.py
linear_transformer.py		linear_transformer.py
load_image_data.py		load_image_data.py
loss.py		loss.py
model.py		model.py
optim.py		optim.py
requirements.txt		requirements.txt
run_demo_server.py		run_demo_server.py
training.py		training.py
transformer.py		transformer.py
transformer_without_trg.py		transformer_without_trg.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

transformers-text-recognition

How to run

Install

Demo

Training

Eval

About

Releases

Packages

Languages

anhtu-phan/transformers-text-recognition

Folders and files

Latest commit

History

Repository files navigation

transformers-text-recognition

How to run

Install

Demo

Training

Eval

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages