TE-AI-Cup: ML Automation for Lot History Record Digitization

This project delivers an intelligent, end-to-end digitization system developed to automate the recognition of handwritten data on Lot History Record (LHR) sheets. The system combines advanced machine learning techniques with a user-friendly web-based UI, providing seamless data extraction and integration with SAP systems. It achieves 98% detection accuracy, cutting processing time from 8 hours to just 2 minutes.

Key Features

UI-Driven Workflow: An intuitive web interface streamlines user interaction for uploading files and viewing results.
High Accuracy: Powered by YOLO and OpenCV, achieving 98% accuracy in handwritten data recognition.
Time Efficiency: Reduces manual processing from hours to minutes.
SAP Integration: Outputs data in an SAP-compatible format, simplifying enterprise adoption.
Scalability: Deployed on AWS Cloud for reliable and scalable performance.

System Workflow

Backend Workflow

Input Handling:
- Accepts single PDF files or folders via the UI or CLI.
PDF Conversion:
- Converts PDFs into PNG images using convert_pdf_to_png.
Pre-Processing:
- Enhances images using OpenCV for optimal OCR performance.
OCR:
- Extracts handwritten data using DocTR and processes specific table columns.
Image Classification:
- YOLO-v8m models classify and detect objects in specified columns.
Validation:
- Ensures the data extracted is accurate and formatted correctly via validation scripts.
Output Generation:
- Consolidates and formats results into Excel files compatible with SAP.

User Interface (UI)

File Upload:
- Users can upload PDFs or folders directly from the browser.
Real-Time Feedback:
- Displays processing status and logs.
Result Download:
- Outputs can be downloaded directly in Excel format.

File Structure

main/
│
├── pdf_converter.py     # Converts PDFs into images
├── ocr_det.py           # Handles OCR detection and data extraction
├── tt.py                # Processes images for object tracking
├── firstWord.py         # Detects key words in specific table columns
├── yolo_pred.py         # Implements YOLO-based object detection
├── popSheet.py          # Consolidates and generates Excel output
├── validate.py          # Validates extracted data for accuracy
├── ui/                  # Contains code for the web interface
│   ├── app.py           # Main UI logic
│   ├── templates/       # HTML templates for the web interface
│   └── static/          # CSS and JavaScript for the frontend

Setup Instructions

Prerequisites

Python: Version 3.6+ is required.
Anaconda/Miniconda: Recommended for environment management.
Poppler: Necessary for PDF-to-image conversion.

Installation

Clone the Repository:

git clone <repository_url>
cd <repository_directory>

Set Up Virtual Environment: Open a terminal and navigate to the project directory. Create a new Conda environment:
```
conda create -n lhr-digitization python=3.8
conda activate lhr-digitization
```

Install Required Dependencies:

pip install opencv-python-headless pdf2image python-doctr[torch] flask numpy

Install Poppler: macOS:

brew install poppler

Linux:

sudo apt-get install poppler-utils

Run the Web Interface:
```
python ui/app.py
```
Access the UI at http://127.0.0.1:5000 in your browser.

Running the Backend

Command-Line Interface (CLI)

Process a Single PDF:

python main.py --input <file_path> --is_file True

Process a Folder:
```
python main.py --input <folder_path>
```

Web Interface

Upload PDF files or folders via the UI.
Monitor the real-time progress on the web interface.
Download processed results in Excel format.

Workflow Example

PDF Input:
- A folder containing multiple LHR PDFs is uploaded via the UI.
Processing:
- PDFs are converted into images and processed for OCR and object detection.
Validation:
- Extracted data is validated for accuracy.
Excel Output:
- Processed data is consolidated into Excel files and made available for download.

Additional Notes

Temp Tables:
- Temporary files are created during processing and stored in tempTables_.
Validation:
- The validate.py script ensures the extracted data meets quality requirements.
Customizations:
- Adjust the backend scripts to handle additional use cases or different document layouts.

Name		Name	Last commit message	Last commit date
Latest commit History 104 Commits
YoloDataset		YoloDataset
__pycache__		__pycache__
allDatasets		allDatasets
data		data
docker		docker
main		main
model		model
nextjs/works		nextjs/works
node_modules		node_modules
test		test
.DS_Store		.DS_Store
.gitignore		.gitignore
README.md		README.md
dimVal.py		dimVal.py
package-lock.json		package-lock.json
package.json		package.json
popSheet.py		popSheet.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TE-AI-Cup: ML Automation for Lot History Record Digitization

Key Features

System Workflow

Backend Workflow

User Interface (UI)

File Structure

Setup Instructions

Prerequisites

Installation

Running the Backend

Command-Line Interface (CLI)

Web Interface

Workflow Example

Additional Notes

About

Releases

Packages

Contributors 2

Languages

yiyaozzz/TE-AI-Cup

Folders and files

Latest commit

History

Repository files navigation

TE-AI-Cup: ML Automation for Lot History Record Digitization

Key Features

System Workflow

Backend Workflow

User Interface (UI)

File Structure

Setup Instructions

Prerequisites

Installation

Running the Backend

Command-Line Interface (CLI)

Web Interface

Workflow Example

Additional Notes

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages