This web application allows users to upload images containing text in both Hindi and English. The app extracts text using OCR and provides a keyword search functionality to search within the extracted text.
- Upload an image (JPEG, PNG).
- Extract text from images using Tesseract OCR.
- Search for specific keywords in the extracted text.
-
Clone this repository: git clone
-
Install the required Python packages: pip install -r requirements.txt
-
Install Tesseract OCR:
- On Ubuntu:
sudo apt-get install tesseract-ocr
- On Windows, download and install Tesseract.
- Install required dependencies and libraries.
pip install pytesseract
pip install Pillow
pip install streamlit
pip install torch
pip install transformers
- Run the application: streamlit run anu.py
This project is licensed under the MIT License.