Skip to content

Chemical Estrogenicity Classification using OCR and Machine Learning

Notifications You must be signed in to change notification settings

newflux/Estrogenic

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

12 Commits
 
 
 
 
 
 

Repository files navigation

Chemical Estrogenicity Classification using OCR and Machine Learning

This project classifies chemicals as Estrogenic or Non-Estrogenic using Optical Character Recognition (OCR) and Machine Learning (ML). The system takes text or image input, extracts chemical information via OCR, and processes it through an ML model. The deployment is handled using Streamlit for an interactive user interface.

loom-video.mp4

Project Overview

  • Input: Chemical name (via text or image)

  • Processing:

    • Extract text from images using OCR (Tesseract/Pytesseract)
  • Preprocess the extracted text for ML classification
  • Classify chemicals using a trained ML model
  • Output: Classification as Estrogenic or Non-Estrogenic

  • Deployment: Interactive Streamlit web application


Installation

1. Clone the Repository

git clone https://github.com/your-repo/Estrogenic.git
cd Estrogenic

2. Create a Virtual Environment (Optional but Recommended)

python -m venv venv
venv\Scripts\activate

3. Install Dependencies

pip install -r requirements.txt


Usage

1. Run the Streamlit Application

streamlit run app.py

2. Input Options

  • Upload an Image: The system extracts chemical names using OCR and classifies them.
  • Enter Chemical Name: Direct text input for classification.

3. View Results

  • The application displays whether the chemical is Estrogenic or Non-Estrogenic.
  • Confidence scores and additional details are provided.

About

Chemical Estrogenicity Classification using OCR and Machine Learning

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published