Skip to content

Latest commit

 

History

History
30 lines (24 loc) · 1.13 KB

Readme.md

File metadata and controls

30 lines (24 loc) · 1.13 KB

Covid 19 Data Analysis

This repository contains a Python-based implementation of a data preprocessing pipeline. The model performs essential preprocessing tasks required for machine learning workflows. The steps include handling missing data, encoding categorical data, splitting the dataset into training and testing sets, and feature scaling.

Features

  1. Data Preprocessing

    • Handling missing data.
    • Encoding categorical data (both independent and dependent variables).
    • Splitting data into training and testing sets.
    • Feature scaling (standardization).
  2. User-Friendly Interface.

    • Easily adaptable to different datasets.
    • Modular code structure for each preprocessing step.

Dataset

The model uses the Covid_Data.csv file, which contains anonymized COVID-related data. Ensure the dataset is present in the working directory.

File Structure

|-- Covid_Data.csv
|-- Cleaned_Covid_Data.csv
|-- model.ipynb
|-- README.md

License

This project is licensed under the MIT License.

Author

Created by Arpan Surin. Feel free to contact me for any questions or suggestions!