Heart Attack Risk Analysis and Prediction

This project analyzes a dataset related to heart attack occurrences, performs exploratory data analysis (EDA), and builds predictive models to assess an individual’s risk of experiencing a heart attack. The objective is to provide actionable insights and predictive capabilities for healthcare professionals and organizations.

Project Overview

Heart disease is a leading cause of mortality worldwide. This project aims to uncover patterns in heart attack risk factors through EDA and build machine learning models to predict the likelihood of a heart attack based on key features such as cholesterol levels, age, exercise-induced angina, and more.

Dataset Description

The dataset used in this project contains records of patients and their associated health metrics. It includes variables such as cholesterol levels, resting blood pressure, maximum heart rate achieved, and exercise-induced angina.

Source: Heart Attack Analysis & Prediction Dataset

Features and Target Variable

Key Features:
- Age
- Gender
- Cholesterol Levels
- Resting Blood Pressure
- Maximum Heart Rate Achieved
- Exercise-Induced Angina
- Other relevant metrics
Target Variable:
- Output (1 = More likely to have a heart attack, 0 = Less likely)

Methods and Tools

Programming Language: Python
Libraries Used:
- Data Analysis: pandas, numpy
- Data Visualization: matplotlib, seaborn
- Machine Learning: scikit-learn, xgboost, tensorflow
Jupyter Notebook: The analysis and modeling are documented in an interactive notebook.

Project Workflow

Exploratory Data Analysis (EDA):
- Visualization of distributions and correlations among features.
- Identification of key trends and outliers in the dataset.
- Feature selection based on domain knowledge and statistical methods.
Data Preprocessing:
- Handling missing values.
- Standardizing and normalizing data where applicable.
- Splitting the dataset into training and testing subsets.
Modeling:
- Building machine learning models including Logistic Regression, Random Forest, and XGBoost.
- Hyperparameter tuning to optimize performance.
- Evaluating models using metrics such as Accuracy, Precision, Recall, and F1-Score.
Results and Insights:
- Interpreting the feature importance.
- Presenting actionable insights from EDA and model predictions.

Key Results

Accuracy: The chosen model achieved an accuracy of 0.87%.
Key Insights:
- Features chest pain type and maximum heart rate achieved are highly correlated with heart attack risk.
- Patients with exercise-induced angina have a significantly higher risk.

Acknowledgments

Thanks to the dataset contributors for making the data available.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
README.md		README.md
heart-attack-eda-and-modeling.ipynb		heart-attack-eda-and-modeling.ipynb
heart.csv		heart.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Heart Attack Risk Analysis and Prediction

Project Overview

Dataset Description

Features and Target Variable

Methods and Tools

Project Workflow

Key Results

Acknowledgments

About

Languages

AnnaAnastasy/Heart-Attack-Predictions

Folders and files

Latest commit

History

Repository files navigation

Heart Attack Risk Analysis and Prediction

Project Overview

Dataset Description

Features and Target Variable

Methods and Tools

Project Workflow

Key Results

Acknowledgments

About

Topics

Resources

Stars

Watchers

Forks

Languages