forked from abhisheks008/ML-Crate
-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
1 parent
1dfedb7
commit 7b7468a
Showing
17 changed files
with
12,531 additions
and
0 deletions.
There are no files selected for viewing
6,723 changes: 6,723 additions & 0 deletions
6,723
...nalysis and Prediction/Simpsons Episodes Dataset Analysis/Dataset/simpsons_characters.csv
Large diffs are not rendered by default.
Oops, something went wrong.
601 changes: 601 additions & 0 deletions
601
... Analysis and Prediction/Simpsons Episodes Dataset Analysis/Dataset/simpsons_episodes.csv
Large diffs are not rendered by default.
Oops, something went wrong.
4,460 changes: 4,460 additions & 0 deletions
4,460
...Analysis and Prediction/Simpsons Episodes Dataset Analysis/Dataset/simpsons_locations.csv
Large diffs are not rendered by default.
Oops, something went wrong.
Binary file added
BIN
+8.93 MB
...and Prediction/Simpsons Episodes Dataset Analysis/Dataset/simpsons_script_lines(zip_file)
Binary file not shown.
Binary file added
BIN
+41.1 KB
...n/Simpsons Episodes Dataset Analysis/Images/Average IMDb ratings per season.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+42 KB
...psons Episodes Dataset Analysis/Images/Bottom Episodes based on IMDb rating.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+2.02 MB
...diction/Simpsons Episodes Dataset Analysis/Images/Character Lines pie Chart.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+20.7 KB
...ediction/Simpsons Episodes Dataset Analysis/Images/Episode count per season.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+20.7 KB
...nd Prediction/Simpsons Episodes Dataset Analysis/Images/Gender Distribution.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+23.4 KB
...and Prediction/Simpsons Episodes Dataset Analysis/Images/Seasonal Viewrship.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+92.3 KB
...ction/Simpsons Episodes Dataset Analysis/Images/Sentiment Analyis over time.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+44.9 KB
...Simpsons Episodes Dataset Analysis/Images/Top Episodes based on IMDb rating.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+39 KB
...ion/Simpsons Episodes Dataset Analysis/Images/Top Locations across episodes.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+39 KB
...rediction/Simpsons Episodes Dataset Analysis/Images/Top Speaking Characters.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
Binary file added
BIN
+279 KB
...iction/Simpsons Episodes Dataset Analysis/Images/Word Cloud of spoken words.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
666 changes: 666 additions & 0 deletions
666
... Analysis and Prediction/Simpsons Episodes Dataset Analysis/Model/Simpsons_Analysis.ipynb
Large diffs are not rendered by default.
Oops, something went wrong.
81 changes: 81 additions & 0 deletions
81
...ality Rate Analysis and Prediction/Simpsons Episodes Dataset Analysis/Readme.md
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,81 @@ | ||
# Data Analysis Report: The Simpsons Dataset | ||
|
||
## Introduction | ||
This report presents a comprehensive analysis of The Simpsons dataset, encompassing characters, episodes, locations, and script lines. The analysis explores various aspects such as character interactions, episode ratings, viewership trends, sentiment analysis, and additional advanced analyses. | ||
|
||
## Data Loading and Cleaning | ||
The datasets were loaded into Pandas DataFrames, and basic cleaning steps were performed to handle encoding issues. The following datasets were used: | ||
- simpsons_characters | ||
- simpsons_episodes | ||
- simpons_locations | ||
- simpons_script_lines | ||
|
||
DataSets: https://www.kaggle.com/datasets/thedevastator/the-simpsons-episodes-dataset | ||
|
||
## Character Analysis | ||
|
||
### Top Speaking Characters | ||
Identified and visualized the top characters based on the number of lines spoken. | ||
|
||
![Top Speaking Characters](https://github.com/Bayyana-kiran/sdf/assets/99533113/60bd9db2-8bc4-4ffe-b333-75fcd6d8c2f0) | ||
|
||
|
||
### Gender Distribution of Speaking Characters | ||
Explored the distribution of lines spoken by male and female characters. | ||
|
||
![Gender Distribution](https://github.com/Bayyana-kiran/sdf/assets/99533113/3f43f0b7-53c3-4a11-ab76-69044cbfab55) | ||
|
||
|
||
### Character Lines Pie Chart | ||
Visualized the percentage of lines spoken by each character in a pie chart. | ||
|
||
![Character Lines pie Chart](https://github.com/Bayyana-kiran/sdf/assets/99533113/6fa8afb2-0529-40c1-ab50-d177ee9fab87) | ||
|
||
|
||
## Episode Analysis | ||
|
||
### Season-wise Episode Count | ||
Visualized the number of episodes in each season. | ||
|
||
![Episode count per season](https://github.com/Bayyana-kiran/sdf/assets/99533113/632a943e-b57b-4abc-b0ea-1885c3693630) | ||
|
||
|
||
|
||
|
||
### Seasonal Viewership Bar Chart | ||
Visualized the average viewership per season using a bar chart. | ||
|
||
![Seasonal Viewrship](https://github.com/Bayyana-kiran/sdf/assets/99533113/7368c975-fa54-49a4-9ee5-8572ec98726c) | ||
|
||
## Location Analysis | ||
|
||
### Top Locations Across Episodes | ||
Identified and visualized the most popular locations based on the number of lines spoken. | ||
|
||
![Top Locations across episodes](https://github.com/Bayyana-kiran/sdf/assets/99533113/d6fb1949-e18d-4fae-b95e-0017bc36faef) | ||
|
||
|
||
|
||
## Text Analysis | ||
|
||
### Word Cloud of Spoken Words | ||
Created a word cloud to visualize the most frequently used words in the spoken lines. | ||
|
||
![Word Cloud of spoken words](https://github.com/Bayyana-kiran/sdf/assets/99533113/90fbb2aa-7ca3-4a0c-8e14-592581db492a) | ||
|
||
|
||
### Sentiment Analysis Over Time | ||
Analyzed the sentiment of spoken words over time. | ||
|
||
![Sentiment Analyis over time](https://github.com/Bayyana-kiran/sdf/assets/99533113/c4390f94-6e7a-4871-a230-c477f6ac9e6c) | ||
|
||
|
||
## Conclusion | ||
This comprehensive analysis provides insights into character dynamics, episode trends, and sentiment patterns throughout The Simpsons dataset. Further explorations and analyses can be conducted based on specific areas of interest. | ||
|
||
For detailed code and additional analyses, refer to the Python script used for this report. | ||
|
||
--- | ||
Author: Sai Kiran B L S | ||
Github: [Sai Kiran B L S](https://github.com/Bayyana-kiran) | ||
|