Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Performed EDA: Handled Missing Values and Outliers in Stock Price Data #148

Merged
merged 2 commits into from
Oct 22, 2024

Conversation

RB137
Copy link
Contributor

@RB137 RB137 commented Oct 20, 2024

I have handled the data quality issues in the stock price dataset and fixed the problems related to missing values and outliers mentioned in issue #137 :

Fixes: #137

Handling Missing Values:

Used forward fill (ffill) to fill missing values in the Open, High, Low, Close, Adj Close, and Volume columns, ensuring data continuity for time-series analysis.

Outlier Treatment:

Applied the Interquartile Range (IQR) method to detect and cap outliers in the stock price columns, preventing distortion from extreme values and creating a more reliable dataset.
These fixes enhance the dataset's quality, making it ready for further analysis and predictive modeling.

Before:

Screenshot 2024-10-20 015427

After:

image
image

Added documentation to enhance code readability :

image
image

Copy link
Contributor

@github-actions github-actions bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Ensure the PR matches the requirements mentioned in the Contribution guide. The maintainer might get in touch to enusre quality. Thanks for your time

@RB137
Copy link
Contributor Author

RB137 commented Oct 20, 2024

I’ve handled missing values and outliers and added some documentation.
Have a look at it, review it, and kindly accept my PR! @rohitinu6 @Mayureshd-18 @jvedsaqib

Copy link
Collaborator

@Mayureshd-18 Mayureshd-18 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Looks good to me!

@RB137
Copy link
Contributor Author

RB137 commented Oct 20, 2024

@Mayureshd-18 Thanks for the approval! Please add the gssoc-ext, hacktoberfest, hacktoberfest-accepted, and suitable level labels, and kindly merge it to close the issue. @rohitinu6

@Mayureshd-18 Mayureshd-18 added gssoc-ext GSSoC'24 Extended Version hacktoberfest-accepted Hacktoberfest 2024 hacktoberfest Hacktober Collaboration labels Oct 21, 2024
@rohitinu6 rohitinu6 added the level1 10 Points 🥇(GSSoC) label Oct 21, 2024
@RB137
Copy link
Contributor Author

RB137 commented Oct 21, 2024

@rohitinu6 @Mayureshd-18
Thanks for approving. The issue seems fixed, so please merge my PR and close the issue #137.

@RB137
Copy link
Contributor Author

RB137 commented Oct 21, 2024

@rohitinu6 @Mayureshd-18
Also, please label it as level 2, as it matches the difficulty level of data refinement. and outliers handling

@Mayureshd-18
Copy link
Collaborator

@RB137 The assigned level 1 is appropriate based on the root objective which is the refinement of code. Higher levels are reserved for new algos and procedures.
Thanks for understanding! Merging!

@Mayureshd-18 Mayureshd-18 merged commit 52bacc2 into rohitinu6:main Oct 22, 2024
1 check passed
Copy link
Contributor

🎉🎉 Thank you for your contribution! Your PR #148 has been merged! 🎉🎉

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
gssoc-ext GSSoC'24 Extended Version hacktoberfest Hacktober Collaboration hacktoberfest-accepted Hacktoberfest 2024 level1 10 Points 🥇(GSSoC)
Projects
None yet
Development

Successfully merging this pull request may close these issues.

Handling Missing Values and Outliers
3 participants