Skip to content

Latest commit

 

History

History
65 lines (33 loc) · 1.84 KB

File metadata and controls

65 lines (33 loc) · 1.84 KB

Employee-Attendance-Data-Analysis

Project Overview

This project involves cleaning and analyzing employee attendance data using Excel and Python. The goal is to standardize data formats, perform ad hoc analysis, and generate insights to understand employee attendance patterns.

Task 1: Data Cleaning (Excel)

  1. Date Standardization: Converted date values to the format YYYY-MM-DD and extracted month names and day types.

  2. Employee ID Cleanup: Removed extra characters from employee IDs, such as '@'.

  3. Name Standardization: Capitalized names to title case.

  4. Status Mapping: Mapped status values to abbreviations:

  • Work From Office (WFO)

  • Work From Home (WFH)

  • Birthday Leave (BL)

  • Menstrual Leave (ML)

  • Paid Leave (PL)

  • Sick Leave (SL)

  • Weekly Off (WO)

  1. Duplicate Removal: Identified and removed duplicate records.

Task 2: Ad Hoc Analysis (Python)

  1. Distinct Employee Count: Calculated the total count of distinct employee names.

  2. WFH Percentage in May: Computed the work-from-home (WFH %) percentage for May.

  3. Attendance Analysis for June: Identified the day of the week with the highest attendance percentage in June.

  4. WFH Percentage in April: Determined the number of employees with a WFH percentage greater than 10% in April.

Tools and Technologies Used

  • Excel: Data cleaning and standardization.

  • Python: Ad hoc analysis and computations.

How to Use

  1. Data Preparation:
  • Load the dataset into Excel for cleaning and standardization.
  1. Ad Hoc Analysis:
  • Use Python scripts to perform the specified analyses and generate insights.
  1. Review Results:
  • Interpret the results to understand attendance patterns and make data-driven decisions.

Conclusion

This project demonstrates effective data cleaning and analysis techniques to provide actionable insights into employee attendance patterns.