Skip to content

WIP Final project for the IBM Data Analyst Professional Certificate.

Notifications You must be signed in to change notification settings

cnoret/ibm-data-analyst-capstone-project

Repository files navigation

If my project has helped you, don't forget to click on the “ ⭐Star” button !

IBM Data Analyst Capstone Project (2024)

Final project for the IBM Data Analyst Professional Certificate.

Project Overview

This Capstone project aims to provide a practical application of the skills acquired throughout the IBM Data Analyst Professional Certificate program. By taking on the role of a Data Analyst at a global IT and Business Services firm, let's analyze various datasets to identify trends in emerging technologies.

Project Phases

  • Data Collection (Module 1): Gather data on in-demand technology skills from sources like job postings, blog posts, and surveys.

  • Data Wrangling (Module 2): Prepare the collected data for analysis by cleaning and organizing it. This includes tasks such as handling missing values, duplicates, and inconsistencies.

  • Data Analysis (Module 3): Apply statistical techniques to analyze the data and extract meaningful insights. This involves identifying top programming languages, database skills, IDEs, and demographic trends among developers.

  • Data Visualization (Module 4): Choose appropriate visualizations (charts, plots, histograms) to effectively communicate the findings and trends.

  • Interactive Dashboard Creation (Module 5): Utilize Cognos to build interactive dashboards that allow for dynamic analysis and presentation of the data.

  • Storytelling and Presentation (Module 6): Develop a compelling narrative to present the findings of your analysis. Use a provided presentation template to structure your story and deliver a persuasive presentation.

Skills summary

Module 1: Data Collection

  • Collecting Data Using APIs
  • Collecting Data Using Webscraping
  • Exploring Data

Module 2: Data Wrangling

  • Finding Missing Values
  • Determine Missing Values
  • Finding Duplicates
  • Removing Duplicates
  • Normalizing Data

Module 3: Exploratory Data Analysis

  • Distribution
  • Outliers
  • Correlation

Module 4: Data Visualization

  • Visualizing Distribution of Data
  • Relationship
  • Composition
  • Comparison

Module 5: Dashboard

  • Creation
  • Dashboards

Module 6: Presentation

  • Presentation of Findings
  • Final Presentation