Skip to content

Latest commit

 

History

History
17 lines (12 loc) · 1.43 KB

README.md

File metadata and controls

17 lines (12 loc) · 1.43 KB

Data Engineering Foundations

Data Engineering Foundations Data science can be generally defined as the process of making data useful, and data engineering is a key part of how and why. If you think of data science like a race car, the data engineers are the pit crew. They’re not driving the car, but they make the car much easier to drive. Data engineers make sure the data flow is running smoothly, monitor systems, anticipate problems, and repair the data pipeline whenever problems arise. They extract and gather data from multiple sources and load it into a single, easy-to-query database. In short, data engineers make data scientists’ lives easier.

In this course, Harshit Tyagi explains the fundamentals of data engineering. He covers key topics like data wrangling, database schema, and developing ETL pipelines. He also details several data engineering tools like Hive, Hadoop, Spark, and Airflow. By the end of this course, it should be abundantly clear why the data engineer is one of the most valuable people in a data-driven organization.

Author

Harshit Tyagi

Learning Plataform

LinkedIn Learning

What I have done?

  • Importing Data from CSV to a database Schema
  • ETL from spark to database