Data Engineering Foundations

Data Engineering Foundations Data science can be generally defined as the process of making data useful, and data engineering is a key part of how and why. If you think of data science like a race car, the data engineers are the pit crew. They’re not driving the car, but they make the car much easier to drive. Data engineers make sure the data flow is running smoothly, monitor systems, anticipate problems, and repair the data pipeline whenever problems arise. They extract and gather data from multiple sources and load it into a single, easy-to-query database. In short, data engineers make data scientists’ lives easier.

In this course, Harshit Tyagi explains the fundamentals of data engineering. He covers key topics like data wrangling, database schema, and developing ETL pipelines. He also details several data engineering tools like Hive, Hadoop, Spark, and Airflow. By the end of this course, it should be abundantly clear why the data engineer is one of the most valuable people in a data-driven organization.

Author

Harshit Tyagi

Learning Plataform

LinkedIn Learning

What I have done?

Importing Data from CSV to a database
ETL from spark to database

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
imgs		imgs
CertificateOfCompletion_Data Engineering Foundations.pdf		CertificateOfCompletion_Data Engineering Foundations.pdf
Data_Engineering_Foundations.ipynb		Data_Engineering_Foundations.ipynb
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data Engineering Foundations

Author

Learning Plataform

What I have done?

About

Releases

Packages

Languages

vsvale/Data-Engineering-Foundations

Folders and files

Latest commit

History

Repository files navigation

Data Engineering Foundations

Author

Learning Plataform

What I have done?

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages