Ask Ubuntu Logs analysis with PySpark on GCP | Pipeline with Airflow (Cloud Composer)
-
Updated
Jan 22, 2021 - HTML
Ask Ubuntu Logs analysis with PySpark on GCP | Pipeline with Airflow (Cloud Composer)
This project focuses on analyzing the questions on askubuntu.com to find the most common topics asked about in order to better understand what areas of Ubuntu may need more attention for bug fixing and also what features might be good to add in future releases of Ubuntu. To do this, I analyzed public data from askubuntu.com using Azure HDInsight…
distraction free stackexchange questions
Ask Ubuntu Logs analysis with Hadoop, MapReduce 2(Yarn)
Repository for code for Stack Exchange sites.
Add a description, image, and links to the askubuntu topic page so that developers can more easily learn about it.
To associate your repository with the askubuntu topic, visit your repo's landing page and select "manage topics."