Image Source:link
Please also consider reading this blog article for a non-technical overview of this project.
Users download apps for various usage purposes. Given that paid service is usually better at offering pleasant experience, and that free apps are more accesible to everyone, what are the user opinions towards these apps?
More specifically, the following questions are of interest:
- How do the app ratings differ between paid and free apps in general?
- How are the differences distributed across different app categories?
- Are there any categories where the differences are statistically significant?
To expore answers to the above questions, I narrawed the context to Google Play Store and conducted data analysis on the Kaggle dataset Google Play Store.
numpy
pandas
scipy
Download the dataset at Kaggle: Google Play Store Apps.
Extract the data file googleplaystore.csv
and put it in the project root.
DataAnalysis4GooglePlayStore
|---- AnalyzeGoogleStoreApp.ipynb (The code file for data analysis)
|---- AnalyzeGoogleStoreApp.html (Result of code execution)
|---- googleplaystore.csv (data file)
|---- README.md (readme file)
- In general, Paid apps are more highly-rated than free apps, which appears to support the argument that experience of the paid apps is better.
- In most categories, Paid apps achieve higher ratings than free apps, however, in a few categories such as COMMUNICATION, FINANCE and PHOTOGRAPHY, the average ratings of free apps are higher than those of paid apps. Is this because many popular apps in these categories are free, like facebook and whatsapp in the COMMUNICATION category?
- There are four categories (PERSONALIZATION, TOOLS, FAMILY and GAME) where paid apps are rated significantly higher than free apps.
I would like to thank Google Play Store and Lavanya Gupta for offering the wonderful dataset.