Playstore Data Analysis

Analysis of Google Playstore data visualizing the distribution of app downloads, reviews and purchases across categories and genres.

Playstore Data Analysis

Back to HOME

Overview

Objectives

The aim is to build dashboards to visualise the distribution of application downloads, purchases and reviews across individual applications, categories and genres.

Deliverables

Demand Value Deliverable
A dashboard that summarises application downloads To understand which apps, categories and genres are the most popular in terms of downloads A dashboard visualising disribution of downloads across different applications, genres and categories as well as the total download numbers
A dashboard with information on money spent on applications and review activities of applications To understand which are the categories where money is spent the most as well as the customer activities in terms of reviews A dashboard visualising disribution of amount spent and total reviews (per application) across genres and categories

Data Preparation

Data was obtained from googleplaystore.csv dataset taken from kaggle (Author: Lavanya, Updated on 03/02/2019). To acess the source CLICK HERE:

The data transformation and cleaning was done in PostgreSQL. Data with null values were removed for columns with few null values. Rating column contained large amount of null values and they were replaced with zeros. There were certain columns such as price and installs, where numerical information was in text format (with some additional string values such as ‘+’, ‘M’ etc.). The string values were removed and the data were converted to double precision type. The data was exported into a new table (playstore_data_final) which was then used in Tableau for visualisation.

The SQL queries can be found HERE.

Data Visualisation

Data visualisation was done in Tableau. The dashboards were created.

App Downloads Dashboard:

Presents the distribution of application downloads across categories and genres. Provides information on the total download activity as well as the total number of applications. DASHBOARD LINK

Dashboard Image

App Performance Dashboard:

Visualises the distribution of free and paid apps as well as the money spent across categories and genres. The relation of review activities with categories and genres is explored as well. DASHBOARD LINK

Dashboard Image

Conclusions

Note: The dataset was updated in 2019 and the trends are relevant to that time period (The project is for educational purpose and not for making decisions for the current market situation).