Find and Highlight Duplicates in Excel

You can quickly find duplicates in Excel by employing a few different strategies. Often users want to see if there is a duplicate email in a list. Also, retailers may want to see if there have been duplicate transactions over a given period of time. There are tons of scenarios where you may want to […]

VLOOKUP vs INDEX Match

VLOOKUP is probably one of the top 5 functions that asked for Excel professionals to demonstrate for analysis or while interviewing. I can personally attest that early in my career I was asked whether I could do VLOOKUP hundreds of times. I often wonder why VLOOKUP became the go-to function for looking up data when […]

What If Parameter in Power BI

Using What If parameter in Power Bi can easily give you the ability to dynamically transform your data. For example, using this parameter will allow to demonstrates how your data change under various scenarios. For example how much revenue would you have if your products were at 10%, 20% or 30%  of the retail price. […]

Python for Data Science –Free Learning Path

It took me a long time to learn Python. The reasons are aplenty on why it was such a long journey. However, hands down, I can tell you the main reasons are due to the vast amount of flexibility with Python. I tried to learn too much at one time instead of focusing on key […]

Data Visualization –NBA Highest Points Per Game

NBA All Time Point Leaders This Plotly chart was created using a dataset of NBA players stats from basketball-reference.com. It contains player points, rebounds, assists, starts and etc. I create this visualization by building it in Plotly, based on an initial Python plot created using Matplotlib. I wanted to explore using Plotly cause I thought […]

Seaborn Histogram

You can easily create and style a histogram in Seaborn with just a few steps. Let’s get started. You will need a few dependencies to ensure that the plot is shown.  The dependencies that you essentially need to load are Matplotlib and Seaborn. However, let’s load the standards such as Pandas and Numpy also in case […]

Waffle Chart in Tableau

Definition: Waffle chart is a 10 X 10 cell grid in which each cell represents 1 percentage point summing up to total 100%. Waffle charts can be represented with conditional formatting where cells are  highlighted with different colors based on the percentage value of that KPI. There are following used cases of Waffles chart: To […]

Create a Python Heatmap with Seaborn

You can easily create a heatmap using the Seaborn library in Python.  For this tutorial, I’m going to create this using Jupyter Notebooks. The first step is to load the dependencies which are the essential library. You can also Learn Python Data Insights on YouTube import pandas as pd import numpy as np import seaborn […]

Google Store Most Installed Apps

The  Google Apps store has a plethora of different apps ranging across a ton of different categories. Here is a visualization created in Tableau that give you an idea of what are the most popular apps on the platform.       The dataset was sourced a Kaggle.com in the form a CSV. I did some light […]

Create Pivot Tables with Pandas

One of the key actions for any data analyst is to be able to pivot data tables. Luckily Pandas has an excellent function that will allow you to pivot. To create this spreadsheet style pivot table, you will need two dependencies with is Numpy and Pandas. However, in newer iterations, you don’t need Numpy. Let’s […]

1 10 11 12 13 14 17