Finding outliers in your data is easier than you think. You can use statistics concepts to identify the threshold where outliers may fall. One of the most accurate ways is to use the Z-Score. This score is composed of the mean, standard deviation, and current value. You see from the image below how this is […]
Let’s talk about how to do some bi-variate analysis using the Crosstab function from pandas. The Crosstab function is very flexible and allows us to compare two categories. As usual, we’ll work on this practically using a sample dataset. We’re going to analyze a BankChurn dataset to see when our customers left and the different categories […]
Grouping data is one of the most important skills that you would require as a data analyst. Luckily, Pandas has a great function called GroupBy which is extremely flexible and allows you to answer many questions with just one line of code. In this tutorial, we’re going to understand the GroupBy function and subsequently answer […]