This is a follow-up article on using data analysis techniques to explore some interesting insights into Indian Premier League(IPL) matches played from 2008 – 2020. If you are not familiar with Data analysis, I request you to go through my introductory article: Indian Premier League(IPL) Data Analysis from 2008-2020
As a use-case for this article, let us explore which IPL team played the maximum number of matches
Which IPL team played the maximum number of matches?
A most successful team, would naturally be playing the maximum number of matches considering they would be reaching the semi-finals and probably finals on a consistent basis. A less successful team, would naturally be dropping out at the league stage itself due to inconsistent performance due to which they would be lagging in the bottom half of the points table.
Step 1: If we can get the count of matches played by each individual team, we will know which team played the maximum of matches. Notice the column team1
and team2
indicate the teams which has played each game. The sum of all these games gives the total games played by each individual team.
Notice that Mumbai Indians is by the far the most successful team in IPL, having played 203 matches between 2008 and 2020.
Step 2: Let us visualize this data through a bar graph. Let us convert the above series back into dataframe as follows:
Step 3: Let us sort the dataframe from team which played the maximum number of matches to team which played the least number of matches
Step 4: The final step is the data visualization to visualize the data using matplatlib
and seaborn
library
I hope you find this article helpful. Please refer to the project in GitHub https://github.com/kirancshet/IPL_Data_Analysis which includes the Jupyter notebook covering the entire code.
1,210 total views, 2 views today