Audio version of the article
Tips and tricks, especially in the programming world, can be very useful. Sometimes a little hack can be both time and life-saving. A minor shortcut or add-on can sometimes prove to be a Godsend and can be a real productivity booster. So, here are some of my favorite tips and tricks I’ve used and compiled together in the form of this article. Some may be fairly known and some may be new, but I’m sure they’ll come in pretty handy the next time you work on a data analysis project.
Profiling the ‘pandas’ dataframe
Profilingis a process that helps us understand our data, and Pandas Profiling is a python package that does exactly that. It’s a simple and fast way to perform exploratory data analysis of a Pandas Dataframe. The pandas
df.info()functions are normally used as a first step in the EDA process. However, it only gives a very basic overview of the data and doesn’t help much in the case of large data sets. The Pandas Profiling function, on the other hand, extends the pandas DataFrame with
df.profile_report() for quick data analysis. It displays a lot of information with a single line of code and that too in an interactive HTML report.