Back to articles
I Built a Python Package That Automates EDA in One Line

I Built a Python Package That Automates EDA in One Line

via Dev.to PythonChinthaparthi Sridhar

After writing the same pandas code for every new dataset, I decided to automate it and published it on PyPI. The Problem Every data scientist knows this pain. You get a new dataset and start typing: df . head () df . tail () df . info () df . describe () df . isnull (). sum () df . duplicated (). sum () # ... 50 more lines Same code. Every. Single. Time. So I built smarteda — a Python package that runs your entire EDA automatically. Installation pip install smarteda Quick Start import pandas as pd import smarteda df = pd . read_csv ( " your_data.csv " ) # Run everything at once smarteda . analyze ( df ) # Or pick what you need smarteda . basic_eda ( df ) # head, tail, info, describe, shape smarteda . overview ( df ) # shape, memory, dtypes, constant columns smarteda . missing ( df ) # missing values + fill suggestions smarteda . outliers ( df ) # IQR + Z-score + Isolation Forest smarteda . correlations ( df ) # multicollinearity warnings + heatmap smarteda . suggestions ( df ) # smart

Continue reading on Dev.to Python

Opens in a new tab

Read Full Article
6 views

Related Articles