Skip to content

Commit 67a17f1

Browse files
sweetviz library
1 parent 80bba94 commit 67a17f1

15 files changed

+90326
-1
lines changed

pandas-profiling/README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ Pandas profiling is an open source Python module with which we can quickly do an
77
pip install -r requirements.txt
88

99
## Run
10-
python3 eda_pandas-profiling.py
10+
python3 eda_pandas_profiling.py
1111

1212
## Output
1313

File renamed without changes.
File renamed without changes.

sweetviz/.~lock.house_train.csv#

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1 @@
1+
,pradeep,INFBA31703,08.07.2020 09:52,file:///home/pradeep/.config/libreoffice/4;

sweetviz/README.md

Lines changed: 27 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,27 @@
1+
# Sweetviz
2+
3+
Sweetviz is an open source Python library that generates beautiful, high-density visualizations to kickstart EDA (Exploratory Data Analysis) with a single line of code. Output is a fully self-contained HTML application.
4+
5+
The system is built around quickly visualizing target values and comparing datasets. Its goal is to help quick analysis of target characteristics, training vs testing data, and other such data characterization tasks.
6+
7+
## Dependancies
8+
9+
pip install -r requirements.txt
10+
11+
## Run
12+
13+
python3 run eda_sweetviz.py
14+
15+
## Output
16+
17+
Some of the generated outputs are shown below
18+
19+
### Train data
20+
![Alt text](output/summary.png?raw=true "Train data")
21+
### EDA on train data
22+
![Alt text](output/eda.png?raw=true "EDA on train data")
23+
### Dataset comparison
24+
![Alt text](output/summary_compare.png?raw=true "Dataset comparison")
25+
### EDA on comparison result
26+
![Alt text](output/eda_compare.png?raw=true "EDA on comparison result")
27+

sweetviz/eda_sweetviz.py

Lines changed: 10 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,10 @@
1+
import pandas as pd
2+
import sweetviz
3+
train = pd.read_csv("house_train.csv")
4+
test = pd.read_csv("house_test.csv")
5+
train_report = sweetviz.analyze([train, "Train"],target_feat='SalePrice')
6+
train_report.show_html('Report.html')
7+
8+
#comparing two data frames
9+
compare_report = sweetviz.compare([train, "Train"], [test, "Test"], "SalePrice")
10+
compare_report.show_html('Comparison Report.html')

sweetviz/house_test.csv

Lines changed: 1460 additions & 0 deletions
Large diffs are not rendered by default.

sweetviz/house_train.csv

Lines changed: 1461 additions & 0 deletions
Large diffs are not rendered by default.

sweetviz/output/Comparison Report.html

Lines changed: 48408 additions & 0 deletions
Large diffs are not rendered by default.

sweetviz/output/Report.html

Lines changed: 38956 additions & 0 deletions
Large diffs are not rendered by default.

0 commit comments

Comments
 (0)