Seaborn Introduction

import seaborn as sns
import matplotlib as mpl
import matplotlib.pyplot as plt

def listAttr(obj, search = None):
    
    if not search:
        return [item for item in dir(obj) if not (item.startswith("_"))]
    
    search = search.lower()
    return [item for item in dir(obj) if not (item.startswith("_")) and search in item]
    
    pass

listAttr(sns)

['FacetGrid',
 'JointGrid',
 'PairGrid',
 'algorithms',
 'axes_style',
 'axisgrid',
 'barplot',
 'blend_palette',
 'boxenplot',
 'boxplot',
 'categorical',
 'catplot',
 'choose_colorbrewer_palette',
 'choose_cubehelix_palette',
 'choose_dark_palette',
 'choose_diverging_palette',
 'choose_light_palette',
 'clustermap',
 'cm',
 'color_palette',
 'colors',
 'countplot',
 'crayon_palette',
 'crayons',
 'cubehelix_palette',
 'dark_palette',
 'desaturate',
 'despine',
 'displot',
 'distplot',
 'distributions',
 'diverging_palette',
 'dogplot',
 'ecdfplot',
 'external',
 'get_data_home',
 'get_dataset_names',
 'heatmap',
 'histplot',
 'hls_palette',
 'husl_palette',
 'jointplot',
 'kdeplot',
 'light_palette',
 'lineplot',
 'lmplot',
 'load_dataset',
 'matrix',
 'miscplot',
 'move_legend',
 'mpl',
 'mpl_palette',
 'pairplot',
 'palettes',
 'palplot',
 'plotting_context',
 'pointplot',
 'rcmod',
 'regplot',
 'regression',
 'relational',
 'relplot',
 'reset_defaults',
 'reset_orig',
 'residplot',
 'rugplot',
 'saturate',
 'scatterplot',
 'set',
 'set_color_codes',
 'set_context',
 'set_hls_values',
 'set_palette',
 'set_style',
 'set_theme',
 'stripplot',
 'swarmplot',
 'utils',
 'violinplot',
 'widgets',
 'xkcd_palette',
 'xkcd_rgb']

listAttr(sns, "load_dataset")

['load_dataset']

Load an example dataset from the online repository

sns.get_dataset_names()

['anagrams',
 'anscombe',
 'attention',
 'brain_networks',
 'car_crashes',
 'diamonds',
 'dots',
 'dowjones',
 'exercise',
 'flights',
 'fmri',
 'geyser',
 'glue',
 'healthexp',
 'iris',
 'mpg',
 'penguins',
 'planets',
 'seaice',
 'taxis',
 'tips',
 'titanic']

Load dataset

tips = sns.load_dataset('tips')
tips

sns.set(color_codes=True)

ax = sns.scatterplot(x = 'total_bill', y = 'tip', data = tips)

sns.set_style('ticks') 
ax = sns.barplot(x="total_bill", y="tip", data=tips)

ax = sns.barplot(x = "total_bill", y = "tip", data = tips)

ax = sns.scatterplot(x="total_bill", y="tip", hue="day", data=tips)

ax = sns.scatterplot(x="total_bill", y="tip", hue="day", style="time", data=tips)

to enhance a scatterplot to include a linear regression model (and its uncertainty) using lmplot():

sns.lmplot(x="total_bill", y="tip", data=tips)

<seaborn.axisgrid.FacetGrid at 0x18fc29ba5e0>

sns.lmplot(x = "total_bill", y = "tip", data = tips, hue = "time")

<seaborn.axisgrid.FacetGrid at 0x18fc2a75b20>

sns.lmplot(x = "total_bill", y = "tip", data = tips, hue="day")

<seaborn.axisgrid.FacetGrid at 0x18fc2a96820>

Specialized categorical plots

sns.catplot(x="day", y="total_bill", hue="smoker", kind="swarm", data=tips);

tips.query("size != 3")

sns.catplot(x="size", y="total_bill", kind="swarm",
            data=tips.query("size != 3"));

C:\ProgramData\Miniconda3\lib\site-packages\seaborn\categorical.py:3540: UserWarning: 9.6% of the points cannot be placed; you may want to decrease the size of the markers or use stripplot.
  warnings.warn(msg, UserWarning)

sns.catplot(x="day", y="total_bill", hue="smoker", kind="violin", data=tips);

sns.catplot(x="day", y="total_bill", hue="smoker",
            kind="bar", data=tips);

g = sns.catplot(x = "total_bill", y = "day",  hue="time", kind = 'box', legend=False, data = tips)
g.add_legend(title = "Meal")

<seaborn.axisgrid.FacetGrid at 0x18fc1b7fe80>

g = sns.catplot(x = "total_bill", y = "day",  hue="time", kind = 'box', legend=False, data = tips)
g.add_legend(title = "Meal")
g.fig.set_size_inches(10.5, 5.5)
g.set_axis_labels("Total bill ($)", "")

<seaborn.axisgrid.FacetGrid at 0x18fc1d1d1c0>

g = sns.catplot(x="total_bill", y="day", hue="time",
                height=3.5, aspect=1.5,
                kind="boxen", legend=False, data=tips);

g = sns.catplot(x="total_bill", y="day", hue="time",
                height=3.5, aspect=1.5,
                kind="box", legend=False, data=tips);
g.add_legend(title="Meal")
g.set_axis_labels("Total bill ($)", "")
g.set(xlim=(0, 60), yticklabels=["Thursday", "Friday", "Saturday", "Sunday"])
g.despine(trim=True)
g.fig.set_size_inches(6.5, 3.5)
g.ax.set_xticks([5, 15, 25, 35, 45, 55], minor=True);
plt.setp(g.ax.get_yticklabels(), rotation=30);

Histograms

sns.distplot(tips['total_bill'])

C:\Users\nutan\AppData\Local\Temp\ipykernel_9328\1695966430.py:1: UserWarning: 

`distplot` is a deprecated function and will be removed in seaborn v0.14.0.

Please adapt your code to use either `displot` (a figure-level function with
similar flexibility) or `histplot` (an axes-level function for histograms).

For a guide to updating your code to use the new functions, please see
https://gist.github.com/mwaskom/de44147ed2974457ad6372750bbe5751

  sns.distplot(tips['total_bill'])

<AxesSubplot:xlabel='total_bill', ylabel='Density'>

https://www.khanacademy.org/math/statistics-probability/displaying-describing-data/quantitative-data-graphs/a/histograms-review

"bin" (or "bucket") the range of values—that is, divide the entire range of values into a series of intervals and then count how many values fall into each interval.

sns.distplot(tips['total_bill'], bins=20, kde=False)

C:\Users\nutan\AppData\Local\Temp\ipykernel_9328\4103393073.py:1: UserWarning: 

`distplot` is a deprecated function and will be removed in seaborn v0.14.0.

Please adapt your code to use either `displot` (a figure-level function with
similar flexibility) or `histplot` (an axes-level function for histograms).

For a guide to updating your code to use the new functions, please see
https://gist.github.com/mwaskom/de44147ed2974457ad6372750bbe5751

  sns.distplot(tips['total_bill'], bins=20, kde=False)

<AxesSubplot:xlabel='total_bill'>

#kde(Kernel density estimation) - plotting the shape of a distribution
sns.distplot(tips['total_bill'], kde=False)

C:\Users\nutan\AppData\Local\Temp\ipykernel_9328\1721381072.py:2: UserWarning: 

`distplot` is a deprecated function and will be removed in seaborn v0.14.0.

Please adapt your code to use either `displot` (a figure-level function with
similar flexibility) or `histplot` (an axes-level function for histograms).

For a guide to updating your code to use the new functions, please see
https://gist.github.com/mwaskom/de44147ed2974457ad6372750bbe5751

  sns.distplot(tips['total_bill'], kde=False)

<AxesSubplot:xlabel='total_bill'>

tips.time.unique()

['Dinner', 'Lunch']
Categories (2, object): ['Lunch', 'Dinner']

This particular plot shows the relationship between five variables in the tips dataset. Three are numeric, and two are categorical. Two numeric variables (total_bill and tip) determined the position of each point on the axes, and the third (size) determined the size of each point. One categorical variable split the dataset onto two different axes (facets), and the other determined the color and shape of each point.

sns.relplot(x="total_bill", y="tip", col="time",
            hue="smoker", style="smoker", size="size",
            data=tips)

<seaborn.axisgrid.FacetGrid at 0x18fc3c9e0d0>

sns.relplot(x="total_bill", y="tip", col="time",
            hue="smoker", style="smoker", size="size", kind="line", data=tips)

<seaborn.axisgrid.FacetGrid at 0x18fc2ba33d0>

	total_bill	tip	sex	smoker	day	time	size
0	16.99	1.01	Female	No	Sun	Dinner	2
1	10.34	1.66	Male	No	Sun	Dinner	3
2	21.01	3.50	Male	No	Sun	Dinner	3
3	23.68	3.31	Male	No	Sun	Dinner	2
4	24.59	3.61	Female	No	Sun	Dinner	4
...	...	...	...	...	...	...	...
239	29.03	5.92	Male	No	Sat	Dinner	3
240	27.18	2.00	Female	Yes	Sat	Dinner	2
241	22.67	2.00	Male	Yes	Sat	Dinner	2
242	17.82	1.75	Male	No	Sat	Dinner	2
243	18.78	3.00	Female	No	Thur	Dinner	2

	total_bill	tip	sex	smoker	day	time	size
0	16.99	1.01	Female	No	Sun	Dinner	2
3	23.68	3.31	Male	No	Sun	Dinner	2
4	24.59	3.61	Female	No	Sun	Dinner	4
5	25.29	4.71	Male	No	Sun	Dinner	4
6	8.77	2.00	Male	No	Sun	Dinner	2
...	...	...	...	...	...	...	...
237	32.83	1.17	Male	Yes	Sat	Dinner	2
240	27.18	2.00	Female	Yes	Sat	Dinner	2
241	22.67	2.00	Male	Yes	Sat	Dinner	2
242	17.82	1.75	Male	No	Sat	Dinner	2
243	18.78	3.00	Female	No	Thur	Dinner	2

Seaborn Introduction

Seaborn Introduction

Load an example dataset from the online repository

Load dataset

Specialized categorical plots

Histograms

kindergarten

Python for kids

Fourier series

Linear Equations

Geometry

Laplace

Vectors

Differential equations

Functions

Jacobian

Lagrangian

Waves

Electromagnetism

Optics

Quantum mechanics concepts

Theory of relativity

Kinematics

Thermodynamics

Formulae

A level physics

Chemistry

English

Geography

Animation

Plotting

SVG

Python

Machine Learning

TensorFlow

PySpark

PyTorch

Natural Language Processing

Others