Data Handling

Input Text

text = """

Once, there were two friends who were crossing the jungle.
After some time, they saw a bear coming towards them.
Then, one of the friends quickly climbed the nearby tree, and the other one did not know how to climb the tree.
So he lays down on the ground, holding his breath.
The bear reaches towards him and sniffs him in the ear.
After some time, the bear left the place, thinking the man was dead.
Now the other friend climbs down and asks his friend, What did bear say to him in his ear?
He replied, "
To be safe from the fake friends."

"""

text

'\n\nOnce, there were two friends who were crossing the jungle.\nAfter some time, they saw a bear coming towards them.\nThen, one of the friends quickly climbed the nearby tree, and the other one did not know how to climb the tree.\nSo he lays down on the ground, holding his breath.\nThe bear reaches towards him and sniffs him in the ear.\nAfter some time, the bear left the place, thinking the man was dead.\nNow the other friend climbs down and asks his friend, What did bear say to him in his ear?\nHe replied, "\nTo be safe from the fake friends."\n\n'

print(text)


Once, there were two friends who were crossing the jungle.
After some time, they saw a bear coming towards them.
Then, one of the friends quickly climbed the nearby tree, and the other one did not know how to climb the tree.
So he lays down on the ground, holding his breath.
The bear reaches towards him and sniffs him in the ear.
After some time, the bear left the place, thinking the man was dead.
Now the other friend climbs down and asks his friend, What did bear say to him in his ear?
He replied, "
To be safe from the fake friends."

mod_text = text.replace(",", "").replace(".", "").replace("?", "").replace("\"", "")
print(mod_text)


Once there were two friends who were crossing the jungle
After some time they saw a bear coming towards them
Then one of the friends quickly climbed the nearby tree and the other one did not know how to climb the tree
So he lays down on the ground holding his breath
The bear reaches towards him and sniffs him in the ear
After some time the bear left the place thinking the man was dead
Now the other friend climbs down and asks his friend What did bear say to him in his ear
He replied 
To be safe from the fake friends

mod_text = text.replace(",", "").replace(".", "").replace("?", "").replace('"', "")
print(mod_text)


Once there were two friends who were crossing the jungle
After some time they saw a bear coming towards them
Then one of the friends quickly climbed the nearby tree and the other one did not know how to climb the tree
So he lays down on the ground holding his breath
The bear reaches towards him and sniffs him in the ear
After some time the bear left the place thinking the man was dead
Now the other friend climbs down and asks his friend What did bear say to him in his ear
He replied 
To be safe from the fake friends

mod_text

'\n\nOnce there were two friends who were crossing the jungle\nAfter some time they saw a bear coming towards them\nThen one of the friends quickly climbed the nearby tree and the other one did not know how to climb the tree\nSo he lays down on the ground holding his breath\nThe bear reaches towards him and sniffs him in the ear\nAfter some time the bear left the place thinking the man was dead\nNow the other friend climbs down and asks his friend What did bear say to him in his ear\nHe replied \nTo be safe from the fake friends\n\n'

mod_text = text.replace(",", "").replace(".", "").replace("?", "").replace('"', ""
            ).replace("\n", " ")
print(mod_text)

  Once there were two friends who were crossing the jungle After some time they saw a bear coming towards them Then one of the friends quickly climbed the nearby tree and the other one did not know how to climb the tree So he lays down on the ground holding his breath The bear reaches towards him and sniffs him in the ear After some time the bear left the place thinking the man was dead Now the other friend climbs down and asks his friend What did bear say to him in his ear He replied  To be safe from the fake friends

mod_text

'  Once there were two friends who were crossing the jungle After some time they saw a bear coming towards them Then one of the friends quickly climbed the nearby tree and the other one did not know how to climb the tree So he lays down on the ground holding his breath The bear reaches towards him and sniffs him in the ear After some time the bear left the place thinking the man was dead Now the other friend climbs down and asks his friend What did bear say to him in his ear He replied  To be safe from the fake friends  '

type(mod_text)

str

words = mod_text.split(" ")

print(words)

['', '', 'Once', 'there', 'were', 'two', 'friends', 'who', 'were', 'crossing', 'the', 'jungle', 'After', 'some', 'time', 'they', 'saw', 'a', 'bear', 'coming', 'towards', 'them', 'Then', 'one', 'of', 'the', 'friends', 'quickly', 'climbed', 'the', 'nearby', 'tree', 'and', 'the', 'other', 'one', 'did', 'not', 'know', 'how', 'to', 'climb', 'the', 'tree', 'So', 'he', 'lays', 'down', 'on', 'the', 'ground', 'holding', 'his', 'breath', 'The', 'bear', 'reaches', 'towards', 'him', 'and', 'sniffs', 'him', 'in', 'the', 'ear', 'After', 'some', 'time', 'the', 'bear', 'left', 'the', 'place', 'thinking', 'the', 'man', 'was', 'dead', 'Now', 'the', 'other', 'friend', 'climbs', 'down', 'and', 'asks', 'his', 'friend', 'What', 'did', 'bear', 'say', 'to', 'him', 'in', 'his', 'ear', 'He', 'replied', '', 'To', 'be', 'safe', 'from', 'the', 'fake', 'friends', '', '']

[]

[]

[ word  for word in words  if len(word) == 0  ]

['', '', '', '', '']

mod_words = [ word  for word in words  if len(word) != 0  ]

print(mod_words)

['Once', 'there', 'were', 'two', 'friends', 'who', 'were', 'crossing', 'the', 'jungle', 'After', 'some', 'time', 'they', 'saw', 'a', 'bear', 'coming', 'towards', 'them', 'Then', 'one', 'of', 'the', 'friends', 'quickly', 'climbed', 'the', 'nearby', 'tree', 'and', 'the', 'other', 'one', 'did', 'not', 'know', 'how', 'to', 'climb', 'the', 'tree', 'So', 'he', 'lays', 'down', 'on', 'the', 'ground', 'holding', 'his', 'breath', 'The', 'bear', 'reaches', 'towards', 'him', 'and', 'sniffs', 'him', 'in', 'the', 'ear', 'After', 'some', 'time', 'the', 'bear', 'left', 'the', 'place', 'thinking', 'the', 'man', 'was', 'dead', 'Now', 'the', 'other', 'friend', 'climbs', 'down', 'and', 'asks', 'his', 'friend', 'What', 'did', 'bear', 'say', 'to', 'him', 'in', 'his', 'ear', 'He', 'replied', 'To', 'be', 'safe', 'from', 'the', 'fake', 'friends']

word_lengths = [len(x)  for x in mod_words ]

print(word_lengths)

[4, 5, 4, 3, 7, 3, 4, 8, 3, 6, 5, 4, 4, 4, 3, 1, 4, 6, 7, 4, 4, 3, 2, 3, 7, 7, 7, 3, 6, 4, 3, 3, 5, 3, 3, 3, 4, 3, 2, 5, 3, 4, 2, 2, 4, 4, 2, 3, 6, 7, 3, 6, 3, 4, 7, 7, 3, 3, 6, 3, 2, 3, 3, 5, 4, 4, 3, 4, 4, 3, 5, 8, 3, 3, 3, 4, 3, 3, 5, 6, 6, 4, 3, 4, 3, 6, 4, 3, 4, 3, 2, 3, 2, 3, 3, 2, 7, 2, 2, 4, 4, 3, 4, 7]

How many

n = len(word_lengths)
n

104

What is sum of all of them

s = sum(word_lengths)
s

417

Average

avg = s / n
avg

4.009615384615385

import numpy as np
import statistics as st

np.mean(word_lengths)

4.009615384615385

st.mean(word_lengths)

4.009615384615385

Median

Median is the middle element in the sorted list

sorted_lengths = sorted(word_lengths)
print(sorted_lengths)

[1, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 2, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 3, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 4, 5, 5, 5, 5, 5, 5, 5, 6, 6, 6, 6, 6, 6, 6, 6, 6, 7, 7, 7, 7, 7, 7, 7, 7, 7, 7, 8, 8]

What is middle position?

n / 2

52.0

As it is even, 52 and 53 are both in the middle

sorted_lengths[52]

4

sorted_lengths[53]

4

(sorted_lengths[52]   + sorted_lengths[53]) / 2

4.0

median = (sorted_lengths[52]   + sorted_lengths[53]) / 2
median

4.0

np.median(word_lengths)

4.0

st.median(word_lengths)

4.0

What is mode?

st.mode(word_lengths)

3

Mode is the item that appears highest number of times

word_lengths.count(0)

0

word_lengths.count(1)

1

word_lengths.count(2)

11

word_lengths.count(3)

37

word_lengths.count(4)

27

word_lengths.count(5)

7

min(word_lengths)

1

max(word_lengths)

8

for x in range(1, 9):
    #print(x)
    print(x,  word_lengths.count(x)    )

1 1
2 11
3 37
4 27
5 7
6 9
7 10
8 2

Mode is 3 as it appears 37 times, the most.

import collections  as cl

cl.Counter(word_lengths)

Counter({4: 27, 5: 7, 3: 37, 7: 10, 8: 2, 6: 9, 1: 1, 2: 11})

So we got the frequency of items.

plot frequency

counter = cl.Counter(word_lengths)

import matplotlib.pyplot as plt

tuples = tuple(   counter.items()     )
tuples

((4, 27), (5, 7), (3, 37), (7, 10), (8, 2), (6, 9), (1, 1), (2, 11))

plt.plot(tuples)

[<matplotlib.lines.Line2D at 0x7fe5737b68b0>,
 <matplotlib.lines.Line2D at 0x7fe5737b6910>]

Text Data Handling

Data Handling

Input Text

How many

What is sum of all of them

Average

Median

What is mode?

plot frequency

kindergarten

Python for kids

Fourier series

Linear Equations

Geometry

Laplace

Vectors

Differential equations

Functions

Jacobian

Lagrangian

Waves

Electromagnetism

Optics

Quantum mechanics concepts

Theory of relativity

Kinematics

Thermodynamics

Formulae

A level physics

Chemistry

English

Geography

Animation

Plotting

SVG

Python

Machine Learning

TensorFlow

PySpark

PyTorch

Natural Language Processing

Others