# Statistics - Class 10 : Notes

Statistics - Class 10 : Notes

(1) Data : It is a collection of facts such as numbers, words, measurements, alphabets, symbols, observations or even just description of things.
For Example: Data include marks of students, present-absent report of students, name of students, runs made by batsman, etc.

(2) Data Organization : The data available in an unorganized form is called as raw data. The extraction of the information from these raw data to give meaning to these data is known as data organization.

(3) Frequency of data : The number of times a particular quantity repeats itself in the given data is known as its frequency.
For ExampleTable below represents number of cars possessed by different families in a society.

Number of cars

No of families

0

4

1

8

2

2

Here, the frequency of families who have one car is 8.

(4) Frequency Distribution Table : The table which represents the number of times a particular quantity is repeated is known as the frequency distribution table.
For ExampleTable below represents number of cars possessed by different families in a society.

Number of cars

Frequency

0

4

1

8

2

2

3

3

4

2

5

1

(5) Mean of Grouped Data : the mean value of a variable is defined as the sum of all the values of the variable divided by the number of values. Suppose, if x1 , x2 ,. . ., xn are observations with respective frequencies f1 , f2 , . . ., fn , then this means observation x1 occurs f1 times, x2 occurs f2 times, and so on. Now, the sum of the values of all the observations = f1x1 + f2x2 + . . . + fnxn , and the number of observations = f1 + f2 + . . . + fn.
Hence, the mean of the data is given by
or

(6) Data Grouping : When the amount of data is huge, then the frequency distribution table for individual observation will result into a large table. In such case, we form group of data and then prepare a table. This type of table is called as grouped frequency distribution.
For ExampleSuppose, we need to prepare a table for Science marks obtained by 60 students in a class. Then preparing table for individual marks will result into a big table, so we will group the data as shown in the table below:

Range of Marks

No of students

0 - 10

2

10-20

9

20-30

22

30-40

20

40-50

6

50-60

1

Total

60

(i) Class Interval or Class: It represents the range in which the data are grouped. For the above example, groups 0-10, 10-20, 20-30, etc. represents class interval.
(ii) Lower class limit: The lowest number occurring in a particular class interval is known as its lower class limit. For the above example, if we consider the class interval 10-20 then 10 is called the lower class limit of that interval.
(iii) Upper class limit: The highest number occurring in a particular class interval is known as its upper class limit. For the above example, if we consider the class interval 10-20 then 20 is called the upper class limit of that interval.
(iv) Width or size of class interval: The difference between the upper class limit and the lower class limit is called as the width or size of class interval. For the above example, if we consider the class interval 10-20, then width or size of this class interval will be 10.
(v) Class mark: The frequency of each class interval is centred around its mid-point. Class mark = (Upper class limit + lower class limit)/2. For the above example, if we consider the class interval 10-20, then class mark will be 15.

(7) Methods to find mean:
(i) Direct Method:
For ExampleA survey was conducted by a group of students as a part of their environment awareness programme, in which they collected the following data regarding the number of plants in 20 houses in a locality. Find the mean number of plants per house.

No of plants

0 - 2

2 - 4

4 – 6

6 - 8

8 - 10

10- 12

12 – 14

No of houses121562

3

We know that, Class mark (xi) = (Upper class limit + lower class limit)/2.

No of plants

No of houses

xi

fixi

0 – 211

1

2 – 4

2

3

6

4 – 615

5

6 – 8

5

7

35

8 – 1069

54

10 – 12

2

11

22

12 – 14313

39

Total

20

162

$\Sigma&space;f_{i}$= 20
$\Sigma f_{i}x_{i}$= 162
$\overline{x}&space;=&space;\Sigma&space;f_{i}x_{i}/\Sigma&space;f_{i}$ = 162/20 = 8.1
Therefore, mean number of plants per house is 8.1

(ii) Assumed Mean Method:
For ExampleThe following distribution shows the daily pocket allowance of children of a locality. The mean pocket allowance is Rs 18. Find the missing frequency f.

Daily pocket allowance

(in Rs)

11 - 13

13 - 15

15 – 17

17 - 19

19 - 21

21- 23

23 – 25

No of children76913f5

4

We know that, Class mark (xi) = (Upper class limit + lower class limit)/2.
Given, mean pocket allowance, $\overline{x}$= 18 Rs.

Daily pocket allowance

(in Rs)

No of children

fi

Class mark

xi

di = xi - 18

fidi

11 – 13

7

12

-6

-42

13 – 15614-4

-24

15 – 17

9

16

-2

-18

17 – 1913180

0

19 – 21

f

20

2

2f

21 – 235224

20

23 – 25

4

24

6

24

Total$\Sigma&space;f_{i}$= 44 + f

2f – 40

From the table, we get,
$\Sigma&space;f_{i}$= 44 + f
$\Sigma f_{i}d_{i}$= 2f - 40
$\overline{x}$= a + $\Sigma f_{i}d_{i}$ /$\Sigma f_{i}$
18 = 18 + (2f – 40)/(44 + f)
0 = (2f – 40)/(44 + f)
2f – 40 = 0
f = 20.
Therefore, the missing frequency is 20.

(iii) Step-deviation method:
For ExampleConsider the following distribution of daily wages of 50 workers of a factory. Find the mean daily wages of the workers of the factory.

Daily wages (in Rs)

100 - 120

120 - 140

140 – 160

160 - 180

180 - 200

No of workers121486

10

We know that, Class mark (xi) = (Upper class limit + lower class limit)/2.
Here, Class size(h) = 20.
Taking 150 as assured mean(a), di, ui and fiui can be calculated as follows:

Daily wages

(in Rs)

No of workers

fi

xi

di = xi - 150

ui = di/20

fiui

100 – 12012110-40-2

-24

120 – 140

14

130

-20

-1

-14

140 – 160815000

0

160 – 180

6

170

20

1

6

180 – 20010190402

20

Total

-12

From the table, we get,
$\Sigma&space;f_{i}$= 50
$\Sigma f_{i}u_{i}$= - 12
Mean $\overline{x}$ = a + $\Sigma f_{i}u_{i}$/$\Sigma&space;f_{i}$
= 150 + (-12/50) 20
= 150 – 24/5
= 145.2
Therefore, the mean daily wage of the workers of the factory is 145.20 Rs.

(8) Mode of Grouped Data:
Modal class: The class interval having highest frequency is called the modal class and Mode is obtained using the modal class.Where
l = lower limit of the modal class,
h = size of the class interval (assuming all class sizes to be equal),
f1 = frequency of the modal class,
f0 = frequency of the class preceding the modal class,
f2 = frequency of the class succeeding the modal class.

For ExampleThe following data gives the information on the observed lifetimes (in hours) of 225 electrical components. Determine the modal lifetimes of the components.

0 - 20

20 - 40

40 - 60

60 - 80

80 - 100

100 – 120

Frequency1035526138

29

For the given data, it can be observed that the maximum class frequency is 61 which belong to class interval 60 – 80.
Therefore, modal class = 60 – 80.
Lower class limit (l) of modal class = 60
Frequency (f1) of modal class = 61
Frequency (f0) of class preceding the modal class = 52
Frequency (f2) of class succeeding the modal class = 38
Class size (h) = 20= 60 + ((61 – 52)/(2 x 61 – 52 – 38) (20)
= 60 + (9/(122 – 90)) (20)
= 60 + 90/16
=65.625
Therefore, modal lifetime of electrical components is 65.625 hours.

(9) Median of Grouped Data: For the given data, we need to have class interval, frequency distribution and cumulative frequency distribution. Then, median is calculated as
Where
l = lower limit of median class,
n = number of observations,
cf = cumulative frequency of class preceding the median class,
f = frequency of median class,
h = class size (assuming class size to be equal)

For ExampleThe following frequency distribution gives the monthly consumption of electricity of 68 consumers of a locality. Find the median of the data.

Monthly consumption

(in units)

No of consumers

65 – 85

4

85 – 105

5

105 – 125

13

125 – 145

20

145 – 165

14

165 – 185

8

185 – 205

4

To find the median of the given data, cumulative frequency is calculated as follows:

Monthly consumption

(in units)

No of consumers

Cumulative frequency

65 – 854

4

85 – 105

5

4 + 5 = 9

105 – 12513

9 + 13 = 22

125 – 145

20

22 + 20 = 42

145 – 16514

42 + 14 = 56

165 – 185

8

56 + 8 = 64

185 – 2054

64 + 4 = 68

From the table, we get n = 68.
The cumulative frequency (cf) is just greater than n/2 (i.e. 68/2 = 34) is 42, belonging to interval 125 – 145.
Therefore, median class = 125 – 145
Lower limit (l) of median class = 125
Class size (h) = 20
Frequency (f) of median class = 20
Cumulative frequency (cf) of class preceding median class = 22.= 125 + ((34 – 22)/20) (20)
= 125 + 12 = 137.
Therefore, median of the given data is 137.

(10) Graphical Representation of Cumulative Frequency Distribution:
For ExampleThe following distribution gives the daily income of 50 workers of a factory. Convert the distribution above to a less than type cumulative frequency distribution, and draw its ogive.

Daily income

(in Rs)

No of workers

100 – 120

12

120 – 140

14

140 – 160

8

160 – 180

6

180 – 200

10

The less than type cumulative frequency distribution is given as follows:

Daily income

(in Rs)

No of workers

Cumulative frequency

100 – 12012

12

120 – 140

14

12 + 14 = 26

140 – 1608

26 + 8 = 34

160 – 180

6

34 + 6 = 40

180 – 20010

40 + 10 = 50

Now, we will draw the ogive curve by plotting points (120, 12), (140, 26), (160, 34), (180, 40), (200, 50).