Содержание

Слайд 2

Part 1 THE MEAN VALUES

Part 1 THE MEAN VALUES

Слайд 3

СHAPTER QUESTIONS Measures of location Types of means Measures of location

СHAPTER QUESTIONS

Measures of location
Types of means
Measures of location for ungrouped data
-

Arithmetic mean
- Harmonic mean
- Geometric mean
- Median and Mode
4. Measures of location for grouped data
- Arithmetic mean
- Harmonic mean
- Geometric mean
- Median and Mode
Слайд 4

Properties to describe numerical data: Central tendency Dispersion Shape Measures calculated

Properties to describe numerical data:
Central tendency
Dispersion
Shape

Measures calculated for:
Sample data
Statistics
Entire population
Parameters

Measures of

location and dispersion
Слайд 5

Measures of location include: Arithmetic mean Harmonic mean Geometric mean Median

Measures of location include:
Arithmetic mean
Harmonic mean
Geometric mean
Median

Mode

Measures of location and dispersion

Слайд 6

Grouped and Ungrouped UNGROUPED or raw data refers to data as

Grouped and Ungrouped

UNGROUPED or raw data refers to data as

they were collected, that is, before they are summarised or organised in any way or form
GROUPED data refers to data summarised in a frequency table
Слайд 7

What is the mean? The mean - is a general indicator

What is the mean?

The mean - is a general indicator characterizing

the typical level of varying trait per unit of qualitatively homogeneous population.
Слайд 8

Statistics derive the formula of the means of the formula of

Statistics derive the formula of the means of the formula of

mean exponential:
We introduce the following definitions
- X-bar - the symbol of the mean
Х1, Х2...Хn – measurement of a data value
f- frequency of a data values​​;
n – population size or sample size.
Слайд 9

There are the following types of mean: If z = -1

There are the following types of mean:
If z = -1

- the harmonic mean,
z = 0 - the geometric mean,
z = +1 - arithmetic mean,
z = +2 - mean square,
z = +3 - mean cubic, etc.
Слайд 10

The higher the degree of z, the greater the value of

The higher the degree of z, the greater the value of

the mean. If the characteristic values ​​are equal, the mean is equal to this constant.
There is the following relation, called the rule the majorizing mean:
Слайд 11

There are two ways of calculating mean: for ungrouped data -

There are two ways of calculating mean:
for ungrouped data -

is calculated as a simple mean
for grouped data -
is calculated weighted mean
Слайд 12

Types of means

Types of means

Слайд 13

Arithmetic mean Arithmetic mean value is called the mean value of

Arithmetic mean

Arithmetic mean value is called the mean value of

the sign, in the calculation of the total volume of which feature in the aggregate remains unchanged
Слайд 14

Characteristics of the arithmetic mean The arithmetic mean has a number

Characteristics of the arithmetic mean

The arithmetic mean has a number

of mathematical properties that can be used to calculate it in a simplified way.
1. If the data values (Xi) to reduce or increase by a constant number (A), the mean, respectively, decrease or increase by a same constant number (A)
Слайд 15

2. If the data values (Xi) divided or multiplied by a

2. If the data values (Xi) divided or multiplied by a

constant number (A), the mean decrease or increase, respectively, in the same amount of time (this feature allows you to change the frequency of specific gravities - relative frequency):
a) when divided by a constant number:
b) when multiplied by a constant number:
Слайд 16

3. If the frequency divided by a constant number, the mean will not change:

3. If the frequency divided by a constant number, the mean

will not change:
Слайд 17

4. Multiplying the mean for the amount of frequency equal to

4. Multiplying the mean for the amount of frequency equal to

the sum of multiplications variants on the frequency:
If
then the following equality holds:
Слайд 18

5.The sum of the deviations of the number in a data

5.The sum of the deviations of the number in a

data value from the mean is zero:
If
then
So
Слайд 19

Measures of location for ungrouped data In calculating summary values for

Measures of location for ungrouped data

In calculating summary values for a

data collection, the best is to find a central, or typical, value for the data.
More important measures of central tendency are presented in this section:
Mean (simple or weighter)
Median and Mode
Слайд 20

Measures of location for ungrouped data ARITHMETIC MEAN - This is

Measures of location for ungrouped data

ARITHMETIC MEAN
-

This is the most commonly used measure.
- The arithmetic mean is a summary value calculated by summing the numerical data values and dividing by the number of values

Sample size

Measures of location for ungrouped data

Слайд 21

sum of observations number of observations Population mean = Measures of

sum of observations
number of observations

Population mean =

Measures of location for

ungrouped data

ARITHMETIC MEAN
This is the most commonly used measure and is also called the mean.

Population size

Xi = observations of the population

∑ = “the sum of”

Mean

Слайд 22

Example - The sales of the six largest restaurant chains are

Example - The sales of the six largest restaurant chains are

presented in table

A mean sales amount of 5.280 $ million is computed using Equation of arithmetical mean simple

Слайд 23

MEDIAN for ungrouped data The median of a data is the

MEDIAN for ungrouped data

The median of a data is the middle

item in a set of observation that are arranged in order of magnitude.
The median is the measure of location most often reported for annual income and property value data.
A few extremely large incomes or property values can inflate the mean.
Слайд 24

MEDIAN Every ordinal-level, interval-level and ratio-level data set has a median

MEDIAN
Every ordinal-level, interval-level and ratio-level data set has a

median
The median is not sensitive to extreme values
The median does not have valuable mathematical properties for use in further computations
Half the values in data set is smaller than median.
Half the values in data set is larger than median.
Order the data from small to large.

Characteristics of the median

Слайд 25

Position of median If n is odd: Median item number =

Position of median

If n is odd:
Median item number

= (n+1)/2
If n is even:
Calculate (n+1)/2
The median is the average of the values before and after (n+1)/2.
Слайд 26

Example The median number of people treated daily at the emergency

Example

The median number of people treated daily at the emergency

room of St. Luke’s Hospital must be determined from the following data for the last six days: 25, 26, 45, 52, 65, 78
Since the data values are arranged from lowest to highest, the median be easily found. If the data values are arranged in a mess, they must rank.
Median item number = (6+1)/2 =3,5
Since the median is item 3,5 in the array, the third and fourth elements need to be averaged: (45+52)/2=48,5. Therefore, 48,5 is the median number of patients treated in hospital emergency room during the six-day period.
Слайд 27

MODE for ungrouped data Is the observation in the data set

MODE for ungrouped data
Is the observation in the data set that

occurs the most frequently.
Order the data from small to large.
If no observation repeats there is no mode.
If one observation occurs more frequently:
Unimodal
If two or more observation occur the same number of times:
Multimodal
Used for nominal scaled variables.
The mode does not have valuable mathematical properties for use in future computations
Слайд 28

The simple mean of the sample of nine measurements is given

The simple mean of the sample of nine measurements is given

by:

2

5

8

5

2

6

Example – Given the following data sample:
2 5 8 −3 5 2 6 5 −4

−3

5

−4

9

Слайд 29

−4 −3 2 2 5 5 5 6 8 Median item

−4 −3 2 2 5 5 5 6 8

Median item number =
(n+1)/2 = (9+1)/2 = 5th measurement

1

2

3

4

5

6

7

8

9

Median =

5

Odd number

The median of the sample of nine measurements is given by:

Example – Given the following data set:
2 5 8 −3 5 2 6 5 −4

Слайд 30

Determine the median of the sample of ten measurements. Order the

Determine the median of the sample of ten measurements.
Order the measurements

Example


Given the following data set:
2 5 8 −3 5 2 6 5 −4 3

−4 −3 2 2 3 5 5 5 6 8

(n+1)/2 = (10+1)/2 = 5,5th measurement

1

2

3

4

5

6

7

8

9

Median = (3+5)/2 = 4

Even number

10

Слайд 31

Determine the mode of the sample of nine measurements. Order the

Determine the mode of the sample of nine measurements.
Order the measurements


Given the following data set:
2 5 8 −3 5 2 6 5 −4

−4 −3 2 2 5 5 5 6 8

Mode = 5
Unimodal

Example

Слайд 32

Determine the mode of the sample of ten measurements. Order the

Determine the mode of the sample of ten measurements.
Order the measurements

Given the following data set:
2 5 8 −3 5 2 6 5 −4 2

−4 −3 2 2 2 5 5 5 6 8

Mode = 2 and 5
Multimodal - bimodal

Example

Слайд 33

Is used if М = const: Harmonic mean is also called

Is used if М = const:
Harmonic mean is also called the

simple mean of the inverse values .

Harmonic mean for ungrouped data

Слайд 34

For example: One student spends on a solution of task 1/3

For example:
One student spends on a solution of task 1/3

hours, the second student – ¼ (quarter) and the third student 1/5 hours. Harmonic mean will be calculated:

Harmonic mean for ungrouped data

Слайд 35

Geometric mean for ungrouped data This value is used as the

Geometric mean for ungrouped data

This value is used as the average

of the relations between the two values, or in the ranks of the distributions presented in the form of a geometric progression.
Слайд 36

Where П – the multiplication of the data value (Xi). n


Where П – the multiplication of the data value (Xi).
n

– power of root

Geometric mean for ungrouped data

Слайд 37

For example, the known data about the rate of growth of

For example, the known data about the rate of growth of

production

Calculate the geometric mean. It is 127 percent:

Geometric mean for ungrouped data

Слайд 38

Measures of location for grouped data ARITHMETIC MEAN Data is given

Measures of location for grouped data

ARITHMETIC MEAN
Data is given

in a frequency table
Only an approximate value of the mean
Слайд 39

Example There are data on seniority hundred employees in the table

Example

There are data on seniority hundred employees in the table

Слайд 40

Average seniority employee is:

Average seniority employee is:

Слайд 41

Harmonic mean for grouped data Harmonic mean - is the reciprocal

Harmonic mean for grouped data

Harmonic mean - is the reciprocal of

the arithmetic mean. Harmonic mean is used when statistical information does not contain frequencies, and presented as
xf = M.
Слайд 42

Harmonic mean for grouped data Harmonic mean is calculated by the formula: where M = xf

Harmonic mean for grouped data

Harmonic mean is calculated by the formula:
where

M = xf
Слайд 43

Example There are data on hárvesting the apples by three teams and on average per worker

Example

There are data on hárvesting the apples by three teams

and on average per worker
Слайд 44

is calculated by the formula: Where fi – frequency of the

is calculated by the formula:
Where fi – frequency of the data

value (Xi)
П – multiplication sign.

Geometric mean for grouped data

Слайд 45

Calculate the geometric mean. It is 127,5% percent: Geometric mean for grouped data EXAMPLE

Calculate the geometric mean. It is 127,5% percent:

Geometric mean for grouped

data

EXAMPLE

Слайд 46

Measures of location for grouped data MEDIAN Data is given in

Measures of location for grouped data

MEDIAN
Data is given in

a frequency table.
First cumulative frequency ≥ n/2 will indicate the median class interval.
Median can also be determined from the ogive.
Слайд 47

Measures of location for grouped data MODE Class interval that has

Measures of location for grouped data

MODE
Class interval that has the

largest frequency value will contain the mode.
Mode is the class midpoint of this class.
Mode must be determined from the histogram.
Слайд 48

Mode is calculated by the formula: where хМо – lower boundary

Mode is calculated by the formula:
where хМо – lower boundary of

the modal interval
i= хМо – xMo+1 - difference between the lower boundary of the modal interval and upper boundary
fMo, fMo-1, fMo+1 – frequencies of the modal interval, of interval foregoing modal interval and of interval following modal interval
Слайд 49

To calculate the mean for the sample of the 48 hours:

To calculate the mean for the sample of the 48 hours:
Determine

the class midpoints

Number of Number of
calls hours fi xi
[2–under 5) 3 3,5
[5–under 8) 4 6,5
[8–under 11) 11 9,5
[11–under 14) 13 12,5
[14–under 17) 9 15,5
[17–under 20) 6 18,5
[20–under 23) 2 21,5 n = 48

Measures of location for grouped data

Example – The following data represents the number of telephone calls received for two days at a municipal call centre. The data was measured per hour.

Слайд 50

Number of Number of xi calls hours fi [2–under 5) 3

Number of Number of xi
calls hours fi
[2–under 5) 3 3,5


[5–under 8) 4 6,5
[8–under 11) 11 9,5
[11–under 14) 13 12,5
[14–under 17) 9 15,5
[17–under 20) 6 18,5
[20–under 23) 2 21,5 n = 48

Measures of location for grouped data

Example – The following data represents the number of telephone calls received for two days at a municipal call centre. The data was measured per hour.

Average number of calls per hour is 12,44.

Слайд 51

To calculate the for the sample median of the 48: hours:

To calculate the for the sample median of the 48: hours:
determine

the cumulative frequencies

Number of Number of
calls hours fi F
[2–under 5) 3 3
[5–under 8) 4 7
[8–under 11) 11 18
[11–under 14) 13 31
[14–under 17) 9 40
[17–under 20) 6 46
[20–under 23) 2 48 n = 48

Measures of location for grouped data

Example – The following data represents the number of telephone calls received for two days at a municipal call centre. The data was measured per hour.

n/2 = 48/2 = 24
The first cumulative frequency ≥ 24

Слайд 52

Number of Number of calls hours fi F [2–under 5) 3

Number of Number of
calls hours fi F
[2–under 5)

3 3
[5–under 8) 4 7
[8–under 11) 11 18
[11–under 14) 13 31
[14–under 17) 9 40
[17–under 20) 6 46
[20–under 23) 2 48 n = 48

Measures of location for grouped data

Example – The following data represents the number of telephone calls received for two days at a municipal call centre. The data was measured per hour.

50% of the time less than 12,38 or 50% of the time more than 12,38 calls per hour.

Слайд 53

Measures of location for grouped data Example – The following data

Measures of location for grouped data

Example – The following data

represents the number of telephone calls received for two days at a municipal call centre. The data was measured per hour.

The median can be determined form the ogive.

n/2 = 48/2 = 24

Median = 12,4 Read at A.

A

Слайд 54

To calculate the for the sample mode of the 48 hours

To calculate the for the sample mode of the 48 hours
The

modal interval

Number of Number of
calls hours fi
[2–under 5) 3
[5–under 8) 4
[8–under 11) 11
[11–under 14) 13
[14–under 17) 9
[17–under 20) 6
[20–under 23) 2 n = 48

Measures of location for grouped data

Example – The following data represents the number of telephone calls received for two days at a municipal call centre. The data was measured per hour.

The highest frequency

Слайд 55

We substitute the data into the formula: Mo = 12,3 So,

We substitute the data into the formula:
Mo = 12,3
So, the most

frequent number of calls per hour = 12.3

MODE

Слайд 56

Measures of location for grouped data Example – The following data

Measures of location for grouped data

Example – The following data

represents the number of telephone calls received for two days at a municipal call centre. The data was measured per hour.

The mode can be determined form the histogram.
Mode = 12,3 Read at A.

A

Слайд 57

Relationship between mean, median, and mode If a distribution is symmetrical:

Relationship between mean, median, and mode

If a distribution is symmetrical:
the mean,

median and mode are the same and lie at centre of distribution

If a distribution is non-symmetrical:
skewed to the left or to the right
three measures differ

A positively skewed distribution
(skewed to the right)

A negatively skewed distribution
(skewed to the left)

Measures of location for grouped data

Слайд 58

Слайд 59

EXAMPLE Consider a study of the hourly wage rates in three

EXAMPLE

Consider a study of the hourly wage rates in three different

companies, For simplicity, assume that they employ the same number of employees: 100 people.
Слайд 60

Слайд 61

So we have three 100-element samples, which have the same average

So we have three 100-element samples, which have the same average

value (35) and the same variability (120). But these are different samples. The diversity of these samples can be seen even better when we draw their histograms.
Слайд 62

The histogram for company I (left chart) is symmetric. The histogram

The histogram for company I (left chart) is symmetric. The histogram

for company II (middle chart) is right skewed. The histogram for company III (right chart) is left skewed. It remains for us to find a way of determining the type of asymmetry (skewness) and “distinguishing” it from symmetry.
Слайд 63

Knowing the median, modal and average values enables us to resolve

Knowing the median, modal and average values enables us to resolve

the problem regarding the symmetry of the distribution of the sample. Hence,
For symmetrical distributions:
x = Me = Mo ,
For right skewed distributions:
x > Me > Mo
For left skewed distributions:
x < Me < Mo .

POSITIONAL CHARACTERISTICS

Слайд 64

We obtain the following relevant indicators (measures) of asymmetry: Index of

We obtain the following relevant indicators (measures) of asymmetry:
Index of

skewness: ;
Standardized skewness ratio:
Coefficient of asymmetry
Слайд 65

Example

Example

Слайд 66

Слайд 67

Слайд 68

The weighted arithmetic mean

The weighted arithmetic mean

Слайд 69

The median

The median

Слайд 70

The mode

The mode