Statistical estimates of random variables

Measures of the central tendency

Mean

The following data-set is given: [71, 50, 48, 67, 53]

The mean of the given data set can be calculated by the following code:

Weighted Mean

Another way to describe the data set [71, 48, 48, 67, 53] is to put it in to a table form:

f f(x)
71 1
48 2
67 1
53 1

The weighted Mean can be calculated by the following code:

Median

The following data set is given: [71, 50, 48, 67, 53]

The middle-most value is the median and it can be calculated by the following code:

The median of the data set [48, 50, 53, 65, 67, 71] is:

Mode

The mode is the value that occurs most frequently.

The mode of the data set [71, 50, 48, 48, 53] can be calculated by the following code:

If the values in a distribution have equal number of occurrences, then there isn’t a single mode.

The set of scores can be bimodal (with two modes), as the following set: [71, 48, 48, 50, 67, 67, 53]

Measures of Variability

Range

The range is calculated by subtracting the lowest value in a data set from the highest value in the data set.

For the dataset [71, 50, 48, 67, 53] the range is:

Standard Deviation

The standard deviation represents the average amount of variability in a data set

The standard deviation of the data set [71, 50, 48, 67, 53] is:

Variance

The variance, is simply the squared standard deviation and for the data set [71, 50, 48, 67, 53] is

Coefficient of Variation

The coefficient of variation is a derivative of two other statistical measures - the standard deviation and the mean and it is the ratio between them

Two datasets are given: [71, 50, 48, 67, 53] and [171, 150, 148, 167, 153]. Their coefficients of variation are: