Descriptive Statistics
: Total number of observations
: Number of classes (for grouped data)
: i-th observation (ungrouped data)
: Absolute frequency of the i-th class
: Cumulative frequency up to the i-th class
: Lower limit of a class
: Class width (length)
: Arithmetic Mean
: Median
: Mode
For Ungrouped Data
Measures of Central Tendency
Arithmetic Mean
Median
Mode
Measures of Dispersion
Range
Variance (population)
Practical formula:
Standard Deviation
Coefficient of Variation
Measures of Shape
Skewness Coefficient
- : Symmetric distribution
- : Positive skew (tail to the right)
- : Negative skew (tail to the left)
Kurtosis Coefficient
- : Mesokurtic distribution (like the normal)
- : Leptokurtic (more peaked)
- : Platykurtic (less peaked)
For Grouped Data (Frequency Tables)
Measures of Central Tendency
Arithmetic Mean
where is the class mark (midpoint).
Median
- : Lower limit of the median class
- : Frequency of the median class
- : Cumulative frequency preceding the median class
- : Width of the median class
Mode
- : Lower limit of the modal class
Measures of Dispersion
Range
Variance
Standard Deviation
Coefficient of Variation
Measures of Shape
Skewness Coefficient
Kurtosis Coefficient
Fractiles (Quantiles) for Grouped Data
General Formula
where:
- : Desired fractile
- : Lower limit of the fractile class
- : Corresponding proportion (e.g., 0.25 for )
- : Frequency of the fractile class
- : Preceding cumulative frequency
- : Width of the fractile class
Types of Fractiles
| Type | Symbol | Proportion (k) | Example |
|---|---|---|---|
| Quartiles | 0.25, 0.50, 0.75 | : | |
| Deciles | 0.10, 0.20, , 0.90 | = Median | |
| Percentiles | 0.01, 0.02, , 0.99 | : |
Important relationships:
- = Median
- ,
Practical Considerations
-
Grouped data: All formulas use the class mark (midpoint) as the representative value of the interval.
-
Median and fractiles: Their calculation first requires identifying the corresponding class by analyzing cumulative frequencies.
-
Mode: A distribution can be unimodal, bimodal, or multimodal. For grouped data, interpolation within the modal class is used.
-
Interpreting the coefficient of variation:
- : Low relative dispersion
- : Moderate dispersion
- : High relative dispersion
-
Units of measurement:
- Variance retains the original units squared
- Standard deviation maintains the original units
- The coefficients of variation, skewness, and kurtosis are dimensionless