Descriptive statistics are typically distinguished from inferential statistics. Descriptive statistics describe the data while inferential statistics, allow us to infer from the sample what is occurring in the population. Inferential statistics are also used to make judgments of the probability that the difference we observe between groups is dependable or that the difference may be due to chance or sampling error in the study.
Descriptive Statistics are used to simplify and present the large amounts of data in a manageable form. Each descriptive statistic reduces the data into a summary. For instance, consider your GPA (Grade Point Average) statistic. This single number describes the general performance of a student across a potentially wide range of courses.
When we describe a large set of observations with a single indicator we risk distorting the original data or losing important detail. The GPA doesn't tell us how difficult the courses were or what field the courses were in. In spite of these limitations, descriptive statistics give us a powerful summary that may enable us to compare across people, groups, or other units.
|
12,34,36,42,52 54,68,72,81,93 |
152,154,155,155,156 158,159,161,163,163 |
The mode is an unstable measure--minor changes in the data can change it substantially, but it our only choice for nominal level data.
Set A:
2, 13, 23, 32, 43
Mean = 24.6 Median = 23
Set A Altered: 12, 13, 23, 32, 143
Mean = 44.6 Median = 23
The mean is affected by extreme scores
(known as outliers), while the median remains the same. When
extreme scores occur in your data, you should report both the mean and
the median as measures of dispersion.
The two most common measures of variability
are the variance and the standard deviation. The standard
deviation shows the relation that a set of scores has to the mean of
the sample. Normal distributions, which are important in both descriptive
and inferential statistics, are completely determined by two "parameters":
the mean and the variance.
The variance
is the average difference of all the scores from the mean score. The variance
describes the heterogeneity of a distribution and is calculated from a
formula using every score in the distribution. It is typically symbolized
as "s2 ". The formula is:
Sum-all-scores(Score - Mean)2 Variance= s2 = ________________________ n - 1The square root of the variance is known as the "standard deviation." It is symbolized by "s".
Square-root(Variance)= s = Standard Deviation.
If the standard deviation of women's gender-role attitude scores was .90 on an 8-point scale and the standard deviation of men's scores was 1.68, you would know that men are more varied in gender-role attitudes than women are. Can you come up with a sociologically sound explanation of why this might be so?
Assuming that the distribution of scores is normal or bell-shaped (or close to it):
Use your browser BACK button to return to
previous page.
Or go to Methods
and Measurements Index or Site
Index.