The T-Test for Differences Between Two Groups


       The Formula Assumptions we must make  Unpaired Data  Paired Data


      The t-test allows us to determine whether the means of two groups are statistically different from each other. This test is appropriate whenever you want to compare the means of two groups.

      When we look at the differences between scores for two groups, we must judge the difference between their means relative to the spread or variability of their scores. The t-test does this.

      First, we must ask: Is the observed difference statistically significant?

      Second: If the difference is statistically significant, then is it a meaningful or substantive difference? Let's say, for example, that Protestants have a mean of 13.6 years of education and Catholics have a mean of 12.4 years. We find that the observed difference is statistically significant. Does the additional year give Protestants an "advantage" over Catholics?  What does the difference mean?  There is no simple way to answer this question. It depends on the situation and what your argument is (based on theoretical expectations). Substantive differences are different from statistically significant differences.

      Statistical Analysis of the t-test

      The formula for the t-test is a ratio. The top part of the equation is the difference between the two means. The bottom part is a measure of the variability or dispersion of the scores. The variability is essentially "noise" that may make it harder for us to see group differences. The formula for the t-test is:
      Group 1 mean - Group 2 mean/ Standard Error(Group 1 mean-Group 2 mean).
       
      t test
      The bottom part of the formula is called the standard error of the difference. To compute it, we take the variance (the standard deviation squared) for each group and divide it by the number of respondents in that group minus 1. We add these two values, then take their square root. The specific formula is shown below:
       
      standard error

      The final combined formula for the t-test is:
       

      t-test formula

      The t-value will be positive if the mean from Group I is larger than the mean of group II and negative if it is smaller. Once you compute the t-value you look up the t-value in a table of significance which tells us whether the ratio is large enough to say that the difference between the groups is significant.  In other words the difference observed is not likely due to chance or sampling error. As with any test of significance, you need to set the alpha level. In most social research, the "rule of thumb" is to set the alpha level at .05. This means that 5% of the time (five times out of a hundred) you would find a statistically significant difference between the means even if there is none ("chance"). The t-test also requires that we determine the degrees of freedom (df) for the test. In the t-test, the degrees of freedom is the sum of the persons in both groups minus 2. Given the alpha level, the df, and the t-value, you can look the t-value up in a standard table of significance (available as an appendix in the back of most statistics texts) to determine whether the t-value is large enough to be significant. If it is, you can conclude that the difference between the means for the two groups is different. Statistical computer programs routinely print the significance test results, saving you from looking them up in a table.



      The t-test is a parametric test.  In order to use a t-test, several assumptions must be met. The further away from meeting these assumptions that we get, the less reliable the test statistic becomes. The assumptions are:
      • observations are independent and not paired (if paired use the paired t Test.).
      • observations for each group are a sample from a population that is normally distributed
      • variances for the two independent groups must be considered in interpretation of the test statistic.
      If these assumptions are not met, it may be better to use a non-parametric test.

      In the unpaired (unmatched) analysis each unit of analysis (person in this case) is observed or measured only once.
      Example: The years of completed education for a group of Protestants and a group of Catholics.

      In the paired (matched) situation, each unit of analysis (person in this case) is observed or measured two or more times.
      Example: The Math SAT score of a group of people is taken before a course in math and then compared to the Math SAT score taken after the course for the same group of people.   This is an example of the pre-test/post-test experiment.

      Note: The t-test, one-way Analysis of Variance (ANOVA) and a form of regression analysis are mathematically equivalent  and would yield identical results.



      Use your browser's BACK button to return to previous page.
      or go to Methods and Measurements
      or the Site Index