non-parametric | educational research techniques

The chi-square test is a non-parametric test that is used in statistic to determine if an observed distribution or model conforms or is similar to an expected distribution or model. In simple terms, this test will tell you if the data you collected is similar to other data or to what you expected.

There are several types of chi-square test such as the Chi-square Test of Independence, which is used for nominal data, and the Goodness-of-Fit Test, which deals with data that is not nominal. This post is about the Goodness-of-Fit Test. The Goodness-of-Fit test compares the distribution of the observed data with an expected distribution.

A unique caveat of chi-square test is that we normally desire as a researcher to make sure we do not reject our model. This is opposite of traditional hypothesis testing which desires often to reject the null hypothesis as this indicates that there is a statistical difference. With chi-square test, we want our observed model to be similar to the values found in the expected model. What this means is that our model represents what is happening in the real-world and is not only theoretical. If we reject the null it means that the model we are trying to create is not similar to expected values that might be found in the real world. In other words, we found something that does not conform to what is expected. If a model does not represent the world, it may not serve much purpose.

Here are the assumptions of Goodness-of-Fit Test

Random selection of subjects
Mutually exclusive categories

Here are the steps

Determine hypothesis
- H0: There is no difference between the observed values/model and the expected values/model
- H1: There is a difference between the observed values/model and the expected values/model
Decide level of significance
Determine degree of freedom to find chi-square critical
Compute for the expected frequencies
Compute chi-square
Make decision to accept or reject null
State conclusion

Here is an example

A principal wants to know if the number of students absent each day of the week is the same. Below are the results for one week.

Day Absents

Monday 17

Tuesday 20

Wednesday 16

Thursday 14

Friday 13

Step 1: Determine Hypothesis

H0: The number of students absent is the same every day
H1: The number of students absent is not the same every day

Step 2: Decide level of significance

0.05

Step 3 Determine chi-square critical region (computer does this for you)

Chi-square critical region = 9.48

Step 4: Compute expected frequencies

Computer does this

Step 5: Compute Chi square (computer does this for you)

Chi-square = 1.87

Step 6: Make decision

Since the computed chi-square of 1,87 is less than the critical chi-square value of 9.48 we do not reject the null hypothesis

Step 7: Conclusion

Since we do not reject the null hypothesis we can say that there is a lack of evidence that there is a difference in the number of absences each day of the week. In other words, the number of students absent each day is the same.

NOTE: There is also a way to do this test when the expected frequencies are unequal

Spearman rank correlation aka ρ is used to measure the strength of the relationship between two variables. You may be already wondering what is the difference between Spearman rank correlation and Person product moment correlation. The difference is that Spearman rank correlation is a non-parametric test while Person product moment correlation is a parametric test.

A non-parametric test does not have to comply with the assumptions of parametric test such as the data being normally distributed. This allows a researcher to still make inferences from data that may not have normality. In addition, non-parametric test are used for data that is at the ordinal or nominal level. In many ways, Spearman correlation and Pearson product moment correlation compliment each other. One is used in non-parametric statistics and the other for parametric statistics and each analyzes the relationship between variables.

If you get suspicious results from your Pearson product moment correlation analysis or your data lacks normality Spearman rank correlation may be useful for you if you still want to determine if there is a relationship between the variables. Spearmen correlation works by ranking the data within each variable. Next, the Pearson product moment correlation is calculated between the two sets of rank variables. Below are the assumptions of Spearman correlation test.

Subjects are randomly selected
Observations are at the ordinal level at least

Below are the steps of Spearman correlation

Setup the hypotheses
1. H0: There is no correlation between the variables
2. H1: There is a correlation between the variables
Set the level of significance
Calculate the degrees of freedom and find the t-critical value (computer does this for you)
Calculate the value of Spearman correlation or ρ (computer does this for you)
Calculate the t-value(computer does this for you) and make a statistical decision
State conclusion

Here is an example

A clerk wants to see if there is a correlation between the overall grade students get on an exam and the number of words they wrote for their essay. Below are the results

Student Grade Words on Essay
1 79 147
2 76 143
3 78 147
4 84 168
5 90 206
6 83 155
7 93 192
8 94 211
9 97 209
10 85 187
11 88 200
12 82 150

Note: The computer will rank the data of each variable with a rank of 1 being the highest value of a variable and a rank 12 being the lowest value of a variable. Remember that the computer does this for you.

Step 1: State hypotheses
H0: There is no relationship between grades and words on the essay
H1: There is a relationship between grades and words on the essay

Step 2: Determine level of significance
Level set to 0.05

Step 3: Determine critical t-value
t = + 2.228 (computer does this for you)

Step 4: Compute Spearman correlation
ρ = 0.97 (computer does this for you)
Note: This correlation is very strong. Remember the strongest relationship possible is + 1

Step 5: Calculate t-value and make a decision
t = 12.62 ( the computer does this for you)
Since the computed t-value of 12.62 is greater than the t-critical value of 2.228 we reject the null hypothesis

Step 6: Conclusion
Since the null hypotheses are rejected, we can conclude that there is evidence that there is a strong relationship between exam grade and the number of words written on an essay. This means that a teacher could tell students they should write longer essays if they want a higher grade on exams

educational research techniques

Research techniques and education

Tag Archives: non-parametric

Chi-Square Goodness-of-Fit-Test

Like this:

Spearman Rank Correlation

Like this:

Share this:

Like this:

Share this:

Like this: