Chapter 7 Three or more means | Common statistical tests are linear models: a work through (2024)

7.1 One-way ANOVA

The one-way analysis of variance (ANOVA) is used to determine whether there are any statistically significant differences between the means of three or more independent groups.

Examples include:

Is there a difference in academic outcomes for pupils from ten different schools?
Is there a difference daily coffee consumption between people in three different countries?

Intuitively, ANOVA is based on comparing the variance (or variation) between the groups, to variation within each particular group. If the ‘between’ variation is much larger than the ‘within’ variation, we are more likely to conclude that the means of the different groups are not equal. If the ‘between’ and ‘within’ variations are more similar in size, then we are less likely to conclude that there is a significant difference between sample means.

Why can’t we just compare the means of every possible pair of groups, and see if any differences are statistically significant? The reason is that as the number of groups increases, the more likely we are to see differences that are due to chance alone. This means we are more likely to commit a Type I error, rejecting the null hypothesis (that there is no difference between the means) when the null hypothesis is in fact true.5 An ANOVA controls for this additional risk of Type I errors, maintaining the overall or experimentwise error rate, which is typically \(\alpha\) = 0.05.

It is important to note that the one-way ANOVA is an omnibus test statistic and cannot tell you which specific groups were statistically significantly different from each other, only that at least two groups were different. To determine which specific groups differed from each other, you would need to use a post hoc test.6

Dataset:

To illustrate, we will create a new dataset (mydata_anova1). We assume three groups (A, B and C) of normally-distributed variables, with means of 0, 1 and 0.5 respectively. We also create dummy variables for groups B and C (group A is our reference group, and so does not require an indicator):

# Create dataset 'mydata_anova1' which is three groups:set.seed(40) # Makes the randomised figures reproducibleN <- 20 # Sample size of 20 for each groupmydata_anova1 <- data.frame( value = c(rnorm_fixed(N,mu = 0, sd = 1), # Group A rnorm_fixed(N, mu = 1, sd = 1), # Group B rnorm_fixed(N, mu = 0.5, sd = 1)), # Group C group = rep(c('a', 'b', 'c'), each=N) ) %>%  # Explicitly add indicator/dummy variables mutate(group_b = if_else(group == 'b', 1 , 0)) %>% # Group B dummy mutate(group_c = if_else(group == 'c', 1 , 0)) # Group C dummy

Here’s a sample of six rows from our new dataset, which includes the dummy variables:

Table 7.1: Some randomly selected rows from our dataset
value	group	group_b	group_c
0.5685977	a	0	0
0.5873537	a	0	0
2.2828164	b	1	0
0.5557109	b	1	0
1.2658245	c	0	1
0.1858697	c	0	1

Note that the data used in the remainder of this book varies from that used by Lindeløv in the original version, but the principles being discussed are exactly the same.

ANOVA function:

R has a package for ANOVA, in this case car::Anova(aov()). However, this is simply a ‘wrapper’ around the equivalent linear model, described below, and yields identical results.

Equivalent linear model:

The linear model assumes that the dependent variable can be predicted with a single mean for each group:

\(y = \beta_0 + \beta_1 \cdot x_1 + \beta_2 \cdot x_2 + ... \qquad H_0: y = \beta_0\)

This assumption is illustrated below, in which there are assumed to be three groups (A, B and C). This extends the case with two groups, as was illustrated in the previous chapter, to cases with three or more groups. In this case, members of group B are identified by the dummy variable \(x_1\), with the coefficient (or slope) of \(\beta_1\), and members of group C are identified by the variable \(x_2\), with the slope of \(\beta_2\).

Figure 7.1: Linear model equivalent to ANOVA with three groups

The null hypothesis of the linear model is that \(\beta_1\) and \(\beta_2\) are both zero; or equivalently, that all groups have the same mean of \(\beta_0\). To test this hypothesis, an F-test is used. The F statistic in a regression is the result of a test where the null hypothesis is that all of the regression coefficients are equal to zero. The F-test compares your full model to one with no predictor variables (the intercept only model), and decides whether your added variables improved the model. If you get a significant result, then whatever coefficients you included in your full model improved the model’s fit (beyond what could be expected by chance alone).

Table 7.2: One-way ANOVA and equivalent linear model
Test	df	df.residual	F.statistic	p.value
Anova	2	57	5	0.00998
lm	2	57	5	0.00998

7.1.1 Kruskal-Wallis

The non-parametric version of the ANOVA is the Kruskal-Wallis test. We would need to use this test if our dependent variable was ordinal rather than continuous. We would also use the non-parametric version if other assumptions of the one-way ANOVA did not hold, including (1) that the dependent variable was approximately normally distributed for each category of the independent variable, and (2) hom*ogeneity of variances.

Equivalent linear model:

The Kruskal-Wallis is essentially a one-way ANOVA test on ranks. It can be expressed as the following linear model:

\(rank(y) = \beta_0 + \beta_1 \cdot x_1 + \beta_2 \cdot x_2 + ... \qquad H_0: y = \beta_0\)

Comparison:

Here is a comparison of the Kruskal-Wallis test and the equivalent linear model (the equivalent ANOVA test is also included for completeness, which we’ve seen is just a ‘wrap’ around the linear model).

# Kruskal-Walliskruskal.test(value ~ group, data = mydata_anova1)# Linear model on rankslm <- lm(rank(value) ~ 1 + group_b + group_c, data = mydata_anova1) lm %>% summary() %>% print(digits = 8) # show summary output# Anova on ranks (which is a wrapper around the linear model above)car::Anova(aov(rank(value) ~ group, data = mydata_anova1))

Table 7.3: Kruskal-Wallis test and equivalent linear model
Test	df	p.value
Kruskal	2	0.0203
lm	2	0.0177

The p-value of the two tests are similar, though not identical. In this example, we would reject the null hypothesis that all groups had equal means (at the 0.05 level of significance).

7.2 Two-way ANOVA

The two-way ANOVA compares the means of groups that have been split on two independent variables, or ‘factors’.

For example: is there an interaction between gender and educational level on test anxiety among university students? Here gender (males / females) and education level (high school / undergraduate / postgraduate) are your independent variables or factors.

A two-way ANOVA tests three hypotheses:

That the population means of the first factor (e.g.each gender) are equal;
That the population means of the second factor (e.g.each education level) are equal; and
That there is no interaction between the two factors - i.e.that the relationship between anxiety and gender does not depend on education level, or that the relationship between anxiety and education does not depend on gender.

The first two hypotheses relate to the relationship between each factor and the dependent variable, referred to as ‘main effects’. Each of these is like a one-way ANOVA, but in the context of a larger model. The third hypothesis relates to the ‘interaction effect’. Here we will focus on the interaction effect.

Updated dataset:

To show the modelling in R, we’ll add another factor to our example dataset, mood, which reports whether a person is happy or sad. We also use this to create a dummy variable, mood_happy, which takes the value of 1 if the person is happy or 0 if they are sad.

Table 7.4: Some randomly selected rows from our dataset
value	group	group_b	group_c	mood	mood_happy
0.5685977	a	0	0	happy	1
0.5873537	a	0	0	sad	0
2.2828164	b	1	0	happy	1
0.5557109	b	1	0	sad	0
1.2658245	c	0	1	happy	1
0.1858697	c	0	1	sad	0

Table 7.5: Two-way ANOVA and equivalent linear model
Test	df	df.res	F.value	p.value
ANOVA	2	54	0.2977	0.7437
lm	2	54	0.2977	0.7437

7.3 ANCOVA

This adds a continuous independent variable, or covariate, to the model (e.g.age), in addition to one or more categorical independent variables (e.g.gender or education level).

An analysis of covariance (ANCOVA) evaluates whether the mean of the dependent variable is equal across levels of a categorical independent variable, while statistically controlling for the effects of other continuous variables (e.g.age) that are not of primary interest, known as covariates.

Updated dataset:

Here will will add a covariate to our one-way ANOVA above. In addition to the group dummy variables, we update our data set to include each subject’s age, which we assume is correlated with the dependent variable, value:

# create a new column with the continuous variable 'age'mydata_anova3 <- mydata_anova1 %>%  mutate(age = value + rnorm_fixed(nrow(.),sd = 3))

Here’s a selection of six rows from the updated dataset:

Table 7.6: Some randomly selected rows from our dataset
value	group	group_b	group_c	age
0.5685977	a	0	0	-3.6341563
0.5873537	a	0	0	3.1155833
2.2828164	b	1	0	2.4620050
0.5557109	b	1	0	-3.5019533
1.2658245	c	0	1	0.8547244
0.1858697	c	0	1	-1.0090546

ANOVA function:

An ANCOVA can be carried out using the Anova() function and including the covariate (in this case age) as an independent variable.

car::Anova(aov(value ~ group + age, mydata_anova3))

Equivalent linear model:

The same results can be achieved by using F-tests to compare two sets of linear models: (i) the full model and the nested model which excludes age, and (ii) the full model and the nested model that excludes the group dummy variables. Again, the F-tests are carried out using the anova() function, which uses an F-test by default.

The full model can be formulated as follows:

\(y = \beta_0 + \beta_1 \cdot x_1 + \beta_2 \cdot x_2 + ... + \beta_3 \cdot age\)

where the value of \(y\) varies by group, as represented here by the dummy variables \(x_1\) and \(x_2\), and also by \(age\).

This can be illustrated below. The ANCOVA tests whether there is difference in the mean y for the three groups, after controlling for age (the vertical shift shown by \(\beta_1\) and \(\beta_2\)). It also tests whether the slope \((\beta_3)\) is statistically significant.

Figure 7.2: Linear model equivalent to ANCOVA with three groups

Here we run the linear model in R. The resulting p-values relate to the null hypotheses that age and group (respectively) have no effect on the dependent variable, in this case value.

# full model, with group and age variablesfull <- lm(value ~ 1 + group_b + group_c + age, mydata_anova3)# model without age null_age <- lm(value ~ 1 + group_b + group_c, mydata_anova3)# model without groupsnull_group<- lm(value ~ 1 + age, mydata_anova3)# result for ageanova(null_age, full)# results of groupanova(null_group, full)

Comparison:

The results of the two approaches are presented in the table below:

Table 7.7: ANCOVA and linear model
Term	Model	df	res.df	F.value	p.value
age	Anova	1		5.2002	0.02641
age	lm	1	56	5.2002	0.02641
group	Anova	2		4.6929	0.01305
group	lm	2	56	4.6929	0.01305

Based on these results, we would reject the null hypothesis that there was no relationship between value and group, even after controlling for differences in age. We would also reject the null hypothesis that value was not related to age.

For example, if there were only two groups, we would be carrying out one comparison: the mean of Group A vs the mean of Group B. At a 0.05 level of significance, there would be a 5% chance of a Type I error. If we had three groups, there would be three comparisons (Group A vs Group B, Group A vs Group C, and Group B vs Group C), and we would have a 14.3% (1-0.95^3) chance of a Type I error.↩
Post hoc tests attempt to control the experimentwise error rate (usually \(\alpha\) = 0.05) in the same manner that the one-way ANOVA is used instead of multiple t-tests. Laerd suggests using the Tukey test (where there is hom*ogeneity of variances in your samples) or the Games Howell test (where there is not).↩

Chapter 7 Three or more means | Common statistical tests are linear models: a work through (2024)

FAQs

What statistical test is used in linear regression? ›

In linear regression, we estimate the coefficients (such as β1, β2, β3, …, βn) of the features (such as x1, x2, x3, …, xn) using methods like ordinary least squares (OLS). The t-test is then applied to examine the statistical significance of these coefficients.

Explore More ›

What kind of statistical test would be used to determine if there is a linear relationship between two variables? ›

1.1.

Simple linear regression is a type of test that describes the relationship between a dependent and an independent variable using a straight line. This test determines the relationship between two quantitative variables.

Find Out More ›

What is an ANOVA statistical test? ›

ANOVA, which stands for Analysis of Variance, is a statistical test used to analyze the difference between the means of more than two groups. A one-way ANOVA uses one independent variable, while a two-way ANOVA uses two independent variables.

Keep Reading ›

What test is used to compare three or more means? ›

One-way analysis of variance is the typical method for comparing three or more group means.

Learn More Now ›

How to test a linear model? ›

We can check the linearity of the data by looking at the Residual vs Fitted plot. Ideally, this plot would not have a pattern where the red line (lowes smoother) is approximately horizontal at zero. In the above plot, we can see that there is a clear pattern in the residual plot.

Discover More ›

Is a linear model a statistical test? ›

In statistics, linear regression is a statistical model which estimates the linear relationship between a scalar response and one or more explanatory variables (also known as dependent and independent variables).

Discover More Details ›

How do you check if data is linear or not? ›

One way to check the linearity is to plot the target versus the predictors for each of the predictors in the dataset. If the plot shows a distinct trend, you can conclude that there is some amount of linearity between the two variables. When the plot shows a different pattern, the relation is not linear.

Discover More ›

Which is the best method to test the linear relationship of the data? ›

Linear regression is a quiet and the simplest statistical regression method used for predictive analysis in machine learning. Linear regression shows the linear relationship between the independent(predictor) variable i.e. X-axis and the dependent(output) variable i.e. Y-axis, called linear regression.

Tell Me More ›

What are the tests for linear hypothesis? ›

Generally, linear hypothesis tests are performed using F-statistics. However, there are alternate approaches such as likelihood tests or chi-squared tests. Be sure you know which on you're getting.

Tell Me More ›

When to use t-test vs ANOVA? ›

The t-test is a method that determines whether two populations are statistically different from each other, whereas ANOVA determines whether three or more populations are statistically different from each other.

What is the ANOVA test best for? ›

ANOVA is a versatile and powerful statistical technique, and the essential tool when researching multiple groups or categories. The one-way ANOVA can help you know whether or not there are significant differences between the means of your independent variable.

Discover More ›

What is ANOVA used for with examples? ›

In a one-way ANOVA test, there is one independent variable and one dependent variable. A one-way ANOVA test would be used, for example, to observe which diet caused the most weight loss in individual participants. In this study, the different diets (vegan, vegetarian, keto, etc.) would be the independent variables.

View Details ›

What does ANOVA compare? ›

ANOVA is used to compare differences of means among more than two groups. It does this by looking at variation in the data and where that variation is found (hence its name). Specifically, ANOVA compares the amount of variation between groups with the amount of variation within groups.

Learn More ›

How to pick a statistical test? ›

Use an unpaired test to compare groups when the individual values are not paired or matched with one another. Select a paired or repeated-measures test when values represent repeated measurements on one subject (before and after an intervention) or measurements on matched subjects.

Know More ›

What is the best statistical test to compare three groups? ›

When comparing more than two sets of numerical data, a multiple group comparison test such as one-way analysis of variance (ANOVA) or Kruskal-Wallis test should be used first.

Keep Reading ›

What is the significant test for linear regression? ›

Significance tests for linear regression are used to determine if the relationship between the dependent variable and one or more independent variables is statistically significant. In other words, they help us determine if the independent variables are good predictors of the dependent variable.

Keep Reading ›

What test is used for simple linear regression? ›

T-Test for Regression

If the regression equation has a slope of zero, then every x value will give the same y value and the regression equation would be useless for prediction. We should perform a t-test to see if the slope is significantly different from zero before using the regression equation for prediction.

Discover More Details ›

Does linear regression use t-test? ›

If the categorical predictor has only 2 levels such as sex (male, female), then the simple regression analysis is equivalent to an independent t test. If the single categorical variable has more than 2 levels, then the simple linear regression is equivalent to 1-way analysis of variance (ANOVA).

Which test statistic is used to test the regression model? ›

To test this hypothesis you perform a regression test, which generates a t value as its test statistic. The t value compares the observed correlation between these variables to the null hypothesis of zero correlation.

Get More Info ›

Chapter 7 Three or more means | Common statistical tests are linear models: a work through (2024)

7.1 One-way ANOVA

7.1.1 Kruskal-Wallis

7.2 Two-way ANOVA

7.3 ANCOVA

FAQs

What statistical test is used in linear regression? ›

What is the ANOVA test best for? ›

References