[1] 36.97382
STAT 205: Introduction to Mathematical Statistics
University of British Columbia Okanagan
Estimating a parameter from sample data can involve:
A single number (a point estimate).
An interval of plausible values (a confidence interval).
Often, the goal of an investigation isn’t just parameter estimation, but deciding between two contradictory claims about the parameter.
This fall under statistical inference, specifically hypothesis testing.
In this lecture we will be covering
Terminology:
Significance Level (\(\alpha\)-level)
Rejection Region (RR)
Approaches:
Critical Values
\(p\)-value(next lecture)
A statistical hypothesis, or just hypothesis, is a claim or assertion either about
Today we’ll be focusing on hypothesis testing for single parameters
Null Hypotheses
Definition 1 A null hypothesis (\(H_0\)) represents the status quo or a claim to be tested, often stating that the population mean equals a specific value.
Alternative Hypothesis
Definition 2 The alternative hypothesis (\(H_A\) or \(H_a\) or \(H_1\)) represents an alternative claim and suggests a difference from the null hypothesis, often represented by a range of possible parameter values.
Objective: based on sample information, decide which of the two hypotheses is most likely.
Battery Life
Exercise 1 A battery manufacturer claims that the average battery life of their product is 50 hours.
Null Hypothesis
\(\mu = 50\) hours
Alternative Hypothesis
\(\mu \neq 50\) hours
For a one-sample with \(H_0: \theta = \theta_0\), our alternative hypothesis can take one of the following forms:
Two-tailed test | \(H_A: \theta \neq \theta_0\) |
Left-tailed test | \(H_A: \theta < \theta_0\) |
Right-tailed test | \(H_A: \theta > \theta_0\) |
Hypothesized value
\(\theta_0\) represents the null value (aka hypothesized value) of the population parameter \(\theta\). It practice this will be some specific value based on prior knowledge, historical data, claims, or assumptions about the population parameter.
Voter Turnout in Santa Clara County (iClicker)
Exercise 2 We want to test whether the proportion of registered voters in Santa Clara County who voted in the primary election is more than 30%. Which of the following correctly represents the null and alternative hypotheses?
Average College Duration (iClicker)
Exercise 3 We want to test whether college students take less than five years on average to graduate. Which of the following correctly represents the null and alternative hypotheses?
The null hypothesis (\(H_0\)) always\(^*\) includes an equal sign because it represents the assumed or claimed value of the population parameter that we are testing.
The alternative hypothesis \(H_A\) is what we want to investigate and is based on the research question. It points in the direction of the suspected difference.
Both hypotheses must be written in terms of the population parameters
In statistics, hypothesis-testing problems are formulated so that the null hypothesis is initially assumed to be true.
\(H_0\) will be rejected in favor of \(H_1\) if the sample data provides strong enough “evidence” (we’ll get back to this).
If there is insufficient evidence, we fail to reject \(H_0\), meaning we do not have enough support for \(H_A\).
Important
Failing to reject \(H_0\) but this does not prove \(H_0\) is true—only that we lack strong evidence against it.
TikTok Watching Habits
Exercise 4 (Tiktok) A social media research group claims that the average number of TikTok videos watched per session is 35 videos. However, a digital well-being organization suspects that people are actually watching more than that per session, contributing to increased screen time. To investigate, the organization collects data from 25 randomly selected TikTok users and finds that they watch an average of 38 videos per session. From past research, the population standard deviation is known to be \(\sigma\) = 6 videos.
\[ \begin{align} H_0: \quad \quad\quad \quad\quad \quad \quad \quad\quad \quad & H_1:\quad \quad\quad \quad\quad \quad \end{align} \]
If \(H_0: \mu = 35\) is true and \(\sigma = 6\), we know that \(\bar X \sim N(\mu_\bar{X} = 35, \sigma_\bar{X} = 6/\sqrt{25})\)
If our sample of size 25 produced a sample mean of 34.9, that would be pretty consistent with our null (\(H_0: \mu = 35\)).
If our sample of size 25 produced a sample mean of 35.1, that would be pretty consistent with our null (\(H_0: \mu = 35\)).
If our sample of size 25 produced a sample mean of 36.2, we might consider that a reasonably likely to happen under the null (\(H_0: \mu = 35\)).
If our sample of size 25 produced a sample mean of 37, we might start questining the initial claim (\(H_0: \mu = 35\)).
If our sample of size 25 produced a sample mean of 40, we might stop believing the claim (\(H_0: \mu = 35\)) altogether.
So what would be a reasonably cutoff for separating the unlikely values (that would lead us to reject \(H_0\)) from the values we deem are likely to have occured by random chance?
Critical Value
Definition 3 A critical value is a threshold that defines the boundary for rejecting the null hypothesis (\(H_0\)) in a hypothesis test. It is based on the significance level (\(\alpha\)) and the sampling distribution of the test statistic.
Significance Level
Definition 4 The significance level, denoted by \(\alpha\), represents the probability of rejecting \(H_0\) when it is true. Common1 choices: \(\alpha = 0.05\) (our course default), \(\alpha = 0.01\)
[1] 36.97382
Using \(\alpha = 0.05\) our critical value becomes 36.974.
[1] 1.644854
Figure 1: Alternatively, we find the critical value on the standard normal.
Need \(z\) such that
\[ \begin{align} \Pr(Z > z) & = 0.05\\ \implies \Pr(Z \leq z) &= 0.95\\ \end{align} \]
Since it falls between two values, we take the average
\[ \begin{align} \Pr(Z < 1.64) &= 0.945\\ \Pr(Z < 1.65) &= 0.505\\ \implies \Pr(Z < 1.645) &\approx 0.505 \end{align} \]
Since we assume \(\sigma\) is known, and we assume \(\mu = \mu_0\) (the hypothesized value) we can standardize \(\bar{x}\) using a Z-score transformation. This is called our test statistic.
\[ Z = \dfrac{\bar X - \mu_0}{\sigma/\sqrt{n}} \sim N(0,1) \] \(Z_{obs}\) is the observed test statistic obtained by subbing \(\bar x\) for \(\bar X\).
Test Statistic
Definition 5 A test statistic, a function of the sample data on which the decision (reject \(H_0\) or do not reject \(H_0\)) is to be based. It is typically calculated as:
\[ \begin{align} \text{Test Statistic} &= \dfrac{\text{Point Estimate - Hypothesized value} }{\text{Standard error of Point Estimator}}\\ &= \frac{\hat \theta - \theta_0}{\sigma_{\hat \theta}} \end{align} \]
Null Distribution
Definition 6 The null distribution is the probability distribution of a test statistic under the assumption that the null hypothesis (\(H_0\)) is true.
Reject if \[ \begin{align} &z_{obs} \geq z_{\alpha}\\ &z_{obs} \geq 1.645 \\ \end{align} \]
The rejection region (also called the critical region or RR for short) is the set of values for the test statistic that leads to reject the null hypothesis (\(H_0\)) in favour of the alternative (\(H_A\)). It is determined based on the chosen significance level (\(\alpha\)).
Fail to reject if
\[ \begin{align} &z_{obs} < z_{\alpha}\\ &z_{obs} < 1.645 \\ \end{align} \]
The acceptance region (also called the non-rejection region) is the set of values for the test statistic where we fail to reject the null hypothesis (\(H_0\)). It is the complement of the critical region and represents outcomes that are not extreme enough to provide sufficient evidence against \(H_0\) at the given significance level (\(\alpha\)).
\[ \begin{align} H_0: \mu &= \mu & H_0:& \mu > \mu \end{align} \]
Upper-tailed: Illustration of the hypothesis test for \(H_0: \mu = \mu_0\) versus \(H_A: \mu > \mu_0\). The null distribution follows a standard normal curve. The rejection region (red) corresponds to values where \(H_0\) is rejected at significance level \(\alpha\), while the acceptance region (green) includes values where \(H_0\) is not rejected. The critical value \(z_\alpha\) separates these regions
\[ \begin{align} H_0: \mu &= \mu & H_0:& \mu < \mu \end{align} \]
Lower-tailed: Illustration of the hypothesis test for \(H_0: \mu = \mu_0\) versus \(H_A: \mu < \mu_0\). The null distribution follows a standard normal curve. The rejection region (red) corresponds to values where \(H_0\) is rejected at significance level \(\alpha\), while the acceptance region (green) includes values where \(H_0\) is not rejected. The critical value \(z_\alpha\) separates these regions
Double-sided: Illustration of the hypothesis test for \(H_0: \mu = \mu_0\) versus \(H_A: \mu \neq \mu_0\). The null distribution follows a standard normal curve. The rejection region (red) corresponds to values where \(H_0\) is rejected at significance level \(\alpha\), while the acceptance region (green) includes values where \(H_0\) is not rejected. The critical values \(-z_{\alpha/2}\) and \(z_{\alpha/2}\) separates these regions
Reject if
\[ \begin{align} | z_{obs} | &> z_{\alpha/2}\\ | z_{obs}| &> ? \\ \end{align} \]
Double-sided: Illustration of the hypothesis test for \(H_0: \mu = \mu_0\) versus \(H_A: \mu \neq \mu_0\). The null distribution follows a standard normal curve. The rejection region (red) corresponds to values where \(H_0\) is rejected at significance level \(\alpha\), while the acceptance region (green) includes values where \(H_0\) is not rejected. The critical values \(-z_{\alpha/2}\) and \(z_{\alpha/2}\) separates these regions
Finding the Critical Value (iClicker)
Exercise 5 A two-tailed hypothesis test is conducted at a significance level of \(\alpha = 0.05\). The test statistic follows a standard normal distribution. What is the critical value \(z_{\alpha/2}\) that defines the rejection regions?
Test Procedure using a significance level of \(\alpha\)
State the null and alternative hypotheses \[ \begin{align} H_0: \mu &= \mu_0 & H_1: & \begin{cases} \mu < \mu_0\\ \mu > \mu_0 \\ \mu \neq \mu_0 \end{cases} \end{align} \]
Compute \(Z_{obs} = \dfrac{\bar x - \mu_0}{\frac{\sigma}{\sqrt{n}}}\)
Determine the Critical Value; see Figure 2
Make a Decision
TikTok Watching Habits Revisited
A social media research group claims that the average number of TikTok videos watched per session is 35 videos. However, a digital well-being organization suspects that people are actually watching more than that per session, contributing to increased screen time. To investigate, the organization collects data from 25 randomly selected TikTok users and finds that they watch an average of 38 videos per session. From past research, the population standard deviation is known to be \(\sigma\) = 6 videos.
Using a significance level of \(\alpha\) =0.05, conduct a hypothesis test to determine whether there is enough evidence to conclude that users are watching more than 35 videos per session.
\[ \begin{align} H_0: \mu &= 35 \quad \quad & H_1: \mu &> 35 \end{align} \]
Test statistic \(z_{obs} = \dfrac{\bar x - \mu_0}{\frac{\sigma}{\sqrt{n}}} = \dfrac{38 - 35}{\frac{6}{\sqrt{25}}} = 2.5\)
Determine the Critical Value
Make a Decision