What is Interval Estimation?

A range of plausible values for a population parameter (e.g., mean $μ$ , proportion $p$ ) constructed from sample data.
Contrasts with a point estimate (single value).
Provides a measure of uncertainty or confidence.
Often reported as Confidence Intervals (CIs).

Confidence Interval (CI): Basic Concept

A CI for parameter $θ$ is a random interval $[L (X), U (X)]$ , where $X$ is sample data.
For a $(1 - α) \times 100%$ CI: $P r (L (X) \leq θ \leq U (X)) = 1 - α$
This probability holds for repeated sampling.
Commonly, $α = 0.05$ for a 95% CI.

Types of Confidence Intervals

For Mean (population standard deviation $σ$ known): $\overset{ˉ}{X} \pm z^{*} \cdot \frac{σ}{n}$
For Mean (population standard deviation $σ$ unknown): $\overset{x}{ˉ} \pm t^{*} \cdot \frac{s}{n}$ (where $t^{*}$ is the critical value from the t-distribution)
For Proportion: $\overset{p}{^} \pm z^{*} \cdot \frac{p ^ ( 1 - p ^ )}{n}$ (where $\overset{p}{^}$ is the sample proportion, $z^{*}$ is the critical value from the standard normal distribution)

Example: CI for Mean (Normal Case, Known $σ^{2}$ )

Assumptions: Sample $X = (X_{1}, ..., X_{n})$ from Normal $N (μ, σ^{2})$ , $σ^{2}$ is known.
Steps:
1. Compute sample mean: $\overset{ˉ}{X} = \frac{1}{n} \sum_{i = 1}^{n} X_{i}$
2. Standard Error (SE): $SE = \frac{σ}{n}$
3. $(1 - α) \times 100%$ Z-interval: $[\overset{ˉ}{X} - z_{α /2} \frac{σ}{n}, \overset{ˉ}{X} + z_{α /2} \frac{σ}{n}]$ (e.g., for 95% CI, $α = 0.05$ , $z_{α /2} = z_{0.025} \approx 1.96$ )

Example Calculation (Unknown $σ$ )

Survey: $n = 100$ , $\overset{x}{ˉ} = 170$ cm, $s = 10$ cm. Find 95% CI for population mean $μ$ .
Use t-distribution since $σ$ is unknown.
$SE = \frac{s}{n} = \frac{10}{100} = 1$
Degrees of freedom $df = n - 1 = 99$ .
For 95% CI, $df = 99$ , the critical $t$ -value $t^{*} \approx 1.984$ .
CI = $\overset{x}{ˉ} \pm t^{*} \cdot SE = 170 \pm 1.984 \cdot 1 = 170 \pm 1.984$
CI = $[168.016, 171.984]$

Interpretation of CI

If we repeat the experiment many times, about $(1 - α) \times 100%$ (e.g., 95%) of the computed intervals will contain the true parameter $μ$ .
The parameter $μ$ is fixed; the interval varies from sample to sample.

Choosing Z-Score vs T-Distribution

Use Z-score if: Sample size $n \geq 30$ OR population standard deviation $σ$ is known.
Use T-distribution if: Sample size $n < 30$ AND population standard deviation $σ$ is unknown.

Pivotal Quantity

A function of sample data $X_{1}, ..., X_{n}$ and unknown parameter(s) $θ$ , say $Q (X_{1}, ..., X_{n}; θ)$ .
Its probability distribution is known and does not depend on the unknown parameter(s) $θ$ .
Crucial for constructing exact CIs and hypothesis tests.

Examples of Pivotal Quantities

Normal Distribution, Known Variance $σ^{2}$ : For estimating $μ$ , the pivotal quantity is: $Q = \frac{X ˉ - μ}{σ / n} \sim N (0, 1)$
Normal Distribution, Unknown Variance $σ^{2}$ : For estimating $μ$ , the pivotal quantity is: $Q = \frac{X ˉ - μ}{s / n} \sim t_{n - 1}$ (t-distribution with $n - 1$ degrees of freedom)

Hypothesis Test

Introduction & Role

A fundamental statistical procedure to make inferences about population parameters based on sample data.
Involves formulating hypotheses and using sample data to decide whether to reject the null hypothesis.
Role: Provides a systematic, objective framework for making data-based decisions, quantifying evidence, controlling error probabilities, and standardizing scientific inquiry.

Basic Concepts

How Hypothesis Testing Works

Null Hypothesis ( $H_{0}$ ): The default assumption or statement being tested (e.g., “no effect”, “status quo”). Usually contains equality.
- Formal: $H_{0} : θ = θ_{0}$ (Eq 1)
Alternative Hypothesis ( $H_{a}$ or $H_{1}$ ): The claim you suspect might be true, contradicting $H_{0}$ .
- Formal Types:
  - $H_{a} : θ \neq = θ_{0}$ (two-sided/two-tailed) (Eq 2)
  - $H_{a} : θ > θ_{0}$ (right-sided/right-tailed) (Eq 3)
  - $H_{a} : θ < θ_{0}$ (left-sided/left-tailed) (Eq 4)

Testing Process Overview

Collect data.
Calculate a test statistic (summarizes data relative to $H_{0}$ ).
Make a decision: If the test statistic is “too unusual” assuming $H_{0}$ is true, reject $H_{0}$ in favor of $H_{1}$ . Otherwise, fail to reject $H_{0}$ .

Test Statistics

Numerical summary of sample data used for decision making.
Choice depends on parameter, assumed population distribution, and sample size.
Z-statistic (Mean test, $σ$ known or large $n$ ): $Z = \frac{X ˉ - μ _{0}}{σ / n}$ (Eq 5)
T-statistic (Mean test, $σ$ unknown, small $n$ ): $t = \frac{X ˉ - μ _{0}}{s / n}$ (Eq 6)

Tests for a Single Mean

Z-test (Known $σ$ )

Assumptions: $σ$ known; Population is Normal OR $n$ is large (CLT applies).
Hypotheses: $H_{0} : μ = μ_{0}$ (Eq 7) $H_{a} : μ \neq = μ_{0}$ (or $μ > μ_{0}$ , or $μ < μ_{0}$ ) (Eq 8)
Test Statistic: $Z = \frac{X ˉ - μ _{0}}{σ / n}$ (Eq 9)
Decision Rule (Example: Two-tailed): Reject $H_{0}$ if $∣ Z ∣ > Z_{α /2}$ . Fail to reject $H_{0}$ if $∣ Z ∣ \leq Z_{α /2}$ .

Z-test Example (IQ Scores)

Scenario: Claim $μ > 82$ . $σ = 20$ . Sample: $n = 81, \overset{ˉ}{X} = 90$ . Use $α = 0.05$ .
$H_{0} : μ \leq 82$ , $H_{1} : μ > 82$ (Right-tailed).
Significance Level: $α = 0.05$ . Critical Z-value $Z_{0.05} = 1.645$ .
Test Statistic: $Z = \frac{90 - 82}{20/ 81} = \frac{8}{20/9} = \frac{8}{2.22} = 3.60$ .
Decision: Since $3.60 > 1.645$ , reject $H_{0}$ .
Conclusion: Sufficient evidence supports the claim that the mean IQ score is greater than 82.

T-test (Unknown $σ$ )

Assumptions: $σ$ unknown; Population is Normal OR $n$ is large.
Hypotheses: $H_{0} : μ = μ_{0}$ (Eq 10) $H_{a} : μ \neq = μ_{0}$ (or $μ > μ_{0}$ , or $μ < μ_{0}$ ) (Eq 11)
Test Statistic: $t = \frac{X ˉ - μ _{0}}{s / n}$ (Eq 12) (follows t-distribution with $df = n - 1$ )
Decision Rule (Example: Two-tailed): Reject $H_{0}$ if $∣ t ∣ > t_{α /2, n - 1}$ . Fail to reject $H_{0}$ if $∣ t ∣ \leq t_{α /2, n - 1}$ .

T-test Example (Exam Scores)

Scenario: Claim $μ \geq 75$ . Sample: $n = 16, \overset{ˉ}{X} = 71.5, s = 8.5$ . Use $α = 0.05$ .
$H_{0} : μ \geq 75$ , $H_{1} : μ < 75$ (Left-tailed). (Eq 13, 14)
Significance Level: $α = 0.05$ . Degrees of freedom $df = 16 - 1 = 15$ . Critical t-value $t_{0.05, 15} = - 1.753$ .
Test Statistic: $t = \frac{71.5 - 75}{8.5/ 16} = \frac{- 3.5}{8.5/4} = \frac{- 3.5}{2.125} \approx - 1.647$ . (Eq 15-19)
Decision: Since $- 1.647 > - 1.753$ , fail to reject $H_{0}$ .
Conclusion: Insufficient evidence to conclude the mean score is less than 75.

Tests for Two Means

Z-test for Two Means (Independent Samples, Known $σ_{1}, σ_{2}$ )

Use when: Comparing means from 2 independent populations; $σ_{1}, σ_{2}$ known; populations Normal OR sample sizes large.
Hypotheses: (Often $Δ_{0} = 0$ ) $H_{0} : μ_{1} - μ_{2} = Δ_{0}$ (Eq 20) $H_{a} : μ_{1} - μ_{2} \neq = Δ_{0}$ (or $>, <$ ) (Eq 21)
Test Statistic: $Z = \frac{( X ˉ _{1} - X ˉ _{2} ) - Δ _{0}}{\frac{σ _{1}^{2}}{n _{1}} + \frac{σ _{2}^{2}}{n _{2}}}$ (Eq 22)
Decision Rule (Example: Two-tailed): Reject $H_{0}$ if $∣ Z ∣ > Z_{α /2}$ .

Z-test Two Means Example (Teaching Methods)

Scenario: Compare Method A ( $μ_{1}$ ) vs Method B ( $μ_{2}$ ). Claim: Method B is more effective ( $μ_{2} > μ_{1}$ ).
Data: A: $n_{1} = 45, \overset{ˉ}{X}_{1} = 72, σ_{1} = 12$ . B: $n_{2} = 50, \overset{ˉ}{X}_{2} = 78, σ_{2} = 15$ . Use $α = 0.05$ .
$H_{0} : μ_{1} \geq μ_{2}$ (or $μ_{1} - μ_{2} \geq 0$ ), $H_{1} : μ_{1} < μ_{2}$ (or $μ_{1} - μ_{2} < 0$ ) (Left-tailed). (Eq 23, 24)
Significance Level: $α = 0.05$ . Critical Z-value $Z_{0.05} = - 1.645$ .
Test Statistic: $Z = \frac{( 72 - 78 ) - 0}{\frac{1 2 ^{2}}{45} + \frac{1 5 ^{2}}{50}} = \frac{- 6}{3.2 + 4.5} = \frac{- 6}{7.7} \approx \frac{- 6}{2.77} \approx - 2.17$ . (Eq 25-30)
Decision: Since $- 2.17 < - 1.645$ , reject $H_{0}$ .
Conclusion: Sufficient evidence that Method B is more effective than Method A.

T-test for Two Means (Independent Samples, Unknown but Equal $σ$ )

Use when: Comparing 2 independent means; $σ_{1}, σ_{2}$ unknown but assumed equal; populations Normal OR sample sizes large.
Test Statistic: $t = \frac{( X ˉ _{1} - X ˉ _{2} ) - Δ _{0}}{s _{p}^{2} ( \frac{1}{n _{1}} + \frac{1}{n _{2}} )}$ (Eq 31) (follows t-distribution with $df = n_{1} + n_{2} - 2$ )
Pooled Variance ( $s_{p}^{2}$ ): Estimate of the common variance $σ^{2}$ . $s_{p}^{2} = \frac{( n _{1} - 1 ) s _{1}^{2} + ( n _{2} - 1 ) s _{2}^{2}}{n _{1} + n _{2} - 2}$ (Eq 32)
Degrees of Freedom: $df = n_{1} + n_{2} - 2$ .

T-test Two Means Example (Study Methods)

Scenario: Compare Method 1 ( $μ_{1}$ ) vs Method 2 ( $μ_{2}$ ). Test if scores are different.
Data: M1: $n_{1} = 12, \overset{ˉ}{X}_{1} = 76, s_{1} = 8$ . M2: $n_{2} = 15, \overset{ˉ}{X}_{2} = 82, s_{2} = 7$ . Use $α = 0.05$ . Assume equal variances.
$H_{0} : μ_{1} = μ_{2}$ (or $μ_{1} - μ_{2} = 0$ ), $H_{1} : μ_{1} \neq = μ_{2}$ (or $μ_{1} - μ_{2} \neq = 0$ ) (Two-tailed). (Eq 33, 34)
Significance Level: $α = 0.05$ . $df = 12 + 15 - 2 = 25$ . Critical t-values $t_{0.025, 25} = \pm 2.060$ .
Calculate Pooled Variance: $s_{p}^{2} = \frac{( 11 ) ( 8 ^{2} ) + ( 14 ) ( 7 ^{2} )}{12 + 15 - 2} = \frac{704 + 686}{25} = \frac{1390}{25} = 55.6$ . (Eq 35-39)
Test Statistic: $t = \frac{( 76 - 82 ) - 0}{55.6 ( \frac{1}{12} + \frac{1}{15} )} = \frac{- 6}{55.6 ( 0.0833 + 0.0667 )} = \frac{- 6}{55.6 ( 0.15 )} = \frac{- 6}{8.34} \approx \frac{- 6}{2.89} \approx - 2.08$ . (Eq 40-43)
Decision: Since $∣ - 2.08∣ = 2.08 > 2.060$ , reject $H_{0}$ .
Conclusion: Sufficient evidence that the two study methods produce different test scores (Method 2 avg is higher).

Welch’s T-test (Independent Samples, Unknown and Unequal $σ$ )

Use when: Comparing 2 independent means; $σ_{1}, σ_{2}$ unknown and not assumed equal; populations Normal OR sample sizes large.
Hypotheses: (Often $Δ_{0} = 0$ ) $H_{0} : μ_{1} - μ_{2} = Δ_{0}$ (Eq 44) $H_{a} : μ_{1} - μ_{2} \neq = Δ_{0}$ (or $>, <$ ) (Eq 45)
Test Statistic: $t^{'} = \frac{( X ˉ _{1} - X ˉ _{2} ) - Δ _{0}}{\frac{s _{1}^{2}}{n _{1}} + \frac{s _{2}^{2}}{n _{2}}}$ (Eq 46)
Degrees of Freedom (Welch-Satterthwaite approximation): $df \approx \frac{( \frac{s _{1}^{2}}{n _{1}} + \frac{s _{2}^{2}}{n _{2}} ) ^{2}}{\frac{( s _{1}^{2} / n _{1} ) ^{2}}{n _{1} - 1} + \frac{( s _{2}^{2} / n _{2} ) ^{2}}{n _{2} - 1}}$ (Eq 47) (Often rounded down)
Decision Rule (Example: Two-tailed): Reject $H_{0}$ if $∣ t^{'} ∣ > t_{α /2, df}$ .

Welch’s T-test Example (Drug Recovery Time)

Scenario: Compare New Drug ( $μ_{n e w}$ ) vs Old Drug ( $μ_{o l d}$ ). Test if new drug reduces recovery time ( $μ_{n e w} < μ_{o l d}$ ).
Data: New: $n_{N} = 10, \overset{ˉ}{X}_{N} = 12.6, s_{N} = 2.22$ . Old: $n_{O} = 10, \overset{ˉ}{X}_{O} = 20.7, s_{O} = 2.41$ . Use $α = 0.05$ . Variances not assumed equal.
$H_{0} : μ_{n e w} \geq μ_{o l d}$ (or $μ_{n e w} - μ_{o l d} \geq 0$ ), $H_{1} : μ_{n e w} < μ_{o l d}$ (or $μ_{n e w} - μ_{o l d} < 0$ ) (Left-tailed). (Eq 48, 49)
Significance Level: $α = 0.05$ .
Calculate Test Statistic: $t^{'} = \frac{12.6 - 20.7}{\frac{2.2 2 ^{2}}{10} + \frac{2.4 1 ^{2}}{10}} = \frac{- 8.1}{\frac{4.9284}{10} + \frac{5.8081}{10}} = \frac{- 8.1}{0.493 + 0.581} = \frac{- 8.1}{1.074} \approx \frac{- 8.1}{1.036} \approx - 7.82$ . (Eq 50-53)
Calculate Degrees of Freedom: Let $v_{N} = s_{N}^{2} / n_{N} \approx 0.493$ , $v_{O} = s_{O}^{2} / n_{O} \approx 0.581$ . $df \approx \frac{( v _{N} + v _{O} ) ^{2}}{\frac{v _{N}^{2}}{n _{N} - 1} + \frac{v _{O}^{2}}{n _{O} - 1}} = \frac{( 0.493 + 0.581 ) ^{2}}{\frac{0.49 3 ^{2}}{9} + \frac{0.58 1 ^{2}}{9}} = \frac{1.07 4 ^{2}}{\frac{0.243}{9} + \frac{0.338}{9}} = \frac{1.153}{0.027 + 0.0376} = \frac{1.153}{0.0646} \approx 17.85$ . Use $df = 17$ . (Eq 54-56)
Critical t-value: $t_{0.05, 17} \approx - 1.740$ .
Decision: Since $t^{'} = - 7.82 < - 1.740$ , reject $H_{0}$ .
Conclusion: Strong statistical evidence that the new drug reduces recovery time compared to the old drug. The difference (8.1 days) is also clinically significant. 95% CI for difference: [5.9, 10.3] days.

Quartz 4

Explorer

Interval Estimation & Hypothesis Test

What is Interval Estimation?

Confidence Interval (CI): Basic Concept

Types of Confidence Intervals

Example: CI for Mean (Normal Case, Known $σ^{2}$ )

Example Calculation (Unknown $σ$ )

Interpretation of CI

Choosing Z-Score vs T-Distribution

Pivotal Quantity

Examples of Pivotal Quantities

Hypothesis Test

Introduction & Role

Basic Concepts

How Hypothesis Testing Works

Testing Process Overview

Test Statistics

Tests for a Single Mean

Z-test (Known $σ$ )

Z-test Example (IQ Scores)

T-test (Unknown $σ$ )

T-test Example (Exam Scores)

Tests for Two Means

Z-test for Two Means (Independent Samples, Known $σ_{1}, σ_{2}$ )

Z-test Two Means Example (Teaching Methods)

T-test for Two Means (Independent Samples, Unknown but Equal $σ$ )

T-test Two Means Example (Study Methods)

Welch’s T-test (Independent Samples, Unknown and Unequal $σ$ )

Welch’s T-test Example (Drug Recovery Time)

Graph View

Table of Contents

Backlinks

Quartz 4

Explorer

Interval Estimation & Hypothesis Test

What is Interval Estimation?

Confidence Interval (CI): Basic Concept

Types of Confidence Intervals

Example: CI for Mean (Normal Case, Known σ2)

Example Calculation (Unknown σ)

Interpretation of CI

Choosing Z-Score vs T-Distribution

Pivotal Quantity

Examples of Pivotal Quantities

Hypothesis Test

Introduction & Role

Basic Concepts

How Hypothesis Testing Works

Testing Process Overview

Test Statistics

Tests for a Single Mean

Z-test (Known σ)

Z-test Example (IQ Scores)

T-test (Unknown σ)

T-test Example (Exam Scores)

Tests for Two Means

Z-test for Two Means (Independent Samples, Known σ1​,σ2​)

Z-test Two Means Example (Teaching Methods)

T-test for Two Means (Independent Samples, Unknown but Equal σ)

T-test Two Means Example (Study Methods)

Welch’s T-test (Independent Samples, Unknown and Unequal σ)

Welch’s T-test Example (Drug Recovery Time)

Graph View

Table of Contents

Backlinks

Example: CI for Mean (Normal Case, Known $σ^{2}$ )

Example Calculation (Unknown $σ$ )

Z-test (Known $σ$ )

T-test (Unknown $σ$ )

Z-test for Two Means (Independent Samples, Known $σ_{1}, σ_{2}$ )

T-test for Two Means (Independent Samples, Unknown but Equal $σ$ )

Welch’s T-test (Independent Samples, Unknown and Unequal $σ$ )