What is Interval Estimation?

  • A range of plausible values for a population parameter (e.g., mean , proportion ) constructed from sample data.
  • Contrasts with a point estimate (single value).
  • Provides a measure of uncertainty or confidence.
  • Often reported as Confidence Intervals (CIs).

Confidence Interval (CI): Basic Concept

  • A CI for parameter is a random interval , where is sample data.
  • For a CI:
  • This probability holds for repeated sampling.
  • Commonly, for a 95% CI.

Types of Confidence Intervals

  • For Mean (population standard deviation known):
  • For Mean (population standard deviation unknown): (where is the critical value from the t-distribution)
  • For Proportion: (where is the sample proportion, is the critical value from the standard normal distribution)

Example: CI for Mean (Normal Case, Known )

  • Assumptions: Sample from Normal , is known.
  • Steps:
    1. Compute sample mean:
    2. Standard Error (SE):
    3. Z-interval: (e.g., for 95% CI, , )

Example Calculation (Unknown )

  • Survey: , cm, cm. Find 95% CI for population mean .
  • Use t-distribution since is unknown.
  • Degrees of freedom .
  • For 95% CI, , the critical -value .
  • CI =
  • CI =

Interpretation of CI

  • If we repeat the experiment many times, about (e.g., 95%) of the computed intervals will contain the true parameter .
  • The parameter is fixed; the interval varies from sample to sample.

Choosing Z-Score vs T-Distribution

  • Use Z-score if: Sample size OR population standard deviation is known.
  • Use T-distribution if: Sample size AND population standard deviation is unknown.

Pivotal Quantity

  • A function of sample data and unknown parameter(s) , say .
  • Its probability distribution is known and does not depend on the unknown parameter(s) .
  • Crucial for constructing exact CIs and hypothesis tests.

Examples of Pivotal Quantities

  • Normal Distribution, Known Variance : For estimating , the pivotal quantity is:
  • Normal Distribution, Unknown Variance : For estimating , the pivotal quantity is: (t-distribution with degrees of freedom)

Hypothesis Test

Introduction & Role

  • A fundamental statistical procedure to make inferences about population parameters based on sample data.
  • Involves formulating hypotheses and using sample data to decide whether to reject the null hypothesis.
  • Role: Provides a systematic, objective framework for making data-based decisions, quantifying evidence, controlling error probabilities, and standardizing scientific inquiry.

Basic Concepts

How Hypothesis Testing Works

  • Null Hypothesis (): The default assumption or statement being tested (e.g., “no effect”, “status quo”). Usually contains equality.
    • Formal: (Eq 1)
  • Alternative Hypothesis ( or ): The claim you suspect might be true, contradicting .
    • Formal Types:
      • (two-sided/two-tailed) (Eq 2)
      • (right-sided/right-tailed) (Eq 3)
      • (left-sided/left-tailed) (Eq 4)

Testing Process Overview

  1. Collect data.
  2. Calculate a test statistic (summarizes data relative to ).
  3. Make a decision: If the test statistic is “too unusual” assuming is true, reject in favor of . Otherwise, fail to reject .

Test Statistics

  • Numerical summary of sample data used for decision making.
  • Choice depends on parameter, assumed population distribution, and sample size.
  • Z-statistic (Mean test, known or large ): (Eq 5)
  • T-statistic (Mean test, unknown, small ): (Eq 6)

Tests for a Single Mean

Z-test (Known )

  • Assumptions: known; Population is Normal OR is large (CLT applies).
  • Hypotheses: (Eq 7) (or , or ) (Eq 8)
  • Test Statistic: (Eq 9)
  • Decision Rule (Example: Two-tailed): Reject if . Fail to reject if .

Z-test Example (IQ Scores)

  • Scenario: Claim . . Sample: . Use .
  • , (Right-tailed).
  • Significance Level: . Critical Z-value .
  • Test Statistic: .
  • Decision: Since , reject .
  • Conclusion: Sufficient evidence supports the claim that the mean IQ score is greater than 82.

T-test (Unknown )

  • Assumptions: unknown; Population is Normal OR is large.
  • Hypotheses: (Eq 10) (or , or ) (Eq 11)
  • Test Statistic: (Eq 12) (follows t-distribution with )
  • Decision Rule (Example: Two-tailed): Reject if . Fail to reject if .

T-test Example (Exam Scores)

  • Scenario: Claim . Sample: . Use .
  • , (Left-tailed). (Eq 13, 14)
  • Significance Level: . Degrees of freedom . Critical t-value .
  • Test Statistic: . (Eq 15-19)
  • Decision: Since , fail to reject .
  • Conclusion: Insufficient evidence to conclude the mean score is less than 75.

Tests for Two Means

Z-test for Two Means (Independent Samples, Known )

  • Use when: Comparing means from 2 independent populations; known; populations Normal OR sample sizes large.
  • Hypotheses: (Often ) (Eq 20) (or ) (Eq 21)
  • Test Statistic: (Eq 22)
  • Decision Rule (Example: Two-tailed): Reject if .

Z-test Two Means Example (Teaching Methods)

  • Scenario: Compare Method A () vs Method B (). Claim: Method B is more effective ().
  • Data: A: . B: . Use .
  • (or ), (or ) (Left-tailed). (Eq 23, 24)
  • Significance Level: . Critical Z-value .
  • Test Statistic: . (Eq 25-30)
  • Decision: Since , reject .
  • Conclusion: Sufficient evidence that Method B is more effective than Method A.

T-test for Two Means (Independent Samples, Unknown but Equal )

  • Use when: Comparing 2 independent means; unknown but assumed equal; populations Normal OR sample sizes large.
  • Test Statistic: (Eq 31) (follows t-distribution with )
  • Pooled Variance (): Estimate of the common variance . (Eq 32)
  • Degrees of Freedom: .

T-test Two Means Example (Study Methods)

  • Scenario: Compare Method 1 () vs Method 2 (). Test if scores are different.
  • Data: M1: . M2: . Use . Assume equal variances.
  • (or ), (or ) (Two-tailed). (Eq 33, 34)
  • Significance Level: . . Critical t-values .
  • Calculate Pooled Variance: . (Eq 35-39)
  • Test Statistic: . (Eq 40-43)
  • Decision: Since , reject .
  • Conclusion: Sufficient evidence that the two study methods produce different test scores (Method 2 avg is higher).

Welch’s T-test (Independent Samples, Unknown and Unequal )

  • Use when: Comparing 2 independent means; unknown and not assumed equal; populations Normal OR sample sizes large.
  • Hypotheses: (Often ) (Eq 44) (or ) (Eq 45)
  • Test Statistic: (Eq 46)
  • Degrees of Freedom (Welch-Satterthwaite approximation): (Eq 47) (Often rounded down)
  • Decision Rule (Example: Two-tailed): Reject if .

Welch’s T-test Example (Drug Recovery Time)

  • Scenario: Compare New Drug () vs Old Drug (). Test if new drug reduces recovery time ().
  • Data: New: . Old: . Use . Variances not assumed equal.
  • (or ), (or ) (Left-tailed). (Eq 48, 49)
  • Significance Level: .
  • Calculate Test Statistic: . (Eq 50-53)
  • Calculate Degrees of Freedom: Let , . . Use . (Eq 54-56)
  • Critical t-value: .
  • Decision: Since , reject .
  • Conclusion: Strong statistical evidence that the new drug reduces recovery time compared to the old drug. The difference (8.1 days) is also clinically significant. 95% CI for difference: [5.9, 10.3] days.