intermediate 20 minutes

t-Distribution

Learn about Student's t-distribution, its relationship to the normal distribution, and when to use t instead of z for statistical inference.

On This Page
Advertisement

The Need for the t-Distribution

When we don’t know the population standard deviation (σ), we must estimate it from our sample (s). This estimation introduces extra uncertainty.


What is the t-Distribution?

The t-distribution (also called Student’s t) is similar to the normal distribution but with heavier tails.

t-Statistic

t=xˉμs/nt = \frac{\bar{x} - \mu}{s / \sqrt{n}}

Where:

  • xˉ\bar{x} = sample mean
  • μ = hypothesized population mean
  • s = sample standard deviation
  • n = sample size

Key Properties

PropertyDescription
ShapeSymmetric, bell-shaped
CenterMean = 0
SpreadMore spread than normal (heavier tails)
ParameterDegrees of freedom (df)

Degrees of Freedom

The degrees of freedom (df) control the shape of the t-distribution.

Degrees of Freedom

For a single sample mean: df=n1df = n - 1

Where n is the sample size.

Effect of df on Shape

dfShape
Small (e.g., 3)Very heavy tails, flat peak
Medium (e.g., 10)Moderately heavy tails
Large (e.g., 30+)Nearly normal
df → ∞Exactly normal

t vs z Distribution

Aspectz (Normal)t (Student’s t)
Use whenσ is knownσ is unknown (use s)
TailsLighterHeavier (more extreme values likely)
ShapeFixedChanges with df
Critical valuesAlways sameDepend on df
Large samplesUse zApproaches z
Critical Values Comparison

For 95% confidence (two-tailed):

dft-criticalz-critical
52.5711.96
102.2281.96
302.0421.96
1001.9841.96
1.9601.96

Notice: t-critical is always ≥ z-critical, making intervals wider.


Reading the t-Table

A t-table provides critical values for different:

  • Degrees of freedom (rows)
  • Tail probabilities (columns)
Using the t-Table

Find t-critical for 95% CI with n = 15

  1. df = n - 1 = 15 - 1 = 14
  2. For 95% CI (two-tailed), look up α/2 = 0.025
  3. Find row df = 14, column 0.025
  4. t-critical = 2.145

This means: 95% of the t-distribution (df=14) falls between -2.145 and +2.145.


Confidence Intervals with t

CI for Mean (σ unknown)

xˉ±t×sn\bar{x} \pm t^* \times \frac{s}{\sqrt{n}}

Where t* is the critical value for df = n - 1

Calculating a t-Based CI

Data: n = 20 students, mean = 75, s = 8

Find: 95% confidence interval for population mean

Solution:

  1. df = 20 - 1 = 19
  2. t* for 95% CI, df = 19: 2.093
  3. Margin of error = 2.093 × (8/√20) = 2.093 × 1.789 = 3.74
  4. CI: 75 ± 3.74 = (71.26, 78.74)

Interpretation: We’re 95% confident the population mean is between 71.26 and 78.74.


Hypothesis Testing with t

One-Sample t-Test

Test statistic: t=xˉμ0s/nt = \frac{\bar{x} - \mu_0}{s / \sqrt{n}}

df = n - 1

Compare to t-critical or find p-value from t-distribution.

One-Sample t-Test

Claim: Average commute is 30 minutes Sample: n = 25, mean = 33.2, s = 7.5 Test: Two-tailed at α = 0.05

Solution:

  1. H₀: μ = 30, H₁: μ ≠ 30
  2. t = (33.2 - 30) / (7.5/√25) = 3.2 / 1.5 = 2.13
  3. df = 24, t-critical (two-tailed, 0.05) = 2.064
  4. Since |2.13| > 2.064, reject H₀

Conclusion: Evidence suggests mean commute ≠ 30 minutes.


When to Use t vs z

Flowchart

  1. Are you working with means or proportions?
    • Proportions → use z
    • Means → continue
  2. Do you know population σ?
    • Yes → use z (rare)
    • No → use t

Assumptions for t-Procedures

Sample SizeNormality Requirement
n < 15Data must be nearly normal
15 ≤ n < 30No strong skewness or outliers
n ≥ 30OK even if somewhat skewed

The History of “Student’s t”


Summary

In this lesson, you learned:

  • t-distribution is used when σ is unknown (we estimate with s)
  • Has heavier tails than normal—accounts for extra uncertainty
  • Shape depends on degrees of freedom (df = n - 1)
  • As df increases, t approaches normal (z)
  • Use t-critical values for confidence intervals and hypothesis tests
  • Always valid for inference about means when σ is unknown
  • Requires random sample and approximate normality (or large n)

Practice Problems

1. A sample of n = 12 has mean = 45 and s = 6. What df should you use for a t-procedure?

2. Find the t-critical value for a 90% CI with n = 20. (Hint: df = 19, look up two-tailed α = 0.10)

3. For n = 8, mean = 52, s = 4, construct a 95% CI for μ.

4. Why are t-based confidence intervals wider than z-based intervals (for the same confidence level)?

Click to see answers

1. df = n - 1 = 12 - 1 = 11

2. df = 19, two-tailed 90% means α/2 = 0.05 in each tail t* = 1.729 (from t-table)

3. df = 8 - 1 = 7 t* for 95% CI, df = 7: 2.365

SE = s/√n = 4/√8 = 1.414 ME = 2.365 × 1.414 = 3.34

CI: 52 ± 3.34 = (48.66, 55.34)

4. t-critical values are larger than z-critical values (e.g., t* = 2.26 vs z* = 1.96 for n = 10).

This is because:

  • We’re estimating σ with s, which introduces extra uncertainty
  • The t-distribution has heavier tails to account for this
  • Wider intervals reflect our increased uncertainty

Next Steps

Apply t-distributions in hypothesis testing:

Advertisement

Was this lesson helpful?

Help us improve by sharing your feedback or spreading the word.