t-Distribution
Learn about Student's t-distribution, its relationship to the normal distribution, and when to use t instead of z for statistical inference.
On This Page
The Need for the t-Distribution
When we don’t know the population standard deviation (σ), we must estimate it from our sample (s). This estimation introduces extra uncertainty.
What is the t-Distribution?
The t-distribution (also called Student’s t) is similar to the normal distribution but with heavier tails.
Where:
- = sample mean
- μ = hypothesized population mean
- s = sample standard deviation
- n = sample size
Key Properties
| Property | Description |
|---|---|
| Shape | Symmetric, bell-shaped |
| Center | Mean = 0 |
| Spread | More spread than normal (heavier tails) |
| Parameter | Degrees of freedom (df) |
Degrees of Freedom
The degrees of freedom (df) control the shape of the t-distribution.
For a single sample mean:
Where n is the sample size.
Effect of df on Shape
| df | Shape |
|---|---|
| Small (e.g., 3) | Very heavy tails, flat peak |
| Medium (e.g., 10) | Moderately heavy tails |
| Large (e.g., 30+) | Nearly normal |
| df → ∞ | Exactly normal |
t vs z Distribution
| Aspect | z (Normal) | t (Student’s t) |
|---|---|---|
| Use when | σ is known | σ is unknown (use s) |
| Tails | Lighter | Heavier (more extreme values likely) |
| Shape | Fixed | Changes with df |
| Critical values | Always same | Depend on df |
| Large samples | Use z | Approaches z |
For 95% confidence (two-tailed):
| df | t-critical | z-critical |
|---|---|---|
| 5 | 2.571 | 1.96 |
| 10 | 2.228 | 1.96 |
| 30 | 2.042 | 1.96 |
| 100 | 1.984 | 1.96 |
| ∞ | 1.960 | 1.96 |
Notice: t-critical is always ≥ z-critical, making intervals wider.
Reading the t-Table
A t-table provides critical values for different:
- Degrees of freedom (rows)
- Tail probabilities (columns)
Find t-critical for 95% CI with n = 15
- df = n - 1 = 15 - 1 = 14
- For 95% CI (two-tailed), look up α/2 = 0.025
- Find row df = 14, column 0.025
- t-critical = 2.145
This means: 95% of the t-distribution (df=14) falls between -2.145 and +2.145.
Confidence Intervals with t
Where t* is the critical value for df = n - 1
Data: n = 20 students, mean = 75, s = 8
Find: 95% confidence interval for population mean
Solution:
- df = 20 - 1 = 19
- t* for 95% CI, df = 19: 2.093
- Margin of error = 2.093 × (8/√20) = 2.093 × 1.789 = 3.74
- CI: 75 ± 3.74 = (71.26, 78.74)
Interpretation: We’re 95% confident the population mean is between 71.26 and 78.74.
Hypothesis Testing with t
Test statistic:
df = n - 1
Compare to t-critical or find p-value from t-distribution.
Claim: Average commute is 30 minutes Sample: n = 25, mean = 33.2, s = 7.5 Test: Two-tailed at α = 0.05
Solution:
- H₀: μ = 30, H₁: μ ≠ 30
- t = (33.2 - 30) / (7.5/√25) = 3.2 / 1.5 = 2.13
- df = 24, t-critical (two-tailed, 0.05) = 2.064
- Since |2.13| > 2.064, reject H₀
Conclusion: Evidence suggests mean commute ≠ 30 minutes.
When to Use t vs z
Flowchart
- Are you working with means or proportions?
- Proportions → use z
- Means → continue
- Do you know population σ?
- Yes → use z (rare)
- No → use t
Assumptions for t-Procedures
| Sample Size | Normality Requirement |
|---|---|
| n < 15 | Data must be nearly normal |
| 15 ≤ n < 30 | No strong skewness or outliers |
| n ≥ 30 | OK even if somewhat skewed |
The History of “Student’s t”
Summary
In this lesson, you learned:
- t-distribution is used when σ is unknown (we estimate with s)
- Has heavier tails than normal—accounts for extra uncertainty
- Shape depends on degrees of freedom (df = n - 1)
- As df increases, t approaches normal (z)
- Use t-critical values for confidence intervals and hypothesis tests
- Always valid for inference about means when σ is unknown
- Requires random sample and approximate normality (or large n)
Practice Problems
1. A sample of n = 12 has mean = 45 and s = 6. What df should you use for a t-procedure?
2. Find the t-critical value for a 90% CI with n = 20. (Hint: df = 19, look up two-tailed α = 0.10)
3. For n = 8, mean = 52, s = 4, construct a 95% CI for μ.
4. Why are t-based confidence intervals wider than z-based intervals (for the same confidence level)?
Click to see answers
1. df = n - 1 = 12 - 1 = 11
2. df = 19, two-tailed 90% means α/2 = 0.05 in each tail t* = 1.729 (from t-table)
3. df = 8 - 1 = 7 t* for 95% CI, df = 7: 2.365
SE = s/√n = 4/√8 = 1.414 ME = 2.365 × 1.414 = 3.34
CI: 52 ± 3.34 = (48.66, 55.34)
4. t-critical values are larger than z-critical values (e.g., t* = 2.26 vs z* = 1.96 for n = 10).
This is because:
- We’re estimating σ with s, which introduces extra uncertainty
- The t-distribution has heavier tails to account for this
- Wider intervals reflect our increased uncertainty
Next Steps
Apply t-distributions in hypothesis testing:
- t-Tests - One-sample, two-sample, paired
- Confidence Intervals - Complete CI methods
- T-Test Calculator - Practice calculations
Was this lesson helpful?
Help us improve by sharing your feedback or spreading the word.