EN: OLS regression of one dependent variable on one independent variable — coefficients, R², F-test, hypothesis tests, and confidence intervals.
VN: Hồi quy tuyến tính đơn — hệ số, R², kiểm định F, t, khoảng tin cậy.
Fitted equation: \(\hat{Y} = 2 + 0.8 X\). Predict Y when X = 10. Interpret slope.
OLS minimizes the sum of squared residuals \(\sum \hat{\varepsilon}_i^{2}\). The fitted line passes through \((\bar{X}, \bar{Y})\).
Cov(X,Y) = 12, Var(X) = 8, \(\bar{X}\) = 5, \(\bar{Y}\) = 20. Compute OLS slope and intercept.
SST = 200, SSE = 60. Compute SSR and R².
In simple regression, \(R^{2} = r^{2}\) (square of the Pearson correlation between X and Y).
Pearson correlation r = 0.6 in a simple regression. What is R²?
Smaller SEE → tighter fit. Units of SEE = units of Y.
SSE = 80, n = 22. Compute SEE.
Tests H₀: b₁ = 0 vs Hₐ: b₁ ≠ 0. In simple regression, F = t² of the slope test.
SSR = 120, SSE = 60, n = 32. Compute F.
An OLS regression returns \(\hat{b}_1\) = 0.85 with standard error 0.20, n = 32. Test H₀: b₁ = 0 at α = 5% (two-tailed).
\(\hat{b}_1 = 0.85\), SE(\(\hat{b}_1\)) = 0.20, n = 32, t(α/2, df=30) = 2.042. Build the 95% CI.
Prediction SE \(s_f\) is wider than SEE because it includes uncertainty in both the regression line and the new error.
Predicted Y = 50, prediction SE = 4, t(α/2, df=28) = 2.048. Build the 95% prediction interval.