版权说明:本文档由用户提供并上传,收益归属内容提供方,若内容存在侵权,请进行举报或认领
文档简介
1、Chapter 4 Statistical Properties of the OLS EstimatorsXiAn Institute of Post & TelecommunicationDept of Economic & ManagementProf. Long第1页,共69页。yt = household weekly food expendituresSimple Linear Regression Modelyt = b1 + b2 x t + e tx t = household weekly incomeFor a given level of x t, the expect
2、edlevel of food expenditures will be: E(yt|x t) = b1 + b2 x t4.2第2页,共69页。1. yt = b1 + b2x t + e t2. E(e t) = 0 E(yt) = b1 + b2x t 3. var(e t) = s 2 = var(yt)4. cov(e i,e j) = cov(yi,yj) = 05. x t c for every observation6. e tN(0,s 2) ytN(b1+ b2x t, s 2) Assumptions of the SimpleLinear Regression Mod
3、el4.3第3页,共69页。The population parameters b1 and b2are unknown population constants.The formulas that produce thesample estimates b1 and b2 arecalled the estimators of b1 and b2.When b0 and b1 are used to representthe formulas rather than specific values,they are called estimators of b1 and b2which ar
4、e random variables becausethey are different from sample to sample.4.4第4页,共69页。If the least squares estimators b0 and b1are random variables, then what are theirmeans, variances, covariances andprobability distributions?Compare the properties of alternative estimators to the properties of the least
5、squares estimators. Estimators are Random Variables ( estimates are not )4.5第5页,共69页。 The Expected Values of b1 and b2 The least squares formulas (estimators)in the simple regression case:b2 =nSxiyi - Sxi SyinSxi2 -(Sxi)2 2b1 = y - b2xwhere y = Syi / n and x = Sx i / n (4.1a)(4.1b)4.6第6页,共69页。Substi
6、tute in yi = b1 + b2xi + e ito get:b2 = b2 +nSxiei - Sxi SeinSxi -(Sxi)22The mean of b2 is:Eb2 = b2 +nSxiEei - Sxi SEeinSxi -(Sxi)22Since Eei = 0, then Eb2 = b2 .4.7第7页,共69页。The result Eb2 = b2 means thatthe distribution of b2 is centered at b2.Since the distribution of b2 is centered at b2 ,we say
7、thatb2 is an unbiased estimator of b2. An Unbiased Estimator 4.8第8页,共69页。The unbiasedness result on the previous slide assumes that weare using the correct model.If the model is of the wrong formor is missing important variables,then Eei = 0, then Eb2 = b2 . Wrong Model Specification 4.9第9页,共69页。 Un
8、biased Estimator of the Intercept In a similar manner, the estimator b1of the intercept or constant term can beshown to be an unbiased estimator of b1 when the model is correctly specified.Eb1 = b14.10第10页,共69页。b2 =nSxiyi - Sxi SyinSxi -(Sxi)22(4.3b)(4.3a)Equivalent expressions for b2:Expand and mul
9、tiply top and bottom by n:b2 =S(xi - x )(yi - y )S(xi - x )24.11第11页,共69页。 Variance of b2 Given that both yi and ei have variance s 2,the variance of the estimator b2 is:b2 is a function of the yi values butvar(b2) does not involve yi directly.S(x i - x)s 22var(b2) =4.12第12页,共69页。 Variance of b1 nS(
10、x i - x)2var(b1) = s 2Sx i2the variance of the estimator b1 is:b1 = y - b2xGiven4.13第13页,共69页。 Covariance of b1 and b2 S(x i - x)2cov(b1,b2) = s2- x If x = 0, slope can change without affectingthe variance.4.14第14页,共69页。 What factors determine variance and covariance ?1. s 2: uncertainty about yi va
11、lues uncertainty about b1, b2 and their relationship.2. The more spread out the xi values are then the more confidence we have in b1, b2, etc.3. The larger the sample size, n, the smaller the variances and covariances.4. The variance b1 is large when the (squared) xi values are far from zero (in eit
12、her direction).5. Changing the slope, b2, has no effect on the intercept, b1, when the sample mean is zero. But if sample mean is positive, the covariance between b1 and b2 will be negative, and vice versa.4.15第15页,共69页。Gauss-Markov Theorem Under the first five assumptions of the simple, linear regr
13、ession model, the ordinary least squares estimators b1 and b2 have the smallest variance of all linear and unbiased estimators of b1 and b2. This means that b1and b2 are the Best Linear Unbiased Estimators (BLUE) of b1 and b2.4.16第16页,共69页。implications of Gauss-Markov1. b1 and b2 are “best” within t
14、he class of linear and unbiased estimators.2. “Best” means smallest variance within the class of linear/unbiased.3. All of the first five assumptions must hold to satisfy Gauss-Markov.4. Gauss-Markov does not require assumption six: normality.5. G-Markov is not based on the least squares principle b
15、ut on b1 and b2.4.17第17页,共69页。G-Markov implications (continued)6. If we are not satisfied with restricting our estimation to the class of linear and unbiased estimators, we should ignore the Gauss-Markov Theorem and use some nonlinear and/or biased estimator instead. (Note: a biased or nonlinear est
16、imator could have smaller variance than those satisfying Gauss-Markov.)7. Gauss-Markov applies to the b1 and b2 estimators and not to particular sample values (estimates) of b1 and b2.4.18第18页,共69页。Probability Distribution of Least Squares Estimators b2 N b2 ,S(xi - x)s 22b1 N b1 , nS(x i - x)2s 2 S
17、xi24.19第19页,共69页。 yi and e i normally distributed The least squares estimator of b2 can beexpressed as a linear combination of yis:b2 = S wi yi b1 = y - b2x S(x i - x)2where wi = (x i - x)This means that b1and b2 are normal sincelinear combinations of normals are normal.4.20第20页,共69页。 normally distr
18、ibuted under The Central Limit TheoremIf the first five Gauss-Markov assumptionshold, and sample size, n, is sufficiently large,then the least squares estimators, b1 and b2,have a distribution that approximates thenormal distribution with greater accuracythe larger the value of sample size, n.4.21第2
19、1页,共69页。 Consistency We would like our estimators, b1 and b2, to collapse onto the true population values, b1 and b2, as sample size, n, goes to infinity.One way to achieve this consistency property is for the variances of b1 and b2 to go to zero as n goes to infinity.Since the formulas for the vari
20、ances of the least squares estimators b1 and b2 show that their variances do, in fact, go to zero, then b1 and b2, are consistent estimators of b1 and b2.4.22第22页,共69页。 Estimating the variance of the error term, s 2et = yt - b1 - b2 x tSett =1T2n- 2s 2 = s 2 is an unbiased estimator of s 2 4.23第23页,
21、共69页。The Least Squares Predictor, yo Given a value of the explanatory variable, Xo, we would like to predicta value of the dependent variable, yo.The least squares predictor is:yo = b1 + b2 x o (4.4) 4.24第24页,共69页。Inference in the Simple Regression ModelChapter 55.1第25页,共69页。1. yt = b1 + b2x t + e t
22、2. E(e t) = 0 E(yt) = b1 + b2x t 3. var(e t) = s 2 = var(yt)4. cov(e i,e j) = cov(yi,yj) = 05. x t c for every observation6. e tN(0,s 2) ytN(b1+ b2x t,s 2) Assumptions of the Simple Linear Regression Model5.2第26页,共69页。Probability Distribution of Least Squares Estimators b1 N b1 ,n S(x t - x)2s2 Sx t
23、2b2 N b2 ,S(x t - x)s225.3第27页,共69页。s 2=n - 2et2SUnbiased estimator of the error variance:s 2s 2(n - 2)n - 2cTransform to a chi-square distribution: Error Variance Estimation 5.4第28页,共69页。We make a correct decision if:The null hypothesis is false and we decide to reject it.The null hypothesis is tru
24、e and we decide not to reject it.Our decision is incorrect if:The null hypothesis is true and we decide to reject it. This is a type I error.The null hypothesis is false and we decide not to reject it. This is a type II error.5.5第29页,共69页。b2 N b2 ,S(x t - x)s22Create a standardized normal random var
25、iable, Z, by subtracting the mean of b2 and dividing by its standard deviation:b2 - b2 var(b2)Z = N(0,1)5.6第30页,共69页。Simple Linear Regressionyt = b1 + b2x t + e t where E e t = 0yt N(b1+ b2x t , s 2) since Eyt = b1 + b2x t e t = yt - b1 - b2x t Therefore, e t N(0,s 2) .5.7第31页,共69页。Create a Chi-Squa
26、ree t N(0,s 2) but want N(0,1) .(e t /s) N(0,1) Standard Normal .(e t /s)2 c2(1) Chi-Square .5.8第32页,共69页。Sum of Chi-SquaresSt =1(e t /s)2 = (e1 /s)2 + (e 2 /s)2 +. . .+ (e n /s)2 c2(1) + c2(1) +. . .+c2(1) = c2(N) Therefore, St =1(e t /s)2 c2(N)5.9第33页,共69页。 Since the errors e t = yt - b1 - b2x t a
27、re not observable, we estimate them with the sample residuals e t = yt - b1 - b2x t.Unlike the errors, the sample residuals arenot independent since they use up two degrees of freedom by using b1 and b2 to estimate b1 and b2. We get only n-2 degrees of freedom instead of n.Chi-Square degrees of free
28、dom5.10第34页,共69页。Student-t Distributiont = t(m)ZV / mwhere Z N(0,1)and V c(m)25.11第35页,共69页。t = t(n-2)ZV / (n- 2)where Z = (b2 - b2) var(b2)and var(b2) = s 2S( xi - x )25.12第36页,共69页。t = ZV / (n-2)(b2 - b2) var(b2)t = (n-2) s 2s 2(n- 2)V = (n-2) s 2s 25.13第37页,共69页。var(b2) = s 2S( xi - x )2(b2 - b2)
29、 s 2S( xi - x )2t = =(n-2) s 2s 2( n-2)(b2 - b2) s 2S( xi - x )2 notice thecancellations5.14第38页,共69页。(b2 - b2) s 2S( xi - x )2t = =(b2 - b2) var(b2) t = (b2 - b2) se(b2)5.15第39页,共69页。Students t - statistic t = t (n-2)(b2 - b2) se(b2) t has a Student-t Distribution with n-2 degrees of freedom. 5.16第
30、40页,共69页。Figure 5.1 Student-t Distribution(1-a)t0f(t)-tctca/2a/2red area = rejection region for 2-sided test5.17第41页,共69页。probability statementsP(-tc t tc) = 1 - aP( t tc ) = a/2P(-tc tc) = 1 - a(b2 - b2) se(b2)5.18第42页,共69页。Confidence IntervalsTwo-sided (1-a)x100% C.I. for b1:b1 - ta/2se(b1), b1 +
31、ta/2se(b1)b2 - ta/2se(b2), b2 + ta/2se(b2)Two-sided (1-a)x100% C.I. for b2:5.19第43页,共69页。Student-t vs. Normal Distribution1. Both are symmetric bell-shaped distributions.2. Student-t distribution has fatter tails than the normal.3. Student-t converges to the normal for infinite sample.4. Student-t c
32、onditional on degrees of freedom (df).5. Normal is a good approximation of Student-t for the first few decimal places when df 30 or so.5.20第44页,共69页。Hypothesis Tests1. A null hypothesis, H0.2. An alternative hypothesis, H1.3. A test statistic.4. A rejection region.5.21第45页,共69页。Rejection Rules1. Two
33、-Sided Test: If the value of the test statistic falls in the critical region in either tail of the t-distribution, then we reject the null hypothesis in favor of the alternative.2. Left-Tail Test:If the value of the test statistic falls in the critical region which lies in the left tail of the t-dis
34、tribution, then we reject the null hypothesis in favor of the alternative.2. Right-Tail Test:If the value of the test statistic falls in the critical region which lies in the right tail of the t-distribution, then we reject the null hypothesis in favor of the alternative.5.22第46页,共69页。Format for Hyp
35、othesis Testing1. Determine null and alternative hypotheses.2. Specify the test statistic and its distribution as if the null hypothesis were true.3. Select a and determine the rejection region.4. Calculate the sample value of test statistic.5. State your conclusion.5.23第47页,共69页。practical vs. stati
36、stical significance in economicsPractically but not statistically significant:When sample size is very small, a large average gap between the salaries of men and women might not be statistically significant. Statistically but not practically significant:When sample size is very large, a small correl
37、ation (say, r = 0.00000001) between the winning numbers in the PowerBall Lottery and the Dow-Jones Stock Market Index might be statistically significant.5.24第48页,共69页。Type I and Type II errorsType I error:We make the mistake of rejecting the null hypothesis when it is true.a = P (rejecting H0 when i
38、t is true).Type II error:We make the mistake of failing to reject the null hypothesis when it is false.b = P (failing to reject H0 when it is false).5.25第49页,共69页。Prediction IntervalsA (1-a)x100% prediction interval for yo is:yo tc se( f )se( f ) = var( f )f = yo - yoS(x t - x)2var( f ) = s 2 1 + +1
39、n(x o - x)25.26第50页,共69页。Two-sided Hypothesis5.275.6 HYPOTHESIS TESTING: 1 THE CONFIDENCE-INTERVAL APPROACH第51页,共69页。5.28One-Sided or One-Tail TestH0:2 0.3 and H1:2 0.3Sometimes we have a strong a priori or theoretical expectation (or expectations based on some previous empirical work) that the alte
40、rnative hypothesis is one-sided or unidirectional rather than two-sided, as just discussed.Thus, for our consumptionincome example, one could postulate that H0:2 0.3 and H1:2 0.3Perhaps economic theory or prior empirical work suggests that the marginal propensity to consume is greater than 0.3. Alth
41、ough the procedure to test this hypothesis can be easily derived from earlier works, the actual mechanics are better explained in terms of the test-of-significance approach discussed next.第52页,共69页。5.29 Before concluding our discussion of hypothesis testing, note that the testing procedure just outl
42、ined is known as a two-sided, or two-tail, test-of-significance procedure in that we consider the two extreme tails of the relevant probability distribution, the rejection regions, and reject the nullhypothesis if it lies in either tail. But this happens because our H1 was a two-sided composite hypo
43、thesis; 2= 0.3 means 2 is either greater than or less than 0.3. But suppose prior experience suggests to us that the MPC is expected to be greater than 0.3. In this case we have: H0:2 0.3 and H1:2 0.3. Although H1 is still a composite hypothesis, It is now one-sided.第53页,共69页。 To test this hypothesi
44、s, we use the one-tail test (the right tail), as shown in Figure 5.5. The test procedure is the same as before except that the upper confidence limit or critical value now corresponds to t = t.05, that is, the 5 percent level.As Figure 5.5 shows, we need not consider the lower tail of the t distribu
45、tion in this case. Whether one uses a two- or one-tail test of significance will depend upon how the alternative hypothesis is formulated, which, in turn, may depend upon some a priori considerations or prior empirical experience. 5.30第54页,共69页。5.31第55页,共69页。HYPOTHESIS TESTING: 2 THE TEST-OF-SIGNIFI
46、CANCE APPROACHTesting the Significance of Regression Coefficients: The t Test An alternative but complementary approach to the confidence-intervalmethod of testing statistical hypotheses is the test-of-significance approach developed along independent lines by R. A. Fisher and jointly by Neyman and
47、Pearson. Broadly speaking, a test of significance is a procedure by which sample results are used to verify the truth or falsity of a null hypothesis. The key idea behind tests of significance is that of a test statistic (estimator) and the sampling distribution of such a statistic under the null hy
48、pothesis. The decision to accept or reject H0 is made on the basis of the value of the test statistic obtained from the data at hand. As an illustration, recall that under the normality assumption the variable follows the t distribution with n-2 df. If the value of true 2 is specifiedunder the null
49、hypothesis, the t value can readily be computedfrom the available sample, and therefore it can serve as a test statistic. 5.32第56页,共69页。5.33第57页,共69页。5.34第58页,共69页。5.35第59页,共69页。5.36第60页,共69页。APPLICATION OF REGRESSION ANALYSIS:THE PROBLEM OF PREDICTIONOn the basis of the sample data of Table 3.2 we obtained the following sample regression:Y = 24.4545+0.5091Xi 5.37第61页,共69页。5.38第62页,共69页。第63页,共69页。5.39第64页,共69页。5.40第65页,共69页。SUMMARY
温馨提示
- 1. 本站所有资源如无特殊说明,都需要本地电脑安装OFFICE2007和PDF阅读器。图纸软件为CAD,CAXA,PROE,UG,SolidWorks等.压缩文件请下载最新的WinRAR软件解压。
- 2. 本站的文档不包含任何第三方提供的附件图纸等,如果需要附件,请联系上传者。文件的所有权益归上传用户所有。
- 3. 本站RAR压缩包中若带图纸,网页内容里面会有图纸预览,若没有图纸预览就没有图纸。
- 4. 未经权益所有人同意不得将文件中的内容挪作商业或盈利用途。
- 5. 人人文库网仅提供信息存储空间,仅对用户上传内容的表现方式做保护处理,对用户上传分享的文档内容本身不做任何修改或编辑,并不能对任何下载内容负责。
- 6. 下载文件中如有侵权或不适当内容,请与我们联系,我们立即纠正。
- 7. 本站不保证下载资源的准确性、安全性和完整性, 同时也不承担用户因使用这些下载资源对自己和他人造成任何形式的伤害或损失。
最新文档
- 2024年度山西省高校教师资格证之高等教育心理学过关检测试卷B卷附答案
- 通信行业:6G概念及远景白皮书
- 企业融资协议2024格式
- 2024临时活动场地租赁协议样本
- 2024快递业务重要客户服务协议
- 2024手工礼品定制协议
- 2024年施工协议追加条款格式
- 二手房销售预订协议格式 2024
- 2024年度新款手机租赁协议文本
- 2024年建筑项目分包协议样本
- 建设银行员工劳动合同
- 浙江大学学生社团手册(08)
- 水利水电工程专业毕业设计(共98页)
- 医院医用气体管路的设计计算(2014)
- 人教版统编高中语文“文学阅读与写作”学习任务群编写简介
- SQE质量月报参考格式
- 初中物理实验室课程表
- CTQ-2型支线接触网故障智能切除装置概述
- 砂石料取样试验标准与规范
- 运营管理已完毕第七讲库存
- 罗马数字对照表
评论
0/150
提交评论