In order to determine the effectiveness of auditing programs in promoting energy savings, the NSW Department of Energy has carried out a project in which they ran a regression model on a small number of households.

Authors Avatar

                                                                   

In order to determine the effectiveness of auditing programs in promoting energy savings, the NSW Department of Energy has carried out a project in which they ran a regression model on a small number of households. The households were investigated based on two forms of data: billing data consisting of annual household consumption of electricity and survey data including household demographic characteristics. Based on the econometrics, the Department concluded of a positive effectiveness of the audit program.

In this paper, we are going to review the project in the following parts: Initial data analysis, Critical review of the Department’s report, Proposed econometric approach, Initial result & model evaluation and Final results and main conclusions.

a. Replication

Using Shazam, we successfully replicated the results presented in the Department of Energy’s report. See appendix 1.1 and 1.2 for details of the regressions. The only difference related to scale is that the coefficient for income is 1000 times smaller than that is reported by the Department. The reason comes from the use of the scale where the Department uses 1 dollar to represent as unit while the data displays thousand dollar unit.

b. Significance of the intercept

Interestingly, the result of the regression of model (3) suggests an insignificance of the intercept, i.e. the intercept is not significantly different from 0. To confirm that, the following test is conducted.

Test hypothesis:

H0: β1=0

HA: β1≠0

LOS: α=0.05

Test statistic:  t = -0.1668 < 2    (t stat is from Appendix 1.2)

Conclusion: Do not reject H0 i.e. we conclude that the intercept is not significantly different from 0 at 0.05 LOS.

The above outcome encourages us to run a regression of no intercept. The detailed results are shown in Appendix 1.3 that derives the following model:

             ELEXP = 69.282 ROOMNO  -99.47DAUD +  u 

                   (4.232)      (40.88)     

This model seems to produce better results than the previous one (3). However, comparing the two is not a straight forward process of using (conventional) R2.

•R2 of intercept-presented model vs. R2 of non-constant model

For both types of the regressions (with and without intercept), R2 is like a test of a restriction that β2=0. Yet, under regression with intercept, the test compares how much better Ŷ does than Y-bar while under non-constant model, R2 implies how much better Yhat does than 0. This is because under non-constant model, the regression line is forced to pass through the origin. Therefore, the two types of R2 show different characteristics that may cause problems in comparing between the models using R2.

•Conventional R2 vs. raw moment r2

In non-constant models, SSR ≠ SST – SSE, then the conventional R2 = 1 – SSE can be

                                                                                                                        SST                    

negative. Raw moment r2 is introduced to overcome the problem. It is calculated by       1- SSE . This reflects the true proportion of dependent variable is explained by the

     ∑yi2               

independent variables under non-constant models. r2 is always greater than R2 () .  Although 0<r2<1, it’s not comparable with R2 (as calculated on different basis). Therefore, comparing models based on R2 is sometimes difficult.  The suggestion is that we should stick with intercept-presented model.                                              

c. Discuss features of the full data set

In this part, we expand analysis to the full data set by discussing about its features. Firstly, means of dummy variables gives us proportion data. For example, mean of DAUD is 0.3 showing that there are about 30% of the households audited; similarly 22.6% are using gas and the number of households joined in the previous report accounts for 33.7% of the full sample. These statistics indicate that the sample is quite well split and there is no serious biased selectivity. The others independent variables tell us about income, number of rooms, number of people in the households. Comparing to the initial sample, the following points are drawn. On average, there are more people in a household in the full data set than the sample of 100 households, 2.69 compared to 2.4. The corresponding number of rooms is, therefore, greater, with 6.85 and 6.3 for the full data and the sample respectively. Moreover, households in the full data have higher average income (22276) than the average of households in the small sample (21012). Interestingly, 30% of the sample’s households are using gas compared to about 20% in the full data. Since the differences are relatively small, we conclude that the problem of biased selectivity is not so serious. Details of data can be accessed from Appendix 2a

Join now!

   Firgure 1.Number of people residual plot            Firgure 1.Number of rooms residual plot

                                                                                         Firgure 1.Number of income residual plot

                                                    ...

This is a preview of the whole essay