# Estimation of regression and causal models with and without correction for measurement errors

Our example in the introduction about the factors that explain the level of satisfaction with democracy is based on the ESS Round 61 rotating module proposal [Kri10]. In this chapter, we are not going to use all the variables proposed, but only the variables mentioned in the introduction. These have been evaluated with respect to quality in Chapter 3 and we have corrected the correlation matrix for measurement errors in the previous chapter. We are going to start with the estimation of a simple regression model visualized in Figure 5.1.

Figure 5.1: A regression model to explain satisfaction with democracy

The model presented in this figure illustrates direct effects from all the explanatory and control variables on satisfaction with democracy. The correlations between the exogenous variables are taken into account in the analysis. The regression analysis for these variables can be done in different ways. By downloading the raw data, it could be done in SPSS and Stata or any other statistical package. However, it is also possible to run the regression analysis using the correlation matrices provided above. This can be done using different programs2 but, here, we will illustrate the use of the programs Stata and LISREL. Both programs can perform regression analysis using the raw data; however, as data for the analysis we prefer to use the correlation matrix from Table 4.2 (without corrections) and Table 4.7 (with corrections), as input for our analysis.

• [1] ESS Round 6: European Social Survey (2013): ESS-6 2012 Documentation Report. Edition 2.0. Bergen, European Social Survey Data Archive, Norwegian Social Science Data Services.
• [2] For example Structural Equation Modeling (SEM) programs like: EQS, R, LISREL, Stata, IBM SPSS Amos, etc.
Page 1

# Regression analysis without correction for measurement errors

Below, the regression analysis without correction for measurement errors is done using LISREL and Stata. The estimates obtained using both programs are very similar (except for the rounding off errors). Thus, as both programs provide very similar results, please select which program you want to continue with:

First, we ask LISREL1 to run the regression model without correction for measurement errors by using the data from the correlation matrix shown in Table 4.2 as input. The following LISREL syntax can be used to perform this analysis. For more detailed information about the LISREL procedure and notation, we refer to the LISREL 8 manual [Jör96] and LISREL introductory books [Sar84].

Syntax 5.1: The LISREL syntax for the estimation of the parameters of the regression model without correction for measurement errors

Regression analysis without correction for measurement error !Title

da ni=6 no=1468 ma=km !ni=number of variables no=number of observations ma=matrix

km !km=correlation matrix
1.00
.395 1.00
.268 .474 1.00
.310 .299 .271 1.00
.188 .070 .018 .094 1.00
.163 .227 .174 .064 .009 1.00

labels
satdem free critic equal lrplace inc !Labels of the variables
model ny=1 nx=5 !ny=number of dependent variable nx=number of independent variables
pd !To obtain a path diagram
out nd=3 !out=output nd=number of decimals

In Syntax 5.1, we can see that the correlation matrix and a model have to be specified. In this case, the model is a simple regression model. If this program is run, the result presented in Output 5.1 will be obtained.

Output 5.1: The LISREL output for the regression analysis without correction for measurement errors

Figure 5.2: The estimated standardized effects for the evaluation of democracy without correction for measurement errors

In Figure 5.2, we see that all standardized effects are relatively moderate, and that the variable freedom and fairness of elections (Free) has the largest effect. Second in size is the effect of the variable equality by law (Equal). The asterisks (*) indicate that, according to the analysis, all variables have a significant effect testing on the 5% level. It is also relevant to mention that, together, these variables explain only 22.7%2 of the variance in the dependent variable, which means that 77.3% of the variance is unexplained. This suggests that the model is rather incomplete or incorrect. It is also possible, however, that the explained variance is so small because all the variables contain measurement errors that have not been taken into account.

First, we ask Stata3 to run the regression model without correction for measurement errors by using as input the data from the correlation matrix shown in Table 4.2. The following Stata syntax can be used to perform this analysis. For more detailed information about Stata procedure and notation, we refer to Stata books [Aco13]. Appendix A is especially important if one wants to use the correlation or covariance matrix as the data to be analysed.

Syntax 5.1: The Stata syntax for the estimation of the parameters of the regression model without correction for measurement errors

*Regression analysis without correction for measurement errors

clear all
ssd init satdem free critic equal lrplace inc /*variables*/
ssd set observations 1468 /*observations*/

*Correlation matrix
#delimit ;
ssd set correlations
1.00\
.395 1.00\
.268 .474 1.00\
.310 .299 .271 1.00\
.188 .070 .018 .094 1.00\
.163 .227 .174 .064 .009 1.00;
#delimit cr
save ssdmatrix.dat, replace

*Regression model
clear
use ssdmatrix.dat /*use the correlation matrix input as data*/
ssd list
sem (satdem <- free critic equal lrplace inc), standardized /*model*/
estat eqgof /*Equation-level goodness of fit*/

In Syntax 5.1, we can see that the correlation matrix and the model have to be specified. In this case, the model is a simple regression model. If this program is run, the result presented in Output 5.1 will be obtained.

Output 5.1: The Stata output for the regression analysis without correction for measurement errors

Figure 5.2: The estimated standardized effects for the evaluation of democracy without correction for measurement errors

In Figure 5.2, we see that all standardized effects are relatively moderate, and that the variable freedom and fairness of elections (Free) has the largest effect. Second in size is the effect of the variable equality by law (Equal). The asterisks (*) indicate that, according to the analysis, all variables have a significant effect testing on the 5% level. It is also relevant to mention that, together, these variables explain only 22.7%4 of the variance in the dependent variable, which means that 77.3% of the variance is unexplained. This suggests that the model is rather incomplete or incorrect. It is also possible, however, that the explained variance is so small because all the variables contain measurement errors that have not been taken into account.

• [1] The following illustration and results are based on the LISREL 8.7 software version: Jöreskog, K.G. & Sörbom, D. (2004). LISREL 8.7 for Windows [Computer software]. Skokie, IL: Scientific Software International, Inc.
• [2] The explained variance can be obtained in LISREL from the section Squared Multiple Correlations for Structural Equations (R2).
• [3] The following illustration and results are based on the Stata 12 software version: StataCorp. 2011. Stata Statistical Software: Release 12. College Station, TX: StataCorp LP.
• [4] The explained variance can be obtained in Stata by the command estat eqgof R2.
Page 2

# Regression analysis with correction for measurement error

Now we will run the analysis of the same regression model, but this time with correction for measurement errors. To do so, we just need to use the covariance matrix from Table 4.7 as the data. The rest of the specification remains the same in both LISREL and Stata.

In the same way as before, since both programs provide very similar results, please select which program you want to use to continue the analysis, LISREL1 or Stata2. The syntax of the regression model analysis with correction for measurement errors is available for the statistical computing program, R3, in Syntax A2.1 of Appendix 2.

In the previous chapter, after correction for common method variance in the correlation matrix and putting the qualities on the diagonal of the correlation matrix, we obtained the corrected correlation matrix presented in Table 4.7. LISREL can be asked to analyse this matrix in the same way as we have done above. The input for the regression analysis used in LISREL is presented in Syntax 5.2.

Syntax 5.2: The LISREL syntax for the estimation of the parameters of the regression model with correction for measurement errors

Regression analysis with correction for measurement errors
da ni=6 no=1468 ma=km
km
.710 !The correlation matrix corrected for measurement errors
.395 .643
.268 .333 .604
.310 .160 .114 .605
.112 .070 .018 .094 .682
.163 .277 .174 .064 .009 .624
labels
satdem free critic equal lrplace inc
model ny=1 nx=5
pd
out nd=3

In Syntax 5.2, we can see that the input in LISREL is the same as was used in Syntax 5.1, but this time, since our data is a correlation matrix corrected for measurement error, the diagonal does not contain 1s everywhere. Because ma=km is indicated in the program, the program transforms this matrix into a correlation matrix with 1s on the diagonal. This is exactly the transformation needed to correct for random errors because all correlations are divided by the quality coefficients of the two variables for which the correlation is corrected. Through this process, the program creates the matrix that was presented in Table 4.8. It will be clear that this is far more efficient and less risky than making these corrections for random errors by hand.

In the next step, the program performs the estimation of the regression coefficients on the basis of this correlation matrix corrected for measurement errors. The result is presented in Output 5.2.

Output 5.2: The LISREL output of the regression analysis with correction for measurement errors

Figure 5.3: The estimated standardized effects for the evaluation of democracy with correction for measurement errors

Comparing the results with and without correction for measurement error (see Figure 5.2), we see that there is a huge increase, by almost a factor two, of the effect of the evaluation variables Free, Critic and Equal, and a decrease in the effects of the control variables LRplace and Inc. Moreover, the Income control variable is no longer significant4.

Besides, the explained variance in the dependent variable has increased from 22.7% to 46.6%5 through correction for measurement errors. This also means that we now know that the unexplained variance of 53.4% cannot be due to measurement error, so it must be due to omitted explanatory variables or non-linear relationships between these variables.

In the previous chapter, after correction for common method variance in the correlation matrix and putting the qualities on the diagonal of the correlation matrix, we obtained the corrected covariance matrix presented in Table 4.7. Stata can be asked to analyse this matrix in the same way as we have done above. The regression analysis notation used in Stata is presented in Syntax 5.2.

Syntax 5.2: The Stata syntax for the estimation of the parameters of the regression model with correction for measurement errors

*Regression analysis with correction for measurement errors
clear
ssd init satdem free critic equal lrplace inc
ssd set observations 1468

*Covariance matrix
#delimit ;
ssd set covariance /*The correlation matrix corrected for measurement errors*/
.710\
.395 .643\
.268 .333 .604\
.310 .160 .114 .605\
.112 .070 .018 .094 .682\
.163 .277 .174 .064 .009 .624;
#delimit cr
save ssdmatrix.dat, replace

*Regression analysis
clear
use ssdmatrix.dat
ssd list
sem (satdem <- free critic equal lrplace inc), standardized
estat eqgof /*Equation-level goodness of fit*/

In Syntax 5.2, we can see that the input in Stata is the same as we ran in Syntax 5.1, but this time, since our data is a correlation matrix corrected for measurement error, we will also obtain the regression estimates corrected for measurement errors (see Output 5.2).

Output 5.2: The Stata output of the regression analysis with correction for measurement errors

Figure 5.3: The estimated standardized effects for the evaluation of democracy with correction for measurement errors

Comparing the results with and without correction for measurement error (see Figure 5.2), we see that there is a huge increase, by almost a factor two, of the effect of the evaluation variables Free, Critic and Equal, and a decrease in the effects of the control variables LRplace and Inc. Moreover, the Income control variable is no longer significant6.

Besides, the explained variance in the dependent variable has increased from 22.7% to 46.6%7 through correction for measurement errors. This also means that we now know that the unexplained variance of 53.4% cannot be due to measurement error, so it must be due to omitted explanatory variables or non-linear relationships between these variables.

• [1] The following illustration and results are based on the LISREL 8.7 software version: Jöreskog, K.G. & Sörbom, D. (2004). LISREL 8.7 for Windows (Computer software). Skokie, IL: Scientific Software International, Inc.
• [2] The following illustration and results are based on the Stata 12 software version: StataCorp. 2011. Stata Statistical Software: Release 12. College Station, TX: StataCorp LP.
• [3] R: A Language and Environment for Statistical Computing, R Core Team, R Foundation for Statistical Computing. http://www.R-project.org.
• [4] Although most of the coefficients are significant, it has to be realized that the standard errors in this approach are underestimated for two reasons. The first reason is that the observed correlation is assumed to be a perfect estimate of the population correlation, and the second reason is that the quality estimates are assumed to be perfect measures of the quality of the questions (Obe11). In both cases, the uncertainty of the estimation is ignored. However, in most of the cases, this will not harm the conclusions because the effects are relative large or relative small, as it is the case in this example.
• [5] The explained variance can be obtained in LISREL from the section Squared Multiple Correlations for Structural Equations (R2).
• [6] Although most of the coefficients are significant, it has to be realized that the standard errors in this approach are underestimated for two reasons. The first reason is that the observed correlation is assumed to be a perfect estimate of the population correlation, and the second reason is that the quality estimates are assumed to be perfect measures of the quality of the questions (Obe11). In both cases, the uncertainty of the estimation is ignored. However, in most of the cases, this will not harm the conclusions because the effects are relative large or relative small, as it is the case in this example.
• [7] In Stata, the command estat eqgof will show you the value of R2 (The explained variance).
Page 3

# Estimation of causal model with correction for measurement errors

In the social sciences, causal models are often used in order to estimate direct and indirect effects. For example, in the previous section we found that Income does not have a significant direct effect on satisfaction with democracy when we correct for measurement errors. However, this result can lead to the wrong conclusion that this variable has no effect at all on satisfaction with democracy. It is reasonable to assume that this variable affects the variables Free, Critic and Equal and thereby has an indirect effect on satisfaction with democracy. In order to study this, we have to use a causal model. This model is presented in Figure 5.4.

Figure 5.4: The causal model for the evaluation of democracy

Because the regression analysis with correction for measurement errors indicated that Inc has no significant effect on Satdem, we have omitted this effect in this model. The correlations between the variables Free, Critic and Equal have been omitted in the figure to keep the picture simple, but these correlations have been introduced in the analyses. This type of model is often used in the social sciences. Therefore, we also want to show that it is very easy to estimate this type of model with correction for measurement errors.

Below, we will illustrate how to run the causal model specified above (Figure 5.4), using both LISREL1 and Stata2. As both programs produce very similar results, please select which program you want to continue the analysis with. The syntax of the causal model analysis with correction for measurement errors is available for the statistical computing program R3 in Syntax A2.2 of Appendix 2.

In Figure 5.5, all effects have been indicated using the symbols from LISREL. The betas (be) represent the effects of the explanatory (or endogenous) variables (i.e. Free, Critic and Equal) on satisfaction with democracy (i.e. Satdem). For example, be(1,2) indicates the effect of the variable freedom and fairness of elections on satisfaction with democracy. Similarly, the gammas (ga) represent the effects of the control (or exogenous) variables (i.e. LRplace and Inc) on the other variables in the model. For example, ga(1,1) indicates the effect of the control variable ‘left-right placement’ on satisfaction with democracy, while ga(3,2) indicates the effect of the other control variable, Income (i.e. Inc), on the variable equality by law (i.e. Equal). The effect of the variable Inc on Satdem is specified by a dashed line because it represents an effect that has been omitted because it was not significant in the analysis with correction for measurement errors, but we are going estimate this coefficient in the analysis without corrections.

Figure 5.5: The causal model for the evaluation of democracy with LISREL notation

We do not expect the control variables to completely explain the correlations that exist between the evaluation of democracy questions. We can therefore expect correlations between the disturbance terms of these variables (ς22, ς33 and ς44). These correlations are not indicated in the model but are denoted in LISREL by ps(3,2), ps(4,2) and ps(4,3), while the variances of the disturbances are denoted as ps(1,1), ps(2,2), ps(3,3) and ps(4,4). For details of the procedure, we refer to the LISREL manual [Jör96] and introductions to the program LISREL [Sar84]. First, the LISREL input for this analysis without corrections is presented in Syntax 5.3. We then present the same input corrected for measurement errors (see Syntax 5.4).

Syntax 5.3: The LISREL syntax for the estimation of the parameters of the causal model without correction for measurement errors*

Causal analysis without corrections for measurement errors !Title
data ni=6 no=1468 ma=km !ni=number of variables no=number of observations ma=matrix
km !km=correlation matrix
1.00
.3951.00
.268 .474 1.00
.310 .299 .271 1.00
.188 .070 .018 .094 1.00
.163 .227 .174 .064 .009 1.00
labels
satdem free critic equal lrplace inc !Labels of the variables
model ny=4 nx=2 be=fu,fi ga=fu,fi ps=sy,fi !Causal model ny=dependent variables nx=control variables
free be(1,2) be(1,3) be (1,4) !free=coefficients to be estimated
free ga(2,1) ga(2,2) ga(3,1) ga(3,2) ga(4,1) ga(4,2) ga(1,1)
free ps(1,1) ps(2,2) ps(3,3) ps(4,4) ps(3,2) ps(4,2) ps(4,3)
pd !To obtain a path diagram
out nd=3 !out= output nd=number of decimals

*Note that the effect of Income on Satdem (ga(1,2)) has not been introduced in this syntax without correction for measurement errors.

Syntax 5.4: The LISREL syntax for the estimation of the parameters of the causal model with correction for measurement errors

Causal analysis with correction for measurement errors
data ni=6 no=1468 ma=km
km
.710 !The correlation matrix corrected for measurement errors
.395 .643
.268 .333 .604
.310 .160 .114 .605
.112 .070 .018 .094 .682
.163 .227 .174 .064 .009 .624
labels
satdem free critic equal lrplace inc
model ny=4 nx=2 be=fu,fi ga=fu,fi ps=sy,fi
free be(1,2) be(1,3) be (1,4)
free ga(2,1) ga(2,2) ga(3,1) ga(3,2) ga(4,1) ga(4,2) ga(1,1)
free ps(1,1) ps(2,2) ps(3,3) ps(4,4) ps(3,2) ps(4,2) ps(4,3)
pd
out nd=3

The most important point is that the coefficients that have to be estimated are presented in the lines starting with ‘free’. Comparing these two inputs, we see that only the matrix with the data to be analysed has been changed. Focusing on the input for the model with correction for measurement errors, the effects will be estimated on the basis of the covariance matrix in Table 4.7 (i.e. the matrix with the correlations corrected for cmv and with the qualities on the diagonal). Because we ask in the data line that the matrix to be analysed (ma) should be the correlation matrix (km), Table 4.7 is transformed by the program into the correlation matrix corrected for measurement errors, Table 4.8.

The nice feature of this approach, correcting the correlations for measurement errors before estimating the effects, is that the input for the analysis is exactly the same with and without correction for measurement errors, except for the matrix of correlations that is used in the analysis. This point is illustrated in the input for the analyses with and without correction for measurement errors presented in Syntaxes 5.3 and 5.4.

It is important in the estimation of causal models to test whether the model fits to the data, i.e. that the model is not misspecified. Without going into detail, see [Sar09], we can say that the model fits very well to the data corrected for measurement errors. So there is no reason to change the model.

However, analysing the matrix without correction for measurement errors, the program indicates that the fit of the model is not good. This suggests that the effect, ga(1,2), of the control variable Income (Inc) on satisfaction with democracy has to be introduced in the model. If we do so, this model also fits well to the data and we get the standardized results presented in Table 5.1.

Table 5.1: The LISREL results of the estimation of the causal model presented in Figure 5.4 with and without corrections

In Syntaxes 5.3 and 5.4, we can observe that all standardized effects have been indicated using the Stata notation [Aco13]. We can expect variances between the disturbance terms of the evaluation of democracy questions, which are denoted in Stata as e.free*e.critic, e.free*e.equal and e.critic*e.equal. Comparing the two inputs, we see that only the matrix with the data to be analysed has been changed. Focusing on the input for the model with correction for measurement errors, the effects will be estimated on the basis of the covariance matrix in Table 4.7 (i.e. the matrix with the correlations corrected for common method variance and with the qualities on the diagonal).

Syntax 5.3: The Stata syntax for the estimation of the parameters of the causal model without correction for measurement errors*

*Causal analysis without correction for measurement errors
clear all
ssd init satdem free critic equal lrplace inc /*variables*/
ssd set observations /*observations*/

*Correlation matrix
#delimit ;
ssd set correlations
1.00\
.395 1.00\
.268 .474 1.00\
.310 .299 .271 1.00\
.188 .070 .018 .094 1.00\
.163 .227 .174 .064 .009 1.00;
#delimit cr
save ssdmatrix.dat, replace

*Causal model
clear
use ssdmatrix.dat
ssd list
sem (satdem <- free critic equal lrplace) ///
(lrplace inc -> free critic equal), ///
standardized
estat eqgof /*Equation-level goodness of fit*/

*Note that the effect of Income on Satdem has not been introduced in this syntax without correction for measurement errors.

Syntax 5.4: The Stata syntax for the estimation of the parameters of the causal model with correction for measurement errors

*Causal model with correction for measurement errors
clear all
ssd init satdem free critic equal lrplace inc
ssd set observations 1468
*Covariance matrix
#delimit ;
ssd set covariance /*The correlation matrix corrected for measurement errors */
.710\
.395 .643\
.268 .333 .604\
.310 .160 .114 .605\
.112 .070 .018 .094 .682\
.163 .227 .174 .064 .009 .624;
#delimit cr
save ssdmatrix.dat, replace
*Causal model
clear
use ssdmatrix.dat
ssd list
sem (satdem <- free critic equal lrplace) ///
(lrplace inc -> free critic equal), ///
covariance(e.free*e.equal e.free*e.critic e.critic*e.equal)
sem, standardized
estat eqgof /*Equation-level goodness of fit*/

The nice feature of this approach, correcting the correlations for measurement errors before estimating the effects, is that the input for the analysis is exactly the same with and without correction for measurement errors, except for the matrix of correlations that is used in the analysis. This point is illustrated in the input for the analyses with and without correction for measurement errors presented in Syntaxes 5.3 and 5.4.

However, analysing the matrix without correction for measurement errors, the program indicates that the fit of the model is not good. This suggests that the effect of the control variable income (Inc) on satisfaction with democracy has to be introduced in the model. If we do so, this model also fits well to the data and we get the standardized results presented in Table 5.1.

Table 5.1: The Stata results of the estimation of the causal model presented in Figure 5.4 with and without corrections

If we compare the results with and without correction for measurement error, we see first of all that the model is different. After correction for measurement errors, the standardized effects of the control variable (Inc) on satisfaction with democracy are not significantly different from zero, while, without correction for measurement errors, this standardized effect is necessary in order to achieve a good fit of the model. In the latter case, we say that this variable has a direct effect on satisfaction with democracy, while in the former analysis, we have to conclude that there is no direct effect, only an indirect effect.

Figure 5.6: The standardized coefficients of the causal model for the evaluation of democracy without correction for measurement errors

Figure 5.7: The standardized coefficients of the causal model for the evaluation of democracy with correction for measurement errors

Furthermore, comparing Figures 5.6 and 5.7, we see that nearly all other standardized effects after correction for measurement error are greater than without correction for errors. The difference is in fact close to a factor of two in some cases. All significant effects are indicated in the figures by an asterisk (*). Using this approach we see that, in both analyses, the effect of left-right placement on freedom to criticise is not significant on the 5% significance level. This example again illustrates how different the results can be if one corrects for measurement errors. From the outputs, it can also be seen, as for the regression analysis, that, after correcting for measurement errors, the explained variance in the dependent variable increases from 22.7% to 46.6%.4

So far we have shown how correction for measurement errors can be done by first correcting the correlation matrix. In this way, we obtain the estimates of the standardized coefficients corrected for measurement errors.

In this chapter, we have shown that, after correction of the correlation matrix, the analyses with or without correction for measurement error are exactly the same. Thus, we have shown that analysis with correction for measurement errors is not a problem anymore. We have also seen that the correction for measurement errors should be performed because the results are quite different.

## Exercise 5.1

Repeat the estimation of the regression model for the variables introduced in exercise 3.1 as illustrated in the figure below.

Regression model for attitudes towards immigration

In exercise 4.3, we have obtained the results of the correlation matrices with and without correction for measurement errors. These are reproduced in the following tables.

1) Correlation matrix (n = 1801):

2) Correlation matrix with correction for measurement errors:

Note the large difference between the two matrices. We also expect big differences in the effects. Use this information to compute either in LISREL or Stata the results of the analysis for the explanation of the opinion about immigration by people from outside Europe to the Netherlands, with and without correction for measurement errors.

1. LISREL syntax for the regression analysis without corrections for measurement errors*:
Regression analysis without correction for measurement errors
da ni=4 no=1801 ma=km
km
1.000
-0.351 1.000
-0.402 0.534 1.000
-0.340 0.557 0.530 1.000
labels
impcntr imwbcnt imbgeco imueclt
model ny=1 nx=3
pd
out nd=3
2. LISREL syntax for the regression analysis with corrections for measurement errors*:
Regression analysis with correction for measurement errors
da ni=4 no=1801 ma=km
km
0.763
-0.351 0.639
-0.402 0.440 0.702
-0.340 0.447 0.413 0.641
labels
impcntr imwbcnt imbgeco imueclt
model ny=1 nx=3
pd
out nd=3

*Note that the only difference between the two inputs is the matrix used as input.

1. Stata syntax for the regression analysis without corrections for measurement errors*:
*Regression analysis without correction for measurement errors
clear all
ssd init impcntr imwbcnt imbgeco imueclt
ssd set observations 1801
*Correlation matrix
#delimit ;
ssd set correlations
1.000\
-0.351 1.000\
-0.402 0.534 1.000\
-0.340 0.557 0.530 1.000;
#delimit cr
save ssdmatrix.dat, replace
*Regression model
clear
use ssdmatrix.dat
ssd list
sem (impcntr <- imwbcnt imbgeco imueclt), standardized
estat eqgof
2. Stata syntax for the regression analysis with correction for measurement errors*:
*Regression analysis with correction for measurement errors
clear all
ssd init impcntr imwbcnt imbgeco imueclt
ssd set observations 1801
*Covariance matrix
#delimit ;
ssd set covariances
0.763\
-0.351 0.639\
-0.402 0.440 0.702\
-0.340 0.447 0.413 0.641;
#delimit cr
save ssdmatrix.dat, replace
*Regression model
clear
use ssdmatrix.dat
ssd list
sem (impcntr <- imwbcnt imbgeco imueclt), standardized
estat eqgof

*Note that the only difference between the two inputs is the matrix used as input.

From the results obtained we can observe that the correlations between all variables have increased considerably after correcting the correlation matrix for measurement errors. The corrected correlations are presented in the following table:

The corrected correlations
Allow Better Economy Culture
(B37) Allow 1.000
(B40) Better -0.503 1.000
(B38) Economy -0.549 0.657 1.000
(B39) Culture -0.486 0.698 0.616 1.000

The estimated standardized effects for the regression model without corrections for measurement errors

The estimated standardized effects for the regression model with corrections for measurement errors

Although we expected large differences in the effect estimates with and without correction for measurements error because of the changes in the correlations, the results presented in the figures for the regression model without (the figure above) and with (the figure below) corrections for measurement errors indicate that all effects have increased slightly after correcting for measurement errors. The reason of this small change is that not only the correlations between Allow and the explanatory variables increase considerably after correction for measurement errors (see table with the corrected correlations above) but also the correlations between explanatory variables did so. Because the effects on the variable Allow are direct effects, part of the effect of each of these variables on Allow is taken away by the similar increase of the effects of the other explanatory variables and the correlations between them. This is illustrated by the change in the explained variance (R2) in the model. Without corrections, R2 is quite low (19.7%), while with corrections it increases to 34.9%. This means that these variables together explain much more. However, because of the increase in the correlations between the explanatory variables, the direct effects cannot increase too much. If there was only one explanatory variable instead of three, the effect would have been much larger (see Chapter 7).

## Exercise 5.2:

Taking into account the results obtained in exercise 5.1, repeat the estimation of the causal model presented in this chapter for the variables introduced in exercise 3.1. The correlation matrices with and without corrections obtained in exercise 4.3 were reproduced in the previous exercise. Use this information to compute either in LISREL or Stata the results for the analysis for the causal explanation of the opinion about immigration by people from outside Europe to the Netherlands with and without correction for measurement errors, illustrated in the next model.

Causal model for attitudes towards immigration

1. LISREL syntax for the regression analysis without corrections for measurement errors:
Causal analysis without correction for measurement errors
da ni=4 no=1801 ma=km
km
1.000
-0.351 1.000
-0.402 0.534 1.000
-0.340 0.557 0.530 1.000
labels
impcntr imwbcnt imbgeco imueclt
model ny=2 nx=2 be=fu,fi ga=fu,fi ps=sy,fi
free be(1,2)
free ga(1,1) ga(1,2) ga(2,1) ga(2,2)
free ps(1,1) ps(2,2)
pd
out nd=3
2. LISREL syntax for the regression analysis with corrections for measurement errors:
Causal analysis with correction for measurement errors
da ni=4 no=1801 ma=km
km
0.763
-0.351 0.639
-0.402 0.440 0.702
-0.340 0.447 0.413 0.641
labels
impcntr imwbcnt imbgeco imueclt
model ny=2 nx=2 be=fu,fi ga=fu,fi ps=sy,fi
free be(1,2)
free ga(1,1) ga(1,2) ga(2,1) ga(2,2)
free ps(1,1) ps(2,2)
pd
out nd=3

1. Stata syntax for the regression analysis without corrections for measurement errors:
*Causal analysis without correction for measurement errors
clear all
ssd init impcntr imwbcnt imbgeco imueclt
ssd set observations 1801
*Correlation matrix
#delimit ;
ssd set correlation
1.000\
-0.351 1.000\
-0.402 0.534 1.000\
-0.340 0.557 0.530 1.000;
#delimit cr
save ssdmatrix.dat, replace
*Causal model
clear
use ssdmatrix.dat
ssd list
sem (impcntr <- imwbcnt imbgeco imueclt) ///
(imbgeco imueclt -> imwbcnt), ///
standardized
estat eqgof
2. Stata syntax for the regression analysis with corrections for measurement errors:
*Causal analysis with correction for measurement errors
clear all
ssd init impcntr imwbcnt imbgeco imueclt
ssd set observations 1801
*Covariance matrix
#delimit ;
ssd set covariances
0.763\
-0.351 0.639\
-0.402 0.440 0.702\
-0.340 0.447 0.413 0.641;
#delimit cr
save ssdmatrix.dat, replace
*Causal model
clear
use ssdmatrix.dat
ssd list
sem (impcntr <- imwbcnt imbgeco imueclt) ///
(imbgeco imueclt -> imwbcnt), ///
standardized
estat eqgof

The estimated standardized effects of the causal model without corrections for measurement errors

The estimated standardized effects of the regression model with corrections for measurement errors

The top figure presents the results for the causal model without corrections. The lower figure shows the results for the causal model with corrections. Again, we see from these models that all effects are significant. Comparing these two models, we again see, on the one side, a small increase in the effects after correcting for measurement errors and, on the other side, a considerable increase in the variance explained. For the causal model without corrections, the variance in Allow that is explained is still very low, 19.7%. For the causal model with corrections, the variance in Allow that is explained increases to 34.9%. The reason is the same as in the case of the regression model: the increase in the correlations between the explanatory variables.

• [1] The following illustration and results are based on the LISREL 8.7 software version: Jöreskog, K.G. & Sörbom, D. (2004). LISREL 8.7 for Windows [Computer software]. Skokie, IL: Scientific Software International, Inc.
• [2] The following illustration and results are based on the Stata 12 software version: StataCorp. 2011. Stata Statistical Software: Release 12. College Station, TX: StataCorp LP.
• [3] R: A Language and Environment for Statistical Computing, R Core Team, R Foundation for Statistical Computing.
• [4] The explained variance can be obtained in LISREL from the section Squared Multiple Correlations for Structural Equations (R2). Similarly, in Stata the command estat eqgof will show you the value of the R2.
Page 4
• [Aco13] Acock, A. C. (2013). Discovering Structural Equation Modeling Using Stata, Revised Edition. Stata press.
• [Jör96] Jöreskog, K. G. and Sörbom, D. (1996). LISREL 8 User’s Reference Guide. Scientific Software International.
• [Kri10] Kriesi, H., Molino, L., Magalhaes, P., Alonso, S. and Ferrin, M. (2010).Europeans’ understanding and evaluation of democracy. ESS Round 6 democracy module proposal.
• [Sar09] Saris, W. E., Satorra, A. and Van der Veld, W. M. (2009). Testing Structural Equation Models or Detection of Misspecifications? Structural Equation Modeling: A Multidisciplinary Journal, 16 (4), 561-582.
• [Sar84] Saris, W. E. and Stronkhorst, L. H. (1984). Causal modelling in nonexperimental research: an introduction to the LISREL approach. Sociometric Research Foundation.