Structural Equation Modeling: what is it and what can we use it for? (part 1 of 6)

National Centre for Research Methods (NCRM)
14 Jul 201625:31

Summary

TLDRStructural Equation Modeling (SEM) is a comprehensive framework integrating multivariate techniques from various disciplines. It is particularly useful for analyzing complex constructs and relationships, including indirect effects. SEM uses latent variables and path analysis to model causal systems and correct for measurement errors, with software like AMOS and LISREL available for implementation.

Takeaways

  • 🔍 Structural Equation Modeling (SEM) is a comprehensive framework that integrates various multivariate techniques from disciplines such as psychology, statistics, and econometrics.
  • 📊 SEM is not a single technique but a dynamic modeling environment that evolves to incorporate new ways of fitting models over time.
  • 🎯 SEM is particularly suitable for addressing complex research questions involving multifaceted constructs, such as psychological concepts, and for modeling causal systems with multiple outcomes or dependent variables.
  • 🔍 SEM is adept at handling indirect or mediated effects, where the effect of one variable on another is transmitted through a third variable.
  • 📚 SEM is known by various names, including covariance structure analysis, analysis of moment structures, and LISREL models, reflecting its diverse origins and applications.
  • 💻 There are numerous software packages available for SEM, each with its own advantages and disadvantages, and some offer free versions for students to explore.
  • 🛣 SEM can be thought of as path analysis using latent variables, which are underlying constructs that influence attitudes and behaviors but are not directly observable.
  • 📝 Latent variables are measured using observable indicators, such as questionnaire items, which are believed to be caused by the underlying latent constructs.
  • 🔧 SEM employs a true score equation, where the observed variable is comprised of a true score and error, and the goal is to isolate the true score while removing the error variance.
  • 📈 The use of multiple indicators for latent constructs in SEM allows for the correction of measurement errors and provides a better estimate of the true score.
  • 🔗 Path analysis, a key component of SEM, visually represents models diagrammatically, emphasizing direct, indirect, and total effects in the relationships between variables.

Q & A

  • What is the primary distinction of Structural Equation Modeling (SEM) compared to other statistical techniques?

    -SEM is not a single technique but a general modeling framework that integrates various multivariate techniques into one environment, drawing on disciplines like psychology, statistics, epidemiology, and econometrics.

  • How does SEM differ from traditional statistical methods in terms of the research questions it can address?

    -SEM is particularly suitable for addressing complex research questions involving multifaceted constructs, systems of relationships, causal systems, and indirect or mediated effects, which might be more challenging with traditional statistical methods.

  • What are some alternative names for SEM found in literature, and why might they be used?

    -SEM is also known as covariance structure analysis models, analysis of moment structures, and LISREL models, named after the first software for fitting SEMs. The term causal modeling is sometimes used but is controversial as causality comes from research design, not the statistical model.

  • What are some of the software packages available for conducting SEM analysis?

    -Several software packages are available for SEM, including LISREL, Mplus, EQS, AMOS, CALIS, and R. Each package has its advantages and disadvantages, and some offer free or limited versions for students.

  • How is SEM defined in terms of path analysis and latent variables?

    -SEM can be thought of as path analysis using latent variables, which are hypothetical or not directly observable constructs that are measured using observable indicators.

  • Why are latent variables important in social science research, and how are they measured?

    -Latent variables are important because many social science concepts, like intelligence or trust, are not directly observable. They are measured using observable indicators, such as questionnaire items, which are believed to be caused by the underlying latent constructs.

  • What is the significance of the true score equation X = t + e in SEM?

    -The true score equation represents the idea that the observed variable X is comprised of a true score (t), which is the individual's actual level on the measured construct, and error (e), which includes random and systematic error variance.

  • Why is it necessary to have multiple indicators for latent constructs in SEM?

    -Having multiple indicators is necessary to over-identify the true score equation, allowing for the estimation of true scores and error variance for each indicator, and providing a better measure of the concept by correcting for measurement error.

  • How does the use of multiple indicators benefit the measurement of latent constructs in SEM?

    -Multiple indicators help to capture the complexity of the construct, reduce random error in the measurement, and provide more precise and accurate estimates of the construct, leading to better model estimates and less bias in effect sizes.

  • What is the role of path analysis in SEM, and how does it differ from traditional regression analysis?

    -Path analysis in SEM is used to represent the model diagrammatically, focusing on both direct and indirect effects in the relationships between variables. It differs from traditional regression analysis by providing a visual representation and allowing for the examination of complex causal pathways.

  • How can indirect effects be identified and calculated in SEM?

    -Indirect effects can be identified through path diagrams that show the pathways between variables. They are calculated by multiplying the direct effects along the pathway (e.g., beta2 * beta3 in the example given), and the total effect is the sum of direct and indirect effects.

Outlines

00:00

🧠 Structural Equation Modeling (SEM) Overview

Structural Equation Modeling (SEM) is introduced as a comprehensive framework that integrates various multivariate techniques from disciplines like psychology, statistics, and econometrics. It's not a single technique but a dynamic environment that evolves with new modeling methods. SEM is particularly suitable for complex research questions involving psychological or social constructs that are difficult to measure directly. It allows for the correction of measurement errors and is adept at modeling systems of relationships, causal systems, and indirect or mediated effects.

05:07

🔍 Research Questions and SEM's Versatility

SEM is highlighted as a versatile tool for addressing a wide range of research questions, especially those involving complex constructs and systems of relationships. It's noted for its ability to handle multiple outcomes and dependent variables in a more intricate system, making it ideal for modeling causal systems. SEM is also recognized for its utility in analyzing indirect or mediated effects, where the relationship between variables is not direct but influenced through other variables.

10:07

📚 SEM's Multiple Names and Software Options

The script discusses the various names by which SEM is known in literature, such as covariance structure analysis models and analysis of moment structures, which can be confusing. It also mentions the historical association of SEM with causal modeling, although this is considered controversial since causal inference relies more on research design than on statistical models. The paragraph lists several software packages available for fitting SEMs, including LISREL, Mplus, EQS, AMOS, and R, noting that each has its own advantages and disadvantages.

15:12

🔑 Understanding Latent Variables in SEM

This section delves into the concept of latent variables, which are hypothetical or not directly observable, such as intelligence or trust. The challenge of measuring these variables is addressed, and the use of observable indicators, like questionnaire items, to infer the latent constructs is explained. The true score equation, X = t + e, is introduced to represent the observed variable (X), the true score (t), and the error (e), emphasizing the need to isolate the true score and remove error variance for accurate modeling.

20:14

📈 Path Diagrams and Latent Variable Models

The importance of path diagrams in representing SEM is underscored, as they visually depict the relationships between variables, which is particularly appealing for those less comfortable with equations. The paragraph explains the notation used in path diagrams, including the representation of latent variables, observed variables, error terms, and the directional paths indicating causality. The benefits of using multiple indicators for latent constructs are discussed, such as better coverage of complex concepts and the reduction of random error, leading to more precise measurements and accurate effect sizes.

🛤️ Path Analysis in Structural Equation Modeling

Path analysis is described as a key feature of SEM, emphasizing its visual representation of models through diagrams and its focus on both direct and indirect effects. The standardized notation for path analysis is outlined, with examples provided to illustrate how path diagrams can represent simple and complex models. The ability to decompose regression coefficients into direct, indirect, and total effects is highlighted, showcasing the depth of analysis possible with SEM and path analysis.

Mindmap

Keywords

💡Structural Equation Modelling (SEM)

Structural Equation Modelling, often abbreviated as SEM, is a statistical technique that combines multiple multivariate methods into a single framework. It is used to analyze complex relationships between variables, including latent variables, which are not directly observable. In the video, SEM is described as a dynamic and integrative approach that is suitable for addressing research questions involving complex constructs and causal systems, such as psychological and social concepts.

💡Latent Variables

Latent variables are hypothetical constructs that are not directly observable but are inferred from observable indicators. They are used in SEM to represent underlying concepts such as intelligence or happiness that influence attitudes and behaviors. The video script explains that latent variables are central to SEM because they allow researchers to account for unobserved factors that contribute to the variance in observed data.

💡Path Analysis

Path analysis is a graphical method for representing and analyzing relationships between variables. It is a key component of SEM, allowing researchers to visualize and quantify direct, indirect, and total effects within a model. The script uses path analysis to illustrate how SEM can model complex causal systems and mediated effects, such as how one variable may influence another, which in turn affects a third variable.

💡Measurement Error

Measurement error refers to the inaccuracies in the observed data due to factors other than the true underlying construct being measured. SEM is highlighted in the video as particularly useful for correcting measurement error, which is common in social science research where concepts like happiness or trust are difficult to measure directly.

💡Covariance Structure Analysis

Covariance structure analysis is another term for SEM, emphasizing the analysis of covariance matrices rather than individual variables. The script mentions this term to illustrate the statistical foundation of SEM, which involves examining the relationships between variables in terms of their covariances.

💡Factor Analysis

Factor analysis is a method used to identify underlying factors or constructs that explain the correlations among a set of variables. In the context of SEM, factor analysis helps in reducing a large set of observable indicators to a smaller set of latent variables, which are easier to interpret and manage in the model.

💡Regression Modeling

Regression modeling is a statistical technique used to examine the relationship between one or more independent variables and a dependent variable. The video script notes that SEM incorporates regression modeling to analyze the effects of independent variables on dependent variables within a system of equations.

💡Direct and Indirect Effects

Direct effects are the immediate influence one variable has on another, while indirect effects occur when a variable influences another through a mediator. The script explains that SEM is well-suited for analyzing both direct and indirect effects, which is important for understanding complex causal relationships in research.

💡Software Packages

The script mentions various software packages used for fitting SEMs, such as LISREL, AMOS, and Mplus. These packages provide tools for researchers to implement SEM and analyze data according to the model's requirements, highlighting the practical application of SEM in research.

💡Causal Modeling

Causal modeling is a term sometimes used to describe SEM, although the script notes it can be controversial. It refers to the use of SEM to infer causal relationships between variables. The video emphasizes that while SEM can be used in causal research, the ability to make causal inferences depends more on the research design than the statistical technique itself.

Highlights

Structural Equation Modeling (SEM) is a general modeling framework that integrates various multivariate techniques.

SEM draws on disciplines like measurement theory, factor analysis, path analysis, regression modeling, and simultaneous equations.

SEM is dynamic, often integrating new ways of fitting models as the technique develops over time.

SEM is particularly suitable for complex, multifaceted constructs often related to psychological or social concepts.

SEM can correct for measurement errors, which is beneficial for concepts that are difficult to measure directly.

SEM is well-suited for modeling systems of relationships and causal systems with numerous outcomes or dependent variables.

SEM is often used to address indirect or mediated effects in research questions.

SEMs are known by various names such as covariance structure analysis models and analysis of moment structures.

The term 'causal modeling' for SEMs is controversial as causal inference claims come from research design, not the statistical model.

Multiple software packages are available for fitting SEMs, each with its advantages and disadvantages.

SEM can be thought of as path analysis using latent variables.

Latent variables are hypothetical or not directly observable, such as intelligence or trust.

Observable indicators, like questionnaire items, are used to measure latent variables.

The true score equation represents the relationship between observed variables, true scores, and error.

Path diagrams are key to representing SEM, showing the causal relationships between variables.

Having multiple indicators of latent constructs allows for the estimation of true scores and error variance.

Latent variable models provide benefits such as better coverage of complex concepts and reduction of random error.

Path analysis is visually represented and focuses on direct, indirect, and total effects in the model.

Standardized notation in path analysis includes specific symbols for latent variables, observed variables, and error terms.

Path diagrams can represent complex relationships and decompose effects into direct, indirect, and total components.

Transcripts

play00:02

What is structural equation modelling? Well I think one of the first useful

play00:07

things to understand about SEM as I'll refer to it is it isn't a single

play00:12

technique as such we wouldn't want to compare its to say learning ordinary

play00:20

least squares regression or logistic regression log linear modeling which

play00:25

although these techniques have a number of different aspects we can think of

play00:29

them as if you like

play00:31

single approaches to address research questions I think SEM is much better

play00:38

thought of as a general modeling framework that integrates a number of

play00:44

different multivariate techniques into this overall framework it is a framework

play00:53

which draws on a number of different disciplines it brings together

play00:57

measurement theory from psychology factor analysis also from psychology and

play01:02

statistics, path analysis from epidemiology in biology regression

play01:08

modeling from statistics and simultaneous equations from econometrics

play01:13

and all these different techniques come together to form structural equation

play01:19

modeling as a general modeling environment. And it's also an environment

play01:25

which is somewhat dynamic it is not set in stone at this point in time it is

play01:31

actually often integrating new ways of fitting models as the technique

play01:39

develop over time. |

play01:42

What sort of research questions would SEM be particularly suitable for addressing?. Well I think

play01:49

it's being a general model fitting environment it can address many

play01:55

different kinds of research questions but i think it is particularly suitable in

play01:59

situations where the key constructs the key concepts that a researcher is interested

play02:06

in are complex and multifaceted often relating to psychological social

play02:13

psychological

play02:14

concepts. So these kinds of concept can be quite difficult to measure and are

play02:22

often measured with error and one of the useful aspects of SEM as we'll see is

play02:29

its ability to make corrections for errors of measurement. Other kinds of

play02:35

research questions that SEM is well suited to are ones which specify systems

play02:40

of relationships rather than as we may be used to if we're fitting regression

play02:45

models where we have a single dependent variable and a set of predictors or

play02:51

independent variables, structural equation models may have numerous

play02:56

different outcomes or dependent variables each of which is affecting

play03:00

other dependent variables in a more complex system. So if a researcher is

play03:07

interested in modeling a causal system then structural equation models are

play03:14

particularly suitable. Another kind of research question that structural

play03:20

equation models are often used to address is where the researcher is

play03:24

interested in indirect or mediated effects so in many research questions

play03:32

we're interested in the effect of variable X on variable Y that would be

play03:38

thought of as the direct effect of X on Y but in many research contexts we're

play03:44

interested in more complex kinds of relationships where the first variable X

play03:50

perhaps influences a second variables Z which then has a second effect on

play03:55

Y that would be seen as an indirect effects and SEMs are very well suited

play04:02

to addressing those kinds of mediated research questions. | Now SEMs are known

play04:13

by a number of different names in the existing literature and this can be

play04:17

somewhat confusing. Sometimes they are referred to as covariance structure

play04:24

analysis models this relates to the

play04:27

the fact that with SEMs we're actually analyzing covariance matrices not

play04:34

variables directly. We will come on to that in later films. They're also known as

play04:41

analysis of moment structures this is what gives the software the SEM

play04:46

software Amos its name because this is in recognition of the fact that more

play04:53

modern SEMs analyzed not just covariances but also means so higher order

play04:59

moments. It's also know sometimes that LISREL model which again takes its name

play05:06

from possibly the most well-known software certainly the first software for

play05:11

fitting SEMs which is LISREL. More controversially SEMs have been

play05:17

referred to as causal modeling and they're often certainly have

play05:21

historically been associated with analysis which get a cause and effects

play05:27

but I think that is probably more controversial name to give to any

play05:33

modeling technique because the claims for causal inference will come from the

play05:39

research design rather than the statistical model that we apply to

play05:43

analyze the data. | There are many different software packages that are

play05:48

available for fitting SEMs and this is a list that's changing and growing all the

play05:53

time as I mentioned the probably best known is LISREL which was developed by

play06:00

Joreskog and Sorbom one of the first available packages. Now there are

play06:06

many more software packages available Mplus, EQS, AMOS,CALIS. R is a free package Stata and

play06:16

many of these packages have more limited versions that are available for free for

play06:23

students to download and try to see which one is most suitable I wouldn't

play06:27

want to make a recommendation for any particular software package each one has

play06:32

its own particular advantages and

play06:36

disadvantages . | So what is structural equation modeling? Well there are many

play06:41

possible answers to that question the one that i'm gonna propose in this film

play06:47

is that SEM can be thought of as path analysis using latent variables. Now this

play06:56

definition may not be very helpful to you if you are not very familiar with either

play07:01

path analysis or latent variables so for the remainder of the module I'm gonna

play07:07

run through what path analysis is and what latent variables are. | So what are

play07:15

latent variables well most of the concept that we're interested in social science

play07:20

are not directly observable things like intelligence social capital trust is

play07:27

very impossible to go and put some kind of meter into people and get a direct

play07:33

reading of their level of social capital or trust so this makes these concepts

play07:39

hypothetical or latent as we refer to them we believe that they are latent

play07:45

within people at some level and they they drive attitudes and behavior but we

play07:53

can't actually directly observe them. So we're in a bit of a difficult position

play07:58

if we can't measure these concepts that were interested in but fortunately we

play08:03

can use approaches which measure these latent variables using observable

play08:09

indicators using variables that we can measure directly that we believed to be

play08:15

caused by the underlying latent constructs. So if we think of a

play08:23

questionnaire items a question in a questionnaire that has been administered to a

play08:29

sample of people and this would be a good example of an observable indicator

play08:34

of a latent construct. So let's imagine that this question asked people how happy

play08:41

they are with their lives on a scale

play08:43

1 to 10. Now some people will give higher answers or lower answers there will be variability

play08:49

Variance in this variable across the individuals in the sample. Now we

play08:56

don't think that all of that variability is only to do with people's level of

play09:02

happiness some of it will be so some of the variability will be caused by

play09:07

variability in the true level of happiness across people but there will

play09:12

be other factors that also caused variability possibly to do with the

play09:17

questionnaire design the temperature in the room whether the question is it

play09:22

administered by an interviewer or completed on a computer.

play09:26

These are all other factors that we're not really interested in in what we're

play09:31

trying to measure which is happiness. So some of the variability will be to do with

play09:36

happiness, the latent constructs, but some of the variability will be due to

play09:41

other factors error and unique variance. | So we can summarize these ideas quite

play09:50

simply in this formula the true score equation where X =t + e . So

play09:59

here the measured variable the observed indicator is X and as I said the X

play10:06

the variability in X is comprised of both true score and of error. So the true

play10:15

score is simply where the individual is on a true happiness dimension their

play10:22

true underlying level of happiness. The error comprises two components the first

play10:30

is what we could think of a systematic error this is a bias where perhaps the

play10:35

question is phrased in a way which makes people give higher happiness ratings

play10:41

than their actual level of happiness maybe it's because it's

play10:46

question administered by an interviewer and they don't want to seem unhappy because that is

play10:51

socially undesirable this would be a systematic error.

play10:54

A random error would be one where you're just as likely to overrate as to

play11:01

underrate your happiness so we can think of the the systematic error as

play11:05

being one where the mean of the individual errors doesn't cancel out it

play11:11

doesn't equal 0 where as a random error you are as likely to give a higher as a lower

play11:16

score so the expectation would be that the means the mean of the rrror would

play11:21

cancel out and be zero. So this is all by way of saying that when we measure

play11:27

variable when we measure X ideally what we will be able to isolate would be the

play11:34

t part of the variance the true score and to remove the error variance when we're

play11:40

trying to predict t or use t as a predictor in a model. | So we can now translate this

play11:51

true score equation into a very simple path diagram which is key to

play11:58

representing structural equation models. So here we can see that the the X reads

play12:04

over to being the observed item in the rectangle the t reads over to being that

play12:11

latent variable the true score in the ellipse and the e reads over to being

play12:17

the circle at the top of the diagram the error and the arrows indicate that the

play12:24

observed score is caused by both the true score the latent variable and by

play12:31

other factors the error so we can encapsulate those ideas in this simple

play12:37

path diagram. | It would be nice if we could implement this as a statistical

play12:43

model unfortunately when we only have one indicator of the latent variable

play12:49

this is happiness then this equation is what we would call unidentified. We have

play12:56

more unknown pieces of quantities that we're trying to estimate the t and the e

play13:01

we don't know what they are we would like to estimate them then we have known

play13:05

pieces of information the X we've measured X in our sample we have two

play13:10

unknowns and one known so we can't solve that equation

play13:14

uniquely the equation is unidentified so we can't separate the true score from

play13:19

the error when we only have one measure of the underlying concept. What this then

play13:27

tells us is that we need to have multiple indicators of our latent

play13:31

constructs. When we have multiple indicators then we can start to over

play13:39

identify the true score equation and estimate the quantities of t and e for each

play13:45

indicator so we can apply many different kinds of latent variable

play13:52

models we can use principal components analysis, factor analysis, latent class models

play13:59

depending on the metrics of the observed indicators that we have in our dataset.

play14:05

But what these are all going to do is to provide us with a summary score a reduced

play14:14

set of factors or components relative to the full set of indicators that we start

play14:20

out with and in doing that they will correct for the error in each of the

play14:27

individual indicators and give us a better measure of the true score of the

play14:33

concept. | We can represent this simply here with a a common factor model. Here

play14:41

we have four measured variables let's think of these as questionnaire items

play14:45

again they might be measuring happiness different aspects of happiness are you

play14:50

happy at home with your work with your friends and so on. So we got four

play14:54

indicators of the same underlying latent variable happiness now because they

play15:00

measure the same thing we would generally expect these variables to be

play15:04

correlated in our

play15:06

population and that's what these double-headed arrows indicate. The curve

play15:11

double-headed arrows indicate that the Axis are all correlated with one another

play15:16

that's one way of representing what's going on here.

play15:19

Another way would be to do away with these correlations and add in the

play15:25

underlying latent variable someone's true level of happiness which we've here

play15:30

denoted as eta. In this model now we have happiness latent variable having

play15:38

a causal effect on each of the indicators and that causal effect is

play15:43

what we can think of is that the true score or the t part in our X =t + e

play15:48

equations now if that's the case then we also need to have error

play15:53

terms for each of these equations here that's what we have shown in the diagram there so

play15:59

| So with these multiple indicators we can apply a latent variable in this case

play16:04

of factor model and we can get empirical estimates of these key quantities and here

play16:12

now the lambda coefficients there in this model will refer to as factor

play16:18

loadings and these are the correlation between the factor the e and each of the X

play16:24

variables. Now if these are good if these indicators are good indicators of

play16:30

happiness we would expect these correlations to be high we would expect

play16:33

the correlation between a good indicator of the latent construct and the latent

play16:38

constructs to be close to approaching 1. | So if we are able to measure our

play16:49

constructs with multiple indicators we can apply latent variable models and

play16:56

this brings a number of benefits

play16:57

well firstly the kinds of things that we're interested in ,modeling in social

play17:03

science, are generally complex and multifaceted. If we think of happiness

play17:08

for example, it's difficult to come up with a single question which covers all

play17:13

aspects of a person's

play17:15

individual well-being so we probably need to have multiple indicators to get

play17:20

a good coverage of the concept. As I mentioned it also enables us to remove

play17:27

or least reduce random error in the construct that we are measuring this I think

play17:34

we can convince ourselves that removing error seems to be a good thing to do but

play17:39

more formally we can demonstrate that if we have random error in the dependent

play17:44

variable although it leaves the the estimates in a model unbiased these will

play17:50

be less precisely measured there'll be a noisier measure with wider confidence

play17:55

intervals . More seriously perhaps if we have random error in independent variables

play18:01

then regression coefficients that we estimate using those independent

play18:07

variables will be attenuated they will be smaller than they are in the

play18:11

population systematically smaller tending toward zero so we will

play18:16

underestimate effect sizes and we will falsely fail to reject the null hypothesis

play18:28

| So what is

play18:29

path analysis well again there are many ways that we can answer this question but I

play18:35

think a key feature of path analysis and one that makes it very appealing as part

play18:42

of structural equation modeling for social scientists is that the model that

play18:47

you're wanting to fit to the data is represented diagrammatically rather than

play18:53

in the form of equations. Off course we can represent the structural equation

play18:59

model as a system of equations but we can also represent it as a diagram

play19:05

and this visual aspect again is very appealing for social scientists

play19:09

perhaps less comfortable and less intuitive in their reading of equations.

play19:15

So the standardized notation of path analysis is a very important feature. The

play19:23

path analysis presents regression equations between our measured variables

play19:29

so we're interested again in kind of systems of relationships between

play19:33

multiple observed variables. Now that's important and I'm saying observed

play19:39

variables there because in a standard path analysis we would not be using

play19:44

latent variables but variables which are directly observed again perhaps single

play19:50

questionnaire items other kinds of measure. A third key feature of path

play19:55

analysis is its focus not just on direct effects but also as I was talking about

play20:01

earlier on indirect effects and total effects. So for research questions where

play20:08

we don't have a simple linear model where we 're estimating the effects of some

play20:13

set of predictor variables on an outcome dependent or a criterion that we're

play20:18

interested in the pathways between multiple independent variables and

play20:25

possibly multiple dependent

play20:27

variables. | So in this slide I'm presenting some of the standardized

play20:33

notation the way that we represent different parts of the model using

play20:39

diagrammatic notation. We can see at the top a measured latent variable so latent variable

play20:45

will be presented as an ellipse and an observed or manifest variables

play20:51

such as a questionnaire items that we might use as an indicator of a measured

play20:57

latent variable would be a rectangle and error variance or disturbance term

play21:03

is a small circle and there's a similarity with the measured latent

play21:09

variable they are both circular shaped because an error variance is also a

play21:14

latent variable it's is just that we are not specifying it as measuring anything in

play21:20

particular it is the what's left over the residual or disturbance term.

play21:27

A covariance path where we're specifying that two variables in the model are

play21:32

related or correlated with one another

play21:35

would be represented as a curved double-headed arrow this is a

play21:41

non-directional association i.e. we're not specifying there is any causal link from

play21:46

one variable to another but we want to indicate that they are correlated. And

play21:51

finally the single headed straight arrow represents a directional path or what we

play21:58

would generally think of as employing causality in the model a regression path

play22:05

from one variable to another. | So here are some examples of some simple path

play22:11

diagrams that we could represent in Equation form or using standardized path

play22:18

notation in this simple diagram we can see that the variable X has a causal

play22:25

effect on Y and the D term there is the disturbance term so the the error term in

play22:32

this model we could this is essentially a bivariate regression

play22:37

model we can also write this in that standard equation notation. This second

play22:46

path diagram is somewhat more complicated but really is just adding in

play22:51

a second independent variable X2 so again this is equivalent to a multiple linear

play22:59

regression with two independent variables a dependent variable Y and an

play23:05

error term which in this path diagram is labeled D for the disturbance term. |Now

play23:12

as I mentioned one of the things that path diagrams, path analysis are

play23:17

particularly useful for is for studying not just direct

play23:23

effects but also indirect effects we can say now that we've introduced a more

play23:28

complex relationship between these variables where x1 has a direct effect

play23:35

on x2 but x2 also has a direct effect on Y so we now have an indirect

play23:43

effects of x1 on y through x2. And we can use standard formulae

play23:51

to decompose these regression coefficients indicated by beta1 to beta3 into

play23:58

the direct indirect and total components. So here

play24:04

beta1 represents the direct effect of x1 on y , beta2 is the direct effect of x1

play24:11

on x2 . beta3 now is the direct effect of x2 on y and beta2 x beta3 will

play24:22

give us the indirect effect of x1 on y. And we can also compute from this path

play24:29

diagram the total effect which is the sum of the indirect and the

play24:36

direct effects between one variable and other . So if we take the sum of beta1

play24:42

and the product of beta2 and beta3 this will give us the total effect of x1 on

play24:49

y.

play24:50

| So that's given a very brief overview of both latent variables and path analysis

play24:59

and what I'm encouraging you to think about to understand what we're doing

play25:05

with structural equation models is that when we have a path diagram that

play25:11

includes latent variables rather than just observed variables as we can see in

play25:17

this diagram then we're representing a structural equation model.

Rate This

5.0 / 5 (0 votes)

相关标签
Structural EquationMultivariate TechniquesPath AnalysisCovariance StructureLatent VariablesMeasurement TheorySocial ScienceStatistical ModelingResearch DesignRegression Analysis
您是否需要英文摘要?