Structural Equation Modelling: A Powerful Antibiotic: H. K. Dangi, Ashmeet Kaur and Juhi Jham
Structural Equation Modelling: A Powerful Antibiotic: H. K. Dangi, Ashmeet Kaur and Juhi Jham
Structural Equation Modelling: A Powerful Antibiotic: H. K. Dangi, Ashmeet Kaur and Juhi Jham
Journal of Business Thought, Vol 10, DOI: 10.18311/jbt/2019/23452, April 2019 – March 2020 ISSN (Online) : 2581-8104
Abstract
This article is an attempt to scrutinize the applicability of the widely used statistical technique of Structural Equation Modelling
(SEM). SEM is a comprehensive technique to test the model adequacy. SEM is considered as an important advancement in
social science research as it combines measurement with substantive theories. It has been observed that many studies pay
attention to statistical mechanisation of SEM rather than the assumptions on which it is based. The history of SEM can be
traced to Regression Analysis, Path Analysis and Confirmatory Factor Analysis. SEM is popularly applied because of its use in
estimating multiple dependence relationships. It is able to measure the unobserved variables, define the model representing
the set of relationships and also corrects the measurement error. The technique is commonly applied in disciplines including
sociology, psychology and other fields of behavioural science. The availability of various user-friendly software programmes
like LISREL, AMOS, EQS, Mx, Mplus and PISTE is an added advantage. However, one should be careful while using SEM for
causal inferences. In comparison to other common standard statistical techniques, SEM is based on several assumptions. The
technique requires a priori knowledge of all the parameters to be estimated and a substantial amount of data pertaining to
covariances, variances and path coefficients. It also requires relationships to be specified in the model. The model inherently
assumes temporal precedence and is heavily dependent on researcher’s judgements about exogeneity and directionality.
Normality is yet another important assumption of SEM. The mismatch between data characteristics and assumptions imperils
inference and accuracy. Like antibiotics are a boon to mankind yet one needs to judiciously use them. Similarly, SEM is a
powerful technique however, researchers are suggested to apply cautiously.
Keywords: Confirmatory Factor Analysis, Latent Variables, Path Analysis, Regression Analysis, Structural Equation
Modelling
JEL classification: C36, C38
priori hypothesis is framed in regard to how the vari- istics of using covariance as the basic statistic. Covariance
ables will cluster together on a factor1. It can be used to statistic conveys more information than regression as in
determine if the scale performs in the same manner while the latter, the differences between observed and expected
working with different population groups. The psycho- individual cases are minimized. While, in SEM, the dif-
metric properties of varied versions of the scale can be ferences between observed and expected covariance
compared using CFA. CFA is different from Exploratory matrices are minimized14. The analysis focuses on the fit
Factor Analysis (EFA). In EFA, the software performs its of the data to the theoretical model. SEM allows us to dis-
statistical operations and produces the best combination tinguish between direct and indirect relationships among
of variables clustering together to form a factor, even if it the variables by examining mediation and moderation.
is different from the combinations of variables that have SEM also indicates the group differences. Hence, it can
been hypothesised. In those cases where the model does be used to compare the results of separate models devel-
not fit the data, certain ideas and clues are available to oped for different groups through multiple-group SEM
guide the shuffling of variables so that the model fits the models. The longitudinal data for measuring the change
data in a better way. SEM is a combination of CFA and in the growth of variables over a period of time can be
multiple in a broad sense. collected. It augments in providing new ideas to research-
ers in the field of SEM. The availability of user-friendly
software programmes is an added advantage. Many of the
3. Applications of SEM software programmes are Windows-based and generate
SEM is a multivariate procedure that, as defined by the programme syntax internally, and thus, are easier to
Ullman12, “Allows examination of a set of relationships apply.
between one or more independent variables, either con- SEM is popular in its applications because of the req-
tinuous or discrete, and one or more dependent variables, uisite use of multiple observed variables by researchers
either continuous or discrete.” In SEM, the oval shape for better understanding. It is capable of dealing with
depicts latent variables while squares represent measured the sophisticated theories that are statistically modelled
variables10. Each latent variable, also called construct or and tested. All SEM have three main features3. First is the
unobserved variable, is in fact a small CFA. Lines are used characteristic of estimating multiple dependence rela-
to indicate relationships between variables. Lines have tionships. Second is the representation of unobserved
either one arrow for depicting a hypothesised direct rela- concepts in the relationships and correction of the mea-
tionship between two variables or two arrows indicating a surement errors. And third is the ability to define the
covariance between the two variables13. model representing the set of relationships. Another con-
SEM is considered as an important advancement in tribution to its popularity is the measurement error taken
social science research as it combines measurement with explicitly into account while statistically analysing the
substantive theories. SEM analysis deals with testing of a data. This ensures a greater importance to the reliability
model, testing a hypothesis about a model or modifica- and validity of the observed scores from the measurement
tion of an existing model. The analysis makes it possible instruments.
to simultaneously test all the relationships in case of com-
plex and multidimensional constructs. SEM is commonly
4. Limitations of SEM
applied in disciplines including sociology, psychology
and other behavioural science because of its capability to SEM is a powerful technique for testing models but the
test relationships between latent and measured variables. modelling process, at times, is complicated. The tech-
Further, availability of many user-friendly software have nique requires a priori knowledge of all the parameters
increased the popularity of SEM amongst researchers. to be estimated. Significant amount of data pertaining to
To check model adequacy, goodness of fit test is used5. covariances, variances, path coefficients and the relation-
SEM is useful in understanding the relational data in mul- ships is needed to be specified in the model. With the
tivariate systems and in examining the variances in the availability of user-friendly statistical softwares such as
variables. It can be distinguished from other conventional LISREL, AMOS and EQS, SEM is being used widely It is
methods of statistical analysis due to its distinct character- being excessively reported in social work journals without
adequately validating the assumptions on which the tech- research are mostly composed of less than a few hundred
nique is fundamentally based. SEM inherently assumes cases, thus, violating the basic assumption.
temporal precedence i.e., presumed cause occurs before the It has been observed that in many research studies,
presumed effect6. Temporal precedence must be checked efforts made to model modifications for better goodness
through random assignment of cases to conditions in of fit statistic, are unnecessary. Significant modifications
experimental studies and by measuring cause and effect are made to improve a model fit. These include dropping
relationships at different points over time in non-experi- indicators, allowing cross-loadings and including numer-
mental studies. It is not possible to demonstrate temporal ous correlated error terms, which not only challenge the
precedence if all variables are measured simultaneously, reliability and validity of some studies, but also make it
which is true in case of most of the studies, thus, rendering impossible to replicate the findings with new data.
little justification for the inferences made. Studies that make excessive modifications to a model
It further assumes strong association or co-variation also hamper meaningful interpretations as interpreting
backed by both theories and results of empirical studies. a model with numerous correlated error terms or cross-
This assumption significantly affects the inferences made loadings or both, is inconclusive2. These modifications
as data may indicate spurious association. Association have improved the empirical fit of the model, however,
between two variables could be strong even if there is theoretical consistency is compromised. Testing direc-
no causal relation as both variables might have a com- tionality in relationships is yet another challenge in the
mon third factor causing them. Prior knowledge of causal model. It is the researcher’s hypotheses of causality that
relations is assumed in interpreting path coefficients. In
form the model. Recreation of the variance patterns,
various fields of research like behavioural science, one
observed in nature, is not possible by using SEM since the
barely knows the causal model, rather one hypothesizes
working of the model is limited by the researcher’s choice
the model. If the model fits the data one may conclude
of variables and paths. This makes several models to fit
that model is consistent with the data but one cannot
the data equally well. Eliciting desired results is relatively
claim about the applicability of the model as it is not
easy indicating why the model is overly applied.
proven to be true.
Structural models are heavily dependent on the
researcher’s judgments about exogeneity and directional- 5. Conclusion
ity. Exogeneity implies that the variable presumed to be
exogenous must not affect the endogenous variable in any One should rather be careful while using SEM for causal
other way than prescribed, directly or indirectly. Such a inferences. Assumptions are critical in specification,
variable must be uncorrelated with any other unmeasured analysis and interpretation. All statistical tests make cer-
cause of the endogenous variable. It is difficult to validate tain assumptions about the data or model. It has been
assumptions in the absence of robust empirical evidence. observed that inadequate attention is paid to assumptions.
Multivariate normality of the observed variable is yet Mismatch between data characteristics and assumptions
another important assumption. Before building a model of a particular method used, imperils inference and accu-
it is important to ensure that all the observations must racy of results. Conclusions that are extrapolated from
be drawn from a continuous and multivariate normal a model based on a small sample size are unreliable.
population. The Maximum Likelihood (ML) technique Omission of crucial variables is another major cause of
of approximation, which assumes normality, is used to poorly fit models.
estimate the parameters7. This makes SEM a large sample SEM is being overly applied and hence, grossly misused.
technique. Thus, drawing conclusions from a small sam- Unwarranted usage of SEM makes the validity of inferences
ple size makes them unreliable. questionable. It is being applied without an exhaustive
Use of non-continuous data in the model also leads knowledge of the variables and the model constructed, and
to profusely biased results. Many studies have used this without checking whether, or not, it is justified to use SEM.
technique on dichotomous or ordinal data, which is an Further, the technique is deployed most of the times with-
incorrect estimation method, as it may give inconsistent out validating the compliance of all the assumptions.
results. Estimation via ML technique further requires One may consider applying another suitable statistical
large sample size while sample sizes in social sciences test depending on the data properties. For non-normal
data, alternative methods of estimation like Ordinary 5. Kenny DA, McCoach DB. Effect of the number of vari-
Least Squares can be used with large sample size require- ableson measures of fit in structural equation modeling.
ments. There should be no missing data in any variable. Structural Equation Modeling. 2003; 10(3):333–51. https://
There are various methods dealing with such issues like doi.org/10.1207/S15328007SEM1003_1
Missing Completely at Random approach, Missing at 6. Kline RB. Assumptions in structural equation modeling. R.
H. Hoyle, Ed. Handbook of structural equation modeling.
Random approach and Imputation approach. One may
The Guilford Press; 2012. p. 111–25.
use Partial Least Squares in place of AMOS for forma-
7. Kumar S. Structural equation modeling basic assump-
tive constructs. Thus, if assumptions are not met, suitable tions and concepts: A novices guide. Asian Journal of
alternatives should be explored and employed. Management Sciences. 2015; 03(07):25–8.
Therefore, just like antibiotics, which are a big gift to 8. Lawley D. VI - The estimation of factor loadings by the
mankind, but have been extensively misused resulting in method of maximum likelihood. Proceedings of the Royal
drug resistance; SEM is also widely misused. Society of Edinburgh. 1940; 60(1):64–82. https://doi.
However, if used judiciously, it is a very powerful tech- org/10.1017/S037016460002006X
nique. Thus, there is a need for ensuring a more prudent 9. Streiner DL. Building a better model: An introduc-
use of SEM amongst researchers, warranting it is applied tion to structural equation modelling. Can J Psychiatry.
where apt and for the purpose it is meant for. 2006; 51(5):317–24. PMid: 16986821. https://doi.
org/10.1177/070674370605100507
10. Stoelting R. Structural Equation modeling/Path Analysis.
6. References 2002 Sept. http://userwww.sfsu.edu/efc/classes/biol710/
path/SEMwebpage.htm
1. Anderson JC, Gerbing DW. Structural equation modeling
11. Tomarken AJ, Waller NG. Structural equation modeling:
in practice: A review and recommended two-step approach.
Strengths, limitations and misconceptions. 2005; 1:31–65.
APA PsycNET Direct. 1988; 103(3):411–23. https://doi.
https://doi.org/10.1146/annurev.clinpsy.1.102803.144239
org/10.1037//0033-2909.103.3.411
12. Ullman JB. Structural equation modeling. New York, NY:
2. Guo B, Perron BE, Gillespie DF. A systematic review of
Harper Collins College Publishers; 1996. p. 709–819.
structural equation modelling in social work research. Br.
13. Ullman JB, Bentler PM. Structural equation modelling.
J. Soc. Work 2009; 39(8):1556–74. https://doi.org/10.1093/
Research Methods in Psychology. 2012; 2:663–83. https://
bjsw/bcn101
doi.org/10.1002/9781118133880.hop202023
3. Hair JF, Sarstedt M, Ringle CM, Mena JA. An assessment
14. Valluzzi JL, Larson SL, Miller GE. Indications and limita-
of the use of partial least squares structural equation
tions of structural equation modeling in complex surveys:
modeling in marketing research. Journal of the Academy
Application in the Medical Expenditure Panel Survey
of Marketing Science. 2012; 40:414–33. https://doi.
(MEPS). Agency for Healthcare Research and Quality,
org/10.1007/s11747-011-0261-6
Center for Financing, Access, and Cost Trends. 2003.
4. Hox JJ, Bechger TM. An introduction to structural equa-
tion modeling. Family Science Review. 1998; 11:354–73.