Academia.edu no longer supports Internet Explorer.
To browse Academia.edu and the wider internet faster and more securely, please take a few seconds to upgrade your browser.
NERA Annual Meeting
…
11 pages
1 file
There are many software packages that estimate item response theory parameters and examinee abilities. This study evaluates the open-source ltm package used in R under six conditions on the number of items and examinees. Item and ability estimates were compared with true parameters by, bias, MAD, and RMSE.
Measurement: Interdisciplinary Research and Perspectives, 2019
About 45 R packages to analyze data using item response theory (IRT) have been developed over the last decade. This article introduces these 45 R packages with their descriptions and features. It also describes possible advanced IRT models using R packages, as well as dichotomous and polytomous IRT models, and R packages that contain applications such as differential item functioning and equating are also introduced. Thus, this article helps researchers who plan to use IRT-based analysis to decide on the type of IRT analysis and choose the appropriate R packages.
Applied Psychological Measurement, 2007
Fit of the model to the data is important if the benefits of item response theory (IRT) are to be obtained. In this study, the authors compared model selection results using the likelihood ratio test, two information-based criteria, and two Bayesian methods. An example illustrated the potential for inconsistency in model selection depending on which of the indices was used. Results from a simulation study indicated that the inconsistencies among the indices were common but that model selection was relatively accurate for longer tests administered to larger sample of examinees. The cross-validation log-likelihood (CVLL) appeared to work the best of the five models for the conditions simulated in this study.
Eğitimde ve Psikolojide Ölçme ve Değerlendirme Dergisi, 2020
The aim of this study is twofold. The first one is to investigate the effect of sample size and test length on the estimation of item parameters and their standard errors for the two parameter item response theory (IRT). Another is to provide information about the performance of Mplus, BILOG-MG and R (ltm) programs in terms of parameter estimation under the conditions which were mentioned above. The simulated data were used in this study. The examinee responses were generated by using the open-source program R. After obtaining the data sets, the parameters were estimated in BILOG-MG, Mplus and R (ltm). The accuracy of the item parameters and ability estimates were evaluated under six conditions that differed in the numbers of items and examinees. After looking at the resulting bias and root mean square error (RMSE) values, it can be concluded that Mplus is an unbiased program when compared to BILOG-MG and R (ltm). BILOG-MG can estimate parameters and standard errors close to the true values, when compared to Mplus and R (ltm).
Global Journal of Educational Research, 2017
Item response theory (IRT) is a framework for modeling and analyzing item response data. Item-level modeling gives IRT advantages over classical test theory. The fit of an item score pattern to an item response theory (IRT) models is a necessary condition that must be assessed for further use of item and models that best fit the data. The study investigated item level diagnostic statistics and model-data fit with one-and two-parameter models using IRTPROV3.0 and BILOG-MG V3.0. Ex-post facto design was adopted. The population for the study consisted of 11,538 candidates' responses who took Type L 2014 Unified Tertiary Matriculation Examination (UTME) Mathematics paper in Akwa Ibom State, Nigeria. The sample of 5,192(45%) responses was randomly selected through stratified sampling technique. BILOG-MG V3.0 and IRTPROV3.0 computer software was used to calibrate the candidates' responses. Two research questions were raised to guide the study. Pearson's χ 2 and S-χ 2 statistics as an item fit index for dichotomous item response theory models were used. The outputs from the two computer software were used to answer the questions. The findings revealed that only 1 item fitted 1parameter model in BILOG-MG V3.0 and IRTPRO V3.0. Furthermore, the findings revealed that 26 items fitted 2-parameter models when using BILOG-MG V3.0. Five items fitted 2-parameter models in IRTPRO. It was recommended that the use of more than one IRT software programme offers more useful information for the choice of model that fit the data.
Psychometrics has recently seen the development of complex measurement models to better represent test and item data. Item Response Theory (IRT), in particular, comprises a set of non-linear latent variable models that appear to have several conceptual and empirical properties that make them more valuable in practice than classical test theory methods. However, IRT-based models typically require the availability of costly and computationally- intensive software for estimating parameters and assessing model fit. In this paper, we present a set of SAS Macros called IRT-FIT, which use SAS /IML® and SAS/GRAPH® to estimate, fit, and graph two- and three-parameter IRT models to binary test data. The macros currently developed use Bock and Aitkin's (1981) Marginal Maximum Likelihood (MML) estimation algorithm for fitting models and estimating parameters as the basis for the computations. Additionally, we have extended the MML routines by implementing Bayesian Estimation concepts as su...
Journal of Statistical Software, 2006
The R package ltm has been developed for the analysis of multivariate dichotomous and polytomous data using latent variable models, under the Item Response Theory approach. For dichotomous data the Rasch, the Two-Parameter Logistic, and Birnbaum's Three-Parameter models have been implemented, whereas for polytomous data Semejima's Graded Response model is available. Parameter estimates are obtained under marginal maximum likelihood using the Gauss-Hermite quadrature rule. The capabilities and features of the package are illustrated using two real data examples.
Applied Psychological Measurement, 2011
ETS Research Report Series, 2011
Research Association, American Psychological Association, & National Council for Measurement in Education, 1999) demands evidence of model fit when an item response theory (IRT) model is used to make inferences from a data set. We applied two recently suggested methods for assessing goodness of fit of IRT models-generalized residual analysis (Haberman, 2009) and residual analysis for assessing item fit )-to several operational data sets.
Measurement, 2013
The article provides an overview of goodness-of-fit assessment methods for item response theory (IRT) models. It is now possible to obtain accurate p-values of the overall fit of the model if bivariate information statistics are used. Several alternative approaches are described. As the validity of inferences drawn on the fitted model depends on the magnitude of the misfit, if the model is rejected it is necessary to assess the goodness of approximation. With this aim in mind, a class of root mean squared error of approximation (RMSEA) is described, which makes it possible to test whether the model misfit is below a specific cutoff value. Also, regardless of the outcome of the overall goodness-of-fit assessment, a piece-wise assessment of fit should be performed to detect parts of the model whose fit can be improved. A number of statistics for this purpose are described, including a z statistic for residual means, a mean-and-variance correction to Pearson’s X2 statistic applied to each bivariate subtable separately, and the use of z statistics for residual cross-products.
Communications in Statistics - Simulation and Computation, 2007
Theory of Everything: The Missing Link in Modern Physics, Which Solves all the Problems, 2014
Infolibre, 20 abril 2024
BUDAPEST TRAVEL GUIDE FOR COMPLETE ADVENTURE, 2020
Transilvania, 2024
Psychology and Education: A Multidisciplinary Journal, 2024
Journal of Critical Infrastructure Policy, 2021
Artificial Intelligence, 1996
Forensic Science International: Synergy, 2019
The Journal of General Physiology, 1973
Journal of Health, 2017
Journal of Urological Surgery, 2021
Journal of Breast Health, 2015
Theory and Decision, 2012
Revista Paulista de Pediatria, 2018