Source author record

Silvia Bacci

Silvia Bacci appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology Applications Computation math.ST Statistics Theory

Catalog footprint

What is connected

12works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

Conditioning on the pre-test versus gain score modeling: revisiting the controversy in a multilevel setting

We consider estimating the effect of a treatment on the progress of subjects tested both before and after treatment assignment. A vast literature compares the competing approaches of modeling the post-test score conditionally on the pre-test score versus modeling the difference, namely the gain score. Our contribution resides in analyzing the merits and drawbacks of the two approaches in a multilevel setting. This is relevant in many fields, for example education with students nested into schools. The multilevel structure raises peculiar issues related to the contextual effects and the distinction between individual-level and cluster-level treatment. We derive approximate analytical results and compare the two approaches by a simulation study. For an individual-level treatment our findings are in line with the literature, whereas for a cluster-level treatment we point out the key role of the cluster mean of the pre-test score, which favors the conditioning approach in settings with large clusters.

preprint2016arXiv

Evaluation of student proficiency through a multidimensional finite mixture IRT model

In certain academic systems, a student can enroll for an exam immediately after the end of the teaching period or can postpone it to any later examination session, so that the grade is missing until the exam is not attempted. We propose an approach for the evaluation in itinere of a student's proficiency accounting also for non-attempted exams. The approach is based on considering each exam as an item, so that responding to the item amounts to attempting the exam, and on an Item Response Theory model that includes two latent variables corresponding to the student's ability and the propensity to attempt the exam. In this way, we explicitly account for non-ignorable missing observations as the indicators of item response also contribute to measure the ability. The two latent variables are assumed to have a discrete distribution defining latent classes of students that are homogeneous in terms of ability and priority assigned to exams. The model, which also allows for individual covariates in its structural part, is fitted by the Expectation-Maximization algorithm. The approach is illustrated through the analysis of data about the first-year exams of freshmen of the School of Economics at the University of Florence (Italy).

preprint2014arXiv

A multidimensional latent class IRT model for non-ignorable missing responses

We propose a structural equation model, which reduces to a multidimensional latent class item response theory model, for the analysis of binary item responses with non-ignorable missingness. The missingness mechanism is driven by two sets of latent variables: one describing the propensity to respond and the other referred to the abilities measured by the test items. These latent variables are assumed to have a discrete distribution, so as to reduce the number of parametric assumptions regarding the latent structure of the model. Individual covariates may also be included through a multinomial logistic parametrization of the probabilities of each support point of the distribution of the latent variables. Given the discrete nature of this distribution, the proposed model is efficiently estimated by the Expectation-Maximization algorithm. A simulation study is performed to evaluate the finite sample properties of the parameter estimates. Moreover, an application is illustrated to data coming from a Students' Entry Test for the admission to some university courses.

preprint2014arXiv

A multilevel finite mixture item response model to cluster examinees and schools

Within the educational context, a key goal is to assess students acquired skills and to cluster students according to their ability level. In this regard, a relevant element to be accounted for is the possible effect of the school students come from. For this aim, we provide a methodological tool which takes into account the multilevel structure of the data (i.e., students in schools) in a suitable way. This approach allows us to cluster both students and schools into homogeneous classes of ability and effectiveness, and to assess the effect of certain students and school characteristics on the probability to belong to such classes. The approach relies on an extended class of multidimensional latent class IRT models characterized by: (i) latent traits defined at student level and at school level, (ii) latent traits represented through random vectors with a discrete distribution, (iii) the inclusion of covariates at student level and at school level, and (iv) a two-parameter logistic parametrization for the conditional probability of a correct response given the ability. The approach is applied for the analysis of data collected by two national tests administered in Italy to middle school students in June 2009: the INVALSI Italian Test and Mathematics Test. Results allow us to study the relationships between observed characteristics and latent trait standing within each latent class at the different levels of the hierarchy. They show that examinees and school expected observed scores, at a given latent trait level, are dependent on both unobserved (latent class) group membership and observed first and second level covariates.

preprint2012arXiv

A causal analysis of mother's education on birth inequalities

We propose a causal analysis of the mother's educational level on the health status of the newborn, in terms of gestational weeks and weight. The analysis is based on a finite mixture structural equation model, the parameters of which have a causal interpretation. The model is applied to a dataset of almost ten thousand deliveries collected in an Italian region. The analysis confirms that standard regression overestimates the impact of education on the child health. With respect to the current economic literature, our findings indicate that only high education has positive consequences on child health, implying that policy efforts in education should have benefits for welfare.

preprint2012arXiv

A class of Multidimensional Latent Class IRT models for ordinal polytomous item responses

We propose a class of Item Response Theory models for items with ordinal polytomous responses, which extends an existing class of multidimensional models for dichotomously-scored items measuring more than one latent trait. In the proposed approach, the random vector used to represent the latent traits is assumed to have a discrete distribution with support points corresponding to different latent classes in the population. We also allow for different parameterizations for the conditional distribution of the response variables given the latent traits - such as those adopted in the Graded Response model, in the Partial Credit model, and in the Rating Scale model - depending on both the type of link function and the constraints imposed on the item parameters. For the proposed models we outline how to perform maximum likelihood estimation via the Expectation-Maximization algorithm. Moreover, we suggest a strategy for model selection which is based on a series of steps consisting of selecting specific features, such as the number of latent dimensions, the number of latent classes, and the specific parametrization. In order to illustrate the proposed approach, we analyze data deriving from a study on anxiety and depression as perceived by oncological patients.

preprint2012arXiv

A comparison of some criteria for states selection in the latent Markov model for longitudinal data

We compare different selection criteria to choose the number of latent states of a multivariate latent Markov model for longitudinal data. This model is based on an underlying Markov chain to represent the evolution of a latent characteristic of a group of individuals over time. Then, the response variables observed at the different occasions are assumed to be conditionally independent given this chain. Maximum likelihood of the model is carried out through an Expectation-Maximization algorithm based on forward-backward recursions which are well known in the hidden Markov literature for time series. The selection criteria we consider in our comparison are based on penalized versions of the maximum log-likelihood or on the posterior probabilities of belonging to each latent state, that is the conditional probability of the latent state given the observed data. A Monte Carlo simulation study shows that the indices referred to the log-likelihood based information criteria perform in general better with respect to those referred to the classification based criteria. This is due to the fact that the latter tend to underestimate the true number of latent states, especially in the univariate case.

preprint2012arXiv

A multidimensional latent class Rasch model for the assessment of the Health-related Quality of Life

The work describes a multidimensional latent class Rasch model and its application to data about the measurement of some aspects of Health-related Quality of Life and Anxiety and Depression in oncological patients.

preprint2012arXiv

Joint Assessment of the Differential Item Functioning and Latent Trait Dimensionality of Students' National Tests

Within the educational context, students' assessment tests are routinely validated through Item Response Theory (IRT) models which assume unidimensionality and absence of Differential Item Functioning (DIF). In this paper, we investigate if such assumptions hold for two national tests administered in Italy to middle school students in June 2009: the Italian Test and the Mathematics Test. To this aim, we rely on an extended class of multidimensional latent class IRT models characterised by: (i) a two-parameter logistic parameterisation for the conditional probability of a correct response, (ii) latent traits represented through a random vector with a discrete distribution, and (iii) the inclusion of (uniform) DIF to account for students' gender and geographical area. A classification of the items into unidimensional groups is also proposed and represented by a dendrogram, which is obtained from a hierarchical clustering algorithm. The results provide evidence for DIF effects for both Tests. Besides, the assumption of unidimensionality is strongly rejected for the Italian Test, whereas it is reasonable for the Mathematics Test.

preprint2012arXiv

Mixtures of equispaced normal distributions and their use for testing symmetry in univariate data

Given a random sample of observations, mixtures of normal densities are often used to estimate the unknown continuous distribution from which the data come. Here we propose the use of this semiparametric framework for testing symmetry about an unknown value. More precisely, we show how the null hypothesis of symmetry may be formulated in terms of normal mixture model, with weights about the centre of symmetry constrained to be equal one another. The resulting model is nested in a more general unconstrained one, with same number of mixture components and free weights. Therefore, after having maximised the constrained and unconstrained log-likelihoods by means of a suitable algorithm, such as the Expectation-Maximisation, symmetry is tested against skewness through a likelihood ratio statistic. The performance of the proposed mixture-based test is illustrated through a Monte Carlo simulation study, where we compare two versions of the test, based on different criteria to select the number of mixture components, with the traditional one based on the third standardised moment. An illustrative example is also given that focuses on real data.

preprint2012arXiv

MultiLCIRT: An R package for multidimensional latent class item response models

We illustrate a class of Item Response Theory (IRT) models for binary and ordinal polythomous items and we describe an R package for dealing with these models, which is named MultiLCIRT. The models at issue extend traditional IRT models allowing for (i) multidimensionality and (ii) discreteness of latent traits. This class of models also allows for different parameterizations for the conditional distribution of the response variables given the latent traits, depending on both the type of link function and the constraints imposed on the discriminating and the difficulty item parameters. We illustrate how the proposed class of models may be estimated by the maximum likelihood approach via an Expectation-Maximization algorithm, which is implemented in the MultiLCIRT package, and we discuss in detail issues related to model selection. In order to illustrate this package, we analyze two datasets: one concerning binary items and referred to the measurement of ability in mathematics and the other one coming from the administration of ordinal polythomous items for the assessment of anxiety and depression. In the first application, we illustrate how aggregating items in homogeneous groups through a model-based hierarchical clustering procedure which is implemented in the proposed package. In the second application, we describe the steps to select a specific model having the best fit in our class of IRT models.

preprint2011arXiv

Mixture latent autoregressive models for longitudinal data

Many relevant statistical and econometric models for the analysis of longitudinal data include a latent process to account for the unobserved heterogeneity between subjects in a dynamic fashion. Such a process may be continuous (typically an AR(1)) or discrete (typically a Markov chain). In this paper, we propose a model for longitudinal data which is based on a mixture of AR(1) processes with different means and correlation coefficients, but with equal variances. This model belongs to the class of models based on a continuous latent process, and then it has a natural interpretation in many contexts of application, but it is more flexible than other models in this class, reaching a goodness-of-fit similar to that of a discrete latent process model, with a reduced number of parameters. We show how to perform maximum likelihood estimation of the proposed model by the joint use of an Expectation-Maximisation algorithm and a Newton-Raphson algorithm, implemented by means of recursions developed in the hidden Markov literature. We also introduce a simple method to obtain standard errors for the parameter estimates and a criterion to choose the number of mixture components. The proposed approach is illustrated by an application to a longitudinal dataset, coming from the Health and Retirement Study, about self-evaluation of the health status by a sample of subjects. In this application, the response variable is ordinal and time-constant and time-varying individual covariates are available.

Silvia Bacci

What is connected

Connect this record

See the researcher in context

Building this map preview

12 published item(s)

Conditioning on the pre-test versus gain score modeling: revisiting the controversy in a multilevel setting

Evaluation of student proficiency through a multidimensional finite mixture IRT model

A multidimensional latent class IRT model for non-ignorable missing responses

A multilevel finite mixture item response model to cluster examinees and schools

A causal analysis of mother's education on birth inequalities

A class of Multidimensional Latent Class IRT models for ordinal polytomous item responses

A comparison of some criteria for states selection in the latent Markov model for longitudinal data

A multidimensional latent class Rasch model for the assessment of the Health-related Quality of Life

Joint Assessment of the Differential Item Functioning and Latent Trait Dimensionality of Students' National Tests

Mixtures of equispaced normal distributions and their use for testing symmetry in univariate data

MultiLCIRT: An R package for multidimensional latent class item response models

Mixture latent autoregressive models for longitudinal data