Researcher profile

Silvia Bacci

Silvia Bacci contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
9works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

9 published item(s)

preprint2021arXiv

Conditioning on the pre-test versus gain score modeling: revisiting the controversy in a multilevel setting

We consider estimating the effect of a treatment on the progress of subjects tested both before and after treatment assignment. A vast literature compares the competing approaches of modeling the post-test score conditionally on the pre-test score versus modeling the difference, namely the gain score. Our contribution resides in analyzing the merits and drawbacks of the two approaches in a multilevel setting. This is relevant in many fields, for example education with students nested into schools. The multilevel structure raises peculiar issues related to the contextual effects and the distinction between individual-level and cluster-level treatment. We derive approximate analytical results and compare the two approaches by a simulation study. For an individual-level treatment our findings are in line with the literature, whereas for a cluster-level treatment we point out the key role of the cluster mean of the pre-test score, which favors the conditioning approach in settings with large clusters.

preprint2012arXiv

A causal analysis of mother's education on birth inequalities

We propose a causal analysis of the mother's educational level on the health status of the newborn, in terms of gestational weeks and weight. The analysis is based on a finite mixture structural equation model, the parameters of which have a causal interpretation. The model is applied to a dataset of almost ten thousand deliveries collected in an Italian region. The analysis confirms that standard regression overestimates the impact of education on the child health. With respect to the current economic literature, our findings indicate that only high education has positive consequences on child health, implying that policy efforts in education should have benefits for welfare.

preprint2012arXiv

A class of Multidimensional Latent Class IRT models for ordinal polytomous item responses

We propose a class of Item Response Theory models for items with ordinal polytomous responses, which extends an existing class of multidimensional models for dichotomously-scored items measuring more than one latent trait. In the proposed approach, the random vector used to represent the latent traits is assumed to have a discrete distribution with support points corresponding to different latent classes in the population. We also allow for different parameterizations for the conditional distribution of the response variables given the latent traits - such as those adopted in the Graded Response model, in the Partial Credit model, and in the Rating Scale model - depending on both the type of link function and the constraints imposed on the item parameters. For the proposed models we outline how to perform maximum likelihood estimation via the Expectation-Maximization algorithm. Moreover, we suggest a strategy for model selection which is based on a series of steps consisting of selecting specific features, such as the number of latent dimensions, the number of latent classes, and the specific parametrization. In order to illustrate the proposed approach, we analyze data deriving from a study on anxiety and depression as perceived by oncological patients.

preprint2012arXiv

A comparison of some criteria for states selection in the latent Markov model for longitudinal data

We compare different selection criteria to choose the number of latent states of a multivariate latent Markov model for longitudinal data. This model is based on an underlying Markov chain to represent the evolution of a latent characteristic of a group of individuals over time. Then, the response variables observed at the different occasions are assumed to be conditionally independent given this chain. Maximum likelihood of the model is carried out through an Expectation-Maximization algorithm based on forward-backward recursions which are well known in the hidden Markov literature for time series. The selection criteria we consider in our comparison are based on penalized versions of the maximum log-likelihood or on the posterior probabilities of belonging to each latent state, that is the conditional probability of the latent state given the observed data. A Monte Carlo simulation study shows that the indices referred to the log-likelihood based information criteria perform in general better with respect to those referred to the classification based criteria. This is due to the fact that the latter tend to underestimate the true number of latent states, especially in the univariate case.

preprint2012arXiv

Joint Assessment of the Differential Item Functioning and Latent Trait Dimensionality of Students' National Tests

Within the educational context, students' assessment tests are routinely validated through Item Response Theory (IRT) models which assume unidimensionality and absence of Differential Item Functioning (DIF). In this paper, we investigate if such assumptions hold for two national tests administered in Italy to middle school students in June 2009: the Italian Test and the Mathematics Test. To this aim, we rely on an extended class of multidimensional latent class IRT models characterised by: (i) a two-parameter logistic parameterisation for the conditional probability of a correct response, (ii) latent traits represented through a random vector with a discrete distribution, and (iii) the inclusion of (uniform) DIF to account for students' gender and geographical area. A classification of the items into unidimensional groups is also proposed and represented by a dendrogram, which is obtained from a hierarchical clustering algorithm. The results provide evidence for DIF effects for both Tests. Besides, the assumption of unidimensionality is strongly rejected for the Italian Test, whereas it is reasonable for the Mathematics Test.

preprint2012arXiv

Mixtures of equispaced normal distributions and their use for testing symmetry in univariate data

Given a random sample of observations, mixtures of normal densities are often used to estimate the unknown continuous distribution from which the data come. Here we propose the use of this semiparametric framework for testing symmetry about an unknown value. More precisely, we show how the null hypothesis of symmetry may be formulated in terms of normal mixture model, with weights about the centre of symmetry constrained to be equal one another. The resulting model is nested in a more general unconstrained one, with same number of mixture components and free weights. Therefore, after having maximised the constrained and unconstrained log-likelihoods by means of a suitable algorithm, such as the Expectation-Maximisation, symmetry is tested against skewness through a likelihood ratio statistic. The performance of the proposed mixture-based test is illustrated through a Monte Carlo simulation study, where we compare two versions of the test, based on different criteria to select the number of mixture components, with the traditional one based on the third standardised moment. An illustrative example is also given that focuses on real data.

preprint2012arXiv

MultiLCIRT: An R package for multidimensional latent class item response models

We illustrate a class of Item Response Theory (IRT) models for binary and ordinal polythomous items and we describe an R package for dealing with these models, which is named MultiLCIRT. The models at issue extend traditional IRT models allowing for (i) multidimensionality and (ii) discreteness of latent traits. This class of models also allows for different parameterizations for the conditional distribution of the response variables given the latent traits, depending on both the type of link function and the constraints imposed on the discriminating and the difficulty item parameters. We illustrate how the proposed class of models may be estimated by the maximum likelihood approach via an Expectation-Maximization algorithm, which is implemented in the MultiLCIRT package, and we discuss in detail issues related to model selection. In order to illustrate this package, we analyze two datasets: one concerning binary items and referred to the measurement of ability in mathematics and the other one coming from the administration of ordinal polythomous items for the assessment of anxiety and depression. In the first application, we illustrate how aggregating items in homogeneous groups through a model-based hierarchical clustering procedure which is implemented in the proposed package. In the second application, we describe the steps to select a specific model having the best fit in our class of IRT models.

preprint2011arXiv

Mixture latent autoregressive models for longitudinal data

Many relevant statistical and econometric models for the analysis of longitudinal data include a latent process to account for the unobserved heterogeneity between subjects in a dynamic fashion. Such a process may be continuous (typically an AR(1)) or discrete (typically a Markov chain). In this paper, we propose a model for longitudinal data which is based on a mixture of AR(1) processes with different means and correlation coefficients, but with equal variances. This model belongs to the class of models based on a continuous latent process, and then it has a natural interpretation in many contexts of application, but it is more flexible than other models in this class, reaching a goodness-of-fit similar to that of a discrete latent process model, with a reduced number of parameters. We show how to perform maximum likelihood estimation of the proposed model by the joint use of an Expectation-Maximisation algorithm and a Newton-Raphson algorithm, implemented by means of recursions developed in the hidden Markov literature. We also introduce a simple method to obtain standard errors for the parameter estimates and a criterion to choose the number of mixture components. The proposed approach is illustrated by an application to a longitudinal dataset, coming from the Health and Retirement Study, about self-evaluation of the health status by a sample of subjects. In this application, the response variable is ordinal and time-constant and time-varying individual covariates are available.