Researcher profile

Salvatore Ingrassia

Salvatore Ingrassia contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2015arXiv

Model-based clustering via linear cluster-weighted models

A novel family of twelve mixture models with random covariates, nested in the linear $t$ cluster-weighted model (CWM), is introduced for model-based clustering. The linear $t$ CWM was recently presented as a robust alternative to the better known linear Gaussian CWM. The proposed family of models provides a unified framework that also includes the linear Gaussian CWM as a special case. Maximum likelihood parameter estimation is carried out within the EM framework, and both the BIC and the ICL are used for model selection. A simple and effective hierarchical random initialization is also proposed for the EM algorithm. The novel model-based clustering technique is illustrated in some applications to real data. Finally, a simulation study for evaluating the performance of the BIC and the ICL is presented.

preprint2013arXiv

Fitting Bivariate Mixed-Type Data via the Generalized Linear Exponential Cluster-Weighted Model

The cluster-weighted model (CWM) is a mixture model with random covariates which allows for flexible clustering and density estimation of a random vector composed by a response variable and by a set of covariates. In this class of models, the generalized linear exponential CWM is here introduced especially for modeling bivariate data of mixed-type. Its natural counterpart, in the family of latent class models, is also defined. Maximum likelihood parameter estimates are derived using the EM algorithm and model selection is carried out using the Bayesian information criterion (BIC). Artificial and real data are finally considered to exemplify and appreciate the proposed model.

preprint2013arXiv

Maximum likelihood estimation in constrained parameter spaces for mixtures of factor analyzers

Mixtures of factor analyzers are becoming more and more popular in the area of model based clustering of high-dimensional data. According to the likelihood approach in data modeling, it is well known that the unconstrained log-likelihood function may present spurious maxima and singularities and this is due to specific patterns of the estimated covariance structure, when their determinant approaches 0. To reduce such drawbacks, in this paper we introduce a procedure for the parameter estimation of mixtures of factor analyzers, which maximizes the likelihood function in a constrained parameter space. We then analyze and measure its performance, compared to the usual non-constrained approach, via some simulations and applications to real data sets.

preprint2013arXiv

Maximum Likelihood Estimation of Gaussian Cluster Weighted Models and Relationships with Mixtures of Regression

Cluster-weighted modeling (CWM) is a mixture approach for modeling the joint probability of a response variable and a set of explanatory variables. The parameters are estimated by means of the expectation-maximization algorithm according to the maximum likelihood approach. Under Gaussian assumptions, we analyse the complete-data likelihood function of cluster weighted models. Further, under suitable hypotheses we show that the maximization of the likelihood function of Gaussian cluster weighted models leads to the same parameter estimates of finite mixtures of regression and finite mixtures of regression with concomitant variables. In this sense, the latter ones can be considered as nested models of Gaussian cluster weighted models.

preprint2012arXiv

Clustering and Classification via Cluster-Weighted Factor Analyzers

In model-based clustering and classification, the cluster-weighted model constitutes a convenient approach when the random vector of interest constitutes a response variable Y and a set p of explanatory variables X. However, its applicability may be limited when p is high. To overcome this problem, this paper assumes a latent factor structure for X in each mixture component. This leads to the cluster-weighted factor analyzers (CWFA) model. By imposing constraints on the variance of Y and the covariance matrix of X, a novel family of sixteen CWFA models is introduced for model-based clustering and classification. The alternating expectation-conditional maximization algorithm, for maximum likelihood estimation of the parameters of all the models in the family, is described; to initialize the algorithm, a 5-step hierarchical procedure is proposed, which uses the nested structures of the models within the family and thus guarantees the natural ranking among the sixteen likelihoods. Artificial and real data show that these models have very good clustering and classification performance and that the algorithm is able to recover the parameters very well.

preprint2012arXiv

Generalized Linear Gaussian Cluster-Weighted Modeling

Cluster-Weighted Modeling (CWM) is a flexible mixture approach for modeling the joint probability of data coming from a heterogeneous population as a weighted sum of the products of marginal distributions and conditional distributions. In this paper, we introduce a wide family of Cluster Weighted models in which the conditional distributions are assumed to belong to the exponential family with canonical links which will be referred to as Generalized Linear Gaussian Cluster Weighted Models. Moreover, we show that, in a suitable sense, mixtures of generalized linear models can be considered as nested in Generalized Linear Gaussian Cluster Weighted Models. The proposal is illustrated through many numerical studies based on both simulated and real data sets.

preprint2011arXiv

Local statistical modeling by cluster-weighted

We investigate statistical properties of Cluster-Weighted Modeling, which is a framework for supervised learning originally developed in order to recreate a digital violin with traditional inputs and realistic sound. The analysis is carried out in comparison with Finite Mixtures of Regression models. Based on some geometrical arguments, we highlight that Cluster-WeightedModeling provides a quite general framework for local statistical modeling. Theoretical results are illustrated on the ground of some numerical simulations.