Researcher profile

Giuseppe Arbia

Giuseppe Arbia contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2021arXiv

Spatial sampling design to improve the efficiency of the estimation of the critical parameters of the SARS-CoV-2 epidemic

The pandemic linked to COVID-19 infection represents an unprecedented clinical and healthcare challenge for many medical researchers attempting to prevent its worldwide spread. This pandemic also represents a major challenge for statisticians involved in quantifying the phenomenon and in offering timely tools for the monitoring and surveillance of critical pandemic parameters. In a recent paper, Alleva et al. (2020) proposed a two-stage sample design to build a continuous-time surveillance system designed to correctly quantify the number of infected people through an indirect sampling mechanism that could be repeated in several waves over time to capture different target variables in the different stages of epidemic development. The proposed method exploits the indirect sampling (Lavalle, 2007; Kiesl, 2016) method employed in the estimation of rare and elusive populations (Borchers, 2009; Lavallée and Rivest, 2012) and a capture/recapture mechanism (Sudman, 1988; Thompson and Seber, 1996). In this paper, we extend the proposal of Alleva et al. (2020) to include a spatial sampling mechanism (Müller, 1998; Grafström et al., 2012, Jauslin and Tillè, 2020) in the process of data collection to achieve the same level of precision with fewer sample units, thereby facilitating the process of data collection in a situation where timeliness and costs are crucial elements. We present the basic idea of the new sample design, analytically prove the theoretical properties of the associated estimators and show the relative advantages through a systematic simulation study where all the typical elements of an epidemic are accounted for.

preprint2020arXiv

A Note on Early Epidemiological Analysis of Coronavirus Disease 2019 Outbreak using Crowdsourced Data

Crowdsourcing data can prove of paramount importance in monitoring and controlling the spread of infectious diseases. The recent paper by Sun, Chen and Viboud (2020) is important because it contributes to the understanding of the epidemiology and of the spreading of Covid-19 in a period when most of the epidemic characteristics are still unknown. However, the use of crowdsourcing data raises a number of problems from the statistical point of view which run the risk of invalidating the results and of biasing estimation and hypothesis testing. While the work by Sun, Chen and Viboud (2020) has to be commended, given the importance of the topic for worldwide health security, in this paper we deem important to remark the presence of the possible sources of statistical biases and to point out possible solutions to them

preprint2020arXiv

Observed and estimated prevalence of Covid-19 in Italy: Is it possible to estimate the total cases from medical swabs data?

During the current Covid-19 pandemic in Italy, official data are collected with medical swabs following a pure convenience criterion which, at least in an early phase, has privileged the exam of patients showing evident symptoms. However, there are evidences of a very high proportion of asymptomatic patients (e. g. Aguilar et al., 2020; Chugthai et al, 2020; Li, et al., 2020; Mizumoto et al., 2020a, 2020b and Yelin et al., 2020). In this situation, in order to estimate the real number of infected (and to estimate the lethality rate), it should be necessary to run a properly designed sample survey through which it would be possible to calculate the probability of inclusion and hence draw sound probabilistic inference. Some researchers proposed estimates of the total prevalence based on various approaches, including epidemiologic models, time series and the analysis of data collected in countries that faced the epidemic in earlier time (Brogi et al., 2020). In this paper, we propose to estimate the prevalence of Covid-19 in Italy by reweighting the available official data published by the Istituto Superiore di Sanità so as to obtain a more representative sample of the Italian population. Reweighting is a procedure commonly used to artificially modify the sample composition so as to obtain a distribution which is more similar to the population (Valliant et al., 2018). In this paper, we will use post-stratification of the official data, in order to derive the weights necessary for reweighting them using age and gender as post-stratification variables thus obtaining more reliable estimation of prevalence and lethality.

preprint2020arXiv

Post-sampling crowdsourced data to allow reliable statistical inference: the case of food price indices in Nigeria

Sound policy and decision making in developing countries is often limited by the lack of timely and reliable data. Crowdsourced data may provide a valuable alternative for data collection and analysis, e. g. in remote and insecure areas or of poor accessibility where traditional methods are difficult or costly. However, crowdsourced data are not directly usable to draw sound statistical inference. Indeed, its use involves statistical problems because data do not obey any formal sampling design and may also suffer from various non-sampling errors. To overcome this, we propose the use of a special form of post-stratification with which crowdsourced data are reweighted prior their use in an inferential context. An example in Nigeria illustrates the applicability of the method.

preprint2014arXiv

Least quartic Regression Criterion with Application to Finance

This article proposes a new method for the estimation of the parameters of a simple linear regression model which accounts for the role of co-moments in non-Gaussian distributions being based on the minimization of a quartic loss function. Although the proposed method is very general, we examine its application to finance. In fact, in this field the contribution of the co-moments in explaining the return-generating process is of paramount importance when evaluating the systematic risk of an asset within the framework of the Capital Asset Pricing Model (CAPM). The suggested new method contributes to this literature by showing that, in the presence of non-normality, the regression slope can be expressed as a function of the co-kurtosis between the returns of a risky asset and the market proxy. The paper provides an illustration of the method based on some empirical financial data referring to 40 industrial sector assets rates of the Italian stock market.

preprint2013arXiv

A bivariate marginal likelihood specification of spatial econometric modeling of very large datasets

This paper proposes a bivariate marginal likelihood specification of spatial econometrics models that simplifies the derivation of the log-likelihood and leads to a closed form expression for the estimation of the parameters. With respect to the more traditional specifications of spatial autoregressive models, our method avoids the arbitrariness of the specification of a weight matrix, presents analytical and computational advantages and provides interesting interpretative insights. We establish small sample and asymptotic properties of the estimators and we derive the associated Fisher information matrix needed in confidence interval estimation and hypothesis testing.