Researcher profile

Wenceslao González-Manteiga

Wenceslao González-Manteiga contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2022arXiv

A Comparative Review of Specification Tests for Diffusion Models

Diffusion models play an essential role in modeling continuous-time stochastic processes in the financial field. Therefore, several proposals have been developed in the last decades to test the specification of stochastic differential equations. We provide a survey to collect some developments on goodness-of-fit tests for diffusion models and implement these methods to illustrate their finite sample behavior, regarding size and power, by means of a simulation study. We also apply the ideas of distance correlation for testing independence to propose a test for the parametric specification of diffusion models, comparing its performance with the other methods and analyzing the effect of the curse of dimensionality. As real data examples, treasury securities with different maturities are considered.

preprint2022arXiv

Estimation and Specification Test for Diffusion Models with Stochastic Volatility

Given the importance of continuous-time stochastic volatility models to describe the dynamics of interest rates, we propose a goodness-of-fit test for the parametric form of the drift and diffusion functions, based on a marked empirical process of the residuals. The test statistics are constructed using a continuous functional (Kolmogorov-Smirnov and Cramér-von Mises) over the empirical processes. In order to evaluate the proposed tests, we implement a simulation study, where a bootstrap method is considered for the calibration of the tests. As the estimation of diffusion models with stochastic volatility based on discretely sampled data has proven difficult, we address this issue by means of a Monte Carlo study for different estimation procedures. Finally, an application of the procedures to real data is provided.

preprint2022arXiv

Estimators for covariate-adjusted ROC curves with missing biomarkers values

In this paper, we present three estimators of the ROC curve when missing observations arise among the biomarkers. Two of the procedures assume that we have covariates that allow to estimate the propensity and the estimators are obtained using an inverse probability weighting method or a smoothed version of it. The other one assumes that the covariates are related to the biomarkers through a regression model which enables us to construct convolution--based estimators of the distribution and quantile functions. Consistency results are obtained under mild conditions. Through a numerical study we evaluate the finite sample performance of the different proposals. A real data set is also analysed.

preprint2022arXiv

Functional Classification of Bitcoin Addresses

This paper proposes a classification model for predicting the main activity of bitcoin addresses based on their balances. Since the balances are functions of time, we apply methods from functional data analysis; more specifically, the features of the proposed classification model are the functional principal components of the data. Classifying bitcoin addresses is a relevant problem for two main reasons: to understand the composition of the bitcoin market, and to identify addresses used for illicit activities. Although other bitcoin classifiers have been proposed, they focus primarily on network analysis rather than curve behavior. Our approach, on the other hand, does not require any network information for prediction. Furthermore, functional features have the advantage of being straightforward to build, unlike expert-built features. Results show improvement when combining functional features with scalar features, and similar accuracy for the models using those features separately, which points to the functional model being a good alternative when domain-specific knowledge is not available.

preprint2022arXiv

Novel specification tests for additive concurrent model formulation based on martingale difference divergence

Novel significance tests are proposed for the quite general additive concurrent model formulation without the need of model, error structure preliminary estimation or the use of tuning parameters. Making use of the martingale difference divergence coefficient, we propose new tests to measure the conditional mean independence in the concurrent model framework taking under consideration all observed time instants. In particular, global dependence tests to quantify the effect of a group of covariates in the response as well as partial ones to apply covariates selection are introduced. Their asymptotic distribution is obtained on each case and a bootstrap algorithm is proposed to compute its p-values in practice. These new procedures are tested by means of simulation studies and some real datasets analysis.

preprint2021arXiv

A test for comparing conditional ROC curves with multidimensional covariates

The comparison of Receiver Operating Characteristic (ROC) curves is frequently used in the literature to compare the discriminatory capability of different classification procedures based on diagnostic variables. The performance of these variables can be sometimes influenced by the presence of other covariates, and thus they should be taken into account when making the comparison. A new non-parametric test is proposed here for testing the equality of two or more dependent ROC curves conditioned to the value of a multidimensional covariate. Projections are used for transforming the problem into a one-dimensional approach easier to handle. Simulations are carried out to study the practical performance of the new methodology. A real data set of patients with Pleural Effusion is analysed to illustrate this procedure.

preprint2020arXiv

A goodness-of-fit test for the functional linear model with scalar response

In this work, a goodness-of-fit test for the null hypothesis of a functional linear model with scalar response is proposed. The test is based on a generalization to the functional framework of a previous one, designed for the goodness-of-fit of regression models with multivariate covariates using random projections. The test statistic is easy to compute using geometrical and matrix arguments, and simple to calibrate in its distribution by a wild bootstrap on the residuals. The finite sample properties of the test are illustrated by a simulation study for several types of basis and under different alternatives. Finally, the test is applied to two datasets for checking the assumption of the functional linear model and a graphical tool is introduced. Supplementary materials are available online.

preprint2020arXiv

A test for directional-linear independence, with applications to wildfire orientation and size

The relation between wildfire orientation and size is analyzed by means of a nonparametric test for directional-linear independence. The test statistic is designed for assessing the independence between two random variables of different nature, specifically directional (fire orientation, circular or spherical, as particular cases) and linear (fire size measured as burnt area, scalar), based on a directional-linear nonparametric kernel density estimator. In order to apply the proposed methodology in practice, a resampling procedure based on permutations and bootstrap is provided. The finite sample performance of the test is assessed by a simulation study, comparing its behavior with other classical tests for the circular-linear case. Finally, the test is applied to analyze wildfire data from Portugal.

preprint2020arXiv

Bootstrap independence test for functional linear models

Functional data have been the subject of many research works over the last years. Functional regression is one of the most discussed issues. Specifically, significant advances have been made for functional linear regression models with scalar response. Let $(\mathcal{H},<\cdot,\cdot>)$ be a separable Hilbert space. We focus on the model $Y=<Θ,X>+b+\varepsilon$, where $Y$ and $\varepsilon$ are real random variables, $X$ is an $\mathcal{H}$-valued random element, and the model parameters $b$ and $Θ$ are in $\mathbb{R}$ and $\mathcal{H}$, respectively. Furthermore, the error satisfies that $E(\varepsilon|X)=0$ and $E(\varepsilon^2|X)=σ^2<\infty$. A consistent bootstrap method to calibrate the distribution of statistics for testing $H_0: Θ=0$ versus $H_1: Θ\neq 0$ is developed. The asymptotic theory, as well as a simulation study and a real data application illustrating the usefulness of our proposed bootstrap in practice, is presented.

preprint2020arXiv

Central limit theorems for directional and linear random variables with applications

A central limit theorem for the integrated squared error of the directional-linear kernel density estimator is established. The result enables the construction and analysis of two testing procedures based on squared loss: a nonparametric independence test for directional and linear random variables and a goodness-of-fit test for parametric families of directional-linear densities. Limit distributions for both test statistics, and a consistent bootstrap strategy for the goodness-of-fit test, are developed for the directional-linear case and adapted to the directional-directional setting. Finite sample performance for the goodness-of-fit test is illustrated in a simulation study. This test is also applied to datasets from biology and environmental sciences.

preprint2020arXiv

Exploring wind direction and SO2 concentration by circular-linear density estimation

The study of environmental problems usually requires the description of variables with different nature and the assessment of relations between them. In this work, an algorithm for flexible estimation of the joint density for a circular-linear variable is proposed. The method is applied for exploring the relation between wind direction and SO2 concentration in a monitoring station close to a power plant located in Galicia (NW-Spain), in order to compare the effectiveness of precautionary measures for pollutants reduction in two different years.

preprint2020arXiv

Goodness-of-fit tests for functional linear models based on integrated projections

Functional linear models are one of the most fundamental tools to assess the relation between two random variables of a functional or scalar nature. This contribution proposes a goodness-of-fit test for the functional linear model with functional response that neatly adapts to functional/scalar responses/predictors. In particular, the new goodness-of-fit test extends a previous proposal for scalar response. The test statistic is based on a convenient regularized estimator, is easy to compute, and is calibrated through an efficient bootstrap resampling. A graphical diagnostic tool, useful to visualize the deviations from the model, is introduced and illustrated with a novel data application. The R package goffda implements the proposed methods and allows for the reproducibility of the data application.

preprint2020arXiv

Kernel density estimation for directional-linear data

A nonparametric kernel density estimator for directional-linear data is introduced. The proposal is based on a product kernel accounting for the different nature of both (directional and linear) components of the random vector. Expressions for bias, variance and Mean Integrated Squared Error (MISE) are derived, jointly with an asymptotic normality result for the proposed estimator. For some particular distributions, an explicit formula for the MISE is obtained and compared with its asymptotic version, both for directional and directional-linear kernel density estimators. In this same setting a closed expression for the bootstrap MISE is also derived.

preprint2020arXiv

Robust location estimators in regression models with covariates and responses missing at random

This paper deals with robust marginal estimation under a general regression model when missing data occur in the response and also in some of covariates. The target is a marginal location parameter which is given through an $M-$functional. To obtain robust Fisher--consistent estimators, properly defined marginal distribution function estimators are considered. These estimators avoid the bias due to missing values by assuming a missing at random condition. Three methods are considered to estimate the marginal distribution function which allows to obtain the $M-$location of interest: the well-known inverse probability weighting, a convolution--based method that makes use of the regression model and an augmented inverse probability weighting procedure that prevents against misspecification. The robust proposed estimators and the classical ones are compared through a numerical study under different missing models including clean and contaminated samples. We illustrate the estimators behaviour under a nonlinear model. A real data set is also analysed.

preprint2020arXiv

Smoothing-based tests with directional random variables

Testing procedures for assessing specific parametric model forms, or for checking the plausibility of simplifying assumptions, play a central role in the mathematical treatment of the uncertain. No certain answers are obtained by testing methods, but at least the uncertainty of these answers is properly quantified. This is the case for tests designed on the two most general data generating mechanisms in practice: distribution/density and regression models. Testing proposals are usually formulated on the Euclidean space, but important challenges arise in non-Euclidean settings, such as when directional variables (i.e., random vectors on the hypersphere) are involved. This work reviews some of the smoothing-based testing procedures for density and regression models that comprise directional variables. The asymptotic distributions of the revised proposals are presented, jointly with some numerical illustrations justifying the need of employing resampling mechanisms for effective test calibration.

preprint2020arXiv

Testing parametric models in linear-directional regression

This paper presents a goodness-of-fit test for parametric regression models with scalar response and directional predictor, that is, a vector on a sphere of arbitrary dimension. The testing procedure is based on the weighted squared distance between a smooth and a parametric regression estimator, where the smooth regression estimator is obtained by a projected local approach. Asymptotic behavior of the test statistic under the null hypothesis and local alternatives is provided, jointly with a consistent bootstrap algorithm for application in practice. A simulation study illustrates the performance of the test in finite samples. The procedure is applied to test a linear model in text mining.