Researcher profile

Justo Puerto

Justo Puerto contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2022arXiv

A Network Model for Multiple Selection Questions in Opinion Surveys

Opinion surveys can contain closed questions to which respondents can give multiple answers. We propose to model these data as networks in which vertices are eligible items and arcs are respondents. This representation opens up the possibility of using complex networks methodologies to retrieve information and most prominently, the possibility of using clustering/community detection techniques to reduce data complexity. We will take advantage of the implicit null hypothesis of the modularity function, namely, that items are chosen without any preferential pairing, to show how the hypothesis can be tested through the usual calculation of p-values. We illustrate the methodology applying it to Eurobarometer data. There, a question about national concerns can receive up to two selections. We will show that community clustering groups together concerns that can be interpreted in consistent way and in general terms, such as Economy, Security and Welfare issues. Moreover, we will show that in this way cleavages between social sectors can be determined.

preprint2021arXiv

A combinatorial optimization approach to scenario filtering in portfolio selection

Recent studies stressed the fact that covariance matrices computed from empirical financial time series appear to contain a high amount of noise. This makes the classical Markowitz Mean-Variance Optimization model unable to correctly evaluate the performance associated to selected portfolios. Since the Markowitz model is still one of the most used practitioner-oriented tool, several filtering methods have been proposed in the literature to fix the problem. Among them, the two most promising ones refer to the Random Matrix Theory or to the Power Mapping strategy. The basic idea of these methods is to transform the correlation matrix maintaining the Mean-Variance Optimization model. However, experimental analysis shows that these two strategies are not adequately effective when applied to real financial datasets. In this paper we propose an alternative filtering method based on Combinatorial Optimization. We advance a new Mixed Integer Quadratic Programming model to filter those observations that may influence the performance of a portfolio in the future. We discuss the properties of this new model and we test it on some real financial datasets. We compare the out-of-sample performance of our portfolios with the one of the portfolios provided by the two above mentioned alternative strategies. We show that our method outperforms them. Although our model can be solved efficiently with standard optimization solvers the computational burden increases for large datasets. To overcome this issue we also propose a heuristic procedure that empirically showed to be both efficient and effective.

preprint2020arXiv

A Mathematical Programming approach to Binary Supervised Classification with Label Noise

In this paper we propose novel methodologies to construct Support Vector Machine -based classifiers that takes into account that label noises occur in the training sample. We propose different alternatives based on solving Mixed Integer Linear and Non Linear models by incorporating decisions on relabeling some of the observations in the training dataset. The first method incorporates relabeling directly in the SVM model while a second family of methods combines clustering with classification at the same time, giving rise to a model that applies simultaneously similarity measures and SVM. Extensive computational experiments are reported based on a battery of standard datasets taken from UCI Machine Learning repository, showing the effectiveness of the proposed approaches.

preprint2020arXiv

On hub location problems in geographically flexible networks

In this paper we propose an extension of the Uncapacitated Hub Location Problem where the potential positions of the hubs are not fixed in advance. Instead, they are allowed to belong to a region around an initial discrete set of nodes. We give a general framework in which the collection, transportation and distribution costs are based on norm-based distances and the hub-activation set-up costs depend, not only on the the location of the hub that are opened but also on the size of the region where they are placed. Two alternative mathematical programming formulations are proposed. The first one is a compact formulation while the second one involves a family of constraints of exponential size that we separate efficiently giving rise to a branch-and-cut algorithm. The results of an extensive computational experience are reported showing the advantages of each of the approaches.

preprint2020arXiv

On the multisource hyperplanes location problem to fitting set of points

In this paper we study the problem of locating a given number of hyperplanes minimizing an objective function of the closest distances from a set of points. We propose a general framework for the problem in which norm-based distances between points and hyperplanes are aggregated by means of ordered median functions. A compact Mixed Integer Linear (or Non Linear) programming formulation is presented for the problem and also an extended set partitioning formulation with an exponential number of variables is derived. We develop a column generation procedure embedded within a branch-and-price algorithm for solving the problem by adequately performing its preprocessing, pricing and branching. We also analyze geometrically the optimal solutions of the problem, deriving properties which are exploited to generate initial solutions for the proposed algorithms. Finally, the results of an extensive computational experience are reported. The issue of scalability is also addressed showing theoretical upper bounds on the errors assumed by replacing the original datasets by aggregated versions.

preprint2020arXiv

Proceedings of the X International Workshop on Locational Analysis and Related Problems

The International Workshop on Locational Analysis and Related Problems will take place during January 23-24, 2020 in Seville (Spain). It is organized by the Spanish Location Network and the Location Group GELOCA from the Spanish Society of Statistics and Operations Research(SEIO). The Spanish Location Network is a group of more than 140 researchers from several Spanish universities organized into 7 thematic groups. The Network has been funded by the Spanish Government since 2003. One of the main activities of the Network is a yearly meeting aimed at promoting the communication among its members and between them and other researchers, and to contribute to the development of the location field and related problems. The last meetings have taken place in Cádiz (January 20-February 1, 2019), Segovia (September 27-29, 2017), Málaga (September 14-16, 2016), Barcelona (November 25-28, 2015), Sevilla (October 1-3, 2014), Torremolinos (Málaga, June 19-21, 2013), Granada (May 10-12, 2012), Las Palmas de Gran Canaria (February 2-5, 2011) and Sevilla (February 1-3, 2010). The topics of interest are location analysis and related problems. This includes location models, networks, transportation, logistics, exact and heuristic solution methods, and computational geometry, among others.

preprint2020arXiv

Quid Pro Quo allocations in Production-Inventory games

The concept of Owen point, introduced in Guardiola et al. (2009), is an appealing solution concept that for Production-Inventory games (PI-games) always belongs to their core. The Owen point allows all the players in the game to operate at minimum cost but it does not take into account the cost reduction induced by essential players over their followers (fans). Thus, it may be seen as an altruistic allocation for essential players what can be criticized. The aim this paper is two-fold: to study the structure and complexity of the core of PI-games and to introduce new core allocations for PI-games improving the weaknesses of the Owen point. Regarding the first goal, we advance further on the analysis of PI-games and we analyze its core structure and algorithmic complexity. Specifically, we prove that the number of extreme points of the core of PI-games is exponential on the number of players. On the other hand, we propose and characterize a new core-allocation, the Omega point, which compensates the essential players for their role on reducing the costs of their fans. Moreover, we define another solution concept, the Quid Pro Quo set (QPQ-set) of allocations, which is based on the Owen and Omega points. Among all the allocations in this set, we emphasize what we call the Solomonic QPQ allocation and we provide some necessary conditions for the coincidence of that allocation with the Shapley value and the Nucleolus.

preprint2019arXiv

An optimization model for line planning and timetabling in automated urban metro subway networks

In this paper we present a Mixed Integer Nonlinear Programming model that we developed as part of a pilot study requested by the R&D company Metrolab in order to design tools for finding solutions for line planning and timetable situations in automated urban metro subway networks. Our model incorporates important factors in public transportation systems from both, a cost-oriented and a passenger-oriented perspective, as time-dependent demands, interchange stations, short-turns and technical features of the trains in use. The incoming flows of passengers are modeled by means of piecewise linear demand functions which are parameterized in terms of arrival rates and bulk arrivals. Decisions about frequencies, train capacities, short-turning and timetables for a given planning horizon are jointly integrated to be optimized in our model. Finally, a novel Math-Heuristic approach is proposed to solve the problem. The results of extensive computational experiments are reported to show its applicability and effectiveness to handle real-world subway networks

preprint2019arXiv

Minimal Radius Enclosing Polyellipsoids

In this paper we analyze the extension of the classical smallest enclosing disk problem to the case of the location of a polyellipsoid to fully cover a set of demand points in $\mathbb{R}^d$. We prove that the problem is polynomially solvable in fixed dimension and analyze mathematical programming formulations for it. We also consider some geometric approaches for the problem in case the foci of the polyellipsoids are known. Extensions of the classical algorithm by Elzinga-Hearn are also derived for this new problem. Moreover, we address several extensions of the problem, as the case where the foci of the enclosing polyellipsoid are not given and have to be determined among a potential set of points or the induced covering problems when instead of polyellipsoids, one uses ordered median polyellipsoids. For these problems we also present Mixed Integer (Non) Linear Programming strategies that lead to efficient ways to solve it.

preprint2019arXiv

Optimal arrangements of hyperplanes for multiclass classification

In this paper, we present a novel approach to construct multiclass classifiers by means of arrangements of hyperplanes. We propose different mixed integer (linear and non linear) programming formulations for the problem using extensions of widely used measures for misclassifying observations where the \textit{kernel trick} can be adapted to be applicable. Some dimensionality reductions and variable fixing strategies are also developed for these models. An extensive battery of experiments has been run which reveal the powerfulness of our proposal as compared with other previously proposed methodologies.

preprint2017arXiv

On $\ell_p$-Support Vector Machines and Multidimensional Kernels

In this paper, we extend the methodology developed for Support Vector Machines (SVM) using $\ell_2$-norm ($\ell_2$-SVM) to the more general case of $\ell_p$-norms with $p\ge 1$ ($\ell_p$-SVM). The resulting primal and dual problems are formulated as mathematical programming problems; namely, in the primal case, as a second order cone optimization problem and in the dual case, as a polynomial optimization problem involving homogeneous polynomials. Scalability of the primal problem is obtained via general transformations based on the expansion of functionals in Schauder spaces. The concept of Kernel function, widely applied in $\ell_2$-SVM, is extended to the more general case by defining a new operator called multidimensional Kernel. This object gives rise to reformulations of dual problems, in a transformed space of the original data, which are solved by a moment-sdp based approach. The results of some computational experiments on real-world datasets are presented showing rather good behavior in terms of standard indicators such a \textit{accuracy index} and its ability to classify new data.

preprint2016arXiv

A general framework for locating hyperplanes to fitting set of points

This paper presents a family of new methods for locating/fitting hyperplanes with respect to a given set of points. We introduce a general framework for a family of aggregation criteria of different distance-based errors. The most popular methods found in the specialized literature can be cast within this family as particular choices of the errors and the aggregation criteria. Mathematical programming formulations for these methods are stated and some interesting cases are analyzed. It is also proposed a new goodness of fitting index which extends the classical coefficient of determination. A series of illustrative examples and extensive computational experiments implemented in R are provided to show the performances of some of the proposed methods.