Source author record

Alberto Leon-Garcia

Alberto Leon-Garcia appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Networking and Internet Architecture Machine Learning Performance Artificial Intelligence Cryptography and Security

Catalog footprint

What is connected

7works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Missing Data Estimation in Temporal Multilayer Position-aware Graph Neural Network (TMP-GNN)

GNNs have been proven to perform highly effective in various node-level, edge-level, and graph-level prediction tasks in several domains. Existing approaches mainly focus on static graphs. However, many graphs change over time with their edge may disappear, or node/edge attribute may alter from one time to the other. It is essential to consider such evolution in representation learning of nodes in time varying graphs. In this paper, we propose a Temporal Multi-layered Position-aware Graph Neural Network (TMP-GNN), a node embedding approach for dynamic graph that incorporates the interdependence of temporal relations into embedding computation. We evaluate the performance of TMP-GNN on two different representations of temporal multilayered graphs. The performance is assessed against the most popular GNNs on node-level prediction tasks. Then, we incorporate TMP-GNN into a deep learning framework to estimate missing data and compare the performance with their corresponding competent GNNs from our former experiment, and a baseline method. Experimental results on four real-world datasets yield up to 58% of lower ROC AUC for pairwise node classification task, and 96% of lower MAE in missing feature estimation, particularly for graphs with a relatively high number of nodes and lower mean degree of connectivity.

preprint2021arXiv

Queue-Learning: A Reinforcement Learning Approach for Providing Quality of Service

End-to-end delay is a critical attribute of quality of service (QoS) in application domains such as cloud computing and computer networks. This metric is particularly important in tandem service systems, where the end-to-end service is provided through a chain of services. Service-rate control is a common mechanism for providing QoS guarantees in service systems. In this paper, we introduce a reinforcement learning-based (RL-based) service-rate controller that provides probabilistic upper-bounds on the end-to-end delay of the system, while preventing the overuse of service resources. In order to have a general framework, we use queueing theory to model the service systems. However, we adopt an RL-based approach to avoid the limitations of queueing-theoretic methods. In particular, we use Deep Deterministic Policy Gradient (DDPG) to learn the service rates (action) as a function of the queue lengths (state) in tandem service systems. In contrast to existing RL-based methods that quantify their performance by the achieved overall reward, which could be hard to interpret or even misleading, our proposed controller provides explicit probabilistic guarantees on the end-to-end delay of the system. The evaluations are presented for a tandem queueing system with non-exponential inter-arrival and service times, the results of which validate our controller's capability in meeting QoS constraints.

preprint2020arXiv

On the Robustness of Cooperative Multi-Agent Reinforcement Learning

In cooperative multi-agent reinforcement learning (c-MARL), agents learn to cooperatively take actions as a team to maximize a total team reward. We analyze the robustness of c-MARL to adversaries capable of attacking one of the agents on a team. Through the ability to manipulate this agent's observations, the adversary seeks to decrease the total team reward. Attacking c-MARL is challenging for three reasons: first, it is difficult to estimate team rewards or how they are impacted by an agent mispredicting; second, models are non-differentiable; and third, the feature space is low-dimensional. Thus, we introduce a novel attack. The attacker first trains a policy network with reinforcement learning to find a wrong action it should encourage the victim agent to take. Then, the adversary uses targeted adversarial examples to force the victim to take this action. Our results on the StartCraft II multi-agent benchmark demonstrate that c-MARL teams are highly vulnerable to perturbations applied to one of their agent's observations. By attacking a single agent, our attack method has highly negative impact on the overall team reward, reducing it from 20 to 9.4. This results in the team's winning rate to go down from 98.9% to 0%.

preprint2020arXiv

Probabilistic Bounds on the End-to-End Delay of Service Function Chains using Deep MDN

Ensuring the conformance of a service system's end-to-end delay to service level agreement (SLA) constraints is a challenging task that requires statistical measures beyond the average delay. In this paper, we study the real-time prediction of the end-to-end delay distribution in systems with composite services such as service function chains. In order to have a general framework, we use queueing theory to model service systems, while also adopting a statistical learning approach to avoid the limitations of queueing-theoretic methods such as stationarity assumptions or other approximations that are often used to make the analysis mathematically tractable. Specifically, we use deep mixture density networks (MDN) to predict the end-to-end distribution of the delay given the network's state. As a result, our method is sufficiently general to be applied in different contexts and applications. Our evaluations show a good match between the learned distributions and the simulations, which suggest that the proposed method is a good candidate for providing probabilistic bounds on the end-to-end delay of more complex systems where simulations or theoretical methods are not applicable.

preprint2020arXiv

Reinforcement Learning-based Admission Control in Delay-sensitive Service Systems

Ensuring quality of service (QoS) guarantees in service systems is a challenging task, particularly when the system is composed of more fine-grained services, such as service function chains. An important QoS metric in service systems is the end-to-end delay, which becomes even more important in delay-sensitive applications, where the jobs must be completed within a time deadline. Admission control is one way of providing end-to-end delay guarantee, where the controller accepts a job only if it has a high probability of meeting the deadline. In this paper, we propose a reinforcement learning-based admission controller that guarantees a probabilistic upper-bound on the end-to-end delay of the service system, while minimizes the probability of unnecessary rejections. Our controller only uses the queue length information of the network and requires no knowledge about the network topology or system parameters. Since long-term performance metrics are of great importance in service systems, we take an average-reward reinforcement learning approach, which is well suited to infinite horizon problems. Our evaluations verify that the proposed RL-based admission controller is capable of providing probabilistic bounds on the end-to-end delay of the network, without using system model information.

preprint2016arXiv

Leveraging Synergy of 5G SDWN and Multi-Layer Resource Management for Network Optimization

Fifth-generation (5G) cellular wireless networks are envisioned to predispose service-oriented, flexible, and spectrum/energy-efficient edge-to-core infrastructure, aiming to offer diverse applications. Convergence of software-defined networking (SDN), software-defined radio (SDR) compatible with multiple radio access technologies (RATs), and virtualization on the concept of 5G software-defined wireless networking (5G-SDWN) is a promising approach to provide such a dynamic network. The principal technique behind the 5G-SDWN framework is the separation of the control and data planes, from the deep core entities to edge wireless access points (APs). This separation allows the abstraction of resources as transmission parameters of each user over the 5G-SDWN. In this user-centric and service-oriented environment, resource management plays a critical role to achieve efficiency and reliability. However, it is natural to wonder if 5G-SDWN can be leveraged to enable converged multi-layer resource management over the portfolio of resources, and reciprocally, if CML resource management can effectively provide performance enhancement and reliability for 5G-SDWN. We believe that replying to these questions and investigating this mutual synergy are not trivial, but multidimensional and complex for 5G-SDWN, which consists of different technologies and also inherits legacy generations of wireless networks. In this paper, we propose a flexible protocol structure based on three mentioned pillars for 5G-SDWN, which can handle all the required functionalities in a more crosslayer manner. Based on this, we demonstrate how the general framework of CML resource management can control the end user quality of experience. For two scenarios of 5G-SDWN, we investigate the effects of joint user-association and resource allocation via CML resource management to improve performance in a virtualized network.

preprint2015arXiv

Virtualization of Multi-Cell 802.11 Networks: Association and Airtime Control

This paper investigates the virtualization and optimization of a multi-cell WLAN. We consider the station (STA)-access point (AP) association and airtime control for virtualized 802.11 networks to provide service customization and fairness across multiple internet service providers (ISPs) sharing the common physical infrastructure and network capacity. More specifically, an optimization problem is formulated on the STAs transmission probabilities to maximize the overall network throughput, while providing airtime usage guarantees for the ISPs. Subsequently, an algorithm to reach the optimal solution is developed by applying monomial approximation and geometric programming iteratively. Based on the proposed three-dimensional Markov-chain model of the enhanced distributed channel access (EDCA) protocol, the detailed implementation of the optimal transmission probability is also discussed. The accuracy of the proposed Markov-chain model and the performance of the developed association and airtime control scheme are evaluated through numerical results.

Alberto Leon-Garcia

What is connected

Connect this record

See the researcher in context

Building this map preview

7 published item(s)

Missing Data Estimation in Temporal Multilayer Position-aware Graph Neural Network (TMP-GNN)

Queue-Learning: A Reinforcement Learning Approach for Providing Quality of Service

On the Robustness of Cooperative Multi-Agent Reinforcement Learning

Probabilistic Bounds on the End-to-End Delay of Service Function Chains using Deep MDN

Reinforcement Learning-based Admission Control in Delay-sensitive Service Systems

Leveraging Synergy of 5G SDWN and Multi-Layer Resource Management for Network Optimization

Virtualization of Multi-Cell 802.11 Networks: Association and Airtime Control