Source author record

Zaiyue Yang

Zaiyue Yang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Systems and Control eess.SY math.OC Machine Learning Information Theory math.IT

Catalog footprint

What is connected

8works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Automated deep reinforcement learning for real-time scheduling strategy of multi-energy system integrated with post-carbon and direct-air carbon captured system

The carbon-capturing process with the aid of CO2 removal technology (CDRT) has been recognised as an alternative and a prominent approach to deep decarbonisation. However, the main hindrance is the enormous energy demand and the economic implication of CDRT if not effectively managed. Hence, a novel deep reinforcement learning agent (DRL), integrated with an automated hyperparameter selection feature, is proposed in this study for the real-time scheduling of a multi-energy system coupled with CDRT. Post-carbon capture systems (PCCS) and direct-air capture systems (DACS) are considered CDRT. Various possible configurations are evaluated using real-time multi-energy data of a district in Arizona and CDRT parameters from manufacturers' catalogues and pilot project documentation. The simulation results validate that an optimised soft-actor critic (SAC) algorithm outperformed the TD3 algorithm due to its maximum entropy feature. We then trained four (4) SAC agents, equivalent to the number of considered case studies, using optimised hyperparameter values and deployed them in real time for evaluation. The results show that the proposed DRL agent can meet the prosumers' multi-energy demand and schedule the CDRT energy demand economically without specified constraints violation. Also, the proposed DRL agent outperformed rule-based scheduling by 23.65%. However, the configuration with PCCS and solid-sorbent DACS is considered the most suitable configuration with a high CO2 captured-released ratio of 38.54, low CO2 released indicator value of 2.53, and a 36.5% reduction in CDR cost due to waste heat utilisation and high absorption capacity of the selected sorbent. However, the adoption of CDRT is not economically viable at the current carbon price. Finally, we showed that CDRT would be attractive at a carbon price of 400-450USD/ton with the provision of tax incentives by the policymakers.

preprint2022arXiv

Incentive-aware Electric Vehicle Routing Problem: a Bi-level Model and a Joint Solution Algorithm

Fixed pickup and delivery times can strongly limit the performance of freight transportation. Against this backdrop, fleet operators can use compensation mechanisms such as monetary incentives to buy delay time from their customers, in order to improve the fleet efficiency and ultimately minimize the costs of operation. To make the most of such an operational model, the fleet activities and the incentives should be jointly optimized accounting for the customers' reactions. Against this backdrop, this paper presents an incentive-aware electric vehicle routing scheme in which the fleet operator actively provides incentives to the customers in exchange of pickup or delivery time flexibility. Specifically, we first devise a bi-level model whereby the fleet operator optimizes the routes and charging schedules of the fleet jointly with an incentive rate to reimburse the delivery delays experienced by the customers. At the same time, the customers choose the admissible delays by minimizing a monetarily-weighted combination of the delays minus the reimbursement offered by the operator. Second, we tackle the complexity resulting from the bi-level and nonlinear problem structure with an equivalent transformation method, reformulating the problem as a single-level optimization problem that can be solved with standard mixed-integer linear programming algorithms. We demonstrate the effectiveness of our framework via extensive numerical experiments using VRP-REP data from Belgium. Our results show that by jointly optimizing routes and incentives subject to the customers' preferences, the operational costs can be reduced by up to 5%, whilst customers can save more than 30% in total delivery fees.

preprint2022arXiv

Joint Routing and Charging Problem of Electric Vehicles with Incentive-aware Customers Considering Spatio-temporal Charging Prices

This paper investigates the scheduling problem of a fleet of electric vehicles, providing mobility as a service to a set of time-specified customers, where the operator needs to solve the routing and charging problem jointly for each EV. Hereby we consider incentive-aware customers and propose that the operator offers monetary incentives to customers in exchange for time flexibility. In this way, the fleet operator can achieve a routing and charging schedule with lower costs, whilst the customers receive monetary compensation for their flexibility. Specifically, we first propose a bi-level optimization model whereby the fleet operator optimizes the routing and charging schedule accounting for the spatio-temporal varying charging price, jointly with a monetary incentive to reimburse the delivery time flexibility experienced by the customers. Concurrently the customers choose their own time flexibility by minimizing their own cost. Second, we cope with the computational burden coming from this nonlinear bi-level optimization model with an accurate reformulation approach consisting of the KKT optimality conditions, a Big-M-based linearization method, and the zero duality gap of convex optimization problems. This way, we convert the proposed problem into a single-level optimization problem, which can be solved by a strengthened generalized Benders decomposition method holding a faster convergence rate than the generalized Benders decomposition method. To evaluate the effectiveness of the proposed mathematical model, we carry out numerous simulation experiments by using the VRP-REP data of Belgium. The numerical results showcase that the proposed mathematical model can reduce the delivery fees for the customers together with the cost of operation incurred by the fleet operator.

preprint2020arXiv

Finite-Sample Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation

Motivated by the emerging use of multi-agent reinforcement learning (MARL) in engineering applications such as networked robotics, swarming drones, and sensor networks, we investigate the policy evaluation problem in a fully decentralized setting, using temporal-difference (TD) learning with linear function approximation to handle large state spaces in practice. The goal of a group of agents is to collaboratively learn the value function of a given policy from locally private rewards observed in a shared environment, through exchanging local estimates with neighbors. Despite their simplicity and widespread use, our theoretical understanding of such decentralized TD learning algorithms remains limited. Existing results were obtained based on i.i.d. data samples, or by imposing an `additional' projection step to control the `gradient' bias incurred by the Markovian observations. In this paper, we provide a finite-sample analysis of the fully decentralized TD(0) learning under both i.i.d. as well as Markovian samples, and prove that all local estimates converge linearly to a small neighborhood of the optimum. The resultant error bounds are the first of its type---in the sense that they hold under the most practical assumptions ---which is made possible by means of a novel multi-step Lyapunov analysis.

preprint2016arXiv

Scheduling of EV Battery Swapping, I: Centralized Solution

We formulate an optimal scheduling problem for battery swapping that assigns to each electric vehicle (EV) a best station to swap its depleted battery based on its current location and state of charge. The schedule aims to minimize total travel distance and generation cost over both station assignments and power flow variables, subject to EV range constraints, grid operational constraints and AC power flow equations. To deal with the nonconvexity of power flow equations and the binary nature of station assignments, we propose a solution based on second-order cone programming (SOCP) relaxation of optimal power flow (OPF) and generalized Benders decomposition. When the SOCP relaxation is exact, this approach computes a globally optimal solution. We evaluate the performance of the proposed algorithm through simulations. The algorithm requires global information and is suitable for cases where the distribution network, stations, and EVs are managed centrally by the same operator. In Part II of the paper, we develop distributed solutions for cases where they are operated by different organizations that do not share private information.

preprint2016arXiv

Scheduling of EV Battery Swapping, II: Distributed Solutions

In Part I of this paper we formulate an optimal scheduling problem for battery swapping that assigns to each electric vehicle (EV) a best station to swap its depleted battery based on its current location and state of charge. The schedule aims to minimize a weighted sum of total travel distance and generation cost over both station assignments and power flow variables, subject to EV range constraints, grid operational constraints and AC power flow equations. We propose there a centralized solution based on the second-order cone programming (SOCP) relaxation of optimal power flow (OPF) and generalized Benders decomposition that is suitable when global information is available. In this paper we propose two distributed solutions based on the alternating direction method of multipliers (ADMM) and dual decomposition respectively that are suitable for cases where the distribution grid, battery stations and EVs are managed by separate entities. Our algorithms allow these entities to make individual decisions but coordinate through privacy-preserving information exchanges to jointly solve an approximate version of the joint battery swapping scheduling and OPF problem. We evaluate our algorithms through simulations.

preprint2016arXiv

Smart Grid Testbed for Demand Focused Energy Management in End User Environments

Successful deployment of smart grids necessitates experimental validities of their state-of-the-art designs in two-way communications, real-time demand response and monitoring of consumers' energy usage behavior. The objective is to observe consumers' energy usage pattern and exploit this information to assist the grid in designing incentives, energy management mechanisms, and real-time demand response protocols; so as help the grid achieving lower costs and improve energy supply stability. Further, by feeding the observed information back to the consumers instantaneously, it is also possible to promote energy efficient behavior among the users. To this end, this paper performs a literature survey on smart grid testbeds around the world, and presents the main accomplishments towards realizing a smart grid testbed at the Singapore University of Technology and Design (SUTD). The testbed is able to monitor, analyze and evaluate smart grid communication network design and control mechanisms, and test the suitability of various communications networks for both residential and commercial buildings. The testbeds are deployed within the SUTD student dormitories and the main university campus to monitor and record end-user energy consumption in real-time, which will enable us to design incentives, control algorithms and real-time demand response schemes. The testbed also provides an effective channel to evaluate the needs on communication networks to support various smart grid applications. In addition, our initial results demonstrate that our testbed can provide an effective platform to identify energy wastage, and prompt the needs of a secure communications channel as the energy usage pattern can provide privacy related information on individual user.

preprint2014arXiv

Three-Party Energy Management With Distributed Energy Resources in Smart Grid

In this paper, the benefits of distributed energy resources (DERs) are considered in an energy management scheme for a smart community consisting of a large number of residential units (RUs) and a shared facility controller (SFC). A non-cooperative Stackelberg game between RUs and the SFC is proposed in order to explore how both entities can benefit, in terms of achieved utility and minimizing total cost respectively, from their energy trading with each other and the grid. From the properties of the game, it is shown that the maximum benefit to the SFC in terms of reduction in total cost is obtained at the unique and strategy proof Stackelberg equilibrium (SE). It is further shown that the SE is guaranteed to be reached by the SFC and RUs by executing the proposed algorithm in a distributed fashion, where participating RUs comply with their best strategies in response to the action chosen by the SFC. In addition, a charging-discharging scheme is introduced for the SFC's storage device (SD) that can further lower the SFC's total cost if the proposed game is implemented. Numerical experiments confirm the effectiveness of the proposed scheme.

Zaiyue Yang

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

Automated deep reinforcement learning for real-time scheduling strategy of multi-energy system integrated with post-carbon and direct-air carbon captured system

Incentive-aware Electric Vehicle Routing Problem: a Bi-level Model and a Joint Solution Algorithm

Joint Routing and Charging Problem of Electric Vehicles with Incentive-aware Customers Considering Spatio-temporal Charging Prices

Finite-Sample Analysis of Decentralized Temporal-Difference Learning with Linear Function Approximation

Scheduling of EV Battery Swapping, I: Centralized Solution

Scheduling of EV Battery Swapping, II: Distributed Solutions

Smart Grid Testbed for Demand Focused Energy Management in End User Environments

Three-Party Energy Management With Distributed Energy Resources in Smart Grid