Researcher profile

Hai Yang

Hai Yang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

Coordinated Pandemic Control with Large Language Model Agents as Policymaking Assistants

Effective pandemic control requires timely and coordinated policymaking across administrative regions that are intrinsically interdependent. However, human-driven responses are often fragmented and reactive, with policies formulated in isolation and adjusted only after outbreaks escalate, undermining proactive intervention and global pandemic mitigation. To address this challenge, here we propose a large language model (LLM) multi-agent policymaking framework that supports coordinated and proactive pandemic control across regions. Within our framework, each administrative region is assigned an LLM agent as an AI policymaking assistant. The agent reasons over region-specific epidemiological dynamics while communicating with other agents to account for cross-regional interdependencies. By integrating real-world data, a pandemic evolution simulator, and structured inter-agent communication, our framework enables agents to jointly explore counterfactual intervention scenarios and synthesize coordinated policy decisions through a closed-loop simulation process. We validate the proposed framework using state-level COVID-19 data from the United States between April and December 2020, together with real-world mobility records and observed policy interventions. Compared with real-world pandemic outcomes, our approach reduces cumulative infections and deaths by up to 63.7% and 40.1%, respectively, at the individual state level, and by 39.0% and 27.0%, respectively, when aggregated across states. These results demonstrate that LLM multi-agent systems can enable more effective pandemic control with coordinated policymaking...

preprint2025arXiv

Automating Traffic Model Enhancement with AI Research Agent

Developing efficient traffic models is crucial for optimizing modern transportation systems. However, current modeling approaches remain labor-intensive and prone to human errors due to their dependence on manual workflows. These processes typically involve extensive literature reviews, formula tuning, and iterative testing, which often lead to inefficiencies. To address this, we propose TR-Agent, an AI-powered framework that autonomously develops and refines traffic models through a closed-loop, iterative process. We structure the research pipeline into four key stages: idea generation, theory formulation, theory evaluation, and iterative optimization, and implement TR-Agent with four corresponding modules. These modules collaborate to retrieve knowledge from external sources, generate novel hypotheses, implement and debug models, and evaluate their performance on evaluation datasets. Through iteratively feedback and refinement, TR-Agent improves both modeling efficiency and effectiveness. We validate the framework on three representative traffic models: the Intelligent Driver Model (IDM) for car-following behavior, the MOBIL model for lane-changing, and the Lighthill-Whitham-Richards (LWR) speed-density relationship for macroscopic traffic flow modeling. Experimental results show substantial performance gains over the original models. To assess the robustness and generalizability of the improvements, we conduct additional evaluations across multiple real-world datasets, demonstrating consistent performance gains beyond the original development data. Furthermore, TR-Agent produces interpretable explanations for each improvement, enabling researchers to easily verify and extend its results. This makes TR-Agent a valuable assistant for traffic modeling refinement and a promising tool for broader applications in transportation research.

preprint2022arXiv

Formation of episodic jets and associated flares from black hole accretion systems

Episodic ejections of blobs (episodic jets) are widely observed in black hole sources and usually associated with flares. In this paper, by performing and analyzing three dimensional general relativity magnetohydrodynamical numerical simulations of accretion flows, we investigate their physical mechanisms. We find that magnetic reconnection occurs in the accretion flow, likely due to the turbulent motion and differential rotation of the accretion flow, resulting in flares and formation of flux ropes. Flux ropes formed inside of 10-15 gravitational radii are found to mainly stay within the accretion flow, while flux ropes formed beyond this radius are ejected outward by magnetic forces and form the episodic jets. These results confirm the basic scenario proposed in Yuan et al.(2009). Moreover, our simulations find that the predicted velocity of the ejected blobs is in good consistency with observations of Sgr A*, M81, and M87. The whole processes are found to occur quasi-periodically, with the period being the orbital time at the radius where the flux rope is formed. The predicted period of flares and ejections is consistent with those found from the light curves or image of Sgr A*, M87, and PKS 1510-089. The possible applications to protostellar accretion systems are discussed.

preprint2022arXiv

Improved Multi-step FCS-MPCC with Disturbance Compensation for PMSM Drives -- Methods and Experimental Validation

In this paper, an improved multi-step finite control set model predictive current control (FCS-MPCC) strategy with speed loop disturbance compensation is proposed for permanent magnet synchronous machine (PMSM) drives system. A multi-step prediction mechanism is beneficial to significantly improve the steady-state performance of the motor system. While the conventional multi-step prediction has the defect of heavy computational burden, an improved multi-step finite control set model predictive current control (IM MPCC) strategy is proposed by developing a new multi-step prediction mechanism. Furthermore, in order to improve the dynamic response of the system, a disturbance compensation (DC) mechanism based on an extended state observer (ESO) is proposed to estimate and compensate the total disturbance in the speed loop of the PMSM system. Both simulation and experimental results validate the effectiveness of the proposed control strategy.

preprint2022arXiv

Subtype-Former: a deep learning approach for cancer subtype discovery with multi-omics data

Motivation: Cancer is heterogeneous, affecting the precise approach to personalized treatment. Accurate subtyping can lead to better survival rates for cancer patients. High-throughput technologies provide multiple omics data for cancer subtyping. However, precise cancer subtyping remains challenging due to the large amount and high dimensionality of omics data. Results: This study proposed Subtype-Former, a deep learning method based on MLP and Transformer Block, to extract the low-dimensional representation of the multi-omics data. K-means and Consensus Clustering are also used to achieve accurate subtyping results. We compared Subtype-Former with the other state-of-the-art subtyping methods across the TCGA 10 cancer types. We found that Subtype-Former can perform better on the benchmark datasets of more than 5000 tumors based on the survival analysis. In addition, Subtype-Former also achieved outstanding results in pan-cancer subtyping, which can help analyze the commonalities and differences across various cancer types at the molecular level. Finally, we applied Subtype-Former to the TCGA 10 types of cancers. We identified 50 essential biomarkers, which can be used to study targeted cancer drugs and promote the development of cancer treatments in the era of precision medicine.

preprint2022arXiv

The Accretion flow in M87 is really MAD

The supermassive black holes in most galaxies in the universe are powered by hot accretion flows. Both theoretical analysis and numerical simulations have indicated that, depending on the degree of magnetization, black hole hot accretion flow is divided into two modes, namely SANE (standard and normal evolution) and MAD (magnetically arrested disk). It has been an important question which mode the hot accretion flows in individual sources should belong to in reality, SANE or MAD. This issue has been investigated in some previous works but they all suffer from various uncertainties. By using the measured rotation measure values in the prototype low-luminosity active galactic nuclei in {M87} at 2, 5, and 8 GHz along the jet at various distances from the black hole, combined with three dimensional general relativity magnetohydrodynamical numerical simulations of SANE and MAD, we show in this paper that the predicted rotation measure values by MAD are well consistent with observations, while the SANE model overestimates the rotation measure by over two orders of magnitude thus is ruled out.

preprint2020arXiv

Competitive ride-sourcing market with a third-party integrator

Recently, some transportation service providers attempt to integrate the ride services offered by multiple independent ride-sourcing platforms, and passengers are able to request ride through such third-party integrators or connectors and receive service from any one of the platforms. This novel business model, termed as third-party platform-integration in this paper, has potentials to alleviate the cost of market fragmentation due to the demand splitting among multiple platforms. While most existing studies focus on the operation strategies for one single monopolist platform, much less is known about the competition and platform-integration as well as the implications on operation strategy and system efficiency. In this paper, we propose mathematical models to describe the ride-sourcing market with multiple competing platforms and compare system performance metrics between two market scenarios, i.e., with and without platform-integration, at Nash equilibrium as well as social optimum. We find that platform-integration can increase total realized demand and social welfare at both Nash equilibrium and social optimum, but may not necessarily generate a greater profit when vehicle supply is sufficiently large or/and market is too fragmented. We show that the market with platform-integration generally achieves greater social welfare. On one hand, the integrator in platform-integration is able to generate a thicker market and reduce matching frictions; on the other hand, multiple platforms are still competing by independently setting their prices, which help to mitigate monopoly mark-up as in the monopoly market.

preprint2020arXiv

Joint predictions of multi-modal ride-hailing demands: a deep multi-task multigraph learning-based approach

Ride-hailing platforms generally provide various service options to customers, such as solo ride services, shared ride services, etc. It is generally expected that demands for different service modes are correlated, and the prediction of demand for one service mode can benefit from historical observations of demands for other service modes. Moreover, an accurate joint prediction of demands for multiple service modes can help the platforms better allocate and dispatch vehicle resources. Although there is a large stream of literature on ride-hailing demand predictions for one specific service mode, little efforts have been paid towards joint predictions of ride-hailing demands for multiple service modes. To address this issue, we propose a deep multi-task multi-graph learning approach, which combines two components: (1) multiple multi-graph convolutional (MGC) networks for predicting demands for different service modes, and (2) multi-task learning modules that enable knowledge sharing across multiple MGC networks. More specifically, two multi-task learning structures are established. The first one is the regularized cross-task learning, which builds cross-task connections among the inputs and outputs of multiple MGC networks. The second one is the multi-linear relationship learning, which imposes a prior tensor normal distribution on the weights of various MGC networks. Although there are no concrete bridges between different MGC networks, the weights of these networks are constrained by each other and subject to a common prior distribution. Evaluated with the for-hire-vehicle datasets in Manhattan, we show that our propose approach outperforms the benchmark algorithms in prediction accuracy for different ride-hailing modes.

preprint2020arXiv

Modeling indoor-level non-pharmaceutical interventions during the COVID-19 pandemic: a pedestrian dynamics-based microscopic simulation approach

Mathematical modeling of epidemic spreading has been widely adopted to estimate the threats of epidemic diseases (i.e., the COVID-19 pandemic) as well as to evaluate epidemic control interventions. The indoor place is considered to be a significant epidemic spreading risk origin, but existing widely-used epidemic spreading models are usually limited for indoor places since the dynamic physical distance changes between people are ignored, and the empirical features of the essential and non-essential travel are not differentiated. In this paper, we introduce a pedestrian-based epidemic spreading model that is capable of modeling indoor transmission risks of diseases during people's social activities. Taking advantage of the before-and-after mobility data from the University of Maryland COVID-19 Impact Analysis Platform, it's found that people tend to spend more time in grocery stores once their travel frequencies are restricted to a low level. In other words, an increase in dwell time could balance the decrease in travel frequencies and satisfy people's demand. Based on the pedestrian-based model and the empirical evidence, combined non-pharmaceutical interventions from different operational levels are evaluated. Numerical simulations show that restrictions on people's travel frequency and open-hours of indoor places may not be universally effective in reducing average infection risks for each pedestrian who visit the place. Entry limitations can be a widely effective alternative, whereas the decision-maker needs to balance the decrease in risky contacts and the increase in queue length outside the place that may impede people from fulfilling their travel needs.

preprint2020arXiv

Off-Street Parking for TNC Vehicles to Reduce Cruising Traffic

This paper considers off-street parking for the cruising vehicles of transportation network companies (TNCs) to reduce the traffic congestion. We propose a novel business that integrates the shared parking service into the TNC platform. In the proposed model, the platform (a) provides interfaces that connect passengers, drivers and garage operators (commercial or private garages); (b) determines the ride fare, driver payment, and parking rates; (c) matches passengers to TNC vehicles for ride-hailing services; and (d) matches vacant TNC vehicles to unoccupied parking garages to reduce the cruising cost. A queuing-theoretic model is proposed to capture the matching process of passengers, drivers, and parking garages. A market-equilibrium model is developed to capture the incentives of the passengers, drivers, and garage operators. An optimization-based model is formulated to capture the optimal pricing of the TNC platform. Through a realistic case study, we show that the proposed business model will offer a Pareto improvement that benefits all stakeholders, which leads to higher passenger surplus, higher drivers surplus, higher garage operator surplus, higher platform profit, and reduced traffic congestion.

preprint2019arXiv

Predicting origin-destination ride-sourcing demand with a spatio-temporal encoder-decoder residual multi-graph convolutional network

With the rapid development of mobile-internet technologies, on-demand ride-sourcing services have become increasingly popular and largely reshaped the way people travel. Demand prediction is one of the most fundamental components in supply-demand management systems of ride-sourcing platforms. With accurate short-term prediction for origin-destination (OD) demand, the platforms make precise and timely decisions on real-time matching, idle vehicle reallocations and ride-sharing vehicle routing, etc. Compared to zone-based demand prediction that has been examined by many previous studies, OD-based demand prediction is more challenging. This is mainly due to the complicated spatial and temporal dependencies among demand of different OD pairs. To overcome this challenge, we propose the Spatio-Temporal Encoder-Decoder Residual Multi-Graph Convolutional network (ST-ED-RMGC), a novel deep learning model for predicting ride-sourcing demand of various OD pairs. Firstly, the model constructs OD graphs, which utilize adjacent matrices to characterize the non-Euclidean pair-wise geographical and semantic correlations among different OD pairs. Secondly, based on the constructed graphs, a residual multi-graph convolutional (RMGC) network is designed to encode the contextual-aware spatial dependencies, and a long-short term memory (LSTM) network is used to encode the temporal dependencies, into a dense vector space. Finally, we reuse the RMGC networks to decode the compressed vector back to OD graphs and predict the future OD demand. Through extensive experiments on the for-hire-vehicles datasets in Manhattan, New York City, we show that our proposed deep learning framework outperforms the state-of-arts by a significant margin.