Researcher profile

Junbo Zhao

Junbo Zhao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

Table as a Modality for Large Language Models

To migrate the remarkable successes of Large Language Models (LLMs), the community has made numerous efforts to generalize them to the table reasoning tasks for the widely deployed tabular data. Despite that, in this work, by showing a probing experiment on our proposed StructQA benchmark, we postulate that even the most advanced LLMs (such as GPTs) may still fall short of coping with tabular data. More specifically, the current scheme often simply relies on serializing the tabular data, together with the meta information, then inputting them through the LLMs. We argue that the loss of structural information is the root of this shortcoming. In this work, we further propose TAMO, which bears an ideology to treat the tables as an independent modality integrated with the text tokens. The resulting model in TAMO is a multimodal framework consisting of a hypergraph neural network as the global table encoder seamlessly integrated with the mainstream LLM. Empirical results on various benchmarking datasets, including HiTab, WikiTQ, WikiSQL, FeTaQA, and StructQA, have demonstrated significant improvements on generalization with an average relative gain of 42.65%.

preprint2026arXiv

Training-Trajectory-Aware Token Selection

Efficient distillation is a key pathway for converting expensive reasoning capability into deployable efficiency, yet in the frontier regime where the student already has strong reasoning ability, naive continual distillation often yields limited gains or even degradation. We observe a characteristic training phenomenon: even as loss decreases monotonically, all performance metrics can drop sharply at almost the same bottleneck, before gradually recovering. We further uncover a token-level mechanism: confidence bifurcates into steadily increasing Imitation-Anchor Tokens that quickly anchor optimization and other yet-to-learn tokens whose confidence is suppressed until after the bottleneck. And the characteristic that these two types of tokens cannot coexist is the root cause of the failure in continual distillation. To this end, we propose Training-Trajectory-Aware Token Selection (T3S) to reconstruct the training objective at the token level, clearing the optimization path for yet-to-learn tokens. T3 yields consistent gains in both AR and dLLM settings: with only hundreds of examples, Qwen3-8B surpasses DeepSeek-R1 on competitive reasoning benchmarks, Qwen3-32B approaches Qwen3-235B, and T3-trained LLaDA-2.0-Mini exceeds its AR baseline, achieving state-of-the-art performance among all of 16B-scale no-think models.

preprint2022arXiv

Observers for Differential Algebraic Equation Models of Power Networks: Jointly Estimating Dynamic and Algebraic States

Phasor measurement units ({PMUs}) have become instrumental in modern power systems for enabling real-time, wide-area monitoring and control. Accordingly, many studies have investigated efficient and robust dynamic state estimation (DSE) methods in order to accurately compute the dynamic states of generation units. Nonetheless, most of them forego the dynamic-algebraic nature of power networks and only consider their nonlinear dynamic representations. Motivated by the lack of DSE methods based on power network's differential-algebraic equations (DAEs), this paper develops a novel observer-based DSE framework in order to perform simultaneous estimation of the dynamic and algebraic states of multi-machine power networks. Specifically, we leverage the DAE dynamics of a power network around an operating point and combine them with a PMU-based measurement model capable of capturing bus voltages and line currents. The proposed $\mathcal{H}_{\infty}$ observer, which only requires detectability and impulse observability conditions which are satisfied for various power networks, is designed to handle various noise, unknown inputs, and input sensor failures. The results obtained from performing extensive numerical simulations on the IEEE $9$-bus and $39$-bus systems showcase the effectiveness of the proposed approach for DSE purposes.

preprint2022arXiv

Towards Unifying the Label Space for Aspect- and Sentence-based Sentiment Analysis

The aspect-based sentiment analysis (ABSA) is a fine-grained task that aims to determine the sentiment polarity towards targeted aspect terms occurring in the sentence. The development of the ABSA task is very much hindered by the lack of annotated data. To tackle this, the prior works have studied the possibility of utilizing the sentiment analysis (SA) datasets to assist in training the ABSA model, primarily via pretraining or multi-task learning. In this article, we follow this line, and for the first time, we manage to apply the Pseudo-Label (PL) method to merge the two homogeneous tasks. While it seems straightforward to use generated pseudo labels to handle this case of label granularity unification for two highly related tasks, we identify its major challenge in this paper and propose a novel framework, dubbed as Dual-granularity Pseudo Labeling (DPL). Further, similar to PL, we regard the DPL as a general framework capable of combining other prior methods in the literature. Through extensive experiments, DPL has achieved state-of-the-art performance on standard benchmarks surpassing the prior work significantly.

preprint2020arXiv

Analytic Deep Learning-based Surrogate Model for Operational Planning with Dynamic TTC Constraints

The increased penetration of wind power introduces more operational changes of critical corridors and the traditional time-consuming transient stability constrained total transfer capability (TTC) operational planning is unable to meet the real-time monitoring need. This paper develops a more computationally efficient approach to address that challenge via the analytical deep learning-based surrogate model. The key idea is to resort to the deep learning for developing a computationally cheap surrogate model to replace the original time-consuming differential-algebraic constraints related to TTC. However, the deep learning-based surrogate model introduces implicit rules that are difficult to handle in the optimization process. To this end, we derive the Jacobian and Hessian matrices of the implicit surrogate models and finally transfer them into an analytical formulation that can be easily solved by the interior point method. Surrogate modeling and problem reformulation allow us to achieve significantly improved computational efficiency and the yielded solutions can be used for operational planning. Numerical results carried out on the modified IEEE 39-bus system demonstrate the effectiveness of the proposed method in dealing with com-plicated TTC constraints while balancing the computational efficiency and accuracy.

preprint2020arXiv

Data-driven Optimal Power Flow: A Physics-Informed Machine Learning Approach

This paper proposes a data-driven approach for optimal power flow (OPF) based on the stacked extreme learning machine (SELM) framework. SELM has a fast training speed and does not require the time-consuming parameter tuning process compared with the deep learning algorithms. However, the direct application of SELM for OPF is not tractable due to the complicated relationship between the system operating status and the OPF solutions. To this end, a data-driven OPF regression framework is developed that decomposes the OPF model features into three stages. This not only reduces the learning complexity but also helps correct the learning bias. A sample pre-classification strategy based on active constraint identification is also developed to achieve enhanced feature attractions. Numerical results carried out on IEEE and Polish benchmark systems demonstrate that the proposed method outperforms other alternatives. It is also shown that the proposed method can be easily extended to address different test systems by adjusting only a few hyperparameters.

preprint2020arXiv

Distributed Voltage Regulation of Active Distribution System Based on Enhanced Multi-agent Deep Reinforcement Learning

This paper proposes a data-driven distributed voltage control approach based on the spectrum clustering and the enhanced multi-agent deep reinforcement learning (MADRL) algorithm. Via the unsupervised clustering, the whole distribution system can be decomposed into several sub-networks according to the voltage and reactive power sensitivity. Then, the distributed control problem of each sub-network is modeled as Markov games and solved by the enhanced MADRL algorithm, where each sub-network is modeled as an adaptive agent. Deep neural networks are used in each agent to approximate the policy function and the action value function. All agents are centrally trained to learn the optimal coordinated voltage regulation strategy while executed in a distributed manner to make decisions based on only local information. The proposed method can significantly reduce the requirements of communications and knowledge of system parameters. It also effectively deals with uncertainties and can provide online coordinated control based on the latest local information. Comparison results with other existing model-based and data-driven methods on IEEE 33-bus and 123-bus systems demonstrate the effectiveness and benefits of the proposed approach.

preprint2020arXiv

Dynamic State Estimation for Power System Control and Protection

Dynamic state estimation (DSE) accurately tracks the dynamics of a power system and provides the evolution of the system state in real-time. This paper focuses on the control and protection applications of DSE, comprehensively presenting different facets of control and protection challenges arising in modern power systems. It is demonstrated how these challenges are effectively addressed with DSE-enabled solutions. As precursors to these solutions, reformulation of DSE considering both synchrophasor and sampled value measurements and comprehensive comparisons of DSE and observers have been presented. The usefulness and necessity of DSE based solutions in ensuring system stability, reliable protection and security, and resilience by revamping of control and protection methods are shown through examples, practical applications, and suggestions for further development.

preprint2020arXiv

Model-Free Voltage Regulation of Unbalanced Distribution Network Based on Surrogate Model and Deep Reinforcement Learning

Accurate knowledge of the distribution system topology and parameters is required to achieve good voltage controls, but this is difficult to obtain in practice. This paper develops a model-free approach based on the surrogate model and deep reinforcement learning (DRL). We have also extended it to deal with unbalanced three-phase scenarios. The key idea is to learn a surrogate model to capture the relationship between the power injections and voltage fluctuation of each node from historical data instead of using the original inaccurate model affected by errors and uncertainties. This allows us to integrate the DRL with the learned surrogate model. In particular, DRL is applied to learn the optimal control strategy from the experiences obtained by continuous interactions with the surrogate model. The integrated framework contains training three networks, i.e., surrogate model, actor, and critic networks, which fully leverage the strong nonlinear fitting ability of deep learning and DRL for online decision making. Several single-phase approaches have also been extended to deal with three-phase unbalance scenarios and the simulation results on the IEEE 123-bus system show that our proposed method can achieve similar performance as those that use accurate physical models.

preprint2020arXiv

PIANOTREE VAE: Structured Representation Learning for Polyphonic Music

The dominant approach for music representation learning involves the deep unsupervised model family variational autoencoder (VAE). However, most, if not all, viable attempts on this problem have largely been limited to monophonic music. Normally composed of richer modality and more complex musical structures, the polyphonic counterpart has yet to be addressed in the context of music representation learning. In this work, we propose the PianoTree VAE, a novel tree-structure extension upon VAE aiming to fit the polyphonic music learning. The experiments prove the validity of the PianoTree VAE via (i)-semantically meaningful latent code for polyphonic segments; (ii)-more satisfiable reconstruction aside of decent geometry learned in the latent space; (iii)-this model's benefits to the variety of the downstream music generation.

preprint2020arXiv

Real-time LCC-HVDC Maximum Emergency Power Capacity Estimation Based on Local PMU Measurements

The adjustable capacity of a line-commutated-converter High Voltage Direct Current (LCC-HVDC) connected to a power system, called the LCC-HVDC maximum emergency power capability or HVDC-MC for short, plays an important role in determining the response of that system to a large disturbance. However, it is a challenging task to obtain an accurate HVDC-MC due to system model uncertainties as well as to contingencies. To address this problem, this paper proposes to estimate the HVDC-MC using a Thevenin equivalent (TE) of the system seen at the HVDC terminal bus of connection with the power system, whose parameters are estimated by processing positive-sequences voltages and currents of local synchrophasor measurements. The impacts of TE potential changes on the impedance estimation under large disturbance have been extensively investigated and an adaptive screening process of current measurements is developed to reduce the error of TE impedance estimation. The uncertainties of phasor measurements have been further taken into account by resorting to the total least square estimation method. The limitations of the HVDC control characteristics, the voltage-dependent current order limit, the converter capacity, and the AC voltage on HVDC-MC estimation are also considered. The simulations show that the proposed method can accurately track the dynamics of the TE parameters and the real-time HVDC-MC after the large disturbances.

preprint2017arXiv

Robust Power System Dynamic State Estimator with Non-Gaussian Measurement Noise: Part I--Theory

This paper develops the theoretical framework and the equations of a new robust Generalized Maximum-likelihood-type Unscented Kalman Filter (GM-UKF) that is able to suppress observation and innovation outliers while filtering out non-Gaussian measurement noise. Because the errors of the real and reactive power measurements calculated using Phasor Measurement Units (PMUs) follow long-tailed probability distributions, the conventional UKF provides strongly biased state estimates since it relies on the weighted least squares estimator. By contrast, the state estimates and residuals of our GM-UKF are proved to be roughly Gaussian, allowing the sigma points to reliably approximate the mean and the covariance matrices of the predicted and corrected state vectors. To develop our GM-UKF, we first derive a batch-mode regression form by processing the predictions and observations simultaneously, where the statistical linearization approach is used. We show that the set of equations so derived are equivalent to those of the unscented transformation. Then, a robust GM-estimator that minimizes a convex Huber cost function while using weights calculated via Projection Statistics (PS's) is proposed. The PS's are applied to a two-dimensional matrix that consists of serially correlated predicted state and innovation vectors to detect observation and innovation outliers. These outliers are suppressed by the GM-estimator using the iteratively reweighted least squares algorithm. Finally, the asymptotic error covariance matrix of the GM-UKF state estimates is derived from the total influence function. In the companion paper, extensive simulation results will be shown to verify the effectiveness and robustness of the proposed method.

preprint2017arXiv

Robust Power System Dynamic State Estimator with Non-Gaussian Measurement Noise: Part II--Implementation and Results

This paper is the second of a two-part series that discusses the implementation issues and test results of a robust Unscented Kalman Filter (UKF) for power system dynamic state estimation with non-Gaussian synchrophasor measurement noise. The tuning of the parameters of our Generalized Maximum-Likelihood-type robust UKF (GM-UKF) is presented and discussed in a systematic way. Using simulations carried out on the IEEE 39-bus system, its performance is evaluated under different scenarios, including i) the occurrence of two different types of noises following thick-tailed distributions, namely the Laplace or Cauchy probability distributions for real and reactive power measurements; ii) the occurrence of observation and innovation outliers; iii) the occurrence of PMU measurement losses due to communication failures; iv) cyber attacks; and v) strong system nonlinearities. It is also compared to the UKF and the Generalized Maximum-Likelihood-type robust iterated EKF (GM-IEKF). Simulation results reveal that the GM-UKF outperforms the GM-IEKF and the UKF in all scenarios considered. In particular, when the system is operating under stressed conditions, inducing system nonlinearities, the GM-IEKF and the UKF diverge while our GM-UKF does converge. In addition, when the power measurement noises obey a Cauchy distribution, our GM-UKF converges to a state estimate vector that exhibits a much higher statistical efficiency than that of the GM-IEKF; by contrast, the UKF fails to converge. Finally, potential applications and future work of the proposed GM-UKF are discussed in concluding remarks section.