Researcher profile

Shi Chen

Shi Chen contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
18works
0followers
18topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

18 published item(s)

preprint2026arXiv

MooD: Perception-Enhanced Efficient Affective Image Editing via Continuous Valence-Arousal Modeling

Affective Image Editing (AIE) aims to modify visual content to evoke targeted emotions. Although current approaches achieve impressive editing quality, they often overlook inference efficiency, which limits their applicability in computational social scenarios. Moreover, most methods depend on discrete emotion representations, which hinder the continuous modeling of complex human emotions and constrain expressive capabilities in interactive scenarios. To tackle these gaps, we propose MooD, the first framework that directly leverages continuous Valence-Arousal (VA) values as editing instruction for fine-grained and efficient AIE in computational social systems. Specifically, we first introduce a VA-Aware retrieval strategy to bridge vague affective values and detailed visual semantics. Building upon this, MooD integrates visual transfer and perception-enhanced semantic guidance to achieve controllable AIE. Furthermore, considering that existing VA-annotated datasets mainly focus on social scenarios and largely overlook natural scenes, we therefore construct AffectSet, a comprehensive VA-annotated dataset covering diverse scenarios, to support model optimization and evaluation. Extensive qualitative and quantitative experimental results demonstrate that our MooD achieves superior performance in both affective controllability and visual fidelity while maintaining high efficiency. A series of ablation studies further reveal the crucial factors of our design.

preprint2025arXiv

Ultrahigh-Energy Gamma-ray Emission Associated with Black Hole-Jet Systems

Black holes (BH), one of the most intriguing objects in the universe, can manifest themselves through electromagnetic radiation initiated by the accretion flow. Some stellar-mass BHs drive relativistic jets when accreting matter from their companion stars, forming microquasars. Non-thermal emission from the radio to tera-electronvolt (TeV) gamma-ray band has been observed from microquasars, indicating the acceleration of relativistic particles. Here we report detection of four microquasars (SS 433, V4641 Sgr, GRS 1915+105, MAXI J1820+070) of spectrum extending to the ultrahigh-energy (UHE; photon energy $E>100$ TeV) band and one microquasar (Cygnus X-1) of spectrum approaching 100 TeV, using the Large High Altitude Air Shower Observatory (LHAASO). Notably, the total emission associated with SS 433 cannot be interpreted with a single leptonic component. In the UHE band, its emission is in spatial coincidence with a giant atomic cloud, which is consistent with a hadronic origin. An elongated source is discovered from V4641 Sgr with the spectrum continuing up to 800 TeV. The detection of UHE gamma rays demonstrates that accreting BHs and their environments can operate as extremely efficient accelerators of particles out of 1 peta-electronvolt (PeV), suggesting microquasars to be important contributors to Galactic cosmic rays especially around the `knee' region.

preprint2024arXiv

A Pure Integral-Type PLL with a Damping Branch to Enhance the Stability of Grid-Tied Inverter under Weak Grids

In a phase-locked loop (PLL) synchronized inverter, due to the strong nonlinear coupling between the PLL's parame-ters and the operation power angle, the equivalent damping coefficient will quickly deteriorate while the power angle is close to 90° under an ultra-weak grid, which causes the synchronous instability. To address this issue, in this letter, a pure integral-type phase-locked loop (IPLL) with a damping branch is proposed to replace the traditional PI-type PLL. The equivalent damping coefficient of an IPLL-synchronized inverter is decoupled with the steady-state power angle. As a result, the IPLL-synchronized inverter can stably operate under an ultra-weak grid when the equilibrium point exists. Finally, time-domain simulation results verify the effectiveness and correctness of the proposed IPLL.

preprint2023arXiv

Extended Load Flexibility of Utility-Scale P2H Plants: Optimal Production Scheduling Considering Dynamic Thermal and HTO Impurity Effects

In the conversion toward a clear and sustainable energy system, the flexibility of power-to-hydrogen (P2H) production enables the admittance of volatile renewable energies on a utility scale and provides the connected electrical power system with ancillary services. To extend the load flexibility and thus improve the profitability of green hydrogen production, this paper presents an optimal production scheduling approach for utility-scale P2H plants composed of multiple alkaline electrolyzers. Unlike existing works, this work discards the conservative constant steady-state constraints and first leverages the dynamic thermal and hydrogen-to-oxygen (HTO) impurity crossover processes of electrolyzers. Doing this optimizes their effects on the loading range and energy conversion efficiency, therefore improving the load flexibility of P2H production. The proposed multiphysics-aware scheduling model is formulated as mixed-integer linear programming (MILP). It coordinates the electrolyzers' operation state transitions and load allocation subject to comprehensive thermodynamic and mass transfer constraints. A decomposition-based solution method, SDM-GS-ALM, is followingly adopted to address the scalability issue for scheduling large-scale P2H plants composed of tens of electrolyzers. With an experiment-verified dynamic electrolyzer model, case studies up to 22 electrolyzers show that the proposed method remarkably improves the hydrogen output and profit of P2H production powered by either solar or wind energy compared to the existing scheduling approach.

preprint2022arXiv

Attention in Reasoning: Dataset, Analysis, and Modeling

While attention has been an increasingly popular component in deep neural networks to both interpret and boost the performance of models, little work has examined how attention progresses to accomplish a task and whether it is reasonable. In this work, we propose an Attention with Reasoning capability (AiR) framework that uses attention to understand and improve the process leading to task outcomes. We first define an evaluation metric based on a sequence of atomic reasoning operations, enabling a quantitative measurement of attention that considers the reasoning process. We then collect human eye-tracking and answer correctness data, and analyze various machine and human attention mechanisms on their reasoning capability and how they impact task performance. To improve the attention and reasoning ability of visual question answering models, we propose to supervise the learning of attention progressively along the reasoning process and to differentiate the correct and incorrect attention patterns. We demonstrate the effectiveness of the proposed framework in analyzing and modeling attention with better reasoning capability and task performance. The code and data are available at https://github.com/szzexpoi/AiR

preprint2022arXiv

High-frequency limit of the inverse scattering problem: asymptotic convergence from inverse Helmholtz to inverse Liouville

We investigate the asymptotic relation between the inverse problems relying on the Helmholtz equation and the radiative transfer equation (RTE) as physical models, in the high-frequency limit. In particular, we evaluate the asymptotic convergence of a generalized version of inverse scattering problem based on the Helmholtz equation, to the inverse scattering problem of the Liouville equation (a simplified version of RTE). The two inverse problems are connected through the Wigner transform that translates the wave-type description on the physical space to the kinetic-type description on the phase space, and the Husimi transform that models data localized both in location and direction. The finding suggests that impinging tightly concentrated monochromatic beams can indeed provide stable reconstruction of the medium, asymptotically in the high-frequency regime. This fact stands in contrast with the unstable reconstruction for the classical inverse scattering problem when the probing signals are plane-waves.

preprint2022arXiv

Low-rank approximation for multiscale PDEs

Historically, analysis for multiscale PDEs is largely unified while numerical schemes tend to be equation-specific. In this paper, we propose a unified framework for computing multiscale problems through random sampling. This is achieved by incorporating randomized SVD solvers and manifold learning techniques to numerically reconstruct the low-rank features of multiscale PDEs. We use multiscale radiative transfer equation and elliptic equation with rough media to showcase the application of this framework.

preprint2022arXiv

NTIRE 2022 Challenge on Efficient Super-Resolution: Methods and Results

This paper reviews the NTIRE 2022 challenge on efficient single image super-resolution with focus on the proposed solutions and results. The task of the challenge was to super-resolve an input image with a magnification factor of $\times$4 based on pairs of low and corresponding high resolution images. The aim was to design a network for single image super-resolution that achieved improvement of efficiency measured according to several metrics including runtime, parameters, FLOPs, activations, and memory consumption while at least maintaining the PSNR of 29.00dB on DIV2K validation set. IMDN is set as the baseline for efficiency measurement. The challenge had 3 tracks including the main track (runtime), sub-track one (model complexity), and sub-track two (overall performance). In the main track, the practical runtime performance of the submissions was evaluated. The rank of the teams were determined directly by the absolute value of the average runtime on the validation set and test set. In sub-track one, the number of parameters and FLOPs were considered. And the individual rankings of the two metrics were summed up to determine a final ranking in this track. In sub-track two, all of the five metrics mentioned in the description of the challenge including runtime, parameter count, FLOPs, activations, and memory consumption were considered. Similar to sub-track one, the rankings of five metrics were summed up to determine a final ranking. The challenge had 303 registered participants, and 43 teams made valid submissions. They gauge the state-of-the-art in efficient single image super-resolution.

preprint2022arXiv

REX: Reasoning-aware and Grounded Explanation

Effectiveness and interpretability are two essential properties for trustworthy AI systems. Most recent studies in visual reasoning are dedicated to improving the accuracy of predicted answers, and less attention is paid to explaining the rationales behind the decisions. As a result, they commonly take advantage of spurious biases instead of actually reasoning on the visual-textual data, and have yet developed the capability to explain their decision making by considering key information from both modalities. This paper aims to close the gap from three distinct perspectives: first, we define a new type of multi-modal explanations that explain the decisions by progressively traversing the reasoning process and grounding keywords in the images. We develop a functional program to sequentially execute different reasoning steps and construct a new dataset with 1,040,830 multi-modal explanations. Second, we identify the critical need to tightly couple important components across the visual and textual modalities for explaining the decisions, and propose a novel explanation generation method that explicitly models the pairwise correspondence between words and regions of interest. It improves the visual grounding capability by a considerable margin, resulting in enhanced interpretability and reasoning performance. Finally, with our new data and method, we perform extensive analyses to study the effectiveness of our explanation under different settings, including multi-task learning and transfer learning. Our code and data are available at https://github.com/szzexpoi/rex.

preprint2022arXiv

Skyrmions in a magnetic field and $π^0$ domain wall formation in dense nuclear matter

We elucidate magnetic effects in the skyrmion system to probe into the dense nuclear matter. We find a deformed $π^0$ dipole structure of a baryon induced by a magnetic field. We then extend our scope to stacked Skyrme crystal layers to scrutinize phases of magnetized nuclear matter. We observe a quantized magnetic flux and identify a phase transition from a crystalline state to a $π^0$ domain wall corresponding to a topological transmutation from $π_3(S^3)$ to $π_1(S^1)$. We establish the phase diagram, which could be explored also in analogous systems with two-component Bose-Einstein condensates.

preprint2021arXiv

Classical limit for the varying-mass Schrödinger equation with random inhomogeneities

The varying-mass Schrödinger equation (VMSE) has been successfully applied to model electronic properties of semiconductor hetero-structures, for example, quantum dots and quantum wells. In this paper, we consider VMSE with small random heterogeneities, and derive a radiative transfer equation as its asymptotic limit. The main tool is to systematically apply the Wigner transform in the classical regime when the rescaled Planck constant $ε\ll 1$, and expand the Wigner equation to proper orders of $ε$. As a proof of concept, we numerically compute both VMSE and its limiting radiative transfer equation, and show that their solutions agree well in the classical regime.

preprint2021arXiv

Semi-classical limit of an inverse problem for the Schrödinger equation

It is a classical derivation that the Wigner equation, derived from the Schrödinger equation that contains the quantum information, converges to the Liouville equation when the rescaled Planck constant $ε\to0$. Since the latter presents the Newton's second law, the process is typically termed the (semi-)classical limit. In this paper, we study the classical limit of an inverse problem for the Schrödinger equation. More specifically, we show that using the initial condition and final state of the Schrödinger equation to reconstruct the potential term, in the classical regime with $ε\to0$, becomes using the initial and final state to reconstruct the potential term in the Liouville equation. This formally bridges an inverse problem in quantum mechanics with an inverse problem in classical mechanics.

preprint2021arXiv

Simulation on the Transparency of Electrons and Ion Back Flow for a Time Projection Chamber based on Staggered Multiple THGEMs

The IBF and the transparent rate of electrons are two essential indicators of TPC, which affect the energy resolution and counting rate respectively. In this paper, we propose several novel strategies of staggered multi-THGEM to suppress IBF, where the geometry of the first layer THGEM will be optimized to increase the electron transparent rate. By Garfield++ simulation, the electron transparency rate can be more than 90% of single THGEM with a optimized large hole. By simulating these configurations of triple and quadruple THGEM structures, we conclude that the IBF can be reduced to 0.2% level in an optimized configuration denoted as "ACBA". This strategy for staggered THGEM could have potential applications in future TPC projects.

preprint2020arXiv

AiR: Attention with Reasoning Capability

While attention has been an increasingly popular component in deep neural networks to both interpret and boost performance of models, little work has examined how attention progresses to accomplish a task and whether it is reasonable. In this work, we propose an Attention with Reasoning capability (AiR) framework that uses attention to understand and improve the process leading to task outcomes. We first define an evaluation metric based on a sequence of atomic reasoning operations, enabling quantitative measurement of attention that considers the reasoning process. We then collect human eye-tracking and answer correctness data, and analyze various machine and human attentions on their reasoning capability and how they impact task performance. Furthermore, we propose a supervision method to jointly and progressively optimize attention, reasoning, and task performance so that models learn to look at regions of interests by following a reasoning process. We demonstrate the effectiveness of the proposed framework in analyzing and modeling attention with better reasoning capability and task performance. The code and data are available at https://github.com/szzexpoi/AiR

preprint2020arXiv

Deconfinement and CP-breaking at $θ=π$ in Yang-Mills theories and a novel phase for SU(2)

We discuss the deconfinement and the CP-breaking phase transitions at $θ=π$ in Yang-Mills theories. The &#39;t Hooft anomaly matching prohibits the confined phase with CP symmetry and requires $T_{dec}(θ=π) \le T_{CP}$, where $T_{dec}(θ=π)$ and $T_{CP}$ denote the deconfinement and the CP-restoration temperatures, respectively, at $θ=π$. We analytically study these two phase transitions in softly-broken $\mathcal{N}=1$ supersymmetric Yang-Mills theories on small $\mathbb{R}^3\times S^1$ with the periodic boundary condition for gluinos. For most gauge groups except SU(2) in this model, we find that the inequality is saturated, so deconfinement and CP restoration occur simultaneously. We demonstrate special features of the SU(2) gauge theory: There is a finite window of two temperatures, $T_{dec}(π)<T_{CP}$, which indicates the existence of a novel CP-broken deconfined phase. We also discuss an implication of the novel phase for domain walls and their junctions.

preprint2020arXiv

Regularization Approach for Network Modeling of German Power Derivative Market

In this paper we propose a regularization approach for network modeling of German power derivative market. To deal with the large portfolio, we combine high-dimensional variable selection techniques with dynamic network analysis. The estimated sparse interconnectedness of the full German power derivative market, clearly identify the significant channels of relevant potential risk spillovers. Our empirical findings show the importance of interdependence between different contract types, and identify the main risk contributors. We further observe strong pairwise interconnections between the neighboring contracts especially for the spot contracts trading in the peak hours, its implications for regulators and investors are also discussed. The network analysis of the full German power derivative market helps us to complement a full picture of system risk, and have a better understanding of the German power market functioning and environment.

preprint2020arXiv

Strong field frustrated double ionization of argon atoms

Using a three-dimensional semiclassical method, we theoretically investigate frustrated double ionization (FDI) of Ar atoms subjected to strong laser fields. The double-hump photoelectron momentum distribution generated from FDI observed in a recent experiment [S. Larimian et al., Phys. Rev. Research 2, 013021 (2020)] is reproduced by our simulation. We confirm that the observed spectrum is due to recollision. The laser intensity dependence of FDI is investigated. We reveal that the doubly excited states of Ar atoms and excited states of Ar+ are the dominant pathways for producing FDI at relatively low and high intensities, respectively. Our work demonstrates that at modest intensities, FDI is a general strong-field physical process accompanied with nonsequential double ionization and it is an important consequence of recollision.

preprint2020arXiv

Triaging moderate COVID-19 and other viral pneumonias from routine blood tests

The COVID-19 is sweeping the world with deadly consequences. Its contagious nature and clinical similarity to other pneumonias make separating subjects contracted with COVID-19 and non-COVID-19 viral pneumonia a priority and a challenge. However, COVID-19 testing has been greatly limited by the availability and cost of existing methods, even in developed countries like the US. Intrigued by the wide availability of routine blood tests, we propose to leverage them for COVID-19 testing using the power of machine learning. Two proven-robust machine learning model families, random forests (RFs) and support vector machines (SVMs), are employed to tackle the challenge. Trained on blood data from 208 moderate COVID-19 subjects and 86 subjects with non-COVID-19 moderate viral pneumonia, the best result is obtained in an SVM-based classifier with an accuracy of 84%, a sensitivity of 88%, a specificity of 80%, and a precision of 92%. The results are found explainable from both machine learning and medical perspectives. A privacy-protected web portal is set up to help medical personnel in their practice and the trained models are released for developers to further build other applications. We hope our results can help the world fight this pandemic and welcome clinical verification of our approach on larger populations.