Source author record

Zheng Sun

Zheng Sun appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

hep-th hep-ph cond-mat.mes-hall math.NA Numerical Analysis physics.optics Computer Vision quant-ph Artificial Intelligence astro-ph.CO Computation and Language cond-mat.mtrl-sci cond-mat.other cond-mat.stat-mech cond-mat.str-el gr-qc Machine Learning physics.comp-ph q-fin.PM

Catalog footprint

What is connected

24works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control

Large vision-language models have significantly advanced GUI agents, enabling executable interaction across web, mobile, and desktop interfaces. Yet these gains largely rely on a forgiving region-tolerant paradigm, where many nearby pixels inside the same component remain valid. Precise geometric construction breaks this assumption: actions must land on points in continuous canvas space rather than tolerant regions. Because geometric primitives carry ontological dependencies, a local coordinate error can induce cascading topological failures that distort downstream objects and invalidate the final construction. We identify this regime as precision-sensitive GUI tasks, requiring point-level accuracy, geometry-aware verification, and robustness to dependency-driven error propagation. To benchmark it, we introduce PAGE Bench, with 4,906 problems and over 224K process-supervised, pixel-level GUI actions. We further propose PAGER, a topology-aware agent that decomposes construction into dependency-structured planning and pixel-level execution. Pixel-grounded supervised tuning establishes executable action grammar, while precision-aligned reinforcement learning mitigates rollout-induced exposure bias through state-conditioned geometric feedback. Experiments reveal a pronounced Semantic-Execution Gap: general multimodal models can exceed 88% action type accuracy yet remain below 6% task success. PAGER closes this gap, delivering 4.1x higher task success than the strongest evaluated general baseline and raising step success rate from below 9% for GUI-specialized agents to over 62%, establishing a new state of the art for point-precise GUI control.

preprint2022arXiv

On Energy Laws and Stability of Runge--Kutta Methods for Linear Seminegative Problems

This paper presents a systematic theoretical framework to derive the energy identities of general implicit and explicit Runge--Kutta (RK) methods for linear seminegative systems. It generalizes the stability analysis of explicit RK methods in [Z. Sun and C.-W. Shu, SIAM J. Numer. Anal., 57 (2019), pp. 1158-1182]. The established energy identities provide a precise characterization on whether and how the energy dissipates in the RK discretization, thereby leading to weak and strong stability criteria of RK methods. Furthermore, we discover a unified energy identity for all the diagonal Pade approximations, based on an analytical Cholesky type decomposition of a class of symmetric matrices. The structure of the matrices is very complicated, rendering the discovery of the unified energy identity and the proof of the decomposition highly challenging. Our proofs involve the construction of technical combinatorial identities and novel techniques from the theory of hypergeometric series. Our framework is motivated by a discrete analogue of integration by parts technique and a series expansion of the continuous energy law. In some special cases, our analyses establish a close connection between the continuous and discrete energy laws, enhancing our understanding of their intrinsic mechanisms. Several specific examples of implicit methods are given to illustrate the discrete energy laws. A few numerical examples further confirm the theoretical properties.

preprint2022arXiv

Supersymmetry and R-symmetries in Wess-Zumino models: properties and model dataset construction

The Nelson-Seiberg theorem and its extensions relate supersymmetry breaking and R-symmetries in Wess-Zumino models. But their applicability may be limited by previously found non-generic counterexamples. Constructing a dataset of R-symmetric Wess-Zumino models is useful for studying the occurrence of such counterexamples as well as other purposes. This work gives a pedagogical review on the basics of supersymmetry in (3+1)-dimensions, Wess-Zumino models and their supergravity extensions, the Nelson-Seiberg theorem and its extensions. We present a preliminary construction of the dataset of R-symmetric Wess-Zumino models with up to 5 chiral fields. Among 925 models in total, 20 of them with non-generic R-charges are counterexamples to both the Nelson-Seiberg theorem and its extensions. Thus the dataset gives an estimation of the accuracy of the field counting method based on these theorems. More constructions and applications of the dataset are expected in future work.

preprint2022arXiv

TSRFormer: Table Structure Recognition with Transformers

We present a new table structure recognition (TSR) approach, called TSRFormer, to robustly recognizing the structures of complex tables with geometrical distortions from various table images. Unlike previous methods, we formulate table separation line prediction as a line regression problem instead of an image segmentation problem and propose a new two-stage DETR based separator prediction approach, dubbed \textbf{Sep}arator \textbf{RE}gression \textbf{TR}ansformer (SepRETR), to predict separation lines from table images directly. To make the two-stage DETR framework work efficiently and effectively for the separation line prediction task, we propose two improvements: 1) A prior-enhanced matching strategy to solve the slow convergence issue of DETR; 2) A new cross attention module to sample features from a high-resolution convolutional feature map directly so that high localization accuracy is achieved with low computational cost. After separation line prediction, a simple relation network based cell merging module is used to recover spanning cells. With these new techniques, our TSRFormer achieves state-of-the-art performance on several benchmark datasets, including SciTSR, PubTabNet and WTW. Furthermore, we have validated the robustness of our approach to tables with complex structures, borderless cells, large blank spaces, empty or spanning cells as well as distorted or even curved shapes on a more challenging real-world in-house dataset.

preprint2021arXiv

A formal notion of genericity and term-by-term vanishing superpotentials at supersymmetric vacua from R-symmetric Wess-Zumino models

It is known in previous literature that if a Wess-Zumino model with an R-symmetry gives a supersymmetric vacuum, the superpotential vanishes at the vacuum. In this work, we establish a formal notion of genericity, and show that if the R-symmetric superpotential has generic coefficients, the superpotential vanishes term-by-term at a supersymmetric vacuum. This result constrains the form of the superpotential which leads to a supersymmetric vacuum. It may contribute to a refined classification of R-symmetric Wess-Zumino models, and find applications in string constructions of vacua with small superpotentials. A similar result for a scalar potential system with a scaling symmetry is discussed.

preprint2021arXiv

Femtosecond dynamics of a polariton bosonic cascade at room temperature

Whispering gallery modes in a microwire are characterized by a nearly equidistant energy spectrum. In the strong exciton-photon coupling regime, this system represents a bosonic cascade: a ladder of discrete energy levels that sustains stimulated transitions between neighboring steps. In this work, by using femtosecond angle-resolved spectroscopic imaging technique, the ultrafast dynamics of polaritons in a bosonic cascade based on a one-dimensional ZnO whispering gallery microcavity is explicitly visualized. Clear ladder-form build-up process from higher to lower energy branches of the polariton condensates are observed, which are well reproduced by modeling using rate equations. Moreover, the polariton parametric scattering dynamics are distinguished on a timescale of hundreds of femtoseconds. Our understanding of the femtosecond condensation and scattering dynamics paves the way towards ultrafast coherent control of polaritons at room temperature, which will make it promising for high-speed all-optical integrated applications.

preprint2020arXiv

Error analysis of Runge--Kutta discontinuous Galerkin methods for linear time-dependent partial differential equations

In this paper, we present error estimates of fully discrete Runge--Kutta discontinuous Galerkin (DG) schemes for linear time-dependent partial differential equations. The analysis applies to explicit Runge--Kutta time discretizations of any order. For spatial discretization, a general discrete operator is considered, which covers various DG methods, such as the upwind-biased DG method, the central DG method, the local DG method and the ultra-weak DG method. We obtain error estimates for stable and consistent fully discrete schemes, if the solution is sufficiently smooth and a spatial operator with certain properties exists. Applications to schemes for hyperbolic conservation laws, the heat equation, the dispersive equation and the wave equation are discussed. In particular, we provide an alternative proof of optimal error estimates of local DG methods for equations with high order derivatives in one dimension, which does not rely on energy inequalities of auxiliary unknowns.

preprint2020arXiv

Joint Bilateral Learning for Real-time Universal Photorealistic Style Transfer

Photorealistic style transfer is the task of transferring the artistic style of an image onto a content target, producing a result that is plausibly taken with a camera. Recent approaches, based on deep neural networks, produce impressive results but are either too slow to run at practical resolutions, or still contain objectionable artifacts. We propose a new end-to-end model for photorealistic style transfer that is both fast and inherently generates photorealistic results. The core of our approach is a feed-forward neural network that learns local edge-aware affine transforms that automatically obey the photorealism constraint. When trained on a diverse set of images and a variety of styles, our model can robustly apply style transfer to an arbitrary pair of input images. Compared to the state of the art, our method produces visually superior results and is three orders of magnitude faster, enabling real-time performance at 4K on a mobile phone. We validate our method with ablation and user studies.

preprint2020arXiv

Observation of the Interlayer Exciton Gases in WSe$_2$ -p: WSe$_2$ Heterostructures

Interlayer excitons (IXs) possess a much longer lifetime than intralayer excitons due to the spatial separation of the electrons and holes; hence, they have been pursued to create exciton condensates for decades. The recent emergence of two-dimensional (2D) materials, such as transition metal dichalcogenides (TMDs), and of their van der Waals heterostructures (HSs), in which two different 2D materials are layered together, has created new opportunities to study IXs. Here we present the observation of IX gases within two stacked structures consisting of hBN/WSe$_2$/hBN/p: WSe$_2$/hBN. The IX energy of the two different structures differed by 82 meV due to the different thickness of the hBN spacer layer between the TMD layers. We demonstrate that the lifetime of the IXs is shortened when the temperature and the pump power increase. We attribute this nonlinear behavior to an Auger process.

preprint2020arXiv

Predicting quantum many-body dynamics with transferable neural networks

Machine learning (ML) architectures such as convolutional neural networks (CNNs) have garnered considerable recent attention in the study of quantum many-body systems. However, advanced ML approaches such as transfer learning have seldom been applied to such contexts. Here we demonstrate that a simple recurrent unit (SRU) based efficient and transferable sequence learning framework is capable of learning and accurately predicting the time evolution of one-dimensional (1D) Ising model with simultaneous transverse and parallel magnetic fields, as quantitatively corroborated by relative entropy measurements and magnetization between the predicted and exact state distributions. At a cost of constant computational complexity, a larger many-body state evolution was predicted in an autoregressive way from just one initial state, without any guidance or knowledge of any Hamiltonian. Our work paves the way for future applications of advanced ML methods in quantum many-body dynamics only with knowledge from a smaller system.

preprint2020arXiv

The Nelson-Seiberg theorem generalized with nonpolynomial superpotentials

The Nelson-Seiberg theorem relates R-symmetries to F-term supersymmetry breaking, and provides a guiding rule for new physics model building beyond the Standard Model. A revision of the theorem gives a necessary and sufficient condition to supersymmetry breaking in models with polynomial superpotentials. This work revisits the theorem to include models with nonpolynomial superpotentials. With a generic R-symmetric superpotential, a singularity at the origin of the field space implies both R-symmetry breaking and supersymmetry breaking. We give a generalized necessary and sufficient condition for supersymmetry breaking which applies to both perturbative and nonperturbative models.

preprint2019arXiv

On structure-preserving discontinuous Galerkin methods for Hamiltonian partial differential equations: Energy conservation and multi-symplecticity

In this paper, we present and study discontinuous Galerkin (DG) methods for one-dimensional multi-symplectic Hamiltonian partial differential equations. We particularly focus on semi-discrete schemes with spatial discretization only, and show that the proposed DG methods can simultaneously preserve the multi-symplectic structure and energy conservation with a general class of numerical fluxes, which includes the well-known central and alternating fluxes. Applications to the wave equation, the Benjamin-Bona-Mahony equation, the Camassa-Holm equation, the Korteweg-de Vries equation and the nonlinear Schrödinger equation are discussed. Some numerical results are provided to demonstrate the accuracy and long time behavior of the proposed methods. Numerically, we observe that certain choices of numerical fluxes in the discussed class may help achieve better accuracy compared with the commonly used ones including the central fluxes.

preprint2016arXiv

Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition

A deep learning approach has been widely applied in sequence modeling problems. In terms of automatic speech recognition (ASR), its performance has significantly been improved by increasing large speech corpus and deeper neural network. Especially, recurrent neural network and deep convolutional neural network have been applied in ASR successfully. Given the arising problem of training speed, we build a novel deep recurrent convolutional network for acoustic modeling and then apply deep residual learning to it. Our experiments show that it has not only faster convergence speed but better recognition accuracy over traditional deep convolutional recurrent network. In the experiments, we compare the convergence speed of our novel deep recurrent convolutional networks and traditional deep convolutional recurrent networks. With faster convergence speed, our novel deep recurrent convolutional networks can reach the comparable performance. We further show that applying deep residual learning can boost the convergence speed of our novel deep recurret convolutional networks. Finally, we evaluate all our experimental networks by phoneme error rate (PER) with our proposed bidirectional statistical n-gram language model. Our evaluation results show that our newly proposed deep recurrent convolutional network applied with deep residual learning can reach the best PER of 17.33\% with the fastest convergence speed on TIMIT database. The outstanding performance of our novel deep recurrent convolutional neural network with deep residual learning indicates that it can be potentially adopted in other sequential problems.

preprint2015arXiv

The Renormalizable Three-Term Polynomial Inflation with Large Tensor-to-Scalar Ratio

We systematically study the renormalizable three-term polynomial inflation in the supersymmetric and non-supersymmetric models. The supersymmetric inflaton potentials can be realized in supergravity theory, and only have two independent parameters. We show that the general renormalizable supergravity model is equivalent to one kind of our supersymmetric models. We find that the spectral index and tensor-to-scalar ratio can be consistent with the Planck and BICEP2 results, but the running of spectral index is always out of the $2σ$ range. If we do not consider the BICEP2 experiment, these inflationary models can be highly consistent with the Planck observations and saturate its upper bound on the tensor-to-scalar ratio ($r \le 0.11$). Thus, our models can be tested at the future Planck and QUBIC experiments.

preprint2015arXiv

Weighted Elastic Net Penalized Mean-Variance Portfolio Design and Computation

It is well known that the out-of-sample performance of Markowitz's mean-variance portfolio criterion can be negatively affected by estimation errors in the mean and covariance. In this paper we address the problem by regularizing the mean-variance objective function with a weighted elastic net penalty. We show that the use of this penalty can be motivated by a robust reformulation of the mean-variance criterion that directly accounts for parameter uncertainty. With this interpretation of the weighted elastic net penalty we derive data driven techniques for calibrating the weighting parameters based on the level of uncertainty in the parameter estimates. We test our proposed technique on US stock return data and our results show that the calibrated weighted elastic net penalized portfolio outperforms both the unpenalized portfolio and uniformly weighted elastic net penalized portfolio. This paper also introduces a novel Adaptive Support Split-Bregman approach which leverages the sparse nature of $\ell_{1}$ penalized portfolios to efficiently compute a solution of our proposed portfolio criterion. Numerical results show that this modification to the Split-Bregman algorithm results in significant improvements in computational speed compared with other techniques.

preprint2014arXiv

Preferred hierarchy scales from the product landscape

The product landscape method has been recently proposed to solve hierarchy problems such as the cosmological constant problem. We suggest that the parameter distribution on logarithmic scales should be used as a benchmark for hierarchy, and the preferred hierarchy scales can be obtained from the distribution peak. It is shown that generating hierarchy from purely product distribution is very inefficient. To achieve a reasonably acceptable efficiency, other effects such as accumulation of weak hierarchy in the effective theory should be incorporated.

preprint2014arXiv

Strong light-matter coupling in two-dimensional atomic crystals

Two dimensional (2D) atomic crystals of graphene, and transition metal dichalcogenides have emerged as a class of materials that show strong light-matter interaction. This interaction can be further controlled by embedding such materials into optical microcavities. When the interaction is engineered to be stronger than the dissipation of light and matter entities, one approaches the strong coupling regime resulting in the formation of half-light half-matter bosonic quasiparticles called microcavity polaritons. Here we report the evidence of strong light-matter coupling and formation of microcavity polaritons in a two dimensional atomic crystal of molybdenum disulphide (MoS2) embedded inside a dielectric microcavity at room temperature. A Rabi splitting of 46 meV and highly directional emission is observed from the MoS2 microcavity owing to the coupling between the 2D excitons and the cavity photons. Realizing strong coupling effects at room temperature in a disorder free potential landscape is central to the development of practical polaritonic circuits and switches.

preprint2014arXiv

The Nelson-Seiberg theorem revised

The well-accepted Nelson-Seiberg theorem relates R-symmetries to supersymmetry (SUSY) breaking vacua, and provides a guideline for SUSY model building which is the most promising physics beyond the Standard Model. In the case of Wess-Zumino models with perturbative superpotentials, we revise the theorem to a combined necessary and sufficient condition for SUSY breaking which can be easily checked before solving the vacuum. The revised theorem provides a powerful tool to construct either SUSY breaking or SUSY vacua, and offers many practicable applications in low energy SUSY model building and string phenomenology.

preprint2012arXiv

Effect of Monolayer Thickness Fluctuations on Coherent Exciton Coupling in Single Quantum Wells

Monolayer fluctuations in the thickness of a semiconductor quantum well (QW) lead to three types of excitons, located in the narrower, average and thicker regions of the QW, which are clearly resolved in optical spectra. Whether or not these excitons are coherently coupled via Coulomb interactions is a long-standing debate. We demonstrate that different types of disorder in QWs distinctly affects the coherent coupling and that the coupling strength can be quantitatively measured using optical two-dimensional Fourier transform spectroscopy. We prove experimentally and theoretically that in narrow quantum wells the coherent coupling occurs predominantly between excitons residing in the disorder-free areas of the QWs and those residing in the plateau-type disorder. In contrast, excitons localized in the fault-type disorder potentials do not coherently couple to other excitons.

preprint2012arXiv

Low energy supersymmetry from R-symmetries

In a generic setting of Wess-Zumino models, we prove that the existence of a supersymmetric vacuum with a vanishing superpotential can be a consequence of a continuous or discrete R-symmetry when invariant fields are not less than fields transforming in the same way as the superpotential under the R-symmetry. The realization in string theory is discussed. We show that a rich landscape of low energy supersymmetric vacua can be found in the Type IIB flux compactification setup ready for the KKLT construction of de Sitter vacua in string theory.

preprint2011arXiv

Continuous degeneracy of non-supersymmetric vacua

In global supersymmetric Wess-Zumino models with minimal Kähler potentials, F-type supersymmetry breaking always yields instability or continuous degeneracy of non-supersymmetric vacua. As a generalization of the original O'Raifeartaigh's result, the existence of instability or degeneracy is true to any higher order corrections at tree level for models even with non-renormalizable superpotentials. The degeneracy generically coincides the R-axion direction under some assumptions of R-charge assignment, but generally requires neither R-symmetries nor any assumption of generic superpotentials. The result also confirms the well-known fact that tree level supersymmetry breaking is a very rare occurrence in global supersymmetric theories with minimal Kähler potentials. The implication for effective field theory method in the landscape is discussed and we point out that choosing models with minimal Kähler potentials may result in unexpected answers to the vacuum statistics. Supergravity theories or theories with non-minimal Kähler potentials in general do not suffer from the existence of instability or degeneracy. But very strong gauge dynamics or small compactification dimension reduces the Kähler potential from non-minimal to minimal, and gravity decoupling limit reduces supergravity to global supersymmetry. Instability or degeneracy may appear in these limits. Away from these limits, a large number of non-SUSY vacua may still be found in an intermediate region.

preprint2011arXiv

Spin Selective Purcell Effect in a Quantum Dot Microcavity System

We demonstrate the selective coupling of a single quantum dot exciton spin state with the cavity mode in a quantum dot-micropillar cavity system. By tuning an external magnetic field, the Zeeman splitted exciton spin states coupled differently with the cavity due to field manipulated energy detuning. We found a 26 times increase in the emission intensity of spin-up exciton state with respect to spin-down exciton state at resonance due to Purcell effect, which gives rise to the selective enhancement of light emission with the circular polarization degree up to 93%. A four-level rate equation model is developed and quantitatively agrees well with our experimental data. Our results pave the way for the realization of future quantum light sources and the quantum information processing applications.

preprint2011arXiv

Tree level spontaneous R-symmetry breaking in O'Raifeartaigh models

We show that in O'Raifeartaigh models of spontaneous supersymmetry breaking, R-symmetries can be broken by non-zero values of fields at tree level, rather than by vacuum expectation values of pseudomoduli at loop level. As a complement of the recent result by Shih, we show that there must be a field in the theory with R-charge different from zero and two in order for R-symmetry breaking to occur, no matter whether the breaking happens at tree or loop level. We review the example by CDFM, and construct two types of tree level R-symmetry breaking models with a wide range of parameters and free of runaway problem. And the R-symmetry is broken everywhere on the pseudomoduli space in these models. This provides a rich set of candidates for SUSY model building and phenomenology.

preprint2011arXiv

Vacuum statistics and parameter tuning for F-term supersymmetry breaking

We carry out a model-independent EFT method study on the vacuum statistics of general F-term SUSY breaking models. Assuming a smooth distribution of Lagrangian parameters, SUSY breaking vacua are rare in global SUSY models with a canonical Kähler potential, and have a peaked distribution near the cut-off of the SUSY breaking scale in both global SUSY and SUGRA models with a general Kähler potential. After including different mass scales in the Lagrangian, we compare the total number of SUSY and non-SUSY vacua and estimate quantitatively the rareness of SUSY breaking. The EFT method provides a general view to the amount of parameter tuning needed for a metastable SUSY breaking vacuum. The tuning also indicates the importance of R-symmetries in SUSY breaking even for metastable SUSY breaking.

Zheng Sun

What is connected

Connect this record

See the researcher in context

Building this map preview

24 published item(s)

PAGER: Bridging the Semantic-Execution Gap in Point-Precise Geometric GUI Control

On Energy Laws and Stability of Runge--Kutta Methods for Linear Seminegative Problems

Supersymmetry and R-symmetries in Wess-Zumino models: properties and model dataset construction

TSRFormer: Table Structure Recognition with Transformers

A formal notion of genericity and term-by-term vanishing superpotentials at supersymmetric vacua from R-symmetric Wess-Zumino models

Femtosecond dynamics of a polariton bosonic cascade at room temperature

Error analysis of Runge--Kutta discontinuous Galerkin methods for linear time-dependent partial differential equations

Joint Bilateral Learning for Real-time Universal Photorealistic Style Transfer

Observation of the Interlayer Exciton Gases in WSe$_2$ -p: WSe$_2$ Heterostructures

Predicting quantum many-body dynamics with transferable neural networks

The Nelson-Seiberg theorem generalized with nonpolynomial superpotentials

On structure-preserving discontinuous Galerkin methods for Hamiltonian partial differential equations: Energy conservation and multi-symplecticity

Deep Recurrent Convolutional Neural Network: Improving Performance For Speech Recognition

The Renormalizable Three-Term Polynomial Inflation with Large Tensor-to-Scalar Ratio

Weighted Elastic Net Penalized Mean-Variance Portfolio Design and Computation

Preferred hierarchy scales from the product landscape

Strong light-matter coupling in two-dimensional atomic crystals

The Nelson-Seiberg theorem revised

Effect of Monolayer Thickness Fluctuations on Coherent Exciton Coupling in Single Quantum Wells

Low energy supersymmetry from R-symmetries

Continuous degeneracy of non-supersymmetric vacua

Spin Selective Purcell Effect in a Quantum Dot Microcavity System

Tree level spontaneous R-symmetry breaking in O'Raifeartaigh models

Vacuum statistics and parameter tuning for F-term supersymmetry breaking