Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
20works
0followers
20topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

20 published item(s)

preprint2026arXiv

Beyond Uniform SVD:Dual-Level Optimization across Columns and Modules for LLM Compression

Low-rank decomposition, particularly Singular Value Decomposition (SVD), is a pivotal technique for mitigating the storage and computational demands of Large Language Models (LLMs). However, prevalent SVD-based approaches overlook the critical phenomenon that decomposition errors exhibit significant disparity across different components of the parameter matrix, often leading to suboptimal approximation. Furthermore, existing methods lack a direct metric to evaluate the importance of individual weight matrices. To address these limitations, we propose Duo-SVD (Dual-level Optimization SVD), a novel training-free framework that synergizes optimization at both the column and the module levels. First, Duo-SVD incorporates a Column-Preserving Strategy that explicitly retains columns exhibiting high decomposition errors, while applying low-rank approximation solely to those with lower errors. Second, at the module level, we employ a Module-Adaptive Allocation Strategy that formulates ratio allocation as a global constrained optimization problem based on perturbation-induced model deviation. Extensive experiments demonstrate that Duo-SVD consistently outperforms state-of-the-art SVD-based baselines and structured pruning methods, establishing it as a superior paradigm for efficient LLM compression.

preprint2026arXiv

Learning Perturbations to Extrapolate Your LLM

Recent advancements in large language models demonstrate that injecting perturbations can substantially enhance extrapolation performance. However, current approaches often rely on discrete perturbations with fixed designs, which limits their flexibility. In this work, we propose a framework where token prefixes are perturbed by a learnable transformation of a continuous latent vector within an embedding space. To overcome the challenge of an intractable marginal likelihood, we derive unbiased estimating equations for model parameters and optimize them via stochastic gradient descent. We establish the statistical properties of the resulting estimator in over-parameterized regimes. Empirical evaluations on both synthetic and real-world datasets demonstrate that our proposal yields significant gains in out-of-domain settings over a range of state-of-the-art baseline methods.

preprint2026arXiv

Optical Continuum Light Curves and Bolometric Energy Estimates of Solar White-light Flares

Solar white-light flares (WLFs) are solar flares exhibiting enhanced emission in the optical continuum. They are critical for understanding energy release and transport mechanisms in solar flares and for conducting comparative studies with stellar WLFs. However, the scarcity of accurately and reliably measured optical continuum light curves for solar WLFs significantly hampers related studies. Based on the optimized solar WLF identification method, we construct a dataset of optical continuum light curves for 70 solar WLFs using 6173 Å continuum intensity images from the Solar Dynamics Observatory. Moreover, for each solar WLF event, we also provide the location of the white-light emission enhancement signals and key parameters including bolometric energies and durations derived from both the traditional fixed-temperature blackbody model and the refined variable-temperature blackbody model. This dataset will serve as a valuable resource for future statistical investigations of solar WLFs and for comparative studies between solar and stellar flares.

preprint2026arXiv

Robust Sequential Experimental Design for A/B Testing

Experimental design has emerged as a powerful approach for improving the sample efficiency of A/B testing, yet existing designs rely critically on correctly specified models. We study robust sequential experimental design under model misspecification and develop a unified framework that covers both contextual bandit and dynamic settings. Theoretically, we prove that our design bounds the worst-case mean squared error of the estimated treatment effect. Empirically, we demonstrate the effectiveness of the proposed approach using synthetic and real-world datasets from a leading technology company.

preprint2022arXiv

A Deep Learning Approach to Predicting Ventilator Parameters for Mechanically Ventilated Septic Patients

We develop a deep learning approach to predicting a set of ventilator parameters for a mechanically ventilated septic patient using a long and short term memory (LSTM) recurrent neural network (RNN) model. We focus on short-term predictions of a set of ventilator parameters for the septic patient in emergency intensive care unit (EICU). The short-term predictability of the model provides attending physicians with early warnings to make timely adjustment to the treatment of the patient in the EICU. The patient specific deep learning model can be trained on any given critically ill patient, making it an intelligent aide for physicians to use in emergent medical situations.

preprint2022arXiv

A Necessary and Sufficient Condition for Complete phase synchronization of high-dimensional nonidentical Kuramoto oscillators

For original Kuramoto models with nonidentical oscillators, it is impossible to realize complete phase synchronization. However, this paper reveals that complete phase synchronization can be achieved for a large class of high-dimensional Kuramoto models with nonidentical oscillators. Under the topology of strongly connected digraphs, a necessary and sufficient condition for complete phase synchronization is obtained. Finally, some simulations are provided to validate the obtained theoretic results.

preprint2022arXiv

A New Magnetic Parameter of Active Regions Distinguishing Large Eruptive and Confined Solar Flares

With the aim of investigating how the magnetic field in solar active regions (ARs) controls flare activity, i.e., whether a confined or eruptive flare occurs, we analyze 106 flares of Geostationary Operational Environmental Satellite (GOES) class $\geq$M1.0 during 2010$-$2019. We calculate mean characteristic twist parameters $α$$_{FPIL}$ within the "flaring polarity inversion line" region and $α$$_\mathrm{HFED}$ within the area of high photospheric magnetic free energy density, which both provide measures of the nonpotentiality of AR core region. Magnetic twist is thought to be related to the driving force of electric current-driven instabilities, such as the helical kink instability. We also calculate total unsigned magnetic flux ($Φ$$_\mathrm{AR}$) of ARs producing the flare, which describes the strength of the background field confinement. By considering both the constraining effect of background magnetic fields and the magnetic non-potentiality of ARs, we propose a new parameter $α$/$Φ$$_\mathrm{AR}$ to measure the probability for a large flare to be associated with a coronal mass ejection (CME). We find that in about 90\% of eruptive flares, $α$$_\mathrm{FPIL}$/$Φ$$_\mathrm{AR}$ and $α$$_\mathrm{HFED}$/$Φ$$_\mathrm{AR}$ are beyond critical values (2.2$\times$$10^{-24}$ and 3.2$\times$$10^{-24}$ Mm$^{-1}$ Mx$^{-1}$), whereas they are less than critical values in $\sim$ 80\% of confined flares. This indicates that the new parameter $α$/$Φ$$_\mathrm{AR}$ is well able to distinguish eruptive flares from confined flares. Our investigation suggests that the relative measure of magnetic nonpotentiality within the AR core over the restriction of the background field largely controls the capability of ARs to produce eruptive flares.

preprint2022arXiv

Continuous-stage symplectic adapted exponential methods for charged-particle dynamics with arbitrary electromagnetic fields

This paper is devoted to the numerical symplectic approximation of the charged-particle dynamics (CPD) with arbitrary electromagnetic fields. By utilizing continuous-stage methods and exponential integrators, a general class of symplectic methods is formulated for CPD under a homogeneous magnetic field. Based on the derived symplectic conditions, two practical symplectic methods up to order four are constructed where the error estimates show that the proposed second order scheme has a uniform accuracy in the position w.r.t. the strength of the magnetic field. Moreover, the symplectic methods are extended to CPD under non-homogeneous magnetic fields and three algorithms are formulated. Rigorous error estimates are investigated for the proposed methods and one method is proved to have a uniform accuracy in the position w.r.t. the strength of the magnetic field. Numerical experiments are provided for CPD under homogeneous and non-homogeneous magnetic fields, and the numerical results support the theoretical analysis and demonstrate the remarkable numerical behavior of our methods.

preprint2022arXiv

Dynamic Property and Magnetic Nonpotentiality of Two Types of Confined Solar Flares

We analyze 152 large confined flares (GOES class $\geq$M1.0 and $\leq$$45^{\circ}$ from disk center) during 2010$-$2019, and classify them into two types according to the criterion taken from the work of Li et al. (2019). &#34;Type I&#34; flares are characterized by slipping motions of flare loops and ribbons and a stable filament underlying the flare loops. &#34;Type II&#34; flares are associated with the failed eruptions of the filaments, which can be explained by the classical 2D flare model. A total of 59 flares are &#34;Type I&#34; flares (about 40\%) and 93 events are &#34;Type II&#34; flares (about 60\%). There are significant differences in distributions of the total unsigned magnetic flux ($Φ$$_\mathrm{AR}$) of active regions (ARs) producing the two types of confined flares, with &#34;Type I&#34; confined flares from ARs with a larger $Φ$$_{AR}$ than &#34;Type II&#34;. We calculate the mean shear angle $Ψ$$_\mathrm{HFED}$ within the core of an AR prior to the flare onset, and find that it is slightly smaller for &#34;Type I&#34; flares than that for &#34;Type II&#34; events. The relative non-potentiality parameter $Ψ$$_\mathrm{HFED}$/$Φ$$_\mathrm{AR}$ has the best performance in distinguishing the two types of flares. About 73\% of &#34;Type I&#34; confined flares have $Ψ$$_\mathrm{HFED}$/$Φ$$_\mathrm{AR}$$<$1.0$\times$$10^{-21}$ degree Mx$^{-1}$, and about 66\% of &#34;Type II&#34; confined events have $Ψ$$_\mathrm{HFED}$/$Φ$$_\mathrm{AR}$$\geq$1.0$\times$$10^{-21}$ degree Mx$^{-1}$. We suggest that &#34;Type I&#34; confined flares cannot be explained by the standard flare model in 2D/3D, and the occurrence of multiple slipping magnetic reconnections within the complex magnetic systems probably leads to the observed flare.

preprint2022arXiv

Floquet engineering Hz-Level Rabi Spectra in Shallow Optical Lattice Clock

Quantum metrology with ultra-high precision usually requires atoms prepared in an ultra-stable environment with well-defined quantum states. Thus, in optical lattice clock systems deep lattice potentials are used to trap ultra-cold atoms. However, decoherence, induced by Raman scattering and higher order light shifts, can significantly be reduced if atomic clocks are realized in shallow optical lattices. On the other hand, in such lattices, tunneling among different sites can cause additional dephasing and strongly broadening of the Rabi spectrum. Here, in our experiment, we periodically drive a shallow $^{87}$Sr optical lattice clock. Counter intuitively, shaking the system can deform the wide broad spectral line into a sharp peak with 5.4Hz line-width. With careful comparison between the theory and experiment, we demonstrate that the Rabi frequency and the Bloch bands can be tuned, simultaneously and independently. Our work not only provides a different idea for quantum metrology, such as building shallow optical lattice clock in outer space, but also paves the way for quantum simulation of new phases of matter by engineering exotic spin orbit couplings.

preprint2022arXiv

Snowmass2021 Cosmic Frontier White Paper: Prospects for obtaining Dark Matter Constraints with DESI

Despite efforts over several decades, direct-detection experiments have not yet led to the discovery of the dark matter (DM) particle. This has led to increasing interest in alternatives to the Lambda CDM (LCDM) paradigm and alternative DM scenarios (including fuzzy DM, warm DM, self-interacting DM, etc.). In many of these scenarios, DM particles cannot be detected directly and constraints on their properties can ONLY be arrived at using astrophysical observations. The Dark Energy Spectroscopic Instrument (DESI) is currently one of the most powerful instruments for wide-field surveys. The synergy of DESI with ESA&#39;s Gaia satellite and future observing facilities will yield datasets of unprecedented size and coverage that will enable constraints on DM over a wide range of physical and mass scales and across redshifts. DESI will obtain spectra of the Lyman-alpha forest out to z~5 by detecting about 1 million QSO spectra that will put constraints on clustering of the low-density intergalactic gas and DM halos at high redshift. DESI will obtain radial velocities of 10 million stars in the Milky Way (MW) and Local Group satellites enabling us to constrain their global DM distributions, as well as the DM distribution on smaller scales. The paradigm of cosmological structure formation has been extensively tested with simulations. However, the majority of simulations to date have focused on collisionless CDM. Simulations with alternatives to CDM have recently been gaining ground but are still in their infancy. While there are numerous publicly available large-box and zoom-in simulations in the LCDM framework, there are no comparable publicly available WDM, SIDM, FDM simulations. DOE support for a public simulation suite will enable a more cohesive community effort to compare observations from DESI (and other surveys) with numerical predictions and will greatly impact DM science.

preprint2022arXiv

Various Activities above Sunspot Light Bridges in IRIS Observations: Classification and Comparison

Light bridges (LBs) are among the most striking sub-structures in sunspots, where various activities have been revealed by recent high-resolution observations from the Interface Region Imaging Spectrograph (IRIS). According to the variety of physical properties, we classified these activities into four distinct categories: transient brightening (TB), intermittent jet (IJ), type-I light wall (LW-I), and type-II light wall (LW-II). In IRIS 1400/1330 Å observations, TBs are characterized by abrupt emission enhancements, and IJs appear as collimated plasma ejections with a width of 1-2 Mm at some LB sites. Most observed TBs are associated with IJs and show superpositions of some chromosphere absorption lines on enhanced and broadened wings of C II and Si IV lines, which could be driven by intermittent magnetic reconnection in the lower atmosphere. LW-I and LW-II are wall-shaped structures with bright fronts above the whole LB. An LW-I has a continuous oscillating front with a typical height of several Mm and an almost stationary period of 4-5 minutes. On the contrary, an LW-II has a indented front with a height of over 10 Mm, which has no stable period and is accompanied by recurrent TBs in the entire LB. These results support that LW-IIs are driven by frequent reconnection occurring along the whole LB due to large-scale magnetic flux emergence or intrusion, rather than the leakage of waves producing LW-Is. Our observations reveal a highly dynamical scenario of activities above LBs driven by different basic physical processes, including magneto-convection, magnetic reconnection, and wave leakage.

preprint2021arXiv

Deep Reinforcement Learning-based Task Offloading in Satellite-Terrestrial Edge Computing Networks

In remote regions (e.g., mountain and desert), cellular networks are usually sparsely deployed or unavailable. With the appearance of new applications (e.g., industrial automation and environment monitoring) in remote regions, resource-constrained terminals become unable to meet the latency requirements. Meanwhile, offloading tasks to urban terrestrial cloud (TC) via satellite link will lead to high delay. To tackle above issues, Satellite Edge Computing architecture is proposed, i.e., users can offload computing tasks to visible satellites for executing. However, existing works are usually limited to offload tasks in pure satellite networks, and make offloading decisions based on the predefined models of users. Besides, the runtime consumption of existing algorithms is rather high. In this paper, we study the task offloading problem in satellite-terrestrial edge computing networks, where tasks can be executed by satellite or urban TC. The proposed Deep Reinforcement learning-based Task Offloading (DRTO) algorithm can accelerate learning process by adjusting the number of candidate locations. In addition, offloading location and bandwidth allocation only depend on the current channel states. Simulation results show that DRTO achieves near-optimal offloading cost performance with much less runtime consumption, which is more suitable for satellite-terrestrial network with fast fading channel.

preprint2021arXiv

Performance of Kitt Peak&#39;s Mayall 4-meter Telescope During DESI Commissioning

In preparation for the Dark Energy Spectroscopic Instrument (DESI), a new top end was installed on the Mayall 4-meter telescope at Kitt Peak National Observatory. The refurbished telescope and the DESI instrument were successfully commissioned on sky between 2019 October and 2020 March. Here we describe the pointing, tracking and imaging performance of the Mayall telescope equipped with its new DESI prime focus corrector, as measured by six guider cameras sampling the outer edge of DESI&#39;s focal plane. Analyzing ~500,000 guider images acquired during commissioning, we find a median delivered image FWHM of 1.1 arcseconds (in the r-band at 650 nm), with the distribution extending to a best-case value of ~0.6 arcseconds. The point spread function is well characterized by a Moffat profile with a power-law index of $β$ ~ 3.5 and little dependence of $β$ on FWHM. The shape and size of the PSF delivered by the new corrector at a field angle of 1.57 degrees are very similar to those measured with the old Mayall corrector on axis. We also find that the Mayall achieves excellent pointing accuracy (several arcseconds RMS) and minimal open-loop tracking drift (< 1 milliarcsecond per second), improvements on the telecope&#39;s pre-DESI performance. In the future, employing DESI&#39;s active focus adjustment capabilities will likely further improve the Mayall/DESI delivered image quality.

preprint2020arXiv

Community Detection on Mixture Multi-layer Networks via Regularized Tensor Decomposition

We study the problem of community detection in multi-layer networks, where pairs of nodes can be related in multiple modalities. We introduce a general framework, i.e., mixture multi-layer stochastic block model (MMSBM), which includes many earlier models as special cases. We propose a tensor-based algorithm (TWIST) to reveal both global/local memberships of nodes, and memberships of layers. We show that the TWIST procedure can accurately detect the communities with small misclassification error as the number of nodes and/or the number of layers increases. Numerical studies confirm our theoretical findings. To our best knowledge, this is the first systematic study on the mixture multi-layer networks using tensor decomposition. The method is applied to two real datasets: worldwide trading networks and malaria parasite genes networks, yielding new and interesting findings.

preprint2020arXiv

Confirmed width-Eiso and width-Liso relations in GRB: comparison with the Amati and Yonetoku relations

In this paper, we select a sample including 141 BEST time-integrated F spectra and 145 BEST peak flux P spectra observed by the Konus-Wind with known redshift to recheck the connection between the spectral width and $E_{iso}$ as well as $L_{iso}$. We define six types of absolute spectral widths. It is found that all of the rest-frame absolute spectral widths are strongly positive correlated with $E_{iso}$ as well as $L_{iso}$ for the long burst for both the F and P spectra. All of the short bursts are the outliers for width-$E_{iso}$ relation and most of the short bursts are consistent with the long bursts for the width-$L_{iso}$ relation for both F and P spectra. Moreover, all of the location energy, $E_{2}$ and $E_{1}$, corresponding to various spectral widths are also positive correlated with $E_{iso}$ as well as $L_{iso}$. We compare all of the relations with the Amati and Yonetoku relations and find the width-$E_{iso}$ and width-$L_{iso}$ relations when the widths are at about 90\% maximum of the $EF_{E}$ spectra almost overlap with Amati relation and Yonetoku relation, respectively. The correlations of $E_{2}-E_{iso}$, $E_{1}-E_{iso}$ and $E_{2}-L_{iso}$, $E_{1}-L_{iso}$ when the location energies are at 99\% maximum of the $EF_{E}$ spectra are very close to the Amati and Yonetoku relations, respectively. Therefore, we confirm the existence of tight width-$E_{iso}$ and width-$L_{iso}$ relations for long bursts. We further show that the spectral shape is indeed related to $E_{iso}$ and $L_{iso}$. The Amati and Yonetoku relations are not necessarily the best relationships to relate the energy to the $E_{iso}$ and $L_{iso}$. They may be the special cases of the width-$E_{iso}$ and width-$L_{iso}$ relations or the energy-$E_{iso}$ and energy-$L_{iso}$ relations.

preprint2020arXiv

Dark Energy Survey Year 1 Results: Cosmological Constraints from Cluster Abundances and Weak Lensing

We perform a joint analysis of the counts and weak lensing signal of redMaPPer clusters selected from the Dark Energy Survey (DES) Year 1 dataset. Our analysis uses the same shear and source photometric redshifts estimates as were used in the DES combined probes analysis. Our analysis results in surprisingly low values for $S_8 =σ_8(Ω_{\rm m}/0.3)^{0.5}= 0.65\pm 0.04$, driven by a low matter density parameter, $Ω_{\rm m}=0.179^{+0.031}_{-0.038}$, with $σ_8-Ω_{\rm m}$ posteriors in $2.4σ$ tension with the DES Y1 3x2pt results, and in $5.6σ$ with the Planck CMB analysis. These results include the impact of post-unblinding changes to the analysis, which did not improve the level of consistency with other data sets compared to the results obtained at the unblinding. The fact that multiple cosmological probes (supernovae, baryon acoustic oscillations, cosmic shear, galaxy clustering and CMB anisotropies), and other galaxy cluster analyses all favor significantly higher matter densities suggests the presence of systematic errors in the data or an incomplete modeling of the relevant physics. Cross checks with X-ray and microwave data, as well as independent constraints on the observable--mass relation from SZ selected clusters, suggest that the discrepancy resides in our modeling of the weak lensing signal rather than the cluster abundance. Repeating our analysis using a higher richness threshold ($λ\ge 30$) significantly reduces the tension with other probes, and points to one or more richness-dependent effects not captured by our model.

preprint2020arXiv

Explicit symplectic adapted exponential integrators for charged-particle dynamics in a strong and constant magnetic field

This paper studies explicit symplectic adapted exponential integrators for solving charged-particle dynamics in a strong and constant magnetic field. We first formulate the scheme of adapted exponential integrators and then derive its symplecticity conditions. Based on the symplecticity conditions, we propose five practical explicit symplectic adapted exponential integrators. Two numerical experiments are carried out and the numerical results demonstrate the remarkable numerical behavior of the new methods.

preprint2020arXiv

Magnetic Flux of Active Regions Determining the Eruptive Character of Large Solar Flares

We establish the largest eruptive/confined flare database to date and analyze 322 flares of \emph{GOES} class M1.0 and larger that occurred during 2010$-$2019, i.e., almost spanning the entire solar cycle 24. We find that the total unsigned magnetic flux ($Φ$$_{AR}$) of active regions (ARs) is a key parameter in governing the eruptive character of large flares, with the proportion of eruptive flares exhibiting a strong anti-correlation with $Φ$$_{AR}$. This means that an AR containing a large magnetic flux has a lower probability for the large flares it produces to be associated with a coronal mass ejection (CME). This finding is supported by the high positive correlation we obtained between the critical decay index height and $Φ$$_{AR}$, implying that ARs with a larger $Φ$$_{AR}$ have a stronger magnetic confinement. Moreover, the confined flares originating from ARs larger than 1.0$\times$$10^{23}$ Mx have several characteristics in common: stable filament, slipping magnetic reconnection and strongly sheared post-flare loops. Our findings reveal new relations between the magnetic flux of ARs and the occurrence of CMEs in association with large flares. These relations obtained here provide quantitative criteria for forecasting CMEs and adverse space weather, and have also important implications for &#34;superflares&#34; on solar-type stars and stellar CMEs. The link of database is https://doi.org/10.12149/101030.

preprint2020arXiv

Magnetic Gradient: A Natural Driver of Solar Eruptions

It is well-known that there is a gradient, there will drive a flow inevitably. For example, a density-gradient may drive a diffusion flow, an electrical potential-gradient may drive an electric current in plasmas, etc. Then, what will be driven when a magnetic-gradient occurs in solar atmospheric plasmas? Considering the ubiquitous distribution of magnetic-gradient in solar plasma loops, this work demonstrates that magnetic-gradient pumping (MGP) mechanism is valid even in the partial ionized solar photosphere, chromosphere as well as in the corona. It drives energetic particle flows which carry and convey kinetic energy from the underlying atmosphere to move upwards, accumulate around the looptop and increase there temperature and pressure, and finally lead to eruptions around the looptop by triggering ballooning instabilities. This mechanism may explain the formation of the observing hot cusp-structures above flaring loops in most preflare phases, therefore, the magnetic-gradient should be a natural driver of solar eruptions. Furthermore, we may also apply to understand many other astrophysical phenomena, such as the temperature distribution above sunspots, the formation of solar plasma jets, type-II spicule, and fast solar wind above coronal holes, as well as the fast plasma jets related to white dwarfs, neutron stars and black holes.