Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
38works
0followers
19topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

38 published item(s)

preprint2026arXiv

HLS-Seek: QoR-Aware Code Generation for High-Level Synthesis via Proxy Comparative Reward Reinforcement Learning

High-Level Synthesis (HLS) compiles algorithmic C/C++ descriptions into hardware, with Quality of Results (QoR) -- latency and resource utilization -- critically governed by pragma configurations and code structure. Existing LLM-based HLS approaches train for functional correctness but ignore QoR entirely. We observe that reinforcement learning (RL) for HLS does not require absolute synthesis results -- only relative comparisons between candidates. Based on this insight, we propose \textbf{HLS-Seek}, a QoR-aware NL-to-HLS framework that replaces expensive synthesis-in-the-loop RL with a comparative proxy reward model achieving 99.53\% Pareto-dominance accuracy. To prevent reward hacking, we introduce \textit{uncertainty-aware Monte Carlo (MC) dropout switching} that selectively invokes real Vitis HLS synthesis for low-confidence candidates and online updates the proxy, creating a self-improving reward system. HLS-Seek achieves 81.5\% syntax correctness pass@1 and 81.4\% Func@5 on HLS-eval with only 7B parameters, surpassing GPT-5.1 and other frontier models while achieving 8.5$\times$ faster training than real-reward RL. On QoR evaluation, HLS-Seek achieves the lowest latency on 16/30 kernels and Pareto-dominates HLS-specific baselines on 9 kernels.

preprint2026arXiv

RidgeWalker: Perfectly Pipelined Graph Random Walks on FPGAs

Graph Random Walks (GRWs) offer efficient approximations of key graph properties and have been widely adopted in many applications. However, GRW workloads are notoriously difficult to accelerate due to their strong data dependencies, irregular memory access patterns, and imbalanced execution behavior. While recent work explores FPGA-based accelerators for GRWs, existing solutions fall far short of hardware potential due to inefficient pipelining and static scheduling. This paper presents RidgeWalker, a high-performance GRW accelerator designed for datacenter FPGAs. The key insight behind RidgeWalker is that the Markov property of GRWs allows decomposition into stateless, fine-grained tasks that can be executed out-of-order without compromising correctness. Building on this, RidgeWalker introduces an asynchronous pipeline architecture with a feedback-driven scheduler grounded in queuing theory, enabling perfect pipelining and adaptive load balancing. We prototype RidgeWalker on datacenter FPGAs and evaluated it across a range of GRW algorithms and real-world graph datasets. Experimental results demonstrate that RidgeWalker achieves an average speedup of 7.0x over state-of-the-art FPGA solutions and 8.1x over GPU solutions, with peak speedups of up to 71.0x and 22.9x, respectively. The source code is publicly available at https://github.com/Xtra-Computing/RidgeWalker.

preprint2022arXiv

A model of double coronal hard X-ray sources in solar flares

A number of double coronal X-ray sources have been observed during solar flares by RHESSI, where the two sources reside at different sides of the inferred reconnection site. However, where and how are these X-ray-emitting electrons accelerated remains unclear. Here we present the first model of the double coronal hard X-ray (HXR) sources, where electrons are accelerated by a pair of termination shocks driven by bi-directional fast reconnection outflows. We model the acceleration and transport of electrons in the flare region by numerically solving the Parker transport equation using velocity and magnetic fields from the macroscopic magnetohydrodynamic simulation of a flux rope eruption. We show that electrons can be efficiently accelerated by the termination shocks and high-energy electrons mainly concentrate around the two shocks. The synthetic HXR emission images display two distinct sources extending to $>$100 keV below and above the reconnection region, with the upper source much fainter than the lower one. The HXR energy spectra of the two coronal sources show similar spectral slopes, consistent with the observations. Our simulation results suggest that the flare termination shock can be a promising particle acceleration mechanism in explaining the double-source nonthermal emissions in solar flares.

preprint2022arXiv

Birth places of extreme ultraviolet waves driven by impingement of solar jets upon coronal loops

Solar extreme ultraviolet (EUV) waves are large-scale propagating disturbances in the corona. It is generally believed that the vital key for the formation of EUV waves is the rapid expansion of the loops that overlie erupting cores in solar eruptions, such as coronal mass ejections (CMEs) and solar jets. However, the details of the interaction between the erupting cores and overlying loops are not clear, because that the overlying loops are always instantly opened after the energetic eruptions. Here, we present three typical jet-driven EUV waves without CME to study the interaction between the jets and the overlying loops that remained closed during the events. All three jets emanated from magnetic flux cancelation sites in source regions. Interestingly, after the interactions between jets and overlying loops, three EUV waves respectively formed ahead of the top, the near end (close to the jet source), and the far (another) end of the overlying loops. According to the magnetic field distribution of the loops extrapolated from Potential Field Source Surface method, it is confirmed that the birth places of three jet-driven EUV waves were around the weakest magnetic field strength part of the overlying loops. We suggest that the jet-driven EUV waves preferentially occur at the weakest part of the overlying loops, and the location can be subject to the magnetic field intensity around the ends of the loops.

preprint2022arXiv

Compilation and Optimizations for Efficient Machine Learning on Embedded Systems

Deep Neural Networks (DNNs) have achieved great success in a variety of machine learning (ML) applications, delivering high-quality inferencing solutions in computer vision, natural language processing, and virtual reality, etc. However, DNN-based ML applications also bring much increased computational and storage requirements, which are particularly challenging for embedded systems with limited compute/storage resources, tight power budgets, and small form factors. Challenges also come from the diverse application-specific requirements, including real-time responses, high-throughput performance, and reliable inference accuracy. To address these challenges, we introduce a series of effective design methodologies, including efficient ML model designs, customized hardware accelerator designs, and hardware/software co-design strategies to enable efficient ML applications on embedded systems.

preprint2022arXiv

Double-power-law feature of energetic particles accelerated at coronal shocks

Recent observations have shown that in many large solar energetic particle (SEP) events the event-integrated differential spectra resemble double power laws. We perform numerical modeling of particle acceleration at coronal shocks propagating through a streamer-like magnetic field by solving the Parker transport equation, including protons and heavier ions. We find that for all ion species the energy spectra integrated over the simulation domain can be described by a double power law, and the break energy depends on the ion charge-to-mass ratio as $E_B \sim (Q/A)^α$, with $α$ varying from 0.16 to 1.2 by considering different turbulence spectral indices. We suggest that the double power law distribution may emerge as a result of the superposition of energetic particles from different source regions where the acceleration rates differ significantly due to particle diffusion. The diffusion and mixing of energetic particles could also provide an explanation for the increase of Fe/O at high energies as observed in some SEP events. Although further mixing processes may occur, our simulations indicate that either power-law break or rollover can occur near the Sun and predict that the spectral forms vary significantly along the shock front, which may be examined by upcoming near-Sun SEP measurements from Parker Solar Probe and Solar Orbiter.

preprint2022arXiv

Formation and Immediate Deformation of a Small Filament Through Intermittent Magnetic Interactions

It is generally believed that filament formation involves a process of the accumulation of magnetic energy. However, in this paper we discuss the idea that filaments will not erupt and will only deform when the stored magnetic energy is released gradually. Combining high-quality observations from Solar Dynamics Observatory and other instruments, we present the formation and immediate deformation of a small filament (F1) in the active region (AR) 12760 on 28-30 April 2020. Before the filament formation, three successive dipoles quickly emerged with separation motions in the center of AR 12760. Due to the magnetic interaction between magnetic dipoles and pre-existing positive polarities, coronal brightenings consequently appeared in the overlying atmosphere. Subsequently, because of the continuous cancellation of magnetic flux that happened around the adjacent ends of F1 and another nearby filament (F2), the magnetic reconections occurred intermittently occurred between F1 and F2. Finally, F1 lessened in the shear, and F2 became shorter. All the results show that the formation of F1 was closely associated with intermittent interactions between the sequence of emerging dipoles and pre-existing magnetic polarities, and the immediate deformation of F1 was intimately related to intermittent interactions between F1 and F2. We also suggest that the intermittent magnetic interactions driven by the continuous magnetic activities (magnetic-flux emergence, cancellation, and convergence) play an important role in the formation and deformation of filaments.

preprint2022arXiv

Harmonic Electron Cyclotron Maser Emission along the Coronal Loop

Efficient radiation at second and/or higher harmonics of Wce has been suggested to circumvent the escaping difficulty of the electron cyclotron maser emission mechanism when it is applied to solar radio bursts, such as spikes. In our earlier study, we developed a three-step numerical scheme to connect the dynamics of energetic electrons within a large-scale coronal loop structure with the microscale kinetic instability energized by the obtained nonthermal velocity distribution and found that direct and efficient harmonic X-mode (X2 for short) emission can be achieved due to the strip-like features of the distribution. That study only considered the radiation from the loop top at a specific time. Here we present the emission properties along the loop at different locations and timings. We found that, in accordance with our earlier results, few to several strip-like features can appear in all cases, and the first two strips play the major role in exciting X2 and Z (i.e., the slow extraordinary mode) that propagate quasi-perpendicularly. For the four sections along the loop, significant excitation of X2 is observed from the upper two sections, and the strongest emission is from the top section. In addition, significant excitation of Z is observed for all loop sections, while there is no significant emission of the fundamental X mode. The study provides new insight into coherent maser emission along the coronal loop structure during solar flares.

preprint2022arXiv

Machine learning-enabled high-entropy alloy discovery

High-entropy alloys are solid solutions of multiple principal elements, capable of reaching composition and feature regimes inaccessible for dilute materials. Discovering those with valuable properties, however, relies on serendipity, as thermodynamic alloy design rules alone often fail in high-dimensional composition spaces. Here, we propose an active-learning strategy to accelerate the design of novel high-entropy Invar alloys in a practically infinite compositional space, based on very sparse data. Our approach works as a closed-loop, integrating machine learning with density-functional theory, thermodynamic calculations, and experiments. After processing and characterizing 17 new alloys (out of millions of possible compositions), we identified 2 high-entropy Invar alloys with extremely low thermal expansion coefficients around 2*10-6 K-1 at 300 K. Our study thus opens a new pathway for the fast and automated discovery of high-entropy alloys with optimal thermal, magnetic and electrical properties.

preprint2022arXiv

Nuclear spin self compensation system for moving MEG sensing with optical pumped atomic spin co-magnetometer

Recording the moving MEGs of a person in which a person's head could move freely as we record the brain's magnetic field is a hot topic in recent years. Traditionally, atomic magnetometers are utilized for moving MEGs recording and a large compensation coil system is utilized for background magnetic field compensation. Here we described a new potential candidate: an optically pumped atomic co-magnetometer(OPACM) for moving MEGs recording. In the OPACM, hyper-polarized nuclear spins could produce a magnetic field which will shield the background fluctuation low frequency magnetic field noise while the the fast changing MEGs signal could be recorded. The nuclear spins look like an automatic magnetic field shields and dynamically compensate the fluctuated background magnetic field noise. In this article, the magnetic field compensation is studied theoretically and we find that the compensation is closely related to several parameters such as the electron spin magnetic field, the nuclear spin magnetic field and the holding magnetic field. Based on the model, the magnetic field compensation could be optimized. We also experimentally studied the magnetic field compensation and the responses of the OPACM to different frequencies of magnetic field are measured. We show that the OPACM owns a clear suppression of low frequency magnetic field under 1Hz and response to magnetic field's frequencies around the band of the MEGs. Magnetic field sensitivity of $3fT/Hz^{1/2}$ has been achieved. Finally, we do a simulation for the OPACM as it is utilized for moving MEGs recording. For comparison, the traditional compensation system for moving MEGs recording is based on a coil which is around 2m in dimension while our compensation system is only 2mm in dimension. Moreover, our compensation system could work in situ and will not affect each other.

preprint2022arXiv

On the Nature of the Three-part Structure of Solar Coronal Mass Ejections

Coronal mass ejections (CMEs) result from eruptions of magnetic flux ropes (MFRs) and can possess a three-part structure in white-light coronagraphs, including a bright front, dark cavity and bright core. In the traditional opinion, the bright front forms due to the plasma pileup along the MFR border, the cavity represents the cross section of the MFR, and the bright core corresponds to the erupted prominence. However, this explanation on the nature of the three-part structure is being challenged. In this paper, we report an intriguing event occurred on 2014 June 14 that was recorded by multiple space- and ground-based instruments seamlessly, clearly showing that the CME front originates from the plasma pileup along the magnetic arcades overlying the MFR, and the core corresponds to a hot-channel MFR. Thus the dark cavity is not an MFR, instead it is a low-density zone between the CME front and a trailing MFR. These observations are consistent with a new explanation on the CME structure. If the new explanation is correct, most (if not all) CMEs should exhibit the three-part appearance in their early eruption stage. To examine this prediction, we make a survey study of all CMEs in 2011 and find that all limb events have the three-part feature in the low corona, regardless of their appearances in the high corona. Our studies suggest that the three-part structure is the intrinsic structure of CMEs, which has fundamental importance for understanding CMEs.

preprint2022arXiv

Plasma Emission Induced By Electron Beam in Weakly Magnetized Plasmas

Previous studies on the beam-driven plasma emission process were done mainly for unmagnetized plasmas. Here we present fully-kinetic electromagnetic particle-in-cell simulations to investigate such process in weakly-magnetized plasmas of the solar corona conditions. The primary mode excited is the beam-Langmuir (BL) mode via the classical bump-on-tail instability. Other modes include the whistler (W) mode excited by the electron cyclotron resonance instability, the generalized Langmuir (GL) waves that include a superluminal Z-mode component with smaller wave number $k$ and a thermal Langmuir component with larger $k$, and the fundamental (F) and harmonic (H) branches of plasma emission. Further simulations of different mass and temperature ratios of electrons and protons indicate that the GL mode and the two escaping modes (F and H) correlate positively with the BL mode in intensity, supporting that they are excited through nonlinear wave-wave coupling processes involving the BL mode. We suggest that the dominant process is the decay of the primary BL mode. This is consistent with the standard theory of plasma emission. Yet, the other possibility of the Z+W$\rightarrow$O--F coalescing process for the F emission cannot be ruled out completely.

preprint2022arXiv

Random diffusivity processes in an external force field

Brownian yet non-Gaussian processes have recently been observed in numerous biological systems and the corresponding theories have been built based on random diffusivity models. Considering the particularity of random diffusivity, this paper studies the effect of an external force acting on two kinds of random diffusivity models whose difference is embodied in whether the fluctuation-dissipation theorem is valid. Based on the two random diffusivity models, we derive the Fokker-Planck equations with an arbitrary external force, and analyse various observables in the case with a constant force, including the Einstein relation, the moments, the kurtosis, and the asymptotic behaviors of the probability density function of particle's displacement at different time scales. Both the theoretical results and numerical simulations of these observables show significant difference between the two kinds of random diffusivity models, which implies the important role of the fluctuation-dissipation theorem in random diffusivity systems.

preprint2022arXiv

ReGraph: Scaling Graph Processing on HBM-enabled FPGAs with Heterogeneous Pipelines

The use of FPGAs for efficient graph processing has attracted significant interest. Recent memory subsystem upgrades including the introduction of HBM in FPGAs promise to further alleviate memory bottlenecks. However, modern multi-channel HBM requires much more processing pipelines to fully utilize its bandwidth potential. Existing designs do not scale well, resulting in underutilization of the HBM facilities even when all other resources are fully consumed. In this paper, we re-examined the graph processing workloads and found much diversity in processing. We also found that the diverse workloads can be easily classified into two types, namely dense and sparse partitions. This motivates us to propose a resource-efficient heterogeneous pipeline architecture. Our heterogeneous architecture comprises of two types of pipelines: Little pipelines to process dense partitions with good locality and Big pipelines to process sparse partitions with the extremely poor locality. Unlike traditional monolithic pipeline designs, the heterogeneous pipelines are tailored for more specific memory access patterns, and hence are more lightweight, allowing the architecture to scale up to more effectively with limited resources. In addition, we propose a model-guided task scheduling method that schedules partitions to the right pipeline types, generates the most efficient pipeline combination and balances workloads. Furthermore, we develop an automated open-source framework, called ReGraph, which automates the entire development process. ReGraph outperforms state-of-the-art FPGA accelerators by up to 5.9 times in terms of performance and 12times in terms of resource efficiency.

preprint2022arXiv

The deformation of an erupting magnetic flux rope in a confined solar flare

Magnetic flux ropes (MFRs), sets of coherently twisted magnetic field lines, are believed as core structures of various solar eruptions. Their evolution plays an important role to understand the physical mechanisms of solar eruptions, and can shed light on adverse space weather near the Earth. However, the erupting MFRs are occasionally prevented by strong overlying magnetic fields, and the MFR evolution during the descending phase in the confined cases is lack of attention. Here, we present the deformation of an erupting MFR accompanied by a confined double-peaked solar flare. The first peak corresponded to the MFR eruption in a standard flare model, and the second peak was closely associated with the flashings of an underlying sheared arcade (SA), the reversal slipping motion of the L-shaped flare ribbon, the falling of the MFR, and the shifting of top of filament threads. All results suggest that the confined MFR eruption involved in two-step magnetic reconnection presenting two distinct episodes of energy release in the flare impulsive phase, and the latter magnetic reconnection between the confined MFR and the underlying SA caused the deformation of MFR.

preprint2022arXiv

Toward a Unified Explanation for the Three-part Structure of Solar Coronal Mass Ejections

Coronal mass ejections (CMEs) are associated with the eruption of magnetic flux ropes (MFRs), which usually appear as hot channels in active regions and coronal cavities in quiet-Sun regions. CMEs often exhibit the classical three-part structure in the lower corona when imaged with white-light coronagraphs, including the bright front, dark cavity, and bright core. The bright core and dark cavity have been regarded as the erupted prominence and MFR, respectively, for several decades. However, recent studies clearly demonstrated that both the prominence and hot-channel MFR can be observed as the CME core. The current research presents a three-part CME resulted from the eruption of a coronal prominence cavity on 2010 October 7 with observations from two vantage perspectives, i.e., edge-on from the Earth and face-on from the Solar Terrestrial Relations Observatory (STEREO). Our observations illustrates two important results: (1) For the first time, the erupting coronal cavity is recorded as a channel-like structure in the extreme-ultraviolet passband, analogous to the hot-channel morphology, and is dubbed as warm channel; (2) Both the prominence and warm-channel MFR (coronal cavity) in the extreme-ultraviolet passbands evolve into the CME core in the white-light coronagraphs of STEREO-A. The results support that we are walking toward a unified explanation for the three-part structure of CMEs, in which both prominences and MFRs (hot or warm channels) are responsible for the bright core.

preprint2022arXiv

Twin extreme ultraviolet waves in the solar corona

Solar extreme ultraviolet (EUV) waves are spectacular propagating disturbances with EUV enhancements in annular shapes in the solar corona. These EUV waves carry critical information about the coronal magnetised plasma that can shed light on the elusive physical parameters (e.g. the magnetic field strength) by global solar coronal magneto-seismology. EUV waves are closely associated with a wide range of solar atmospheric eruptions, from violent flares and coronal mass ejections (CMEs) to less energetic plasma jets or mini-filament eruptions. However, the physical nature and driving mechanism of EUV waves is still controversial. Here, we report the unique discovery of twin EUV waves (TEWs) that were formed in a single eruption with observations from two different perspectives. In all earlier studies, a single eruption was associated at most with a single EUV wave. The newly found TEWs urge to re-visit our theoretical understanding about the underlying formation mechanism(s) of coronal EUV waves. Two distinct scenarios of TEWs were found. In the first scenario, the two waves were separately associated with a filament eruption and a precursor jet, while in another scenario the two waves were successively associated with a filament eruption. Hence, we label these distinguished scenarios as "fraternal TEWs" and "identical TEWs", respectively. Further, we also suggest that impulsive lateral expansions of two distinct groups of coronal loops are critical to the formation of TEWs in a single eruption.

preprint2022arXiv

VertXNet: Automatic Segmentation and Identification of Lumbar and Cervical Vertebrae from Spinal X-ray Images

Manual annotation of vertebrae on spinal X-ray imaging is costly and time-consuming due to bone shape complexity and image quality variations. In this study, we address this challenge by proposing an ensemble method called VertXNet, to automatically segment and label vertebrae in X-ray spinal images. VertXNet combines two state-of-the-art segmentation models, namely U-Net and Mask R-CNN to improve vertebrae segmentation. A main feature of VertXNet is to also infer vertebrae labels thanks to its Mask R-CNN component (trained to detect 'reference' vertebrae) on a given spinal X-ray image. VertXNet was evaluated on an in-house dataset of lateral cervical and lumbar X-ray imaging for ankylosing spondylitis (AS) patients. Our results show that VertXNet can accurately label spinal X-rays (mean Dice of 0.9). It can be used to circumvent the lack of annotated vertebrae without requiring human expert review. This step is crucial to investigate clinical associations by solving the lack of segmentation, a common bottleneck for most computational imaging projects.

preprint2021arXiv

A single beam Cs-Ne SERF magnetometer with differential laser power noise suppression method

We describe a single beam compact Spin Exchange Relaxation Free(SERF) magnetometer whose configuration is compatible with the silicon-glass bonding micro-machining method. A cylindrical vapor cell with 3mm diameter and 3mm in length is utilized in the magnetometer. In order to reduce the wall relaxation which could not be neglected in micro-machined SERF magnetometer, 3 Amagats(1Amagat=2.69$\times$ 10$^{19}$/cm$^3$) neon buffer gas is filled in the vapor cell and this is the first demonstration of a Cs-Ne SERF magnetometer. We also did a simulation to show that neon is a better buffer gas than nitrogen and helium which is typical utilized in vapor cells. In order to reduce the laser amplitude noise and the large background detection offset which is reported to be the main noise source of a single beam absorption SERF magnetometer, we developed a laser power differential method and a factor of 2 improvement of the power noise suppression has been demonstrated. Finally, we did an optimization of the magnetometer and sensitivity of 40$fT/Hz^{1/2}$@30Hz has been achieved.

preprint2021arXiv

Comparison of Helium Abundance between ICMEs and Solar Wind near 1 AU

The Helium abundance, defined as $A_{He}=n_{He}/n_{H}\times 100$, is $\sim$8.5 in the photosphere and seldom exceeds 5 in fast solar wind. Previous statistics have demonstrated that $A_{He}$ in slow solar wind correlates tightly with sunspot number. However, less attention is paid to the solar cycle dependence of $A_{He}$ within interplanetary coronal mass ejections (ICMEs) and comparing the $A_{He}$ characteristics of ICMEs and solar wind. In this paper we conduct a statistical comparison of Helium abundance between ICMEs and solar wind near 1 AU with observations of \textit{Advanced Composition Explorer} from 1998 to 2019, and find that the ICME $A_{He}$ also exhibits the obvious solar cycle dependence. Meanwhile, we find that the $A_{He}$ is obviously higher within ICMEs compared to solar wind, and the means within 37\% and 12\% of ICMEs exceed 5 and 8.5, respectively. It is interesting to answer where and how the high Helium abundance originates. Our statistics demonstrate that 21\% (3\%) of ICME (slow wind) $A_{He}$ data points exceed 8.5 around solar maximum, which decreases dramatically near minimum, while no such high $A_{He}$ values appear in the fast wind throughout the whole solar cycle. This indicates that the high $A_{He}$ (e.g., $>$8.5) emanates from active regions as more ICMEs and slow wind originates from active regions around maximum, and supports that both active regions and quiet-Sun regions are the sources of slow wind. We suggest that the high $A_{He}$ from active regions could be explained by means of the magnetic loop confinement model and/or photoionization effect.

preprint2021arXiv

Ergodic property of random diffusivity system with trapping events

Brownian yet non-Gaussian phenomenon has recently been observed in many biological and active matter systems. The main idea of explaining this phenomenon is to introduce a random diffusivity for particles moving in inhomogeneous environment. This paper considers a Langevin system containing a random diffusivity and an $α$-stable subordinator with $α<1$. This model describes the particle&#39;s motion in complex media where both the long trapping events and random diffusivity exist. We derive the general expressions of ensemble- and time-averaged mean-squared displacements which only contain the values of the inverse subordinator and diffusivity. Further taking specific time-dependent diffusivity, we obtain the analytic expressions of ergodicity breaking parameter and probability density function of the time-averaged mean-squared displacement. The results imply the nonergodicity of the random diffusivity model for any kind of diffusivity, including the critical case where the model presenting normal diffusion.

preprint2021arXiv

Novel anomalous diffusion phenomena of underdamped Langevin equation with random parameters

The diffusion behavior of particles moving in complex heterogeneous environment is a very topical issue. We characterize particle&#39;s trajectory via an underdamped Langevin system driven by a Gaussian white noise with a time dependent diffusivity of velocity, together with a random relaxation timescale $τ$ to parameterize the effect of complex medium. We mainly concern how the random parameter $τ$ influences the diffusion behavior and ergodic property of this Langevin system. Besides, the comparison between the fixed and random initial velocity $v_0$ is conducted to show the effect of different initial ensembles. The heavy-tailed distribution of $τ$ with finite mean is found to suppress the decay rate of the velocity correlation function and promote the diffusion behavior, playing a competition role to the time dependent diffusivity. More interestingly, a random $v_0$ with a specific distribution depending on random $τ$ also enhances the diffusion. Both the random parameters $τ$ and $v_0$ influence the dynamics of the Langevin system in an non-obvious way, which cannot be ignored even they has finite moments.

preprint2021arXiv

PIC Simulation of Double Plasma Resonance and Zebra Pattern of Solar Radio Bursts

Latest study reports that plasma emission can be generated by energetic electrons of DGH distribution via the electron cyclotron maser instability (ECMI) in plasmas characterized by a large ratio of plasma oscillation frequency to electron gyro-frequency ($ω_{pe}/Ω_{ce}$). In this study, on the basis of the ECMI-plasma emission mechanism, we examine the double plasma resonance (DPR) effect and the corresponding plasma emission at both harmonic (H) and fundamental (F) bands using PIC simulations with various $ω_{pe}/Ω_{ce}$. This allows us to directly simulate the feature of zebra pattern (ZP) observed in solar radio bursts for the first time. We find that (1) the simulations reproduce the DPR effect nicely for the upper hybrid (UH) and Z modes, as seen from their variation of intensity and linear growth rate with $ω_{pe}/Ω_{ce}$, (2) the intensity of the H emission is stronger than that of the F emission by $\sim$ 2 orders of magnitude and vary periodically with increasing $ω_{pe}/Ω_{ce}$, while the F emission is too weak to be significant, therefore we suggest that it is the H emission accounting for solar ZPs, (3) the peak-valley contrast of the total intensity of H is $\sim 4$, and the peak lies around integer values of $ω_{pe}/Ω_{ce}$ (= 10 and 11) for the present parameter setup. We also evaluate the effect of energy of energetic electrons on the characteristics of ECMI-excited waves and plasma radiation. The study provides novel insight on the physical origin of ZPs of solar radio bursts.

preprint2021arXiv

Quadrupolar interaction induced frequency shift of 131Xe nuclear spins on the surface of silicon

The combination of micro-machined technology with the Atomic Spin Gyroscope(ASG) devices could fabricated Chip Scale Atomic Spin Gyroscope(CASG). The core of the gyroscope is a micro-machined vapor cell which contains alkali metal and isotope enriched noble gases such as 129Xe and 131Xe. The quadrupolar frequency shift of 131Xe is key parameters which could affect the drift of the ASG and is related to the material of the cell in which they are contained. In micro machined technology, the typical utilized material is silicon. In this article, we studied the electric quadrupolar frequency shift of 131Xe atoms with the silicon wall of the micro-machined vapor cell. A cylinder micro-machined vapor cell is utilized in the experiment and a large part of the inner cell surface is composed of silicon material. We studied the temperature dependence of the 129Xe spin relaxation and 131Xe frequency shifts to evaluate the interaction of the nuclear spin with container wall and the alkali metal atoms. The results show that the average twisted angle of the 131Xe nuclear spins as they collide with the silicon wall is measured to be 29 *10^-6 rad. The desorption energy for the 131Xe nuclear spin to escape from the silicon surface is Esi = 0.009eV . This study could help to improve the bias stability of the CASG which is a key parameter for the gyroscope as well as may developes a method to study the surface property of various material.

preprint2020arXiv

An Extreme Ultraviolet Wave Associated with A Solar Filament Activation

Extreme ultraviolet (EUV) waves are impressive coronal propagating disturbances. They are closely associated with various eruptions, and can used for the global coronal seismology and the acceleration of solar energetic particles. Hence, the study of EUV waves plays an important role in solar eruptions and Space Weather. Here we present an EUV wave associated with a filament activation that did not evolve into any eruption. Due to the continuous magnetic flux emergence and cancellation around its one end, the filament rose with untwisting motion, and the filament mass flowed towards another end along the rising fields. Intriguingly, following the filament activation, an EUV wave formed with a fast constant speed ($\sim$500 km s$^{-1}$) ahead of the mass flow, and the overlying coronal loops expanded both in lateral and radial directions. Excluding the possibility of a remote flare and an absent coronal mass ejection, we suggest that the EUV wave was only closely associated with the filament activation. Furthermore, their intimate spacial and temporal relationship indicates that the EUV wave was likely directly triggered by the lateral expansion of overlying loops. We propose that the EUV wave can be interpreted as linear fast-mode wave, and the most vital key for the successful generation of the EUV wave is the impulsive early-phase lateral expansion of overlying loops that was driven by the activated filament mass flow without any eruption.

preprint2020arXiv

Dynamical modulation of solar flare electron acceleration due to plasmoid-shock interactions in the looptop region

A fast-mode shock can form in the front of reconnection outflows and has been suggested as a promising site for particle acceleration in solar flares. Recent development of magnetic reconnection has shown that numerous plasmoids can be produced in a large-scale current layer. Here we investigate the dynamical modulation of electron acceleration in the looptop region when plasmoids intermittently arrive at the shock by combining magnetohydrodynamics simulations with a particle kinetic model. As plasmoids interact with the shock, the looptop region exhibits various compressible structures that modulate the production of energetic electrons. The energetic electron population varies rapidly in both time and space. The number of 5$-$10 keV electrons correlates well with the area with compression, while that of $>$50 keV electrons shows good correlation with strong compression area but only moderate correlation with shock parameters. We further examine the impacts of the first plasmoid, which marks the transition from a quasi-steady shock front to a distorted and dynamical shock. The number of energetic electrons is reduced by $\sim 20\%$ at 15$-$25 keV and nearly 40\% for 25$-$50 keV, while the number of 5$-$10 keV electrons increases. In addition, the electron energy spectrum above 10 keV evolves softer with time. We also find double or even multiple distinct sources can develop in the looptop region when the plasmoids move across the shock. Our simulations have strong implications to the interpretation of nonthermal looptop sources, as well as the commonly observed fast temporal variations in flare emissions, including the quasi-periodic pulsations.

preprint2020arXiv

EDD: Efficient Differentiable DNN Architecture and Implementation Co-search for Embedded AI Solutions

High quality AI solutions require joint optimization of AI algorithms and their hardware implementations. In this work, we are the first to propose a fully simultaneous, efficient differentiable DNN architecture and implementation co-search (EDD) methodology. We formulate the co-search problem by fusing DNN search variables and hardware implementation variables into one solution space, and maximize both algorithm accuracy and hardware implementation quality. The formulation is differentiable with respect to the fused variables, so that gradient descent algorithm can be applied to greatly reduce the search time. The formulation is also applicable for various devices with different objectives. In the experiments, we demonstrate the effectiveness of our EDD methodology by searching for three representative DNNs, targeting low-latency GPU implementation and FPGA implementations with both recursive and pipelined architectures. Each model produced by EDD achieves similar accuracy as the best existing DNN models searched by neural architecture search (NAS) methods on ImageNet, but with superior performance obtained within 12 GPU-hour searches. Our DNN targeting GPU is 1.40x faster than the state-of-the-art solution reported in Proxyless, and our DNN targeting FPGA delivers 1.45x higher throughput than the state-of-the-art solution reported in DNNBuilder.

preprint2020arXiv

GPO: Global Plane Optimization for Fast and Accurate Monocular SLAM Initialization

Initialization is essential to monocular Simultaneous Localization and Mapping (SLAM) problems. This paper focuses on a novel initialization method for monocular SLAM based on planar features. The algorithm starts by homography estimation in a sliding window. It then proceeds to a global plane optimization (GPO) to obtain camera poses and the plane normal. 3D points can be recovered using planar constraints without triangulation. The proposed method fully exploits the plane information from multiple frames and avoids the ambiguities in homography decomposition. We validate our algorithm on the collected chessboard dataset against baseline implementations and present extensive analysis. Experimental results show that our method outperforms the fine-tuned baselines in both accuracy and real-time.

preprint2020arXiv

HaoCL: Harnessing Large-scale Heterogeneous Processors Made Easy

The pervasive adoption of Deep Learning (DL) and Graph Processing (GP) makes it a de facto requirement to build large-scale clusters of heterogeneous accelerators including GPUs and FPGAs. The OpenCL programming framework can be used on the individual nodes of such clusters but is not intended for deployment in a distributed manner. Fortunately, the original OpenCL semantics naturally fit into the programming environment of heterogeneous clusters. In this paper, we propose a heterogeneity-aware OpenCL-like (HaoCL) programming framework to facilitate the programming of a wide range of scientific applications including DL and GP workloads on large-scale heterogeneous clusters. With HaoCL, existing applications can be directly deployed on heterogeneous clusters without any modifications to the original OpenCL source code and without awareness of the underlying hardware topologies and configurations. Our experiments show that HaoCL imposes a negligible overhead in a distributed environment, and provides near-linear speedups on standard benchmarks when computation or data size exceeds the capacity of a single node. The system design and the evaluations are presented in this demo paper.

preprint2020arXiv

Lévy-walk-like Langevin dynamics affected by a time-dependent force

Lévy walk is a popular and more `physical&#39; model to describe the phenomena of superdiffusion, because of its finite velocity. The movements of particles are under the influences of external potentials almost at anytime and anywhere. In this paper, we establish a Langevin system coupled with a subordinator to describe the Lévy walk in the time-dependent periodic force field. The effects of external force are detected and carefully analyzed, including nonzero first moment (even though the force is periodic), adding an additional dispersion on the particle position, the consistent influence on the ensemble- and time-averaged mean-squared displacement, etc. Besides, the generalized Klein-Kramers equation is obtained, not only for the time-dependent force but also for space-dependent one.

preprint2020arXiv

Phonon-induced anomalous gauge potential for photonic isolation in frequency space

Photonic gauge potentials are crucial for manipulating charge-neutral photons like their counterpart electrons in the electromagnetic field, allowing analogous Aharonov-Bohm effect in photonics and paving the way for critical applications like photonic isolation. Normally, a gauge potential exhibits phase inversion along two opposite propagation paths. Here we experimentally demonstrate phonon-induced anomalous gauge potentials with non-inverted gauge phases in a spatial-frequency space, where quasi-phase-matched nonlinear Brillouin scatterings enable such unique direction-dependent gauge phases. Based on this scheme, we construct photonic isolators in the frequency domain permitting nonreciprocal propagation of light along the frequency axis, where coherent phase control in the photonic isolator allows switching completely the directionality through an Aharonov-Bohm interferometer. Moreover, similar coherent controlled unidirectional frequency conversions are also illustrated. These results may offer a unique platform for a compact, integrated solution to implement synthetic-dimension devices for on-chip optical signal processing.

preprint2020arXiv

TAG : Type Auxiliary Guiding for Code Comment Generation

Existing leading code comment generation approaches with the structure-to-sequence framework ignores the type information of the interpretation of the code, e.g., operator, string, etc. However, introducing the type information into the existing framework is non-trivial due to the hierarchical dependence among the type information. In order to address the issues above, we propose a Type Auxiliary Guiding encoder-decoder framework for the code comment generation task which considers the source code as an N-ary tree with type information associated with each node. Specifically, our framework is featured with a Type-associated Encoder and a Type-restricted Decoder which enables adaptive summarization of the source code. We further propose a hierarchical reinforcement learning method to resolve the training difficulties of our proposed framework. Extensive evaluations demonstrate the state-of-the-art performance of our framework with both the auto-evaluated metrics and case studies.

preprint2020arXiv

The initiation of a solar streamer blowout coronal mass ejection arising from the streamer flank

Streamer blowout (SBO) coronal mass ejections (CMEs) represent a particular class of CMEs that are characterized by a gradual swelling of the overlying streamer and a slow CME containing a flux-rope structure. SBO CMEs arising from the streamer flank fall into a special category of SBO CMEs involving three lower arches under the higher streamer arcade. However, the initiation mechanism for this special category of SBO CMEs remains elusive, due to the observational limitations. Here we report critical observations of a SBO CME associated with the eruption of a polar crown filament that originated from the streamer flank. The filament slowly rose toward the solar equator with the writhing motion, and underwent a sudden acceleration before its eruption. Interestingly, during the rising, the filament fields experienced gradual external reconnections, which is evidenced by the dip-shaped bottom of the enveloping flux-rope structure changing from a smooth concave, the slow inflows ($\sim$1.8 km s$^{-1}$) from both the filament fields and the coronal loops beneath, and the persistent brightenings around the interface between the filament fields and the coronal loops beneath. The newly formed lower loops at the filament source and the Y-shaped structure in the stretched tail fields indicate the internal reconnections for the filament eruption. The clear signatures of the external and internal reconnections shed light on the initiation mechanisms of SBO CMEs.

preprint2020arXiv

VecQ: Minimal Loss DNN Model Compression With Vectorized Weight Quantization

Quantization has been proven to be an effective method for reducing the computing and/or storage cost of DNNs. However, the trade-off between the quantization bitwidth and final accuracy is complex and non-convex, which makes it difficult to be optimized directly. Minimizing direct quantization loss (DQL) of the coefficient data is an effective local optimization method, but previous works often neglect the accurate control of the DQL, resulting in a higher loss of the final DNN model accuracy. In this paper, we propose a novel metric called Vector Loss. Based on this new metric, we develop a new quantization solution called VecQ, which can guarantee minimal direct quantization loss and better model accuracy. In addition, in order to speed up the proposed quantization process during model training, we accelerate the quantization process with a parameterized probability estimation method and template-based derivation calculation. We evaluate our proposed algorithm on MNIST, CIFAR, ImageNet, IMDB movie review and THUCNews text data sets with numerical DNN models. The results demonstrate that our proposed quantization solution is more accurate and effective than the state-of-the-art approaches yet with more flexible bitwidth support. Moreover, the evaluation of our quantized models on Saliency Object Detection (SOD) tasks maintains comparable feature extraction quality with up to 16$\times$ weight size reduction.

preprint2019arXiv

Langevin picture of Lévy walk in a constant force field

Lévy walk is a practical model and has wide applications in various fields. Here we focus on the effect of an external constant force on the Lévy walk with the exponent of the power-law distributed flight time $α\in(0,2)$. We add the term $Fη(s)$ ($η(s)$ is the Lévy noise) on a subordinated Langevin system to characterize such a constant force, being effective on the velocity process for all physical time after the subordination. We clearly show the effect of the constant force $F$ on this Langevin system and find this system is like the continuous limit of the collision model. The first moments of velocity processes for these two models are consistent. In particular, based on the velocity correlation function derived from our subordinated Langevin equation, we investigate more interesting statistical quantities, such as the ensemble- and time-averaged mean squared displacements. Under the influence of constant force, the diffusion of particles becomes faster. Finally, the super-ballistic diffusion and the non-ergodic behavior are verified by the simulations with different $α$.

preprint2019arXiv

Strong anomalous diffusion in two-state process with Lévy walk and Brownian motion

Strong anomalous diffusion phenomena are often observed in complex physical and biological systems, which are characterized by the nonlinear spectrum of exponents $qν(q)$ by measuring the absolute $q$-th moment $\langle |x|^q\rangle$. This paper investigates the strong anomalous diffusion behavior of a two-state process with Lévy walk and Brownian motion, which usually serves as an intermittent search process. The sojourn times in Lévy walk and Brownian phases are taken as power law distributions with exponents $α_+$ and $α_-$, respectively. Detailed scaling analyses are performed for the coexistence of three kinds of scalings in this system. Different from the pure Lévy walk, the phenomenon of strong anomalous diffusion can be observed for this two-state process even when the distribution exponent of Lévy walk phase satisfies $α_+<1$, provided that $α_-<α_+$. When $α_+<2$, the probability density function (PDF) in the central part becomes a combination of stretched Lévy distribution and Gaussian distribution due to the long sojourn time in Brownian phase, while the PDF in the tail part (in the ballistic scaling) is still dominated by the infinite density of Lévy walk.

preprint2019arXiv

The Acceleration and Confinement of Energetic Electrons by a Termination Shock in a Magnetic Trap: An Explanation for Nonthermal Loop-top Sources during Solar Flares

Nonthermal loop-top sources in solar flares are the most prominent observational signature that suggests energy release and particle acceleration in the solar corona. Although several scenarios for particle acceleration have been proposed, the origin of the loop-top sources remains unclear. Here we present a model that combines a large-scale magnetohydrodynamic simulation of a two-ribbon flare with a particle acceleration and transport model for investigating electron acceleration by a fast-mode termination shock at the looptop. Our model provides spatially resolved electron distribution that evolves in response to the dynamic flare geometry. We find a concave-downward magnetic structure located below the flare termination shock, induced by the fast reconnection downflows. It acts as a magnetic trap to confine the electrons at the looptop for an extended period of time. The electrons are energized significantly as they cross the shock front, and eventually build up a power-law energy spectrum extending to hundreds of keV. We suggest that this particle acceleration and transport scenario driven by a flare termination shock is a viable interpretation for the observed nonthermal loop-top sources.

preprint2019arXiv

Theory of relaxation dynamics for anomalous diffusion processes in harmonic potential

Optical tweezers setup is often used to probe the motion of individual tracer particle, which promotes the study of relaxation dynamics of a generic process confined in a harmonic potential. We uncover the dependence of ensemble- and time-averaged mean square displacements of confined processes on the velocity correlation function $C(t,t+τ)$ of the original process. With two different scaling forms of $C(t,t+τ)$ for small $τ$ and large $τ$, the stationary value and the relaxation behaviors can be obtained immediately. The gotten results are valid for a large amount of anomalous diffusion processes, including fractional Brownian motion, scaled Brownian motion, and the multi-scale Lévy walk with different exponents of running time distribution.