Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
37works
0followers
15topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

37 published item(s)

preprint2026arXiv

HCInfer: An Efficient Inference System via Error Compensation for Resource-Constrained Devices

LLMs often struggle with memory-constrained deployment on consumer-grade hardware due to their massive parameter sizes. While existing solutions such as model compression and offloading improve deployment feasibility, they often suffer from substantial accuracy degradation or severe throughput bottlenecks. Recent error compensation methods recover accuracy through auxiliary LoRA-style branches, and we observe that these branches are inherently amenable to offloading: they require substantial parameter storage but access only a small subset of compensation parameters during each inference step. Motivated by this opportunity, we propose HCInfer, a heterogeneous inference system that offloads residual compensation to the CPU while executing the compressed backbone on the GPU, and further introduces an asynchronous compensation pipeline and sensitivity-aware dynamic rank allocation to hide compensation overhead and maximize accuracy recovery. Experimental results show that HCInfer achieves a maximum accuracy improvement of 5.2% on downstream tasks compared to compression model and sustaining a maximum speedup of 10.4x compared to full-precision model.

preprint2026arXiv

Inferring Causal Graph Temporal Logic Formulas to Expedite Reinforcement Learning in Temporally Extended Tasks

Decision-making tasks often unfold on graphs with spatial-temporal dynamics. Black-box reinforcement learning often overlooks how local changes spread through network structure, limiting sample efficiency and interpretability. We present GTL-CIRL, a closed-loop framework that simultaneously learns policies and mines Causal Graph Temporal Logic (Causal GTL) specifications. The method shapes rewards with robustness, collects counterexamples when effects fail, and uses Gaussian Process (GP) driven Bayesian optimization to refine parameterized cause templates. The GP models capture spatial and temporal correlations in the system dynamics, enabling efficient exploration of complex parameter spaces. Case studies in gene and power networks show faster learning and clearer, verifiable behavior compared to standard RL baselines.

preprint2024arXiv

Simultaneous observations of a breakout current sheet and a flare current sheet in a coronal jet event

Previous studies have revealed that solar coronal jets triggered by the eruption of mini-filaments (MFs) conform to the famous magnetic-breakout mechanism. In such scenario, a breakout current sheet (BCS) and a flare current sheet (FCS) should be observed during the jets. With high spatial and temporal resolution data from the SDO, the NVST, the RHESSI, the Wind, and the GOES, we present observational evidence of a BCS and a FCS formation during coronal jets driven by a MF eruption occurring in the active region NOAA 11726 on 2013 April 21. Magnetic field extrapolation show that the MF was enclosed by a fan-spine magnetic structure. The MF was activated by flux cancellation under it, and then slowly rose. A BCS formed when the magnetic fields wrapping the MF squeezed to antidirectional external open fields. Simultaneously, one thin bright jet and two bidirectional jet-like structures were observed. As the MF erupted as a blowout jet, a FCS was formed when the two distended legs inside the MF field came together. One end of the FCS connected the post-flare loops. The peak temperature of BCS was calculated to be 2.5 MK. The length, width and peak temperature of FCS was calculated to be 4.35-4.93 Mm, 1.31-1.45 Mm, and 2.5 MK, respectively. The magnetic reconnection rate associated with the FCS was estimated to be from 0.266 to 0.333. This event also related to a type III radio burst, indicating its influence on interplanetary space. These observations support the scenario of the breakout model as the trigger mechanism of coronal jets, and flux cancellation was the driver of this event.

preprint2024arXiv

The Decay of Two Adjacent Sunspots Associated with Moving Magnetic Features

The relationship between the decay of sunspots and moving magnetic features (MMFs) plays an important role in understanding the evolution of active regions. We present observations of two adjacent sunspots, the gap between them, and a lot of MMFs propagating from the gap and the sunspots&#39; outer edges in NOAA Active Region 13023. The MMFs are divided into two types based on their magnetic field inclination angle: vertical (0°<γ<45°) and horizontal (45°<γ<90°) MMFs (V-MMFs and H-MMFs, respectively). The main results are as follows: (1) the mean magnetic flux decay rates of the two sunspots are -1.7*10^20 and -1.4*10^20 Mx/day; (2) the magnetic flux generation rate of all MMFs is calculated to be -1.9 *10^21 Mx/day, which is on average 5.6 times higher than the total magnetic flux loss rate of the sunspots; (3) the magnetic flux of V-MMFs (including a pore separated from the sunspots) is 1.4 times larger than the total lost magnetic flux of the two sunspots, and in a later stage when the pore has passed through the reference ellipse, the magnetic flux generation rate of the V-MMFs is almost the same as the magnetic flux loss rate of the sunspots; and (4) within the gap, the magnetic flux of V-MMFs is one third of the total magnetic flux. Few V-MMFs stream out from the sunspots at the nongap region. All observations suggest that MMFs with vertical magnetic fields are closely related to the disintegration of the sunspot, and most of the MMFs from the gap may originate directly from the sunspot umbra.

preprint2023arXiv

Constrained Active Classification Using Partially Observable Markov Decision Processes

In this work, we study the problem of actively classifying the attributes of dynamical systems characterized as a finite set of Markov decision process (MDP) models. We are interested in finding strategies that actively interact with the dynamical system and observe its reactions so that the attribute of interest is classified efficiently with high confidence. We present a decision-theoretic framework based on partially observable Markov decision processes (POMDPs). The proposed framework relies on assigning a classification belief (a probability distribution) to the attributes of interest. Given an initial belief, a confidence level over which a classification decision can be made, a cost bound, safe belief sets, and a finite time horizon, we compute POMDP strategies leading to classification decisions. We present three different algorithms to compute such strategies. The first algorithm computes the optimal strategy exactly by value iteration. To overcome the computational complexity of computing the exact solutions, we propose a second algorithm based on adaptive sampling and a third based on a Monte Carlo tree search to approximate the optimal probability of reaching a classification decision. We illustrate the proposed methodology using examples from medical diagnosis, security surveillance, and wildlife classification.

preprint2023arXiv

Onset mechanism of an inverted U-shaped solar filament eruption revealed by NVST, SDO, and STEREO-A observations

Utilizing observations from the New Vacuum Solar Telescope (NVST), Solar Dynamics Observatory (SDO), and Solar Terrestrial Relations Observatory-Ahead (STEREO-A), we investigate the event from two distinct observational perspectives: on the solar disk using NVST and SDO, and on the solar limb using STEREO-A. We employ both a non-linear force-free field model and a potential field model to reconstruct the coronal magnetic field, aiming to understand its magnetic properties. Two precursor jet-like activities were observed before the eruption, displaying an untwisted rotation. The second activity released an estimated twist of over two turns. During these two jet-like activities, Y-shaped brightenings, newly emerging magnetic flux accompanied by magnetic cancellation, and the formation of newly moving fibrils were identified. Combining these observational features, it can be inferred that these two precursor jet-like activities released the magnetic field constraining the filament and were triggered by newly emerging magnetic flux. Before the filament eruption, it was observed that some moving flows had been ejected from the site as the onset of two jet-like activities, indicating the same physical process as two jet-like activities. Extrapolations revealed that the filament laid under the height of the decay index of 1.0 and had strong magnetic field (540 Gauss) and a high twisted number (2.4 turns) before the eruption. An apparent rotational motion was observed during the filament eruption. We deduce that the solar filament, exhibiting an inverted U-shape, is a significantly twisted flux rope. The eruption of the filament was initiated by the release of constraining magnetic fields through continuous magnetic reconnection. This reconnection process was triggered by the emergence of newly magnetic flux.

preprint2023arXiv

Propagation of spin channel waves

A mutilated model is constructed to approximate the collision term of spin Boltzmann equation that incorporates newly appearing collisional invariants i.e, the total angular momentum. With recourse to degenerate perturbation theory, the dispersion relations of hydrodynamic modes are formulated, among which spin modes are responsible for spin equilibration. We find that the non-locality does not change the sound speed but slows down the propagation of spin channel waves. The damping rates of spin modes are close to those of spinless modes over a reasonable parameter value range. The results reveal that both spin and momentum should be treated simultaneously in a unified transport framework. In the nonrelativistic limit, the short-wavelength behavior for normal modes is also explored and there exists a critical point for every distinct discrete mode over which only quasiparticle modes contribute.

preprint2022arXiv

A partial filament eruption in three steps induced by external magnetic reconnection

We present an investigation of partial filament eruption on 2012 June 17 in the active region NOAA 11504. For the first time, we observed the vertical splitting process during the partial eruption with high resolution narrow band images at 10830 . The active filament was rooted in a small sunspot of the active region. Particularly, it underwent the partial eruption in three steps, i.e. the precursor, the first eruption, and the second eruption, while the later two were associated with a C1.0 flare and a C3.9 flare, respectively. During the precursor, slow magnetic reconnection took place between the filament and the adjoining loops that also rooted in the sunspot. The continuous reconnection not only caused the filament to split into three groups of threads vertically but also formed a new filament, which was growing and accompanied brightening took place around the site. Subsequently, the growing filament erupted together with one group splitted threads, resulted in the first eruption. At the beginning of the first eruption, a subsequent magnetic reconnection occurred between the erupting splitted threads and another ambient magnetic loop. After about three minutes, the second eruption occurred as a result of the eruption of two larger unstable filaments induced by the magnetic reconnection. The high-resolution observation provides a direct evidence that magnetic reconnection between filament and its ambient magnetic fields could induce the vertical splitting of the filament, resulting in partial eruption.

preprint2022arXiv

All-Around Real Label Supervision: Cyclic Prototype Consistency Learning for Semi-supervised Medical Image Segmentation

Semi-supervised learning has substantially advanced medical image segmentation since it alleviates the heavy burden of acquiring the costly expert-examined annotations. Especially, the consistency-based approaches have attracted more attention for their superior performance, wherein the real labels are only utilized to supervise their paired images via supervised loss while the unlabeled images are exploited by enforcing the perturbation-based \textit{&#34;unsupervised&#34;} consistency without explicit guidance from those real labels. However, intuitively, the expert-examined real labels contain more reliable supervision signals. Observing this, we ask an unexplored but interesting question: can we exploit the unlabeled data via explicit real label supervision for semi-supervised training? To this end, we discard the previous perturbation-based consistency but absorb the essence of non-parametric prototype learning. Based on the prototypical network, we then propose a novel cyclic prototype consistency learning (CPCL) framework, which is constructed by a labeled-to-unlabeled (L2U) prototypical forward process and an unlabeled-to-labeled (U2L) backward process. Such two processes synergistically enhance the segmentation network by encouraging more discriminative and compact features. In this way, our framework turns previous \textit{&#34;unsupervised&#34;} consistency into new \textit{&#34;supervised&#34;} consistency, obtaining the \textit{&#34;all-around real label supervision&#34;} property of our method. Extensive experiments on brain tumor segmentation from MRI and kidney segmentation from CT images show that our CPCL can effectively exploit the unlabeled data and outperform other state-of-the-art semi-supervised medical image segmentation methods.

preprint2022arXiv

Bio-inspired Intelligence with Applications to Robotics: A Survey

In the past decades, considerable attention has been paid to bio-inspired intelligence and its applications to robotics. This paper provides a comprehensive survey of bio-inspired intelligence, with a focus on neurodynamics approaches, to various robotic applications, particularly to path planning and control of autonomous robotic systems. Firstly, the bio-inspired shunting model and its variants (additive model and gated dipole model) are introduced, and their main characteristics are given in detail. Then, two main neurodynamics applications to real-time path planning and control of various robotic systems are reviewed. A bio-inspired neural network framework, in which neurons are characterized by the neurodynamics models, is discussed for mobile robots, cleaning robots, and underwater robots. The bio-inspired neural network has been widely used in real-time collision-free navigation and cooperation without any learning procedures, global cost functions, and prior knowledge of the dynamic environment. In addition, bio-inspired backstepping controllers for various robotic systems, which are able to eliminate the speed jump when a large initial tracking error occurs, are further discussed. Finally, the current challenges and future research directions are discussed in this paper.

preprint2022arXiv

Consensus Formation Tracking for Multiple AUV Systems Using Distributed Bioinspired Sliding Mode Control

Consensus formation tracking of multiple autonomous underwater vehicles (AUVs) subject to nonlinear and uncertain dynamics is a challenging problem in robotics. To tackle this challenge, a distributed bioinspired sliding mode controller is proposed in this paper. First, the conventional sliding mode controller (SMC) is presented, and the consensus problem is addressed on the basis of graph theory. Next, to tackle the high frequency chattering issue in SMC scheme and meanwhile improve the robustness to the noises, a bioinspired approach is introduced, in which a neural dynamic model is employed to replace the nonlinear sign or saturation function in the synthesis of conventional sliding mode controllers. Furthermore, the input-to-state stability of the resulting closed-loop system is proved in the presence of bounded lumped disturbance by the Lyapunov stability theory. Finally, simulation experiments are conducted to demonstrate the effectiveness of the proposed distributed formation control protocol.

preprint2022arXiv

Deformer: Towards Displacement Field Learning for Unsupervised Medical Image Registration

Recently, deep-learning-based approaches have been widely studied for deformable image registration task. However, most efforts directly map the composite image representation to spatial transformation through the convolutional neural network, ignoring its limited ability to capture spatial correspondence. On the other hand, Transformer can better characterize the spatial relationship with attention mechanism, its long-range dependency may be harmful to the registration task, where voxels with too large distances are unlikely to be corresponding pairs. In this study, we propose a novel Deformer module along with a multi-scale framework for the deformable image registration task. The Deformer module is designed to facilitate the mapping from image representation to spatial transformation by formulating the displacement vector prediction as the weighted summation of several bases. With the multi-scale framework to predict the displacement fields in a coarse-to-fine manner, superior performance can be achieved compared with traditional and learning-based approaches. Comprehensive experiments on two public datasets are conducted to demonstrate the effectiveness of the proposed Deformer module as well as the multi-scale framework.

preprint2022arXiv

Double-Uncertainty Guided Spatial and Temporal Consistency Regularization Weighting for Learning-based Abdominal Registration

In order to tackle the difficulty associated with the ill-posed nature of the image registration problem, regularization is often used to constrain the solution space. For most learning-based registration approaches, the regularization usually has a fixed weight and only constrains the spatial transformation. Such convention has two limitations: (i) Besides the laborious grid search for the optimal fixed weight, the regularization strength of a specific image pair should be associated with the content of the images, thus the &#34;one value fits all&#34; training scheme is not ideal; (ii) Only spatially regularizing the transformation may neglect some informative clues related to the ill-posedness. In this study, we propose a mean-teacher based registration framework, which incorporates an additional temporal consistency regularization term by encouraging the teacher model&#39;s prediction to be consistent with that of the student model. More importantly, instead of searching for a fixed weight, the teacher enables automatically adjusting the weights of the spatial regularization and the temporal consistency regularization by taking advantage of the transformation uncertainty and appearance uncertainty. Extensive experiments on the challenging abdominal CT-MRI registration show that our training strategy can promisingly advance the original learning-based method in terms of efficient hyperparameter tuning and a better tradeoff between accuracy and smoothness.

preprint2022arXiv

Incomplete electromagnetic response of hot QCD matter

The electromagnetic response of hot QCD matter to decaying external magnetic fields is investigated. We examine the validity of Ohm&#39;s law and find that the induced electric current increases from zero and relaxes towards the value from Ohm&#39;s law. The relaxation time is larger than the lifetime of the external magnetic field for the QCD matter in relativistic heavy-ion collisions. The lower than expected electric current significantly suppresses the induced magnetic field and makes the electromagnetic response incomplete. We demonstrate the incomplete electromagnetic response of hot QCD matter by calculations employing the parton transport model combined with the solution of Maxwell&#39;s equations. Our results show a strong suppression by two orders of magnitude in the magnetic field, relatively to calculations assuming the validity of Ohm&#39;s law. This may undermine experimental efforts to measure magnetic-field-related effects in heavy-ion collisions.

preprint2022arXiv

Joint Inference of Reward Machines and Policies for Reinforcement Learning

Incorporating high-level knowledge is an effective way to expedite reinforcement learning (RL), especially for complex tasks with sparse rewards. We investigate an RL problem where the high-level knowledge is in the form of reward machines, i.e., a type of Mealy machine that encodes the reward functions. We focus on a setting in which this knowledge is a priori not available to the learning agent. We develop an iterative algorithm that performs joint inference of reward machines and policies for RL (more specifically, q-learning). In each iteration, the algorithm maintains a hypothesis reward machine and a sample of RL episodes. It derives q-functions from the current hypothesis reward machine, and performs RL to update the q-functions. While performing RL, the algorithm updates the sample by adding RL episodes along which the obtained rewards are inconsistent with the rewards based on the current hypothesis reward machine. In the next iteration, the algorithm infers a new hypothesis reward machine from the updated sample. Based on an equivalence relationship we defined between states of reward machines, we transfer the q-functions between the hypothesis reward machines in consecutive iterations. We prove that the proposed algorithm converges almost surely to an optimal policy in the limit if a minimal reward machine can be inferred and the maximal length of each RL episode is sufficiently long. The experiments show that learning high-level knowledge in the form of reward machines can lead to fast convergence to optimal policies in RL, while standard RL methods such as q-learning and hierarchical RL methods fail to converge to optimal policies after a substantial number of training steps in many tasks.

preprint2022arXiv

Magnetic field induced hair structure in charmonium gluon-dissociation

We study electromagnetic field effect on charmonium gluon-dissociation in quark-gluon plasma. With the effective Hamiltonian derived from QCD multipole expansion under an external electromagnetic field, we first solve the two-body Schrödinger equation for a pair of charm quarks with mean field potentials for color and electromagnetic interactions and obtain the charmonium binding energies and wave functions, and then calculate the gluon-dissociation cross-section and decay width by taking the color electric and magnetic dipole interactions as perturbations above the mean field and employing Fermi&#39;s Golden Rule. Considering the charmonium deformation in magnetic field, the discrete Landau energy levels make the dissociation cross-section grow hair, and the electric dipole channel is significantly changed, especially for the $P-$wave states $χ_{c0}$ and $χ_{c\pm}$. From our numerical calculation, the magnetic field strength $eB=5~m_π^2$ already changes the gluon dissociation strongly, which may indicate measurable effects in high-energy nuclear collisions.

preprint2022arXiv

Non-Parametric Neuro-Adaptive Coordination of Multi-Agent Systems

We develop a learning-based algorithm for the distributed formation control of networked multi-agent systems governed by unknown, nonlinear dynamics. Most existing algorithms either assume certain parametric forms for the unknown dynamic terms or resort to unnecessarily large control inputs in order to provide theoretical guarantees. The proposed algorithm avoids these drawbacks by integrating neural network-based learning with adaptive control in a two-step procedure. In the first step of the algorithm, each agent learns a controller, represented as a neural network, using training data that correspond to a collection of formation tasks and agent parameters. These parameters and tasks are derived by varying the nominal agent parameters and the formation specifications of the task in hand, respectively. In the second step of the algorithm, each agent incorporates the trained neural network into an online and adaptive control policy in such a way that the behavior of the multi-agent closed-loop system satisfies a user-defined formation task. Both the learning phase and the adaptive control policy are distributed, in the sense that each agent computes its own actions using only local information from its neighboring agents. The proposed algorithm does not use any a priori information on the agents&#39; unknown dynamic terms or any approximation schemes. We provide formal theoretical guarantees on the achievement of the formation task.

preprint2022arXiv

Non-Parametric Neuro-Adaptive Formation Control

We develop a learning-based algorithm for the distributed formation control of networked multi-agent systems governed by unknown, nonlinear dynamics. Most existing algorithms either assume certain parametric forms for the unknown dynamic terms or resort to unnecessarily large control inputs in order to provide theoretical guarantees. The proposed algorithm avoids these drawbacks by integrating neural network-based learning with adaptive control in a two-step procedure. In the first step of the algorithm, each agent learns a controller, represented as a neural network, using training data that correspond to a collection of formation tasks and agent parameters. These parameters and tasks are derived by varying the nominal agent parameters and a user-defined formation task to be achieved, respectively. In the second step of the algorithm, each agent incorporates the trained neural network into an online and adaptive control policy in such a way that the behavior of the multi-agent closed-loop system satisfies the user-defined formation task. Both the learning phase and the adaptive control policy are distributed, in the sense that each agent computes its own actions using only local information from its neighboring agents. The proposed algorithm does not use any a priori information on the agents&#39; unknown dynamic terms or any approximation schemes. We provide formal theoretical guarantees on the achievement of the formation task.

preprint2022arXiv

Reconfiguration and eruption of a solar filament by magnetic reconnection with an emerging magnetic field

Both observations and simulations suggest that the solar filament eruption is closely related to magnetic flux emergence. It is thought that the eruption is triggered by magnetic reconnection between the filament and the emerging flux. However, the details of such a reconnection are rarely presented. In this study, we report the detailed reconnection between a filament and its nearby emerging fields, that led to the reconfiguration and subsequent partial eruption of the filament located over the polarity inversion line of active region 12816. Before the reconnection, we observed repeated brightenings in the filament at a location that overlies a site of magnetic flux cancellation. Plasmoids form at this brightening region, and propagate bi-directionally along the filament. These indicate the tether-cutting reconnection that results in the formation and eruption of a flux rope. To the northwest of the filament, magnetic fields emerge, and reconnect with the context ones, resulting in repeated jets. Afterwards, another magnetic fields emerge near the northwestern filament endpoints, and reconnect with the filament, forming the newly reconnected filament and loops. Current sheet repeatedly occurs at the interface, with the mean temperature and emission measure of 1.7 MK and 1.1$\times$10$^{28}$ cm$^{-5}$. Plasmoids form in the current sheet, and propagate along it and further along the newly reconnected filament and loops. The newly reconnected filament then erupts, while the unreconnected filament remains stable. We propose that besides the orientation of emerging fields, some other parameters, such as the position, distance, strength, and area, are also crucial for triggering the filament eruption.

preprint2022arXiv

Seeking Common Ground While Reserving Differences: Multiple Anatomy Collaborative Framework for Undersampled MRI Reconstruction

Recently, deep neural networks have greatly advanced undersampled Magnetic Resonance Image (MRI) reconstruction, wherein most studies follow the one-anatomy-one-network fashion, i.e., each expert network is trained and evaluated for a specific anatomy. Apart from inefficiency in training multiple independent models, such convention ignores the shared de-aliasing knowledge across various anatomies which can benefit each other. To explore the shared knowledge, one naive way is to combine all the data from various anatomies to train an all-round network. Unfortunately, despite the existence of the shared de-aliasing knowledge, we reveal that the exclusive knowledge across different anatomies can deteriorate specific reconstruction targets, yielding overall performance degradation. Observing this, in this study, we present a novel deep MRI reconstruction framework with both anatomy-shared and anatomy-specific parameterized learners, aiming to &#34;seek common ground while reserving differences&#34; across different anatomies.Particularly, the primary anatomy-shared learners are exposed to different anatomies to model flourishing shared knowledge, while the efficient anatomy-specific learners are trained with their target anatomy for exclusive knowledge. Four different implementations of anatomy-specific learners are presented and explored on the top of our framework in two MRI reconstruction networks. Comprehensive experiments on brain, knee and cardiac MRI datasets demonstrate that three of these learners are able to enhance reconstruction performance via multiple anatomy collaborative learning.

preprint2022arXiv

Statistical analysis of circular-ribbon flares

Circular-ribbon flares (CFs) are a special type of solar flares owing to their particular magnetic topology. In this paper, we conducted a comprehensive statistical analysis of 134 CFs from 2011 September to 2017 June, including four B-class, 82 C-class, 40 M-class, and eight X-class flares, respectively. The flares were observed by the Atmospheric Imaging Assembly (AIA) on board the Solar Dynamics Observatory (SDO) spacecraft. The physical properties of CFs are derived, including the location, area ($A_{CF}$), equivalent radius ($r_{CF}$) assuming a semi-spherical fan dome, lifetime ($τ_{CF}$), and peak SXR flux in 1$-$8 Å. It is found that all CFs are located in active regions, with the latitudes between -30$^\circ$ and 30$^\circ$. The distributions of areas and lifetimes could be fitted with a log-normal function. There is a positive correlation between the lifetime and area. The peak SXR flux in 1$-$8 Å is well in accord with a power-law distribution with an index of $-$1.42. For the 134 CFs, 57\% of them are accompanied by remote brightenings or ribbons. A positive correlation exists between the total length ($L_{RB}$) and average distance ($D_{RB}$) of remote brightenings. About 47\% and 51\% of the 134 CFs are related to type III radio bursts and jets, respectively. The association rates are independent of flare energies. About 38\% of CFs are related to mini-filament eruptions, and the association rates increase with flare classes. Only 28\% of CFs are related to CMEs, meaning that a majority of them are confined rather than eruptive events. There is a positive correlation between the CME speed and peak SXR flux in 1$-$8 Å, and faster CMEs tend to be wider.

preprint2022arXiv

Sunspot shearing and sudden retraction motion associated with the 2013 August 17 M3.3 Flare

In this Letter, we give a detailed analysis to the M3.3 class flare that occurred on August 17, 2013 (SOL2013-08-17T18:16). It presents a clear picture of mutual magnetic interaction initially from the photosphere to the corona via the abrupt rapid shearing motion of a small sunspot before the flare, and then suddenly from the corona back to the photosphere via the sudden retraction motion of the same sunspot during the flare impulsive phase. About 10 hours before the flare, a small sunspot in the active region NOAA 11818 started to move northeast along a magnetic polarity inversion line (PIL), creating a shearing motion that changed the quasi-static state of the active region. A filament right above the PIL was activated following the movement of the sunspot and then got partially erupted. The eruption eventually led to the M3.3 flare. The sunspot was then suddenly pulled back to the opposite direction upon the flare onset. During the backward motion, the Lorentz force underwent a simultaneous impulsive change both in magnitude and direction. Its directional change is found to be conformable with the retraction motion. The observation provides direct evidence for the role of the shearing motion of the sunspot in powering and triggering the flare. It especially confirms that the abrupt motion of a sunspot during a solar flare is the result of a back reaction caused by the reconfiguration of the coronal magnetic field.

preprint2022arXiv

Towards Better Dermoscopic Image Feature Representation Learning for Melanoma Classification

Deep learning-based melanoma classification with dermoscopic images has recently shown great potential in automatic early-stage melanoma diagnosis. However, limited by the significant data imbalance and obvious extraneous artifacts, i.e., the hair and ruler markings, discriminative feature extraction from dermoscopic images is very challenging. In this study, we seek to resolve these problems respectively towards better representation learning for lesion features. Specifically, a GAN-based data augmentation (GDA) strategy is adapted to generate synthetic melanoma-positive images, in conjunction with the proposed implicit hair denoising (IHD) strategy. Wherein the hair-related representations are implicitly disentangled via an auxiliary classifier network and reversely sent to the melanoma-feature extraction backbone for better melanoma-specific representation learning. Furthermore, to train the IHD module, the hair noises are additionally labeled on the ISIC2020 dataset, making it the first large-scale dermoscopic dataset with annotation of hair-like artifacts. Extensive experiments demonstrate the superiority of the proposed framework as well as the effectiveness of each component. The improved dataset publicly avaliable at https://github.com/kirtsy/DermoscopicDataset.

preprint2022arXiv

Trust It or Not: Confidence-Guided Automatic Radiology Report Generation

Medical imaging plays a pivotal role in diagnosis and treatment in clinical practice. Inspired by the significant progress in automatic image captioning, various deep learning (DL)-based methods have been proposed to generate radiology reports for medical images. Despite promising results, previous works overlook the uncertainties of their models and are thus unable to provide clinicians with the reliability/confidence of the generated radiology reports to assist their decision-making. In this paper, we propose a novel method to explicitly quantify both the visual uncertainty and the textual uncertainty for DL-based radiology report generation. Such multi-modal uncertainties can sufficiently capture the model confidence degree at both the report level and the sentence level, and thus they are further leveraged to weight the losses for more comprehensive model optimization. Experimental results have demonstrated that the proposed method for model uncertainty characterization and estimation can produce more reliable confidence scores for radiology report generation, and the modified loss function, which takes into account the uncertainties, leads to better model performance on two public radiology report datasets. In addition, the quality of the automatically generated reports was manually evaluated by human raters and the results also indicate that the proposed uncertainties can reflect the variance of clinical diagnosis.

preprint2022arXiv

Weighted Graph-Based Signal Temporal Logic Inference Using Neural Networks

Extracting spatial-temporal knowledge from data is useful in many applications. It is important that the obtained knowledge is human-interpretable and amenable to formal analysis. In this paper, we propose a method that trains neural networks to learn spatial-temporal properties in the form of weighted graph-based signal temporal logic (wGSTL) formulas. For learning wGSTL formulas, we introduce a flexible wGSTL formula structure in which the user&#39;s preference can be applied in the inferred wGSTL formulas. In the proposed framework, each neuron of the neural networks corresponds to a subformula in a flexible wGSTL formula structure. We initially train a neural network to learn the wGSTL operators and then train a second neural network to learn the parameters in a flexible wGSTL formula structure. We use a COVID-19 dataset and a rain prediction dataset to evaluate the performance of the proposed framework and algorithms. We compare the performance of the proposed framework with three baseline classification methods including K-nearest neighbors, decision trees, support vector machine, and artificial neural networks. The classification accuracy obtained by the proposed framework is comparable with the baseline classification methods.

preprint2021arXiv

Magneto-acoustic oscillations observed in a solar plage region

We gave an extensive study for the quasi-periodic perturbations on the time profiles of the line of sight (LOS) magnetic field in 10x10 sub-areas in a solar plage region (corresponds to a facula on the photosphere). The perturbations are found to be associated with enhancement of He I 10830 A absorption in a moss region, which is connected to loops with million-degree plasma. FFT analysis to the perturbations gives a kind of spectrum similar to that of Doppler velocity: a number of discrete periods around 5 minutes. The amplitudes of the magnetic perturbations are found to be proportional to magnetic field strength over these sub-areas. In addition, magnetic perturbations lag behind a quarter of cycle in phase with respect to the p-mode Doppler velocity. We show that the relationships can be well explained with an MHD solution for the magneto-acoustic oscillations in high-\b{eta} plasma. Observational analysis also shows that, for the two regions with the stronger and weaker magnetic field, the perturbations are always anti-phased. All findings show that the magnetic perturbations are actually magneto-acoustic oscillations on the solar surface, the photosphere, powered by p-mode oscillations. The findings may provide a new diagnostic tool for exploring the relationship between magneto-acoustic oscillations and the heating of solar upper atmosphere, as well as their role in helioseismology.

preprint2021arXiv

Temporal-Logic-Based Intermittent, Optimal, and Safe Continuous-Time Learning for Trajectory Tracking

In this paper, we develop safe reinforcement-learning-based controllers for systems tasked with accomplishing complex missions that can be expressed as linear temporal logic specifications, similar to those required by search-and-rescue missions. We decompose the original mission into a sequence of tracking sub-problems under safety constraints. We impose the safety conditions by utilizing barrier functions to map the constrained optimal tracking problem in the physical space to an unconstrained one in the transformed space. Furthermore, we develop policies that intermittently update the control signal to solve the tracking sub-problems with reduced burden in the communication and computation resources. Subsequently, an actor-critic algorithm is utilized to solve the underlying Hamilton-Jacobi-Bellman equations. Finally, we support our proposed framework with stability proofs and showcase its efficacy via simulation results.

preprint2020arXiv

Adaptive Teaching of Temporal Logic Formulas to Learners with Preferences

Machine teaching is an algorithmic framework for teaching a target hypothesis via a sequence of examples or demonstrations. We investigate machine teaching for temporal logic formulas -- a novel and expressive hypothesis class amenable to time-related task specifications. In the context of teaching temporal logic formulas, an exhaustive search even for a myopic solution takes exponential time (with respect to the time span of the task). We propose an efficient approach for teaching parametric linear temporal logic formulas. Concretely, we derive a necessary condition for the minimal time length of a demonstration to eliminate a set of hypotheses. Utilizing this condition, we propose a myopic teaching algorithm by solving a sequence of integer programming problems. We further show that, under two notions of teaching complexity, the proposed algorithm has near-optimal performance. The results strictly generalize the previous results on teaching preference-based version space learners. We evaluate our algorithm extensively under a variety of learner types (i.e., learners with different preference models) and interactive protocols (e.g., batched and adaptive). The results show that the proposed algorithms can efficiently teach a given target temporal logic formula under various settings, and that there are significant gains of teaching efficacy when the teacher adapts to the learner&#39;s current hypotheses or uses oracles.

preprint2020arXiv

Adversarial Uni- and Multi-modal Stream Networks for Multimodal Image Registration

Deformable image registration between Computed Tomography (CT) images and Magnetic Resonance (MR) imaging is essential for many image-guided therapies. In this paper, we propose a novel translation-based unsupervised deformable image registration method. Distinct from other translation-based methods that attempt to convert the multimodal problem (e.g., CT-to-MR) into a unimodal problem (e.g., MR-to-MR) via image-to-image translation, our method leverages the deformation fields estimated from both: (i) the translated MR image and (ii) the original CT image in a dual-stream fashion, and automatically learns how to fuse them to achieve better registration performance. The multimodal registration network can be effectively trained by computationally efficient similarity metrics without any ground-truth deformation. Our method has been evaluated on two clinical datasets and demonstrates promising results compared to state-of-the-art traditional and learning-based methods.

preprint2020arXiv

Do Public Datasets Assure Unbiased Comparisons for Registration Evaluation?

With the increasing availability of new image registration approaches, an unbiased evaluation is becoming more needed so that clinicians can choose the most suitable approaches for their applications. Current evaluations typically use landmarks in manually annotated datasets. As a result, the quality of annotations is crucial for unbiased comparisons. Even though most data providers claim to have quality control over their datasets, an objective third-party screening can be reassuring for intended users. In this study, we use the variogram to screen the manually annotated landmarks in two datasets used to benchmark registration in image-guided neurosurgeries. The variogram provides an intuitive 2D representation of the spatial characteristics of annotated landmarks. Using variograms, we identified potentially problematic cases and had them examined by experienced radiologists. We found that (1) a small number of annotations may have fiducial localization errors; (2) the landmark distribution for some cases is not ideal to offer fair comparisons. If unresolved, both findings could incur bias in registration evaluation.

preprint2020arXiv

Dynamic Sparse Training: Find Efficient Sparse Network From Scratch With Trainable Masked Layers

We present a novel network pruning algorithm called Dynamic Sparse Training that can jointly find the optimal network parameters and sparse network structure in a unified optimization process with trainable pruning thresholds. These thresholds can have fine-grained layer-wise adjustments dynamically via backpropagation. We demonstrate that our dynamic sparse training algorithm can easily train very sparse neural network models with little performance loss using the same number of training epochs as dense models. Dynamic Sparse Training achieves the state of the art performance compared with other sparse training algorithms on various network architectures. Additionally, we have several surprising observations that provide strong evidence for the effectiveness and efficiency of our algorithm. These observations reveal the underlying problems of traditional three-stage pruning algorithms and present the potential guidance provided by our algorithm to the design of more compact network architectures.

preprint2020arXiv

Policy Synthesis for Factored MDPs with Graph Temporal Logic Specifications

We study the synthesis of policies for multi-agent systems to implement spatial-temporal tasks. We formalize the problem as a factored Markov decision process subject to so-called graph temporal logic specifications. The transition function and the spatial-temporal task of each agent depend on the agent itself and its neighboring agents. The structure in the model and the specifications enable to develop a distributed algorithm that, given a factored Markov decision process and a graph temporal logic formula, decomposes the synthesis problem into a set of smaller synthesis problems, one for each agent. We prove that the algorithm runs in time linear in the total number of agents. The size of the synthesis problem for each agent is exponential only in the number of neighboring agents, which is typically much smaller than the number of agents. We demonstrate the algorithm in case studies on disease control and urban security. The numerical examples show that the algorithm can scale to hundreds of agents.

preprint2020arXiv

Policy Synthesis for Switched Linear Systems with Markov Decision Process Switching

We study the synthesis of mode switching protocols for a class of discrete-time switched linear systems in which the mode jumps are governed by Markov decision processes (MDPs). We call such systems MDP-JLS for brevity. Each state of the MDP corresponds to a mode in the switched system. The probabilistic state transitions in the MDP represent the mode transitions. We focus on finding a policy that selects the switching actions at each mode such that the switched system that follows these actions is guaranteed to be stable. Given a policy in the MDP, the considered MDP-JLS reduces to a Markov jump linear system (MJLS). {We consider both mean-square stability and stability with probability one. For mean-square stability, we leverage existing stability conditions for MJLSs and propose efficient semidefinite programming formulations to find a stabilizing policy in the MDP. For stability with probability one, we derive new sufficient conditions and compute a stabilizing policy using linear programming. We also extend the policy synthesis results to MDP-JLS with uncertain mode transition probabilities.

preprint2020arXiv

Robust Inference and Verification of Temporal Logic Classifier-in-the-loop Systems

Autonomous systems embedded with machine learning modules often rely on deep neural networks for classifying different objects of interest in the environment or different actions or strategies to take for the system. Due to the non-linearity and high-dimensionality of deep neural networks, the interpretability of the autonomous systems is compromised. Besides, the machine learning methods in autonomous systems are mostly data-intensive and lack commonsense knowledge and reasoning that are natural to humans. In this paper, we propose the framework of temporal logic classifier-in-the-loop systems. The temporal logic classifiers can output different actions to take for an autonomous system based on the environment, such that the behavior of the autonomous system can satisfy a given temporal logic specification. Our approach is robust and provably-correct, as we can prove that the behavior of the autonomous system can satisfy a given temporal logic specification in the presence of (bounded) disturbances.

preprint2020arXiv

Temporal Logic Inference for Hybrid System Observation with Spatial and Temporal Uncertainties

In this paper, we present a mechanism for building hybrid system observers to differentiate between specific positions of the hybrid system. The mechanism is designed through inferring metric temporal logic (MTL) formulae from simulated trajectories from the hybrid system. We first approximate the system behavior by simulating finitely many trajectories with timerobust tube segments around them. These time-robust tube segments account for both spatial and temporal uncertainties that exist in the hybrid system with initial state variations. The inferred MTL formulae classify different time-robust tube segments and thus can be used for classifying the hybrid system behaviors in a provably correct fashion. We implement our approach on a model of a smart building testbed to distinguish two cases of room occupancy.

preprint2019arXiv

Calculation of anisotropic transport coefficients for an ultrarelativistic Boltzmann gas in a magnetic field within a kinetic approach

According to the Kubo formulas we employ the (3+1)-d parton cascade, Boltzmann approach of multiparton scatterings (BAMPS), to calculate the anisotropic transport coefficients (shear viscosity and electric conductivity) for an ultrarelativistic Boltzmann gas in the presence of a magnetic field. The results are compared with those recently obtained by using the Grad&#39;s approximation. We find good agreements between both results, which confirms the general use of the derived Kubo formulas for calculating the anisotropic transport coefficients of quark-gluon plasma in a magnetic field.

preprint2019arXiv

Recurrent Two-Sided Loop Jets Caused by Magnetic Reconnection between Erupting Minifilaments and Nearby Large Filament

Using high spatial and temporal data from the New Vacuum Solar Telescope (NVST) and the Solar Dynamics Observatory (SDO), we present unambiguous observations of recurrent two-sided loop jets caused by magnetic reconnection between erupting minifilaments and nearby large filament. The observations demonstrate that three two-sided loop jets, which ejected along the large filament in opposite directions, had similar appearance and originated from the same region. We find that a minifilament erupted and drove the first jet. It reformed at the same neutral line later, and then underwent partial and total eruptions, drove the second and third jets, respectively. In the course of the jets, cool plasma was injected into the large filament. Furthermore, persistent magnetic flux cancelation occurred at the neutral line under the minifilament before its eruption and continued until the end of the observation. We infer that magnetic flux cancellation may account for building and then triggering the minifilament to erupt to produce the two-sided loop jets. This observation not only indicates that two-sided loop jets can be driven by minifilament eruptions, but also sheds new light on our understanding of the recurrent mechanism of two-sided loop jets.