Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
40works
0followers
29topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

40 published item(s)

preprint2026arXiv

Echo-α: Large Agentic Multimodal Reasoning Model for Ultrasound Interpretation

Ultrasound interpretation requires both precise lesion localization and holistic clinical reasoning, yet existing methods typically excel at only one of these capabilities: specialized detectors offer strong localization but limited reasoning, whereas multimodal large language models (MLLMs) provide flexible reasoning but weak grounding in specialized medical domains. We present Echo-α, an agentic multimodal reasoning model for ultrasound interpretation that unifies these strengths within an invoke-and-reason framework. Echo-α is trained to coordinate organ-specific detector outputs, integrate them with global visual context, and convert the resulting evidence into grounded diagnostic decisions beyond detector-only inference. This behavior is established through a nine-task supervised curriculum and then refined by sequential reinforcement learning under different reward trade-offs, yielding Echo-α-Grounding for lesion anchoring and Echo-α-Diagnosis for final diagnosis. On multi-center renal and breast ultrasound benchmarks, Echo-α outperforms competitive baselines on both grounding and diagnosis. In particular, on cross-center test sets, Echo-α-Grounding attains 56.73%/43.78% F1@0.5 and Echo- α-Diagnosis reaches 74.90%/49.20% overall accuracy on renal/breast ultrasound. These results suggest that agentic multimodal reasoning can turn specialized detectors into verifiable clinical evidence, offering a practical route toward ultrasound AI systems that are more accurate, interpretable, and transferable. The repository is at https://github.com/MiliLab/Echo-Alpha.

preprint2026arXiv

High-Performance KV$_3$Sb$_5$/WSe$_2$ van der Waals Photodetectors

Kagome metals AV$_3$Sb$_5$ (A = K, Rb, Cs) have recently emerged as a promising platform for exploring correlated and topological quantum states, yet their potential for optoelectronic applications remains largely unexplored. Here, we report high-performance photodetectors based on van der Waals KV$_3$Sb$_5$/WSe$_2$ heterojunctions. A high-quality Schottky interface readily forms between KV$_3$Sb$_5$ and WSe$_2$, enabling efficient separation and transport of photoinduced carriers. Under 520 nm illumination, the device achieves an open-circuit voltage up to 0.6 V, a responsivity of 809 mA/W, and a fast response time of 18.3 us. This work demonstrates the promising optoelectronic applications of Kagome metals and highlights the potential of KV$_3$Sb$_5$-based van der Waals heterostructures for high-performance photodetection.

preprint2026arXiv

LoopVLA: Learning Sufficiency in Recurrent Refinement for Vision-Language-Action Models

Current Vision-Language-Action (VLA) models typically treat the deepest representation of a vision-language backbone as universally optimal for action prediction. However, robotic manipulation is composed of many frequent closed-loop spatial adjustments, for which excessive abstraction may waste computation and weaken low-level geometric cues essential for precise control. Existing early-exit strategies attempt to reduce computation by stopping at predefined layers or applying heuristic rules such as action consistency, but they do not directly answer when a representation is actually sufficient for action. In this paper, we present LoopVLA, a recurrent VLA architecture that jointly learns representation refinement, action prediction, and sufficiency estimation. LoopVLA iteratively applies a shared Transformer block to refine multimodal tokens, and at each iteration produces both a candidate action and a sufficiency score that estimates whether further refinement is necessary. By sharing parameters across iterations, LoopVLA decouples refinement from absolute layer indices and grounds sufficiency estimation in the evolving representation itself. Since sufficiency has no direct supervision, we introduce a self-supervised distribution alignment objective, where intermediate confidence scores are trained to match the relative action quality across refinement steps, thereby linking sufficiency learning to policy optimization signals. Experiments on LIBERO, LIBERO-Plus, and VLA-Arena show that LoopVLA pushes the efficiency-performance frontier of VLA policies, reducing parameters by 45% and improving inference throughput by up to 1.7 times while matching or outperforming strong baselines in task success.

preprint2026arXiv

MFAI: A Scalable Bayesian Matrix Factorization Approach to Leveraging Auxiliary Information

In various practical situations, matrix factorization methods suffer from poor data quality, such as high data sparsity and low signal-to-noise ratio (SNR). Here, we consider a matrix factorization problem by utilizing auxiliary information, which is massively available in real-world applications, to overcome the challenges caused by poor data quality. Unlike existing methods that mainly rely on simple linear models to combine auxiliary information with the main data matrix, we propose to integrate gradient boosted trees in the probabilistic matrix factorization framework to effectively leverage auxiliary information (MFAI). Thus, MFAI naturally inherits several salient features of gradient boosted trees, such as the capability of flexibly modeling nonlinear relationships and robustness to irrelevant features and missing values in auxiliary information. The parameters in MFAI can be automatically determined under the empirical Bayes framework, making it adaptive to the utilization of auxiliary information and immune to overfitting. Moreover, MFAI is computationally efficient and scalable to large datasets by exploiting variational inference. We demonstrate the advantages of MFAI through comprehensive numerical results from simulation studies and real data analyses. Our approach is implemented in the R package mfair available at https://github.com/YangLabHKUST/mfair.

preprint2026arXiv

Pairing-free Group-level Knowledge Distillation for Robust Gastrointestinal Lesion Classification in White-Light Endoscopy

White-Light Imaging (WLI) is the standard for endoscopic cancer screening, but Narrow-Band Imaging (NBI) offers superior diagnostic details. A key challenge is transferring knowledge from NBI to enhance WLI-only models, yet existing methods are critically hampered by their reliance on paired NBI-WLI images of the same lesion, a costly and often impractical requirement that leaves vast amounts of clinical data untapped. In this paper, we break this paradigm by introducing PaGKD, a novel Pairing-free Group-level Knowledge Distillation framework that that enables effective cross-modal learning using unpaired WLI and NBI data. Instead of forcing alignment between individual, often semantically mismatched image instances, PaGKD operates at the group level to distill more complete and compatible knowledge across modalities. Central to PaGKD are two complementary modules: (1) Group-level Prototype Distillation (GKD-Pro) distills compact group representations by extracting modality-invariant semantic prototypes via shared lesion-aware queries; (2) Group-level Dense Distillation (GKD-Den) performs dense cross-modal alignment by guiding group-aware attention with activation-derived relation maps. Together, these modules enforce global semantic consistency and local structural coherence without requiring image-level correspondence. Extensive experiments on four clinical datasets demonstrate that PaGKD consistently and significantly outperforms state-of-the-art methods, achieving relative AUC improvements of 3.3%, 1.1%, 2.8%, and 3.2%, respectively, establishing a new direction for cross-modal learning from unpaired data.

preprint2025arXiv

Evidence of anisotropic three-dimensional weak-localization in TiSe$_{2}$ nanoflakes

TiSe$_2$ is a typical transition-metal dichalcogenide known for its charge-density wave order. In this study, we report the observation of an unusual anisotropic negative magnetoresistance in exfoliated TiSe$_2$ nanoflakes at low temperatures. Unlike the negative magnetoresistance reported in most other transition-metal dichalcogenides, our results cannot be explained by either the conventional two-dimensional weak localization effect or the Kondo effect. A comprehensive analysis of the data suggests that the observed anisotropic negative magnetoresistance in TiSe$_2$ flakes is most likely caused by the three-dimensional weak localization effect. Our findings contribute to a deeper understanding of the phase-coherent transport processes in TiSe$_2$.

preprint2024arXiv

Nonvolatile optical control of interlayer stacking order in 1T-TaS2

Nonvolatile optical manipulation of material properties on demand is a highly sought-after feature in the advancement of future optoelectronic applications. While the discovery of such metastable transition in various materials holds good promise for achieving this goal, their practical implementation is still in the nascent stage. Here, we unravel the nature of the ultrafast laser-induced hidden state in 1T-TaS2 by systematically characterizing the electronic structure evolution throughout the reversible transition cycle. We identify it as a mixed-stacking state involving two similarly low-energy interlayer orders, which is manifested as the charge density wave phase disruption. Furthermore, our comparative experiments utilizing the single-pulse writing, pulse-train erasing and pulse-pair control explicitly reveal the distinct mechanism of the bidirectional transformations -- the ultrafast formation of the hidden state is initiated by a coherent phonon which triggers a competition of interlayer stacking orders, while its recovery to the initial state is governed by the progressive domain coarsening. Our work highlights the deterministic role of the competing interlayer orders in the nonvolatile phase transition in the layered material 1T-TaS2, and promises the coherent control of the phase transition and switching speed. More importantly, these results establish all-optical engineering of stacking orders in low-dimensional materials as a viable strategy for achieving desirable nonvolatile electronic devices.

preprint2024arXiv

Superconductivity at Pd/Bi$_2$Se$_3$ Interfaces Due to Self-Formed PdBiSe Interlayers

Understanding the physical and chemical processes at the interface of metals and topological insulators is crucial for developing the next generation of topological quantum devices. Here we report the discovery of robust superconductivity in Pd/Bi$_2$Se$_3$ bilayers fabricated by sputtering Pd on the surface of Bi$_2$Se$_3$. Through transmission electron microscopy measurements, we identify that the observed interfacial superconductivity originates from the diffusion of Pd into Bi$_2$Se$_3$. In the diffusion region, Pd chemically reacts with Bi$_2$Se$_3$ and forms a layer of PdBiSe, a known su-perconductor with a bulk transition temperature of 1.5 K. Our work provides a method for in-troducing superconductivity into Bi$_2$Se$_3$, laying the foundation for developing sophisticated Bi$_2$Se$_3$-based topological devices.

preprint2023arXiv

Pressure-Induced Superconductivity in Topological Heterostructure (PbSe)5(Bi2Se3)6

Recently, the natural heterostructure of (PbSe)5(Bi2Se3)6 has been theoretically predicted and experimentally confirmed as a topological insulator. In this work, we induce superconductivity in (PbSe)5(Bi2Se3)6 by implementing high pressure. As increasing pressure up to 10 GPa, superconductivity with Tc ~ 4.6 K suddenly appears, followed by an abrupt decrease. Remarkably, upon further compression above 30 GPa, a new superconducting state arises, where pressure raises the Tc to an unsaturated 6.0 K within the limit of our research. Combining XRD and Raman spectroscopies, we suggest that the emergence of two distinct superconducting states occurs concurrently with the pressure-induced structural transition in this topological heterostructure (PbSe)5(Bi2Se3)6.

preprint2022arXiv

A deep learning-based model reduction (DeePMR) method for simplifying chemical kinetics

A deep learning-based model reduction (DeePMR) method for simplifying chemical kinetics is proposed and validated using high-temperature auto-ignitions, perfectly stirred reactors (PSR), and one-dimensional freely propagating flames of n-heptane/air mixtures. The mechanism reduction is modeled as an optimization problem on Boolean space, where a Boolean vector, each entry corresponding to a species, represents a reduced mechanism. The optimization goal is to minimize the reduced mechanism size given the error tolerance of a group of pre-selected benchmark quantities. The key idea of the DeePMR is to employ a deep neural network (DNN) to formulate the objective function in the optimization problem. In order to explore high dimensional Boolean space efficiently, an iterative DNN-assisted data sampling and DNN training procedure are implemented. The results show that DNN-assistance improves sampling efficiency significantly, selecting only $10^5$ samples out of $10^{34}$ possible samples for DNN to achieve sufficient accuracy. The results demonstrate the capability of the DNN to recognize key species and reasonably predict reduced mechanism performance. The well-trained DNN guarantees the optimal reduced mechanism by solving an inverse optimization problem. By comparing ignition delay times, laminar flame speeds, temperatures in PSRs, the resulting skeletal mechanism has fewer species (45 species) but the same level of accuracy as the skeletal mechanism (56 species) obtained by the Path Flux Analysis (PFA) method. In addition, the skeletal mechanism can be further reduced to 28 species if only considering atmospheric, near-stoichiometric conditions (equivalence ratio between 0.6 and 1.2). The DeePMR provides an innovative way to perform model reduction and demonstrates the great potential of data-driven methods in the combustion area.

preprint2022arXiv

Accurate Scoliosis Vertebral Landmark Localization on X-ray Images via Shape-constrained Multi-stage Cascaded CNNs

Vertebral landmark localization is a crucial step for variant spine-related clinical applications, which requires detecting the corner points of 17 vertebrae. However, the neighbor landmarks often disturb each other for the homogeneous appearance of vertebrae, which makes vertebral landmark localization extremely difficult. In this paper, we propose multi-stage cascaded convolutional neural networks (CNNs) to split the single task into two sequential steps, i.e., center point localization to roughly locate 17 center points of vertebrae, and corner point localization to find 4 corner points for each vertebra without distracted by others. Landmarks in each step are located gradually from a set of initialized points by regressing offsets via cascaded CNNs. Principal Component Analysis (PCA) is employed to preserve a shape constraint in offset regression to resist the mutual attraction of vertebrae. We evaluate our method on the AASCE dataset that consists of 609 tight spinal anterior-posterior X-ray images and each image contains 17 vertebrae composed of the thoracic and lumbar spine for spinal shape characterization. Experimental results demonstrate our superior performance of vertebral landmark localization over other state-of-the-arts with the relative error decreasing from 3.2e-3 to 7.2e-4.

preprint2022arXiv

An Upper Limit of Decaying Rate with Respect to Frequency in Deep Neural Network

Deep neural network (DNN) usually learns the target function from low to high frequency, which is called frequency principle or spectral bias. This frequency principle sheds light on a high-frequency curse of DNNs -- difficult to learn high-frequency information. Inspired by the frequency principle, a series of works are devoted to develop algorithms for overcoming the high-frequency curse. A natural question arises: what is the upper limit of the decaying rate w.r.t. frequency when one trains a DNN? In this work, our theory, confirmed by numerical experiments, suggests that there is a critical decaying rate w.r.t. frequency in DNN training. Below the upper limit of the decaying rate, the DNN interpolates the training data by a function with a certain regularity. However, above the upper limit, the DNN interpolates the training data by a trivial function, i.e., a function is only non-zero at training data points. Our results indicate a better way to overcome the high-frequency curse is to design a proper pre-condition approach to shift high-frequency information to low-frequency one, which coincides with several previous developed algorithms for fast learning high-frequency information. More importantly, this work rigorously proves that the high-frequency curse is an intrinsic difficulty of DNNs.

preprint2022arXiv

Carrier Injection and Manipulation of Charge-Density Wave in Kagome Superconductor CsV3Sb5

Kagome metals AV3Sb5 (A = K, Rb, and Cs) exhibit a unique superconducting ground state coexisting with charge-density wave (CDW), whereas how these characteristics are affected by carrier doping remains unexplored because of the lack of an efficient carrier-doping method. Here we report successful electron doping to CsV3Sb5 by Cs dosing, as visualized by angle-resolved photoemission spectroscopy. We found that the electron doping with Cs dosing proceeds in an orbital-selective way, as characterized by a marked increase in electron filling of the Sb 5pz and V 3dxz/yz bands as opposed to relatively insensitive nature of the V 3dxy/x2-y2 bands. By monitoring the temperature evolution of the CDW gap around the M point, we found that the CDW can be completely killed by Cs dosing while keeping the saddle point with the V 3dxy/x2-y2 character almost pinned at the Fermi level. The present result suggests a crucial role of multi-orbital effect to the occurrence of CDW, and provides an important step toward manipulating the CDW and superconductivity in AV3Sb5.

preprint2022arXiv

Joint Progressive and Coarse-to-fine Registration of Brain MRI via Deformation Field Integration and Non-Rigid Feature Fusion

Registration of brain MRI images requires to solve a deformation field, which is extremely difficult in aligning intricate brain tissues, e.g., subcortical nuclei, etc. Existing efforts resort to decomposing the target deformation field into intermediate sub-fields with either tiny motions, i.e., progressive registration stage by stage, or lower resolutions, i.e., coarse-to-fine estimation of the full-size deformation field. In this paper, we argue that those efforts are not mutually exclusive, and propose a unified framework for robust brain MRI registration in both progressive and coarse-to-fine manners simultaneously. Specifically, building on a dual-encoder U-Net, the fixed-moving MRI pair is encoded and decoded into multi-scale deformation sub-fields from coarse to fine. Each decoding block contains two proposed novel modules: i) in Deformation Field Integration (DFI), a single integrated sub-field is calculated, warping by which is equivalent to warping progressively by sub-fields from all previous decoding blocks, and ii) in Non-rigid Feature Fusion (NFF), features of the fixed-moving pair are aligned by DFI-integrated sub-field, and then fused to predict a finer sub-field. Leveraging both DFI and NFF, the target deformation field is factorized into multi-scale sub-fields, where the coarser fields alleviate the estimate of a finer one and the finer field learns to make up those misalignments insolvable by previous coarser ones. The extensive and comprehensive experimental results on both private and public datasets demonstrate a superior registration performance of brain MRI images over progressive registration only and coarse-to-fine estimation only, with an increase by at most 8% in the average Dice.

preprint2022arXiv

Linear isometric invariants of bounded domains

We introduce two new conditions for bounded domains, namely $A^p$-completeness and boundary blow down type, and show that, for two bounded domains $D_1$ and $D_2$ that are $A^p$-complete and not of boundary blow down type, if there exists a linear isometry from $A^p(D_1)$ to $A^{p}(D_2)$ for some real number $p>0$ with $p\neq $ even integers, then $D_1$ and $D_2$ must be holomorphically equivalent, where for a domain $D$, $A^p(D)$ denotes the space of $L^p$ holomorphic functions on $D$.

preprint2022arXiv

TensorFHE: Achieving Practical Computation on Encrypted Data Using GPGPU

In this paper, we propose TensorFHE, an FHE acceleration solution based on GPGPU for real applications on encrypted data. TensorFHE utilizes Tensor Core Units (TCUs) to boost the computation of Number Theoretic Transform (NTT), which is the part of FHE with highest time-cost. Moreover, TensorFHE focuses on performing as many FHE operations as possible in a certain time period rather than reducing the latency of one operation. Based on such an idea, TensorFHE introduces operation-level batching to fully utilize the data parallelism in GPGPU. We experimentally prove that it is possible to achieve comparable performance with GPGPU as with state-of-the-art ASIC accelerators. TensorFHE performs 913 KOPS and 88 KOPS for NTT and HMULT (key FHE kernels) within NVIDIA A100 GPGPU, which is 2.61x faster than state-of-the-art FHE implementation on GPGPU; Moreover, TensorFHE provides comparable performance to the ASIC FHE accelerators, which makes it even 2.9x faster than the F1+ with a specific workload. Such a pure software acceleration based on commercial hardware with high performance can open up usage of state-of-the-art FHE algorithms for a broad set of applications in real systems.

preprint2022arXiv

Three-dimensional energy gap and origin of charge-density wave in kagome superconductor KV3Sb5

Kagome lattices offer a fertile ground to explore exotic quantum phenomena associated with electron correlation and band topology. The recent discovery of superconductivity coexisting with charge-density wave (CDW) in the kagome metals KV3Sb5, RbV3Sb5, and CsV3Sb5 suggests an intriguing entanglement of electronic order and superconductivity. However, the microscopic origin of CDW, a key to understanding the superconducting mechanism and its possible topological nature, remains elusive. Here, we report angle-resolved photoemission spectroscopy of KV3Sb5 and demonstrate a substantial reconstruction of Fermi surface in the CDW state that accompanies the formation of small three-dimensional pockets. The CDW gap exhibits a periodicity of undistorted Brillouin zone along the out-of-plane wave vector, signifying a dominant role of the in-plane inter-saddle-point scattering to the mechanism of CDW. The characteristics of experimental band dispersion can be captured by first-principles calculations with the inverse star-of-David structural distortion. The present result indicates a direct link between the low-energy excitations and CDW, and puts constraints on the microscopic theory of superconductivity in alkali-metal kagome lattices.

preprint2022arXiv

Tuning the competition between superconductivity and charge order in kagome superconductor Cs(V1-xNbx)3Sb5

The recently discovered coexistence of superconductivity and charge density wave order in the kagome systems AV3Sb5 (A = K, Rb, Cs) has stimulated enormous interest. According to theory, a vanadium-based kagome system may host a flat band, nontrivial linear dispersive Dirac surface states and electronic correlation. Despite intensive investigations, it remains controversial about the origin of the charge density wave (CDW) order, how does the superconductivity relate to the CDW, and whether the anomalous Hall effect (AHE) arises primarily from the kagome lattice or the CDW order. We report an extensive investigation on Cs(V1-xNbx)3Sb5 samples with systematic Nb doping. Our results show that the Nb doping induces apparent suppression of CDW order and promotes superconductivity; meanwhile, the AHE and magnetoresistance (MR) will be significantly weakened together with the CDW order. Combining with our density functional calculations, we interpret these effects by an antiphase shift of the Fermi energy with respect to the saddle points near M and the Fermi surface centered around Γ. It is found that the former depletes the filled states for the CDW instability and worsens the nesting condition for CDW order; while the latter lifts the Fermi level upward and enlarges the Fermi surface surrounding the Γ point, and thus promotes superconductivity. Our results uncover a delicate but unusual competition between the CDW order and superconductivity.

preprint2021arXiv

Electron-phonon coupling in the charge density wave state of CsV$_3$Sb$_5$

Metallic materials with kagome lattice structure are interesting because their electronic structures can host flat bands, Dirac cones, and van Hove singularities, resulting in strong electron correlations, nontrivial band topology, charge density wave (CDW), and unconventional superconductivity. Recently, kagome lattice compounds AV$_3$Sb$_5$ (A = K, Rb, Cs) are found to have intertwined CDW order and superconductivity. The origin of the CDW has been suggested to be purely electronic, arising from Fermi-surface instabilities of van Hove singularity (saddle point) near the M points. Here we use neutron scattering experiments to demonstrate that the CDW order in CsV$_3$Sb$_5$ is associated with static lattice distortion and a sudden hardening of the B3u longitudinal optical phonon mode, thus establishing that electron-phonon coupling must also play an important role in the CDW order of AV$_3$Sb$_5$.

preprint2021arXiv

No observation of chiral flux current in the topological kagome metal CsV$_{3}$Sb$_{5}$

Compounds with kagome lattice usually host many exotic quantum states, including the quantum spin liquid, non-trivial topological Dirac bands and a strongly renormalized flat band, etc. Recently an interesting vanadium based kagome family $A$V$_{3}$Sb$_{5}$ ($A$ = K, Rb, or Cs) was discovered, and these materials exhibit multiple interesting properties, including unconventional saddle-point driving charge density wave (CDW) state, superconductivity, etc. Furthermore, some experiments show anomalous Hall effect which inspires that there might be some chiral flux current states. Here we report scanning tunneling measurements by using spin-polarized tips. Although we have observed clearly the $2a_0\times2a_0$ CDW and $4a_0$ stripe orders, the well-designed experiments with refined spin-polarized tips do not reveal any trace of the chiral flux current phase in CsV$_3$Sb$_5$ within the limits of experimental accuracy. No observation of the local magnetic moment in our experiments may put an upper bound constraint on the magnitude of magnetic moments induced by the possible chiral loop current which has a time-reversal symmetry breaking along $c$-axis in CsV$_{3}$Sb$_{5}$.

preprint2020arXiv

Chat as Expected: Learning to Manipulate Black-box Neural Dialogue Models

Recently, neural network based dialogue systems have become ubiquitous in our increasingly digitalized society. However, due to their inherent opaqueness, some recently raised concerns about using neural models are starting to be taken seriously. In fact, intentional or unintentional behaviors could lead to a dialogue system to generate inappropriate responses. Thus, in this paper, we investigate whether we can learn to craft input sentences that result in a black-box neural dialogue model being manipulated into having its outputs contain target words or match target sentences. We propose a reinforcement learning based model that can generate such desired inputs automatically. Extensive experiments on a popular well-trained state-of-the-art neural dialogue model show that our method can successfully seek out desired inputs that lead to the target outputs in a considerable portion of cases. Consequently, our work reveals the potential of neural dialogue models to be manipulated, which inspires and opens the door towards developing strategies to defend them.

preprint2020arXiv

Crystal structure and distortion of superconducting Cu$_x$Bi$_2$Se$_{3}$

The crystal structure of the candidate topological superconductor Cu$_x$Bi$_2$Se$_3$ was studied by single-crystal neutron diffraction using samples obtained by inserting the Cu dopant electrochemically. Neither structural refinements nor calculated scattering-density maps find a significant occupation of Cu at the intercalation site between the quintuple layers of Bi$_2$Se$_3$. Following Bragg reflection intensities as function of temperature, there is no signature of a structural phase transition between 295 and 2 K. However, the analysis of large sets of Bragg reflections indicates a small structural distortion breaking the rotational axis due to small displacements of the Bi ions.

preprint2020arXiv

Deriving AC OPF Solutions via Proximal Policy Optimization for Secure and Economic Grid Operation

Optimal power flow (OPF) is a very fundamental but vital optimization problem in the power system, which aims at solving a specific objective function (ex.: generator costs) while maintaining the system in the stable and safe operations. In this paper, we adopted the start-of-the-art artificial intelligence (AI) techniques to train an agent aiming at solving the AC OPF problem, where the nonlinear power balance equations are considered. The modified IEEE-14 bus system were utilized to validate the proposed approach. The testing results showed a great potential of adopting AI techniques in the power system operations.

preprint2020arXiv

Distributed Frequency Emergency Control with Coordinated Edge Intelligence

Developing effective strategies to rapidly support grid frequency while minimizing loss in case of severe contingencies is an important requirement in power systems. While distributed responsive load demands are commonly adopted for frequency regulation, it is difficult to achieve both rapid response and global accuracy in a practical and cost-effective manner. In this paper, the cyber-physical design of an Internet-of-Things (IoT) enabled system, called Grid Sense, is presented. Grid Sense utilizes a large number of distributed appliances for frequency emergency support. It features a local power loss $ΔP$ estimation approach for frequency emergency control based on coordinated edge intelligence. The specifically designed smart outlets of Grid Sense detect the frequency disturbance event locally using the parameters sent from the control center to estimate active power loss in the system and to make rapid and accurate switching decisions soon after a severe contingency. Based on a modified IEEE 24-bus system, numerical simulations and hardware experiments are conducted to demonstrate the frequency support performance of Grid Sense in the aspects of accuracy and speed. It is shown that Grid Sense equipped with its local $ΔP$-estimation frequency control approach can accurately and rapidly prevent the drop of frequency after a major power loss.

preprint2020arXiv

Evaluating Load Models and Their Impacts on Power Transfer Limits

Power transfer limits or transfer capability (TC) directly relate to the system operation and control as well as electricity markets. As a consequence, their assessment has to comply with static constraints, such as line thermal limits, and dynamic constraints, such as transient stability limits, voltage stability limits and small-signal stability limits. Since the load dynamics have substantial impacts on power system transient stability, load models are one critical factor that affects the power transfer limits. Currently, multiple load models have been proposed and adopted in the industry and academia, including the ZIP model, ZIP plus induction motor composite model (ZIP + IM) and WECC composite load model (WECC CLM). Each of them has its unique advantages, but their impacts on the power transfer limits are not yet adequately addressed. One existing challenge is fitting the high-order nonlinear models such as WECC CLM. In this study, we innovatively adopt double deep Q-learning Network (DDQN) agent as a general load modeling tool in the dynamic assessment procedure and fit the same transient field measurements into different load models. A comprehensive evaluation is then conducted to quantify the load models' impacts on the power transfer limits. The simulation environment is the IEEE-39 bus system constructed in Transient Security Assessment Tool (TSAT).

preprint2020arXiv

Extended Prony Analysis on Power System Oscillation Under a Near-Resonance Condition

Power system oscillations under a large disturbance often exhibit distorted waveforms as captured by increasingly deployed phasor measurement units. One cause is the occurrence of a near-resonance condition among several dominant modes that are influenced by nonlinear transient dynamics of generators. This paper proposes an Extended Prony Analysis method for measurement-based modal analysis. Based on the normal form theory, it compares analyses on transient and post-transient waveforms to distinguish a resonance mode caused by a near-resonance condition from natural modes so that the method can give more accurate modal properties than a traditional Prony Analysis method, especially for large disturbances. The new method is first demonstrated in detail on Kundur's two-area system and then tested on the IEEE 39-bus system to show its performance under a near-resonance condition.

preprint2020arXiv

Graph Computing based Distributed State Estimation with PMUs

Power system state estimation plays a fundamental and critical role in the energy management system (EMS). To achieve a high performance and accurate system states estimation, a graph computing based distributed state estimation approach is proposed in this paper. Firstly, a power system network is divided into multiple areas. Reference buses are selected with PMUs being installed at these buses for each area. Then, the system network is converted into multiple independent areas. In this way, the power system state estimation could be conducted in parallel for each area and the estimated system states are obtained without compromise of accuracy. IEEE 118-bus system and MP 10790-bus system are employed to verify the results accuracy and present the promising computation performance.

preprint2020arXiv

Implications of Stahl's Theorems to Holomorphic Embedding Pt. 1: Theoretical Convergence

What has become known as Stahl's Theorem in power engineering circles has been used to justify a convergence guarantee of the Holormorphic Embedding Method (HEM) as it applies to the power flow (PF) problem. In this two-part paper, we examine in more detail the implications of Stahl's theorems to both theoretcial and numerical convergence for a wider range of problems to which these theorems are now being applied. In Pt. 1, we introduce the theorem using the necessary mathematical parlance and then translate the language to show its implications to convergence of nonlinear problems in general and the PF problem specifically. We show that among other possibilities the existence of the Chebotarev points, which are embedding specific, are a possible theoretical impediment to convergence. Numerical impediments to convergences are discussed in the companion paper.

preprint2020arXiv

Implications of Stahl's Theorems to Holomorphic Embedding Pt. 2: Numerical Convergence

What has become known as Stahl's Theorem in power-engineering circles has been used to justify a convergence guarantee of the Holomorphic Embedding Method (HEM) as it applies to the power-flow problem. In this, the second part of a two-part paper, we examine implications to numerical convergence of HEM and the numerical properties of a Padé approximant algorithm. We show that even if the convergence domain is identical to the function's domain, numerical convergence of the sequence of Padé approximants computed with finite precision is not guaranteed. We also show that the study of convergence properties of the Padé approximant is the study of the location of branch-points of the function, which dictate branch-cut topology and capacity and, therefore, convergence rate. We show how poorly chosen embeddings can prevent numerical convergence.

preprint2020arXiv

Multi-Scale One-Class Recurrent Neural Networks for Discrete Event Sequence Anomaly Detection

Discrete event sequences are ubiquitous, such as an ordered event series of process interactions in Information and Communication Technology systems. Recent years have witnessed increasing efforts in detecting anomalies with discrete-event sequences. However, it still remains an extremely difficult task due to several intrinsic challenges including data imbalance issues, the discrete property of the events, and sequential nature of the data. To address these challenges, in this paper, we propose OC4Seq, a multi-scale one-class recurrent neural network for detecting anomalies in discrete event sequences. Specifically, OC4Seq integrates the anomaly detection objective with recurrent neural networks (RNNs) to embed the discrete event sequences into latent spaces, where anomalies can be easily detected. In addition, given that an anomalous sequence could be caused by either individual events, subsequences of events, or the whole sequence, we design a multi-scale RNN framework to capture different levels of sequential patterns simultaneously. Experimental results on three benchmark datasets show that OC4Seq consistently outperforms various representative baselines by a large margin. Moreover, through both quantitative and qualitative analysis, the importance of capturing multi-scale sequential patterns for event anomaly detection is verified.

preprint2020arXiv

Observation of inverted band structure in topological Dirac-semimetal candidate CaAuAs

We have performed high-resolution angle-resolved photoemission spectroscopy of ternary pnictide CaAuAs which is predicted to be a three-dimensional topological Dirac semimetal (TDS). By accurately determining the bulk-band structure, we have revealed the coexistence of three-dimensional and quasi-two-dimensional Fermi surfaces with dominant hole carriers. The band structure around the Brillouin-zone center is characterized by an energy overlap between hole and electron pockets, in excellent agreement with first-principles band-structure calculations. This indicates the occurrence of bulk-band inversion, supporting the TDS state in CaAuAs. Because of the high tunability in the chemical composition besides the TDS nature, CaAuAs provides a precious opportunity for investigating the quantum phase transition from TDS to other exotic topological phases.

preprint2020arXiv

Online Low Frequency Oscillation Detection and Analysis System with an Ensemble Filter

The widespread deployment of phasor measurement unit (PMU) overpower systems makes it possible to monitor and analyze grid dynamics in real-time. Low-frequency oscillation is harmful to power system equipment and operation, and in the worst-case scenario may lead to cascading failures. Therefore, it is critical to detect and identify them as soon as they appear. This paper presents an online low-frequency oscillation detection and analysis (LFODA) system, which has the merit of significantly reducing the chance of false alarm via a voting schema and a time-serial filter. A novel algorithm based on density-based spatial clustering of applications with noise (DBSCAN) is proposed to classify oscillation modes as well as to group their corresponding buses/monitoring sites. Performance of the LFODA system is evaluated through experiments using both simulated and real-world PMU data.

preprint2020arXiv

Positivity of holomorphic vector bundles in terms of $L^p$-conditions of $\bar\partial$

We study the positivity properties of Hermitian (or even Finsler) holomorphic vector bundles in terms of $L^p$-estimates of $\bar\partial$ and $L^p$-extensions of holomorphic objects. To this end, we introduce four conditions, called the optimal $L^p$-estimate condition, the multiple coarse $L^p$-estimate condition, the optimal $L^p$-extension condition, and the multiple coarse $L^p$-extension condition, for a Hermitian (or Finsler) vector bundle $(E,h)$. The main result of the present paper is to give a characterization of the Nakano positivity of $(E,h)$ via the optimal $L^2$-estimate condition. We also show that $(E,h)$ is Griffiths positive if it satisfies the multiple coarse $L^p$-estimate condition for some $p>1$, the optimal $L^p$-extension condition, or the multiple coarse $L^p$-extension condition for some $p>0$. These results can be roughly viewed as converses of Hörmander's $L^2$-estimate of $\bar\partial$ and Ohsawa-Takegoshi type extension theorems. As an application of the main result, we get a totally different method to Nakano positivity of direct image sheaves of twisted relative canonical bundles associated to holomorphic families of complex manifolds.

preprint2020arXiv

Siamese Neural Networks for Class Activity Detection

Classroom activity detection (CAD) aims at accurately recognizing speaker roles (either teacher or student) in classrooms. A CAD solution helps teachers get instant feedback on their pedagogical instructions. However, CAD is very challenging because (1) classroom conversations contain many conversational turn-taking overlaps between teachers and students; (2) the CAD model needs to be generalized well enough for different teachers and students; and (3) classroom recordings may be very noisy and low-quality. In this work, we address the above challenges by building a Siamese neural framework to automatically identify teacher and student utterances from classroom recordings. The proposed model is evaluated on real-world educational datasets. The results demonstrate that (1) our approach is superior on the prediction tasks for both online and offline classroom environments; and (2) our framework exhibits robustness and generalization ability on new teachers (i.e., teachers never appear in training data).

preprint2020arXiv

Signature of band inversion in the antiferromagnetic phase of axion insulator candidate EuIn2As2

We have performed angle-resolved photoemission spectroscopy on EuIn2As2 which is predicted to be an axion insulator in the antiferromagnetic state. By utilizing soft-x-ray and vacuum-ultraviolet photons, we revealed a three-dimensional hole pocket centered at the Gamma point of bulk Brillouin zone together with a heavily hole-doped surface state in the paramagnetic phase. Upon entering the antiferromagnetic phase, the band structure exhibits a marked reconstruction characterized by the emergence of a "M"-shaped bulk band near the Fermi level. The qualitative agreement with first-principles band-structure calculations suggests the occurrence of bulk-band inversion at the Gamma point in the antiferromagnetic phase. We suggest that EuIn2As2 provides a good opportunity to study the exotic quantum phases associated with possible axion-insulator phase.

preprint2020arXiv

Two-stage WECC Composite Load Modeling: A Double Deep Q-Learning Networks Approach

With the increasing complexity of modern power systems, conventional dynamic load modeling with ZIP and induction motors (ZIP + IM) is no longer adequate to address the current load characteristic transitions. In recent years, the WECC composite load model (WECC CLM) has shown to effectively capture the dynamic load responses over traditional load models in various stability studies and contingency analyses. However, a detailed WECC CLM model typically has a high degree of complexity, with over one hundred parameters, and no systematic approach to identifying and calibrating these parameters. Enabled by the wide deployment of PMUs and advanced deep learning algorithms, proposed here is a double deep Q-learning network (DDQN)-based, two-stage load modeling framework for the WECC CLM. This two-stage method decomposes the complicated WECC CLM for more efficient identification and does not require explicit model details. In the first stage, the DDQN agent determines an accurate load composition. In the second stage, the parameters of the WECC CLM are selected from a group of Monte-Carlo simulations. The set of selected load parameters is expected to best approximate the true transient responses. The proposed framework is verified using an IEEE 39-bus test system on commercial simulation platforms.

preprint2019arXiv

Generalized Anderson's theorem for superconductors derived from topological insulators

A well-known result in unconventional superconductivity is the fragility of nodal superconductors against nonmagnetic impurities. Despite this common wisdom, Bi$_2$Se$_3$-based topological superconductors have recently displayed unusual robustness against disorder. Here we provide a theoretical framework which naturally explains what protects Cooper pairs from strong scattering in complex superconductors. Our analysis is based on the concept of superconducting fitness and generalizes the famous Anderson's theorem into superconductors having multiple internal degrees of freedom. For concreteness, we report on the extreme example of the Cu$_x$(PbSe)$_5$(Bi$_2$Se$_3$)$_6$ superconductor, where thermal conductivity measurements down to 50 mK not only give unambiguous evidence for the existence of nodes, but also reveal that the energy scale corresponding to the scattering rate is orders of magnitude larger than the superconducting energy gap. This provides a most spectacular case of the generalized Anderson's theorem protecting a nodal superconductor.

preprint2019arXiv

Uniaxial-strain Control of Nematic Superconductivity in Sr$_{x}$Bi$_2$Se$_3$

Nematic states are characterized by rotational symmetry breaking without translational ordering. Recently, nematic superconductivity, in which the superconducting gap spontaneously lifts the rotational symmetry of the lattice, has been discovered. However the pairing mechanism and the mechanism determining the nematic orientation remain unresolved. A first step is to demonstrate control of the nematicity, through application of an external symmetry-breaking field, to determine the sign and strength of coupling to the lattice. Here, we report for the first time control of the nematic orientation of the superconductivity of Sr$_x$Bi$_2$Se$_3$, through externally-applied uniaxial stress. The suppression of subdomains indicates that it is the $Δ_{4y}$ state that is most favoured under compression along the basal Bi-Bi bonds. These results provide an inevitable step towards understanding the microscopic origin of the unique topological nematic superconductivity.