Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
57works
0followers
38topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

57 published item(s)

preprint2026arXiv

CCD-Level and Load-Aware Thread Orchestration for In-Memory Vector ANNS on Multi-Core CPUs

Vector approximate nearest neighbor search (ANNS) underpins search engines, recommendation systems, and advertising services. Recent advances in ANNS indexes make CPU a cost-effective choice for serving million-scale, in-memory vector search, yet per-core throughput remains constrained by memory access latency of vector reading and the compute intensity of distance evaluations in production deployments. With the growing scale of the business and advances in hardware, modern CCD-based multi-core CPUs have been widely deployed for high throughput in our services. However, we find that simply increasing core counts does not yield optimal performance scaling. To improve the efficiency of more cores from the CCD-based architecture, we analyze the distributions of real-world requests in our production environments. We observe high access locality in vector search in our online services and low cache utilization, resulting from overlooking the multi-chiplet nature of CCD based CPUs. Hence, we propose a workload- and hardware-aware thread orchestration framework at CCD-level that (i) provides a uniform interface for both inter-query parallel HNSW search and intra-query parallel IVF search, (ii) achieves cache-friendly and workload-adaptive mapping of task dispatching, and (iii) employs CCD-aware task stealing to address load imbalance. Applied to real production workloads from search, recommendation, and advertising services of Xiaohongshu (RedNote), our approach delivers up to 3.7x higher throughput and 30-90% reductions in P50 and P999 latency. In detail, compared with the original framework, the cache-miss ratio decreases by 6-30%, and the total CPU stall is reduced by 20-80%.

preprint2026arXiv

Context-Free Grammar Inference for Complex Programming Languages in Black Box Settings

Grammar inference for complex programming languages remains a significant challenge, as existing approaches fail to scale to real world datasets within practical time constraints. In our experiments, none of the state-of-the-art tools, including Arvada, Treevada and Kedavra were able to infer grammars for complex languages such as C, C++, and Java within 48 hours. Arvada and Treevada perform grammar inference directly on full-length input examples, which proves inefficient for large files commonly found in such languages. While Kedavra introduces data decomposition to create shorter examples for grammar inference, its lexical analysis still relies on the original inputs. Additionally, its strict no-overgeneralization constraint limits the construction of complex grammars. To overcome these limitations, we propose Crucio, which builds a decomposition forest to extract short examples for lexical and grammar inference via a distributional matrix. Experimental results show that Crucio is the only method capable of successfully inferring grammars for complex programming languages (where the number of nonterminals is up to 23x greater than in prior benchmarks) within reasonable time limits. On the prior simple benchmark, Crucio achieves an average recall improvement of 1.37x and 1.19x over Treevada and Kedavra, respectively, and improves F1 scores by 1.21x and 1.13x.

preprint2026arXiv

DSA-Tokenizer: Disentangled Semantic-Acoustic Tokenization via Flow Matching-based Hierarchical Fusion

Speech tokenizers serve as the cornerstone of discrete Speech Large Language Models (Speech LLMs). Existing tokenizers either prioritize semantic encoding, fuse semantic content with acoustic style inseparably, or achieve incomplete semantic-acoustic disentanglement. To achieve better disentanglement, we propose DSA-Tokenizer, which explicitly disentangles speech into discrete semantic and acoustic tokens via distinct optimization constraints. Specifically, semantic tokens are supervised by ASR to capture linguistic content, while acoustic tokens focus on mel-spectrograms restoration to encode style. To eliminate rigid length constraints between the two sequences, we introduce a hierarchical Flow-Matching decoder that further improve speech generation quality. Furthermore, We employ a joint reconstruction-recombination training strategy to enforce this separation. DSA-Tokenizer enables high fidelity reconstruction and flexible recombination through robust disentanglement, facilitating controllable generation in speech LLMs. Our analysis highlights disentangled tokenization as a pivotal paradigm for future speech modeling. Audio samples are avaialble at https://anonymous.4open.science/w/DSA_Tokenizer_demo/. The code and model will be made publicly available after the paper has been accepted.

preprint2026arXiv

Multi-Agent Collaborative Reward Design for Enhancing Reasoning in Reinforcement Learning

We present CRM (Multi-Agent Collaborative Reward Model), a framework that replaces a single black-box reward model with a coordinated team of specialist evaluators to improve robustness and interpretability in RLHF. Conventional reward models struggle to jointly optimize multiple, sometimes conflicting, preference dimensions (e.g., factuality, helpfulness, safety) and offer limited transparency into why a score is assigned. CRM addresses these issues by decomposing preference evaluation into domain-specific agents that each produce partial signals, alongside global evaluators such as ranker-based and embedding-similarity rewards. A centralized aggregator fuses these signals at each timestep, balancing factors like step-wise correctness, multi-agent agreement, and repetition penalties, yielding a single training reward compatible with standard RL pipelines. The policy is optimized with advantage-based updates (e.g., GAE), while a value model regresses to the aggregated reward, enabling multi-perspective reward shaping without requiring additional human annotations beyond those used to train the evaluators. To support training and assessment, we introduce rewardBench, a benchmark and training suite aligned with the collaborative structure of CRM. Together, CRM and rewardBench provide a practical, modular path to more transparent reward modeling and more stable optimization.

preprint2026arXiv

Optimizing View Change for Byzantine Fault Tolerance in Parallel Consensus

The parallel Byzantine Fault Tolerant (BFT) protocol is viewed as a promising solution to address the consensus scalability issue of the permissioned blockchain. One of the main challenges in parallel BFT is the view change process that happens when the leader node fails, which can lead to performance bottlenecks. Existing parallel BFT protocols typically rely on passive view change mechanisms with blind leader rotation. Such approaches frequently select unavailable or slow nodes as leaders, resulting in degraded performance. To address these challenges, we propose a View Change Optimization (VCO) model based on mixed integer programming that optimizes leader selection and follower reassignment across parallel committees by considering communication delays and failure scenarios. We applied a decomposition method with efficient subproblems and improved benders cuts to solve the VCO model. Leveraging the results of improved decomposition solution method, we propose an efficient iterative backup leader selection algorithm as views proceed. By performing experiments in Microsoft Azure cloud environments, we demonstrate that the VCO-driven parallel BFT outperforms existing configuration methods under both normal operation and faulty condition. The results show that the VCO model is effective as network size increases, making it a suitable solution for high-performance parallel BFT systems.

preprint2026arXiv

Simulating decoherence of two coupled spins using the generalized cluster correlation expansion

We simulate the coherence of two coupled electron spins interacting with a bath of nuclei using the generalized cluster correlation expansion (gCCE) method. An exchange interaction between the electrons facilitates a family of entangling gates that can be spoiled by nuclear-induced dephasing. Consequently, we study the dephasing of the coherent two-electron system by characterizing the $T_2$ and $T_2^*$ of the two-electron reduced density matrix for various system parameters in the range mimicking magnetic molecules, including magnetic field strength and orientation, exchange interaction strength, distance between the two spins, minimum distance between electron and nuclei and between nuclei, and nuclei density. We find the optimal regime for each parameter in which the coherence time is maximized and provide a physical understanding of it.

preprint2025arXiv

LLHA-Net: A Hierarchical Attention Network for Two-View Correspondence Learning

Establishing the correct correspondence of feature points is a fundamental task in computer vision. However, the presence of numerous outliers among the feature points can significantly affect the matching results, reducing the accuracy and robustness of the process. Furthermore, a challenge arises when dealing with a large proportion of outliers: how to ensure the extraction of high-quality information while reducing errors caused by negative samples. To address these issues, in this paper, we propose a novel method called Layer-by-Layer Hierarchical Attention Network, which enhances the precision of feature point matching in computer vision by addressing the issue of outliers. Our method incorporates stage fusion, hierarchical extraction, and an attention mechanism to improve the network's representation capability by emphasizing the rich semantic information of feature points. Specifically, we introduce a layer-by-layer channel fusion module, which preserves the feature semantic information from each stage and achieves overall fusion, thereby enhancing the representation capability of the feature points. Additionally, we design a hierarchical attention module that adaptively captures and fuses global perception and structural semantic information using an attention mechanism. Finally, we propose two architectures to extract and integrate features, thereby improving the adaptability of our network. We conduct experiments on two public datasets, namely YFCC100M and SUN3D, and the results demonstrate that our proposed method outperforms several state-of-the-art techniques in both outlier removal and camera pose estimation. Source code is available at http://www.linshuyuan.com.

preprint2023arXiv

Entanglement structure in the volume-law phase of hybrid quantum automaton circuits

We study entanglement fluctuations and quantum error correction in the weakly monitored volume-law phase of quantum automaton circuits subject to repeated local measurements. We numerically observe that the entanglement entropy exhibits strong fluctuation with the exponent close to the ``growth exponent'' of the Kardar-Parisi-Zhang (KPZ) universality class, the same as other local random circuits studied previously. We also investigate the dynamically generated quantum error correction code in the purification process and show that this model has different contiguous code distances for two types of errors that exhibit similar sublinear power-law scaling. We give an interpretation of these results by mapping them to various quantities in a classical particle model. We demonstrate that the subleading correction term of the entanglement entropy and the sublinear power-law scaling of the contiguous code distance in the volume-law phase are both the emergent phenomena of the hybrid random dynamics. Finally, we show that this classical particle dynamics itself has a type of error correction ability and can dynamically generate a classical linear code.

preprint2023arXiv

Holistic Multi-Slice Framework for Dynamic Simultaneous Multi-Slice MRI Reconstruction

Dynamic Magnetic Resonance Imaging (dMRI) is widely used to assess various cardiac conditions such as cardiac motion and blood flow. To accelerate MR acquisition, techniques such as undersampling and Simultaneous Multi-Slice (SMS) are often used. Special reconstruction algorithms are needed to reconstruct multiple SMS image slices from the entangled information. Deep learning (DL)-based methods have shown promising results for single-slice MR reconstruction, but the addition of SMS acceleration raises unique challenges due to the composite k-space signals and the resulting images with strong inter-slice artifacts. Furthermore, many dMRI applications lack sufficient data for training reconstruction neural networks. In this study, we propose a novel DL-based framework for dynamic SMS reconstruction. Our main contributions are 1) a combination of data transformation steps and network design that effectively leverages the unique characteristics of undersampled dynamic SMS data, and 2) an MR physics-guided transfer learning strategy that addresses the data scarcity issue. Thorough comparisons with multiple baseline methods illustrate the strengths of our proposed methods.

preprint2022arXiv

Accelerating Representation Learning with View-Consistent Dynamics in Data-Efficient Reinforcement Learning

Learning informative representations from image-based observations is of fundamental concern in deep Reinforcement Learning (RL). However, data-inefficiency remains a significant barrier to this objective. To overcome this obstacle, we propose to accelerate state representation learning by enforcing view-consistency on the dynamics. Firstly, we introduce a formalism of Multi-view Markov Decision Process (MMDP) that incorporates multiple views of the state. Following the structure of MMDP, our method, View-Consistent Dynamics (VCD), learns state representations by training a view-consistent dynamics model in the latent space, where views are generated by applying data augmentation to states. Empirical evaluation on DeepMind Control Suite and Atari-100k demonstrates VCD to be the SoTA data-efficient algorithm on visual control tasks.

preprint2022arXiv

Automatic Comment Generation via Multi-Pass Deliberation

Deliberation is a common and natural behavior in human daily life. For example, when writing papers or articles, we usually first write drafts, and then iteratively polish them until satisfied. In light of such a human cognitive process, we propose DECOM, which is a multi-pass deliberation framework for automatic comment generation. DECOM consists of multiple Deliberation Models and one Evaluation Model. Given a code snippet, we first extract keywords from the code and retrieve a similar code fragment from a pre-defined corpus. Then, we treat the comment of the retrieved code as the initial draft and input it with the code and keywords into DECOM to start the iterative deliberation process. At each deliberation, the deliberation model polishes the draft and generates a new comment. The evaluation model measures the quality of the newly generated comment to determine whether to end the iterative process or not. When the iterative process is terminated, the best-generated comment will be selected as the target comment. Our approach is evaluated on two real-world datasets in Java (87K) and Python (108K), and experiment results show that our approach outperforms the state-of-the-art baselines. A human evaluation study also confirms the comments generated by DECOM tend to be more readable, informative, and useful.

preprint2022arXiv

BugListener: Identifying and Synthesizing Bug Reports from Collaborative Live Chats

In community-based software development, developers frequently rely on live-chatting to discuss emergent bugs/errors they encounter in daily development tasks. However, it remains a challenging task to accurately record such knowledge due to the noisy nature of interleaved dialogs in live chat data. In this paper, we first formulate the task of identifying and synthesizing bug reports from community live chats, and propose a novel approach, named BugListener, to address the challenges. Specifically, BugListener automates three sub-tasks: 1) Disentangle the dialogs from massive chat logs by using a Feed-Forward neural network; 2) Identify the bug-report dialogs from separated dialogs by modeling the original dialog to the graph-structured dialog and leveraging the graph neural network to learn the contextual information; 3) Synthesize the bug reports by utilizing the TextCNN model and Transfer Learning network to classify the sentences into three groups: observed behaviors (OB), expected behaviors (EB), and steps to reproduce the bug (SR). BugListener is evaluated on six open source projects. The results show that: for bug report identification, BugListener achieves the average F1 of 74.21%, improving the best baseline by 10.37%; and for bug report synthesis task, BugListener could classify the OB, EB, and SR sentences with the F1 of 67.37%, 87.14%, and 65.03%, improving the best baselines by 7.21%, 7.38%, 5.30%, respectively. A human evaluation also confirms the effectiveness of BugListener in generating relevant and accurate bug reports. These demonstrate the significant potential of applying BugListener in community-based software development, for promoting bug discovery and quality improvement.

preprint2022arXiv

Characterizing Sensor Leaks in Android Apps

While extremely valuable to achieve advanced functions, mobile phone sensors can be abused by attackers to implement malicious activities in Android apps, as experimentally demonstrated by many state-of-the-art studies. There is hence a strong need to regulate the usage of mobile sensors so as to keep them from being exploited by malicious attackers. However, despite the fact that various efforts have been put in achieving this, i.e., detecting privacy leaks in Android apps, we have not yet found approaches to automatically detect sensor leaks in Android apps. To fill the gap, we designed and implemented a novel prototype tool, SEEKER, that extends the famous FlowDroid tool to detect sensor-based data leaks in Android apps. SEEKER conducts sensor-focused static taint analyses directly on the Android apps' bytecode and reports not only sensor-triggered privacy leaks but also the sensor types involved in the leaks. Experimental results using over 40,000 real-world Android apps show that SEEKER is effective in detecting sensor leaks in Android apps, and malicious apps are more interested in leaking sensor data than benign apps.

preprint2022arXiv

CoCA-MDD: A Coupled Cross-Attention based Framework for Streaming Mispronunciation Detection and Diagnosis

Mispronunciation detection and diagnosis (MDD) is a popular research focus in computer-aided pronunciation training (CAPT) systems. End-to-end (e2e) approaches are becoming dominant in MDD. However an e2e MDD model usually requires entire speech utterances as input context, which leads to significant time latency especially for long paragraphs. We propose a streaming e2e MDD model called CoCA-MDD. We utilize conv-transformer structure to encode input speech in a streaming manner. A coupled cross-attention (CoCA) mechanism is proposed to integrate frame-level acoustic features with encoded reference linguistic features. CoCA also enables our model to perform mispronunciation classification with whole utterances. The proposed model allows system fusion between the streaming output and mispronunciation classification output for further performance enhancement. We evaluate CoCA-MDD on publicly available corpora. CoCA-MDD achieves F1 scores of 57.03% and 60.78% for streaming and fusion modes respectively on L2-ARCTIC. For phone-level pronunciation scoring, CoCA-MDD achieves 0.58 Pearson correlation coefficient (PCC) value on SpeechOcean762.

preprint2022arXiv

Constructing dynamic residential energy lifestyles using Latent Dirichlet Allocation

The rapid expansion of Advanced Meter Infrastructure (AMI) has dramatically altered the energy information landscape. However, our ability to use this information to generate actionable insights about residential electricity demand remains limited. In this research, we propose and test a new framework for understanding residential electricity demand by using a dynamic energy lifestyles approach that is iterative and highly extensible. To obtain energy lifestyles, we develop a novel approach that applies Latent Dirichlet Allocation (LDA), a method commonly used for inferring the latent topical structure of text data, to extract a series of latent household energy attributes. By doing so, we provide a new perspective on household electricity consumption where each household is characterized by a mixture of energy attributes that form the building blocks for identifying a sparse collection of energy lifestyles. We examine this approach by running experiments on one year of hourly smart meter data from 60,000 households and we extract six energy attributes that describe general daily use patterns. We then use clustering techniques to derive six distinct energy lifestyle profiles from energy attribute proportions. Our lifestyle approach is also flexible to varying time interval lengths, and we test our lifestyle approach seasonally (Autumn, Winter, Spring, and Summer) to track energy lifestyle dynamics within and across households and find that around 73% of households manifest multiple lifestyles across a year. These energy lifestyles are then compared to different energy use characteristics, and we discuss their practical applications for demand response program design and lifestyle change analysis.

preprint2022arXiv

Deep Statistic Shape Model for Myocardium Segmentation

Accurate segmentation and motion estimation of myocardium have always been important in clinic field, which essentially contribute to the downstream diagnosis. However, existing methods cannot always guarantee the shape integrity for myocardium segmentation. In addition, motion estimation requires point correspondence on the myocardium region across different frames. In this paper, we propose a novel end-to-end deep statistic shape model to focus on myocardium segmentation with both shape integrity and boundary correspondence preserving. Specifically, myocardium shapes are represented by a fixed number of points, whose variations are extracted by Principal Component Analysis (PCA). Deep neural network is used to predict the transformation parameters (both affine and deformation), which are then used to warp the mean point cloud to the image domain. Furthermore, a differentiable rendering layer is introduced to incorporate mask supervision into the framework to learn more accurate point clouds. In this way, the proposed method is able to consistently produce anatomically reasonable segmentation mask without post processing. Additionally, the predicted point cloud guarantees boundary correspondence for sequential images, which contributes to the downstream tasks, such as the motion estimation of myocardium. We conduct several experiments to demonstrate the effectiveness of the proposed method on several benchmark datasets.

preprint2022arXiv

DeepFD: Automated Fault Diagnosis and Localization for Deep Learning Programs

As Deep Learning (DL) systems are widely deployed for mission-critical applications, debugging such systems becomes essential. Most existing works identify and repair suspicious neurons on the trained Deep Neural Network (DNN), which, unfortunately, might be a detour. Specifically, several existing studies have reported that many unsatisfactory behaviors are actually originated from the faults residing in DL programs. Besides, locating faulty neurons is not actionable for developers, while locating the faulty statements in DL programs can provide developers with more useful information for debugging. Though a few recent studies were proposed to pinpoint the faulty statements in DL programs or the training settings (e.g. too large learning rate), they were mainly designed based on predefined rules, leading to many false alarms or false negatives, especially when the faults are beyond their capabilities. In view of these limitations, in this paper, we proposed DeepFD, a learning-based fault diagnosis and localization framework which maps the fault localization task to a learning problem. In particular, it infers the suspicious fault types via monitoring the runtime features extracted during DNN model training and then locates the diagnosed faults in DL programs. It overcomes the limitations by identifying the root causes of faults in DL programs instead of neurons and diagnosing the faults by a learning approach instead of a set of hard-coded rules. The evaluation exhibits the potential of DeepFD. It correctly diagnoses 52% faulty DL programs, compared with around half (27%) achieved by the best state-of-the-art works. Besides, for fault localization, DeepFD also outperforms the existing works, correctly locating 42% faulty programs, which almost doubles the best result (23%) achieved by the existing works.

preprint2022arXiv

DProQ: A Gated-Graph Transformer for Protein Complex Structure Assessment

Proteins interact to form complexes to carry out essential biological functions. Computational methods have been developed to predict the structures of protein complexes. However, an important challenge in protein complex structure prediction is to estimate the quality of predicted protein complex structures without any knowledge of the corresponding native structures. Such estimations can then be used to select high-quality predicted complex structures to facilitate biomedical research such as protein function analysis and drug discovery. We challenge this significant task with DProQ, which introduces a gated neighborhood-modulating Graph Transformer (GGT) designed to predict the quality of 3D protein complex structures. Notably, we incorporate node and edge gates within a novel Graph Transformer framework to control information flow during graph message passing. We train and evaluate DProQ on four newly-developed datasets that we make publicly available in this work. Our rigorous experiments demonstrate that DProQ achieves state-of-the-art performance in ranking protein complex structures.

preprint2022arXiv

DyLex: Incorporating Dynamic Lexicons into BERT for Sequence Labeling

Incorporating lexical knowledge into deep learning models has been proved to be very effective for sequence labeling tasks. However, previous works commonly have difficulty dealing with large-scale dynamic lexicons which often cause excessive matching noise and problems of frequent updates. In this paper, we propose DyLex, a plug-in lexicon incorporation approach for BERT based sequence labeling tasks. Instead of leveraging embeddings of words in the lexicon as in conventional methods, we adopt word-agnostic tag embeddings to avoid re-training the representation while updating the lexicon. Moreover, we employ an effective supervised lexical knowledge denoising method to smooth out matching noise. Finally, we introduce a col-wise attention based knowledge fusion mechanism to guarantee the pluggability of the proposed framework. Experiments on ten datasets of three tasks show that the proposed framework achieves new SOTA, even with very large scale lexicons.

preprint2022arXiv

Efficient, Interpretable Graph Neural Network Representation for Angle-dependent Properties and its Application to Optical Spectroscopy

Graph neural networks are attractive for learning properties of atomic structures thanks to their intuitive graph encoding of atoms and bonds. However, conventional encoding does not include angular information, which is critical for describing atomic arrangements in disordered systems. In this work, we extend the recently proposed ALIGNN encoding, which incorporates bond angles, to also include dihedral angles (ALIGNN-d). This simple extension leads to a memory-efficient graph representation that captures the complete geometry of atomic structures. ALIGNN-d is applied to predict the infrared optical response of dynamically disordered Cu(II) aqua complexes, leveraging the intrinsic interpretability to elucidate the relative contributions of individual structural components. Bond and dihedral angles are found to be critical contributors to the fine structure of the absorption response, with distortions representing transitions between more common geometries exhibiting the strongest absorption intensity. Future directions for further development of ALIGNN-d are discussed.

preprint2022arXiv

EGR: Equivariant Graph Refinement and Assessment of 3D Protein Complex Structures

Protein complexes are macromolecules essential to the functioning and well-being of all living organisms. As the structure of a protein complex, in particular its region of interaction between multiple protein subunits (i.e., chains), has a notable influence on the biological function of the complex, computational methods that can quickly and effectively be used to refine and assess the quality of a protein complex's 3D structure can directly be used within a drug discovery pipeline to accelerate the development of new therapeutics and improve the efficacy of future vaccines. In this work, we introduce the Equivariant Graph Refiner (EGR), a novel E(3)-equivariant graph neural network (GNN) for multi-task structure refinement and assessment of protein complexes. Our experiments on new, diverse protein complex datasets, all of which we make publicly available in this work, demonstrate the state-of-the-art effectiveness of EGR for atomistic refinement and assessment of protein complexes and outline directions for future work in the field. In doing so, we establish a baseline for future studies in macromolecular refinement and structure analysis.

preprint2022arXiv

Entanglement phase transitions in random stabilizer tensor networks

We explore a class of random tensor network models with "stabilizer" local tensors which we name Random Stabilizer Tensor Networks (RSTNs). For RSTNs defined on a two-dimensional square lattice, we perform extensive numerical studies of entanglement phase transitions between volume-law and area-law entangled phases of the one-dimensional boundary states. These transitions occur when either (a) the bond dimension $D$ of the constituent tensors is varied, or (b) the tensor network is subject to random breaking of bulk bonds, implemented by forced measurements. In the absence of broken bonds, we find that the RSTN supports a volume-law entangled boundary state with bond dimension $D\geq3$ where $D$ is a prime number, and an area-law entangled boundary state for $D=2$. Upon breaking bonds at random in the bulk with probability $p$, there exists a critical measurement rate $p_c$ for each $D\geq 3$ above which the boundary state becomes area-law entangled. To explore the conformal invariance at these entanglement transitions for different prime $D$, we consider tensor networks on a finite rectangular geometry with a variety of boundary conditions, and extract universal operator scaling dimensions via extensive numerical calculations of the entanglement entropy, mutual information and mutual negativity at their respective critical points. Our results at large $D$ approach known universal data of percolation conformal field theory, while showing clear discrepancies at smaller $D$, suggesting a distinct entanglement transition universality class for each prime $D$. We further study universal entanglement properties in the volume-law phase and demonstrate quantitative agreement with the recently proposed description in terms of a directed polymer in a random environment.

preprint2022arXiv

Hydrodynamic theory of scrambling in chaotic long-range interacting systems

The Fisher-Kolmogorov-Petrovsky-Piskunov (FKPP) equation provides a mean-field theory of out-of-time-ordered commutators in locally interacting quantum chaotic systems at high energy density; in the systems with power-law interactions, the corresponding fractional-derivative FKPP equation provides an analogous mean-field theory. However, the fractional FKPP description is potentially subject to strong quantum fluctuation effects, so it is not clear a priori if it provides a suitable effective description for generic chaotic systems with power-law interactions. Here we study this problem using a model of coupled quantum dots with interactions decaying as $\frac{1}{r^α}$, where each dot hosts $N$ degrees of freedom. The large $N$ limit corresponds to the mean-field description, while quantum fluctuations contributing to the OTOC can be modeled by $\frac{1}{N}$ corrections consisting of a cutoff function and noise. Within this framework, we show that the parameters of the effective theory can be chosen to reproduce the butterfly light cone scalings that we previously found for $N=1$ and generic finite $N$. In order to reproduce these scalings, the fractional index $μ$ in the FKPP equation needs to be shifted from the naïve value of $μ= 2α- 1$ to a renormalized value $μ= 2α- 2$. We provide supporting analytic evidence for the cutoff model and numerical confirmation for the full fractional FKPP equation with cutoff and noise.

preprint2022arXiv

Invertible Sharpening Network for MRI Reconstruction Enhancement

High-quality MRI reconstruction plays a critical role in clinical applications. Deep learning-based methods have achieved promising results on MRI reconstruction. However, most state-of-the-art methods were designed to optimize the evaluation metrics commonly used for natural images, such as PSNR and SSIM, whereas the visual quality is not primarily pursued. Compared to the fully-sampled images, the reconstructed images are often blurry, where high-frequency features might not be sharp enough for confident clinical diagnosis. To this end, we propose an invertible sharpening network (InvSharpNet) to improve the visual quality of MRI reconstructions. During training, unlike the traditional methods that learn to map the input data to the ground truth, InvSharpNet adapts a backward training strategy that learns a blurring transform from the ground truth (fully-sampled image) to the input data (blurry reconstruction). During inference, the learned blurring transform can be inverted to a sharpening transform leveraging the network's invertibility. The experiments on various MRI datasets demonstrate that InvSharpNet can improve reconstruction sharpness with few artifacts. The results were also evaluated by radiologists, indicating better visual quality and diagnostic confidence of our proposed method.

preprint2022arXiv

Length L-function for Network-Constrained Point Data

Network constrained points are referred to as points restricted to road networks, such as taxi pick up and drop off locations. A significant pattern of network constrained points is referred to as an aggregation; e.g., the aggregation of pick up points may indicate a high taxi demand in a particular area. Although the network K function using the shortest path network distance has been proposed to detect point aggregation, its statistical unit is still radius based. R neighborhood, in particular, has inconsistent network length owing to the complex configuration of road networks which cause unfair counts and identification errors in networks (e.g., the length of the r neighborhood located at an intersection is longer than that on straight roads, which may include more points). In this study, we derived the length L function for network constrained points to identify the aggregation by designing a novel neighborhood as the statistical unit; the total length of this is consistent throughout the network. Compared to the network K function, our method can detect a true to life aggregation scale, identify the aggregation with higher network density, as well as identify the aggregations that the network K function cannot. We validated our method using taxi trips pick up location data within Zhongguancun Area in Beijing, analyzing differences in maximal aggregation between workdays and weekends to understand taxi demand in the morning and evening peak.

preprint2022arXiv

Low-bit Shift Network for End-to-End Spoken Language Understanding

Deep neural networks (DNN) have achieved impressive success in multiple domains. Over the years, the accuracy of these models has increased with the proliferation of deeper and more complex architectures. Thus, state-of-the-art solutions are often computationally expensive, which makes them unfit to be deployed on edge computing platforms. In order to mitigate the high computation, memory, and power requirements of inferring convolutional neural networks (CNNs), we propose the use of power-of-two quantization, which quantizes continuous parameters into low-bit power-of-two values. This reduces computational complexity by removing expensive multiplication operations and with the use of low-bit weights. ResNet is adopted as the building block of our solution and the proposed model is evaluated on a spoken language understanding (SLU) task. Experimental results show improved performance for shift neural network architectures, with our low-bit quantization achieving 98.76 \% on the test set which is comparable performance to its full-precision counterpart and state-of-the-art solutions.

preprint2022arXiv

Mining Android API Usage to Generate Unit Test Cases for Pinpointing Compatibility Issues

Despite being one of the largest and most popular projects, the official Android framework has only provided test cases for less than 30% of its APIs. Such a poor test case coverage rate has led to many compatibility issues that can cause apps to crash at runtime on specific Android devices, resulting in poor user experiences for both apps and the Android ecosystem. To mitigate this impact, various approaches have been proposed to automatically detect such compatibility issues. Unfortunately, these approaches have only focused on detecting signature-induced compatibility issues (i.e., a certain API does not exist in certain Android versions), leaving other equally important types of compatibility issues unresolved. In this work, we propose a novel prototype tool, JUnitTestGen, to fill this gap by mining existing Android API usage to generate unit test cases. After locating Android API usage in given real-world Android apps, JUnitTestGen performs inter-procedural backward data-flow analysis to generate a minimal executable code snippet (i.e., test case). Experimental results on thousands of real-world Android apps show that JUnitTestGen is effective in generating valid unit test cases for Android APIs. We show that these generated test cases are indeed helpful for pinpointing compatibility issues, including ones involving semantic code changes.

preprint2022arXiv

MSE-Based Transceiver Designs for RIS-Aided Communications With Hardware Impairments

It is challenging to precisely configure the phase shifts of the reflecting elements at the reconfigurable intelligent surface (RIS) due to inherent hardware impairments (HIs). In this paper, the mean square error (MSE) performance is investigated in an RIS-aided single-user multiple-input multipleoutput (MIMO) communication system with transceiver HIs and RIS phase noise. We aim to jointly optimize the transmit precoder, linear received equalizer, and RIS reflecting matrices to minimize the MSE. To tackle this problem, an iterative algorithm is proposed, wherein the beamforming matrices are alternately optimized. Specifically, for the beamforming optimization subproblem, we derive the closed-form expression of the optimal precoder and equalizer matrices. Then, for the phase shift optimization subproblem, an efficient algorithm based on the majorization-minimization (MM) method is proposed. Simulation results show that the proposed MSE-based RIS-aided transceiver scheme dramatically outperforms the conventional system algorithms that do not consider HIs at both the transceiver and the RIS.

preprint2022arXiv

Robust Landmark-based Stent Tracking in X-ray Fluoroscopy

In clinical procedures of angioplasty (i.e., open clogged coronary arteries), devices such as balloons and stents need to be placed and expanded in arteries under the guidance of X-ray fluoroscopy. Due to the limitation of X-ray dose, the resulting images are often noisy. To check the correct placement of these devices, typically multiple motion-compensated frames are averaged to enhance the view. Therefore, device tracking is a necessary procedure for this purpose. Even though angioplasty devices are designed to have radiopaque markers for the ease of tracking, current methods struggle to deliver satisfactory results due to the small marker size and complex scenes in angioplasty. In this paper, we propose an end-to-end deep learning framework for single stent tracking, which consists of three hierarchical modules: U-Net based landmark detection, ResNet based stent proposal and feature extraction, and graph convolutional neural network (GCN) based stent tracking that temporally aggregates both spatial information and appearance features. The experiments show that our method performs significantly better in detection compared with the state-of-the-art point-based tracking models. In addition, its fast inference speed satisfies clinical requirements.

preprint2022arXiv

The Astrometric Performance Test of 80-cm Telescope at Yaoan Station and Precise CCD Positions of Apophis

The 80-cm azimuthal telescope is newly mounted at Yaoan Station, Purple Mountain Observatory in 2018. The astrometric performance of the telescope is tested in the following three aspects. (a) The geometric distortion of its CCD attached. It is stable in both a single epoch and multi epochs. Eight distortion solutions are derived over about one year. The maximum values range from 0.75 to 0.79 pixel and the median values range from 0.14 to 0.16 pixel. (b) The limit magnitude of stars. About 20.5 magnitude (Gaia-G) stars can be detected with Johnson-V filter exposured in 300 seconds. The astrometric error of about 20.5 magnitude stars is estimated at 0.14 arcsec using the fitted sigmoidal function. (c) The astrometric accuracy and the precision of stacked fast-moving faint object. 24 stacked frames of the potentially hazardous asteroid (PHA) (99942) Apophis are derived on April 14 and 15, 2021 (fainter than 18 mag) based on the ephemeris shifts. During data reduction, the newest Gaia EDR3 Catalog and Jet Propulsion Laboratory Horizons ephemeris are referenced as theoretical positions of stars and Apophis, respectively. Our results show that the mean (O-C)s (observed minus computed) of Apophis are -0.018 and 0.020 arcsec in right ascention and declination, and the dispersions are estimated at 0.094 and 0.085 arcsec, respectively, which show the consistency of the stacked results by Astrometrica.

preprint2022arXiv

Universal Entanglement Transitions of Free Fermions with Long-range Non-unitary Dynamics

Non-unitary evolution can give rise to novel steady states classified by their entanglement properties. In this work, we aim to understand its interplay with long-range hopping that decays with $r^{-α}$ in free-fermion systems. We first study two solvable Brownian models with long-range non-unitary dynamics: a large-$N$ SYK$_2$ chain and a single-flavor fermion chain and we show that they share the same phase diagram. When $α>0.5$, we observe two critical phases with subvolume entanglement scaling: (i) $α>1.5$, a logarithmic phase with dynamical exponent $z=1$ and logarithmic subsystem entanglement, and (ii) $0.5<α<1.5$, a fractal phase with $z=\frac{2α-1}{2}$ and subsystem entanglement $S_A\propto L_A^{1-z}$, where $L_A$ is the length of the subsystem $A$. These two phases cannot be distinguished by the purification dynamics, in which the entropy always decays as $L/T$. We then confirm that the results are also valid for the static SYK$_2$ chain, indicating the phase diagram is universal for general free-fermion systems. We also discuss phase diagrams in higher dimensions and the implication in measurement-induced phase transitions.

preprint2021arXiv

Enhanced Coalbed Methane Extraction by Geothermal Stimulation in Deep Coal Mines: An Appraisal

Coalbed methane embedded in coal seams, is an unconventional energy resource as well as a hazardous gas existing in mining industries, which attracts lots of global attention. As the largest coal producer, the mining industry in China had to deal with many hazards induced by methane for decades. To solve this issue, underground methane extraction is commonly used in underground coal mines. However, underground methane extraction is hampered by low production rate and low efficiency because of slow gas emission from coal primarily controlled by gas desorption and permeability. It is well known that temperature has a great impact on gas sorption. The higher the temperature the larger the desorption rate. As the depth of coal mines increases beyond 1000m coal mines suffer elevated air temperatures caused by the natural geothermal gradient. The elevated temperature in such mines provides a potential economical way for geothermal energy extraction and utilization in deep coal mines which can largely cut the expenses of installation and operation maintenance. Therefore, a novel method is proposed to enhance underground methane extraction by deep heat stimulation. This paper mainly presents an assessment of previous and ongoing research in the related field and provides a first feasibility analysis of this method applied in the underground environment. The technique proposed in this early appraisal is deemed significant for coalbed methane drainage enhancing the productivity of deep coal mines by geothermal technology and can also be extended for many applications in relevant areas such as shale gas, and tight oil.

preprint2021arXiv

Group-Skeleton-Based Human Action Recognition in Complex Events

Human action recognition as an important application of computer vision has been studied for decades. Among various approaches, skeleton-based methods recently attract increasing attention due to their robust and superior performance. However, existing skeleton-based methods ignore the potential action relationships between different persons, while the action of a person is highly likely to be impacted by another person especially in complex events. In this paper, we propose a novel group-skeleton-based human action recognition method in complex events. This method first utilizes multi-scale spatial-temporal graph convolutional networks (MS-G3Ds) to extract skeleton features from multiple persons. In addition to the traditional key point coordinates, we also input the key point speed values to the networks for better performance. Then we use multilayer perceptrons (MLPs) to embed the distance values between the reference person and other persons into the extracted features. Lastly, all the features are fed into another MS-G3D for feature fusion and classification. For avoiding class imbalance problems, the networks are trained with a focal loss. The proposed algorithm is also our solution for the Large-scale Human-centric Video Analysis in Complex Events Challenge. Results on the HiEve dataset show that our method can give superior performance compared to other state-of-the-art methods.

preprint2021arXiv

Measurement-induced phase transitions in quantum automaton circuits

We study the entanglement dynamics in a generic quantum automaton circuit subjected to projective measurements. We design an efficient algorithm which not only allows us to perform large scale simulation for the Rényi entropy but also provides a physical picture for the entanglement dynamics, which can be interpreted in terms of a classical bit-string model which belongs to the directed percolation universality class. We study the purification dynamics of a state formed by EPR pairs, and the growth of entanglement starting from a product state. In both cases, we verify numerically that the dynamics is in the universality class of classical directed percolation.

preprint2021arXiv

Multifractality in non-unitary random dynamics

We explore the multifractality of the steady state wave function in non-unitary random quantum dynamics in one dimension. We focus on two classes of random systems: the hybrid Clifford circuit model and the non-unitary free fermion dynamics. In the hybrid Clifford model, we map the measurement driven transition to an Anderson localization transition in an effective graph space by using properties of the stabilizer state. We show that the volume law phase with nonzero measurement rate is non-ergodic in the graph space and exhibits weak multifractal behavior. We apply the same method to the hybrid Clifford quantum automaton circuit and obtain similar multifractality in the volume law phase. For the non-unitary random free fermion system with a critical steady state, we compute the moments of the probability distribution of the single particle wave function and demonstrate that it is also weakly multifractal and has strong variations in real space.

preprint2021arXiv

Non-unitary dynamics of Sachdev-Ye-Kitaev chain

We construct a series of one-dimensional non-unitary dynamics consisting of both unitary and imaginary evolutions based on the Sachdev-Ye-Kitaev model. Starting from a short-range entangled state, we analyze the entanglement dynamics using the path integral formalism in the large $N$ limit. Among all the results that we obtain, two of them are particularly interesting: (1) By varying the strength of the imaginary evolution, the interacting model exhibits a first order phase transition from the highly entangled volume law phase to an area law phase; (2) The one-dimensional free fermion model displays an extensive critical regime with emergent two-dimensional conformal symmetry.

preprint2020arXiv

An Investigation of Few-Shot Learning in Spoken Term Classification

In this paper, we investigate the feasibility of applying few-shot learning algorithms to a speech task. We formulate a user-defined scenario of spoken term classification as a few-shot learning problem. In most few-shot learning studies, it is assumed that all the N classes are new in a N-way problem. We suggest that this assumption can be relaxed and define a N+M-way problem where N and M are the number of new classes and fixed classes respectively. We propose a modification to the Model-Agnostic Meta-Learning (MAML) algorithm to solve the problem. Experiments on the Google Speech Commands dataset show that our approach outperforms the conventional supervised learning approach and the original MAML.

preprint2020arXiv

Anatomy-Aware Cardiac Motion Estimation

Cardiac motion estimation is critical to the assessment of cardiac function. Myocardium feature tracking (FT) can directly estimate cardiac motion from cine MRI, which requires no special scanning procedure. However, current deep learning-based FT methods may result in unrealistic myocardium shapes since the learning is solely guided by image intensities without considering anatomy. On the other hand, motion estimation through learning is challenging because ground-truth motion fields are almost impossible to obtain. In this study, we propose a novel Anatomy-Aware Tracker (AATracker) for cardiac motion estimation that preserves anatomy by weak supervision. A convolutional variational autoencoder (VAE) is trained to encapsulate realistic myocardium shapes. A baseline dense motion tracker is trained to approximate the motion fields and then refined to estimate anatomy-aware motion fields under the weak supervision from the VAE. We evaluate the proposed method on long-axis cardiac cine MRI, which has more complex myocardium appearances and motions than short-axis. Compared with other methods, AATracker significantly improves the tracking performance and provides visually more realistic tracking results, demonstrating the effectiveness of the proposed weakly-supervision scheme in cardiac motion estimation.

preprint2020arXiv

Computational Performance of a Germline Variant Calling Pipeline for Next Generation Sequencing

With the booming of next generation sequencing technology and its implementation in clinical practice and life science research, the need for faster and more efficient data analysis methods becomes pressing in the field of sequencing. Here we report on the evaluation of an optimized germline mutation calling pipeline, HummingBird, by assessing its performance against the widely accepted BWA-GATK pipeline. We found that the HummingBird pipeline can significantly reduce the running time of the primary data analysis for whole genome sequencing and whole exome sequencing while without significantly sacrificing the variant calling accuracy. Thus, we conclude that expansion of such software usage will help to improve the primary data analysis efficiency for next generation sequencing.

preprint2020arXiv

Conv-Transformer Transducer: Low Latency, Low Frame Rate, Streamable End-to-End Speech Recognition

Transformer has achieved competitive performance against state-of-the-art end-to-end models in automatic speech recognition (ASR), and requires significantly less training time than RNN-based models. The original Transformer, with encoder-decoder architecture, is only suitable for offline ASR. It relies on an attention mechanism to learn alignments, and encodes input audio bidirectionally. The high computation cost of Transformer decoding also limits its use in production streaming systems. To make Transformer suitable for streaming ASR, we explore Transducer framework as a streamable way to learn alignments. For audio encoding, we apply unidirectional Transformer with interleaved convolution layers. The interleaved convolution layers are used for modeling future context which is important to performance. To reduce computation cost, we gradually downsample acoustic input, also with the interleaved convolution layers. Moreover, we limit the length of history context in self-attention to maintain constant computation cost for each decoding step. We show that this architecture, named Conv-Transformer Transducer, achieves competitive performance on LibriSpeech dataset (3.6\% WER on test-clean) without external language models. The performance is comparable to previously published streamable Transformer Transducer and strong hybrid streaming ASR systems, and is achieved with smaller look-ahead window (140~ms), fewer parameters and lower frame rate.

preprint2020arXiv

Data-Rate Driven Transmission Strategy for Deep Learning Based Communication Systems

Deep learning (DL) based autoencoder is a promising architecture to implement end-to-end communication systems. One fundamental problem of such systems is how to increase the transmission rate. Two new schemes are proposed to address the limited data rate issue: adaptive transmission scheme and generalized data representation (GDR) scheme. In the first scheme, an adaptive transmission is designed to select the transmission vectors for maximizing the data rate under different channel conditions. The block error rate (BLER) of the first scheme is 80% lower than that of the conventional one-hot vector scheme. This implies that higher data rate can be achieved by the adaptive transmission scheme. In the second scheme, the GDR replaces the conventional one-hot representation. The GDR scheme can achieve higher data rate than the conventional one-hot vector scheme with comparable BLER performance. For example, when the vector size is eight, the proposed GDR scheme can double the date rate of the one-hot vector scheme. Besides, the joint scheme of the two proposed schemes can create further benefits. The effect of signal-to-noise ratio (SNR) is analyzed for these DL-based communication systems. Numerical results show that training the autoencoder using data set with various SNR values can attain robust BLER performance under different channel conditions.

preprint2020arXiv

Effective Data Fusion with Generalized Vegetation Index: Evidence from Land Cover Segmentation in Agriculture

How can we effectively leverage the domain knowledge from remote sensing to better segment agriculture land cover from satellite images? In this paper, we propose a novel, model-agnostic, data-fusion approach for vegetation-related computer vision tasks. Motivated by the various Vegetation Indices (VIs), which are introduced by domain experts, we systematically reviewed the VIs that are widely used in remote sensing and their feasibility to be incorporated in deep neural networks. To fully leverage the Near-Infrared channel, the traditional Red-Green-Blue channels, and Vegetation Index or its variants, we propose a Generalized Vegetation Index (GVI), a lightweight module that can be easily plugged into many neural network architectures to serve as an additional information input. To smoothly train models with our GVI, we developed an Additive Group Normalization (AGN) module that does not require extra parameters of the prescribed neural networks. Our approach has improved the IoUs of vegetation-related classes by 0.9-1.3 percent and consistently improves the overall mIoU by 2 percent on our baseline.

preprint2020arXiv

Emergent conformal symmetry in non-unitary random dynamics of free fermions

We present random quantum circuit models for non-unitary quantum dynamics of free fermions in one spatial dimension. Numerical simulations reveal that the dynamics tends towards steady states with logarithmic violations of the entanglement area law and power law correlation functions. Moreover, starting with a short-range entangled many-body state, the dynamical evolution of entanglement and correlations quantitatively agrees with the predictions of two-dimensional conformal field theory with a space-like time direction. We argue that this behavior is generic in non-unitary free quantum dynamics with time-dependent randomness, and show that the emergent conformal dynamics of two-point functions arises out of a simple &#34;nonlinear master equation&#34;.

preprint2020arXiv

First-principles calculation of gate-tunable ferromagnetism in magic-angle twisted bilayer graphene under pressure

Magic-angle twisted bilayer graphene (MATBG) is notable as a highly tunable platform for investigating strongly correlated phenomena such as high-$T_c$ superconductivity and quantum spin liquids, due to easy control of doping level through gating and sensitive dependence of the magic angle on hydrostatic pressure. Experimental observations of correlated insulating states, unconventional superconductivity and ferromagnetism in MATBG indicate that this system exhibits rich exotic phases. In this work, using density functional theory calculations in conjunction with the effective screening medium method, we find the MATBG under pressure at a twisting angle of $2.88\unicode{xb0}$ and simulate how its electronic states evolve when doping level and out-of-plane electric field are gate-tuned. Our calculations show that, at doping levels between two electrons and four holes per moiré unit cell, a ferromagnetic solution with spin density localized at AA stacking sites is lower in energy than the nonmagnetic solution. The magnetic moment of this ferromagnetic state decreases with both electron and hole doping and vanishes at four electrons/holes doped per moiré unit cell. Hybridization between the flat bands at the Fermi level and the surrounding dispersive bands can take place at finite doping. Moreover, upon increasing the out-of-plane electric field at zero doping, a transition from the ferromagnetic state to the nonmagnetic one is seen. We also analyze the interlayer bonding character due to the flat bands via Wannier functions. Finally, we report trivial band topology of the flat bands in the ferromagnetic state at a certain doping level.

preprint2020arXiv

FOAL: Fast Online Adaptive Learning for Cardiac Motion Estimation

Motion estimation of cardiac MRI videos is crucial for the evaluation of human heart anatomy and function. Recent researches show promising results with deep learning-based methods. In clinical deployment, however, they suffer dramatic performance drops due to mismatched distributions between training and testing datasets, commonly encountered in the clinical environment. On the other hand, it is arguably impossible to collect all representative datasets and to train a universal tracker before deployment. In this context, we proposed a novel fast online adaptive learning (FOAL) framework: an online gradient descent based optimizer that is optimized by a meta-learner. The meta-learner enables the online optimizer to perform a fast and robust adaptation. We evaluated our method through extensive experiments on two public clinical datasets. The results showed the superior performance of FOAL in accuracy compared to the offline-trained tracking method. On average, the FOAL took only $0.4$ second per video for online optimization.

preprint2020arXiv

Hydrodynamics in lattice models with continuous non-Abelian symmetries

We develop a systematic effective field theory of hydrodynamics for many-body systems on the lattice with global continuous non-Abelian symmetries. Models with continuous non-Abelian symmetries are ubiquitous in physics, arising in diverse settings ranging from hot nuclear matter to cold atomic gases and quantum spin chains. In every dimension and for every flavor symmetry group, the low energy theory is a set of coupled noisy diffusion equations. Independence of the physics on the choice of canonical or microcanonical ensemble is manifest in our hydrodynamic expansion, even though the ensemble choice causes an apparent shift in quasinormal mode spectra. We use our formalism to explain why flavor symmetry is qualitatively different from hydrodynamics with other non-Abelian conservation laws, including angular momentum and charge multipoles. As a significant application of our framework, we study spin and energy diffusion in classical one-dimensional SU(2)-invariant spin chains, including the Heisenberg model along with multiple generalizations. We argue based on both numerical simulations and our effective field theory framework that non-integrable spin chains on a lattice exhibit conventional spin diffusion, in contrast to some recent predictions that diffusion constants grow logarithmically at late times. We show that the apparent enhancement of diffusion is due to slow equilibration caused by (non-Abelian) hydrodynamic fluctuations.

preprint2020arXiv

Motion Pyramid Networks for Accurate and Efficient Cardiac Motion Estimation

Cardiac motion estimation plays a key role in MRI cardiac feature tracking and function assessment such as myocardium strain. In this paper, we propose Motion Pyramid Networks, a novel deep learning-based approach for accurate and efficient cardiac motion estimation. We predict and fuse a pyramid of motion fields from multiple scales of feature representations to generate a more refined motion field. We then use a novel cyclic teacher-student training strategy to make the inference end-to-end and further improve the tracking performance. Our teacher model provides more accurate motion estimation as supervision through progressive motion compensations. Our student model learns from the teacher model to estimate motion in a single step while maintaining accuracy. The teacher-student knowledge distillation is performed in a cyclic way for a further performance boost. Our proposed method outperforms a strong baseline model on two public available clinical datasets significantly, evaluated by a variety of metrics and the inference time. New evaluation metrics are also proposed to represent errors in a clinically meaningful manner.

preprint2020arXiv

Probabilistic Load-Margin Assessment using Vine Copula and Gaussian Process Emulation

The increasing penetration of renewable energy along with the variations of the loads bring large uncertainties in the power system states that are threatening the security of power system planning and operation. Facing these challenges, this paper proposes a cost-effective, nonparametric method to quantify the impact of uncertain power injections on the load margins. First, we propose to generate system uncertain inputs via a novel vine copula due to its capability in simulating complex multivariate highly dependent model inputs. Furthermore, to reduce the prohibitive computational time required in the traditional Monte-Carlo method, we propose to use a nonparametric, Gaussian-process-emulator-based reduced-order model to replace the original complicated continuation power-flow model. This emulator allows us to execute the time-consuming continuation power-flow solver at the sampled values with a negligible computational cost. The simulations conducted on the IEEE 57-bus system, to which correlated renewable generation are attached, reveal the excellent performance of the proposed method.

preprint2020arXiv

Quantum butterfly effect in polarized Floquet systems

We explore quantum dynamics in Floquet many-body systems with local conservation laws in one spatial dimension, focusing on sectors of the Hilbert space which are highly polarized. We numerically compare the predicted charge diffusion constants and quantum butterfly velocity of operator growth between models of chaotic Floquet dynamics (with discrete spacetime translation invariance) and random unitary circuits which vary both in space and time. We find that for small but finite polarization per length (in the thermodynamic limit), the random unitary circuit correctly predicts the scaling of the butterfly velocity but incorrectly predicts the scaling of the diffusion constant. We argue that this is a consequence of quantum coherence on short time scales. Our work clarifies the settings in which random unitary circuits provide correct physical predictions for non-random chaotic systems, and sheds light into the origin of the slow down of the butterfly effect in highly polarized systems or at low temperature.

preprint2020arXiv

Real-Space Imaging of the Ordered Small Molecule Orientations in Porous Frameworks by Electron Microscopy

The real-space imaging of small molecules is always challenging under the electron microscopes, but highly demanded for investigating various nanoscale interactions, such as hydrogen bond and van der Waals (vdW) force. Especially, identifying the host-guest interactions in porous materials directly at the molecular level will bring a deeper insight into the behaviors of guest molecules during the sorption, catalysis, gas separation and energy storage. In this work, we directly resolved the ordered configurations of p-xylenes (PXs) adsorbed in ZSM-5 frameworks by the scanning transmission electron microscopy (STEM) with the integrated differential phase contrast (iDPC) technique to identify the host-guest vdW interactions. Based on these observations, we revealed that the PXs in one straight channel modified the channel geometry with a coherent orientation. And the adjacent straight channels were deformed up to 8.8% along the different directions corresponding to three dominant PX configurations, resulting a negligible overall expansion of ZSM-5 lattices. Then, we could also image the disorder and desorption of PXs in ZSM-5 channels during the in situ heating. This work not only helped us to study the host-guest vdW interactions and the sorption behaviors of PXs in ZSM-5, but also provided an efficient tool for further imaging and studying other single-molecule behaviors under STEMs.

preprint2020arXiv

Real-Time Cardiac Cine MRI with Residual Convolutional Recurrent Neural Network

Real-time cardiac cine MRI does not require ECG gating in the data acquisition and is more useful for patients who can not hold their breaths or have abnormal heart rhythms. However, to achieve fast image acquisition, real-time cine commonly acquires highly undersampled data, which imposes a significant challenge for MRI image reconstruction. We propose a residual convolutional RNN for real-time cardiac cine reconstruction. To the best of our knowledge, this is the first work applying deep learning approach to Cartesian real-time cardiac cine reconstruction. Based on the evaluation from radiologists, our deep learning model shows superior performance than compressed sensing.

preprint2020arXiv

Resolving asset pricing puzzles using price-impact

We solve in closed-form an equilibrium model in which a finite number of exponential investors continuously consume and trade with price-impact. Compared to the analogous Pareto-efficient equilibrium model, price-impact has an amplification effect on risk-sharing distortions that helps resolve the interest rate puzzle and the stock-price volatility puzzle and, to a lesser extent, affects the equity premium puzzle.

preprint2020arXiv

Revealing the configurations and host-guest interactions of small aromatics confined in porous frameworks by electron microscopy

Directly imaging the configurations of small molecules at the ambient temperatures will greatly promote the study of their chemical and physical properties, including the host-guest interactions of organics in porous materials during the adsorption, catalysis and energy storage. However, due to the current challenges on the small-molecule imaging by the (scanning) transmission electron microscopy ((S)TEM), we still have a lack of the molecular-level understandings on the host-guest interactions and other molecular behaviors. Here, we achieved the STEM imaging of various small aromatics confined in the MFI-type zeolite frameworks by using the integrated differential phase contrast (iDPC) technique. Due to the strong confinement effect in MFI channels, the 1D solid-like aromatic columns showed the coherent configurations, which were clearly resolved by enhancing the host-guest interactions. Then, we also evaluated the strength of host-guest interactions directly by the image analysis and revealed the desorption behaviors of confined aromatics during the in-situ heating process. These results not only helped us to reveal the configurations and host-guest interactions of small aromatics during the adsorption/desorption in porous materials, but also expanded the applications of STEM to further study other molecular behaviors in the real space.

preprint2020arXiv

Subsystem Rényi Entropy of Thermal Ensembles for SYK-like models

The Sachdev-Ye-Kitaev model is an $N$-modes fermionic model with infinite range random interactions. In this work, we study the thermal Rényi entropy for a subsystem of the SYK model using the path-integral formalism in the large-$N$ limit. The results are consistent with exact diagonalization [1] and can be well approximated by thermal entropy with an effective temperature [2] when subsystem size $M\leq N/2$. We also consider generalizations of the SYK model with quadratic random hopping term or $U(1)$ charge conservation.

preprint2020arXiv

The 1st Agriculture-Vision Challenge: Methods and Results

The first Agriculture-Vision Challenge aims to encourage research in developing novel and effective algorithms for agricultural pattern recognition from aerial images, especially for the semantic segmentation task associated with our challenge dataset. Around 57 participating teams from various countries compete to achieve state-of-the-art in aerial agriculture semantic segmentation. The Agriculture-Vision Challenge Dataset was employed, which comprises of 21,061 aerial and multi-spectral farmland images. This paper provides a summary of notable methods and results in the challenge. Our submission server and leaderboard will continue to open for researchers that are interested in this challenge dataset and task; the link can be found here.

preprint2020arXiv

The operator Lévy flight: light cones in chaotic long-range interacting systems

We argue that chaotic power-law interacting systems have emergent limits on information propagation, analogous to relativistic light cones, which depend on the spatial dimension $d$ and the exponent $α$ governing the decay of interactions. Using the dephasing nature of quantum chaos, we map the problem to a stochastic model with a known phase diagram. A linear light cone results for $α\ge d+1/2$. We also provide a Lévy flight (long-range random walk) interpretation of the results and show consistent numerical data for 1d long-range spin models with 200 sites.

preprint2020arXiv

Towards an Astronomical Science Platform: Experiences and Lessons Learned from Chinese Virtual Observatory

In the era of big data astronomy, next generation telescopes and large sky surveys produce data sets at the TB or even PB level. Due to their large data volumes, these astronomical data sets are extremely difficult to transfer and analyze using personal computers or small clusters. In order to offer better access to data, data centers now generally provide online science platforms that enable analysis close to the data. The Chinese Virtual Observatory (China-VO) is one of the member projects in the International Virtual Observatory Alliance and it is dedicated to providing a research and education environment where globally distributed astronomy archives are simple to find, access, and interoperate. In this study, we summarize highlights of the work conducted at the China-VO, as well the experiences and lessons learned during the full life-cycle management of astronomical data. Finally, We discuss the challenges and future trends for astronomical science platforms.