Source author record

Xu Liu

Xu Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

54works

29topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A General Framework for Multimodal LLM-Based Multimedia Understanding in Large-Scale Recommendation Systems

Conventional recommendation systems frequently fail to fully exploit the high-dimensional semantic signals inherent in multimedia content, thereby limiting the fidelity of user preference modeling. While Multimodal Large Language Models (MM-LLMs) offer robust mechanisms for interpreting such complex data, their integration into latency-constrained, industrial-scale architectures remains a significant challenge. To address this, we propose a generalized framework for MM-LLM-driven multimedia understanding. Our methodology employs a tripartite architecture encompassing content interpretation, representation extraction, and systematic pipeline integration, instantiated via a LLaMA2-based model that generates descriptive captions subsequently ingested as tokenized categorical features. Empirical evaluation demonstrates the efficacy of this approach, yielding a $0.35\%$ increase in offline AUC and a $0.02\%$ improvement in online metrics at scale, substantiating the practical viability of leveraging MM-LLMs to enhance large-scale recommendation performance.

preprint2026arXiv

Intelligent Nano-Fingerprinting: An Efficient and Precise Approach for Liquid Biopsy

Biological matrices are rich in information related to life processes, serving as invaluable media for assessing an individual's overall physiological status and its dynamic fluctuations, as well as crucial foundations for disease diagnosis. However, the inherent complexity of these matrices, coupled with our incomplete understanding of their full composition, presents significant challenges for comprehensive analysis and accurate diagnostic interpretation. The advent of single-molecule technologies has revolutionized biomedical research, enabling the direct observation of life processes at the molecular scale. We have proposed an Intelligent Nano-Fingerprinting strategy based on single-molecule nanopore technology, designed to capture the global molecular fingerprints of complex plasma matrices. Furthermore, we developed an intelligent algorithmic model capable of achieving precise classification of plasma samples. This approach is characterized by its simplicity, efficiency, and considerable potential for large-scale adoption and transferable applications.

preprint2026arXiv

Learning Geometric Invariance for Gait Recognition

The goal of gait recognition is to extract identity-invariant features of an individual under various gait conditions, e.g., cross-view and cross-clothing. Most gait models strive to implicitly learn the common traits across different gait conditions in a data-driven manner to pull different gait conditions closer for recognition. However, relatively few studies have explicitly explored the inherent relations between different gait conditions. For this purpose, we attempt to establish connections among different gait conditions and propose a new perspective to achieve gait recognition: variations in different gait conditions can be approximately viewed as a combination of geometric transformations. In this case, all we need is to determine the types of geometric transformations and achieve geometric invariance, then identity invariance naturally follows. As an initial attempt, we explore three common geometric transformations (i.e., Reflect, Rotate, and Scale) and design a $\mathcal{R}$eflect-$\mathcal{R}$otate-$\mathcal{S}$cale invariance learning framework, named ${\mathcal{RRS}}$-Gait. Specifically, it first flexibly adjusts the convolution kernel based on the specific geometric transformations to achieve approximate feature equivariance. Then these three equivariant-aware features are respectively fed into a global pooling operation for final invariance-aware learning. Extensive experiments on four popular gait datasets (Gait3D, GREW, CCPG, SUSTech1K) show superior performance across various gait conditions.

preprint2026arXiv

Meta-Backscatter: Long-Distance Battery-Free Metamaterial-Backscatter Sensing and Communication

Battery-free Internet of Things (BF-IoT) enabled by backscatter communication is a rapidly evolving technology offering advantages of low cost, ultra-low power consumption, and robustness. However, the practical deployment of BF-IoT is significantly constrained by the limited communication range of common backscatter tags, which typically operate with a range of merely a few meters due to inherent round-trip path loss. Meta-backscatter systems that utilize metamaterial tags present a promising solution, retaining the inherent advantages of BF-IoT while breaking the critical communication range barrier. By leveraging densely paved sub-wavelength units to concentrate the reflected signal power, metamaterial tags enable a significant communication range extension over existing BF-IoT tags that employ omni-directional antennas. In this paper, we synthesize the principles and paradigms of metamaterial sensing to establish a unified design framework and a forward-looking research roadmap. Specifically, we first provide an overview of backscatter communication, encompassing its development history, working principles, and tag classification. We then introduce the design methodology for both metamaterial tags and their compatible transceivers. Moreover, we present the implementation of a meta-backscatter system prototype and report the experimental results based on it. Finally, we conclude by highlighting key challenges and outlining potential avenues for future research.

preprint2026arXiv

Quantum tunnelling-integrated optoplasmonic nanotrap enables conductance visualisation of individual proteins

Biological electron transfer (ET) relies on quantum mechanical tunnelling through a dynamically folded protein. Yet, the spatiotemporal coupling between structural fluctuations and electron flux remains poorly understood, largely due to limitations in existing experimental techniques, such as ensemble averaging and non-physiological operating conditions. Here, we introduce a quantum tunnelling-integrated optoplasmonic nanotrap (QTOP-trap), an optoelectronic platform that combines plasmonic optical trapping with real-time quantum tunnelling measurements. This label-free approach enables single-molecule resolution of protein conductance in physiological electrolytes, achieving sub-3 nm spatial precision and 10-μs temporal resolution. By synchronising optoelectronic measurements, QTOP-trap resolves protein-specific conductance signatures and directly correlates tertiary structure dynamics with conductance using a "protein switch" strategy. This methodology establishes a universal framework for dissecting non-equilibrium ET mechanisms in individual conformational-active proteins, with broad implications for bioenergetics research and biomimetic quantum device design.

preprint2026arXiv

ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization

Offline-to-online reinforcement learning harnesses the stability of offline pretraining and the flexibility of online fine-tuning. A key challenge lies in the non-stationary distribution shift between offline datasets and the evolving online policy. Common approaches often rely on static mixing ratios or heuristic-based replay strategies, which lack adaptability to different environments and varying training dynamics, resulting in suboptimal tradeoff between stability and asymptotic performance. In this work, we propose Reinforcement Learning with Optimized Adaptive Data-mixing (ROAD), a dynamic plug-and-play framework that automates the data replay process. We identify a fundamental objective misalignment in existing approaches. To tackle this, we formulate the data selection problem as a bi-level optimization process, interpreting the data mixing strategy as a meta-decision governing the policy performance (outer-level) during online fine-tuning, while the conventional Q-learning updates operate at the inner level. To make it tractable, we propose a practical algorithm using a multi-armed bandit mechanism. This is guided by a surrogate objective approximating the bi-level gradient, which simultaneously maintains offline priors and prevents value overestimation. Our empirical results demonstrate that this approach consistently outperforms existing data replay methods across various datasets, eliminating the need for manual, context-specific adjustments while achieving superior stability and asymptotic performance.

preprint2023arXiv

Towards Exascale Computation for Turbomachinery Flows

A state-of-the-art large eddy simulation code has been developed to solve compressible flows in turbomachinery. The code has been engineered with a high degree of scalability, enabling it to effectively leverage the many-core architecture of the new Sunway system. A consistent performance of 115.8 DP-PFLOPs has been achieved on a high-pressure turbine cascade consisting of over 1.69 billion mesh elements and 865 billion Degree of Freedoms (DOFs). By leveraging a high-order unstructured solver and its portability to large heterogeneous parallel systems, we have progressed towards solving the grand challenge problem outlined by NASA, which involves a time-dependent simulation of a complete engine, incorporating all the aerodynamic and heat transfer components.

preprint2022arXiv

ApolloRL: a Reinforcement Learning Platform for Autonomous Driving

We introduce ApolloRL, an open platform for research in reinforcement learning for autonomous driving. The platform provides a complete closed-loop pipeline with training, simulation, and evaluation components. It comes with 300 hours of real-world data in driving scenarios and popular baselines such as Proximal Policy Optimization (PPO) and Soft Actor-Critic (SAC) agents. We elaborate in this paper on the architecture and the environment defined in the platform. In addition, we discuss the performance of the baseline agents in the ApolloRL environment.

preprint2022arXiv

BinGo: Pinpointing Concurrency Bugs in Go via Binary Analysis

Golang (also known as Go for short) has become popular in building concurrency programs in distributed systems. As the unique features, Go employs lightweight Goroutines to support highly parallelism in user space. Moreover, Go leverages channels to enable explicit communication among threads. However, recent studies show that concurrency bugs are not uncommon in Go applications. Pinpointing these concurrency bugs in real Go applications is both important and challenging. Existing approaches are mostly based on compiler-aided static or dynamic analysis, which have two limitations. First, existing approaches require the availability and recompilation of the source code, which work well on testing rather than production environments with no source code available for both applications and external libraries. Second, existing approaches work on pure Go code bases only, not programs mixed with Go and other languages. To address these limitations, we develop BinGo, the first tool to identify concurrency bugs in Go applications via dynamic binary analysis. BinGo correlates binary execution with Go semantics and employs novel bug detection algorithms. BinGo is an end-to-end tool that is ready for deployment in the production environment with no modification on source code, compilers, and runtimes in the Go eco-system. Our experiments show that BinGo has a high coverage of concurrency bugs with no false positives. We are able to use BinGo to identify concurrency bugs in real applications with moderate overhead.

preprint2022arXiv

Deep Learning-based Occluded Person Re-identification: A Survey

Occluded person re-identification (Re-ID) aims at addressing the occlusion problem when retrieving the person of interest across multiple cameras. With the promotion of deep learning technology and the increasing demand for intelligent video surveillance, the frequent occlusion in real-world applications has made occluded person Re-ID draw considerable interest from researchers. A large number of occluded person Re-ID methods have been proposed while there are few surveys that focus on occlusion. To fill this gap and help boost future research, this paper provides a systematic survey of occluded person Re-ID. Through an in-depth analysis of the occlusion in person Re-ID, most existing methods are found to only consider part of the problems brought by occlusion. Therefore, we review occlusion-related person Re-ID methods from the perspective of issues and solutions. We summarize four issues caused by occlusion in person Re-ID, i.e., position misalignment, scale misalignment, noisy information, and missing information. The occlusion-related methods addressing different issues are then categorized and introduced accordingly. After that, we summarize and compare the performance of recent occluded person Re-ID methods on four popular datasets: Partial-ReID, Partial-iLIDS, Occluded-ReID, and Occluded-DukeMTMC. Finally, we provide insights on promising future research directions.

preprint2022arXiv

Functional varying index coefficient model for dynamic gene-environment interactions

Rooted in genetics, human complex diseases are largely influenced by environmental factors. Existing literature has shown the power of integrative gene-environment interaction analysis by considering the joint effect of environmental mixtures on a disease risk. In this work, we propose a functional varying index coefficient model for longitudinal measurements of a phenotypic trait together with multiple environmental variables, and assess how the genetic effects on a longitudinal disease trait are nonlinearly modified by a mixture of environmental influences. We derive an estimation procedure for the nonparametric functional varying index coefficients under the quadratic inference function and penalized spline framework. Theoretical results such as estimation consistency and asymptotic normality of the estimates are established. In addition, we propose a hypothesis testing procedure to assess the significance of the nonparametric index coefficient function. We evaluate the performance of our estimation and testing procedure through Monte Carlo simulation studies. The proposed method is illustrated by applying to a real data set from a pain sensitivity study in which SNP effects are nonlinearly modulated by the combination of dosage levels and other environmental variables to affect patients' blood pressure and heart rate.

preprint2022arXiv

Mars Entry Trajectory Planning with Range Discretization and Successive Convexification

This paper develops a sequential convex programming approach for Mars entry trajectory planning by range discretization. To improve the accuracy of numerical integration, the range of entry trajectory is selected as the independent variable rather than time or energy. A dilation factor is employed to normalize the entry dynamics and integration interval of the performance index so that the difficult free-final-time programming problem can be converted to a fixed-final-range optimization problem. The bank angle rate with respect to the range is introduced as the new control input in order to decouple the control from the state and facilitate convexification of constraints on the bank angle and its rate. The nonlinear bank angle rate constraint is further relaxed into a linear one via inequality relaxation. Moreover, the nonconvex minimum-time performance index is convexified by regarding flight time as a state variable. Then, the Mars entry trajectory planning problem can be formulated into the framework of convex programming after linearization. By range discretization and successive convexification, the reformulated Mars entry trajectory planning problem is transcribed into a series of convex optimization sub-problems that can be sequentially solved using the convex programming algorithm. The virtual control and adaptive trust-region techniques are employed to improve the accuracy, robustness, and computation efficiency of the algorithm. Numerical simulations with comparative studies are presented to demonstrate the convergence performance and efficiency of the proposed algorithm.

preprint2022arXiv

Multispectral large-area X-ray imaging enabled by stacked multilayer scintillators

Conventional energy-integration black-white X-ray imaging lacks spectral information of X-ray photons. Although X-ray spectra (energy) can be distinguished by photon-counting technique typically with CdZnTe detectors, it is very challenging to be applied to large-area flat-panel X-ray imaging (FPXI). Herein, we design multi-layer stacked scintillators of different X-ray absorption capabilities and scintillation spectrums, in this scenario, the X-ray energy can be discriminated by detecting the emission spectra of each scintillator, therefore the multispectral X-ray imaging can be easily obtained by color or multispectral visible-light camera in one single shot of X-ray. To verify this idea, stacked multilayer scintillators based on several emerging metal halides were fabricated in the cost-effective and scalable solution process, and proof-of-concept multi-energy FPXI were experimentally demonstrated. The dual-energy X-ray image of a bone-muscle model clearly showed the details that were invisible in conventional energy-integration FPXI. By stacking four layers of specifically designed multilayer scintillators with appropriate thicknesses, a prototype FPXI with four energy channels was realized, proving its extendibility to multispectral or even hyperspectral X-ray imaging. This study provides a facile and effective strategy to realize energy-resolved flat-panel X-ray imaging.

preprint2022arXiv

OJXPerf: Featherlight Object Replica Detection for Java Programs

Memory bloat is an important source of inefficiency in complex production software, especially in software written in managed languages such as Java. Prior approaches to this problem have focused on identifying objects that outlive their life span. Few studies have, however, looked into whether and to what extent myriad objects of the same type are identical. A quantitative assessment of identical objects with code-level attribution can assist developers in refactoring code to eliminate object bloat, and favor reuse of existing object(s). The result is reduced memory pressure, reduced allocation and garbage collection, enhanced data locality, and reduced re-computation, all of which result in superior performance. We develop OJXPerf, a lightweight sampling-based profiler, which probabilistically identifies identical objects. OJXPerf employs hardware performance monitoring units (PMU) in conjunction with hardware debug registers to sample and compare field values of different objects of the same type allocated at the same calling context but potentially accessed at different program points. The result is a lightweight measurement, a combination of object allocation contexts and usage contexts ordered by duplication frequency. This class of duplicated objects is relatively easier to optimize. OJXPerf incurs 9% runtime and 6% memory overheads on average. We empirically show the benefit of OJXPerf by using its profiles to instruct us to optimize a number of Java programs, including well-known benchmarks and real-world applications. The results show a noticeable reduction in memory usage (up to 11%) and a significant speedup (up to 25%).

preprint2022arXiv

Rapid Elastic Architecture Search under Specialized Classes and Resource Constraints

In many real-world applications, we often need to handle various deployment scenarios, where the resource constraint and the superclass of interest corresponding to a group of classes are dynamically specified. How to efficiently deploy deep models for diverse deployment scenarios is a new challenge. Previous NAS approaches seek to design architectures for all classes simultaneously, which may not be optimal for some individual superclasses. A straightforward solution is to search an architecture from scratch for each deployment scenario, which however is computation-intensive and impractical. To address this, we present a novel and general framework, called Elastic Architecture Search (EAS), permitting instant specializations at runtime for diverse superclasses with various resource constraints. To this end, we first propose to effectively train an over-parameterized network via a superclass dropout strategy during training. In this way, the resulting model is robust to the subsequent superclasses dropping at inference time. Based on the well-trained over-parameterized network, we then propose an efficient architecture generator to obtain promising architectures within a single forward pass. Experiments on three image classification datasets show that EAS is able to find more compact networks with better performance while remarkably being orders of magnitude faster than state-of-the-art NAS methods, e.g., outperforming OFA (once-for-all) by 1.3% on Top-1 accuracy at a budget around 361M #MAdds on ImageNet-10. More critically, EAS is able to find compact architectures within 0.1 second for 50 deployment scenarios.

preprint2022arXiv

Temperature Field Inversion of Heat-Source Systems via Physics-Informed Neural Networks

Temperature field inversion of heat-source systems (TFI-HSS) with limited observations is essential to monitor the system health. Although some methods such as interpolation have been proposed to solve TFI-HSS, those existing methods ignore correlations between data constraints and physics constraints, causing the low precision. In this work, we develop a physics-informed neural network-based temperature field inversion (PINN-TFI) method to solve the TFI-HSS task and a coefficient matrix condition number based position selection of observations (CMCN-PSO) method to select optima positions of noise observations. For the TFI-HSS task, the PINN-TFI method encodes constrain terms into the loss function, thus the task is transformed into an optimization problem of minimizing the loss function. In addition, we have found that noise observations significantly affect reconstruction performances of the PINN-TFI method. To alleviate the effect of noise observations, the CMCN-PSO method is proposed to find optimal positions, where the condition number of observations is used to evaluate positions. The results demonstrate that the PINN-TFI method can significantly improve prediction precisions and the CMCN-PSO method can find good positions to acquire a more robust temperature field.

preprint2022arXiv

The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems

This paper presents and characterizes an Open Application Repository for Federated Learning (OARF), a benchmark suite for federated machine learning systems. Previously available benchmarks for federated learning have focused mainly on synthetic datasets and use a limited number of applications. OARF mimics more realistic application scenarios with publicly available data sets as different data silos in image, text and structured data. Our characterization shows that the benchmark suite is diverse in data size, distribution, feature distribution and learning task complexity. The extensive evaluations with reference implementations show the future research opportunities for important aspects of federated learning systems. We have developed reference implementations, and evaluated the important aspects of federated learning, including model accuracy, communication cost, throughput and convergence time. Through these evaluations, we discovered some interesting findings such as federated learning can effectively increase end-to-end throughput.

preprint2022arXiv

Transformers Meet Visual Learning Understanding: A Comprehensive Review

Dynamic attention mechanism and global modeling ability make Transformer show strong feature learning ability. In recent years, Transformer has become comparable to CNNs methods in computer vision. This review mainly investigates the current research progress of Transformer in image and video applications, which makes a comprehensive overview of Transformer in visual learning understanding. First, the attention mechanism is reviewed, which plays an essential part in Transformer. And then, the visual Transformer model and the principle of each module are introduced. Thirdly, the existing Transformer-based models are investigated, and their performance is compared in visual learning understanding applications. Three image tasks and two video tasks of computer vision are investigated. The former mainly includes image classification, object detection, and image segmentation. The latter contains object tracking and video classification. It is significant for comparing different models' performance in various tasks on several public benchmark data sets. Finally, ten general problems are summarized, and the developing prospects of the visual Transformer are given in this review.

preprint2022arXiv

Weighted Simultaneous Algebra Reconstruction Technique (wSART) for Additive Light Field Synthesis

We apply an iterative weighting scheme for additive light field synthesis. Unlike previous work optimizing additive light field evenly over viewpoints, we constrain the optimization to deliver a reconstructed light field of high image quality for viewpoints of large weight.

preprint2021arXiv

A novel meta-learning initialization method for physics-informed neural networks

Physics-informed neural networks (PINNs) have been widely used to solve various scientific computing problems. However, large training costs limit PINNs for some real-time applications. Although some works have been proposed to improve the training efficiency of PINNs, few consider the influence of initialization. To this end, we propose a New Reptile initialization based Physics-Informed Neural Network (NRPINN). The original Reptile algorithm is a meta-learning initialization method based on labeled data. PINNs can be trained with less labeled data or even without any labeled data by adding partial differential equations (PDEs) as a penalty term into the loss function. Inspired by this idea, we propose the new Reptile initialization to sample more tasks from the parameterized PDEs and adapt the penalty term of the loss. The new Reptile initialization can acquire initialization parameters from related tasks by supervised, unsupervised, and semi-supervised learning. Then, PINNs with initialization parameters can efficiently solve PDEs. Besides, the new Reptile initialization can also be used for the variants of PINNs. Finally, we demonstrate and verify the NRPINN considering both forward problems, including solving Poisson, Burgers, and Schrödinger equations, as well as inverse problems, where unknown parameters in the PDEs are estimated. Experimental results show that the NRPINN training is much faster and achieves higher accuracy than PINNs with other initialization methods.

preprint2021arXiv

NumaPerf: Predictive and Full NUMA Profiling

Parallel applications are extremely challenging to achieve the optimal performance on the NUMA architecture, which necessitates the assistance of profiling tools. However, existing NUMA-profiling tools share some similar shortcomings, such as portability, effectiveness, and helpfulness issues. This paper proposes a novel profiling tool - NumaPerf - that overcomes these issues. NumaPerf aims to identify potential performance issues for any NUMA architecture, instead of only on the current hardware. To achieve this, NumaPerf focuses on memory sharing patterns between threads, instead of real remote accesses. NumaPerf further detects potential thread migrations and load imbalance issues that could significantly affect the performance but are omitted by existing profilers. NumaPerf also separates cache coherence issues that may require different fix strategies. Based on our extensive evaluation, NumaPerf is able to identify more performance issues than any existing tool, while fixing these bugs leads to up to 5.94x performance speedup.

preprint2020arXiv

An entanglement-based quantum network based on symmetric dispersive optics quantum key distribution

Quantum key distribution (QKD) is a crucial technology for information security in the future. Developing simple and efficient ways to establish QKD among multiple users are important to extend the applications of QKD in communication networks. Herein, we proposed a scheme of symmetric dispersive optics QKD (DO-QKD) and demonstrated an entanglement-based quantum network based on it. In the experiment, a broadband entanglement photon pair source was shared by end users via wavelength and space division multiplexing. The wide spectrum of generated entangled photon pairs was divided into 16 combinations of frequency-conjugate channels. Photon pairs in each channel combination supported a fully-connected subnet with 8 users by a passive beam splitter. Eventually, it showed that an entanglement-based QKD network over 100 users could be supported by one entangled photon pair source in this architecture. It has great potential on applications of local quantum networks with large user number.

preprint2020arXiv

Feature Super-Resolution Based Facial Expression Recognition for Multi-scale Low-Resolution Faces

Facial Expressions Recognition(FER) on low-resolution images is necessary for applications like group expression recognition in crowd scenarios(station, classroom etc.). Classifying a small size facial image into the right expression category is still a challenging task. The main cause of this problem is the loss of discriminative feature due to reduced resolution. Super-resolution method is often used to enhance low-resolution images, but the performance on FER task is limited when on images of very low resolution. In this work, inspired by feature super-resolution methods for object detection, we proposed a novel generative adversary network-based feature level super-resolution method for robust facial expression recognition(FSR-FER). In particular, a pre-trained FER model was employed as feature extractor, and a generator network G and a discriminator network D are trained with features extracted from images of low resolution and original high resolution. Generator network G tries to transform features of low-resolution images to more discriminative ones by making them closer to the ones of corresponding high-resolution images. For better classification performance, we also proposed an effective classification-aware loss re-weighting strategy based on the classification probability calculated by a fixed FER model to make our model focus more on samples that are easily misclassified. Experiment results on Real-World Affective Faces (RAF) Database demonstrate that our method achieves satisfying results on various down-sample factors with a single model and has better performance on low-resolution images compared with methods using image super-resolution and expression recognition separately.

preprint2020arXiv

MultiResolution Attention Extractor for Small Object Detection

Small objects are difficult to detect because of their low resolution and small size. The existing small object detection methods mainly focus on data preprocessing or narrowing the differences between large and small objects. Inspired by human vision "attention" mechanism, we exploit two feature extraction methods to mine the most useful information of small objects. Both methods are based on multiresolution feature extraction. We initially design and explore the soft attention method, but we find that its convergence speed is slow. Then we present the second method, an attention-based feature interaction method, called a MultiResolution Attention Extractor (MRAE), showing significant improvement as a generic feature extractor in small object detection. After each building block in the vanilla feature extractor, we append a small network to generate attention weights followed by a weighted-sum operation to get the final attention maps. Our attention-based feature extractor is 2.0 times the AP of the "hard" attention counterpart (plain architecture) on the COCO small object detection benchmark, proving that MRAE can capture useful location and contextual information through adaptive learning.

preprint2020arXiv

ScalAna: Automating Scaling Loss Detection with Graph Analysis

Scaling a parallel program to modern supercomputers is challenging due to inter-process communication, Amdahl's law, and resource contention. Performance analysis tools for finding such scaling bottlenecks either base on profiling or tracing. Profiling incurs low overheads but does not capture detailed dependencies needed for root-cause analysis. Tracing collects all information at prohibitive overheads. In this work, we design ScalAna that uses static analysis techniques to achieve the best of both worlds - it enables the analyzability of traces at a cost similar to profiling. ScalAna first leverages static compiler techniques to build a Program Structure Graph, which records the main computation and communication patterns as well as the program's control structures. At runtime, we adopt lightweight techniques to collect performance data according to the graph structure and generate a Program Performance Graph. With this graph, we propose a novel approach, called backtracking root cause detection, which can automatically and efficiently detect the root cause of scaling loss. We evaluate ScalAna with real applications. Results show that our approach can effectively locate the root cause of scaling loss for real applications and incurs 1.73% overhead on average for up to 2,048 processes. We achieve up to 11.11% performance improvement by fixing the root causes detected by ScalAna on 2,048 processes.

preprint2020arXiv

Variational Inference-Based Dropout in Recurrent Neural Networks for Slot Filling in Spoken Language Understanding

This paper proposes to generalize the variational recurrent neural network (RNN) with variational inference (VI)-based dropout regularization employed for the long short-term memory (LSTM) cells to more advanced RNN architectures like gated recurrent unit (GRU) and bi-directional LSTM/GRU. The new variational RNNs are employed for slot filling, which is an intriguing but challenging task in spoken language understanding. The experiments on the ATIS dataset suggest that the variational RNNs with the VI-based dropout regularization can significantly improve the naive dropout regularization RNNs-based baseline systems in terms of F-measure. Particularly, the variational RNN with bi-directional LSTM/GRU obtains the best F-measure score.

preprint2019arXiv

SLOAM: Semantic Lidar Odometry and Mapping for Forest Inventory

This paper describes an end-to-end pipeline for tree diameter estimation based on semantic segmentation and lidar odometry and mapping. Accurate mapping of this type of environment is challenging since the ground and the trees are surrounded by leaves, thorns and vines, and the sensor typically experiences extreme motion. We propose a semantic feature based pose optimization that simultaneously refines the tree models while estimating the robot pose. The pipeline utilizes a custom virtual reality tool for labeling 3D scans that is used to train a semantic segmentation network. The masked point cloud is used to compute a trellis graph that identifies individual instances and extracts relevant features that are used by the SLAM module. We show that traditional lidar and image based methods fail in the forest environment on both Unmanned Aerial Vehicle (UAV) and hand-carry systems, while our method is more robust, scalable, and automatically generates tree diameter estimations.

preprint2016arXiv

An Internal Observability Estimate for Stochastic Hyperbolic Equations

This paper is addressed to establishing an internal observability estimate for some linear stochastic hyperbolic equations. The key is to establish a new global Carleman estimate for forward stochastic hyperbolic equations in the $L^2$-space. Different from the deterministic case, a delicate analysis of the adaptedness for some stochastic processes is required in the stochastic setting.

preprint2016arXiv

Correctness of Hierarchical MCS Locks with Timeout

This manuscript serves as a correctness proof of the Hierarchical MCS locks with Timeout (HMCS-T) described in our paper titled "An Efficient Abortable-locking Protocol for Multi-level NUMA Systems" appearing in the proceedings of the 22nd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming. HMCS-T is a very involved protocol. The system is stateful; the values of prior acquisition efforts affect the subsequent acquisition efforts. Also, the status of successors, predecessors, ancestors, and descendants affect steps followed by the protocol. The ability to make the protocol fully non-blocking leads to modifications to the \texttt{next} field, which causes deviation from the original MCS lock protocol both in acquisition and release. At several places, unconditional field updates are replaced with SWAP or CAS operations. We follow a multi-step approach to prove the correctness of HMCS-T. To demonstrate the correctness of the HMCS-T lock, we use the Spin model checking. Model checking causes a combinatorial explosion even to simulate a handful of threads. First, we understand the minimal, sufficient configurations necessary to prove safety properties of a single level of lock in the tree. We construct HMCS-T locks that represent these configurations. We model check these configurations, which proves the correctness of components of an HMCS-T lock. Finally, building upon these facts, we argue logically for the correctness of HMCS-T<n>.

preprint2016arXiv

Finite Codimensional Controllability for Evolution Equations

Motivated by infinite-dimensional optimal control problems with endpoint state constraints, in this Note, we introduce the notion of finite codimensional exact controllability for evolution equations. It is shown that this new controllability is equivalent to the finite codimensionality condition in the literatures to guarantee Pontryagin's maximum principle. As examples, LQ problems with fixed endpoint state constraints for a wave and a heat equation are analyzed, respectively.

preprint2015arXiv

A weighted identity for stochastic partial differential operators and its applications

In this paper, a pointwise weighted identity for some stochastic partial differential operators (with complex principal parts) is established. This identity presents a unified approach in studying the controllability, observability and inverse problems for some deterministic/stochastic partial differential equations. Based on this identity, one can deduce all the known Carleman estimates and observability results, for some deterministic partial differential equations, stochastic heat equations, stochastic Schrödinger equations and stochastic transport equations. Meanwhile, as its new application, we study an inverse problem for linear stochastic complex Ginzburg-Landau equations.

preprint2015arXiv

Anomalous High-Energy Waterfall-Like Electronic Structure in 5d Transition Metal Oxide Sr2IrO4 with a Strong Spin-Orbit Coupling

The layered 5d transition metal oxides like Sr2IrO4 have attracted significant interest recently due to a number of exotic and new phenomena induced by the interplay between the spin-orbit coupling, bandwidth W and on-site Coulomb correlation U. In contrast to a metallic behavior expected from the Mott-Hubbard model due to more spatially extended 5d orbitals and moderate U, an insulating ground state has been observed in Sr2IrO4. Such an insulating behavior can be understood by an effective J_eff=1/2 Mott insulator model by incorporating both electron correlation and strong spin-orbital coupling, although its validity remains under debate at present. In particular, Sr2IrO4 exhibits a number of similarities to the high temperature cuprate superconductors in the crystal structure, electronic structure, magnetic structure, and even possible high temperature superconductivity. Here we report a new observation of the anomalous high energy electronic structure in Sr2IrO4. By taking high-resolution angle-resolved photoemission measurements on Sr2IrO4 over a wide energy range, we have revealed that the high energy electronic structures show unusual nearly-vertical bands that extend over a large energy range. Such anomalous high energy behaviors resemble the high energy waterfall features observed in the cuprate superconductors, adding one more important similarity between these two systems. While strong electron correlation plays an important role in producing high energy waterfall features in the cuprate superconductors, the revelation of the high energy anomalies in Sr2IrO4 points to a novel route in generating exotic electronic excitations from the strong spin-orbit coupling and a moderate electron correlation.

preprint2015arXiv

Atomically thin spherical shell-shaped superscatterers based on Bohr model

Graphene monolayers can be used for atomically thin three-dimensional shell-shaped superscatterer designs. Due to the excitation of the first-order resonance of transverse magnetic (TM) graphene plasmons, the scattering cross section of the bare subwavelength dielectric particle is enhanced significantly by five orders of magnitude. The superscattering phenomenon can be intuitively understood and interpreted with Bohr model. Besides, based on the analysis of Bohr model, it is shown that contrary to the TM case, superscattering is hard to occur by exciting the resonance of transverse electric (TE) graphene plasmons due to their poor field confinements.

preprint2015arXiv

Common Electronic Origin of Superconductivity in (Li,Fe)OHFeSe Bulk Superconductor and Single-Layer FeSe/SrTiO3 Films

The mechanism of high temperature superconductivity in the iron-based superconductors remains an outstanding issue in condensed matter physics. The electronic structure, in particular the Fermi surface topology, is considered to play an essential role in dictating the superconductivity. Recent revelation of distinct electronic structure and possible high temperature superconductivity with a transition temperature Tc above 65 K in the single-layer FeSe films grown on the SrTiO3 substrate provides key information on the roles of Fermi surface topology and interface in inducing or enhancing superconductivity. Here we report high resolution angle-resolved photoemission measurement on the electronic structure and superconducting gap of a novel FeSe-based superconductor, (Li0.84Fe0.16)OHFe0.98Se, with a Tc at 41 K. We find that this single-phase bulk superconductor shows remarkably similar electronic behaviors to that of the superconducting single-layer FeSe/SrTiO3 film in terms of Fermi surface topology, band structure and nearly isotropic superconducting gap without nodes. These observations provide significant insights in understanding high temperature superconductivity in the single-layer FeSe/SrTiO3 film in particular, and the mechanism of superconductivity in the iron-based superconductors in general.

preprint2015arXiv

Concealing arbitrary objects remotely with multi-folded transformation optics

An invisibility cloak that can hide an arbitrary object external to the cloak itself has not been devised before. In this Letter, we introduce a novel way to design a remote cloaking device that makes any object located at a certain distance invisible. This is accomplished using multi-folded transformation optics to remotely generate a hidden region around the object that no field can penetrate and that does not disturb the far-field scattering electromagnetic field. As a result, any object in the hidden region can stay in position or move freely within that region and remain invisible. Our idea is further extended in order to design a remote illusion optics that can transform any arbitrary object into another one. Unlike other cloaking methods, this method would require no knowledge of the details of the object itself. The proposed multi-folded transformation optics will be crucial in the design of remote devices in a variety of contexts.

preprint2015arXiv

Electronic Structure and Superconductivity of FeSe-Related Superconductors

The FeSe superconductor and its related systems have attracted much attention in the iron-based superconductors owing to their simple crystal structure and peculiar electronic and physical properties. The bulk FeSe superconductor has a superconducting transition temperature (Tc) of ~8 K; it can be dramatically enhanced to 37 K at high pressure. On the other hand, its cousin system, FeTe, possesses a unique antiferromagnetic ground state but is non-superconducting. Substitution of Se by Te in the FeSe superconductor results in an enhancement of Tc up to 14.5 K and superconductivity can persist over a large composition range in the Fe(Se,Te) system. Intercalation of the FeSe superconductor leads to the discovery of the AxFe2-ySe2 (A=K, Cs and Tl) system that exhibits a Tc higher than 30 K and a unique electronic structure of the superconducting phase. The latest report of possible high temperature superconductivity in the single-layer FeSe/SrTiO3 films with a Tc above 65 K has generated much excitement in the community. This pioneering work opens a door for interface superconductivity to explore for high Tc superconductors. The distinct electronic structure and superconducting gap, layer-dependent behavior and insulator-superconductor transition of the FeSe/SrTiO3 films provide critical information in understanding the superconductivity mechanism of the iron-based superconductors. In this paper, we present a brief review on the investigation of the electronic structure and superconductivity of the FeSe superconductor and related systems, with a particular focus on the FeSe films.

preprint2015arXiv

Electronic Structure of Transition Metal Dichalcogenides PdTe2 and Cu0.05PdTe2 Superconductors Obtained by Angle-Resolved Photoemission Spectroscopy

The layered transition metal chalcogenides have been a fertile land in solid state physics for many decades. Various MX2-type transition metal dichalcogenides, such as WTe2, IrTe2, and MoS2, have triggered great attention recently, either for the discovery of novel phenomena or some extreme or exotic physical properties, or for their potential applications. PdTe2 is a superconductor in the class of transition metal dichalcogenides, and superconductivity is enhanced in its Cu-intercalated form, Cu0.05PdTe2. It is important to study the electronic structures of PdTe2 and its intercalated form in order to explore for new phenomena and physical properties and understand the related superconductivity enhancement mechanism. Here we report systematic high resolution angle-resolved photoemission (ARPES) studies on PdTe2 and Cu0.05$PdTe2 single crystals, combined with the band structure calculations. We present for the first time in detail the complex multi-band Fermi surface topology and densely-arranged band structure of these compounds. By carefully examining the electronic structures of the two systems, we find that Cu-intercalation in PdTe2 results in electron-doping, which causes the band structure to shift downwards by nearly 16 meV in Cu0.05PdTe2. Our results lay a foundation for further exploration and investigation on PdTe2 and related superconductors.

preprint2015arXiv

On the equivalence between local and global existence of complete Kähler metrics with plurisubharmonic potentials

Like the classical potential theory, it was conjectured that there exists equivalence between locally and globally pluripolar and complete pluripolar sets, namely, Problem I of Lelong, and was solved by Josefson, Bedford - Taylor and Colţoiu. In this article, we consider complements of complete Kähler domains as the generalization of closed complete pluripolar sets and prove that there exists an equivalence between local and global existence of these sets.

preprint2014arXiv

Dichotomy of Electronic Structure and Superconductivity between Single-Layer and Double-Layer FeSe/SrTiO3 Films

The latest discovery of possible high temperature superconductivity in the single-layer FeSe film grown on a SrTiO3 substrate, together with the observation of its unique electronic structure and nodeless superconducting gap, has generated much attention. Initial work also found that, while the single-layer FeSe/SrTiO3 film exhibits a clear signature of superconductivity, the double-layer FeSe/SrTiO3 film shows an insulating behavior. Such a dramatic difference between the single-layer and double-layer FeSe/SrTiO3 films is surprising and the underlying origin remains unclear. Here we report our comparative study between the single-layer and double-layer FeSe/SrTiO3 films by performing a systematic angle-resolved photoemission study on the samples annealed in vacuum. We find that, like the single-layer FeSe/SrTiO3 film, the as-prepared double-layer FeSe/SrTiO3 film is insulating and possibly magnetic, thus establishing a universal existence of the magnetic phase in the FeSe/SrTiO3 films. In particular, the double-layer FeSe/SrTiO3 film shows a quite different doping behavior from the single-layer film in that it is hard to get doped and remains in the insulating state under an extensive annealing condition. The difference originates from the much reduced doping efficiency in the bottom FeSe layer of the double-layer FeSe/SrTiO3 film from the FeSe-SrTiO3 interface. These observations provide key insights in understanding the origin of superconductivity and the doping mechanism in the FeSe/SrTiO3 films. The property disparity between the single-layer and double-layer FeSe/SrTiO3 films may facilitate to fabricate electronic devices by making superconducting and insulating components on the same substrate under the same condition.

preprint2014arXiv

Electronic Evidence of an Insulator-Superconductor Transition in Single-Layer FeSe/SrTiO3 Films

In high temperature cuprate superconductors, it is now generally agreed that the parent compound is a Mott insulator and superconductivity is realized by doping the antiferromagnetic Mott insulator. In the iron-based superconductors, however, the parent compound is mostly antiferromagnetic metal, raising a debate on whether an appropriate starting point should go with an itinerant picture or a localized picture. It has been proposed theoretically that the parent compound of the iron-based superconductors may be on the verge of a Mott insulator, but so far no clear experimental evidence of doping-induced Mott transition has been available. Here we report an electronic evidence of an insulator-superconductor transition observed in the single-layer FeSe films grown on the SrTiO3 substrate. By taking angle-resolved photoemission measurements on the electronic structure and energy gap, we have identified a clear evolution of an insulator to a superconductor with the increasing doping. This observation represents the first example of an insulator-superconductor transition via doping observed in the iron-based superconductors. It indicates that the parent compound of the iron-based superconductors is in proximity of a Mott insulator and strong electron correlation should be considered in describing the iron-based superconductors.

preprint2014arXiv

Orbital-Selective Spin Texture and its Manipulation in a Topological Insulator

Topological insulators represent a new quantum state of matter that are insulating in the bulk but metallic on the edge or surface. In the Dirac surface state, it is well-established that the electron spin is locked with the crystal momentum. Here we report a new phenomenon of the spin texture locking with the orbital texture in a topological insulator Bi2Se3. We observe light-polarization-dependent spin texture of both the upper and lower Dirac cones that constitutes strong evidence of the orbital-dependent spin texture in Bi2Se3. The different spin texture detected in variable polarization geometry is the manifestation of the spin-orbital texture in the initial state combined with the photoemission matrix element effects. Our observations provide a new orbital degree of freedom and a new way of light manipulation in controlling the spin structure of the topological insulators that are important for their future applications in spin-related technologies.

preprint2014arXiv

Strong Anisotropy of Dirac Cone in SrMnBi2 and CaMnBi2 Revealed by Angle-Resolved Photoemission Spectroscopy

The Dirac materials, such as graphene and three-dimensional topological insulators, have attracted much attention because they exhibit novel quantum phenomena with their low energy electrons governed by the relativistic Dirac equations. One particular interest is to generate Dirac cone anisotropy so that the electrons can propagate differently from one direction to the other, creating an additional tunability for new properties and applications. While various theoretical approaches have been proposed to make the isotropic Dirac cones of graphene into anisotropic ones, it has not yet been met with success. There are also some theoretical predictions and/or experimental indications of anisotropic Dirac cone in novel topological insulators and AMnBi2 (A=Sr and Ca) but more experimental investigations are needed. Here we report systematic high resolution angle-resolved photoemission measurements that have provided direct evidence on the existence of strongly anisotropic Dirac cones in SrMnBi2 and CaMnBi2. Distinct behaviors of the Dirac cones between SrMnBi2 and CaMnBi2 are also observed. These results have provided important information on the strong anisotropy of the Dirac cones in AMnBi2 system that can be governed by the spin-orbital coupling and the local environment surrounding the Bi square net.

preprint2014arXiv

Weak Electron-Phonon Coupling and Unusual Electron Scattering of Topological Surface States in Sb(111) by Laser-Based Angle-Resolved Photoemission Spectroscopy

High resolution laser-based angle-resolved photoemission measurements have been carried out on Sb(111) single crystal. Two kinds of Fermi surface sheets are observed that are derived from the topological surface states: one small hexagonal electron-like Fermi pocket around $Γ$ point and the other six elongated lobes of hole-like Fermi pockets around the electron pocket. Clear Rashba-type band splitting due to the strong spin-orbit coupling is observed that is anisotropic in the momentum space. Our super-high-resolution ARPES measurements reveal no obvious kink in the surface band dispersions indicating a weak electron-phonon interaction in the surface states. In particular, the electron scattering rate for these topological surface states is nearly a constant over a large energy window near the Fermi level that is unusual in terms of the conventional picture.

preprint2013arXiv

Electrical Tuning of Surface Plasmon Polariton Propagation in Graphene-Nanowire Hybrid Structure

We demonstrate a dynamic surface plasmonic modulation of graphene-nanowire hybrid structures in visible light range, which was thought to be a tough task for graphene based field effect transistor modulator previously. Static modulation depth of as high as 0.07 dB/μm has been achieved experimentally. Carefully simulation indicates the strongly focused electromagnetic field and dramatically enhanced electric field at the interface between a silver NW and a graphene sheet are key roles for bringing the optical response of the device to the visible range. Furthermore, the modulation behaviors near the Dirac point of monolayer graphene and the singularity of gap-induced bilayer graphene are investigated.

preprint2013arXiv

Fermi Surface and Band Structure of (Ca,La)FeAs2 Superconductor from Angle-Resolved Photoemission Spectroscopy

The (Ca,R)FeAs2 (R=La,Pr and etc.) superconductors with a signature of superconductivity transition above 40 K possess a new kind of block layers that consist of zig-zag As chains. In this paper, we report the electronic structure of the new (Ca,La)FeAs2 superconductor investigated by both band structure calculations and high resolution angle-resolved photoemission spectroscopy measurements. Band structure calculations indicate that there are four hole-like bands around the zone center $Γ$(0,0) and two electron-like bands near the zone corner M(pi,pi) in CaFeAs2. In our angle-resolved photoemission measurements on (Ca0.9La0.1})FeAs2, we have observed three hole-like bands around the Gamma point and one electron-like Fermi surface near the M(pi,pi) point. These results provide important information to compare and contrast with the electronic structure of other iron-based compounds in understanding the superconductivity mechanism in the iron-based superconductors.

preprint2013arXiv

Tunable Dirac Fermion Dynamics in Topological Insulators

Three-dimensional topological insulators are characterized by insulating bulk state and metallic surface state involving Dirac fermions that behave as massless relativistic particles. These Dirac fermions are responsible for achieving a number of novel and exotic quantum phenomena in the topological insulators and for their potential applications in spintronics and quantum computations. It is thus essential to understand the electron dynamics of the Dirac fermions, i.e., how they interact with other electrons, phonons and disorders. Here we report super-high resolution angle-resolved photoemission studies on the Dirac fermion dynamics in the prototypical Bi2(Te,Se)3 topological insulators. We have directly revealed signatures of the electron-phonon coupling in these topological insulators and found that the electron-disorder interaction is the dominant factor in the scattering process. The Dirac fermion dynamics in Bi2(Te3-xSex) topological insulators can be tuned by varying the composition, x, or by controlling the charge carriers. Our findings provide crucial information in understanding the electron dynamics of the Dirac fermions in topological insulators and in engineering their surface state for fundamental studies and potential applications.

preprint2012arXiv

Electronic Origin of High Temperature Superconductivity in Single-Layer FeSe Superconductor

The latest discovery of high temperature superconductivity signature in single-layer FeSe is significant because it is possible to break the superconducting critical temperature ceiling (maximum Tc~55 K) that has been stagnant since the discovery of Fe-based superconductivity in 2008. It also blows the superconductivity community by surprise because such a high Tc is unexpected in FeSe system with the bulk FeSe exhibiting a Tc at only 8 K at ambient pressure which can be enhanced to 38 K under high pressure. The Tc is still unusually high even considering the newly-discovered intercalated FeSe system A_xFe_{2-y}Se_2 (A=K, Cs, Rb and Tl) with a Tc at 32 K at ambient pressure and possible Tc near 48 K under high pressure. Particularly interesting is that such a high temperature superconductivity occurs in a single-layer FeSe system that is considered as a key building block of the Fe-based superconductors. Understanding the origin of high temperature superconductivity in such a strictly two-dimensional FeSe system is crucial to understanding the superconductivity mechanism in Fe-based superconductors in particular, and providing key insights on how to achieve high temperature superconductivity in general. Here we report distinct electronic structure associated with the single-layer FeSe superconductor. Its Fermi surface topology is different from other Fe-based superconductors; it consists only of electron pockets near the zone corner without indication of any Fermi surface around the zone center. Our observation of large and nearly isotropic superconducting gap in this strictly two-dimensional system rules out existence of node in the superconducting gap. These results have provided an unambiguous case that such a unique electronic structure is favorable for realizing high temperature superconductivity.

preprint2012arXiv

Phase Diagram and High Temperature Superconductivity at 65 K in Tuning Carrier Concentration of Single-Layer FeSe Films

Superconductivity in the cuprate superconductors and the Fe-based superconductors is realized by doping the parent compound with charge carriers, or by application of high pressure, to suppress the antiferromagnetic state. Such a rich phase diagram is important in understanding superconductivity mechanism and other physics in the Cu- and Fe-based high temperature superconductors. In this paper, we report a phase diagram in the single-layer FeSe films grown on SrTiO3 substrate by an annealing procedure to tune the charge carrier concentration over a wide range. A dramatic change of the band structure and Fermi surface is observed, with two distinct phases identified that are competing during the annealing process. Superconductivity with a record high transition temperature (Tc) at ~65 K is realized by optimizing the annealing process. The wide tunability of the system across different phases, and its high-Tc, make the single-layer FeSe film ideal not only to investigate the superconductivity physics and mechanism, but also to study novel quantum phenomena and for potential applications.

preprint2011arXiv

Automatic Performance Debugging of SPMD-style Parallel Programs

The simple program and multiple data (SPMD) programming model is widely used for both high performance computing and Cloud computing. In this paper, we design and implement an innovative system, AutoAnalyzer, that automates the process of debugging performance problems of SPMD-style parallel programs, including data collection, performance behavior analysis, locating bottlenecks, and uncovering their root causes. AutoAnalyzer is unique in terms of two features: first, without any apriori knowledge, it automatically locates bottlenecks and uncovers their root causes for performance optimization; second, it is lightweight in terms of the size of performance data to be collected and analyzed. Our contributions are three-fold: first, we propose two effective clustering algorithms to investigate the existence of performance bottlenecks that cause process behavior dissimilarity or code region behavior disparity, respectively; meanwhile, we present two searching algorithms to locate bottlenecks; second, on a basis of the rough set theory, we propose an innovative approach to automatically uncovering root causes of bottlenecks; third, on the cluster systems with two different configurations, we use two production applications, written in Fortran 77, and one open source code-MPIBZIP2 (http://compression.ca/mpibzip2/), written in C++, to verify the effectiveness and correctness of our methods. For three applications, we also propose an experimental approach to investigating the effects of different metrics on locating bottlenecks.

preprint2011arXiv

Common Fermi Surface Topology and Nodeless Superconducting Gap in K0.68Fe1.79Se2 and (Tl0.45K0.34)Fe1.84Se2 Superconductors Revealed from Angle-Resolved Photoemission Spectroscopy

We carried out high resolution angle-resolved photoemission measurements on the electronic structure and superconducting gap of K_0.68Fe_1.79Se_2 (T_c=32 K) and (Tl_0.45K_0.34)Fe_1.84Se_2 (T_c=28 K) superconductors. In addition to the electron-like Fermi surface near M(π,π), two electron-like Fermi pockets are revealed around the zone center Γ(0,0) in K0.68Fe1.79Se_2. This observation makes the Fermi surface topology of K_0.68Fe_1.79Se_2 consistent with that of (Tl,Rb)_xFe_{2-y}Se_2 and (Tl,K)_xFe_{2-y}Se_2 compounds. A nearly isotropic superconducting gap (Δ) is observed along the electron-like Fermi pocket near the M point in K_0.68Fe_1.79Se_2 (Δ\sim 9 meV) and (Tl_0.45K_0.34)Fe_1.84Se_2 (Δ\sim 8 meV). The establishment of a universal picture on the Fermi surface topology and superconducting gap in the A_xFe_2-ySe_2 (A=K, Tl, Cs, Rb and etc.) superconductors will provide important information in understanding the superconductivity mechanism of the iron-based superconductors.

preprint2011arXiv

The Theory of Stochastic Pseudo-differential Operators and Its Applications, I

The purpose of this paper is to establish the theory of stochastic pseudo-differential operators and give its applications in stochastic partial differential equations. First, we introduce some concepts on stochastic pseudo-differential operators and prove their fundamental properties. Also, we present the boundedness theory, invertibility of stochastic elliptic operators and the Garding inequality. Moreover, as an application of the theory of stochastic pseudo-differential operators, we give a Calderon-type uniqueness theorem on the Cauchy problem of stochastic partial differential equations. The proof of the uniqueness theorem is based on a new Carleman-type estimate, which is adapted to the stochastic setting.

preprint2010arXiv

Automatic Performance Debugging of SPMD Parallel Programs

Automatic performance debugging of parallel applications usually involves two steps: automatic detection of performance bottlenecks and uncovering their root causes for performance optimization. Previous work fails to resolve this challenging issue in several ways: first, several previous efforts automate analysis processes, but present the results in a confined way that only identifies performance problems with apriori knowledge; second, several tools take exploratory or confirmatory data analysis to automatically discover relevant performance data relationships. However, these efforts do not focus on locating performance bottlenecks or uncovering their root causes. In this paper, we design and implement an innovative system, AutoAnalyzer, to automatically debug the performance problems of single program multi-data (SPMD) parallel programs. Our system is unique in terms of two dimensions: first, without any apriori knowledge, we automatically locate bottlenecks and uncover their root causes for performance optimization; second, our method is lightweight in terms of size of collected and analyzed performance data. Our contribution is three-fold. First, we propose a set of simple performance metrics to represent behavior of different processes of parallel programs, and present two effective clustering and searching algorithms to locate bottlenecks. Second, we propose to use the rough set algorithm to automatically uncover the root causes of bottlenecks. Third, we design and implement the AutoAnalyzer system, and use two production applications to verify the effectiveness and correctness of our methods. According to the analysis results of AutoAnalyzer, we optimize two parallel programs with performance improvements by minimally 20% and maximally 170%.

preprint2010arXiv

Calderon-Type Uniqueness Theorem for Stochastic Partial Differential Equations

In this Note, we present a Calderón-type uniqueness theorem on the Cauchy problem of stochastic partial differential equations. To this aim, we introduce the concept of stochastic pseudo-differential operators, and establish their boundedness and other fundamental properties. The proof of our uniqueness theorem is based on a new Carleman-type estimate.

preprint2010arXiv

Weak Maximum Principle for Strongly Coupled Elliptic Differential Systems

A classical counterexample due to E. De Giorgi, shows that the weak maximum principle does not remain true for general linear elliptic differential systems. After that, there are some efforts to establish the weak maximum principle for special elliptic differential systems, but the existing works are addressing only the cases of weakly coupled systems, or almost-diagonal systems, or even some systems coupling in various lower order terms. In this paper, by contrast, we present maximum modulus estimates for weak solutions to two classes of coupled linear elliptic differential systems with different principal parts, under considerably mild and physically reasonable assumptions. The systems under consideration are strongly coupled in the second order terms and other lower order terms, without restrictions on the size of ratios of the different principal part coefficients, or on the number of equations and space variables.

Xu Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

54 published item(s)

A General Framework for Multimodal LLM-Based Multimedia Understanding in Large-Scale Recommendation Systems

Intelligent Nano-Fingerprinting: An Efficient and Precise Approach for Liquid Biopsy

Learning Geometric Invariance for Gait Recognition

Meta-Backscatter: Long-Distance Battery-Free Metamaterial-Backscatter Sensing and Communication

Quantum tunnelling-integrated optoplasmonic nanotrap enables conductance visualisation of individual proteins

ROAD: Adaptive Data Mixing for Offline-to-Online Reinforcement Learning via Bi-Level Optimization

Towards Exascale Computation for Turbomachinery Flows

ApolloRL: a Reinforcement Learning Platform for Autonomous Driving

BinGo: Pinpointing Concurrency Bugs in Go via Binary Analysis

Deep Learning-based Occluded Person Re-identification: A Survey

Functional varying index coefficient model for dynamic gene-environment interactions

Mars Entry Trajectory Planning with Range Discretization and Successive Convexification

Multispectral large-area X-ray imaging enabled by stacked multilayer scintillators

OJXPerf: Featherlight Object Replica Detection for Java Programs

Rapid Elastic Architecture Search under Specialized Classes and Resource Constraints

Temperature Field Inversion of Heat-Source Systems via Physics-Informed Neural Networks

The OARF Benchmark Suite: Characterization and Implications for Federated Learning Systems

Transformers Meet Visual Learning Understanding: A Comprehensive Review

Weighted Simultaneous Algebra Reconstruction Technique (wSART) for Additive Light Field Synthesis

A novel meta-learning initialization method for physics-informed neural networks

NumaPerf: Predictive and Full NUMA Profiling

An entanglement-based quantum network based on symmetric dispersive optics quantum key distribution

Feature Super-Resolution Based Facial Expression Recognition for Multi-scale Low-Resolution Faces

MultiResolution Attention Extractor for Small Object Detection

ScalAna: Automating Scaling Loss Detection with Graph Analysis

Variational Inference-Based Dropout in Recurrent Neural Networks for Slot Filling in Spoken Language Understanding

SLOAM: Semantic Lidar Odometry and Mapping for Forest Inventory

An Internal Observability Estimate for Stochastic Hyperbolic Equations

Correctness of Hierarchical MCS Locks with Timeout

Finite Codimensional Controllability for Evolution Equations

A weighted identity for stochastic partial differential operators and its applications

Anomalous High-Energy Waterfall-Like Electronic Structure in 5d Transition Metal Oxide Sr2IrO4 with a Strong Spin-Orbit Coupling

Atomically thin spherical shell-shaped superscatterers based on Bohr model

Common Electronic Origin of Superconductivity in (Li,Fe)OHFeSe Bulk Superconductor and Single-Layer FeSe/SrTiO3 Films

Concealing arbitrary objects remotely with multi-folded transformation optics

Electronic Structure and Superconductivity of FeSe-Related Superconductors

Electronic Structure of Transition Metal Dichalcogenides PdTe2 and Cu0.05PdTe2 Superconductors Obtained by Angle-Resolved Photoemission Spectroscopy

On the equivalence between local and global existence of complete Kähler metrics with plurisubharmonic potentials

Dichotomy of Electronic Structure and Superconductivity between Single-Layer and Double-Layer FeSe/SrTiO3 Films

Electronic Evidence of an Insulator-Superconductor Transition in Single-Layer FeSe/SrTiO3 Films

Orbital-Selective Spin Texture and its Manipulation in a Topological Insulator

Strong Anisotropy of Dirac Cone in SrMnBi2 and CaMnBi2 Revealed by Angle-Resolved Photoemission Spectroscopy

Weak Electron-Phonon Coupling and Unusual Electron Scattering of Topological Surface States in Sb(111) by Laser-Based Angle-Resolved Photoemission Spectroscopy

Electrical Tuning of Surface Plasmon Polariton Propagation in Graphene-Nanowire Hybrid Structure

Fermi Surface and Band Structure of (Ca,La)FeAs2 Superconductor from Angle-Resolved Photoemission Spectroscopy

Tunable Dirac Fermion Dynamics in Topological Insulators

Electronic Origin of High Temperature Superconductivity in Single-Layer FeSe Superconductor

Phase Diagram and High Temperature Superconductivity at 65 K in Tuning Carrier Concentration of Single-Layer FeSe Films

Automatic Performance Debugging of SPMD-style Parallel Programs

Common Fermi Surface Topology and Nodeless Superconducting Gap in K0.68Fe1.79Se2 and (Tl0.45K0.34)Fe1.84Se2 Superconductors Revealed from Angle-Resolved Photoemission Spectroscopy

The Theory of Stochastic Pseudo-differential Operators and Its Applications, I

Automatic Performance Debugging of SPMD Parallel Programs

Calderon-Type Uniqueness Theorem for Stochastic Partial Differential Equations

Weak Maximum Principle for Strongly Coupled Elliptic Differential Systems