Source author record

Xin Sun

Xin Sun appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

41works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

KBQA-R1: Reinforcing Large Language Models for Knowledge Base Question Answering

Knowledge Base Question Answering (KBQA) challenges models to bridge the gap between natural language and strict knowledge graph schemas by generating executable logical forms. While Large Language Models (LLMs) have advanced this field, current approaches often struggle with a dichotomy of failure: they either generate hallucinated queries without verifying schema existence or exhibit rigid, template-based reasoning that mimics synthesized traces without true comprehension of the environment. To address these limitations, we present \textbf{KBQA-R1}, a framework that shifts the paradigm from text imitation to interaction optimization via Reinforcement Learning. Treating KBQA as a multi-turn decision process, our model learns to navigate the knowledge base using a list of actions, leveraging Group Relative Policy Optimization (GRPO) to refine its strategies based on concrete execution feedback rather than static supervision. Furthermore, we introduce \textbf{Referenced Rejection Sampling (RRS)}, a data synthesis method that resolves cold-start challenges by strictly aligning reasoning traces with ground-truth action sequences. Extensive experiments on WebQSP, GrailQA, and GraphQuestions demonstrate that KBQA-R1 achieves state-of-the-art performance, effectively grounding LLM reasoning in verifiable execution.

preprint2026arXiv

Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation

Retrieval-Augmented Generation (RAG) has emerged as a powerful approach for enhancing large language models' question-answering capabilities through the integration of external knowledge. However, when adapting RAG systems to specialized domains, challenges arise from distribution shifts, resulting in suboptimal generalization performance. In this work, we propose TTARAG, a test-time adaptation method that dynamically updates the language model's parameters during inference to improve RAG system performance in specialized domains. Our method introduces a simple yet effective approach where the model learns to predict retrieved content, enabling automatic parameter adjustment to the target domain. Through extensive experiments across six specialized domains, we demonstrate that TTARAG achieves substantial performance improvements over baseline RAG systems. Code available at https://github.com/sunxin000/TTARAG.

preprint2026arXiv

Revisiting Reinforcement Learning with Verifiable Rewards from a Contrastive Perspective

RLVR has become a widely adopted paradigm for improving LLMs' reasoning capabilities, and GRPO is one of its most representative algorithms. In this paper, we first show that GRPO admits an equivalent discriminative reformulation as a weighted positive-negative score difference. Under this view, GRPO increases sequence-level scores of verified positive rollouts and decreases those of negative rollouts, where the scores are averages of clipped token-level importance sampling ratios. This reformulation reveals two structural limitations of GRPO: likelihood-misaligned scoring, where clipped ratio-based surrogate scores are optimized instead of generation likelihoods, and score-insensitive credit assignment, where rollout-level credit is assigned without accounting for relative score gaps between positive and negative rollouts in the same group. To address these limitations, we propose ConSPO, a framework for Contrastive Sequence-level Policy Optimization in RLVR. ConSPO replaces GRPO's clipped ratio-based scores with length-normalized sequence log-probabilities, aligning the optimized rollout scores with the likelihoods used in autoregressive generation. It then optimizes a group-wise InfoNCE-style objective that contrasts each positive rollout against negative distractors from the same group, enabling credit assignment to depend on their relative scores. This contrastive formulation amplifies updates for poorly separated positives while concentrating suppressive updates on high-scoring negatives. Moreover, ConSPO introduces a curriculum-scheduled margin, guiding optimization from coarse positive-negative ordering in early training toward stronger separation in later stages. Extensive evaluations across diverse backbone models, parameter scales, and training datasets show that ConSPO consistently outperforms several strong RLVR baselines on challenging mathematical reasoning benchmarks.

preprint2023arXiv

Baxter permuton and Liouville quantum gravity

The Baxter permuton is a random probability measure on the unit square which describes the scaling limit of uniform Baxter permutations. We find an explict formula for the expectation of the Baxter permuton, i.e.\ the density of its intensity measure. This answers a question of Dokos and Pak (2014). We also prove that all pattern densities of the Baxter permuton are strictly positive, distinguishing it from other permutons arising as scaling limits of pattern-avoiding permutations. Our proofs rely on a recent connection between the Baxter permuton and Liouville quantum gravity (LQG) coupled with the Schramm-Loewner evolution (SLE). The method works equally well for a two-parameter generalization of the Baxter permuton recently introduced by the first author, except that the density is not as explicit. This new family of permutons, called \emph{skew Brownian permuton}, describes the scaling limit of a number of random constrained permutations. We finally observe that in the LQG/SLE framework, the expected proportion of inversions in a skew Brownian permuton equals $\frac{π-2θ}{2π}$ where $θ$ is the so-called imaginary geometry angle between a certain pair of SLE curves.

preprint2022arXiv

A Comprehensive Survey on Aerial Mobile Edge Computing: Challenges, State-of-the-Art, and Future Directions

Driven by the visions of Internet of Things (IoT), there is an ever-increasing demand for computation resources of IoT users to support diverse applications. Mobile edge computing (MEC) has been deemed a promising solution to settle the conflict between the resource-hungry mobile applications and the resource-constrained IoT users. On the other hand, in order to provide ubiquitous and reliable connectivity in wireless networks, unmanned aerial vehicles (UAVs) can be leveraged as efficient aerial platforms by exploiting their inherent attributes, such as the on-demand deployment, high cruising altitude, and controllable maneuverability in three-dimensional (3D) space. Thus, the UAV-enabled aerial MEC is believed as a win-win solution to facilitate cost-effective and energy-saving communication and computation services in various environments. In this paper, we provide a comprehensive survey on the UAV-enabled aerial MEC. Firstly, the related advantages and research challenges for aerial MEC are discussed. Then, we provide a comprehensive review of the recent research advances, which is categorized by different domains, including the joint optimization of UAV trajectory, computation offloading and resource allocation, UAV deployment, task scheduling and load balancing, interplay between aerial MEC and other technologies, as well as the machine-learning (ML)-driven optimization. Finally, some important research directions deserved more efforts in future work are summarized.

preprint2022arXiv

A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model

Synthetic data construction of Grammatical Error Correction (GEC) for non-English languages relies heavily on human-designed and language-specific rules, which produce limited error-corrected patterns. In this paper, we propose a generic and language-independent strategy for multilingual GEC, which can train a GEC system effectively for a new non-English language with only two easy-to-access resources: 1) a pretrained cross-lingual language model (PXLM) and 2) parallel translation data between English and the language. Our approach creates diverse parallel GEC data without any language-specific operations by taking the non-autoregressive translation generated by PXLM and the gold translation as error-corrected sentence pairs. Then, we reuse PXLM to initialize the GEC model and pretrain it with the synthetic data generated by itself, which yields further improvement. We evaluate our approach on three public benchmarks of GEC in different languages. It achieves the state-of-the-art results on the NLPCC 2018 Task 2 dataset (Chinese) and obtains competitive performance on Falko-Merlin (German) and RULEC-GEC (Russian). Further analysis demonstrates that our data construction method is complementary to rule-based approaches.

preprint2022arXiv

Distributed Quantum Vote Based on Quantum Logical Operators, a New Battlefield of the Second Quantum Revolution

We designed two rules of binary quantum computed vote: Quantum Logical Veto (QLV) and Quantum Logical Nomination (QLN). The conjunction and disjunction from quantum computational logic are used to define QLV and QLN, respectively. Compared to classical vote, quantum computed vote is fairer, more democratic and has stronger expressive power. Since the advantage of quantum computed vote is neither the speed of computing nor the security of communication, we believe it opens a new battlefield in the second quantum revolution. Compared to other rules of quantum computed vote, QLV and QLN have better scalability. Both QLV and QLN can be implemented by the current technology and the difficulty of implementation does not grow with the increase of the number of voters.

preprint2022arXiv

FZZ formula of boundary Liouville CFT via conformal welding

Liouville Conformal Field Theory (LCFT) on the disk describes the conformal factor of the quantum disk, which is the natural random surface in Liouville quantum gravity with disk topology. Fateev, Zamolodchikov and Zamolodchikov (2000) proposed an explicit expression, the so-called the FZZ formula, for the one-point bulk structure constant for LCFT on the disk. In this paper we give a proof of the FZZ formula in the probabilistic framework of LCFT, which represents the first step towards rigorously solving boundary LCFT using conformal bootstrap. In contrast to previous works, our proof is based on conformal welding of quantum disks and the mating-of-trees theory for Liouville quantum gravity. As a byproduct of our proof, we also obtain the exact value of the variance for the Brownian motion in the mating-of-trees theory. Our paper is an essential part of an ongoing program proving integrability results for Schramm-Loewner evolutions, LCFT, and in the mating-of-trees theory.

preprint2022arXiv

Instant Reality: Gaze-Contingent Perceptual Optimization for 3D Virtual Reality Streaming

Media streaming has been adopted for a variety of applications such as entertainment, visualization, and design. Unlike video/audio streaming where the content is usually consumed sequentially, 3D applications such as gaming require streaming 3D assets to facilitate client-side interactions such as object manipulation and viewpoint movement. Compared to audio and video streaming, 3D streaming often requires larger data sizes and yet lower latency to ensure sufficient rendering quality, resolution, and latency for perceptual comfort. Thus, streaming 3D assets can be even more challenging than streaming audios/videos, and existing solutions often suffer from long loading time or limited quality. To address this critical and timely issue, we propose a perceptually-optimized progressive 3D streaming method for spatial quality and temporal consistency in immersive interactions. Based on the human visual mechanisms in the frequency domain, our model selects and schedules the streaming dataset for optimal spatial-temporal quality. We also train a neural network for our model to accelerate this decision process for real-time client-server applications. We evaluate our method via subjective studies and objective analysis under varying network conditions (from 3G to 5G) and client devices (HMD and traditional displays), and demonstrate better visual quality and temporal consistency than alternative solutions.

preprint2022arXiv

Integrability of SLE via conformal welding of random surfaces

We demonstrate how to obtain integrable results for the Schramm-Loewner evolution (SLE) from Liouville conformal field theory (LCFT) and the mating-of-trees framework for Liouville quantum gravity (LQG). In particular, we prove an exact formula for the law of a conformal derivative of a classical variant of SLE called $\mathrm{SLE}_κ(ρ_-;ρ_+)$. Our proof is built on two connections between SLE, LCFT, and mating-of-trees. Firstly, LCFT and mating-of-trees provide equivalent but complementary methods to describe natural random surfaces in LQG. Using a novel tool that we call the uniform embedding of an LQG surface, we extend earlier equivalence results by allowing fewer marked points and more generic singularities. Secondly, the conformal welding of these random surfaces produces SLE curves as their interfaces. In particular, we rely on the conformal welding results proved in our companion paper [AHS20]. Our paper is an essential part of a program proving integrability results for SLE, LCFT, and mating-of-trees based on these two connections.

preprint2022arXiv

Joint Optimization of Resource Allocation, Phase Shift and UAV Trajectory for Energy-Efficient RIS-Assisted UAV-Enabled MEC Systems

The unmanned aerial vehicle (UAV) enabled mobile edge computing (MEC) has been deemed a promising paradigm to provide ubiquitous communication and computing services for the Internet of Things (IoT). Besides, by intelligently reflecting the received signals, the reconfigurable intelligent surface (RIS) can significantly improve the propagation environment and further enhance the service quality of the UAV-enabled MEC. Motivated by this vision, in this paper, we consider both the amount of completed task bits and the energy consumption to maximize the energy efficiency of the RIS-assisted UAV-enabled MEC systems, where the bit allocation, transmit power, phase shift, and UAV trajectory are jointly optimized by an iterative algorithm with a double-loop structure based on the Dinkelbach's method and block coordinate decent (BCD) technique. Simulation results demonstrate that: 1) with the deployment of RIS, our proposed algorithm can achieve higher energy efficiency than baseline schemes while satisfying the task tolerance latency; 2) the energy efficiency first increases and then decreases with the increase of the mission period and the total amount of task-input bits of IoT devices; 3) when the CPU cycles required for computing 1-bit of task-input data becomes larger, more task bits will be offloaded to the UAV while the energy efficiency will be decreased.

preprint2022arXiv

Joint Resource Allocation and Configuration Design for STAR-RIS-Enhanced Wireless-Powered MEC

In this paper, a novel concept called simultaneously transmitting and reflecting RIS (STAR-RIS) is introduced into the wireless-powered mobile edge computing (MEC) systems to improve the efficiency of energy transfer and task offloading. Compared with traditional reflecting-only RIS, STAR-RIS extends the half-space coverage to full-space coverage by simultaneously transmitting and reflecting incident signals, and also provides new degrees-of-freedom (DoFs) for manipulating signal propagation. We aim to maximize the total computation rate of all users, where the energy transfer time, transmit power and CPU frequencies of users, and the configuration design of STAR-RIS are jointly optimized. Considering the characteristics of STAR-RIS, three operating protocols, namely energy splitting (ES), mode switching (MS), and time splitting (TS) are studied, respectively. For the ES protocol, based on the penalty method, successive convex approximation (SCA), and the linear search method, an iterative algorithm is proposed to solve the formulated non-convex problem. Then, the proposed algorithm for ES protocol is extended to solve the MS and TS problems. Simulation results illustrate that the STAR-RIS outperforms traditional reflecting/transmitting-only RIS. More importantly, the TS protocol can achieve the largest computation rate among the three operating protocols of STAR-RIS.

preprint2022arXiv

Lossless Acceleration for Seq2seq Generation with Aggressive Decoding

We study lossless acceleration for seq2seq generation with a novel decoding algorithm -- Aggressive Decoding. Unlike the previous efforts (e.g., non-autoregressive decoding) speeding up seq2seq generation at the cost of quality loss, our approach aims to yield the identical (or better) generation compared with autoregressive decoding but in a significant speedup, achieved by innovative cooperation of aggressive decoding and verification that are both efficient due to parallel computing. We propose two Aggressive Decoding paradigms for 2 kinds of seq2seq tasks: 1) For the seq2seq tasks whose inputs and outputs are highly similar (e.g., Grammatical Error Correction), we propose Input-guided Aggressive Decoding (IAD) that aggressively copies from the input sentence as drafted decoded tokens to verify in parallel; 2) For other general seq2seq tasks (e.g., Machine Translation), we propose Generalized Aggressive Decoding (GAD) that first employs an additional non-autoregressive decoding model for aggressive decoding and then verifies in parallel in the autoregressive manner. We test Aggressive Decoding on the most popular 6-layer Transformer model on GPU in multiple seq2seq tasks: 1) For IAD, we show that it can introduce a 7x-9x speedup for the Transformer in Grammatical Error Correction and Text Simplification tasks with the identical results as greedy decoding; 2) For GAD, we observe a 3x-5x speedup with the identical or even better quality in two important seq2seq tasks: Machine Translation and Abstractive Summarization. Moreover, Aggressive Decoding can benefit even more from stronger computing devices that are better at parallel computing. Given the lossless quality as well as significant and promising speedup, we believe Aggressive Decoding may potentially evolve into a de facto standard for efficient and lossless seq2seq generation in the near future.

preprint2022arXiv

Mating of trees for critical Liouville quantum gravity

In a groundbreaking work, Duplantier, Miller and Sheffield showed that subcritical Liouville quantum gravity (LQG) coupled with Schramm-Loewner evolutions (SLE) can be described by the mating of two continuum random trees. In this paper, we consider the counterpart of their result for critical LQG and SLE, i.e., for the case when $γ^2=κ=16/κ=4$. We prove that as one sends $κ\downarrow 4$ in the subcritical setting, the space-filling SLE$_κ$ in a disk degenerates to the CLE$_4$ exploration introduced by Werner and Wu, along with a collection of i.i.d.\ coin tosses indexed by the branch points of the exploration. Furthermore, in the $κ=16/γ^2\downarrow 4$ limit, the pair of continuum random trees collapse into a single continuum random tree, and we observe that upon applying an appropriate affine transform to the encoding Brownian motions before taking the limit, we get convergence to a pair of independent Brownian motions $(A,B)$. The Brownian motion $A$ encodes the LQG distance from the CLE loops to the boundary of the disk, while the Brownian motion $B$ encodes the boundary lengths of the CLE$_4$ loops. In contrast to the subcritical setting, $(A,B)$ does not determine the CLE-decorated LQG surface.

preprint2022arXiv

Maximizing Modular plus Non-monotone Submodular Functions

The research problem in this work is the relaxation of maximizing non-negative submodular plus modular with the entire real number domain as its value range over a family of down-closed sets. We seek a feasible point $\mathbf{x}^*$ in the polytope of the given constraint such that $\mathbf{x}^*\in\arg\max_{\mathbf{x}\in\mathcal{P}\subseteq[0,1]^n}F(\mathbf{x})+L(\mathbf{x})$, where $F$, $L$ denote the extensions of the underlying submodular function $f$ and modular function $\ell$. We provide an approximation algorithm named \textsc{Measured Continuous Greedy with Adaptive Weights}, which yields a guarantee $F(\mathbf{x})+L(\mathbf{x})\geq \left(1/e-\mathcal{O}(ε)\right)\cdot f(OPT)+\left(\frac{β-e}{e(β-1)}-\mathcal{O}(ε)\right)\cdot\ell(OPT)$ under the assumption that the ratio of non-negative part within $\ell(OPT)$ to the absolute value of its negative part is demonstrated by a parameter $β\in[0, \infty]$, where $OPT$ is the optimal integral solution for the discrete problem. It is obvious that the factor of $\ell(OPT)$ is $1$ when $β=0$, which means the negative part is completely dominant at this time; otherwise the factor is closed to $1/e$ whe $β\rightarrow\infty$. Our work first breaks the restriction on the specific value range of the modular function without assuming non-positivity or non-negativity as previous results and quantifies the relative variation of the approximation guarantee for optimal solutions with arbitrary structure. Moreover, we also give an analysis for the inapproximability of the problem we consider. We show a hardness result that there exists no polynomial algorithm whose output $S$ satisfies $f(S)+\ell(S)\geq0.478\cdot f(OPT)+\ell(OPT)$.

preprint2022arXiv

Relational Triple Extraction: One Step is Enough

Extracting relational triples from unstructured text is an essential task in natural language processing and knowledge graph construction. Existing approaches usually contain two fundamental steps: (1) finding the boundary positions of head and tail entities; (2) concatenating specific tokens to form triples. However, nearly all previous methods suffer from the problem of error accumulation, i.e., the boundary recognition error of each entity in step (1) will be accumulated into the final combined triples. To solve the problem, in this paper, we introduce a fresh perspective to revisit the triple extraction task, and propose a simple but effective model, named DirectRel. Specifically, the proposed model first generates candidate entities through enumerating token sequences in a sentence, and then transforms the triple extraction task into a linking problem on a "head $\rightarrow$ tail" bipartite graph. By doing so, all triples can be directly extracted in only one step. Extensive experimental results on two widely used datasets demonstrate that the proposed model performs better than the state-of-the-art baselines.

preprint2022arXiv

RiCS: A 2D Self-Occlusion Map for Harmonizing Volumetric Objects

There have been remarkable successes in computer vision with deep learning. While such breakthroughs show robust performance, there have still been many challenges in learning in-depth knowledge, like occlusion or predicting physical interactions. Although some recent works show the potential of 3D data in serving such context, it is unclear how we efficiently provide 3D input to the 2D models due to the misalignment in dimensionality between 2D and 3D. To leverage the successes of 2D models in predicting self-occlusions, we design Ray-marching in Camera Space (RiCS), a new method to represent the self-occlusions of foreground objects in 3D into a 2D self-occlusion map. We test the effectiveness of our representation on the human image harmonization task by predicting shading that is coherent with a given background image. Our experiments demonstrate that our representation map not only allows us to enhance the image quality but also to model temporally coherent complex shadow effects compared with the simulation-to-real and harmonization methods, both quantitatively and qualitatively. We further show that we can significantly improve the performance of human parts segmentation networks trained on existing synthetic datasets by enhancing the harmonization quality with our method.

preprint2021arXiv

Conditional Gaussian Distribution Learning for Open Set Recognition

Deep neural networks have achieved state-of-the-art performance in a wide range of recognition/classification tasks. However, when applying deep learning to real-world applications, there are still multiple challenges. A typical challenge is that unknown samples may be fed into the system during the testing phase and traditional deep neural networks will wrongly recognize the unknown sample as one of the known classes. Open set recognition is a potential solution to overcome this problem, where the open set classifier should have the ability to reject unknown samples as well as maintain high classification accuracy on known classes. The variational auto-encoder (VAE) is a popular model to detect unknowns, but it cannot provide discriminative representations for known classification. In this paper, we propose a novel method, Conditional Gaussian Distribution Learning (CGDL), for open set recognition. In addition to detecting unknown samples, this method can also classify known samples by forcing different latent features to approximate different Gaussian models. Meanwhile, to avoid information hidden in the input vanishing in the middle layers, we also adopt the probabilistic ladder architecture to extract high-level abstract features. Experiments on several standard image datasets reveal that the proposed method significantly outperforms the baseline method and achieves new state-of-the-art results.

preprint2021arXiv

MIMO Assisted Networks Relying on Intelligent Reflective Surfaces

Intelligent reflective surfaces (IRSs) are invoked for improving both spectral efficiency (SE) and energy efficiency (EE). Specifically, an IRS-aided multiple-input multiple-output network is considered, where the performance of randomly roaming users is analyzed by utilizing stochastic geometry tools. As such, to distinguish the superposed signals at each user, the passive beamforming weight at the IRSs and detection weight vectors at the users are jointly designed. As a benefit, by adopting a zero-forcing-based design, the intra-cell interference imposed by the IRS can be suppressed. In order to evaluate the performance of the proposed network, we first derive the approximated channel statistics in the high signal-to-noise-ratio (SNR) regime. Then, we derive the closed-form expressions both for the outage probability and for the ergodic rate of users. Both the high-SNR slopes of ergodic rate and the diversity orders of outage probability are derived for gleaning further insights. The network's SE and EE are also derived. Our numerical results are provided to confirm that: i) the high-SNR slope of the proposed network is one; ii) the SE and EE can be significantly enhanced by increasing the number of IRS elements.

preprint2021arXiv

Open Set Recognition with Conditional Probabilistic Generative Models

Deep neural networks have made breakthroughs in a wide range of visual understanding tasks. A typical challenge that hinders their real-world applications is that unknown samples may be fed into the system during the testing phase, but traditional deep neural networks will wrongly recognize these unknown samples as one of the known classes. Open set recognition (OSR) is a potential solution to overcome this problem, where the open set classifier should have the flexibility to reject unknown samples and meanwhile maintain high classification accuracy in known classes. Probabilistic generative models, such as Variational Autoencoders (VAE) and Adversarial Autoencoders (AAE), are popular methods to detect unknowns, but they cannot provide discriminative representations for known classification. In this paper, we propose a novel framework, called Conditional Probabilistic Generative Models (CPGM), for open set recognition. The core insight of our work is to add discriminative information into the probabilistic generative models, such that the proposed models can not only detect unknown samples but also classify known classes by forcing different latent features to approximate conditional Gaussian distributions. We discuss many model variants and provide comprehensive experiments to study their characteristics. Experiment results on multiple benchmark datasets reveal that the proposed method significantly outperforms the baselines and achieves new state-of-the-art performance.

preprint2020arXiv

A mating-of-trees approach for graph distances in random planar maps

We introduce a general technique for proving estimates for certain random planar maps which belong to the $γ$-Liouville quantum gravity (LQG) universality class for $γ\in (0,2)$. The family of random planar maps we consider are those which can be encoded by a two-dimensional random walk with i.i.d.\ increments via a mating-of-trees bijection, and includes the uniform infinite planar triangulation (UIPT; $γ=\sqrt{8/3}$); and planar maps weighted by the number of different spanning trees ($γ=\sqrt 2$), bipolar orientations ($γ=\sqrt{4/3}$), or Schnyder woods ($γ=1$) that can be put on the map. Using our technique, we prove estimates for graph distances in the above family of random planar maps. In particular, we obtain non-trivial upper and lower bounds for the cardinality of a graph distance ball consistent with the Watabiki (1993) prediction for the Hausdorff dimension of $γ$-LQG and we establish the existence of an exponent for certain distances in the map. The basic idea of our approach is to compare a given random planar map $M$ to a mated-CRT map---a random planar map constructed from a correlated two-dimensional Brownian motion---using a strong coupling (Zaitsev, 1998) of the encoding walk for $M$ and the Brownian motion used to construct the mated-CRT map. This allows us to deduce estimates for graph distances in $M$ from the estimates for graph distances in the mated-CRT map which we proved (using continuum theory) in a previous work. In the special case when $γ=\sqrt{8/3}$, we instead deduce estimates for the $\sqrt{8/3}$-mated-CRT map from known results for the UIPT. The arguments of this paper do not directly use SLE/LQG, and can be read without any knowledge of these objects.

preprint2020arXiv

Bit Commitment for Lottery and Auction on Quantum Blockchain

This paper propose a protocol for lottery and a protocol for auction on quantum Blockchain. Our protocol of lottery satisfies randomness, unpredictability, unforgeability, verifiability, decentralization and unconditional security. Our protocol of auction satisfies bid privacy, posterior privacy, bids' binding, decentralization and unconditional security. Except quantum Block-chain, the main technique involved in both protocols is quantum bit commitment. Since both quantum blockchain and quantum bit commitment can be realized by the current technology, our protocols are practically feasible.

preprint2020arXiv

Equivalence of Liouville measure and Gaussian free field

Given an instance $h$ of the Gaussian free field on a planar domain $D$ and a constant $γ\in (0,2)$, one can use various regularization procedures to make sense of the Liouville quantum gravity area measure $μ:= e^{γh(z)} dz.$ It is known that the field $h$ a.s. determines the measure $μ_h$. We show that the converse is true: namely, $h$ is measurably determined by $μ_h$. More generally, given a random closed fractal subset $\mathcal A$ endowed with a Frostman measure $σ$ whose support is $\mathcal A$ (independent of $h$), a Gaussian multiplicative chaos measure $μ_{σ,h}$ can be constructed. We give a mild condition on $(\mathcal A,σ)$ under which $μ_{σ,h}$ determines $h$ restricted to $\mathcal A$, in the sense that it determines its harmonic extension off $\mathcal A$. Our condition is satisfied by the occupation measures of planar Brownian motion and SLE curves under natural parametrizations. Along the way we obtain general positive moment bounds for Gaussian multiplicative chaos. Contrary to previous results, this does not require any assumption on the underlying measure $σ$ such as scale invariance, and hence may be of independent interest.

preprint2020arXiv

Few-shot Learning for Domain-specific Fine-grained Image Classification

Learning to recognize novel visual categories from a few examples is a challenging task for machines in real-world industrial applications. In contrast, humans have the ability to discriminate even similar objects with little supervision. This paper attempts to address the few shot fine-grained image classification problem. We propose a feature fusion model to explore discriminative features by focusing on key regions. The model utilizes the focus area location mechanism to discover the perceptually similar regions among objects. High-order integration is employed to capture the interaction information among intra-parts. We also design a Center Neighbor Loss to form robust embedding space distributions. Furthermore, we build a typical fine-grained and few-shot learning dataset miniPPlankton from the real-world application in the area of marine ecological environments. Extensive experiments are carried out to validate the performance of our method. The results demonstrate that our model achieves competitive performance compared with state-of-the-art models. Our work is a valuable complement to the model domain-specific industrial applications.

preprint2020arXiv

High-Order Paired-ASPP Networks for Semantic Segmenation

Current semantic segmentation models only exploit first-order statistics, while rarely exploring high-order statistics. However, common first-order statistics are insufficient to support a solid unanimous representation. In this paper, we propose High-Order Paired-ASPP Network to exploit high-order statistics from various feature levels. The network first introduces a High-Order Representation module to extract the contextual high-order information from all stages of the backbone. They can provide more semantic clues and discriminative information than the first-order ones. Besides, a Paired-ASPP module is proposed to embed high-order statistics of the early stages into the last stage. It can further preserve the boundary-related and spatial context in the low-level features for final prediction. Our experiments show that the high-order statistics significantly boost the performance on confusing objects. Our method achieves competitive performance without bells and whistles on three benchmarks, i.e, Cityscapes, ADE20K and Pascal-Context with the mIoU of 81.6%, 45.3% and 52.9%.

preprint2020arXiv

Induced graphs of uniform spanning forests

Given a subgraph $H$ of a graph $G$, the induced graph of $H$ is the largest subgraph of $G$ whose vertex set is the same as that of $H$. Our paper concerns the induced graphs of the components of $\operatorname{WSF}(G)$, the wired spanning forest on $G$, and, to a lesser extent, $\operatorname{FSF}(G)$, the free uniform spanning forest. We show that the induced graph of each component of $\operatorname{WSF}(\mathbb Z^d$) is almost surely recurrent when $d\ge 8$. Moreover, the effective resistance between two points on the ray of the tree to infinity within a component grows linearly when $d\ge9$. For any vertex-transitive graph $G$, we establish the following resampling property: Given a vertex $o$ in $G$, let $\mathcal T_o$ be the component of $\operatorname{WSF}(G)$ containing $o$ and $\overline{\mathcal{T}_o}$ be its induced graph. Conditioned on $\overline{\mathcal{T}_o}$, the tree $\mathcal T_o$ is distributed as $\operatorname{WSF}(\overline{\mathcal{T}_o})$. For any graph $G$, we also show that if $\mathcal T_o$ is the component of $\operatorname{FSF}(G)$ containing $o$ and $\overline{\mathcal{T}_o}$ is its induced graph, then conditioned on $\overline{\mathcal{T}_o}$, the tree $\mathcal T_o$ is distributed as $\operatorname{FSF}(\overline{\mathcal{T}_o})$.

preprint2020arXiv

Learning Relation Ties with a Force-Directed Graph in Distant Supervised Relation Extraction

Relation ties, defined as the correlation and mutual exclusion between different relations, are critical for distant supervised relation extraction. Existing approaches model this property by greedily learning local dependencies. However, they are essentially limited by failing to capture the global topology structure of relation ties. As a result, they may easily fall into a locally optimal solution. To solve this problem, in this paper, we propose a novel force-directed graph based relation extraction model to comprehensively learn relation ties. Specifically, we first build a graph according to the global co-occurrence of relations. Then, we borrow the idea of Coulomb's Law from physics and introduce the concept of attractive force and repulsive force to this graph to learn correlation and mutual exclusion between relations. Finally, the obtained relation representations are applied as an inter-dependent relation classifier. Experimental results on a large scale benchmark dataset demonstrate that our model is capable of modeling global relation ties and significantly outperforms other baselines. Furthermore, the proposed force-directed graph can be used as a module to augment existing relation extraction systems and improve their performance.

preprint2020arXiv

LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech Recognition

We present a corpus of sentence-aligned triples of German audio, German text, and English translation, based on German audiobooks. The speech translation data consist of 110 hours of audio material aligned to over 50k parallel sentences. An even larger dataset comprising 547 hours of German speech aligned to German text is available for speech recognition. The audio data is read speech and thus low in disfluencies. The quality of audio and sentence alignments has been checked by a manual evaluation, showing that speech alignment quality is in general very high. The sentence alignment quality is comparable to well-used parallel translation data and can be adjusted by cutoffs on the automatic alignment score. To our knowledge, this corpus is to date the largest resource for German speech recognition and for end-to-end German-to-English speech translation.

preprint2020arXiv

MIMO-NOMA Networks Relying on Reconfigurable Intelligent Surface: A Signal Cancellation Based Design

Reconfigurable intelligent surface (RIS) technique stands as a promising signal enhancement or signal cancellation technique for next generation networks. We design a novel passive beamforming weight at RISs in a multiple-input multiple-output (MIMO) non-orthogonal multiple access (NOMA) network for simultaneously serving paired users, where a signal cancellation based (SCB) design is employed. In order to implement the proposed SCB design, we first evaluate the minimal required number of RISs in both the diffuse scattering and anomalous reflector scenarios. Then, new channel statistics are derived for characterizing the effective channel gains. In order to evaluate the network's performance, we derive the closed-form expressions both for the outage probability (OP) and for the ergodic rate (ER). The diversity orders as well as the high-signal-to-noise (SNR) slopes are derived for engineering insights. The network's performance of a finite resolution design has been evaluated. Our analytical results demonstrate that: i) the inter-cluster interference can be eliminated with the aid of large number of RIS elements; ii) the line-of-sight of the BS-RIS and RIS-user links are required for the diffuse scattering scenario, whereas the LoS links are not compulsory for the anomalous reflector scenario.

preprint2020arXiv

NOMA Enhanced Terrestrial and Aerial IoT Networks with Partial CSI

This article investigates a non-orthogonal multiple access (NOMA) enhanced Internet of Things (IoT) network. In order to provide connectivity, a novel cluster strategy is proposed, where multiple devices can be served simultaneously. Two potential scenarios are investigated: 1) NOMA enhanced terrestrial IoT networks and 2) NOMA enhanced aerial IoT networks. We utilize stochastic geometry tools to model the spatial randomness of both terrestrial and aerial devices. New channel statistics are derived for both terrestrial and aerial devices. The exact and the asymptotic expressions in terms of coverage probability are derived. In order to obtain further engineering insights, short-packet communication scenarios are investigated. From our analysis, we show that the performance of NOMA enhanced IoT networks is capable of outperforming OMA enhanced IoT networks. Moreover, based on simulation results, there exists an optimal value of the transmit power that maximizes the coverage probability.

preprint2020arXiv

Weak LQG metrics and Liouville first passage percolation

For $γ\in (0,2)$, we define a weak $γ$-Liouville quantum gravity (LQG) metric to be a function $h\mapsto D_h$ which takes in an instance of the planar Gaussian free field (GFF) and outputs a metric on the plane satisfying a certain list of natural axioms. We show that these axioms are satisfied for any subsequential limits of Liouville first passage percolation. Such subsequential limits were proven to exist by Ding-Dubédat-Dunlap-Falconet (2019). It is also known that these axioms are satisfied for the $\sqrt{8/3}$-LQG metric constructed by Miller and Sheffield (2013-2016). For any weak $γ$-LQG metric, we obtain moment bounds for diameters of sets as well as point-to-point, set-to-set, and point-to-set distances. We also show that any such metric is locally bi-Hölder continuous with respect to the Euclidean metric and compute the optimal Hölder exponents in both directions. Finally, we show that LQG geodesics cannot spend a long time near a straight line or the boundary of a metric ball. These results are used in subsequent work by Gwynne and Miller which proves that the weak $γ$-LQG metric is unique for each $γ\in (0,2)$, which in turn gives the uniqueness of the subsequential limit of Liouville first passage percolation. However, most of our results are new even in the special case when $γ=\sqrt{8/3}$.

preprint2016arXiv

Brownian motion correlation in the peanosphere for $κ> 8$

The peanosphere (or "mating of trees") construction of Duplantier, Miller, and Sheffield encodes certain types of $γ$-Liouville quantum gravity (LQG) surfaces ($γ\in (0,2)$) decorated with an independent SLE$_κ$ ($κ= 16/γ^2 > 4$) in terms of a correlated two-dimensional Brownian motion and provides a framework for showing that random planar maps decorated with statistical physics models converge to LQG decorated with an SLE. Previously, the correlation for the Brownian motion was only explicitly identified as $-\cos(4π/κ)$ for $κ\in (4,8]$ and unknown for $κ> 8$. The main result of this work is that this formula holds for all $κ> 4$. This supplies the missing ingredient for proving convergence results of the aforementioned type for $κ> 8$. Our proof is based on the calculation of a certain tail exponent for SLE$_κ$ on a quantum wedge and then matching it with an exponent which is well-known for Brownian motion.

preprint2016arXiv

Fractional Gaussian fields: a survey

We discuss a family of random fields indexed by a parameter $s\in \mathbb{R}$ which we call the fractional Gaussian fields, given by \[ \mathrm{FGF}_s(\mathbb{R}^d)=(-Δ)^{-s/2} W, \] where $W$ is a white noise on $\mathbb{R}^d$ and $(-Δ)^{-s/2}$ is the fractional Laplacian. These fields can also be parameterized by their Hurst parameter $H = s-d/2$. In one dimension, examples of $\mathrm{FGF}_s$ processes include Brownian motion ($s = 1$) and fractional Brownian motion ($1/2 < s < 3/2$). Examples in arbitrary dimension include white noise ($s = 0$), the Gaussian free field ($s = 1$), the bi-Laplacian Gaussian field ($s = 2$), the log-correlated Gaussian field ($s = d/2$), Lévy's Brownian motion ($s = d/2 + 1/2$), and multidimensional fractional Brownian motion ($d/2 < s < d/2 + 1$). These fields have applications to statistical physics, early-universe cosmology, finance, quantum field theory, image processing, and other disciplines. We present an overview of fractional Gaussian fields including covariance formulas, Gibbs properties, spherical coordinate decompositions, restrictions to linear subspaces, local set theorems, and other basic results. We also define a discrete fractional Gaussian field and explain how the $\mathrm{FGF}_s$ with $s \in (0,1)$ can be understood as a long range Gaussian free field in which the potential theory of Brownian motion is replaced by that of an isotropic $2s$-stable Lévy process.

preprint2016arXiv

Spinodal Decomposition-Enabled Halide Perovskite Double Heterostructure with Reduced Fröhlich Electron-Phonon Coupling

Epitaxial III-V semiconductor heterostructures are key components in modern microelectronics, electro-optics and optoelectronics. With superior semiconducting properties, halide perovskite materials are rising as promising candidates for coherent heterostructure devices. In this report, spinodal decomposition is proposed and experimentally implemented to produce epitaxial double heterostructures in halide perovskite system. Pristine epitaxial mixed halide perovskites rods and films were synthesized via Van der Waals epitaxy by chemical vapor deposition method. At room temperature, photon was applied as a knob to regulate the kinetics of spinodal decomposition and classic coarsening. By this approach, halide perovskite double heterostructures were created carrying epitaxial interfaces and outstanding optical properties. Reduced Fröhlich electron-phonon coupling was discovered in coherent halide double heterostructure, which is hypothetically attributed to the classic phonon confinement effect widely existing in III-V double heterostructures. The ability to develop coherent double heterostructures in halide perovskites paves an avenue to exploring halide perovskite-based quantum wells and superlattices for high-performance and low-cost optoelectronics, electro-optics and microelectronics.

preprint2015arXiv

On Fluctuations for Random Band Toeplitz Matrices

In this paper we study two one-parameter families of random band Toeplitz matrices: \[ A_n(t)=\frac{1}{\sqrt{b_n}}\Big(a_{i-j}δ_{|i-j|\le[b_nt]}\Big)_{i,j=1}^n \quad\text{and}\quad B_n(t)=\frac{1}{\sqrt{b_n}}\Big(a_{i-j}(t)δ_{|i-j|\le b_n}\Big)_{i,j=1}^n \] where 1. $a_0=0$, $\{a_1,a_2,..\}$ in $A_n(t)$ are independent random variables and $a_{-i}=a_i$ 2. $a_0(t)=0$, $\{a_1(t),a_2(t),...\}$ in $B_n(t)$ are independent copies of the standard Brownian motion at time $t$ and $a_{-i}(t)=a_i(t)$. As $t$ varies, the empirical measures $μ(A_n(t))$ and $μ(B_n(t))$ are measure valued stochastic processes. The purpose of this paper is to study the fluctuations of $μ(A_n(t))$ and $μ(B_n(t))$ as $n$ goes to $\infty$. Given a monomial $f(x)=x^p$ with $p\ge2$, the corresponding rescaled fluctuations of $μ(A_n(t))$ and $μ(B_n(t))$ are \[\sqrt{b_n}\Big(\int f(x)dμ(A_n(t))-E[\int f(x)dμ(A_n(t))]\Big)=\frac{\sqrt{b_n}}{n}\Big(\text{tr}(A_n(t)^p)-E[\text{tr}(A_n(t)^p)]\Big), \quad(1)\] \[\sqrt{b_n}\Big(\int f(x)dμ(B_n(t))-E[\int f(x)dμ(B_n(t))]\Big)=\frac{\sqrt{b_n}}{n}\Big(\text{tr}(B_n(t)^p)-E[\text{tr}(B_n(t)^p)]\Big) \quad(2)\] respectively. We will prove that (1) and (2) converge to centered Gaussian families $\{Z_p(t)\}$ and $\{W_p(t)\}$ respectively. The covariance structure $E[Z_p(t_1)Z_q(t_2)]$ and $E[W_p(t_1)W_q(t_2)]$ are obtained for all $p,q\ge 2$; $t_1,t_2\ge 0,$ and are both homogeneous polynomials of $t_1$ and $t_2$ for fixed $p,q$. In particular, $Z_2(t)$ is the Brownian motion and $Z_3(t)$ is the same as $W_2(t)$ up to a constant. The main method of this paper is the moment method.

preprint2015arXiv

Sandpiles and unicycles on random planar maps

We consider the abelian sandpile model and the uniform spanning unicycle on random planar maps. We show that the sandpile density converges to 5/2 as the maps get large. For the spanning unicycle, we show that the length and area of the cycle converges to the hitting time and location of a simple random walk in the first quadrant. The calculations use the "hamburger-cheeseburger" construction of Fortuin--Kasteleyn random cluster configurations on random planar maps.

preprint2015arXiv

Scaling limits for the critical Fortuin-Kastelyn model on a random planar map III: finite volume case

We prove scaling limit results for the finite-volume version of the inventory accumulation model of Sheffield (2011), which encodes a random planar map decorated by a collection of loops sampled from the critical Fortuin-Kasteleyn (FK) model. In particular, we prove that the random walk associated with the finite-volume version of this model converges in the scaling limit to a correlated Brownian motion $\dot Z$ conditioned to stay in the first quadrant for two units of time and satisfy $\dot Z(2) = 0$. We also show that the times which describe complementary connected components of FK loops in the discrete model converge to the $π/2$-cone times of $\dot Z$. Combined with recent results of Duplantier, Miller, and Sheffield, our results imply that many interesting functionals of the FK loops on a finite-volume FK planar map (e.g. their boundary lengths and areas) converge in the scaling limit to the corresponding "quantum" functionals of the CLE$_κ$ loops on a $4/\sqrtκ$-Liouville quantum gravity sphere for $κ\in (4,8)$. Our results are finite-volume analogues of the scaling limit theorems for the infinite-volume version of the inventory accumulation model proven by Sheffield (2011) and Gwynne, Mao, and Sun (2015).

preprint2014arXiv

Ergodicity of the Airy line ensemble

In this paper, we establish the ergodicity of the Airy line ensemble. This shows that it is the only candidate for Conjecture 3.2 in [3], regarding the classification of ergodic line ensembles satisfying a certain Brownian Gibbs property after a parabolic shift.

preprint2013arXiv

Uniform Spanning Forests and the bi-Laplacian Gaussian field

We construct a natural discrete random field on $\mathbb{Z}^{d}$, $d\geq 5$ that converges weakly to the bi-Laplacian Gaussian field in the scaling limit. The construction is based on assigning i.i.d. Bernoulli random variables on each component of the uniform spanning forest, thus defines an associated random function. To our knowledge, this is the first natural discrete model (besides the discrete bi-Laplacian Gaussian field) that converges to the bi-Laplacian Gaussian field.

preprint2011arXiv

A note on eigenvalues of random block Toeplitz matrices with slowly growing bandwidth

This paper can be thought of as a remark of \cite{llw}, where the authors studied the eigenvalue distribution $μ_{X_N}$ of random block Toeplitz band matrices with given block order $m$. In this note we will give explicit density functions of $\lim\limits_{N\to\infty}μ_{X_N}$ when the bandwidth grows slowly. In fact, these densities are exactly the normalized one-point correlation functions of $m\times m$ Gaussian unitary ensemble (GUE for short). The series $\{\lim\limits_{N\to\infty}μ_{X_N}|m\in\mathbb{N}\}$ can be seen as a transition from the standard normal distribution to semicircle distribution. We also show a similar relationship between GOE and block Toeplitz band matrices with symmetric blocks.

preprint2010arXiv

Fluctuation of Eigenvalues for Random Toeplitz and Related Matrices

Consider random symmetric Toeplitz matrices $T_{n}=(a_{i-j})_{i,j=1}^{n}$ with matrix entries $a_{j}, j=0,1,2,...,$ being independent real random variables such that \be \mathbb{E}[a_{j}]=0, \ \ \mathbb{E}[|a_{j}|^{2}]=1 \ \ \textrm{for}\,\ \ j=0,1,2,...,\ee (homogeneity of 4-th moments) \be{κ=\mathbb{E}[|a_{j}|^{4}],}\ee \noindent and further (uniform boundedness)\be\sup\limits_{j\geq 0} \mathbb{E}[|a_{j}|^{k}]=C_{k}<\iy\ \ \ \textrm{for} \ \ \ k\geq 3.\ee Under the assumption of $a_{0}\equiv 0$, we prove a central limit theorem for linear statistics of eigenvalues for a fixed polynomial with degree $\geq 2$. Without the assumption, the CLT can be easily modified to a possibly non-normal limit law. In a special case where $a_{j}$'s are Gaussian, the result has been obtained by Chatterjee for some test functions. Our derivation is based on a simple trace formula for Toeplitz matrices and fine combinatorial analysis. Our method can apply to other related random matrix models, including Hankel matrices and product of several Toeplitz matrices in a flavor of free probability theory etc. Since Toeplitz matrices are quite different from the Wigner and Wishart matrices, our results enrich this topic.

Xin Sun

What is connected

Connect this record

See the researcher in context

Building this map preview

41 published item(s)

KBQA-R1: Reinforcing Large Language Models for Knowledge Base Question Answering

Predict the Retrieval! Test time adaptation for Retrieval Augmented Generation

Revisiting Reinforcement Learning with Verifiable Rewards from a Contrastive Perspective

Baxter permuton and Liouville quantum gravity

A Comprehensive Survey on Aerial Mobile Edge Computing: Challenges, State-of-the-Art, and Future Directions

A Unified Strategy for Multilingual Grammatical Error Correction with Pre-trained Cross-Lingual Language Model

Distributed Quantum Vote Based on Quantum Logical Operators, a New Battlefield of the Second Quantum Revolution

FZZ formula of boundary Liouville CFT via conformal welding

Instant Reality: Gaze-Contingent Perceptual Optimization for 3D Virtual Reality Streaming

Integrability of SLE via conformal welding of random surfaces

Joint Optimization of Resource Allocation, Phase Shift and UAV Trajectory for Energy-Efficient RIS-Assisted UAV-Enabled MEC Systems

Joint Resource Allocation and Configuration Design for STAR-RIS-Enhanced Wireless-Powered MEC

Lossless Acceleration for Seq2seq Generation with Aggressive Decoding

Mating of trees for critical Liouville quantum gravity

Maximizing Modular plus Non-monotone Submodular Functions

Relational Triple Extraction: One Step is Enough

RiCS: A 2D Self-Occlusion Map for Harmonizing Volumetric Objects

Conditional Gaussian Distribution Learning for Open Set Recognition

MIMO Assisted Networks Relying on Intelligent Reflective Surfaces

Open Set Recognition with Conditional Probabilistic Generative Models

A mating-of-trees approach for graph distances in random planar maps

Bit Commitment for Lottery and Auction on Quantum Blockchain

Equivalence of Liouville measure and Gaussian free field

Few-shot Learning for Domain-specific Fine-grained Image Classification

High-Order Paired-ASPP Networks for Semantic Segmenation

Induced graphs of uniform spanning forests

Learning Relation Ties with a Force-Directed Graph in Distant Supervised Relation Extraction

LibriVoxDeEn: A Corpus for German-to-English Speech Translation and German Speech Recognition

MIMO-NOMA Networks Relying on Reconfigurable Intelligent Surface: A Signal Cancellation Based Design

NOMA Enhanced Terrestrial and Aerial IoT Networks with Partial CSI

Weak LQG metrics and Liouville first passage percolation

Brownian motion correlation in the peanosphere for $κ> 8$

Fractional Gaussian fields: a survey

Spinodal Decomposition-Enabled Halide Perovskite Double Heterostructure with Reduced Fröhlich Electron-Phonon Coupling

On Fluctuations for Random Band Toeplitz Matrices

Sandpiles and unicycles on random planar maps

Scaling limits for the critical Fortuin-Kastelyn model on a random planar map III: finite volume case

Ergodicity of the Airy line ensemble

Uniform Spanning Forests and the bi-Laplacian Gaussian field

A note on eigenvalues of random block Toeplitz matrices with slowly growing bandwidth

Fluctuation of Eigenvalues for Random Toeplitz and Related Matrices