Source author record

Xuan Liu

Xuan Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

25works

33topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

ControBench: An Interaction-Aware Benchmark for Controversial Discourse Analysis on Social Networks

Understanding how people argue across ideological divides online is important for studying political polarization, misinformation, and content moderation. Existing datasets capture only part of this problem: some preserve text but ignore interaction structure, some model structure without rich semantics, and others represent conversations without stable user-level ideological identity. We introduce ControBench, a benchmark for controversial discourse analysis that combines heterogeneous social interaction graphs with rich textual semantics. Built from Reddit discussions on three topics, Trump, abortion, and religion, ControBench contains 7,370 users, 1,783 posts, and 26,525 interactions. The graph contains user and post nodes connected by semantically enriched edges; in particular, user-comment-user edges encode both a reply and the parent comment that it responds to, preserving local argumentative context. User labels are derived from self-declared Reddit flairs, providing a scalable proxy for ideological identity without manual annotation. The resulting datasets exhibit low or negative adjusted homophily (Trump: -0.77, Abortion: 0.06, Religion: 0.04), reflecting the cross-cutting structure of real-world debate. We evaluate graph neural networks, pretrained language models, and large language models on ControBench and observe distinct performance patterns across topics and model families, especially when ideological boundaries are ambiguous. These results position ControBench as a challenging and realistic benchmark for controversial discourse analysis.

preprint2024arXiv

A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation

Therapeutic peptides represent a unique class of pharmaceutical agents crucial for the treatment of human diseases. Recently, deep generative models have exhibited remarkable potential for generating therapeutic peptides, but they only utilize sequence or structure information alone, which hinders the performance in generation. In this study, we propose a Multi-Modal Contrastive Diffusion model (MMCD), fusing both sequence and structure modalities in a diffusion framework to co-generate novel peptide sequences and structures. Specifically, MMCD constructs the sequence-modal and structure-modal diffusion models, respectively, and devises a multi-modal contrastive learning strategy with intercontrastive and intra-contrastive in each diffusion timestep, aiming to capture the consistency between two modalities and boost model performance. The inter-contrastive aligns sequences and structures of peptides by maximizing the agreement of their embeddings, while the intra-contrastive differentiates therapeutic and non-therapeutic peptides by maximizing the disagreement of their sequence/structure embeddings simultaneously. The extensive experiments demonstrate that MMCD performs better than other state-of-theart deep generative methods in generating therapeutic peptides across various metrics, including antimicrobial/anticancer score, diversity, and peptide-docking.

preprint2024arXiv

Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains

During the process of computed tomography (CT), metallic implants often cause disruptive artifacts in the reconstructed images, impeding accurate diagnosis. Several supervised deep learning-based approaches have been proposed for reducing metal artifacts (MAR). However, these methods heavily rely on training with simulated data, as obtaining paired metal artifact CT and clean CT data in clinical settings is challenging. This limitation can lead to decreased performance when applying these methods in clinical practice. Existing unsupervised MAR methods, whether based on learning or not, typically operate within a single domain, either in the image domain or the sinogram domain. In this paper, we propose an unsupervised MAR method based on the diffusion model, a generative model with a high capacity to represent data distributions. Specifically, we first train a diffusion model using CT images without metal artifacts. Subsequently, we iteratively utilize the priors embedded within the pre-trained diffusion model in both the sinogram and image domains to restore the degraded portions caused by metal artifacts. This dual-domain processing empowers our approach to outperform existing unsupervised MAR methods, including another MAR method based on the diffusion model, which we have qualitatively and quantitatively validated using synthetic datasets. Moreover, our method demonstrates superior visual results compared to both supervised and unsupervised methods on clinical datasets.

preprint2022arXiv

Modified scattering for the one-dimensional Schrödinger equation with a subcritical dissipative nonlinearity

We study the asymptotic behavior in time of solutions to the one dimensional nonlinear Schrödinger equation with a subcritical dissipative nonlinearity $λ|u|^αu$, where $0<α<2$, and $λ$ is a complex constant satisfying $\text{Im} λ>\frac{α|\text{Re} λ|}{2\sqrt{ α+1}}$. For arbitrary large initial data, we present the uniform time decay estimates when $4/3<α<2$, and the large time asymptotics of the solution when $\frac{7+\sqrt{145}}{12}<α<2$. The proof is based on the vector fields method and a semiclassical analysis method.

preprint2022arXiv

Optimization-Derived Learning with Essential Convergence Analysis of Training and Hyper-training

Recently, Optimization-Derived Learning (ODL) has attracted attention from learning and vision areas, which designs learning models from the perspective of optimization. However, previous ODL approaches regard the training and hyper-training procedures as two separated stages, meaning that the hyper-training variables have to be fixed during the training process, and thus it is also impossible to simultaneously obtain the convergence of training and hyper-training variables. In this work, we design a Generalized Krasnoselskii-Mann (GKM) scheme based on fixed-point iterations as our fundamental ODL module, which unifies existing ODL methods as special cases. Under the GKM scheme, a Bilevel Meta Optimization (BMO) algorithmic framework is constructed to solve the optimal training and hyper-training variables together. We rigorously prove the essential joint convergence of the fixed-point iteration for training and the process of optimizing hyper-parameters for hyper-training, both on the approximation quality, and on the stationary analysis. Experiments demonstrate the efficiency of BMO with competitive performance on sparse coding and real-world applications such as image deconvolution and rain streak removal.

preprint2022arXiv

Revisiting GANs by Best-Response Constraint: Perspective, Methodology, and Application

In past years, the minimax type single-level optimization formulation and its variations have been widely utilized to address Generative Adversarial Networks (GANs). Unfortunately, it has been proved that these alternating learning strategies cannot exactly reveal the intrinsic relationship between the generator and discriminator, thus easily result in a series of issues, including mode collapse, vanishing gradients and oscillations in the training phase, etc. In this work, by investigating the fundamental mechanism of GANs from the perspective of hierarchical optimization, we propose Best-Response Constraint (BRC), a general learning framework, that can explicitly formulate the potential dependency of the generator on the discriminator. Rather than adopting these existing time-consuming bilevel iterations, we design an implicit gradient scheme with outer-product Hessian approximation as our fast solution strategy. \emph{Noteworthy, we demonstrate that even with different motivations and formulations, a variety of existing GANs ALL can be uniformly improved by our flexible BRC methodology.} Extensive quantitative and qualitative experimental results verify the effectiveness, flexibility and stability of our proposed framework.

preprint2022arXiv

Towards Extremely Fast Bilevel Optimization with Self-governed Convergence Guarantees

Gradient methods have become mainstream techniques for Bi-Level Optimization (BLO) in learning and vision fields. The validity of existing works heavily relies on solving a series of approximation subproblems with extraordinarily high accuracy. Unfortunately, to achieve the approximation accuracy requires executing a large quantity of time-consuming iterations and computational burden is naturally caused. This paper is thus devoted to address this critical computational issue. In particular, we propose a single-level formulation to uniformly understand existing explicit and implicit Gradient-based BLOs (GBLOs). This together with our designed counter-example can clearly illustrate the fundamental numerical and theoretical issues of GBLOs and their naive accelerations. By introducing the dual multipliers as a new variable, we then establish Bilevel Alternating Gradient with Dual Correction (BAGDC), a general framework, which significantly accelerates different categories of existing methods by taking specific settings. A striking feature of our convergence result is that, compared to those original unaccelerated GBLO versions, the fast BAGDC admits a unified non-asymptotic convergence theory towards stationarity. A variety of numerical experiments have also been conducted to demonstrate the superiority of the proposed algorithmic framework.

preprint2021arXiv

Blockchain for Decentralized Multi-Drone to Combat COVID-19

Currently, drones represent a promising technology for combating Coronavirus disease 2019 (COVID-19) due to the transport of goods, medical supplies to a given target location in the quarantine areas experiencing an epidemic outbreak. Drone missions will increasingly rely on drone collaboration, which requires the drones to reduce communication complexity and be controlled in a decentralized fashion. Blockchain technology becomes a must in industrial applications because it provides decentralized data, accessibility, immutability, and irreversibility. Therefore, Blockchain makes data public for all drones and enables drones to log information concerning world states, time, location, resources, delivery data, and drone relation to all neighbors drones. This paper introduces decentralized independent multi-drones to accomplish the task collaboratively. Improving blockchain with a consensus algorithm can improve network partitioning and scalability in order to combat COVID-19. The multi-drones task is to combat COVID-19 via monitoring and detecting, social distancing, sanitization, data analysis, delivering goods and medical supplies, and announcement while avoiding collisions with one another. We discuss End to End (E2E) delivery application of combination blockchain and multi-drone in combating COVID-19 and beyond future pandemics. Furthermore, the challenges and opportunities of our proposed framework are highlighted.

preprint2021arXiv

Learning to shortcut and shortlist order fulfillment deciding

With the increase of order fulfillment options and business objectives taken into consideration in the deciding process, order fulfillment deciding is becoming more and more complex. For example, with the advent of ship from store retailers now have many more fulfillment nodes to consider, and it is now common to take into account many and varied business goals in making fulfillment decisions. With increasing complexity, efficiency of the deciding process can become a real concern. Finding the optimal fulfillment assignments among all possible ones may be too costly to do for every order especially during peak times. In this work, we explore the possibility of exploiting regularity in the fulfillment decision process to reduce the burden on the deciding system. By using data mining we aim to find patterns in past fulfillment decisions that can be used to efficiently predict most likely assignments for future decisions. Essentially, those assignments that can be predicted with high confidence can be used to shortcut, or bypass, the expensive deciding process, or else a set of most likely assignments can be used for shortlisting -- sending a much smaller set of candidates for consideration by the fulfillment deciding system.

preprint2021arXiv

Local well-posedness and finite time blowup for fourth-order Schrödinger equation with complex coefficient

We consider the fourth-order Schrödinger equation $$ i\partial_tu+Δ^2 u+μΔu+λ|u|^αu=0, $$ where $α>0,μ=\pm1$ or $0$ and $λ\in\mathbb{C}$. Firstly, we prove local well-posedness in $H^4\left(\R^N\right)$ in both $H^4$ subcritical and critical case: $α>0$, $(N-8)α\leq8$. Then, for any given compact set $K\subset\mathbb{R}^N$, we construct $H^4(\R^N)$ solutions that are defined on $(-T, 0)$ for some $T>0$, and blow up exactly on $K$ at $t=0$.

preprint2020arXiv

A Dielectric Metasurface Optical Chip for the Generation of Cold Atoms

Compact and robust cold atom sources are increasingly important for quantum research, especially for transferring cutting-edge quantum science into practical applications. In this letter, we report on a novel scheme that utilizes a metasurface optical chip to replace the conventional bulky optical elements used to produce a cold atomic ensemble with a single incident laser beam, which is split by the metasurface into multiple beams of the desired polarization states. Atom numbers $~10^7$ and temperatures (about 35 $μ$K) of relevance to quantum sensing are achieved in a compact and robust fashion. Our work highlights the substantial progress towards fully integrated cold atom quantum devices by exploiting metasurface optical chips, which may have great potential in quantum sensing, quantum computing and other areas.

preprint2020arXiv

Defending Against Adversarial Attacks in Transmission- and Distribution-level PMU Data

Phasor measurement units (PMUs) provide high-fidelity data that improve situation awareness of electric power grid operations. PMU datastreams inform wide-area state estimation, monitor area control error, and facilitate event detection in real time. As PMU data become more available and increasingly reliable, these devices are found in new roles within control systems, such as remedial action schemes and early warning detection systems. As with other cyber physical systems, maintaining data integrity and security pose a significant challenge for power system operators. In this paper, we present a comprehensive analysis of multiple machine learning techniques to detect malicious data injection within PMU data streams. The two datasets used in this study come from two PMU networks: an inter-university, research-grade distribution network spanning three institutions in the U.S. Pacific Northwest, and a utility transmission network from the Bonneville Power Administration. We implement the detection algorithms with TensorFlow, an open-source software library for machine learning, and the results demonstrate potential for distributing the training workload and achieving higher performance, while maintaining effectiveness in the detection of spoofed data.

preprint2020arXiv

Estimation of the Galaxy Quenching Rate in the Illustris Simulation

Quenching is a key topic in exploring the formation and evolution of galaxies. In this work, we study the quenching rate, i.e., the variation in the fraction of quenched galaxies per unit time, of the Illustris-1 simulation. By building the quenched fraction function $f(m,ρ, t)$ of each snapshot in the simulation, we derive an accurate form of quenching rate as $\Re_q=df(m,ρ,t)/dt$. According to the analytic expression of the quenching rate $\Re_q$, we split it into four components: mass quenching, environmental quenching, intrinsic mass quenching and intrinsic environmental quenching. The precise value and evolutions can be given via the formula of $\Re_q$. With this method, we analyze the Illustris-1 simulation. We find that quenched galaxies concentrate around $M_*\simeq10^{11}h^{-1}M_\odot$ and $δ+1\simeq10^{3.5}$ at earlier times, and that the quenching galaxy population slowly shifts to lower stellar mass and lower overdensity regions with time. We also find that mass quenching dominates the quenching process in this simulation, in agreement with some previous analytical models. Intrinsic quenching is the second most important component. Environmental quenching is very weak, because it is possible that the pre- or postprocessing of environments disguises environmental quenching as intrinsic quenching. We find that our method roughly predict the actual quenching rate. It could well predict the actual amount of galaxies quenched by intrinsic quenching. However, it overestimates the amount of mass quenching galaxies and underestimates the amount of environmental quenching. We suggest that the reason is the nonlinearity of the environmental overdensity change and mass growth of the galaxy.

preprint2020arXiv

Large-Scale Beamforming for Massive MIMO via Randomized Sketching

Massive MIMO system yields significant improvements in spectral and energy efficiency for future wireless communication systems. The regularized zero-forcing (RZF) beamforming is able to provide good performance with the capability of achieving numerical stability and robustness to the channel uncertainty. However, in massive MIMO systems, the matrix inversion operation in RZF beamforming becomes computationally expensive. To address this computational issue, we shall propose a novel randomized sketching based RZF beamforming approach with low computational complexity. This is achieved by solving a linear system via randomized sketching based on the preconditioned Richard iteration, which guarantees high quality approximations to the optimal solution. We theoretically prove that the sequence of approximations obtained iteratively converges to the exact RZF beamforming matrix linearly fast as the number of iterations increases. Also, it turns out that the system sum-rate for such sequence of approximations converges to the exact one at a linear convergence rate. Our simulation results verify our theoretical findings.

preprint2020arXiv

Learning to Locomote with Deep Neural-Network and CPG-based Control in a Soft Snake Robot

In this paper, we present a new locomotion control method for soft robot snakes. Inspired by biological snakes, our control architecture is composed of two key modules: A deep reinforcement learning (RL) module for achieving adaptive goal-tracking behaviors with changing goals, and a central pattern generator (CPG) system with Matsuoka oscillators for generating stable and diverse locomotion patterns. The two modules are interconnected into a closed-loop system: The RL module, analogizing the locomotion region located in the midbrain of vertebrate animals, regulates the input to the CPG system given state feedback from the robot. The output of the CPG system is then translated into pressure inputs to pneumatic actuators of the soft snake robot. Based on the fact that the oscillation frequency and wave amplitude of the Matsuoka oscillator can be independently controlled under different time scales, we further adapt the option-critic framework to improve the learning performance measured by optimality and data efficiency. The performance of the proposed controller is experimentally validated with both simulated and real soft snake robots.

preprint2020arXiv

Snapshot Interferometric 3D Imaging by Compressive Sensing and Deep Learning

We demonstrate single-shot compressive three-dimensional (3D) $(x, y, z)$ imaging based on interference coding. The depth dimension of the object is encoded into the interferometric spectra of the light field, resulting a $(x, y, λ)$ datacube which is subsequently measured by a single-shot spectrometer. By implementing a compression ratio up to $400$, we are able to reconstruct $1G$ voxels from a 2D measurement. Both an optimization based compressive sensing algorithm and a deep learning network are developed for 3D reconstruction from a single 2D coded measurement. Due to the fast acquisition speed, our approach is able to capture volumetric activities at native camera frame rates, enabling 4D (volumetric-temporal) visualization of dynamic scenes.

preprint2020arXiv

The Multi-Symplectic Lanczos Algorithm and Its Applications to Color Image Processing

Low-rank approximations of original samples are playing more and more an important role in many recently proposed mathematical models from data science. A natural and initial requirement is that these representations inherit original structures or properties. With this aim, we propose a new multi-symplectic method based on the Lanzcos bidiagonalization to compute the partial singular triplets of JRS-symmetric matrices. These singular triplets can be used to reconstruct optimal low-rank approximations while preserving the intrinsic multi-symmetry. The augmented Ritz and harmonic Ritz vectors are used to perform implicit restarting to obtain a satisfactory bidiagonal matrix for calculating the $k$ largest or smallest singular triplets, respectively. We also apply the new multi-symplectic Lanczos algorithms to color face recognition and color video compressing and reconstruction. Numerical experiments indicate their superiority over the state-of-the-art algorithms.

preprint2016arXiv

A Decomposition Based Approach for Solving a General Bilevel Linear Programming

Bilevel optimization has been widely used in decision-making process. However, there still lacks an efficient algorithm to determine an optimal solution of a bilevel optimization problem, especially for a large-size problem. To bridge the gap, this paper proposes an efficient decomposition algorithm for a general bilevel linear programming(GBLP). The simulation results on large-size testing system demonstrate its correctness and efficiency.

preprint2016arXiv

On the maximum of a type of random processes

We consider a type of random processes which satisfies the conditional increment condition and obtain an estimate for the tail probability and a Doob-type inequality of the maximum of the process. The main result is that, for processes satisfying the conditional increment condition, the tail probability decay of its maximum behaves in the same way as its marginals.

preprint2015arXiv

Impacts of Bad Data on the PMU based Line Outage Detection

Power systems become more prone to cyber-attacks due to the high integration of information technologies. In this paper, we demonstrate that the outages of some lines can be masked by injecting false data into a set of measurements. The success of the topology attack can be guaranteed by making that: 1)the injected false data obeys KCL and KVL to avoid being detected by the bad data detection program in the state estimation; 2)the residual is increased such that the line outage cannot be detected by PMU data. A quadratic programming problem is set up to determine the optimal attack vector that can maximize the residual of the outaged line. The IEEE 39-bus system is used to demonstrate the masking scheme.

preprint2013arXiv

A generalized theory of preferential linking

There are diverse mechanisms driving the evolution of social networks. A key open question dealing with understanding their evolution is: How various preferential linking mechanisms produce networks with different features? In this paper we first empirically study preferential linking phenomena in an evolving online social network, find and validate the linear preference. We propose an analyzable model which captures the real growth process of the network and reveals the underlying mechanism dominating its evolution. Furthermore based on preferential linking we propose a generalized model reproducing the evolution of online social networks, present unified analytical results describing network characteristics for 27 preference scenarios, and explore the relation between preferential linking mechanism and network features. We find that within the framework of preferential linking analytical degree distributions can only be the combinations of finite kinds of functions which are related to rational, logarithmic and inverse tangent functions, and extremely complex network structure will emerge even for very simple sublinear preferential linking. This work not only provides a verifiable origin for the emergence of various network characteristics in social networks, but bridges the micro individuals' behaviors and the global organization of social networks.

preprint2013arXiv

LED-Induced Fluorescence System for Tea Classification and Quality Assessment

A fluorescence system is developed by using several light emitting diodes (LEDs) with different wavelengths as excitation light sources. The fluorescence detection head consists of multi LED light sources and a multimode fiber for fluorescence collection, where the LEDs and the corresponding filters can be easily chosen to get appropriate excitation wavelengths for different applications. By analyzing fluorescence spectra with the principal component analysis method, the system is utilized in the classification of four types of green tea beverages and two types of black tea beverages. Qualities of the Xihu Longjing tea leaves of different grades, as well as the corresponding liquid tea samples, are studied to further investigate the ability and application of the system in the evaluation of classification/quality of tea and other foods.

preprint2012arXiv

CDAS: A Crowdsourcing Data Analytics System

Some complex problems, such as image tagging and natural language processing, are very challenging for computers, where even state-of-the-art technology is yet able to provide satisfactory accuracy. Therefore, rather than relying solely on developing new and better algorithms to handle such tasks, we look to the crowdsourcing solution -- employing human participation -- to make good the shortfall in current technology. Crowdsourcing is a good supplement to many computer tasks. A complex job may be divided into computer-oriented tasks and human-oriented tasks, which are then assigned to machines and humans respectively. To leverage the power of crowdsourcing, we design and implement a Crowdsourcing Data Analytics System, CDAS. CDAS is a framework designed to support the deployment of various crowdsourcing applications. The core part of CDAS is a quality-sensitive answering model, which guides the crowdsourcing engine to process and monitor the human tasks. In this paper, we introduce the principles of our quality-sensitive model. To satisfy user required accuracy, the model guides the crowdsourcing query engine for the design and processing of the corresponding crowdsourcing jobs. It provides an estimated accuracy for each generated result based on the human workers' historical performances. When verifying the quality of the result, the model employs an online strategy to reduce waiting time. To show the effectiveness of the model, we implement and deploy two analytics jobs on CDAS, a twitter sentiment analytics job and an image tagging job. We use real Twitter and Flickr data as our queries respectively. We compare our approaches with state-of-the-art classification and image annotation techniques. The results show that the human-assisted methods can indeed achieve a much higher accuracy. By embedding the quality-sensitive model into crowdsourcing query engine, we effectiv...[truncated].

preprint2010arXiv

Efficient delay-tolerant particle filtering

This paper proposes a novel framework for delay-tolerant particle filtering that is computationally efficient and has limited memory requirements. Within this framework the informativeness of a delayed (out-of-sequence) measurement (OOSM) is estimated using a lightweight procedure and uninformative measurements are immediately discarded. The framework requires the identification of a threshold that separates informative from uninformative; this threshold selection task is formulated as a constrained optimization problem, where the goal is to minimize tracking error whilst controlling the computational requirements. We develop an algorithm that provides an approximate solution for the optimization problem. Simulation experiments provide an example where the proposed framework processes less than 40% of all OOSMs with only a small reduction in tracking accuracy.

preprint2010arXiv

Maximal graded subalgebras of Witt and Special superalgebras

This paper has been withdrawn by the author due to that the main results and approaches are closedly parallel to the ones in Lie algebra case.

Xuan Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

25 published item(s)

ControBench: An Interaction-Aware Benchmark for Controversial Discourse Analysis on Social Networks

A Multi-Modal Contrastive Diffusion Model for Therapeutic Peptide Generation

Unsupervised CT Metal Artifact Reduction by Plugging Diffusion Priors in Dual Domains

Modified scattering for the one-dimensional Schrödinger equation with a subcritical dissipative nonlinearity

Optimization-Derived Learning with Essential Convergence Analysis of Training and Hyper-training

Revisiting GANs by Best-Response Constraint: Perspective, Methodology, and Application

Towards Extremely Fast Bilevel Optimization with Self-governed Convergence Guarantees

Blockchain for Decentralized Multi-Drone to Combat COVID-19

Learning to shortcut and shortlist order fulfillment deciding

Local well-posedness and finite time blowup for fourth-order Schrödinger equation with complex coefficient

A Dielectric Metasurface Optical Chip for the Generation of Cold Atoms

Defending Against Adversarial Attacks in Transmission- and Distribution-level PMU Data

Estimation of the Galaxy Quenching Rate in the Illustris Simulation

Large-Scale Beamforming for Massive MIMO via Randomized Sketching

Learning to Locomote with Deep Neural-Network and CPG-based Control in a Soft Snake Robot

Snapshot Interferometric 3D Imaging by Compressive Sensing and Deep Learning

The Multi-Symplectic Lanczos Algorithm and Its Applications to Color Image Processing

A Decomposition Based Approach for Solving a General Bilevel Linear Programming

On the maximum of a type of random processes

Impacts of Bad Data on the PMU based Line Outage Detection

A generalized theory of preferential linking

LED-Induced Fluorescence System for Tea Classification and Quality Assessment

CDAS: A Crowdsourcing Data Analytics System

Efficient delay-tolerant particle filtering

Maximal graded subalgebras of Witt and Special superalgebras