Source author record

Xiaoyu Chen

Xiaoyu Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

39works

24topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Intelligent Elastic Feature Fading: Enabling Model Retrain-Free Feature Efficiency Rollouts at Scale

Large-scale ranking systems depend on thousands of features derived from user behavior across multiple time horizons. Typically requires model retraining -- resulting in long iteration cycles (3--6 months), substantial GPU resource consumption, and limited rollout throughput. We introduce Intelligent Elastic Feature Fading (IEFF), a production infrastructure system that enables retrain-free feature efficiency rollouts by elastically controlling feature coverage and distribution at serving time. IEFF supports incremental feature coverage adjustments while models adapt through recurring training, eliminating dependencies on explicit retraining cycles. The system incorporates strict safety guardrails, reversibility mechanisms, and comprehensive monitoring to ensure stability at scale. Across multiple production use cases, IEFF accelerates efficiency-related rollouts by 5$\times$, eliminates retraining-related GPU overhead, and enables faster capacity recycling. Extensive offline and online experiments demonstrate that gradual feature fading prevents 50--55\% of online performance degradation compared to abrupt feature removal, while maintaining stable model behavior. These results establish elastic, system-level feature fading as a practical and scalable approach for managing feature efficiency in modern industrial ranking systems.

preprint2026arXiv

Rapid Mixing at the Uniqueness Threshold

Over the past decades, a fascinating computational phase transition has been identified in sampling from Gibbs distributions. Though, the computational complexity at the critical point remains poorly understood, as previous algorithmic and hardness results all required a constant slack from this threshold. In this paper, we resolve this open question at the critical phase transition threshold, thus completing the picture of the computational phase transition. We show that for the hardcore model on graphs with maximum degree $Δ\ge 3$ at the uniqueness threshold $λ= λ_c(Δ)$, the mixing time of Glauber dynamics is upper bounded by a polynomial in $n$, but is not nearly linear in the worst case. For the Ising model (either antiferromagnetic or ferromagnetic), we establish similar results. For the Ising model on graphs with maximum degree $Δ\ge 3$ at the critical temperature $β$ where $|β| = β_c(Δ)$, with the tree-uniqueness threshold $β_c(Δ)$, we show that the mixing time of Glauber dynamics is upper bounded by $\tilde{O}\left(n^{3 + O(1/Δ)}\right)$ and lower bounded by $Ω\left(n^{3/2}\right)$ in the worst case. For the Ising model specified by a critical interaction matrix $J$ with $\left \lVert J \right \rVert_2=1$, we obtain an upper bound $\tilde{O}(n^{3/2})$ for the mixing time, matching the lower bound $Ω\left(n^{3/2}\right)$ on the complete graph up to a logarithmic factor. Our mixing time upper bounds are derived from a new interpretation and analysis of the localization scheme method introduced by Chen and Eldan (2022), applied to the field dynamics for the hardcore model and the proximal sampler for the Ising model. As key steps in both our upper and lower bounds, we establish sub-linear upper and lower bounds for spectral independence at the critical point for worst-case instances.

preprint2026arXiv

UAM: A Dual-Stream Perspective on Forgetting in VLA Training

Vision--language--action (VLA) models are typically built by fine-tuning a pretrained vision--language model (VLM) on action data. However, we show that this standard recipe systematically erodes the VLM's multimodal competence, a side effect we call the embodiment tax. But do VLAs have to forget? Inspired by the two-stream organization of biological vision, we trace this degradation to a structural bottleneck: current VLAs ask a single encoder to support both language-grounded semantics and control-relevant visual features, whereas biological vision separates recognition and visuomotor control into distinct pathways. Building on this view, we propose the Unified Action Model (UAM), which adds a parallel Dorsal Expert, an analog of the brain's dorsal pathway. To make the Dorsal Expert an effective second pathway and reduce the control-learning burden on the VLM, we initialize it from a pretrained generative model and train it with a mid-level reasoning objective that predicts visual dynamics. This design allows us to train the whole VLA end-to-end on action data alone: with no parameter freezing, no gradient stopping, and no auxiliary VL co-training, UAM retains over $95\%$ of the underlying VLM's multimodal capability and at the same time achieves the highest average success rate among baselines on a variety of manipulation tasks that probe out-of-distribution generalization, including unseen objects, novel object--target compositions, and instruction variation. Together, these results suggest that semantic preservation in VLAs can emerge from architectural separation itself, rather than being enforced by frozen weights or auxiliary data replay, and that this preserved semantic capability can naturally transfer from VLMs to semantic generalization in actions.

preprint2026arXiv

VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models

Vision-Language-Action (VLA) models, which integrate pretrained large Vision-Language Models (VLM) into their policy backbone, are gaining significant attention for their promising generalization capabilities. This paper revisits a fundamental yet seldom systematically studied question: how VLM choice and competence translate to downstream VLA policies performance? We introduce VLM4VLA, a minimal adaptation pipeline that converts general-purpose VLMs into VLA policies using only a small set of new learnable parameters for fair and efficient comparison. Despite its simplicity, VLM4VLA proves surprisingly competitive with more sophisticated network designs. Through extensive empirical studies on various downstream tasks across three benchmarks, we find that while VLM initialization offers a consistent benefit over training from scratch, a VLM's general capabilities are poor predictors of its downstream task performance. This challenges common assumptions, indicating that standard VLM competence is necessary but insufficient for effective embodied control. We further investigate the impact of specific embodied capabilities by fine-tuning VLMs on seven auxiliary embodied tasks (e.g., embodied QA, visual pointing, depth estimation). Contrary to intuition, improving a VLM's performance on specific embodied skills does not guarantee better downstream control performance. Finally, modality-level ablations identify the visual module in VLM, rather than the language component, as the primary performance bottleneck. We demonstrate that injecting control-relevant supervision into the vision encoder of the VLM yields consistent gains, even when the encoder remains frozen during downstream fine-tuning. This isolates a persistent domain gap between current VLM pretraining objectives and the requirements of embodied action-planning.

preprint2022arXiv

Flow-based Recurrent Belief State Learning for POMDPs

Partially Observable Markov Decision Process (POMDP) provides a principled and generic framework to model real world sequential decision making processes but yet remains unsolved, especially for high dimensional continuous space and unknown models. The main challenge lies in how to accurately obtain the belief state, which is the probability distribution over the unobservable environment states given historical information. Accurately calculating this belief state is a precondition for obtaining an optimal policy of POMDPs. Recent advances in deep learning techniques show great potential to learn good belief states. However, existing methods can only learn approximated distribution with limited flexibility. In this paper, we introduce the \textbf{F}l\textbf{O}w-based \textbf{R}ecurrent \textbf{BE}lief \textbf{S}tate model (FORBES), which incorporates normalizing flows into the variational inference to learn general continuous belief states for POMDPs. Furthermore, we show that the learned belief states can be plugged into downstream RL algorithms to improve performance. In experiments, we show that our methods successfully capture the complex belief states that enable multi-modal predictions as well as high quality reconstructions, and results on challenging visual-motor control tasks show that our method achieves superior performance and sample efficiency.

preprint2022arXiv

Generalized Radiograph Representation Learning via Cross-supervision between Images and Free-text Radiology Reports

Pre-training lays the foundation for recent successes in radiograph analysis supported by deep learning. It learns transferable image representations by conducting large-scale fully-supervised or self-supervised learning on a source domain. However, supervised pre-training requires a complex and labor intensive two-stage human-assisted annotation process while self-supervised learning cannot compete with the supervised paradigm. To tackle these issues, we propose a cross-supervised methodology named REviewing FreE-text Reports for Supervision (REFERS), which acquires free supervision signals from original radiology reports accompanying the radiographs. The proposed approach employs a vision transformer and is designed to learn joint representations from multiple views within every patient study. REFERS outperforms its transfer learning and self-supervised learning counterparts on 4 well-known X-ray datasets under extremely limited supervision. Moreover, REFERS even surpasses methods based on a source domain of radiographs with human-assisted structured labels. Thus REFERS has the potential to replace canonical pre-training methodologies.

preprint2022arXiv

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

We study human-in-the-loop reinforcement learning (RL) with trajectory preferences, where instead of receiving a numeric reward at each step, the agent only receives preferences over trajectory pairs from a human overseer. The goal of the agent is to learn the optimal policy which is most preferred by the human overseer. Despite the empirical successes, the theoretical understanding of preference-based RL (PbRL) is only limited to the tabular case. In this paper, we propose the first optimistic model-based algorithm for PbRL with general function approximation, which estimates the model using value-targeted regression and calculates the exploratory policies by solving an optimistic planning problem. Our algorithm achieves the regret of $\tilde{O} (\operatorname{poly}(d H) \sqrt{K} )$, where $d$ is the complexity measure of the transition and preference model depending on the Eluder dimension and log-covering numbers, $H$ is the planning horizon, $K$ is the number of episodes, and $\tilde O(\cdot)$ omits logarithmic terms. Our lower bound indicates that our algorithm is near-optimal when specialized to the linear setting. Furthermore, we extend the PbRL problem by formulating a novel problem called RL with $n$-wise comparisons, and provide the first sample-efficient algorithm for this new setting. To the best of our knowledge, this is the first theoretical result for PbRL with (general) function approximation.

preprint2022arXiv

Irreducible Modules of Reductive Groups with Borel-stable Line

Let $p$ be a prime number and $\Bbbk=\bar{\mathbb{F}}_p$, the algebraic closure of the finite field $\mathbb{F}_p$ of $p$ elements. Let ${\bf G}$ be a connected reductive group defined over $\mathbb{F}_p$ and ${\bf B}$ be a Borel subgroup of ${\bf G}$ (not necessarily defined over $\mathbb{F}_p$). We show that for each (one-dimensional) character $θ$ of ${\bf B}$ (not necessarily rational), there is a unique (up to isomorphism) irreducible $\Bbbk{\bf G}$-module $\mathbb{L}(θ)$ containing $θ$ as a $\Bbbk{\bf B}$-submodule, and moreover, $\mathbb{L}(θ)$ is isomorphic to a parabolic induction from a finite-dimensional irreducible $\Bbbk{\bf L}$-module for some Levi subgroup ${\bf L}$ of ${\bf G}$. Thus, we have classified and constructed all (abstract) irreducible $\Bbbk{\bf G}$-modules with ${\bf B}$-stable line (i.e. an one-dimensional $\Bbbk{\bf B}$-submodule). As a byproduct, we give a new proof of a result of Borel and Tits on the classification of finite-dimensional irreducible $\Bbbk{\bf G}$-modules.

preprint2022arXiv

LDC-VAE: A Latent Distribution Consistency Approach to Variational AutoEncoders

Variational autoencoders (VAEs), as an important aspect of generative models, have received a lot of research interests and reached many successful applications. However, it is always a challenge to achieve the consistency between the learned latent distribution and the prior latent distribution when optimizing the evidence lower bound (ELBO), and finally leads to an unsatisfactory performance in data generation. In this paper, we propose a latent distribution consistency approach to avoid such substantial inconsistency between the posterior and prior latent distributions in ELBO optimizing. We name our method as latent distribution consistency VAE (LDC-VAE). We achieve this purpose by assuming the real posterior distribution in latent space as a Gibbs form, and approximating it by using our encoder. However, there is no analytical solution for such Gibbs posterior in approximation, and traditional approximation ways are time consuming, such as using the iterative sampling-based MCMC. To address this problem, we use the Stein Variational Gradient Descent (SVGD) to approximate the Gibbs posterior. Meanwhile, we use the SVGD to train a sampler net which can obtain efficient samples from the Gibbs posterior. Comparative studies on the popular image generation datasets show that our method has achieved comparable or even better performance than several powerful improvements of VAEs.

preprint2022arXiv

Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver

Although model-based reinforcement learning (RL) approaches are considered more sample efficient, existing algorithms are usually relying on sophisticated planning algorithm to couple tightly with the model-learning procedure. Hence the learned models may lack the ability of being re-used with more specialized planners. In this paper we address this issue and provide approaches to learn an RL model efficiently without the guidance of a reward signal. In particular, we take a plug-in solver approach, where we focus on learning a model in the exploration phase and demand that \emph{any planning algorithm} on the learned model can give a near-optimal policy. Specicially, we focus on the linear mixture MDP setting, where the probability transition matrix is a (unknown) convex combination of a set of existing models. We show that, by establishing a novel exploration algorithm, the plug-in approach learns a model by taking $\tilde{O}(d^2H^3/ε^2)$ interactions with the environment and \emph{any} $ε$-optimal planner on the model gives an $O(ε)$-optimal policy on the original model. This sample complexity matches lower bounds for non-plug-in approaches and is \emph{statistically optimal}. We achieve this result by leveraging a careful maximum total-variance bound using Bernstein inequality and properties specified to linear mixture MDP.

preprint2022arXiv

Optimal mixing for two-state anti-ferromagnetic spin systems

We prove an optimal $Ω(n^{-1})$ lower bound for modified log-Sobolev (MLS) constant of the Glauber dynamics for anti-ferromagnetic two-spin systems with $n$ vertices in the tree uniqueness regime. Specifically, this optimal MLS bound holds for the following classes of two-spin systems in the tree uniqueness regime: $\bullet$ all strictly anti-ferromagnetic two-spin systems (where both edge parameters $β,γ<1$), which cover the hardcore models and the anti-ferromagnetic Ising models; $\bullet$ general anti-ferromagnetic two-spin systems on regular graphs. Consequently, an optimal $O(n\log n)$ mixing time holds for these anti-ferromagnetic two-spin systems when the uniqueness condition is satisfied. These MLS and mixing time bounds hold for any bounded or unbounded maximum degree, and the constant factors in the bounds depend only on the gap to the uniqueness threshold. We prove this by showing a boosting theorem for MLS constant for distributions satisfying certain spectral independence and marginal stability properties.

preprint2022arXiv

Towards better understanding and better generalization of few-shot classification in histology images with contrastive learning

Few-shot learning is an established topic in natural images for years, but few work is attended to histology images, which is of high clinical value since well-labeled datasets and rare abnormal samples are expensive to collect. Here, we facilitate the study of few-shot learning in histology images by setting up three cross-domain tasks that simulate real clinics problems. To enable label-efficient learning and better generalizability, we propose to incorporate contrastive learning (CL) with latent augmentation (LA) to build a few-shot system. CL learns useful representations without manual labels, while LA transfers semantic variations of the base dataset in an unsupervised way. These two components fully exploit unlabeled training data and can scale gracefully to other label-hungry problems. In experiments, we find i) models learned by CL generalize better than supervised learning for histology images in unseen classes, and ii) LA brings consistent gains over baselines. Prior studies of self-supervised learning mainly focus on ImageNet-like images, which only present a dominant object in their centers. Recent attention has been paid to images with multi-objects and multi-textures. Histology images are a natural choice for such a study. We show the superiority of CL over supervised learning in terms of generalization for such data and provide our empirical understanding for this observation. The findings in this work could contribute to understanding how the model generalizes in the context of both representation learning and histological image analysis. Code is available.

preprint2022arXiv

Understanding Domain Randomization for Sim-to-real Transfer

Reinforcement learning encounters many challenges when applied directly in the real world. Sim-to-real transfer is widely used to transfer the knowledge learned from simulation to the real world. Domain randomization -- one of the most popular algorithms for sim-to-real transfer -- has been demonstrated to be effective in various tasks in robotics and autonomous driving. Despite its empirical successes, theoretical understanding on why this simple algorithm works is limited. In this paper, we propose a theoretical framework for sim-to-real transfers, in which the simulator is modeled as a set of MDPs with tunable parameters (corresponding to unknown physical parameters such as friction). We provide sharp bounds on the sim-to-real gap -- the difference between the value of policy returned by domain randomization and the value of an optimal policy for the real world. We prove that sim-to-real transfer can succeed under mild conditions without any real-world training samples. Our theory also highlights the importance of using memory (i.e., history-dependent policies) in domain randomization. Our proof is based on novel techniques that reduce the problem of bounding the sim-to-real gap to the problem of designing efficient learning algorithms for infinite-horizon MDPs, which we believe are of independent interest.

preprint2022arXiv

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

In this paper, we present WenetSpeech, a multi-domain Mandarin corpus consisting of 10000+ hours high-quality labeled speech, 2400+ hours weakly labeled speech, and about 10000 hours unlabeled speech, with 22400+ hours in total. We collect the data from YouTube and Podcast, which covers a variety of speaking styles, scenarios, domains, topics, and noisy conditions. An optical character recognition (OCR) based method is introduced to generate the audio/text segmentation candidates for the YouTube data on its corresponding video captions, while a high-quality ASR transcription system is used to generate audio/text pair candidates for the Podcast data. Then we propose a novel end-to-end label error detection approach to further validate and filter the candidates. We also provide three manually labelled high-quality test sets along with WenetSpeech for evaluation -- Dev for cross-validation purpose in training, Test_Net, collected from Internet for matched test, and Test\_Meeting, recorded from real meetings for more challenging mismatched test. Baseline systems trained with WenetSpeech are provided for three popular speech recognition toolkits, namely Kaldi, ESPnet, and WeNet, and recognition results on the three test sets are also provided as benchmarks. To the best of our knowledge, WenetSpeech is the current largest open-sourced Mandarin speech corpus with transcriptions, which benefits research on production-level speech recognition.

preprint2021arXiv

Distinct Properties of Vortex Bound States Driven by Temperature

We investigate the behavior of vortex bound states in the quantum limit by self-consistently solving the Bogoliubov-de Gennes equation. We find that the energies of the vortex bound states deviates from the analytical result $E_μ=μΔ^2/E_F$ with the half-integer angular momentum $μ$ in the extreme quantum limit. Specifically, the energy ratio for the first three orders is more close to $1:2:3$ instead of $1:3:5$ at extremely low temperature. The local density of states reveals an Friedel-like behavior associated with that of the pair potential in the extreme quantum limit, which will be smoothed out by thermal effect above a certain temperature even the quantum limit condition, namely $T/T_c<Δ/E_F$ is still satisfied. Our studies show that the vortex bound states can exhibit very distinct features in different temperature regimes, which provides a comprehensive understanding and should stimulate more experimental efforts for verifications.

preprint2021arXiv

Modeling Method for the Coupling Relations of Microgrid Cyber-Physical Systems Driven by Hybrid Spatiotemporal Events

The essence of the microgrid cyber-physical system (CPS) lies in the cyclical conversion of information flow and energy flow. Most of the existing coupling models are modeled with static networks and interface structures, in which the closed-loop data flow characteristic is not fully considered. It is difficult for these models to accurately describe spatiotemporal deduction processes, such as microgrid CPS attack identification, risk propagation, safety assessment, defense control, and cascading failure. To address this problem, a modeling method for the coupling relations of microgrid CPS driven by hybrid spatiotemporal events is proposed in the present work. First, according to the topological correlation and coupling logic of the microgrid CPS, the cyclical conversion mechanism of information flow and energy flow is analyzed, and a microgrid CPS architecture with multi-agents as the core is constructed. Next, the spatiotemporal evolution characteristic of the CPS is described by hybrid automata, and the task coordination mechanism of the multi-agent CPS terminal is designed. On this basis, a discrete-continuous correlation and terminal structure characteristic representation method of the CPS based on heterogeneous multi-groups are then proposed. Finally, four spatiotemporal events, namely state perception, network communication, intelligent decision-making, and action control, are defined. Considering the constraints of the temporal conversion of information flow and energy flow, a microgrid CPS coupling model is established, the effectiveness of which is verified by simulating false data injection attack (FDIA) scenarios.

preprint2021arXiv

Near-optimal Representation Learning for Linear Bandits and Linear RL

This paper studies representation learning for multi-task linear bandits and multi-task episodic RL with linear value function approximation. We first consider the setting where we play $M$ linear bandits with dimension $d$ concurrently, and these bandits share a common $k$-dimensional linear representation so that $k\ll d$ and $k \ll M$. We propose a sample-efficient algorithm, MTLR-OFUL, which leverages the shared representation to achieve $\tilde{O}(M\sqrt{dkT} + d\sqrt{kMT} )$ regret, with $T$ being the number of total steps. Our regret significantly improves upon the baseline $\tilde{O}(Md\sqrt{T})$ achieved by solving each task independently. We further develop a lower bound that shows our regret is near-optimal when $d > M$. Furthermore, we extend the algorithm and analysis to multi-task episodic RL with linear value function approximation under low inherent Bellman error \citep{zanette2020learning}. To the best of our knowledge, this is the first theoretical result that characterizes the benefits of multi-task representation learning for exploration in RL with function approximation.

preprint2021arXiv

U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition

The unified streaming and non-streaming two-pass (U2) end-to-end model for speech recognition has shown great performance in terms of streaming capability, accuracy, real-time factor (RTF), and latency. In this paper, we present U2++, an enhanced version of U2 to further improve the accuracy. The core idea of U2++ is to use the forward and the backward information of the labeling sequences at the same time at training to learn richer information, and combine the forward and backward prediction at decoding to give more accurate recognition results. We also proposed a new data augmentation method called SpecSub to help the U2++ model to be more accurate and robust. Our experiments show that, compared with U2, U2++ shows faster convergence at training, better robustness to the decoding method, as well as consistent 5\% - 8\% word error rate reduction gain over U2. On the experiment of AISHELL-1, we achieve a 4.63\% character error rate (CER) with a non-streaming setup and 5.05\% with a streaming setup with 320ms latency by U2++. To the best of our knowledge, 5.05\% is the best-published streaming result on the AISHELL-1 test set.

preprint2020arXiv

(Locally) Differentially Private Combinatorial Semi-Bandits

In this paper, we study Combinatorial Semi-Bandits (CSB) that is an extension of classic Multi-Armed Bandits (MAB) under Differential Privacy (DP) and stronger Local Differential Privacy (LDP) setting. Since the server receives more information from users in CSB, it usually causes additional dependence on the dimension of data, which is a notorious side-effect for privacy preserving learning. However for CSB under two common smoothness assumptions \cite{kveton2015tight,chen2016combinatorial}, we show it is possible to remove this side-effect. In detail, for $B_{\infty}$-bounded smooth CSB under either $\varepsilon$-LDP or $\varepsilon$-DP, we prove the optimal regret bound is $Θ(\frac{mB^2_{\infty}\ln T } {Δε^2})$ or $\tildeΘ(\frac{mB^2_{\infty}\ln T} { Δε})$ respectively, where $T$ is time period, $Δ$ is the gap of rewards and $m$ is the number of base arms, by proposing novel algorithms and matching lower bounds. For $B_1$-bounded smooth CSB under $\varepsilon$-DP, we also prove the optimal regret bound is $\tildeΘ(\frac{mKB^2_1\ln T} {Δε})$ with both upper bound and lower bound, where $K$ is the maximum number of feedback in each round. All above results nearly match corresponding non-private optimal rates, which imply there is no additional price for (locally) differentially private CSB in above common settings.

preprint2020arXiv

Multivariate Regression of Mixed Responses for Evaluation of Visualization Designs

Information visualization significantly enhances human perception by graphically representing complex data sets. The variety of visualization designs makes it challenging to efficiently evaluate all possible designs catering to users' preferences and characteristics. Most of existing evaluation methods perform user studies to obtain multivariate qualitative responses from users via questionnaires and interviews. However, these methods cannot support online evaluation of designs as they are often time-consuming. A statistical model is desired to predict users' preferences on visualization designs based on non-interference measurements (i.e., wearable sensor signals). In this work, we propose a multivariate regression of mixed responses (MRMR) to facilitate quantitative evaluation of visualization designs. The proposed MRMR method is able to provide accurate model prediction with meaningful variable selection. A simulation study and a user study of evaluating visualization designs with 14 effective participants are conducted to illustrate the merits of the proposed model.

preprint2020arXiv

Noisy Agents: Self-supervised Exploration by Predicting Auditory Events

Humans integrate multiple sensory modalities (e.g. visual and audio) to build a causal understanding of the physical world. In this work, we propose a novel type of intrinsic motivation for Reinforcement Learning (RL) that encourages the agent to understand the causal effect of its actions through auditory event prediction. First, we allow the agent to collect a small amount of acoustic data and use K-means to discover underlying auditory event clusters. We then train a neural network to predict the auditory events and use the prediction errors as intrinsic rewards to guide RL exploration. Experimental results on Atari games show that our new intrinsic motivation significantly outperforms several state-of-the-art baselines. We further visualize our noisy agents' behavior in a physics environment and demonstrate that our newly designed intrinsic reward leads to the emergence of physical interaction behaviors (e.g. contact with objects).

preprint2019arXiv

Dynamic 3-D measurement based on fringe-to-fringe transformation using deep learning

Fringe projection profilometry (FPP) has become increasingly important in dynamic 3-D shape measurement. In FPP, it is necessary to retrieve the phase of the measured object before shape profiling. However, traditional phase retrieval techniques often require a large number of fringes, which may generate motion-induced error for dynamic objects. In this paper, a novel phase retrieval technique based on deep learning is proposed, which uses an end-to-end deep convolution neural network to transform a single or two fringes into the phase retrieval required fringes. When the object's surface is located in a restricted depth, the presented network only requires a single fringe as the input, which otherwise requires two fringes in an unrestricted depth. The proposed phase retrieval technique is first theoretically analyzed, and then numerically and experimentally verified on its applicability for dynamic 3-D measurement.

preprint2016arXiv

Convex PBW-type Lyndon Bases and Restricted Two-Parameter Quantum Group of Type $F_4$

We determine convex PBW-type Lyndon bases for two-parameter quantum groups $U_{r,s}(F_4)$ with detailed commutation relations. We construct a finite-dimensional Hopf algebra $\mathfrak u_{r,s}(F_4)$, as a quotient of $U_{r,s}(F_4)$ by a Hopf ideal generated by certain central elements, which is pointed, and of a Drinfel'd double structure under a certain condition. All of Hopf isomorphisms of $\mathfrak u_{r,s}(F_4)$ are determined which are important for seeking the possible new pointed objects in low order with $(\ell, 210)\ne 1$. Finally, necessary and sufficient conditions for $\mathfrak u_{r,s}(F_4)$ to be a ribbon Hopf algebra are singled out by describing the left and right integrals.

preprint2016arXiv

On High-Order Capacity Statistics of Spectrum Aggregation Systems over $κ$-$μ$ and $κ$-$μ$ shadowed Fading Channels

The frequency scarcity imposed by fast growing demand for mobile data service requires promising spectrum aggregation systems. The so-called higher-order statistics (HOS) of the channel capacity is a suitable metric on the system performance. While prior relevant works have improved our knowledge on the HOS characterization of spectrum aggregation systems, an analytical framework encompassing generalized fading models of interest is not yet available. In this paper, we pursue a detailed HOS analysis of $κ$-$μ$ and $κ$-$μ$ shadowed fading channels by deriving novel and exact expressions. Furthermore, the simplified HOS expressions for the asymptotically low and high signal-to-noise regimes are derived. Several important statistical measures, such as amount of fading, amount of dispersion, reliability, skewness, and kurtosis, are obtained by using the HOS results. More importantly, the useful implications of system and fading parameters on spectrum aggregation systems are investigated for channel selection. Finally, all derived expressions are validated via Monte-Carlo simulations.

preprint2015arXiv

On a problem from the Kourovka Notebook

In this manuscript, a solution to Problem 18.91(b) in the Kourovka Notebook is given by proving the following theorem. Let $P$ be a Sylow $p$-subgroup of a group $G$ with $|P| = p^n$. Suppose that there is an integer $k$ such that $1 < k < n$ and every subgroup of $P$ of order $p^k$ is $S$-propermutable in $G$, and also, in the case that $p=2$, $k = 1$ and $P$ is non-abelian, every cyclic subgroup of $P$ of order $4$ is $S$-propermutable in $G$. Then $G$ is $p$-nilpotent.

preprint2015arXiv

On finite groups with some primary subgroups satisfying partial $S$-$Π$-property

A $p$-subgroup $H$ of a finite group $G$ is said to satisfy partial $S$-$Π$-property in $G$ if $G$ has a chief series $Γ_{G}: 1=G_{0}<G_{1}<\cdots<G_{n}=G$ such that for every $G$-chief factor $G_{i}/G_{i-1}$ $(1\leqslant i\leqslant n)$ of $Γ_{G}$, either $(H\cap G_{i})G_{i-1}/G_{i-1}$ is a Sylow $p$-subgroup of $G_{i}/G_{i-1}$ or $|G/G_{i-1}: N_{G/G_{i-1}}((H\cap G_{i})G_{i-1}/G_{i-1})|$ is a $p$-number. In this paper, we mainly investigate the structure of finite groups with some primary subgroups satisfying partial $S$-$Π$-property.

preprint2015arXiv

On supersolubility of finite groups admitting a Frobenius group of automorphisms with fixed-point-free kernel

Assume that a finite group $G$ admits a Frobenius group of automorphisms $FH$ with kernel $F$ and complement $H$ such that $C_{G}(F)=1$. In this paper, we investigate this situation and prove that if $C_G(H)$ is supersoluble and $C_{G'}(H)$ is nilpotent, then $G$ is supersoluble. Also, we show that $G$ is a Sylow tower group of a certain type if $C_{G}(H)$ is a Sylow tower group of the same type.

preprint2015arXiv

On the converse of Hall's theorem

In this paper, we mainly investigate the converse of a well-known theorem proved by P. Hall, and present detailed characterizations under the various assumptions of the existence of some families of Hall subgroups. In particular, we prove that if $p\neq 3$ and a finite group $G$ has a Hall $\{p,q\}$-subgroup for every prime $q\neq p$, then $G$ is $p$-soluble.

preprint2014arXiv

Automated Generation of Geometric Theorems from Images of Diagrams

We propose an approach to generate geometric theorems from electronic images of diagrams automatically. The approach makes use of techniques of Hough transform to recognize geometric objects and their labels and of numeric verification to mine basic geometric relations. Candidate propositions are generated from the retrieved information by using six strategies and geometric theorems are obtained from the candidates via algebraic computation. Experiments with a preliminary implementation illustrate the effectiveness and efficiency of the proposed approach for generating nontrivial theorems from images of diagrams. This work demonstrates the feasibility of automated discovery of profound geometric knowledge from simple image data and has potential applications in geometric knowledge management and education.

preprint2014arXiv

Finite groups in which SS-permutability is a transitive relation

A subgroup $H$ of a finite group $G$ is said to be SS-permutable in $G$ if $H$ has a supplement $K$ in $G$ such that $H$ permutes with every Sylow subgroup of $K$. A finite group $G$ is called an SST-group if SS-permutability is a transitive relation on the set of all subgroups of $G$. The structure of SST-groups is investigated in this paper.

preprint2014arXiv

On $Π$-supplemented subgroups of a finite group

A subgroup $H$ of a finite group $G$ is said to satisfy $Π$-property in $G$ if for every chief factor $L/K$ of $G$, $|G/K:N_{G/K}(HK/K\cap L/K)|$ is a $π(HK/K\cap L/K)$-number. A subgroup $H$ of $G$ is called to be $Π$-supplemented in $G$ if there exists a subgroup $T$ of $G$ such that $G=HT$ and $H\cap T\leq I\leq H$, where $I$ satisfies $Π$-property in $G$. In this paper, we investigate the structure of a finite group $G$ under the assumption that some primary subgroups of $G$ are $Π$-supplemented in $G$. The main result we proved improves a large number of earlier results.

preprint2014arXiv

On HC-subgroups of a finite group

A subgroup $H$ of a finite group $G$ is said to be an $\mathscr{H}C$-subgroup of $G$ if there exists a normal subgroup $T$ of $G$ such that $G=HT$ and $H^g \cap N_T(H)\leq H$ for all $g\in G$. In this paper, we investigate the structure of a finite group $G$ under the assumption that certain subgroups of $G$ of arbitrary prime power order are $\mathscr{H}C$-subgroups of $G$.

preprint2014arXiv

On partial $Π$-property of subgroups of finite groups

Let $H$ be a subgroup of a finite group $G$. We say that $H$ satisfies partial $Π$-property in $G$ if there exists a chief series $\mathitΓ_G:1=G_0<G_1<\cdots<G_n=G$ of $G$ such that for every $G$-chief factor $G_i/G_{i-1}$ ($1\leq i\leq n$) of $\mathitΓ_G$, $|G/G_{i-1}:N_{G/G_{i-1}}(HG_{i-1}/G_{i-1}\cap G_i/G_{i-1})|$ is a $π(HG_{i-1}/G_{i-1}\cap G_i/G_{i-1})$-number. Our main results are listed here: Theorem A. Let $\mathfrak{F}$ be a solubly saturated formation containing $\mathfrak{U}$ and $E$ a normal subgroup of $G$ with $G/E\in \mathfrak{F}$. Let $X\unlhd G$ such that $F_p^*(E)\leq X\leq E$. Suppose that for any Sylow $p$-subgroup $P$ of $X$, every maximal subgroup of $P$ satisfies partial $Π$-property in $G$. Then one of the following holds: (1) $G\in \mathfrak{G}_{p'}\mathfrak{F}$. (2) $X/O_{p'}(X)$ is a quasisimple group with Sylow $p$-subgroups of order $p$. In particular, if $X=F_p^*(E)$, then $X/O_{p'}(X)$ is a simple group. Theorem B. Let $\mathfrak{F}$ be a solubly saturated formation containing $\mathfrak{U}$ and $E$ a normal subgroup of $G$ with $G/E\in \mathfrak{F}$. Suppose that for any Sylow $p$-subgroup $P$ of $F_p^*(E)$, every cyclic subgroup of $P$ of prime order or order 4 (when $P$ is not quaternion-free) satisfies partial $Π$-property in $G$. Then $G\in \mathfrak{G}_{p'}\mathfrak{F}$.

preprint2014arXiv

On the $π$$\mathfrak{F}$-norm and the $\mathfrak{H}$-$\mathfrak{F}$-norm of a finite group

Let $\mathfrak{H}$ be a Fitting class and $\mathfrak{F}$ a formation. We call a subgroup $\mathcal{N}_{\mathfrak{H},\mathfrak{F}}(G)$ of a finite group $G$ the $\mathfrak{H}$-$\mathfrak{F}$-norm of $G$ if $\mathcal{N}_{\mathfrak{H},\mathfrak{F}}(G)$ is the intersection of the normalizers of the products of the $\mathfrak{F}$-residuals of all subgroups of $G$ and the $\mathfrak{H}$-radical of $G$. Let $π$ denote a set of primes and let $\mathfrak{G}_π$ denote the class of all finite $π$-groups. We call the subgroup $\mathcal{N}_{\mathfrak{G}_π,\mathfrak{F}}(G)$ of $G$ the $π\mathfrak{F}$-norm of $G$. A normal subgroup $N$ of $G$ is called $π\mathfrak{F}$-hypercentral in $G$ if either $N=1$ or $N>1$ and every $G$-chief factor below $N$ of order divisible by at least one prime in $π$ is $\mathfrak{F}$-central in $G$. Let $Z_{π\mathfrak{F}}(G)$ denote the $π\mathfrak{F}$-hypercentre of $G$, that is, the product of all $π\mathfrak{F}$-hypercentral normal subgroups of $G$. In this paper, we study the properties of the $\mathfrak{H}$-$\mathfrak{F}$-norm, especially of the $π\mathfrak{F}$-norm of a finite group $G$. In particular, we investigate the relationship between the $π'\mathfrak{F}$-norm and the $π\mathfrak{F}$-hypercentre of $G$.

preprint2014arXiv

On weakly $\frak{F}_{s}$-quasinormal subgroups of finite groups

Let $\mathfrak{F}$ be a formation and $G$ a finite group. A subgroup $H$ of $G$ is said to be weakly $\mathfrak{F}_{s}$-quasinormal in $G$ if $G$ has an $S$-quasinormal subgroup $T$ such that $HT$ is $S$-quasinormal in $G$ and $(H\cap T)H_{G}/H_{G}\leq Z_{\mathfrak{F}}(G/H_{G})$, where $Z_{\mathfrak{F}}(G/H_{G})$ denotes the $\mathfrak{F}$-hypercenter of $G/H_{G}$. In this paper, we study the structure of finite groups by using the concept of weakly $\mathfrak{F}_{s}$-quasinormal subgroups.

preprint2014arXiv

The influence of $\mathfrak{F_{\mathrm s}}$-quasinormality of subgroups on the structure of finite groups

Let $\frak{F}$ be a class of finite groups. A subgroup $H$ of a finite group $G$ is said to be $\mathfrak{F_{\mathrm s}}$-quasinormal in $G$ if there exists a normal subgroup $T$ of $G$ such that $HT$ is $s$-permutable in $G$ and $(H\cap T)H_G/H_G$ is contained in the $\frak{F}$-hypercenter $Z_\infty^\mathfrak{F}(G/H_G)$ of $G/H_G$. In this paper, we investigate further the influence of $\mathfrak{F_{\mathrm s}}$-quasinormality of some subgroups on the structure of finite groups. New characterization of some classes of finite groups are obtained.

preprint2014arXiv

The Spaces of Data, Information, and Knowledge

We study the data space $D$ of any given data set $X$ and explain how functions and relations are defined over $D$. From $D$ and for a specific domain $Δ$ we construct the information space $I$ of $X$ by interpreting variables, functions, and explicit relations over $D$ in $Δ$ and by including other relations that $D$ implies under the interpretation in $Δ$. Then from $I$ we build up the knowledge space $K$ of $X$ as the product of two spaces $K_T$ and $K_P$, where $K_T$ is obtained from $I$ by using the induction principle to generalize propositional relations to quantified relations, the deduction principle to generate new relations, and standard mechanisms to validate relations and $K_P$ is the space of specifications of methods with operational instructions which are valid in $K_T$. Through our construction of the three topological spaces the following key observation is made clear: the retrieval of information from the given data set for $Δ$ consists essentially in mining domain objects and relations, and the discovery of knowledge from the retrieved information consists essentially in applying the induction and deduction principles to generate propositions, synthesizing and modeling the information to generate specifications of methods with operational instructions, and validating the propositions and specifications. Based on this observation, efficient approaches may be designed to discover profound knowledge automatically from simple data, as demonstrated by the result of our study in the case of geometry.

preprint2013arXiv

On weakly S-embedded subgroups and weakly $τ$-embedded subgroups

Let $G$ be a finite group. A subgroup $H$ of $G$ is said to be weakly S-embedded in $G$ if there exists $K\unlhd G$ such that $HK$ is S-quasinormal in $G$ and $H\cap K\leq H_{seG}$, where $H_{seG}$ is the subgroup generated by all those subgroups of $H$ which are S-quasinormally embedded in $G$. We say that $H$ is weakly $τ$-embedded in $G$ if there exists $K\unlhd G$ such that $HK$ is S-quasinormal in $G$ and $H\cap K\leq H_{τG}$, where $H_{τG}$ is the subgroup generated by all those subgroups of $H$ which are $τ$-quasinormal in $G$. In this paper, we study the properties of the weakly S-embedded subgroups and the weakly $τ$-embedded subgroups, and use them to determine the structure of finite groups.

preprint2010arXiv

Electronic Geometry Textbook: A Geometric Textbook Knowledge Management System

Electronic Geometry Textbook is a knowledge management system that manages geometric textbook knowledge to enable users to construct and share dynamic geometry textbooks interactively and efficiently. Based on a knowledge base organizing and storing the knowledge represented in specific languages, the system implements interfaces for maintaining the data representing that knowledge as well as relations among those data, for automatically generating readable documents for viewing or printing, and for automatically discovering the relations among knowledge data. An interface has been developed for users to create geometry textbooks with automatic checking, in real time, of the consistency of the structure of each resulting textbook. By integrating an external geometric theorem prover and an external dynamic geometry software package, the system offers the facilities for automatically proving theorems and generating dynamic figures in the created textbooks. This paper provides a comprehensive account of the current version of Electronic Geometry Textbook.

Xiaoyu Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

39 published item(s)

Intelligent Elastic Feature Fading: Enabling Model Retrain-Free Feature Efficiency Rollouts at Scale

Rapid Mixing at the Uniqueness Threshold

UAM: A Dual-Stream Perspective on Forgetting in VLA Training

VLM4VLA: Revisiting Vision-Language-Models in Vision-Language-Action Models

Flow-based Recurrent Belief State Learning for POMDPs

Generalized Radiograph Representation Learning via Cross-supervision between Images and Free-text Radiology Reports

Human-in-the-loop: Provably Efficient Preference-based Reinforcement Learning with General Function Approximation

Irreducible Modules of Reductive Groups with Borel-stable Line

LDC-VAE: A Latent Distribution Consistency Approach to Variational AutoEncoders

Near-Optimal Reward-Free Exploration for Linear Mixture MDPs with Plug-in Solver

Optimal mixing for two-state anti-ferromagnetic spin systems

Towards better understanding and better generalization of few-shot classification in histology images with contrastive learning

Understanding Domain Randomization for Sim-to-real Transfer

WenetSpeech: A 10000+ Hours Multi-domain Mandarin Corpus for Speech Recognition

Distinct Properties of Vortex Bound States Driven by Temperature

Modeling Method for the Coupling Relations of Microgrid Cyber-Physical Systems Driven by Hybrid Spatiotemporal Events

Near-optimal Representation Learning for Linear Bandits and Linear RL

U2++: Unified Two-pass Bidirectional End-to-end Model for Speech Recognition

(Locally) Differentially Private Combinatorial Semi-Bandits

Multivariate Regression of Mixed Responses for Evaluation of Visualization Designs

Noisy Agents: Self-supervised Exploration by Predicting Auditory Events

Dynamic 3-D measurement based on fringe-to-fringe transformation using deep learning

Convex PBW-type Lyndon Bases and Restricted Two-Parameter Quantum Group of Type $F_4$

On High-Order Capacity Statistics of Spectrum Aggregation Systems over $κ$-$μ$ and $κ$-$μ$ shadowed Fading Channels

On a problem from the Kourovka Notebook

On finite groups with some primary subgroups satisfying partial $S$-$Π$-property

On supersolubility of finite groups admitting a Frobenius group of automorphisms with fixed-point-free kernel

On the converse of Hall's theorem

Automated Generation of Geometric Theorems from Images of Diagrams

Finite groups in which SS-permutability is a transitive relation

On $Π$-supplemented subgroups of a finite group

On HC-subgroups of a finite group

On partial $Π$-property of subgroups of finite groups

On the $π$$\mathfrak{F}$-norm and the $\mathfrak{H}$-$\mathfrak{F}$-norm of a finite group

On weakly $\frak{F}_{s}$-quasinormal subgroups of finite groups

The influence of $\mathfrak{F_{\mathrm s}}$-quasinormality of subgroups on the structure of finite groups

The Spaces of Data, Information, and Knowledge

On weakly S-embedded subgroups and weakly $τ$-embedded subgroups

Electronic Geometry Textbook: A Geometric Textbook Knowledge Management System