Source author record

Yifan Chen

Yifan Chen appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

39works

36topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Accumulation of Sub-Sampling Matrices with Applications to Statistical Computation

With appropriately chosen sampling probabilities, sampling-based random projection can be used to implement large-scale statistical methods, substantially reducing computational cost while maintaining low statistical error. However, computing optimal sampling probabilities is often itself expensive, and in practice one typically resorts to suboptimal schemes. This generally leads to increased time and space costs, as more subsamples are required and the resulting projection matrices become larger, thereby making the inference procedure more computationally demanding. In this paper, we extend the framework of sampling-based random projection and propose a new projection method, \emph{accumulative sub-sampling}. By carefully accumulating multiple such projections, accumulative sub-sampling improves statistical efficiency while controlling the effective matrix size throughout the statistical computation. On the theoretical side, we quantify how the quality of the subsampling scheme affects the error in approximating matrix products and positive semidefinite matrices, and show how the proposed accumulation strategy mitigates this effect. Moreover, we apply our method to statistical models involving intensive matrix operations, such as eigendecomposition in spectral clustering and matrix inversion in kernel ridge regression, and demonstrate that reducing the effective matrix size leads to substantial computational savings. Numerical experiments across a range of problems further show that our approach consistently improves computational efficiency compared to existing random projection baselines under suboptimal sampling schemes.

preprint2026arXiv

AxiomOcean: Forecasting the Three-Dimensional Structure of the Upper Ocean

Short-term ocean forecast skill depends strongly on the three-dimensional ocean structure of the upper ocean, which governs stratification, subsurface heat storage, and the response of the ocean to atmospheric forcing. However, AI ocean forecasting models often fail to preserve this vertical structure, resulting in over-smoothed subsurface features and weak physical consistency under strong forcing. Here, we present AxiomOcean, a global AI ocean forecasting model that explicitly represents vertical hierarchy and cross-layer dependence within the water column. By combining a fully three-dimensional encoder-backbone-decoder architecture with surface atmospheric forcing, AxiomOcean jointly predicts upper-ocean temperature, salinity, and three-dimensional currents at global 1/12° resolution down to 643 m depth. In 10-day forecasts, AxiomOcean outperforms an advanced AI comparison model across variables and lead times, reducing day-1 RMSE by approximately 20 to 35% while maintaining higher anomaly correlation. The gain is not achieved through excessive smoothing: AxiomOcean better preserves eddy kinetic energy, temperature and salinity variance. Its advantage also extends through the water column and remains evident across the equatorial Pacific, Kuroshio Extension, and Southern Ocean, yielding a more realistic reconstruction of upper-ocean heat content. These results show that explicitly preserving upper-ocean three-dimensional structure can improve both forecast accuracy and physical fidelity in AI ocean prediction.

preprint2026arXiv

Cavity Multimodes as an Array for High-Frequency Gravitational Waves

Microwave cavities operated in the presence of a background magnetic field provide a promising avenue for detecting high-frequency gravitational waves (HFGWs). We demonstrate for the first time that the distinct antenna patterns of multiple electromagnetic modes within a single cavity enable localization and reconstruction of key properties of an incoming HFGW signal, including its polarization ratio and frequency drift rate. Using a 9-cell cavity commonly employed in particle accelerators as a representative example, we analyze the time-domain response of 18 nearly degenerate modes, which can be sequentially excited by a frequency-drifting signal. The sensitivity is further enhanced by the number of available modes, in close analogy to the scaling achieved by a network of independent detectors, enabling sensitivity to astrophysically plausible binary sources.

preprint2026arXiv

CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training

As training scales grow, collective communication libraries (CCL) increasingly face anomalies arising from complex interactions among hardware, software, and environmental factors. These anomalies typically manifest as slow/hang communication, the most frequent and time-consuming category to diagnose. However, traditional diagnostic methods remain inaccurate and inefficient, frequently requiring hours or even days for root cause analysis. To address this, we propose CCL-D, a high-precision diagnostic system designed to detect and locate slow/hang anomalies in large-scale distributed training. CCL-D integrates a rank-level real-time probe with an intelligent decision analyzer. The probe measures cross-layer anomaly metrics using a lightweight distributed tracing framework to monitor communication traffic. The analyzer performs automated anomaly detection and root-cause location, precisely identifying the faulty GPU rank. Deployed on a 4,000-GPU cluster over one year, CCL-D achieved near-complete coverage of known slow/hang anomalies and pinpointed affected ranks within 6 minutes-substantially outperforming existing solutions.

preprint2026arXiv

How You Begin is How You Reason: Driving Exploration in RLVR via Prefix-Tuned Priors

Reinforcement learning with verifiable rewards (RLVR) recently thrives in large language model (LLM) reasoning tasks. However, the reward sparsity and the long reasoning horizon make effective exploration challenging. In practice, this challenge manifests as the \emph{entropy collapse} phenomenon, where RLVR improves single-rollout accuracy but fails to expand coverage on successful reasoning trajectories. Passive exploration techniques like entropy regularization tend to dismiss generation quality, resulting in noisy rollouts. In response to this issue, we propose an Information-Maximizing Augmented eXploration (IMAX) framework to train a pool of soft prefixes that reshapes the base model's prior over reasoning trajectories. Rather than relying on RL to incentivize exploration on top of the base model, each prefix acts as a trainable control knob that induces a distinct rollout distribution from the same backbone model. To encourage discovery of diverse and task-relevant reasoning behaviors, we derive an Information Maximization (InfoMax) reward to complement the verifiable rewards for RL training. IMAX is in general algorithm-agnostic and can be seamlessly integrated into existing RLVR pipelines. Experiment results have shown that across three backbone scales, IMAX consistently improves reasoning performance over standard RLVR, with gains up to 11.60\% in Pass@4 and 10.57\% in Avg@4.

preprint2026arXiv

IG-Diff: Complex Night Scene Restoration with Illumination-Guided Diffusion Model

In nighttime circumstances, it is challenging for individuals and machines to perceive their surroundings. While prevailing image restoration methods adeptly handle singular forms of degradation, they falter when confronted with intricate nocturnal scenes, such as the concurrent presence of weather and low-light conditions. Compounding this challenge, the lack of paired data that encapsulates the coexistence of low-light situations and other forms of degradation hinders the development of a comprehensive end-to-end solution. In this work, we contribute complex nighttime scene datasets that simulate both illumination degradation and other forms of deterioration. To address the complexity of night degradation, we propose an integration of an illumination-guided module embedded in the diffusion model to guide the illumination restoration process. Our model can preserve texture fidelity while contending with the adversities posed by various degradation in low-light scenarios.

preprint2026arXiv

MMCL-Bench: Multimodal Context Learning from Visual Rules, Procedures, and Evidence

We introduce MMCL-Bench, a benchmark for multimodal context learning: learning task-local rules, procedures, and empirical patterns from visual or mixed-modality teaching context and applying them to new visual instances. Unlike text-only context learning or standard multimodal question answering, this setting requires models to recover and localize relevant evidence from images, screenshots, manuals, videos, and frame sequences before they can reason over the learned context. MMCL-Bench contains 102 tasks spanning three categories: rule system application, procedural task execution, and empirical discovery and induction. We evaluate frontier multimodal models with strict rubric-based scoring and find that current systems remain far from robust multimodal context learning, with even the strongest model solving fewer than one-third of tasks under strict evaluation. Diagnostic ablations and error analysis show that failures arise throughout the context-to-answer pipeline, including context anchoring, visual evidence extraction, context reasoning, and response construction. MMCL-Bench thus highlights multimodal context learning as an important unsolved capability bottleneck for current multimodal models.

preprint2025arXiv

New affine invariant ensemble samplers and their dimensional scaling

We introduce new affine invariant ensemble Markov chain Monte Carlo (MCMC) samplers that are easy to construct and improve upon existing methods, especially for high-dimensional problems. We first propose a simple derivative-free side move sampler that improves upon popular samplers in the \texttt{emcee} package by generating more effective proposal directions. We then develop a class of derivative-based affine invariant ensemble Hamiltonian Monte Carlo (HMC) samplers based on antisymmetric preconditioning using complementary ensembles, which outperform standard, non-affine-invariant HMC when sampling highly anisotropic distributions. We provide asymptotic scaling analysis for high-dimensional Gaussian targets to further elucidate the properties of these affine invariant ensemble samplers. In particular, with derivative information, the affine invariant ensemble HMC can scale much better with dimension compared to derivative-free ensemble samplers.

preprint2022arXiv

A FAIR and AI-ready Higgs boson decay dataset

To enable the reusability of massive scientific datasets by humans and machines, researchers aim to adhere to the principles of findability, accessibility, interoperability, and reusability (FAIR) for data and artificial intelligence (AI) models. This article provides a domain-agnostic, step-by-step assessment guide to evaluate whether or not a given dataset meets these principles. We demonstrate how to use this guide to evaluate the FAIRness of an open simulated dataset produced by the CMS Collaboration at the CERN Large Hadron Collider. This dataset consists of Higgs boson decays and quark and gluon background, and is available through the CERN Open Data Portal. We use additional available tools to assess the FAIRness of this dataset, and incorporate feedback from members of the FAIR community to validate our results. This article is accompanied by a Jupyter notebook to visualize and explore this dataset. This study marks the first in a planned series of articles that will guide scientists in the creation of FAIR AI models and datasets in high energy particle physics.

preprint2022arXiv

Axion Haloscope Array With $\mathcal{PT}$ Symmetry

We generalize the recently proposed $\mathcal{P T}$-symmetric axion haloscope to a larger array with more $\mathcal{P T}$-symmetric structures. By broadening the response bandwidth of the signal without increasing the readout noise, the optimized scan rate of the axion haloscope is significantly enhanced, as well as is the signal power. Furthermore, we show that the robustness of the detector towards the variations of the array coupling is the strongest when a binary tree structure is introduced which contains a largely enhanced $\mathcal{P T}$ symmetry. The multiple allowed probing sensors can further increase the scan rate by a factor of the sensors' number due to the correlation of the signals. This type of array can strongly boost the search for an axion compared to single-mode resonant detection. The enhancement to the scan rate becomes the most manifest when applied to the proposed detection using a superconducting radio-frequency cavity with an ac magnetic field where most of the parameter space of the QCD axion above kHz can be probed.

preprint2022arXiv

Chiral Ball and Its Omnidirectional Circularly-Polarized Radiation

Chiral structures have reported radiation of circular polarized electromagnetic waves (CPs) in a specific direction. Here we report a class of torus knot radiators that is not only chiral but also three-dimensional (3-D) rotational symmetric along X, Y and Z axes. Because of this exotic chirality and symmetry, the knot radiator presented is able to demonstrate omnidirectional circular polarized radiation, which has never been reported in any known structures.

preprint2022arXiv

Combining deep learning and crowdsourcing geo-images to predict housing quality in rural China

Housing quality is an essential proxy for regional wealth, security and health. Understanding the distribution of housing quality is crucial for unveiling rural development status and providing political proposals. However,present rural house quality data highly depends on a top-down, time-consuming survey at the national or provincial level but fails to unpack the housing quality at the village level. To fill the gap between accurately depicting rural housing quality conditions and deficient data,we collect massive rural images and invite users to assess their housing quality at scale. Furthermore, a deep learning framework is proposed to automatically and efficiently predict housing quality based on crowd-sourcing rural images.

preprint2022arXiv

Contact tracing Inspired Efficient Computation by Energy Tracing

Inspired by the epidemic contact tracing technique, we propose a method to efficiently solve electromagnetics by tracing the energy distribution. The computational domain is adaptively decomposed, and the available computational resources are focused on those energy-active (infections) and their adjacent (exposed) domains, while avoiding the unnecessary computation of energy-null (unexposed) domains. As an example, we employ this method to solve several optics problems. The proposed method shows high efficiency while maintaining a good accuracy. The energy tracing method is based on the causality principle, and therefore is potentially transformative into other computational physics and associated algorithms.

preprint2022arXiv

Discovery of post-mass-transfer helium-burning red giants using asteroseismology

A star expands to become a red giant when it has fused all the hydrogen in its core into helium. If the star is in a binary system, its envelope can overflow onto its companion or be ejected into space, leaving a hot core and potentially forming a subdwarf-B star. However, most red giants that have partially transferred envelopes in this way remain cool on the surface and are almost indistinguishable from those that have not. Among $\sim$7000 helium-burning red giants observed by NASA's Kepler mission, we use asteroseismology to identify two classes of stars that must have undergone dramatic mass loss, presumably due to stripping in binary interactions. The first class comprises about 7 underluminous stars with smaller helium-burning cores than their single-star counterparts. Theoretical models show that these small cores imply the stars had much larger masses when ascending the red giant branch. The second class consists of 32 red giants with masses down to 0.5 M$_\odot$, whose implied ages would exceed the age of the universe had no mass loss occurred. The numbers are consistent with binary statistics, and our results open up new possibilities to study the evolution of post-mass-transfer binary systems.

preprint2022arXiv

Dissecting axion and dark photon with a network of vector sensors

We develop formalisms for a network of vector sensors, sensitive to certain spatial components of the signals, to identify the properties of a light axion or a dark photon background. These bosonic fields contribute to vector-like signals in the detectors, including effective magnetic fields triggering the spin precession, effective electric currents in a shielded room, and forces on the matter. The interplay between a pair of vector sensors and a baseline that separates them can potentially uncover rich information of the bosons, including angular distribution, polarization modes, source localization, and macroscopic circular polarization. Using such a network, one can identify the microscopic nature of a potential signal, such as distinguishing between the axion-fermion coupling and the dipole couplings with the dark photon.

preprint2022arXiv

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization

For online speaker diarization, samples arrive incrementally, and the overall distribution of the samples is invisible. Moreover, in most existing clustering-based methods, the training objective of the embedding extractor is not designed specially for clustering. To improve online speaker diarization performance, we propose a unified online clustering framework, which provides an interactive manner between embedding extractors and clustering algorithms. Specifically, the framework consists of two highly coupled parts: clustering-guided recurrent training (CGRT) and truncated beam searching clustering (TBSC). The CGRT introduces the clustering algorithm into the training process of embedding extractors, which could provide not only cluster-aware information for the embedding extractor, but also crucial parameters for the clustering process afterward. And with these parameters, which contain preliminary information of the metric space, the TBSC penalizes the probability score of each cluster, in order to output more accurate clustering results in online fashion with low latency. With the above innovations, our proposed online clustering system achieves 14.48\% DER with collar 0.25 at 2.5s latency on the AISHELL-4, while the DER of the offline agglomerative hierarchical clustering is 14.57\%.

preprint2022arXiv

Microwave Chirality Imaging for the Early Diagnosis of Neurological Degenerative Diseases

We propose a system to visualize the chirality of the protein in brains, which would be helpful to diagnose early neurological degenerative diseases in vivo. These neurological degenerative diseases often occur along with some mark proteins. By nanoparticle instilling and metamaterial technique, the chiral effect of the mark proteins is assumed to be manifest in microwave regime. Therefore, by detecting the transmission of cross-polarization, we could detect the chirality that rotates the microwave polarization angle. We developed a numerical method to simulate the electromagnetic response upon chiral (bi-isotropic) material. Then a numerical experiment was conduct with a numerical head phantom. A map of cross-polarized transmission magnitude can be reached by sweeping the antenna pair. The imaging results matches well with the distribution of chiral materials. It suggests that the proposed method would be capable of in vivo imaging of neurological degenerative disease using microwaves.

preprint2022arXiv

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset

This paper introduces a high-quality rich annotated Mandarin conversational (RAMC) speech dataset called MagicData-RAMC. The MagicData-RAMC corpus contains 180 hours of conversational speech data recorded from native speakers of Mandarin Chinese over mobile phones with a sampling rate of 16 kHz. The dialogs in MagicData-RAMC are classified into 15 diversified domains and tagged with topic labels, ranging from science and technology to ordinary life. Accurate transcription and precise speaker voice activity timestamps are manually labeled for each sample. Speakers' detailed information is also provided. As a Mandarin speech dataset designed for dialog scenarios with high quality and rich annotations, MagicData-RAMC enriches the data diversity in the Mandarin speech community and allows extensive research on a series of speech-related tasks, including automatic speech recognition, speaker diarization, topic detection, keyword search, text-to-speech, etc. We also conduct several relevant tasks and provide experimental results to help evaluate the dataset.

preprint2022arXiv

Some new results about $q$-trinomial coefficients

In this paper, we present several new congruences on the $q$-trinomial coefficients introduced by Andrews and Baxter. A new congruence on sums of central $q$-binomial coefficients is also established.

preprint2022arXiv

Spatio-temporal Gait Feature with Global Distance Alignment

Gait recognition is an important recognition technology, because gait is not easy to camouflage and does not need cooperation to recognize subjects. However, many existing methods are inadequate in preserving both temporal information and fine-grained information, thus reducing its discrimination. This problem is more serious when the subjects with similar walking postures are identified. In this paper, we try to enhance the discrimination of spatio-temporal gait features from two aspects: effective extraction of spatio-temporal gait features and reasonable refinement of extracted features. Thus our method is proposed, it consists of Spatio-temporal Feature Extraction (SFE) and Global Distance Alignment (GDA). SFE uses Temporal Feature Fusion (TFF) and Fine-grained Feature Extraction (FFE) to effectively extract the spatio-temporal features from raw silhouettes. GDA uses a large number of unlabeled gait data in real life as a benchmark to refine the extracted spatio-temporal features. GDA can make the extracted features have low inter-class similarity and high intra-class similarity, thus enhancing their discrimination. Extensive experiments on mini-OUMVLP and CASIA-B have proved that we have a better result than some state-of-the-art methods.

preprint2022arXiv

Stringent axion constraints with Event Horizon Telescope polarimetric measurements of M87$^\star$

The hitherto unprecedented angular resolution of the Event Horizon Telescope (EHT) has created exciting opportunities in the search for new physics. Recently, the linear polarization of radiation emitted near the supermassive black hole M87$^\star$ was measured on four separate days, precisely enabling tests of the existence of a dense axion cloud produced by a spinning black hole. The presence of an axion cloud leads to a frequency-independent oscillation in the electric vector position angle (EVPA) of this linear polarization. For a nearly face-on M87$^\star$, this oscillation in the EVPA appears as a propagating wave along the photon ring. In this paper, we leverage the azimuthal distribution of EVPA measured by the EHT to study the axion-photon coupling. We propose a novel differential analysis procedure to reduce the astrophysical background, and derive stringent constraints on the existence of axions in the previously unexplored mass window $\sim (10^{-21}-10^{-20})$~eV.

preprint2022arXiv

Superradiant evolution of the shadow and photon ring of Sgr A$^\star$

Ultralight bosons can affect the dynamics of spinning black holes (BHs) via superradiant instability, which can lead to a time evolution of the supermassive BH shadow. We study prospects for witnessing the superradiance-induced BH shadow evolution, considering ultralight vector and tensor fields. We introduce two observables sensitive to the shadow time-evolution: the shadow drift, and the variation in the azimuthal angle lapse associated to the photon ring autocorrelation. The two observables are shown to be highly complementary, depending on the observer's inclination angle. Focusing on the supermassive object Sgr A$^\star$ we show that both observables can vary appreciably over human timescales of a few years in the presence of superradiant instability, leading to signatures which are well within the reach of the Event Horizon Telescope for realistic observation times (but benefiting significantly from extended observation periods), and paving the way towards probing ultralight bosons in the $\sim 10^{-17}\,{\rm eV}$ mass range.

preprint2022arXiv

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

The conversation scenario is one of the most important and most challenging scenarios for speech processing technologies because people in conversation respond to each other in a casual style. Detecting the speech activities of each person in a conversation is vital to downstream tasks, like natural language processing, machine translation, etc. People refer to the detection technology of "who speak when" as speaker diarization (SD). Traditionally, diarization error rate (DER) has been used as the standard evaluation metric of SD systems for a long time. However, DER fails to give enough importance to short conversational phrases, which are short but important on the semantic level. Also, a carefully and accurately manually-annotated testing dataset suitable for evaluating the conversational SD technologies is still unavailable in the speech community. In this paper, we design and describe the Conversational Short-phrases Speaker Diarization (CSSD) task, which consists of training and testing datasets, evaluation metric and baselines. In the dataset aspect, despite the previously open-sourced 180-hour conversational MagicData-RAMC dataset, we prepare an individual 20-hour conversational speech test dataset with carefully and artificially verified speakers timestamps annotations for the CSSD task. In the metric aspect, we design the new conversational DER (CDER) evaluation metric, which calculates the SD accuracy at the utterance level. In the baseline aspect, we adopt a commonly used method: Variational Bayes HMM x-vector system, as the baseline of the CSSD task. Our evaluation metric is publicly available at https://github.com/SpeechClub/CDER_Metric.

preprint2021arXiv

Fast Statistical Leverage Score Approximation in Kernel Ridge Regression

Nyström approximation is a fast randomized method that rapidly solves kernel ridge regression (KRR) problems through sub-sampling the n-by-n empirical kernel matrix appearing in the objective function. However, the performance of such a sub-sampling method heavily relies on correctly estimating the statistical leverage scores for forming the sampling distribution, which can be as costly as solving the original KRR. In this work, we propose a linear time (modulo poly-log terms) algorithm to accurately approximate the statistical leverage scores in the stationary-kernel-based KRR with theoretical guarantees. Particularly, by analyzing the first-order condition of the KRR objective, we derive an analytic formula, which depends on both the input distribution and the spectral density of stationary kernels, for capturing the non-uniformity of the statistical leverage scores. Numerical experiments demonstrate that with the same prediction accuracy our method is orders of magnitude more efficient than existing methods in selecting the representative sub-samples in the Nyström approximation.

preprint2020arXiv

Deep Attention Aware Feature Learning for Person Re-Identification

Visual attention has proven to be effective in improving the performance of person re-identification. Most existing methods apply visual attention heuristically by learning an additional attention map to re-weight the feature maps for person re-identification. However, this kind of methods inevitably increase the model complexity and inference time. In this paper, we propose to incorporate the attention learning as additional objectives in a person ReID network without changing the original structure, thus maintain the same inference time and model size. Two kinds of attentions have been considered to make the learned feature maps being aware of the person and related body parts respectively. Globally, a holistic attention branch (HAB) makes the feature maps obtained by backbone focus on persons so as to alleviate the influence of background. Locally, a partial attention branch (PAB) makes the extracted features be decoupled into several groups and be separately responsible for different body parts (i.e., keypoints), thus increasing the robustness to pose variation and partial occlusion. These two kinds of attentions are universal and can be incorporated into existing ReID networks. We have tested its performance on two typical networks (TriNet and Bag of Tricks) and observed significant performance improvement on five widely used datasets.

preprint2020arXiv

Function Approximation via The Subsampled Poincar\' e Inequality

Function approximation and recovery via some sampled data have long been studied in a wide array of applied mathematics and statistics fields. Analytic tools, such as the Poincaré inequality, have been handy for estimating the approximation errors in different scales. The purpose of this paper is to study a generalized Poincar\' e inequality, where the measurement function is of subsampled type, with a small but non-zero lengthscale that will be made precise. Our analysis identifies this inequality as a basic tool for function recovery problems. We discuss and demonstrate the optimality of the inequality concerning the subsampled lengthscale, connecting it to existing results in the literature. In application to function approximation problems, the approximation accuracy using different basis functions and under different regularity assumptions is established by using the subsampled Poincaré inequality. We observe that the error bound blows up as the subsampled lengthscale approaches zero, due to the fact that the underlying function is not regular enough to have well-defined pointwise values. A weighted version of the Poincar\' e inequality is proposed to address this problem; its optimality is also discussed.

preprint2020arXiv

Logarithmic loop corrections, moduli stabilisation and de Sitter vacua in string theory

We study string loop corrections to the gravity kinetic terms in type IIB compactifications on Calabi-Yau threefolds or their orbifold limits, in the presence of $D7$-branes and orientifold planes. We show that they exhibit in general a logarithmic behaviour in the large volume limit transverse to the $D7$-branes, induced by a localised four-dimensional Einstein-Hilbert action that appears at a lower order in the closed string sector, found in the past. Here, we compute the coefficient of the logarithmic corrections and use them to provide an explicit realisation of a mechanism for Kähler moduli stabilisation that we have proposed recently, which does not rely on non-perturbative effects and lead to de Sitter vacua. Our result avoids no-go theorems of perturbative stabilisation due to runaway potentials, in a way similar to the Coleman-Weinberg mechanism, and provides a counter example to one of the swampland conjectures concerning de Sitter vacua in quantum gravity, once string loop effects are taken into account; it thus paves the way for embedding the Standard Model of particle physics and cosmology in string theory.

preprint2020arXiv

One-to-one Mapping for Unpaired Image-to-image Translation

Recently image-to-image translation has attracted significant interests in the literature, starting from the successful use of the generative adversarial network (GAN), to the introduction of cyclic constraint, to extensions to multiple domains. However, in existing approaches, there is no guarantee that the mapping between two image domains is unique or one-to-one. Here we propose a self-inverse network learning approach for unpaired image-to-image translation. Building on top of CycleGAN, we learn a self-inverse function by simply augmenting the training samples by swapping inputs and outputs during training and with separated cycle consistency loss for each mapping direction. The outcome of such learning is a proven one-to-one mapping function. Our extensive experiments on a variety of datasets, including cross-modal medical image synthesis, object transfiguration, and semantic labeling, consistently demonstrate clear improvement over the CycleGAN method both qualitatively and quantitatively. Especially our proposed method reaches the state-of-the-art result on the cityscapes benchmark dataset for the label to photo unpaired directional image translation.

preprint2020arXiv

Probing Axions with Event Horizon Telescope Polarimetric Measurements

With high spatial resolution, polarimetric imaging of a supermassive black hole, like M87$^\star$ or Sgr A$^\star$, by the Event Horizon Telescope can be used to probe the existence of ultralight bosonic particles, such as axions. Such particles can accumulate around a rotating black hole through the superradiance mechanism, forming an axion cloud. When linearly polarized photons are emitted from an accretion disk near the horizon, their position angles oscillate due to the birefringent effect when traveling through the axion background. In particular, the observations of supermassive black holes M87$^\star$ (Sgr A$^\star$) can probe the dimensionless axion-photon coupling $c = 2 πg_{a γ} f_a$ for axions with mass around $O(10^{-20})$~eV ($O( 10^{-17}$)~eV) and decay constant $f_a < O(10^{16})$ GeV, which is complimentary to other axion measurements.

preprint2020arXiv

Towards Question-based Recommender Systems

Conversational and question-based recommender systems have gained increasing attention in recent years, with users enabled to converse with the system and better control recommendations. Nevertheless, research in the field is still limited, compared to traditional recommender systems. In this work, we propose a novel Question-based recommendation method, Qrec, to assist users to find items interactively, by answering automatically constructed and algorithmically chosen questions. Previous conversational recommender systems ask users to express their preferences over items or item facets. Our model, instead, asks users to express their preferences over descriptive item features. The model is first trained offline by a novel matrix factorization algorithm, and then iteratively updates the user and item latent factors online by a closed-form solution based on the user answers. Meanwhile, our model infers the underlying user belief and preferences over items to learn an optimal question-asking strategy by using Generalized Binary Search, so as to ask a sequence of questions to the user. Our experimental results demonstrate that our proposed matrix factorization model outperforms the traditional Probabilistic Matrix Factorization model. Further, our proposed Qrec model can greatly improve the performance of state-of-the-art baselines, and it is also effective in the case of cold-start user and item recommendations.

preprint2016arXiv

Content-Based Top-N Recommendation using Heterogeneous Relations

Top-$N$ recommender systems have been extensively studied. However, the sparsity of user-item activities has not been well resolved. While many hybrid systems were proposed to address the cold-start problem, the profile information has not been sufficiently leveraged. Furthermore, the heterogeneity of profiles between users and items intensifies the challenge. In this paper, we propose a content-based top-$N$ recommender system by learning the global term weights in profiles. To achieve this, we bring in PathSim, which could well measures the node similarity with heterogeneous relations (between users and items). Starting from the original TF-IDF value, the global term weights gradually converge, and eventually reflect both profile and activity information. To facilitate training, the derivative is reformulated into matrix form, which could easily be paralleled. We conduct extensive experiments, which demonstrate the superiority of the proposed method.

preprint2016arXiv

Dynamic Virtual Machine Management via Approximate Markov Decision Process

Efficient virtual machine (VM) management can dramatically reduce energy consumption in data centers. Existing VM management algorithms fall into two categories based on whether the VMs' resource demands are assumed to be static or dynamic. The former category fails to maximize the resource utilization as they cannot adapt to the dynamic nature of VMs' resource demands. Most approaches in the latter category are heuristical and lack theoretical performance guarantees. In this work, we formulate dynamic VM management as a large-scale Markov Decision Process (MDP) problem and derive an optimal solution. Our analysis of real-world data traces supports our choice of the modeling approach. However, solving the large-scale MDP problem suffers from the curse of dimensionality. Therefore, we further exploit the special structure of the problem and propose an approximate MDP-based dynamic VM management method, called MadVM. We prove the convergence of MadVM and analyze the bound of its approximation error. Moreover, MadVM can be implemented in a distributed system, which should suit the needs of real data centers. Extensive simulations based on two real-world workload traces show that MadVM achieves significant performance gains over two existing baseline approaches in power consumption, resource shortage and the number of VM migrations. Specifically, the more intensely the resource demands fluctuate, the more MadVM outperforms.

preprint2015arXiv

Single-Step Tunable Group Delay Phaser for Real-Time Spectrum Sniffing

This paper presents a single-step tunable group delay phaser for spectrum sniffing. This device may be seen as a "time filter", where frequencies are suppressed by time separation rather than by spectral attenuation. Compared to its multiple-step counterpart, this phaser features higher processing resolution, greater simplicity, lower loss and better channel equalization, due to the smaller and channel-independent group delay swing. A three-channel example is provided for illustration.

preprint2014arXiv

Audio Splicing Detection and Localization Using Environmental Signature

Audio splicing is one of the most common manipulation techniques in the area of audio forensics. In this paper, the magnitudes of acoustic channel impulse response and ambient noise are proposed as the environmental signature. Specifically, the spliced audio segments are detected according to the magnitude correlation between the query frames and reference frames via a statically optimal threshold. The detection accuracy is further refined by comparing the adjacent frames. The effectiveness of the proposed method is tested on two data sets. One is generated from TIMIT database, and the other one is made in four acoustic environments using a commercial grade microphones. Experimental results show that the proposed method not only detects the presence of spliced frames, but also localizes the forgery segments with near perfect accuracy. Comparison results illustrate that the identification accuracy of the proposed scheme is higher than the previous schemes. In addition, experimental results also show that the proposed scheme is robust to MP3 compression attack, which is also superior to the previous works.

preprint2014arXiv

Commuting involutions on surfaces of general type with p_g=0 and K^2=7

The aim of this article is to classify the pairs (S, G), where S is a smooth minimal surface of general type with p_g=0 and K^2=7, G is a subgroup of the automorphism group of S and G is isomorphic to the group $\mathbb{Z}_2^2$. The Inoue surfaces with K^2=7, which are finite Galois $\mathbb{Z}_2^2$-covers of the 4-nodal cubic surface, are the first examples of such pairs. More recently, the author constructed a new family of such pairs. They are finite Galois $\mathbb{Z}_2^2$-covers of certain 6-nodal Del Pezzo surfaces of degree one. We prove that the base of the Kuranishi family of deformations of a surface in this family is smooth. We show that, in the Gieseker moduli space of canonical models of surfaces of general type, the subset corresponding to the surfaces in this family is an irreducible connected component, normal, unirational of dimension 3.

preprint2014arXiv

Compact Reflection-Type Phaser Using Quarter-Wavelength Transmission Line Resonators

A compact reflection-type phaser composed of quarter-wavelength transmission line resonators interconnected by alternating K- and J-inverters is proposed. A design method is also presented. To validate this method, a 4th-order example is designed and fabricated. The proposed phaser is shown to exhibit the benefits of smaller size, easier fabrication and suppressed even-order harmonics compared with previously reported half-wavelength phasers.

preprint2014arXiv

Notes on automorphisms of surfaces of general type with $p_g=0$ and $K^2=7$

Let $S$ be a smooth minimal complex surface of general type with $p_g=0$ and $K^2=7$. We prove that any involution on $S$ is in the center of the automorphism group of $S$. As an application, we show that the automorphism group of an Inoue surface with $K^2=7$ is isomorphic to $\mathbb{Z}_2^2$ or $\mathbb{Z}_2 \times \mathbb{Z}_4$. We construct a $2$-dimensional family of Inoue surfaces with automorphism groups isomorphic to $\mathbb{Z}_2 \times \mathbb{Z}_4$.

preprint2012arXiv

A New Family of Surfaces with K^2=7 and p_g=0

We construct a new family of minimal smooth surfaces of general type with K^2=7 and p_g= 0. We show that for a surface in this family, its canonical divisor is ample and its bicanonical morphism is birational. We prove that these surfaces satisfy Bloch's conjecture.

preprint2010arXiv

Double-Directional Information Azimuth Spectrum and Relay Network Tomography for a Decentralized Wireless Relay Network

A novel channel representation for a two-hop decentralized wireless relay network (DWRN) is proposed, where the relays operate in a completely distributive fashion. The modeling paradigm applies an analogous approach to the description method for a double-directional multipath propagation channel, and takes into account the finite system spatial resolution and the extended relay listening/transmitting time. Specifically, the double-directional information azimuth spectrum (IAS) is formulated to provide a compact representation of information flows in a DWRN. The proposed channel representation is then analyzed from a geometrically-based statistical modeling perspective. Finally, we look into the problem of relay network tomography (RNT), which solves an inverse problem to infer the internal structure of a DWRN by using the instantaneous doubledirectional IAS recorded at multiple measuring nodes exterior to the relay region.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Source provenance

Where this author record came from

arxivconfidence 95%

external id: arxiv:2505.02987:author:1:yifan-chen

Imported May 21, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.10455:author:2:yifan-chen

Imported May 20, 2026Synced May 21, 2026

arxivconfidence 95%

external id: arxiv:2605.04478:author:11:yifan-chen

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.08817:author:3:yifan-chen

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.12703:author:1:yifan-chen

Imported May 20, 2026Synced May 20, 2026

arxivconfidence 95%

external id: arxiv:2605.14337:author:1:yifan-chen

Imported May 20, 2026Synced May 20, 2026

5 works

Jing Shu

Researcher

Jing Shu contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Gaofeng Cheng

Researcher

Gaofeng Cheng contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Pengyuan Zhang

Researcher

Pengyuan Zhang contributes to research discovery and scholarly infrastructure.

Open to collaborate

3 works

Rui Wang

Researcher

Rui Wang contributes to research discovery and scholarly infrastructure.

Open to collaborate

Yifan Chen

What is connected

Connect this record

See the researcher in context

Building this map preview

39 published item(s)

Accumulation of Sub-Sampling Matrices with Applications to Statistical Computation

AxiomOcean: Forecasting the Three-Dimensional Structure of the Upper Ocean

Cavity Multimodes as an Array for High-Frequency Gravitational Waves

CCL-D: A High-Precision Diagnostic System for Slow and Hang Anomalies in Large-Scale Model Training

How You Begin is How You Reason: Driving Exploration in RLVR via Prefix-Tuned Priors

IG-Diff: Complex Night Scene Restoration with Illumination-Guided Diffusion Model

MMCL-Bench: Multimodal Context Learning from Visual Rules, Procedures, and Evidence

New affine invariant ensemble samplers and their dimensional scaling

A FAIR and AI-ready Higgs boson decay dataset

Axion Haloscope Array With $\mathcal{PT}$ Symmetry

Chiral Ball and Its Omnidirectional Circularly-Polarized Radiation

Combining deep learning and crowdsourcing geo-images to predict housing quality in rural China

Contact tracing Inspired Efficient Computation by Energy Tracing

Discovery of post-mass-transfer helium-burning red giants using asteroseismology

Dissecting axion and dark photon with a network of vector sensors

Interrelate Training and Searching: A Unified Online Clustering Framework for Speaker Diarization

Microwave Chirality Imaging for the Early Diagnosis of Neurological Degenerative Diseases

Open Source MagicData-RAMC: A Rich Annotated Mandarin Conversational(RAMC) Speech Dataset

Some new results about $q$-trinomial coefficients

Spatio-temporal Gait Feature with Global Distance Alignment

Stringent axion constraints with Event Horizon Telescope polarimetric measurements of M87$^\star$

Superradiant evolution of the shadow and photon ring of Sgr A$^\star$

The Conversational Short-phrase Speaker Diarization (CSSD) Task: Dataset, Evaluation Metric and Baselines

Fast Statistical Leverage Score Approximation in Kernel Ridge Regression

Deep Attention Aware Feature Learning for Person Re-Identification

Function Approximation via The Subsampled Poincar\' e Inequality

Logarithmic loop corrections, moduli stabilisation and de Sitter vacua in string theory

One-to-one Mapping for Unpaired Image-to-image Translation

Probing Axions with Event Horizon Telescope Polarimetric Measurements

Towards Question-based Recommender Systems

Content-Based Top-N Recommendation using Heterogeneous Relations

Dynamic Virtual Machine Management via Approximate Markov Decision Process

Single-Step Tunable Group Delay Phaser for Real-Time Spectrum Sniffing

Audio Splicing Detection and Localization Using Environmental Signature

Commuting involutions on surfaces of general type with p_g=0 and K^2=7

Compact Reflection-Type Phaser Using Quarter-Wavelength Transmission Line Resonators

Notes on automorphisms of surfaces of general type with $p_g=0$ and $K^2=7$

A New Family of Surfaces with K^2=7 and p_g=0

Double-Directional Information Azimuth Spectrum and Relay Network Tomography for a Decentralized Wireless Relay Network