Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
96works
0followers
44topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

96 published item(s)

preprint2026arXiv

An Enigmatic PeVatron in an Area around HII Region G35.6$-$0.5

Identifying Galactic PeVatrons (PeV particle accelerators) from the ultra-high-energy (UHE, >100 TeV) $γ$-ray sources plays a crucial role in revealing the origin of Galactic cosmic rays. The UHE source 1LHAASO J1857+0203u is suggested to be associated with HESS J1858+020, which may be attributed to the possible PeVatron candidate supernova remnant (SNR) G35.6$-$0.4 or HII region G35.6$-$0.5. We perform detailed analysis on the very-high-energy and UHE $γ$-ray emissions towards this region with data from the Large High Altitude Air Shower Observatory (LHAASO). 1LHAASO J1857+0203u is detected with a significance of 11.6$σ$ above 100 TeV, indicating the presence of a PeVatron. It has an extension of $\sim 0.18^\circ$ with a power-law (PL) spectral index of $\sim$2.5 in 1-25 TeV and a point-like emission with a PL spectral index of $\sim$3.2 above 25 TeV. Using the archival CO and HI data, we identify some molecular and atomic clouds that may be associated with the TeV $γ$-ray emissions. Our modelling indicates that the TeV $γ$-ray emissions are unlikely to arise from the clouds illuminated by the protons that escaped from SNR G35.6$-$0.4. In the scenario that HII region G35.6$-$0.5 could accelerate particles to the UHE band, the observed GeV-TeV $γ$-ray emission could be well explained by a hadronic model with a PL spectral index of $\sim$2.0 and cutoff energy of $\sim$450 TeV. However, an evolved pulsar wind nebula origin cannot be ruled out.

preprint2026arXiv

An Ultrahigh-energy $γ$-ray Bubble Powered by a Super PeVatron

We report the detection of a $γ$-ray bubble spanning at least 100$\rm deg^2$ in ultra high energy (UHE) up to a few PeV in the direction of the star-forming region Cygnus X, implying the presence Super PeVatron(s) accelerating protons to at least 10 PeV. A log-parabola form with the photon index $Γ(E) = (2.71 \pm 0.02) + (0.11 \pm 0.02) \times \log_{10} (E/10 \ {\rm TeV})$ is found fitting the gamma-ray energy spectrum of the bubble well. UHE sources, `hot spots' correlated with very massive molecular clouds, and a quasi-spherical amorphous $γ$-ray emitter with a sharp central brightening are observed in the bubble. In the core of $\sim 0.5^{\circ}$, spatially associating with a region containing massive OB association (Cygnus OB2) and a microquasar (Cygnus X-3), as well as previously reported multi-TeV sources, an enhanced concentration of UHE $γ$-rays are observed with 2 photons at energies above 1 PeV. The general feature of the bubble, the morphology and the energy spectrum, are reasonably reproduced by the assumption of a particle accelerator in the core, continuously injecting protons into the ambient medium.

preprint2026arXiv

AnyMS: Bottom-up Attention Decoupling for Layout-guided and Training-free Multi-subject Customization

Multi-subject customization aims to synthesize multiple user-specified subjects into a coherent image. To address issues such as subjects missing or conflicts, recent works incorporate layout guidance to provide explicit spatial constraints. However, existing methods still struggle to balance three critical objectives: text alignment, subject identity preservation, and layout control, while the reliance on additional training further limits their scalability and efficiency. In this paper, we present AnyMS, a novel training-free framework for layout-guided multi-subject customization. AnyMS leverages three input conditions: text prompt, subject images, and layout constraints, and introduces a bottom-up dual-level attention decoupling mechanism to harmonize their integration during generation. Specifically, global decoupling separates cross-attention between textual and visual conditions to ensure text alignment. Local decoupling confines each subject's attention to its designated area, which prevents subject conflicts and thus guarantees identity preservation and layout control. Moreover, AnyMS employs pre-trained image adapters to extract subject-specific features aligned with the diffusion model, removing the need for subject learning or adapter tuning. Extensive experiments demonstrate that AnyMS achieves state-of-the-art performance, supporting complex compositions and scaling to a larger number of subjects.

preprint2026arXiv

Capability-Aware Early-Stage Research Idea Evaluation

Predicting the outcomes of research ideas at their conceptual stage (i.e. before significant resources are committed) holds great potential for optimizing scientific resource allocation and research planning. While existing methods rely heavily on finished manuscripts or peer reviews, we propose a novel capability-aware framework that predicts paper acceptance and ratings using only author information and research ideas, without requiring full text or experimental results. Our approach integrates author information, (inferred) capability presentation, and research ideas through a three-way transformer architecture with flexible fusion mechanisms. We also introduce a two-stage architecture for learning the capability representation given the author information and idea. Experiments show that our method significantly outperform the single-way models by finetuning bert-base and bert-large, and the capability predicting significantly increase the predictive accuracy of the final model. The proposed method can be applied in both early-stage research outcome prediction and scientific resource allocation.

preprint2026arXiv

Constraints on heavy decaying dark matter from 570 days of LHAASO observations

The Kilometer Square Array~(KM2A) of the Large High Altitude Air Shower Observatory (LHAASO) aims at surveying the northern gamma-ray sky at energies above 10 TeV with unprecedented sensitivity. Gamma-ray observations have long been one of the most powerful tools for dark matter searches, as e.g., high-energy gamma-rays could be produced by the decays of heavy dark matter particles. In this letter, we present the first dark matter analysis with LHAASO-KM2A, using the first 340~days of data from 1/2-KM2A and 230~days of data from 3/4-KM2A. Several regions of interest are used to search for a signal and account for the residual cosmic-ray background after gamma/hadron separation. We find no excess of dark matter signals, and thus place some of the strongest gamma-ray constraints on the lifetime of heavy dark matter particles with mass between 10^5 and 10^9~GeV. Our results with LHAASO are robust, and have important implications for dark matter interpretations of the diffuse astrophysical high-energy neutrino emission.

preprint2026arXiv

Direct Product Flow Matching: Decoupling Radial and Angular Dynamics for Few-Shot Adaptation

Recent flow matching (FM) methods improve the few-shot adaptation of vision-language models, by modeling cross-modal alignment as a continuous multi-step flow. In this paper, we argue that existing FM methods are inherently constrained by incompatible geometric priors on pre-trained cross-modal features, resulting in suboptimal adaptation performance. We first analyze these methods from a polar decomposition perspective (i.e., radial and angular sub-manifolds). Under this new geometric view, we identify three overlooked limitations in them: 1) Angular dynamics distortion: The radial-angular coupling induces non-uniform speed on the angular sub-manifold, leading to regression training difficulty and extra truncation errors. 2) Radial dynamics neglect: Feature normalization discards modality confidence, failing to distinguish out-of-distribution and in-distribution data, and abandoning crucial radial dynamics. 3) Context-agnostic unconditional flow: Dataset-specific information loss during pre-trained cross-modal feature extraction remains unrecovered. To resolve these issues, we propose warped product flow matching (WP-FM), a unified Riemannian framework that reformulates alignment on a warped product manifold. Within this framework, we derive direct product flow matching (DP-FM) by introducing a constant-warping metric, which yields a decoupled cylindrical manifold (i.e., direct product manifold). DP-FM enables independent radial evolution and constant-speed angular geodesic transport, effectively eliminating angular dynamics distortion while preserving radial consistency. Meanwhile, we incorporate classifier-free guidance by conditioning the flow on the pre-trained VLMs' hidden states to inject missing dataset-specific information. Extensive results across 11 benchmarks have demonstrated that DP-FM achieves a new state-of-the-art for multi-step few-shot adaptation.

preprint2026arXiv

Discovery of a new $γ$-ray source LHAASO J0341+5258 with emission up to 200TeV

We report the discovery of a new unidentified extended $γ$-ray source in the Galactic plane named LHAASO J0341+5258 with a pre-trial significance of 8.2 standard deviations above 25 TeV. The best fit position is R.A.$=55.34^{\circ}\pm0.11^{\circ}$ and Dec$=52.97^{\circ}\pm0.07^{\circ}$. The angular size of LHAASO J0341+5258 is $0.29^\circ \pm 0.06^\circ_{stat} \pm0.02^\circ_{sys}$. The flux above 25 TeV is about $20\%$ of the flux of Crab Nebula. Although a power-law fit of the spectrum from 10 TeV to 200 TeV with the photon index $α=2.98 \pm 0.19_{stat} \pm 0.02_{sys}$ is not excluded, the LHAASO data together with the flux upper limit at 10 GeV set by the Fermi LAT observation, indicate a noticeable steepening of an initially hard power-law spectrum %($α\leq 1.75$) spectrum with a cutoff at $\approx 50$ TeV. We briefly discuss the origin of UHE gamma-rays. The lack of an energetic pulsar and a young SNR inside or in the vicinity of LHAASO J0341+5258 challenge, but do not exclude both the leptonic and hadronic scenarios of gamma-ray production.

preprint2026arXiv

Discovery of the Ultra-high energy gamma-ray source LHAASO J2108+5157

We report the discovery of a UHE gamma-ray source, LHAASO J2108+5157, by analyzing the LHAASO-KM2A data of 308.33 live days. Significant excess of gamma-ray induced showers is observed in both energy bands of 25-100 TeV and $\gt$100 TeV with 9.5 sigma and 8.5 sigma, respectively. This source is not significantly favored as an extensive source with the angular extension smaller than the point-spread function of KM2A. The measured energy spectrum from 20 to 200 TeV can be approximately described by a power-law function with an index of -2.83$\pm$ 0.18stat. A harder spectrum is demanded at lower energies considering the flux upper limit set by Fermi-LAT observations. The position of the gamma-ray emission is correlated with a giant molecular cloud, which favors a hadronic origin. No obvious counterparts have been found, deeper multiwavelength observations will help to shed new light on this intriguing UHE source.

preprint2026arXiv

Disentangling Intent from Role: Adversarial Self-Play for Persona-Invariant Safety Alignment

The growing capabilities of large language models (LLMs) have driven their widespread deployment across diverse domains, even in potentially high-risk scenarios. Despite advances in safety alignment techniques, current models remain vulnerable to emerging persona-based jailbreak attacks. Existing research on persona-based jailbreak has primarily focused on attack iterations, yet it lacks systemic and mechanistic constraints on the defense side. To address this challenge, we propose Persona-Invariant Alignment (PIA), an adversarial self-play framework that achieves co-evolution through Persona Lineage Evolution (PLE) on the attack side and Persona-Invariant Consistency Learning (PICL) on the defense side. Theoretically, PICL is grounded in the structural separation hypothesis, using a unilateral KL-divergence constraint to enable the structural decoupling of safety decisions from persona context, thereby maintaining safe behavior under persona-based jailbreak attacks. Experimental results demonstrate that PLE efficiently explores high-risk persona spaces by leveraging lineage-based credit propagation. Meanwhile, the PICL defense method significantly reduces the Attack Success Rate (ASR) while preserving the model's general capability, thereby validating the superiority and robustness of this alignment paradigm. Codes are available at https://github.com/JiajiaLi-1130/PIA.

preprint2026arXiv

DVMap: Fine-Grained Pluralistic Value Alignment via High-Consensus Demographic-Value Mapping

Current Large Language Models (LLMs) typically rely on coarse-grained national labels for pluralistic value alignment. However, such macro-level supervision often obscures intra-country value heterogeneity, yielding a loose alignment. We argue that resolving this limitation requires shifting from national labels to multi-dimensional demographic constraints, which can identify groups with predictable, high-consensus value preference. To this end, we propose DVMap (High-Consensus Demographic-Value Mapping), a framework for fine-grained pluralistic value alignment. In this framework, we first present a demographic archetype extraction strategy to construct a high-quality value alignment corpus of 56,152 samples from the World Values Survey (WVS) by strictly retaining respondents with consistent value preferences under identical demographics. Over this corpus, we introduce a Structured Chain-of-Thought (CoT) mechanism that explicitly guides LLMs to reason about demographic-value correlations. Subsequently, we employ Group Relative Policy Optimization (GRPO) to achieve adaptive anchoring of value distributions. To rigorously evaluate generalization, we further establish a triple-generalization benchmark (spanning cross-demographic, cross-country, and cross-value) comprising 21,553 samples. Experimental results demonstrate that DVMap effectively learns the manifold mapping from demographics to values, exhibiting strong generalization and robustness. On cross-demographic tests, Qwen3-8B-DVMap achieves 48.6% accuracy, surpassing the advanced open-source LLM DeepSeek-v3.2 (45.1%). The source code and dataset are available at https://github.com/EnlightenedAI/DVMap.

preprint2026arXiv

Evidence for particle acceleration approaching PeV energies in the W51 complex

The $γ$-ray emission from the W51 complex is widely acknowledged to be attributed to the interaction between the cosmic rays (CRs) accelerated by the shock of supernova remnant (SNR) W51C and the dense molecular clouds in the adjacent star-forming region, W51B. However, the maximum acceleration capability of W51C for CRs remains elusive. Based on observations conducted with the Large High Altitude Air Shower Observatory (LHAASO), we report a significant detection of $γ$ rays emanating from the W51 complex, with energies from 2 TeV to 200 TeV. The LHAASO measurements, for the first time, extend the $γ$-ray emission from the W51 complex beyond 100 TeV and reveal a significant spectrum bending at tens of TeV. By combining the ``$π^0$-decay bump" featured data from Fermi-LAT, the broadband $γ$-ray spectrum of the W51 region can be well-characterized by a simple pp-collision model. The observed spectral bending feature suggests an exponential cutoff at $\sim400$~TeV or a power-law break at $\sim200$~TeV in the CR proton spectrum, most likely providing the first evidence of SNRs serving as CR accelerators approaching the PeV regime. Additionally, two young star clusters within W51B could also be theoretically viable to produce the most energetic $γ$ rays observed by LHAASO. Our findings strongly support the presence of extreme CR accelerators within the W51 complex and provide new insights into the origin of Galactic CRs.

preprint2026arXiv

Exploring Lorentz Invariance Violation from Ultra-high-energy Gamma Rays Observed by LHAASO

Recently the LHAASO Collaboration published the detection of 12 ultra-high-energy gamma-ray sources above 100 TeV, with the highest energy photon reaching 1.4 PeV. The first detection of PeV gamma rays from astrophysical sources may provide a very sensitive probe of the effect of the Lorentz invariance violation (LIV), which results in decay of high-energy gamma rays in the superluminal scenario and hence a sharp cutoff of the energy spectrum. Two highest energy sources are studied in this work. No signature of the existence of LIV is found in their energy spectra, and the lower limits on the LIV energy scale are derived. Our results show that the first-order LIV energy scale should be higher than about 10^5 times the Planck scale M_{pl} and that the second-order LIV scale is >10^{-3}M_{pl}. Both limits improve by at least one order of magnitude the previous results.

preprint2026arXiv

Exposing and Mitigating Temporal Attack in Deepfake Video Detection

While spatiotemporal deepfake detectors achieve high AUC, our experiments reveal their susceptibility to evasion attacks. These models tend to overfit on fragile temporal spectrum cues, rather than learning robust semantic causality. To mitigate this vulnerability, we propose SpInShield, a temporal spectral-invariant defense framework explicitly designed to decouple semantic motion from manipulatable spectral artifacts. We propose a learnable spectral adversary that dynamically synthesizes severe spectral deformations, simulating extreme attack scenarios. By employing a shortcut suppression optimization strategy, SpInShield compels the encoder to extract reliable forensic cues while purging unstable spectral statistics from the latent space. Experiments show that SpInShield obtains competitive performance on widely used datasets and outperforms the strongest baseline by 21.30 percentage points in AUC under simulated amplitude spectral attacks.

preprint2026arXiv

Fine-Grained Generalization via Structuralizing Concept and Feature Space into Commonality, Specificity and Confounding

Fine-Grained Domain Generalization (FGDG) presents greater challenges than conventional domain generalization due to the subtle inter-class differences and relatively pronounced intra-class variations inherent in fine-grained recognition tasks. Under domain shifts, the model becomes overly sensitive to fine-grained cues, leading to the suppression of critical features and a significant drop in performance. Cognitive studies suggest that humans classify objects by leveraging both common and specific attributes, enabling accurate differentiation between fine-grained categories. However, current deep learning models have yet to incorporate this mechanism effectively. Inspired by this mechanism, we propose Concept-Feature Structuralized Generalization (CFSG). This model explicitly disentangles both the concept and feature spaces into three structured components: common, specific, and confounding segments. To mitigate the adverse effects of varying degrees of distribution shift, we introduce an adaptive mechanism that dynamically adjusts the proportions of common, specific, and confounding components. In the final prediction, explicit weights are assigned to each pair of components. Extensive experiments on three single-source benchmark datasets demonstrate that CFSG achieves an average performance improvement of 9.87% over baseline models and outperforms existing state-of-the-art methods by an average of 3.08%. Additionally, explainability analysis validates that CFSG effectively integrates multi-granularity structured knowledge and confirms that feature structuralization facilitates the emergence of concept structuralization.

preprint2026arXiv

FlashEvolve: Accelerating Agent Self-Evolution with Asynchronous Stage Orchestration

LLM-based evolution has emerged as a promising way to improve agents by refining non-parametric artifacts, but its wall-clock cost remains a major bottleneck. We identify that this cost comes from synchronized stage execution and imbalance inside each LLM-heavy stage. We present FlashEvolve, an efficient framework that replaces synchronized execution with asynchronous workers and queues, allowing different stages and steps to overlap. To handle data staleness introduced by asynchrony, FlashEvolve tracks artifact versions and applies different policies to update, discard, or patch stale artifacts. Unlike weight-space staleness in asynchronous RL, language-space staleness is inspectable and repairable: a stale artifact is not just delayed work, but readable evidence that the LLM can reflect on, revise, and turn into useful evolution signal. FlashEvolve further improves throughput and token efficiency with speculative stage completion and adaptive workflow control. On GEPA workloads, FlashEvolve improves proposal throughput by $3.5\times$ on local vLLM and $4.9\times$ on API serving over synchronous GEPA. The same design also applies to ACE and Meta-Harness.

preprint2026arXiv

Inverse Knowledge Search over Verifiable Reasoning: Synthesizing a Scientific Encyclopedia from a Long Chains-of-Thought Knowledge Base

Most scientific materials compress reasoning, presenting conclusions while omitting the derivational chains that justify them. This compression hinders verification by lacking explicit, step-wise justifications and inhibits cross-domain links by collapsing the very pathways that establish the logical and causal connections between concepts. We introduce a scalable framework that decompresses scientific reasoning, constructing a verifiable Long Chain-of-Thought (LCoT) knowledge base and projecting it into an emergent encyclopedia, SciencePedia. Our pipeline operationalizes an endpoint-driven, reductionist strategy: a Socratic agent, guided by a curriculum of around 200 courses, generates approximately 3 million first-principles questions. To ensure high fidelity, multiple independent solver models generate LCoTs, which are then rigorously filtered by prompt sanitization and cross-model answer consensus, retaining only those with verifiable endpoints. This verified corpus powers the Brainstorm Search Engine, which performs inverse knowledge search -- retrieving diverse, first-principles derivations that culminate in a target concept. This engine, in turn, feeds the Plato synthesizer, which narrates these verified chains into coherent articles. The initial SciencePedia comprises approximately 200,000 fine-grained entries spanning mathematics, physics, chemistry, biology, engineering, and computation. In evaluations across six disciplines, Plato-synthesized articles (conditioned on retrieved LCoTs) exhibit substantially higher knowledge-point density and significantly lower factual error rates than an equally-prompted baseline without retrieval (as judged by an external LLM). Built on this verifiable LCoT knowledge base, this reasoning-centric approach enables trustworthy, cross-domain scientific synthesis at scale and establishes the foundation for an ever-expanding encyclopedia.

preprint2026arXiv

Leveraging Latent Visual Reasoning in Silence

Latent visual reasoning involves visual evidence more directly in multimodal reasoning by inserting continuous latent tokens before textual generation. However, the necessity of these latent tokens at inference remains ambiguous. We show that replacing latent tokens with random noise or removing them completely causes little performance degradation across spatial reasoning benchmarks. Reinforcement learning further diminishes the latent generation behavior after post-training. These observations raise a central question: Is latent visual reasoning still meaningful? We argue that its value should be measured by how effectively latent tokens guide learning, rather than whether they persist as an inference-time format. Our analysis shows that latent reasoning is unevenly favorable across question types, yet hard task-level routing for applying latent generation is brittle. Motivated by these findings, we propose an attention-based reward that encourages generated latent tokens to interact with later text tokens during RL. This reward promotes latent utilization when the latent mode is activated while preserving the flexibility to use pure-text reasoning. Experiments show that our method improves performance across perception and visual reasoning benchmarks, even when latent tokens are rarely generated after post-training. Our results highlight that, without explicit expression at inference, latent visual reasoning can shape better visual grounding and more accurate textual reasoning in silence. Our code and trained models are publicly available at \href{https://github.com/ddydyd32/silent-lvr/tree/master}{GitHub} and \href{https://huggingface.co/collections/cornuHGF/silent-lvr}{Hugging Face}.

preprint2026arXiv

LLMRouterBench: A Massive Benchmark and Unified Framework for LLM Routing

Large language model (LLM) routing assigns each query to the most suitable model from an ensemble. We introduce LLMRouterBench, a large-scale benchmark and unified framework for LLM routing. It comprises over 400K instances from 21 datasets and 33 models. Moreover, it provides comprehensive metrics for both performance-oriented routing and performance-cost trade-off routing, and integrates 10 representative routing baselines. Using LLMRouterBench, we systematically re-evaluate the field. While confirming strong model complementarity-the central premise of LLM routing-we find that many routing methods exhibit similar performance under unified evaluation, and several recent approaches, including commercial routers, fail to reliably outperform a simple baseline. Meanwhile, a substantial gap remains to the Oracle, driven primarily by persistent model-recall failures. We further show that backbone embedding models have limited impact, that larger ensembles exhibit diminishing returns compared to careful model curation, and that the benchmark also enables latency-aware analysis. All code and data are available at https://github.com/ynulihao/LLMRouterBench.

preprint2026arXiv

Misclassification Rate and Privacy-Utility Trade-offs in Graph Convolutional Networks via Subsampling Stability

We study differential privacy (DP) in Graph Convolutional Networks (GCNs) through the framework of \textit{subsampling stability}. We derive upper bounds on the misclassification rate that depend explicitly on the subsampling probability $p_s$. Furthermore, we characterize the \textit{privacy--utility trade-off} by identifying feasible ranges of $p_s$; if $p_s$ is too large, the stability-based privacy condition becomes difficult to satisfy, yielding vacuous guarantees, whereas if it is too small, accuracy deteriorates. Our results provide the first rigorous theoretical framework for understanding subsampling stability in GCNs under DP.

preprint2026arXiv

PanoWorld: Towards Spatial Supersensing in 360$^\circ$ Panorama World

Multimodal large laboratory models (MLLMs) still struggle with spatial understanding under the dominant perspective-image paradigm, which inherits the narrow field of view of human-like perception. For navigation, robotic search, and 3D scene understanding, 360-degree panoramic sensing offers a form of supersensing by capturing the entire surrounding environment at once. However, existing MLLM pipelines typically decompose panoramas into multiple perspective views, leaving the spherical structure of equirectangular projection (ERP) largely implicit. In this paper, we study pano-native understanding, which requires an MLLM to reason over an ERP panorama as a continuous, observer-centered space. To this end, we first define the key abilities for pano-native understanding, including semantic anchoring, spherical localization, reference-frame transformation, and depth-aware 3D spatial reasoning. We then build a large-scale metadata construction pipeline that converts mixed-source ERP panoramas into geometry-aware, language-grounded, and depth-aware supervision, and instantiate these signals as capability-aligned instruction tuning data. On the model side, we introduce PanoWorld with Spherical Spatial Cross-Attention, which injects spherical geometry into the visual stream. We further construct PanoSpace-Bench, a diagnostic benchmark for evaluating ERP-native spatial reasoning. Experiments show that PanoWorld substantially outperforms both proprietary and open-source baselines on PanoSpace-Bench, H* Bench, and R2R-CE Val-Unseen benchmarks. These results demonstrate that robust panoramic reasoning requires dedicated pano-native supervision and geometry-aware model adaptation. All source code and proposed data will be publicly released.

preprint2026arXiv

Provable Fairness Repair for Deep Neural Networks

Deep neural networks (DNNs) are suffering from ethical issues such as individual discrimination. In response, extensive NN repair techniques have been developed to adjust models and mitigate such undesired behaviors. However, existing fairness repair methods are typically data-centric, which often lack provable guarantees and generalization to unseen samples. To overcome these limitations, we propose ProF, a novel fairness repair framework with provable guarantees. The key intuition of ProF is to leverage interval bound propagation (a widely used NN verification technique) to soundly capture model outputs over the whole set $S(\mathbf{x})$ around a biased sample $\mathbf{x}$. The derived bounds are utilized to guide fairness repair which encourages the model to produce consistent outputs on $S(\mathbf{x})$. Specifically, we integrate fairness constraints and model modifications into a unified constraint-solving formulation, which can be transformed to a Mixed-Integer Linear Programming (MILP) problem solvable by off-the-shelf solvers. The solution to the MILP problem effectively induces a repaired model with guaranteed fairness over the whole set $S(\mathbf{x})$. We evaluate ProF on four widely used benchmark datasets and demonstrate that it achieves provable fairness repair, with generalization of up to 95.93\% on full datasets and 93.16\% on the entire input space. Notably, ProF can be easily configured to support multiple sensitive attributes and more practical fairness definitions, while providing provable repair guarantees and delivering around 90\% fairness improvement. Our code is available at https://github.com/nninjn/ProF.

preprint2026arXiv

RealCam: Real-Time Novel-View Video Generation with Interactive Camera Control

Camera-controlled video-to-video (V2V) generation enables dynamic viewpoint synthesis from monocular footage, holding immense potential for interactive filmmaking and live broadcasting. However, existing implicit synthesis methods fundamentally rely on non-causal, full-sequence processing and rigid prefix-style temporal concatenation. This architectural paradigm mandates bidirectional attention, resulting in prohibitive computational latency, quadratic complexity scaling, and inherent incompatibility with real-time streaming or variable-length inputs. To overcome these limitations, we introduce \texttt{RealCam}, a novel autoregressive framework for interactive, real-time camera-controlled V2V generation. We first design a high-fidelity teacher model grounded in a \textbf{Cross-frame In-context Learning} paradigm. By interleaving source and target frames into synchronized contextual pairs, our design inherently enables length-agnostic generalization and naturally facilitates causal adaptation, breaking the rigid prefix bottleneck. We then distill this teacher into a few-step causal student via Self-Forcing with Distribution Matching Distillation, enabling efficient, on-the-fly streaming synthesis. Furthermore, to mitigate severe loop inconsistency in closed-loop trajectories, we propose \textbf{Loop-Closed Data Augmentation (LoopAug)}, a novel paradigm that synthesizes globally consistent loop sequences from existing multiview datasets. Extensive experiments demonstrate that \texttt{RealCam} achieves state-of-the-art visual fidelity and temporal consistency while enabling truly interactive camera control with orders-of-magnitude faster inference than existing paradigms. Our project page is at https://xyc-fly.github.io/RealCam/.

preprint2026arXiv

Sub-doppler trace-gas photoacoustic spectroscopy

Molecules are emerging as new benchmark for metrology and fundamental physics research, driving the demand for spectroscopic techniques combining high sensitivity and resolution. Photoacoustic spectroscopy has proven to combine high sensitivity with appealing features like compactness, wavelength-independent and background-free detection. To date, photoacoustic sensing has mostly been focused on high-pressure applied trace-gas analysis, while accessing the low-pressure regime has been considered not compatible with efficient acoustic wave propagation. However, sensing gas samples at low pressure is the key to get access to high-resolution spectroscopy. Here, we demonstrate that sub-Doppler saturation spectroscopy can be performed on low-pressure trace gases in a cavity-enhanced photoacoustic sensor with mW-level mid-infrared radiation. Moreover, we show that the same setup can be operated at higher pressure, enabling trace-gas detection with 5 parts-per-billion sensitivity with a laser power as low as 35 microwatts. This allows to extend the unique advantages of the photoacoustic technique to metrology and fundamental physics and provides the mid-infrared with a cost-effective, flexible tool combining high sensitivity and resolution.

preprint2026arXiv

Symmetry-engineered and electrically tunable in-plane anomalous Hall effect in oxide heterostructures

The family of Hall effects has long served as a premier probe of how symmetry, magnetic order, and topology intertwine in solids. Recently, the in-plane anomalous Hall effect (IP-AHE), a transverse Hall response driven by in-plane magnetization, has emerged as a distinct member of this family, offering innovative spintronic functionalities and illuminating intricate interplay between mirror-symmetry breaking and in-plane magnetic order. However, practical routes to deterministically and reversibly control IP-AHE remain limited. Here, we establish a symmetry-engineered IP-AHE platform, CaRuO3/La2/3Ca1/3MnO3/CaRuO3 heterostructure on NdGaO3(110), that turns strict mirror-symmetry breaking constraints into effective tuning knobs. IP-AHE in these epitaxial trilayers unambiguously couples to the CaRuO3-buffer-induced mirror-symmetry breaking and faithfully reproduces the ferromagnetic hysteresis. Ionic liquid gating further enables reversible reconfigurations of the symmetry breaking, thereby achieving electrical modulation and ON/OFF switching of IP-AHE. This highly tunable IP-AHE platform opens pathways for exploring nontrivial magnetic order and developing programmable Hall functionalities in planar geometries.

preprint2026arXiv

WinDeskGround: A Benchmark for Robust GUI Grounding in Complex Multi-Window Desktop Environments

Multimodal Large Language Models (MLLMs) have revolutionized GUI automation, yet their efficacy is largely established on idealized, single-layer interfaces. This paper identifies a critical reliability gap: state-of-the-art agents face distinct robustness challenges in real-world desktop environments characterized by multi-window stacking, occlusion, and visual clutter. To address this, we introduce WinDeskGround, a novel benchmark and synthesis framework tailored for evaluating GUI grounding robustness. Unlike static datasets, our framework parametrically generates complex desktop scenarios by controlling window occlusion, layout density, and semantic similarity, thereby simulating the distribution shifts of authentic workflows. We construct a diverse meta-dataset of 1,356 high-fidelity instruction-target pairs and conduct comprehensive evaluations of five leading MLLMs. Our results demonstrate that while top-tier agents excel in simplified settings, their accuracy declines under partial occlusion. WinDeskGround provides a valuable benchmark to facilitate the assessment and advancement of GUI agent robustness in realistic environments. The code is available at https://github.com/ZZZhr-1/WinDeskGround.

preprint2025arXiv

Impact of helium ion irradiation on the thermal properties of superconducting nanowire single-photon detectors

SNSPDs are indispensable for applications ranging from quantum information processing to deep-space optical communications, owing to their high detection efficiency, low dark counts, and excellent timing resolution. However, further improving the intrinsic detection efficiency (IDE) remains crucial for optimizing SNSPD performance. Ion irradiation has recently emerged as a powerful post-fabrication method to enhance SNSPD characteristics. Here, we studied the effects of He-ion irradiation on the thermal properties of NbN SNSPDs. We systematically examine the evolution of thermal boundary conductance as a function of ion fluence (0-1.1E17 ions/cm2), observing a 57% decrease from 127 to 54 W/m^2K^4 with increasing fluence, followed by saturation at approximately 9E16 ions/cm2. At this fluence, the minimum hotspot relaxation time measurements indicate a 41% increase, rising from 17 to 24 ps,while the electron-phonon interaction time decreases, with the magnitude of change depending on temperature and sample batch.TEM reveals defect formation at the NbN/SiO2 interface (6-8 nm) and He-bubble formation within the SiO2 layer (30-260 nm), contributing to the extended thermal relaxation time. These irradiation-induced modifications play a key role in enhancing the IDE of the treated devices. We further demonstrate a post-irradiation SNSPD showing a saturated IDE plateau at 2000 nm from 2.7 K to 28 mK, enabled by thermal modifications and a weakly wavelength-dependent avalanche-assisted mechanism. Our findings highlight ion irradiation as a valuable tool for thermal tailoring in SNSPDs and advance the understanding of detection physics and defect engineering in superconducting optoelectronics.

preprint2024arXiv

DeWave: Discrete EEG Waves Encoding for Brain Dynamics to Text Translation

The translation of brain dynamics into natural language is pivotal for brain-computer interfaces (BCIs). With the swift advancement of large language models, such as ChatGPT, the need to bridge the gap between the brain and languages becomes increasingly pressing. Current methods, however, require eye-tracking fixations or event markers to segment brain dynamics into word-level features, which can restrict the practical application of these systems. To tackle these issues, we introduce a novel framework, DeWave, that integrates discrete encoding sequences into open-vocabulary EEG-to-text translation tasks. DeWave uses a quantized variational encoder to derive discrete codex encoding and align it with pre-trained language models. This discrete codex representation brings forth two advantages: 1) it realizes translation on raw waves without marker by introducing text-EEG contrastive alignment training, and 2) it alleviates the interference caused by individual differences in EEG waves through an invariant discrete codex with or without markers. Our model surpasses the previous baseline (40.1 and 31.7) by 3.06% and 6.34%, respectively, achieving 41.35 BLEU-1 and 33.71 Rouge-F on the ZuCo Dataset. This work is the first to facilitate the translation of entire EEG signal periods without word-level order markers (e.g., eye fixations), scoring 20.5 BLEU-1 and 29.5 Rouge-1 on the ZuCo Dataset.

preprint2023arXiv

Distributionally Robust Optimization under Mean-Covariance Ambiguity Set and Half-Space Support for Bivariate Problems

In this paper, we study a bivariate distributionally robust optimization problem with mean-covariance ambiguity set and half-space support. Under a conventional type of objective function widely adopted in inventory management, option pricing, and portfolio selection, we obtain closed-form tight bounds of the inner problem in six different cases. Through a primal-dual approach, we identify the optimal distributions in each case. As an application in inventory control, we first derive the optimal order quantity and the corresponding worst-case distribution, extending the existing results in the literature. Moreover, we show that under the distributionally robust setting, a centralized inventory system does not necessarily reduce the optimal total inventory, which contradicts conventional wisdom. Furthermore, we identify two effects, a conventional pooling effect, and a novel shifting effect, the combination of which determines the benefit of incorporating the covariance information in the ambiguity set. Finally, we demonstrate through numerical experiments the importance of keeping the covariance information in the ambiguity set instead of compressing the information into one dimension.

preprint2023arXiv

Improving photon number resolvability of a superconducting nanowire detector array using a level comparator circuit

Photon number resolving (PNR) capability is very important in many optical applications, including quantum information processing, fluorescence detection, and few-photon-level ranging and imaging. Superconducting nanowire single-photon detectors (SNSPDs) with a multipixel interleaved architecture give the array an excellent spatial PNR capability. However, the signal-to-noise ratio (SNR) of the photon number resolution (SNRPNR) of the array will be degraded with increasing the element number due to the electronic noise in the readout circuit, which limits the PNR resolution as well as the maximum PNR number. In this study, a 16-element interleaved SNSPD array was fabricated, and the PNR capability of the array was investigated and analyzed. By introducing a level comparator circuit (LCC), the SNRPNR of the detector array was improved over a factor of four. In addition, we performed a statistical analysis of the photon number on this SNSPD array with LCC, showing that the LCC method effectively enhances the PNR resolution. Besides, the system timing jitter of the detector was reduced from 90 ps to 72 ps due to the improved electrical SNR.

preprint2022arXiv

A Benchmark for Federated Hetero-Task Learning

To investigate the heterogeneity in federated learning in real-world scenarios, we generalize the classic federated learning to federated hetero-task learning, which emphasizes the inconsistency across the participants in federated learning in terms of both data distribution and learning tasks. We also present B-FHTL, a federated hetero-task learning benchmark consisting of simulation dataset, FL protocols and a unified evaluation mechanism. B-FHTL dataset contains three well-designed federated learning tasks with increasing heterogeneity. Each task simulates the clients with different non-IID data and learning tasks. To ensure fair comparison among different FL algorithms, B-FHTL builds in a full suite of FL protocols by providing high-level APIs to avoid privacy leakage, and presets most common evaluation metrics spanning across different learning tasks, such as regression, classification, text generation and etc. Furthermore, we compare the FL algorithms in fields of federated multi-task learning, federated personalization and federated meta learning within B-FHTL, and highlight the influence of heterogeneity and difficulties of federated hetero-task learning. Our benchmark, including the federated dataset, protocols, the evaluation mechanism and the preliminary experiment, is open-sourced at https://github.com/alibaba/FederatedScope/tree/master/benchmark/B-FHTL

preprint2022arXiv

A Unified Framework for Generalized Moment Problems: a Novel Primal-Dual Approach

Generalized moment problems optimize functional expectation over a class of distributions with generalized moment constraints, i.e., the function in the moment can be any measurable function. These problems have recently attracted growing interest due to their great flexibility in representing nonstandard moment constraints, such as geometry-mean constraints, entropy constraints, and exponential-type moment constraints. Despite the increasing research interest, analytical solutions are mostly missing for these problems, and researchers have to settle for nontight bounds or numerical approaches that are either suboptimal or only applicable to some special cases. In addition, the techniques used to develop closed-form solutions to the standard moment problems are tailored for specific problem structures. In this paper, we propose a framework that provides a unified treatment for any moment problem. The key ingredient of the framework is a novel primal-dual optimality condition. This optimality condition enables us to reduce the original infinite dimensional problem to a nonlinear equation system with a finite number of variables. In solving three specific moment problems, the framework demonstrates a clear path for identifying the analytical solution if one is available, otherwise, it produces semi-analytical solutions that lead to efficient numerical algorithms. Finally, through numerical experiments, we provide further evidence regarding the performance of the resulting algorithms by solving a moment problem and a distributionally robust newsvendor problem.

preprint2022arXiv

Adversarial Attacks and Defense Methods for Power Quality Recognition

Vulnerability of various machine learning methods to adversarial examples has been recently explored in the literature. Power systems which use these vulnerable methods face a huge threat against adversarial examples. To this end, we first propose a signal-specific method and a universal signal-agnostic method to attack power systems using generated adversarial examples. Black-box attacks based on transferable characteristics and the above two methods are also proposed and evaluated. We then adopt adversarial training to defend systems against adversarial attacks. Experimental analyses demonstrate that our signal-specific attack method provides less perturbation compared to the FGSM (Fast Gradient Sign Method), and our signal-agnostic attack method can generate perturbations fooling most natural signals with high probability. What's more, the attack method based on the universal signal-agnostic algorithm has a higher transfer rate of black-box attacks than the attack method based on the signal-specific algorithm. In addition, the results show that the proposed adversarial training improves robustness of power systems to adversarial examples.

preprint2022arXiv

Balancing Stability and Plasticity through Advanced Null Space in Continual Learning

Continual learning is a learning paradigm that learns tasks sequentially with resources constraints, in which the key challenge is stability-plasticity dilemma, i.e., it is uneasy to simultaneously have the stability to prevent catastrophic forgetting of old tasks and the plasticity to learn new tasks well. In this paper, we propose a new continual learning approach, Advanced Null Space (AdNS), to balance the stability and plasticity without storing any old data of previous tasks. Specifically, to obtain better stability, AdNS makes use of low-rank approximation to obtain a novel null space and projects the gradient onto the null space to prevent the interference on the past tasks. To control the generation of the null space, we introduce a non-uniform constraint strength to further reduce forgetting. Furthermore, we present a simple but effective method, intra-task distillation, to improve the performance of the current task. Finally, we theoretically find that null space plays a key role in plasticity and stability, respectively. Experimental results show that the proposed method can achieve better performance compared to state-of-the-art continual learning approaches.

preprint2022arXiv

Bootstrapping a User-Centered Task-Oriented Dialogue System

We present TacoBot, a task-oriented dialogue system built for the inaugural Alexa Prize TaskBot Challenge, which assists users in completing multi-step cooking and home improvement tasks. TacoBot is designed with a user-centered principle and aspires to deliver a collaborative and accessible dialogue experience. Towards that end, it is equipped with accurate language understanding, flexible dialogue management, and engaging response generation. Furthermore, TacoBot is backed by a strong search engine and an automated end-to-end test suite. In bootstrapping the development of TacoBot, we explore a series of data augmentation strategies to train advanced neural language processing models and continuously improve the dialogue experience with collected real conversations. At the end of the semifinals, TacoBot achieved an average rating of 3.55/5.0.

preprint2022arXiv

Coherence boosting: When your pretrained language model is not paying enough attention

Long-range semantic coherence remains a challenge in automatic language generation and understanding. We demonstrate that large language models have insufficiently learned the effect of distant words on next-token prediction. We present coherence boosting, an inference procedure that increases a LM's focus on a long context. We show the benefits of coherence boosting with pretrained models by distributional analyses of generated ordinary text and dialog responses. It is also found that coherence boosting with state-of-the-art models for various zero-shot NLP tasks yields performance gains with no additional training.

preprint2022arXiv

Euler-Maruyama scheme for SDEs with Dini continuous coefficients

In this paper, we show the convergence rate of Euler-Maruyama scheme for non-degenerate SDEs with Dini continuous coefficients, by the aid of the regularity of the solution to the associated Kolmogorov equation. We obtain the same conclusions using a simple and clever way to simplify the proof and weaken the conditions in \cite{BHY} by the properties of Dini continuous and Taylor expansion.

preprint2022arXiv

Explicit Image Caption Editing

Given an image and a reference caption, the image caption editing task aims to correct the misalignment errors and generate a refined caption. However, all existing caption editing works are implicit models, ie, they directly produce the refined captions without explicit connections to the reference captions. In this paper, we introduce a new task: Explicit Caption Editing (ECE). ECE models explicitly generate a sequence of edit operations, and this edit operation sequence can translate the reference caption into a refined one. Compared to the implicit editing, ECE has multiple advantages: 1) Explainable: it can trace the whole editing path. 2) Editing Efficient: it only needs to modify a few words. 3) Human-like: it resembles the way that humans perform caption editing, and tries to keep original sentence structures. To solve this new task, we propose the first ECE model: TIger. TIger is a non-autoregressive transformer-based model, consisting of three modules: Tagger_del, Tagger_add, and Inserter. Specifically, Tagger_del decides whether each word should be preserved or not, Tagger_add decides where to add new words, and Inserter predicts the specific word for adding. To further facilitate ECE research, we propose two new ECE benchmarks by re-organizing two existing datasets, dubbed COCO-EE and Flickr30K-EE, respectively. Extensive ablations on both two benchmarks have demonstrated the effectiveness of TIger.

preprint2022arXiv

Extreme ratio between spectral and Frobenius norms of nonnegative tensors

One of the fundamental problems in multilinear algebra, the minimum ratio between the spectral and Frobenius norms of tensors, has received considerable attention in recent years. While most values are unknown for real and complex tensors, the asymptotic order of magnitude and tight lower bounds have been established. However, little is known about nonnegative tensors. In this paper, we present an almost complete picture of the ratio for nonnegative tensors. In particular, we provide a tight lower bound that can be achieved by a wide class of nonnegative tensors under a simple necessary and sufficient condition, which helps to characterize the extreme tensors and obtain results such as the asymptotic order of magnitude. We show that the ratio for symmetric tensors is no more than that for general tensors multiplied by a constant depending only on the order of tensors, hence determining the asymptotic order of magnitude for real, complex, and nonnegative symmetric tensors. We also find that the ratio is in general different to the minimum ratio between the Frobenius and nuclear norms for nonnegative tensors, a sharp contrast to the case for real tensors and complex tensors.

preprint2022arXiv

FederatedScope-GNN: Towards a Unified, Comprehensive and Efficient Package for Federated Graph Learning

The incredible development of federated learning (FL) has benefited various tasks in the domains of computer vision and natural language processing, and the existing frameworks such as TFF and FATE has made the deployment easy in real-world applications. However, federated graph learning (FGL), even though graph data are prevalent, has not been well supported due to its unique characteristics and requirements. The lack of FGL-related framework increases the efforts for accomplishing reproducible research and deploying in real-world applications. Motivated by such strong demand, in this paper, we first discuss the challenges in creating an easy-to-use FGL package and accordingly present our implemented package FederatedScope-GNN (FS-G), which provides (1) a unified view for modularizing and expressing FGL algorithms; (2) comprehensive DataZoo and ModelZoo for out-of-the-box FGL capability; (3) an efficient model auto-tuning component; and (4) off-the-shelf privacy attack and defense abilities. We validate the effectiveness of FS-G by conducting extensive experiments, which simultaneously gains many valuable insights about FGL for the community. Moreover, we employ FS-G to serve the FGL application in real-world E-commerce scenarios, where the attained improvements indicate great potential business benefits. We publicly release FS-G, as submodules of FederatedScope, at https://github.com/alibaba/FederatedScope to promote FGL's research and enable broad applications that would otherwise be infeasible due to the lack of a dedicated package.

preprint2022arXiv

FedHPO-B: A Benchmark Suite for Federated Hyperparameter Optimization

Hyperparameter optimization (HPO) is crucial for machine learning algorithms to achieve satisfactory performance, whose progress has been boosted by related benchmarks. Nonetheless, existing efforts in benchmarking all focus on HPO for traditional centralized learning while ignoring federated learning (FL), a promising paradigm for collaboratively learning models from dispersed data. In this paper, we first identify some uniqueness of HPO for FL algorithms from various aspects. Due to this uniqueness, existing HPO benchmarks no longer satisfy the need to compare HPO methods in the FL setting. To facilitate the research of HPO in the FL setting, we propose and implement a benchmark suite FedHPO-B that incorporates comprehensive FL tasks, enables efficient function evaluations, and eases continuing extensions. We also conduct extensive experiments based on FedHPO-B to benchmark a few HPO methods. We open-source FedHPO-B at https://github.com/alibaba/FederatedScope/tree/master/benchmark/FedHPOB.

preprint2022arXiv

Graph-Based Similarity of Neural Network Representations

Understanding the black-box representations in Deep Neural Networks (DNN) is an essential problem in deep learning. In this work, we propose Graph-Based Similarity (GBS) to measure the similarity of layer features. Contrary to previous works that compute the similarity directly on the feature maps, GBS measures the correlation based on the graph constructed with hidden layer outputs. By treating each input sample as a node and the corresponding layer output similarity as edges, we construct the graph of DNN representations for each layer. The similarity between graphs of layers identifies the correspondences between representations of models trained in different datasets and initializations. We demonstrate and prove the invariance property of GBS, including invariance to orthogonal transformation and invariance to isotropic scaling, and compare GBS with CKA. GBS shows state-of-the-art performance in reflecting the similarity and provides insights on explaining the adversarial sample behavior on the hidden layer space.

preprint2022arXiv

HGKT: Introducing Hierarchical Exercise Graph for Knowledge Tracing

Knowledge tracing (KT) which aims at predicting learner's knowledge mastery plays an important role in the computer-aided educational system. In recent years, many deep learning models have been applied to tackle the KT task, which have shown promising results. However, limitations still exist. Most existing methods simplify the exercising records as knowledge sequences, which fail to explore rich information that existed in exercises. Besides, the existing diagnosis results of knowledge tracing are not convincing enough since they neglect prior relations between exercises. To solve the above problems, we propose a hierarchical graph knowledge tracing model called HGKT to explore the latent hierarchical relations between exercises. Specifically, we introduce the concept of problem schema to construct a hierarchical exercise graph that could model the exercise learning dependencies. Moreover, we employ two attention mechanisms to highlight the important historical states of learners. In the testing stage, we present a K&S diagnosis matrix that could trace the transition of mastery of knowledge and problem schema, which can be more easily applied to different applications. Extensive experiments show the effectiveness and interpretability of our proposed models.

preprint2022arXiv

HyAR: Addressing Discrete-Continuous Action Reinforcement Learning via Hybrid Action Representation

Discrete-continuous hybrid action space is a natural setting in many practical problems, such as robot control and game AI. However, most previous Reinforcement Learning (RL) works only demonstrate the success in controlling with either discrete or continuous action space, while seldom take into account the hybrid action space. One naive way to address hybrid action RL is to convert the hybrid action space into a unified homogeneous action space by discretization or continualization, so that conventional RL algorithms can be applied. However, this ignores the underlying structure of hybrid action space and also induces the scalability issue and additional approximation difficulties, thus leading to degenerated results. In this paper, we propose Hybrid Action Representation (HyAR) to learn a compact and decodable latent representation space for the original hybrid action space. HyAR constructs the latent space and embeds the dependence between discrete action and continuous parameter via an embedding table and conditional Variantional Auto-Encoder (VAE). To further improve the effectiveness, the action representation is trained to be semantically smooth through unsupervised environmental dynamics prediction. Finally, the agent then learns its policy with conventional DRL algorithms in the learned representation space and interacts with the environment by decoding the hybrid action embeddings to the original action space. We evaluate HyAR in a variety of environments with discrete-continuous action space. The results demonstrate the superiority of HyAR when compared with previous baselines, especially for high-dimensional action spaces.

preprint2022arXiv

Inferring Network Structures via Signal Lasso

Inferring the connectivity structure of networked systems from data is an extremely important task in many areas of science. Most of real-world networks exhibit sparsely connected topologies, with links between nodes that in some cases may be even associated to a binary state (0 or 1, denoting respectively the absence or the existence of a connection). Such un-weighted topologies are elusive to classical reconstruction methods such as Lasso or Compressed Sensing techniques. We here introduce a novel approach called signal Lasso, where the estimation of the signal parameter is subjected to 0 or 1 values. The theoretical properties and algorithm of proposed method are studied in detail. Applications of the method are illustrated to an evolutionary game and synchronization dynamics in several synthetic and empirical networks, where we show that the novel strategy is reliable and robust, and outperform the classical approaches in terms of accuracy and mean square errors.

preprint2022arXiv

Modern Question Answering Datasets and Benchmarks: A Survey

Question Answering (QA) is one of the most important natural language processing (NLP) tasks. It aims using NLP technologies to generate a corresponding answer to a given question based on the massive unstructured corpus. With the development of deep learning, more and more challenging QA datasets are being proposed, and lots of new methods for solving them are also emerging. In this paper, we investigate influential QA datasets that have been released in the era of deep learning. Specifically, we begin with introducing two of the most common QA tasks - textual question answer and visual question answering - separately, covering the most representative datasets, and then give some current challenges of QA research.

preprint2022arXiv

MR-SVS: Singing Voice Synthesis with Multi-Reference Encoder

Multi-speaker singing voice synthesis is to generate the singing voice sung by different speakers. To generalize to new speakers, previous zero-shot singing adaptation methods obtain the timbre of the target speaker with a fixed-size embedding from single reference audio. However, they face several challenges: 1) the fixed-size speaker embedding is not powerful enough to capture full details of the target timbre; 2) single reference audio does not contain sufficient timbre information of the target speaker; 3) the pitch inconsistency between different speakers also leads to a degradation in the generated voice. In this paper, we propose a new model called MR-SVS to tackle these problems. Specifically, we employ both a multi-reference encoder and a fixed-size encoder to encode the timbre of the target speaker from multiple reference audios. The Multi-reference encoder can capture more details and variations of the target timbre. Besides, we propose a well-designed pitch shift method to address the pitch inconsistency problem. Experiments indicate that our method outperforms the baseline method both in naturalness and similarity.

preprint2022arXiv

N24News: A New Dataset for Multimodal News Classification

Current news datasets merely focus on text features on the news and rarely leverage the feature of images, excluding numerous essential features for news classification. In this paper, we propose a new dataset, N24News, which is generated from New York Times with 24 categories and contains both text and image information in each news. We use a multitask multimodal method and the experimental results show multimodal news classification performs better than text-only news classification. Depending on the length of the text, the classification accuracy can be increased by up to 8.11%. Our research reveals the relationship between the performance of a multimodal classifier and its sub-classifiers, and also the possible improvements when applying multimodal in news classification. N24News is shown to have great potential to prompt the multimodal news studies.

preprint2022arXiv

NSSIA: A New Self-Sovereign Identity Scheme with Accountability

Self-Sovereign Identity (SSI) is a new distributed method for identity management, commonly used to address the problem that users are lack of control over their identities. However, the excessive pursuit of self-sovereignty in the most existing SSI schemes hinders sanctions against attackers. To deal with the malicious behavior, a few SSI schemes introduce accountability mechanisms, but they sacrifice users' privacy. What's more, the digital identities (static strings or updatable chains) in the existing SSI schemes are as inputs to a third-party executable program (mobile app, smart contract, etc.) to achieve identity reading, storing and proving, users' self-sovereignty are weakened. To solve the above problems, we present a new self-sovereign identity scheme to strike a balance between privacy and accountability and get rid of the dependence on the third-party program. In our scheme, one and only individual-specific executable code is generated as a digital avatar-i for each human to interact with others in cyberspace without a third-party program, in which the embedding of biometrics enhances uniqueness and user control over their identity. In addition, a joint accountability mechanism, which is based on the shamir (t, n) threshold algorithm and a consortium blockchain, is designed to restrict the power of each regulatory authority and protect users' privacy. Finally, we analyze the security, SSI properties and conduct detailed experiments in term of the cost of computation, storage and blockchain gas. The analysis results indicate that our scheme resists the known attacks and fulfills all the six SSI properties. Compared with the state-of-the-art schemes, the extensive experiment results show that the cost is larger in server storage, blockchain storage and blockchain gas, but is still low enough for practical situations.

preprint2022arXiv

Online Continual Learning with Contrastive Vision Transformer

Online continual learning (online CL) studies the problem of learning sequential tasks from an online data stream without task boundaries, aiming to adapt to new data while alleviating catastrophic forgetting on the past tasks. This paper proposes a framework Contrastive Vision Transformer (CVT), which designs a focal contrastive learning strategy based on a transformer architecture, to achieve a better stability-plasticity trade-off for online CL. Specifically, we design a new external attention mechanism for online CL that implicitly captures previous tasks' information. Besides, CVT contains learnable focuses for each class, which could accumulate the knowledge of previous classes to alleviate forgetting. Based on the learnable focuses, we design a focal contrastive loss to rebalance contrastive learning between new and past classes and consolidate previously learned representations. Moreover, CVT contains a dual-classifier structure for decoupling learning current classes and balancing all observed classes. The extensive experimental results show that our approach achieves state-of-the-art performance with even fewer parameters on online CL benchmarks and effectively alleviates the catastrophic forgetting.

preprint2022arXiv

PAnDR: Fast Adaptation to New Environments from Offline Experiences via Decoupling Policy and Environment Representations

Deep Reinforcement Learning (DRL) has been a promising solution to many complex decision-making problems. Nevertheless, the notorious weakness in generalization among environments prevent widespread application of DRL agents in real-world scenarios. Although advances have been made recently, most prior works assume sufficient online interaction on training environments, which can be costly in practical cases. To this end, we focus on an offline-training-online-adaptation setting, in which the agent first learns from offline experiences collected in environments with different dynamics and then performs online policy adaptation in environments with new dynamics. In this paper, we propose Policy Adaptation with Decoupled Representations (PAnDR) for fast policy adaptation. In offline training phase, the environment representation and policy representation are learned through contrastive learning and policy recovery, respectively. The representations are further refined by mutual information optimization to make them more decoupled and complete. With learned representations, a Policy-Dynamics Value Function (PDVF) [Raileanu et al., 2020] network is trained to approximate the values for different combinations of policies and environments from offline experiences. In online adaptation phase, with the environment context inferred from few experiences collected in new environments, the policy is optimized by gradient ascent with respect to the PDVF. Our experiments show that PAnDR outperforms existing algorithms in several representative policy adaptation problems.

preprint2022arXiv

PrEF: Percolation-based Evolutionary Framework for the diffusion-source-localization problem in large networks

We assume that the state of a number of nodes in a network could be investigated if necessary, and study what configuration of those nodes could facilitate a better solution for the diffusion-source-localization (DSL) problem. In particular, we formulate a candidate set which contains the diffusion source for sure, and propose the method, Percolation-based Evolutionary Framework (PrEF), to minimize such set. Hence one could further conduct more intensive investigation on only a few nodes to target the source. To achieve that, we first demonstrate that there are some similarities between the DSL problem and the network immunization problem. We find that the minimization of the candidate set is equivalent to the minimization of the order parameter if we view the observer set as the removal node set. Hence, PrEF is developed based on the network percolation and evolutionary algorithm. The effectiveness of the proposed method is validated on both model and empirical networks in regard to varied circumstances. Our results show that the developed approach could achieve a much smaller candidate set compared to the state of the art in almost all cases. Meanwhile, our approach is also more stable, i.e., it has similar performance irrespective of varied infection probabilities, diffusion models, and outbreak ranges. More importantly, our approach might provide a new framework to tackle the DSL problem in extreme large networks.

preprint2022arXiv

Social physics

Recent decades have seen a rise in the use of physics methods to study different societal phenomena. This development has been due to physicists venturing outside of their traditional domains of interest, but also due to scientists from other disciplines taking from physics the methods that have proven so successful throughout the 19th and the 20th century. Here we dub this field 'social physics' and pay our respect to intellectual mavericks who nurtured it to maturity. We do so by reviewing the current state of the art. Starting with a set of topics that are at the heart of modern human societies, we review research dedicated to urban development and traffic, the functioning of financial markets, cooperation as the basis for our evolutionary success, the structure of social networks, and the integration of intelligent machines into these networks. We then shift our attention to a set of topics that explore potential threats to society. These include criminal behaviour, large-scale migrations, epidemics, environmental challenges, and climate change. We end the coverage of each topic with promising directions for future research. Based on this, we conclude that the future for social physics is bright. Physicists studying societal phenomena are no longer a curiosity, but rather a force to be reckoned with. Notwithstanding, it remains of the utmost importance that we continue to foster constructive dialogue and mutual respect at the interfaces of different scientific disciplines.

preprint2022arXiv

Study of Electroweak Phase Transition in Exotic Higgs Decays at the CEPC

A strong first-order electroweak phase transition (EWPT) can be induced by light new physics weakly coupled to the Higgs. This study focuses on a scenario in which the first-order EWPT is driven by a light scalar $s$ with a mass between 15-60 GeV. A search for exotic decays of the Higgs boson into a pair of spin-zero particles, $h \to ss$, where the $s$-boson decays into $b$-quarks promptly is presented. The search is performed in events where the Higgs boson is produced in association with a $Z$ boson, giving rise to a signature of two charged leptons (electrons or muons) and multiple jets from $b$-quark decays. The analysis is considering a scenario of analysing 5000 fb$^{-1}$ $e^+ e^-$ collision data at $\sqrt{s} = 240 $ GeV from the Circular Electron Positron Collider (CEPC). This study with $4b$ final state conclusively tests the expected sensitivity of probing the light scalars in the CEPC experiment. The sensitivity reach is significantly larger than that can be achieved at the LHC.

preprint2022arXiv

Towards Fine-grained Causal Reasoning and QA

Understanding causality is key to the success of NLP applications, especially in high-stakes domains. Causality comes in various perspectives such as enable and prevent that, despite their importance, have been largely ignored in the literature. This paper introduces a novel fine-grained causal reasoning dataset and presents a series of novel predictive tasks in NLP, such as causality detection, event causality extraction, and Causal QA. Our dataset contains human annotations of 25K cause-effect event pairs and 24K question-answering pairs within multi-sentence samples, where each can have multiple causal relationships. Through extensive experiments and analysis, we show that the complex relations in our dataset bring unique challenges to state-of-the-art methods across all three tasks and highlight potential research opportunities, especially in developing "causal-thinking" methods.

preprint2022arXiv

Understanding the Dynamics of DNNs Using Graph Modularity

There are good arguments to support the claim that deep neural networks (DNNs) capture better feature representations than the previous hand-crafted feature engineering, which leads to a significant performance improvement. In this paper, we move a tiny step towards understanding the dynamics of feature representations over layers. Specifically, we model the process of class separation of intermediate representations in pre-trained DNNs as the evolution of communities in dynamic graphs. Then, we introduce modularity, a generic metric in graph theory, to quantify the evolution of communities. In the preliminary experiment, we find that modularity roughly tends to increase as the layer goes deeper and the degradation and plateau arise when the model complexity is great relative to the dataset. Through an asymptotic analysis, we prove that modularity can be broadly used for different applications. For example, modularity provides new insights to quantify the difference between feature representations. More crucially, we demonstrate that the degradation and plateau in modularity curves represent redundant layers in DNNs and can be pruned with minimal impact on performance, which provides theoretical guidance for layer pruning. Our code is available at https://github.com/yaolu-zjut/Dynamic-Graphs-Construction.

preprint2021arXiv

A deep learning method for solving high-order nonlinear soliton equation

We propose effective scheme of deep learning method for high-order nonlinear soliton equation and compare the activation function for high-order soliton equation. The neural network approximates the solution of the equation under the conditions of differential operator, initial condition and boundary condition. We apply this method to high-order nonlinear soliton equation, and verify its efficiency by solving the fourth-order Boussinesq equation and the fifth-order Korteweg de Vries equation. The results show that deep learning method can solve the high-order nonlinear soliton equation and reveal the interaction between solitons.

preprint2021arXiv

AdaBERT: Task-Adaptive BERT Compression with Differentiable Neural Architecture Search

Large pre-trained language models such as BERT have shown their effectiveness in various natural language processing tasks. However, the huge parameter size makes them difficult to be deployed in real-time applications that require quick inference with limited resources. Existing methods compress BERT into small models while such compression is task-independent, i.e., the same compressed BERT for all different downstream tasks. Motivated by the necessity and benefits of task-oriented BERT compression, we propose a novel compression method, AdaBERT, that leverages differentiable Neural Architecture Search to automatically compress BERT into task-adaptive small models for specific tasks. We incorporate a task-oriented knowledge distillation loss to provide search hints and an efficiency-aware loss as search constraints, which enables a good trade-off between efficiency and effectiveness for task-adaptive BERT compression. We evaluate AdaBERT on several NLP tasks, and the results demonstrate that those task-adaptive compressed models are 12.7x to 29.3x faster than BERT in inference time and 11.5x to 17.0x smaller in terms of parameter size, while comparable performance is maintained.

preprint2021arXiv

Completion Time Minimization in Wireless-Powered UAV-Assisted Data Collection System (Full Version)

In unmanned aerial vehicle (UAV)-assisted data collection system, UAVs can be deployed to charge ground terminals (GTs) via wireless power transfer (WPT) and collect data from them via wireless information transmission (WIT). In this paper, we aim to minimize the time required by a UAV via jointly optimizing the trajectory of the UAV and the transmission scheduling for all the GTs. This problem is formulated as a mixed integer nonlinear programming (MINLP) which are difficult to address in general. To this end, we develop an iterative algorithm based on binary search and successive convex optimization (SCO) to solve it. The simulation shows that our proposed solution outperforms the benchmark algorithms.

preprint2021arXiv

Entanglement of two quantum memories via fibers over dozens of kilometres

Quantum internet will enable a number of revolutionary applications. It relies on entanglement of remote quantum memories over long distances. Despite enormous progresses so far, the maximal physical separation achieved between two nodes is 1.3 km, and challenges for long distance remain. Here we make a significant step forward by entangling two atomic ensembles in one lab via photon transmission through metropolitan-scale fibers. We use cavity enhancement to create bright atom-photon entanglement, and harness quantum frequency conversion to shift the atomic wavelength to telecom. We realize entanglement over 22 km field-deployed fibers via two-photon interference, and entanglement over 50 km coiled fibers via single-photon interference. Our experiment can be extended to physically separated nodes with similar distance as a functional segment for atomic quantum networks, thus paving the way towards establishing atomic entanglement over many nodes and over much longer distance.

preprint2021arXiv

Equilibrium and Socially optimal of a double-sided queueing system with two-mass point matching time

We study a passenger-taxi double-ended queue with impatient passengers and two-point matching time in this paper. The system considered in this paper is different from those considered in the existing literature, which fully considers the matching time between passengers and taxis, and the taxi capacity of the system. The objective is to get the equilibrium joining strategy and the socially optimal strategy under two information levels. For the practical consideration of the airport terminal scenario, two different information levels are considered. The theoretical results show that the passenger utility function in the partially observable case is monotonic. For the complex form of social welfare function of the partially observable case, we use a split derivation. The equilibrium strategy and socially optimal strategy of the observable case are threshold-type. Furthermore, some representative numerical scenarios are used to visualize the theoretical results. The numerical scenarios illustrate the influence of parameters on the equilibrium strategy and socially optimal strategy under two information levels. Finally, the optimal social welfare for the two information levels with the same parameters are compared.

preprint2021arXiv

Experimental Side-Channel-Free Quantum Key Distribution

Quantum key distribution can provide unconditionally secure key exchange for remote users in theory. In practice, however, in most quantum key distribution systems, quantum hackers might steal the secure keys by listening to the side channels in the source, such as the photon frequency spectrum, emission time, propagation direction, spatial angular momentum, and so on. It is hard to prevent such kinds of attacks because side channels may exist in any of the encoding space whether the designers take care of or not. Here we report an experimental realization of a side-channel-free quantum key distribution protocol which is not only measurement-device-independent, but also immune to all side-channel attacks in the source. We achieve a secure key rate of 4.80e-7 per pulse through 50 km fiber spools.

preprint2021arXiv

High-performance quantum entanglement generation via cascaded second-order nonlinear processes

In this paper, we demonstrate the generation of high-performance entangled photon-pairs in different degrees of freedom from a single piece of fiber pigtailed periodically poled LiNbO$_3$ (PPLN) waveguide. We utilize cascaded second-order nonlinear optical processes, i.e. second-harmonic generation (SHG) and spontaneous parametric down conversion (SPDC), to generate photon-pairs. Previously, the performance of the photon pairs is contaminated by Raman noise photons from the fiber pigtails. Here by integrating the PPLN waveguide with noise rejecting filters, we obtain a coincidence-to-accidental ratio (CAR) higher than 52,600 with photon-pair generation and detection rate of 52.3 kHz and 3.5 kHz, respectively. Energy-time, frequency-bin and time-bin entanglement is prepared by coherently superposing correlated two-photon states in these degrees of freedom, respectively. The energy-time entangled two-photon states achieve the maximum value of CHSH-Bell inequality of S=2.708$\pm$0.024 with a two-photon interference visibility of 95.74$\pm$0.86%. The frequency-bin entangled two-photon states achieve fidelity of 97.56$\pm$1.79% with a spatial quantum beating visibility of 96.85$\pm$2.46%. The time-bin entangled two-photon states achieve the maximum value of CHSH-Bell inequality of S=2.595$\pm$0.037 and quantum tomographic fidelity of 89.07$\pm$4.35%. Our results provide a potential candidate for quantum light source in quantum photonics.

preprint2021arXiv

Inhibition of Current-spike Formation Based on Longitudinal Phase Space Manipulation for High-Repetition-Rate X-ray FEL

The formation of a double-horn current profile is a challenging issue in the achievement of electron bunch with high peak current, especially for high-repetition-rate X-ray free-electron lasers (XFELs) where more nonlinear and intricate beam dynamics propagation is involved. Here, we propose to correct the nonlinear beam longitudinal phase space distortion in the photoinjector section with a dual-mode buncher. In combination with the evolutionary many-objective beam dynamics optimization, this method is shown to be effective in manipulating the longitudinal phase space at the injector exit. Furthermore, the formation of the current spike is avoided after the multi-stage charge density modulation and electron bunch with a peak current of 1.6 kA is achieved for 100-pC bunch charge.Start-to-end simulations based on the Shanghai high-repetition-rate XFEL and extreme light facility demonstrate that the proposed scheme can increase the FEL pulse energy by more than 3 times in a cascading operation of echo-enabled harmonic generation and high-gain harmonic generation. Moreover, this method can also be used for longitudinal phase space shaping of electron beams operating at a high repetition rate to meet the specific demands of different researches.

preprint2021arXiv

Method and Dataset Entity Mining in Scientific Literature: A CNN + Bi-LSTM Model with Self-attention

Literature analysis facilitates researchers to acquire a good understanding of the development of science and technology. The traditional literature analysis focuses largely on the literature metadata such as topics, authors, abstracts, keywords, references, etc., and little attention was paid to the main content of papers. In many scientific domains such as science, computing, engineering, etc., the methods and datasets involved in the scientific papers published in those domains carry important information and are quite useful for domain analysis as well as algorithm and dataset recommendation. In this paper, we propose a novel entity recognition model, called MDER, which is able to effectively extract the method and dataset entities from the main textual content of scientific papers. The model utilizes rule embedding and adopts a parallel structure of CNN and Bi-LSTM with the self-attention mechanism. We evaluate the proposed model on datasets which are constructed from the published papers of four research areas in computer science, i.e., NLP, CV, Data Mining and AI. The experimental results demonstrate that our model performs well in all the four areas and it features a good learning capacity for cross-area learning and recognition. We also conduct experiments to evaluate the effectiveness of different building modules within our model which indicate that the importance of different building modules in collectively contributing to the good entity recognition performance as a whole. The data augmentation experiments on our model demonstrated that data augmentation positively contributes to model training, making our model much more robust in dealing with the scenarios where only small number of training samples are available. We finally apply our model on PAKDD papers published from 2009-2019 to mine insightful results from scientific papers published in a longer time span.

preprint2021arXiv

Observation of a symmetry-protected topological time crystal with superconducting qubits

We report the observation of a symmetry-protected topological time crystal, which is implemented with an array of programmable superconducting qubits. Unlike the time crystals reported in previous experiments, where spontaneous breaking of the discrete time translational symmetry occurs for local observables throughout the whole system, the topological time crystal observed in our experiment breaks the time translational symmetry only at the boundaries and has trivial dynamics in the bulk. More concretely, we observe robust long-lived temporal correlations and sub-harmonic temporal response for the edge spins up to 40 driving cycles. We demonstrate that the sub-harmonic response is independent of whether the initial states are random product states or symmetry-protected topological states, and experimentally map out the phase boundary between the time crystalline and thermal phases. Our work paves the way to exploring peculiar non-equilibrium phases of matter emerged from the interplay between topology and localization as well as periodic driving, with current noisy intermediate-scale quantum processors.

preprint2021arXiv

Prolonging Valley Polarization Lifetime through Gate-Controlled Exciton-to-Trion Conversion in Monolayer Molybdenum Ditelluride

Monolayer 2D semiconductors provide an attractive option for valleytronics due to the valley-addressability by helicity-specific light beam. But the short valley lifetime for excitons have hindered potential valleytronic applications. In this paper, we demonstrate a strategy for prolonging the valley lifetime by converting excitons to trions through effective gate control and by taking advantage of much longer valley lifetime for trions than for excitons. In continuous-wave experiments, we found the valley polarization increases as gate voltage is tuned away from the charge neutrality, with the degree of valley polarization increased from near zero to 38 % for excitons and to 33 % for trions. This is the first successful observation of valley-polarization in MoTe2 without a magnetic field. In pump-probe experiments, we found that the intervalley scattering process of excitons is significantly suppressed as gate voltage is tuned away from charge neutrality, with scattering time from 0.85 ps to ~ 2.17 ps. In contrast, the intervalley scattering rate for trions increases due to increased availability of partner charges for trion spin flipping, with scattering time from 1.39 ns down to ~100 ps away from charge neutrality. Interestingly, our results show that, despite the accelerated intervalley scattering, the trion polarization degree increases due to polarized trion generation from the exciton-to-trion conversion overtaking the intervalley trion scatterings. Importantly, the efficient exciton-to-trion conversion changed the dominant depolarization mechanisms. As a result, the valley lifetime is dramatically improved by 1000 times from excitons to trions at the charge neutrality. Our results shed new light into the depolarization dynamics and the interplay of various depolarization channels for excitons and trions and provide an effective strategy for prolonging the valley polarization.

preprint2021arXiv

Quantum key distribution over 658 km fiber with distributed vibration sensing

Twin-field quantum key distribution (TF-QKD) promises ultra-long secure key distribution which surpasses the rate distance limit and can reduce the number of the trusted nodes in long-haul quantum network. Tremendous efforts have been made towards implementation of TF-QKD, among which, the secure key with finite size analysis can distribute more than 500 km in the lab and in the field. Here, we demonstrate the sending-or-not-sending TF-QKD experimentally, achieving a secure key distribution with finite size analysis over 658 km ultra-low-loss optical fiber, improve the secure distance record by around 100 km. Meanwhile, in a TF-QKD system, any phase fluctuation due to temperature variation and ambient variation during the channel must be recorded and compensated, and all these phase information can then be utilized to sense the channel vibration perturbations. With our QKD system, we recovered the external vibrational perturbations on the fiber generated by an artificial vibroseis and successfully located the perturbation position with a resolution better than 1 km. Our results not only set a new distance record of QKD, but also demonstrate that the redundant information of TF-QKD can be used for remote sensing of the channel vibration, which can find applications in earthquake detection and landslide monitoring besides secure communication.

preprint2021arXiv

Self-Amplification of Coherent Energy Modulation in Seeded Free-Electron Lasers

The spectroscopic techniques for time-resolved fine analysis of matter require coherent X-ray radiation with femtosecond duration and high average brightness. Seeded free-electron lasers (FELs), which use the frequency up-conversion of an external seed laser to improve temporal coherence, are ideal for providing fully coherent soft X-ray pulses. However, it is difficult to operate seeded FELs at a high repetition rate due to the limitations of present state-of-the-art laser systems. Here, we report the novel self-modulation method for enhancing laser-induced energy modulation, thereby significantly reducing the requirement of an external laser system. Driven by this scheme, we experimentally realize high harmonic generation in a seeded FEL using an unprecedentedly small energy modulation. An electron beam with a laser-induced energy modulation as small as 1.8 times the slice energy spread is used for lasing at the 7th harmonic of a 266-nm seed laser in a single-stage high-gain harmonic generation (HGHG) setup and the 30th harmonic of the seed laser in a two-stage HGHG setup. The results mark a major step towards a high-repetition-rate, fully coherent X-ray FEL.

preprint2021arXiv

Synthesizing five-body interaction in a superconducting quantum circuit

Synthesizing many-body interaction Hamiltonian is a central task in quantum simulation. However, it is challenging to synthesize interactions including more than two spins. Borrowing tools from quantum optics, we synthesize five-body spin-exchange interaction in a superconducting quantum circuit by simultaneously exciting four independent qubits with time-energy correlated photon quadruples generated from a qudit. During the dynamic evolution of the five-body interaction, a Greenberger-Horne-Zeilinger state is generated in a single step with fidelity estimated to be $0.685$. We compare the influence of noise on the three-, four- and five-body interaction as a step toward answering the question on the quantum origin of chiral molecules. We also demonstrate a many-body Mach-Zehnder interferometer which potentially has a Heisenberg-limit sensitivity. This study paves a way for quantum simulation involving many-body interactions and high excited states of quantum circuits.

preprint2020arXiv

A 16-channel fiber array-coupled superconducting single-photon detector array with average system detection efficiency over 60% at telecom wavelength

We report a compact, scalable, and high-performance superconducting nanowire single-photon detector (SNSPD) array by using a multichannel optical fiber array-coupled configuration. For single pixels with an active area of 18 um in diameter and illuminated at the telecom wavelength of 1550 nm, we achieved a pixel yield of 13/16 on one chip, an average system detection efficiency of 69% at a dark count rate of 160 cps, a minimum timing jitter of 74 ps, and a maximum count rate of ~40 Mcps. The optical crosstalk coefficient between adjacent channels is better than -60 dB. The performance of the fiber array-coupled detectors is comparable with a standalone detector coupled to a single fiber. Our method is promising for the development of scalable, high-performance, and high-yield SNSPDs.

preprint2020arXiv

A novel route to cyclic dominance in voluntary social dilemmas

Cooperation is the backbone of modern human societies, making it a priority to understand how successful cooperation-sustaining mechanisms operate. Cyclic dominance, a non-transitive setup comprising at least three strategies wherein the first strategy overrules the second which overrules the third which, in turn, overrules the first strategy, is known to maintain bio-diversity, drive competition between bacterial strains, and preserve cooperation in social dilemmas. Here, we present a novel route to cyclic dominance in voluntary social dilemmas by adding to the traditional mix of cooperators, defectors, and loners, a fourth player type, risk-averse hedgers, who enact tit-for-tat upon paying a hedging cost to avoid being exploited. When this cost is sufficiently small, cooperators, defectors, and hedgers enter a loop of cyclic dominance that preserves cooperation even under the most adverse conditions. In contrast, when the hedging cost is large, hedgers disappear, consequently reverting to the traditional interplay of cooperators, defectors, and loners. In the interim region of hedging costs, complex evolutionary dynamics ensues, prompting transitions between states with two, three, or four competing strategies. Our results thus reveal that voluntary participation is but one pathway to sustained cooperation via cyclic dominance.

preprint2020arXiv

A Semi-supervised Graph Attentive Network for Financial Fraud Detection

With the rapid growth of financial services, fraud detection has been a very important problem to guarantee a healthy environment for both users and providers. Conventional solutions for fraud detection mainly use some rule-based methods or distract some features manually to perform prediction. However, in financial services, users have rich interactions and they themselves always show multifaceted information. These data form a large multiview network, which is not fully exploited by conventional methods. Additionally, among the network, only very few of the users are labelled, which also poses a great challenge for only utilizing labeled data to achieve a satisfied performance on fraud detection. To address the problem, we expand the labeled data through their social relations to get the unlabeled data and propose a semi-supervised attentive graph neural network, namedSemiGNN to utilize the multi-view labeled and unlabeled data for fraud detection. Moreover, we propose a hierarchical attention mechanism to better correlate different neighbors and different views. Simultaneously, the attention mechanism can make the model interpretable and tell what are the important factors for the fraud and why the users are predicted as fraud. Experimentally, we conduct the prediction task on the users of Alipay, one of the largest third-party online and offline cashless payment platform serving more than 4 hundreds of million users in China. By utilizing the social relations and the user attributes, our method can achieve a better accuracy compared with the state-of-the-art methods on two tasks. Moreover, the interpretable results also give interesting intuitions regarding the tasks.

preprint2020arXiv

A3: An Automatic Topology-Aware Malfunction Detection and Fixation System in Data Center Networks

Link failures and cable miswirings are not uncommon in building data center networks, which prevents the existing automatic address configuration methods from functioning correctly. However, accurately detecting such malfunctions is not an easy task because there could be no observable node degree changes. Fixing or correcting such malfunctions is even harder as almost no work can provide accurate fixation suggestions now. To solve the problems, we design and implement A3, an automatic topology-aware malfunction detection and fixation system. A3 innovatively formulates the problem of finding minimal fixation to the problem of computing minimum graph difference (NP-hard) and solves it in O(k^6) and O(k^3) for any less than k/2 and k/4 undirected link malfunctions for FatTree, respectively. Our evaluation demonstrates that for less than k/2 undirected link malfunctions, A3 is 100% accurate for malfunction detection and provides the minimum fixation result. For greater or equal to k/2 undirected link malfunctions, A3 still has accuracy of about 100% and provides the near optimal fixation result.

preprint2020arXiv

An entanglement-based quantum network based on symmetric dispersive optics quantum key distribution

Quantum key distribution (QKD) is a crucial technology for information security in the future. Developing simple and efficient ways to establish QKD among multiple users are important to extend the applications of QKD in communication networks. Herein, we proposed a scheme of symmetric dispersive optics QKD (DO-QKD) and demonstrated an entanglement-based quantum network based on it. In the experiment, a broadband entanglement photon pair source was shared by end users via wavelength and space division multiplexing. The wide spectrum of generated entangled photon pairs was divided into 16 combinations of frequency-conjugate channels. Photon pairs in each channel combination supported a fully-connected subnet with 8 users by a passive beam splitter. Eventually, it showed that an entanglement-based QKD network over 100 users could be supported by one entangled photon pair source in this architecture. It has great potential on applications of local quantum networks with large user number.

preprint2020arXiv

Centralized Coordination of Connected Vehicles at Intersections using Graphical Mixed Integer Optimization

This paper proposes a centralized multi-vehicle coordination scheme serving unsignalized intersections. The whole process consists of three stages: a) target velocity optimization: formulate the collision-free vehicle coordination as a Mixed Integer Linear Programming (MILP) problem, with each incoming lane representing an independent variable; b) dynamic vehicle selection: build a directed graph with result of the optimization, and reserve only some of the vehicle nodes to coordinate by applying a subset extraction algorithm; c) synchronous velocity profile planning: bridge the gap between current speed and optimal velocity in a synchronous manner. The problem size is essentially bounded by number of lanes instead of vehicles. Thus the optimization process is realtime with guaranteed solution quality. Simulation has verified efficiency and real-time performance of the scheme.

preprint2020arXiv

Coherence Concurrence for X States

We study the properties of coherence concurrence and present a physical explanation analogous to the coherence of assistance. We give an optimal pure state decomposition which attains the coherence concurrence for qubit states. We prove the additivity of coherence concurrence under direct sum operations in another way. Using these results, we calculate analytically the coherence concurrence for X states and show its optimal decompositions. Moreover, we show that the coherence concurrence is exactly twice the convex roof extended negativity of the Schmidt correlated states, thus establishing a direct relation between coherence concurrence and quantum entanglement.

preprint2020arXiv

Effect of dispersion on indistinguishability between single-photon wave-packets

With propagating through a dispersive medium, the temporal-spectral profile of laser pulses should be inevitably modified. Although such dispersion effect has been well studied in classical optics, its effect on a single-photon wave-packet, i.e., the matter wave of a single-photon, has not yet been entirely revealed. In this paper, we investigate the effect of dispersion on indistinguishability of single-photon wave-packets through the Hong-Ou-Mandel (HOM) interference. By dispersively manipulating two indistinguishable single-photon wave-packets before interfering with each other, we observe that the difference of the second-order dispersion between two optical paths of the HOM interferometer can be mapped to the interference curve, indicating that (1) with the same amount of dispersion effect in both paths, the HOM interference curve must be only determined by the intrinsic indistinguishability between the wave-packets, i.e., dispersion cancellation due to the indistinguishability between Feynman paths; (2) unbalanced dispersion effect in two paths cannot be cancelled and will broaden the interference curve thus providing a way to measure the second-order dispersion coefficient. Our results suggest a more comprehensive understanding of the single-photon wave-packet and pave ways to explore further applications of the HOM interference.

preprint2020arXiv

Equilibrium customer and socially optimal balking strategies in a constant retrial queue with multiple vacations and $N$-policy

In this paper, equilibrium strategies and optimal balking strategies of customers in a constant retrial queue with multiple vacations and the $N$-policy under two information levels, respectively, are investigated. We assume that there is no waiting area in front of the server and an arriving customer is served immediately if the server is idle; otherwise (the server is either busy or on a vacation) it has to leave the system to join a virtual retrial orbit waiting for retrials according to the FCFS rules. After a service completion, if the system is not empty, the server becomes idle, available for serving the next customer, either a new arrival or a retried customer from the virtual retrial orbit; otherwise (if the system is empty), the server starts a vacation. Upon the completion of a vacation, the server is reactivated only if it finds at least $N$ customers in the virtual orbit; otherwise, the server continues another vacation. We study this model at two levels of information, respectively. For each level of information, we obtain both equilibrium and optimal balking strategies of customers, and make corresponding numerical comparisons. Through Particle Swarm Optimization (PSO) algorithm, we explore the impact of parameters on the equilibrium and social optimal thresholds, and obtain the trend in changes, as a function of system parameters, for the optimal social welfare, which provides guiding significance for social planners. Finally, by comparing the social welfare under two information levels, we find that whether the system information should be disclosed to customers depends on how to maintain the growth of social welfare.

preprint2020arXiv

Exit rights open complex pathways to cooperation

We study the evolutionary dynamics of the prisoner's dilemma game in which cooperators and defectors interact with another actor type called exiters. Rather than being exploited by defectors, exiters exit the game in favour of a small payoff. We find that this simple extension of the game allows cooperation to flourish in well-mixed populations when iterations or reputation are added. In networked populations, however, the exit option is less conducive to cooperation. Instead, it enables the coexistence of cooperators, defectors, and exiters through cyclic dominance. Other outcomes are also possible as the exit payoff increases or the network structure changes, including network-wide oscillations in actor abundances that may cause the extinction of exiters and the domination of defectors, although game parameters should favour exiting. The complex dynamics that emerges in the wake of a simple option to exit the game implies that nuances matter even if our analyses are restricted to incentives for rational behaviour.

preprint2020arXiv

Graph Embedding on Biomedical Networks: Methods, Applications, and Evaluations

Graph embedding learning that aims to automatically learn low-dimensional node representations, has drawn increasing attention in recent years. To date, most recent graph embedding methods are evaluated on social and information networks and are not comprehensively studied on biomedical networks under systematic experiments and analyses. On the other hand, for a variety of biomedical network analysis tasks, traditional techniques such as matrix factorization (which can be seen as a type of graph embedding methods) have shown promising results, and hence there is a need to systematically evaluate the more recent graph embedding methods (e.g. random walk-based and neural network-based) in terms of their usability and potential to further the state-of-the-art. We select 11 representative graph embedding methods and conduct a systematic comparison on 3 important biomedical link prediction tasks: drug-disease association (DDA) prediction, drug-drug interaction (DDI) prediction, protein-protein interaction (PPI) prediction; and 2 node classification tasks: medical term semantic type classification, protein function prediction. Our experimental results demonstrate that the recent graph embedding methods achieve promising results and deserve more attention in the future biomedical graph analysis. Compared with three state-of-the-art methods for DDAs, DDIs and protein function predictions, the recent graph embedding methods achieve competitive performance without using any biological features and the learned embeddings can be treated as complementary representations for the biological features. By summarizing the experimental results, we provide general guidelines for properly selecting graph embedding methods and setting their hyper-parameters for different biomedical tasks.

preprint2020arXiv

High Voltage Ride Through Strategy For DFIG Considering HVDC System Converter Blocking

This paper presents a P-Q coordination based high voltage ride through (HVRT) control strategy for double fed induction generators (DFIGs) based on a combined Q-V control and P-V de-loading control. The active/reactive power injection effect of DFIG on transient overvoltage is firstly analyzed and the reactive power capacity evaluation of DFIG considering its de-loading operation is then conducted. In the proposed strategy, the reactive power limit of DFIG can be flexibly extended during the transient process in coordination with its active power adjustment, as a result the transient overvoltage caused by DC bipolar block can be effectively suppressed. Moreover, key outer loop parameters such as Q-V control coefficient and de-loading coefficient can be determined based on the point of common coupling (PCC) voltage level and the available DFIG power capacity. Finally, case studies based on MATLAB/Simulink simulation are used to verify the effectiveness of the proposed control strategy

preprint2020arXiv

Multiple Flat Projections for Cross-manifold Clustering

Cross-manifold clustering is a hard topic and many traditional clustering methods fail because of the cross-manifold structures. In this paper, we propose a Multiple Flat Projections Clustering (MFPC) to deal with cross-manifold clustering problems. In our MFPC, the given samples are projected into multiple subspaces to discover the global structures of the implicit manifolds. Thus, the cross-manifold clusters are distinguished from the various projections. Further, our MFPC is extended to nonlinear manifold clustering via kernel tricks to deal with more complex cross-manifold clustering. A series of non-convex matrix optimization problems in MFPC are solved by a proposed recursive algorithm. The synthetic tests show that our MFPC works on the cross-manifold structures well. Moreover, experimental results on the benchmark datasets show the excellent performance of our MFPC compared with some state-of-the-art clustering methods.

preprint2020arXiv

Nonsaturating magnetoresistance, anomalous Hall effect, and magnetic quantum oscillations in ferromagnetic semimetal PrAlSi

We report a comprehensive investigation of the structural, magnetic, transport and thermodynamic properties of a single crystal PrAlSi, in comparison to its nonmagnetic analogue LaAlSi. PrAlSi exhibits a ferromagnetic transition at $T_C$ = 17.8 K which, however, is followed by two weak phase transitions at lower temperatures. Based on the combined dc and ac magnetic susceptibility measurements, we propose the two reentrant magnetic phases below $T_C$ to be spin glasses or ferromagnetic cluster glasses. When the magnetic glassy states are suppressed by small field, several remarkable features appear. These include a linear, nonsaturating magnetoresistance as a function of field that is reminiscent of a topological or charge-compensated semimetal, and a large anomalous Hall conductivity amounting to $\sim$2000 $Ω^{-1}$cm$^{-1}$. Specific-heat measurements indicate a non-Kramers doublet ground state and a relatively low crystal electric field splitting of the Pr$^{3+}$ multiplets of less than 100 K. Shubnikov-de Hass oscillations are absent in LaAlSi, whereas they are clearly observed below about 25 K in PrAlSi, with an unusual temperature dependence of the dominating oscillation frequency $F$. It increases from $F$ = 18 T at 25 K to $F$ = 33 T at 2 K, hinting at an emerging Fermi pocket upon cooling into the ordered phase. These results suggest that PrAlSi is a new system where a small Fermi pocket of likely relativistic fermions is strongly coupled to magnetism. Whether hybridization between $f$ and conduction band is also involved remains an intriguing open problem.

preprint2020arXiv

Observation of Two-Vertex Four-Dimensional Spin Foam Amplitudes with a 10-qubit Superconducting Quantum Processor

Quantum computers are an increasingly hopeful means for understanding large quantum many-body systems bearing high computational complexity. Such systems exhibit complex evolutions of quantum states, and are prevailing in fundamental physics, e.g., quantum gravity. Computing the transition amplitudes between different quantum states by quantum computers is one of the promising ways to solve such computational complexity problems. In this work, we apply a 10-qubit superconducting quantum processor, where the all-to-all circuit connectivity enables a many-body entangling gate that is highly efficient for state generation, to studying the transition amplitudes in loop quantum gravity. With the device metrics such as qubit coherence, control accuracy, and integration level being continuously improved, superconducting quantum processors are expected to outperform their classical counterparts in handling many-body dynamics and may lead to a deeper understanding of quantum gravity.

preprint2020arXiv

Precise tuning of the superconducting properties of Mn-doped Al films for transition edge sensors by ion-implantation

Magnetic impurities in metallic superconductors are important for both fundamental and applied sciences. In this study, we focused on dilute Mn-doped aluminum (AlMn) films, which are common superconducting materials used to make transition edge sensors (TES). We developed a multi-energy ion-implantation technique to make AlMn films. Compared with frequently used sputtering techniques, ion-implantation provides more precise and reliable control of the Mn doping concentration in the AlMn films.The ion implantation also enables us to quantitatively analyze the superconducting transition temperature curve as a function of the Mn doping concentration. We found that Mn dopants act as magnetic impurities and suppression of superconductivity is counteracted by the antiferromagnetic Ruderman Kittel Kasuya Yosida interaction among Mn dopants. The RKKY interaction can be tuned through defect engineering in the ion-implantation process and through post-implantation annealing.

preprint2020arXiv

Rationalizing Medical Relation Prediction from Corpus-level Statistics

Nowadays, the interpretability of machine learning models is becoming increasingly important, especially in the medical domain. Aiming to shed some light on how to rationalize medical relation prediction, we present a new interpretable framework inspired by existing theories on how human memory works, e.g., theories of recall and recognition. Given the corpus-level statistics, i.e., a global co-occurrence graph of a clinical text corpus, to predict the relations between two entities, we first recall rich contexts associated with the target entities, and then recognize relational interactions between these contexts to form model rationales, which will contribute to the final prediction. We conduct experiments on a real-world public clinical dataset and show that our framework can not only achieve competitive predictive performance against a comprehensive list of neural baseline models, but also present rationales to justify its prediction. We further collaborate with medical experts deeply to verify the usefulness of our model rationales for clinical decision making.

preprint2020arXiv

Scalable evaluation of quantum-circuit error loss using Clifford sampling

A major challenge in developing quantum computing technologies is to accomplish high precision tasks by utilizing multiplex optimization approaches, on both the physical system and algorithm levels. Loss functions assessing the overall performance of quantum circuits can provide the foundation for many optimization techniques. In this paper, we use the quadratic error loss and the final-state fidelity loss to characterize quantum circuits. We find that the distribution of computation error is approximately Gaussian, which in turn justifies the quadratic error loss. It is shown that these loss functions can be efficiently evaluated in a scalable way by sampling from Clifford-dominated circuits. We demonstrate the results by numerically simulating ten-qubit noisy quantum circuits with various error models as well as executing four-qubit circuits with up to ten layers of two-qubit gates on a superconducting quantum processor. Our results pave the way towards the optimization-based quantum device and algorithm design in the intermediate-scale quantum regime.

preprint2020arXiv

Schauder's estimates for nonlocal equations with singular Lévy measures

In this paper, we establish Schauder's estimates for the following non-local equations in \mR^d : $$ \partial_tu=\mathscr L^{(α)}_{κ,σ} u+b\cdot\nabla u+f,\ u(0)=0, $$ where $α\in(1/2,2)$ and $ b:\mathbb R_+\times\mathbb R^d\to\mathbb R$ is an unbounded local $β$-order Hölder function in $ x $ uniformly in $ t $, and $\mathscr L^{(α)}_{κ,σ}$ is a non-local $α$-stable-like operator with form: \begin{align*} {\mathscr L}^{(α)}_{κ,σ}u(t,x):=\int_{\mathbb R^d}\Big(u(t,x+σ(t,x)z)-u(t,x)-σ(t,x)z^{(α)}\cdot\nabla u(t,x)\Big)κ(t,x,z)ν^{(α)}(\mathord{\rm d} z), \end{align*} where $z^{(α)}=z\mathbf{1}_{α\in(1,2)}+z\mathbf{1}_{|z|\leq 1}\mathbf{1}_{α=1}$, $ κ:\mathbb R_+\times\mathbb R^{2d}\to\mathbb R_+ $ is bounded from above and below, $ σ:\mathbb R_+\times\mathbb R^{d}\to \mathbb R^d\otimes \mathbb R^d$ is a $ γ$-order Hölder continuous function in $ x $ uniformly in $ t $, and $ ν^{(α)} $ is a singular non-degenerate $ α$-stable Lévy measure.

preprint2020arXiv

Texture Classification using Block Intensity and Gradient Difference (BIGD) Descriptor

In this paper, we present an efficient and distinctive local descriptor, namely block intensity and gradient difference (BIGD). In an image patch, we randomly sample multi-scale block pairs and utilize the intensity and gradient differences of pairwise blocks to construct the local BIGD descriptor. The random sampling strategy and the multi-scale framework help BIGD descriptors capture the distinctive patterns of patches at different orientations and spatial granularity levels. We use vectors of locally aggregated descriptors (VLAD) or improved Fisher vector (IFV) to encode local BIGD descriptors into a full image descriptor, which is then fed into a linear support vector machine (SVM) classifier for texture classification. We compare the proposed descriptor with typical and state-of-the-art ones by evaluating their classification performance on five public texture data sets including Brodatz, CUReT, KTH-TIPS, and KTH-TIPS-2a and -2b. Experimental results show that the proposed BIGD descriptor with stronger discriminative power yields 0.12% ~ 6.43% higher classification accuracy than the state-of-the-art texture descriptor, dense microblock difference (DMD).

preprint2020arXiv

Tree++: Truncated Tree Based Graph Kernels

Graph-structured data arise ubiquitously in many application domains. A fundamental problem is to quantify their similarities. Graph kernels are often used for this purpose, which decompose graphs into substructures and compare these substructures. However, most of the existing graph kernels do not have the property of scale-adaptivity, i.e., they cannot compare graphs at multiple levels of granularities. Many real-world graphs such as molecules exhibit structure at varying levels of granularities. To tackle this problem, we propose a new graph kernel called Tree++ in this paper. At the heart of Tree++ is a graph kernel called the path-pattern graph kernel. The path-pattern graph kernel first builds a truncated BFS tree rooted at each vertex and then uses paths from the root to every vertex in the truncated BFS tree as features to represent graphs. The path-pattern graph kernel can only capture graph similarity at fine granularities. In order to capture graph similarity at coarse granularities, we incorporate a new concept called super path into it. The super path contains truncated BFS trees rooted at the vertices in a path. Our evaluation on a variety of real-world graphs demonstrates that Tree++ achieves the best classification accuracy compared with previous graph kernels.

preprint2019arXiv

An Experimentally Verified Approach to non-Entanglement-Breaking Channel Certification

Ensuring the non-entanglement-breaking (non-EB) property of quantum channels is crucial for the effective distribution and storage of quantum states. However, a practical method for direct and accurate certification of the non-EB feature is highly desirable. Here, we propose and verify a realistic source based measurement device independent certification of non-EB channels. Our method is resilient to repercussions on the certification from experimental conditions, such as multiphotons and imperfect state preparation, and can be implemented with information incomplete set. We achieve good agreement between experimental outcomes and theoretical predictions, which is validated by the expected results of the ideal semi-quantum signaling game, and accurately certify the non-EB channels. Furthermore, our approach is highly robust to effects from noise. Therefore, the proposed approach can be expected to play a significant role in the design and evaluation of realistic quantum channels.

preprint2019arXiv

Generation and controllable switching of superradiant and subradiant states in a 10-qubit superconducting circuit

Superradiance and subradiance concerning enhanced and inhibited collective radiation of an ensemble of atoms have been a central topic in quantum optics. However, precise generation and control of these states remain challenging. Here we deterministically generate up to 10-qubit superradiant and 8-qubit subradiant states, each containing a single excitation, in a superconducting quantum circuit with multiple qubits interconnected by a cavity resonator. The $\sqrt{N}$-scaling enhancement of the coupling strength between the superradiant states and the cavity is validated. By applying appropriate phase gate on each qubit, we are able to switch the single collective excitation between superradiant and subradiant states. While the subradiant states containing a single excitation are forbidden from emitting photons, we demonstrate that they can still absorb photons from the resonator. However, for even number of qubits, a singlet state with half of the qubits being excited can neither emit nor absorb photons, which is verified with 4 qubits. This study is a step forward in coherent control of collective radiation and has promising applications in quantum information processing.

preprint2019arXiv

High-speed measurement-device-independent quantum key distribution with integrated silicon photonics

Measurement-device-independent quantum key distribution (MDI-QKD) removes all detector side channels and enables secure QKD with an untrusted relay. It is suitable for building a star-type quantum access network, where the complicated and expensive measurement devices are placed in the central untrusted relay and each user requires only a low-cost transmitter, such as an integrated photonic chip. Here, we experimentally demonstrate a 1.25 GHz silicon photonic chip-based MDI-QKD system using polarization encoding. The photonic chip transmitters integrate the necessary encoding components for a standard QKD source. We implement random modulations of polarization states and decoy intensities, and demonstrate a finite-key secret rate of 31 bps over 36 dB channel loss (or 180 km standard fiber). This key rate is higher than state-of-the-art MDI-QKD experiments. The results show that silicon photonic chip-based MDI-QKD, benefiting from miniaturization, low-cost manufacture and compatibility with CMOS microelectronics, is a promising solution for future quantum secure networks.

preprint2019arXiv

Misinformation spreading on correlated multiplex networks

The numerous expanding online social networks offer fast channels for misinformation spreading, which could have a serious impact on socioeconomic systems. Researchers across multiple areas have paid attention to this issue with a view of addressing it. However, no systematical theoretical study has been performed to date on observing misinformation spreading on correlated multiplex networks. In this study, we propose a multiplex network-based misinformation spreading model, considering the fact that each individual can obtain misinformation from multiple platforms. Subsequently, we develop a heterogeneous edge-base compartmental theory to comprehend the spreading dynamics of our proposed model. In addition, we establish an analytical method based on stability analysis to obtain the misinformation outbreak threshold. On the basis of these theories, we finally analyze the influence of different dynamical and structural parameters on the misinformation spreading dynamics. Results show that the misinformation outbreak size $R(\infty)$ grows continuously with the effective transmission probability $β$ once $β$ exceeds a certain value, that is, the outbreak threshold $β_c$. A large average degrees, strong degree heterogeneity, or positive inter-layer correlation will reduce $β_c$, accelerating the outbreak of misinformation. Besides, increasing the degree heterogeneity or a more positive inter-layer correlation will both enlarge (reduce) $R(\infty)$ for small (large) values of $β$. Our systematic theoretical analysis results agree well with the numerical simulation results. Our proposed model and accurate theoretical analysis will serve as a useful framework to understand and predict the spreading dynamics of misinformation on multiplex networks, and thereby pave the way to address this serious issue.

preprint2019arXiv

Sending-or-Not-Sending with Independent Lasers: Secure Twin-Field Quantum Key Distribution Over 509 km

Twin field quantum key distribution promises high key rates at long distance to beat the rate distance limit. Here, applying the sending or not sending TF QKD protocol, we experimentally demonstrate a secure key distribution breaking the absolute key rate limit of repeaterless QKD over 509 km, 408 km ultra-low loss optical fibre and 350 km standard optical fibre. Two independent lasers are used as the source with remote frequency locking technique over 500 km fiber distance; Practical optical fibers are used as the optical path with appropriate noise filtering; And finite key effects are considered in the key rate analysis. The secure key rates obtained at different distances are more than 5 times higher than the conditional limit of repeaterless QKD, a bound value assuming the same detection loss in the comparison. The achieved secure key rate is also higher than that a traditional QKD protocol running with a perfect repeaterless QKD device and even if an infinite number of sent pulses. Our result shows that the protocol and technologies applied in this experiment enable TF QKD to achieve high secure key rate at long distribution distance, and hence practically useful for field implementation of intercity QKD.