Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
50works
0followers
37topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

50 published item(s)

preprint2026arXiv

A Hybrid Tucker-LSTM Tensor Network Model for SOC Prediction in Electric Vehicles

Accurate state of charge estimation is critical for the success of electric vehicle battery management strategies, but it is well known that conventional estimators suffer from two fundamental shortcomings: cumulative errors that grow over time and reliance on simplified battery models that do not reflect real world dynamics. Therefore, this paper presents a novel hybrid approach combining Tucker tensor decomposition with LSTM networks, using full - lifecycle EV field data for SOC prediction. The inputs are charge status, mileage, voltage, current, cell differentials, and temporal features. Tucker decomposition is skillfully used to reduce dimensionality while maintaining the temporal structure, hence allowing a direct, fair comparison with standard LSTM. The result is unequivocal: Tucker - LSTM outperforms the baseline on all metrics, with MSE dropping 70.5\% (from 21.07 to 6.22 ), MAE improving 48.7\% (from 3.37\% to 1.73\%), RMSE falling from 4.59\% to 2.49\%, and $R^2$ rising from 0.918 to 0.976. Since the experimental results demonstrably demonstrate that tensor decomposition compresses high-dimensional battery data very well without loss of predictive fidelity, this paper naturally opens up a new direction for tensor-based analytics in electric vehicle battery management.

preprint2026arXiv

AGoQ: Activation and Gradient Quantization for Memory-Efficient Distributed Training of LLMs

Quantization is a key method for reducing the GPU memory requirement of training large language models (LLMs). Yet, current approaches are ineffective for 4-bit activations and 8-bit gradients, which would easily cause slow convergence or accuracy loss. To address this, we introduce AGoQ, incorporating two new techniques: 1) a layer-aware activation quantization algorithm that allocates appropriate bit-widths for activations of various layers based on their types and pipeline stages to achieve near 4-bit activation storage, and 2) a gradient quantization algorithm that reduces memory usage and shortens communication time by employing 8-bit gradient storage and precision-preserving 8-bit All-Reduce communication. We conduct extensive experiments using different sizes of LLMs on two GPU clusters (up to 64 GPUs), and the experimental results show that our AGoQ reduces the memory by up to 52\% and achieves up to 1.34$\times$ improvement of training speed compared to state-of-the-art training systems Megatron-LM (w/ or w/o ZeRO), COAT and DeepSpeed with 8B to 32B LLaMA models, while achieving convergence loss on pretraining and comparable accuracy on downstream tasks with LLaMA architectures.

preprint2026arXiv

Are Rationales Necessary and Sufficient? Tuning LLMs for Explainable Misinformation Detection

The rapid spread of misinformation on social media platforms has become a formidable challenge. To mitigate its proliferation, Misinformation Detection (MD) has emerged as a critical research topic. Traditional MD approaches based on small models typically perform binary classification through a black-box process. Recently, the rise of Large Language Models (LLMs) has enabled explainable MD, where models generate rationales that explain their decisions, thereby enhancing transparency. Existing explainable MD methods primarily focus on crafting sophisticated prompts to elicit rationales from off-the-shelf LLMs. In this work, we propose a pipeline to fine-tune a dedicated LLM specifically for explainable MD. Our pipeline begins by collecting large-scale fact-checked articles, and then uses multiple strong LLMs to produce veracity predictions and rationales. To ensure high-quality training data, we leverage a filtering strategy that selects only the correct instances for fine-tuning. While this pipeline is intuitive and prevalent, our experiments reveal that naive filtering based solely on label correctness is insufficient in practice and suffers from two critical limitations: (1) Coarse-grained labels cause insufficient rationales: Rationales filtered solely based on binary labels are insufficient to adequately support their decisions; (2) Over-verification behavior causes unnecessary rationales: Stronger LLMs tend to exhibit over-verification behavior, producing excessively verbose and unnecessary rationales. To address these issues, we introduce LONSREX, a novel data synthesis pipeline to Locate Necessary and Sufficient Rationales for Explainable MD. Specifically, we propose a metric that quantifies the contribution of each verification step to the final prediction, thereby evaluating its necessity and sufficiency. Experimental results demonstrate the effectiveness of LONSREX.

preprint2026arXiv

Backtracking When It Strays: Mitigating Dual Exposure Biases in LLM Reasoning Distillation

Large language models (LLMs) have achieved remarkable success in complex reasoning tasks via long chain-of-thought (CoT), yet their immense computational overhead hinders real-world deployment. LLM reasoning distillation addresses this by transferring reasoning capabilities from formidable teacher models to compact student models. However, existing distillation paradigms face a fundamental dilemma. Typical off-policy distillation strictly utilizes teacher-generated golden trajectories, suffering from an exposure bias due to the mismatch between training distributions and student-generated inference contexts, which leads to error cascades in long CoT reasoning. To address this, on-policy distillation allows students to explore their own trajectories, but we demonstrate that it inherently introduces a reciprocal reversed exposure bias: the teacher model also struggles to provide positive guidance when conditioned on student-generated sub-optimal contexts. To resolve this dual exposure biases problem, we propose Monitoring Trajectories and Backtracking when it strays (MOTAB), a new LLM reasoning distillation pipeline. Specifically, MOTAB dynamically monitors the student's on-policy generation against an adaptive safety boundary. When the generation strays and exceeds this threshold, MOTAB backtracks to the last safe state and leverages teacher intervention to correct the course. This approach inherently tolerates minor student errors to mitigate exposure bias, while preventing sub-optimal contexts to circumvent reversed exposure bias. Extensive experiments on the LIMO-v2 and AceReason datasets demonstrate that MOTAB effectively alleviates the dual exposure biases, yielding a roughly 3% average performance improvement in reasoning tasks.

preprint2026arXiv

Decomposing the Basic Abilities of Large Language Models: Mitigating Cross-Task Interference in Multi-Task Instruct-Tuning

Recently, the prominent performance of large language models (LLMs) has been largely driven by multi-task instruct-tuning. Unfortunately, this training paradigm suffers from a key issue, named cross-task interference, due to conflicting gradients over shared parameters among different tasks. Some previous methods mitigate this issue by isolating task-specific parameters, e.g., task-specific neuron selection and mixture-of-experts. In this paper, we empirically reveal that the cross-task interference still exists for the existing solutions because of many parameters also shared by different tasks, and accordingly, we propose a novel solution, namely Basic Abilities Decomposition for multi-task Instruct-Tuning (BADIT). Specifically, we empirically find that certain parameters are consistently co-activated, and that co-activated parameters naturally organize into base groups. This motivates us to analogize that LLMs encode several orthogonal basic abilities, and that any task can be represented as a linear combination of these abilities. Accordingly, we propose BADIT that decomposes LLM parameters into orthogonal high-singular-value LoRA experts representing basic abilities, and dynamically enforces their orthogonality during training via spherical clustering of rank-1 components. We conduct extensive experiments on the SuperNI benchmark with 6 LLMs, and empirical results demonstrate that BADIT can outperform SOTA methods and mitigate the degree of cross-task interference.

preprint2026arXiv

Distribution-Aligned Sequence Distillation for Superior Long-CoT Reasoning

In this report, we introduce DASD-4B-Thinking, a lightweight yet highly capable, fully open-source reasoning model. It achieves SOTA performance among open-source models of comparable scale across challenging benchmarks in mathematics, scientific reasoning, and code generation -- even outperforming several larger models. We begin by critically reexamining a widely adopted distillation paradigm in the community: SFT on teacher-generated responses, also known as sequence-level distillation. Although a series of recent works following this scheme have demonstrated remarkable efficiency and strong empirical performance, they are primarily grounded in the SFT perspective. Consequently, these approaches focus predominantly on designing heuristic rules for SFT data filtering, while largely overlooking the core principle of distillation itself -- enabling the student model to learn the teacher's full output distribution so as to inherit its generalization capability. Specifically, we identify three critical limitations in current practice: i) Inadequate representation of the teacher's sequence-level distribution; ii) Misalignment between the teacher's output distribution and the student's learning capacity; and iii) Exposure bias arising from teacher-forced training versus autoregressive inference. In summary, these shortcomings reflect a systemic absence of explicit teacher-student interaction throughout the distillation process, leaving the essence of distillation underexploited. To address these issues, we propose several methodological innovations that collectively form an enhanced sequence-level distillation training pipeline. Remarkably, DASD-4B-Thinking obtains competitive results using only 448K training samples -- an order of magnitude fewer than those employed by most existing open-source efforts. To support community research, we publicly release our models and the training dataset.

preprint2026arXiv

ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking

Parking is a critical task for autonomous driving systems (ADS), with unique challenges in crowded parking slots and GPS-denied environments. However, existing works focus on 2D parking slot perception, mapping, and localization, 3D reconstruction remains underexplored, which is crucial for capturing complex spatial geometry in parking scenarios. Naively improving the visual quality of reconstructed parking scenes does not directly benefit autonomous parking, as the key entry point for parking is the slots perception module. To address these limitations, we curate the first benchmark named ParkRecon3D, specifically designed for parking scene reconstruction. It includes sensor data from four surround-view fisheye cameras with calibrated extrinsics and dense parking slot annotations. We then propose ParkGaussian, the first framework that integrates 3D Gaussian Splatting (3DGS) for parking scene reconstruction. To further improve the alignment between reconstruction and downstream parking slot detection, we introduce a slot-aware reconstruction strategy that leverages existing parking perception methods to enhance the synthesis quality of slot regions. Experiments on ParkRecon3D demonstrate that ParkGaussian achieves state-of-the-art reconstruction quality and better preserves perception consistency for downstream tasks. The code and dataset will be released at: https://github.com/wm-research/ParkGaussian

preprint2026arXiv

Pixel-Perfect Visual Geometry Estimation

Recovering clean and accurate geometry from images is essential for robotics and augmented reality. However, existing geometry foundation models still suffer severely from flying pixels and the loss of fine details. In this paper, we present pixel-perfect visual geometry models that can predict high-quality, flying-pixel-free point clouds by leveraging generative modeling in the pixel space. We first introduce Pixel-Perfect Depth (PPD), a monocular depth foundation model built upon pixel-space diffusion transformers (DiT). To address the high computational complexity associated with pixel-space diffusion, we propose two key designs: 1) Semantics-Prompted DiT, which incorporates semantic representations from vision foundation models to prompt the diffusion process, preserving global semantics while enhancing fine-grained visual details; and 2) Cascade DiT architecture that progressively increases the number of image tokens, improving both efficiency and accuracy. To further extend PPD to video (PPVD), we introduce a new Semantics-Consistent DiT, which extracts temporally consistent semantics from a multi-view geometry foundation model. We then perform reference-guided token propagation within the DiT to maintain temporal coherence with minimal computational and memory overhead. Our models achieve the best performance among all generative monocular and video depth estimation models and produce significantly cleaner point clouds than all other models.

preprint2026arXiv

PointForward: Feedforward Driving Reconstruction through Point-Aligned Representations

High-fidelity reconstruction of driving scenes is crucial for autonomous driving. While recent feedforward 3D Gaussian Splatting (3DGS) methods enable fast reconstruction, their per-pixel Gaussian prediction paradigm often suffers from multi-view inconsistency and layering artifacts. Moreover, existing methods often model dynamic instances via dense flow prediction, which lacks explicit cross-view correspondence and instance-level consistency. In this paper, we propose PointForward, a feedforward driving reconstruction framework through point-aligned representations. Unlike pixel-aligned methods, we initialize sparse 3D queries in world space and aggregate multi-view image information via spatial-temporal fusion onto these queries, enforcing explicit cross-view consistency in a single feedforward pass. To handle scene dynamics, we introduce scene graphs that explicitly organize moving instances during reconstruction. By leveraging 3D bounding boxes, our method enables instance-level motion propagation and temporally consistent dynamic representations. Extensive experiments demonstrate that PointForward achieves state-of-the-art performance on large-scale driving benchmarks. The code will be available upon the publication of the paper.

preprint2026arXiv

Prefix Teach, Suffix Fade: Local Teachability Collapse in Strong-to-Weak On-Policy Distillation

On-policy distillation (OPD) trains a student model on its own rollouts using dense feedback from a stronger teacher. Prior literature suggests that, provided teacher feedback is available, supervising the full sequence of response tokens should monotonically improve performance. However, we demonstrate that this assumption sometimes fails to hold in strong-to-weak OPD settings. While later segments of a generated trajectory may still exhibit a non-zero teacher-student advantage, they frequently lack the local contrast that makes dense feedback effective for prioritizing student learning. We term this failure mode local teachability collapse. The resulting principle is straightforward: supervision should concentrate on trajectory regions where the teacher's feedback remains discriminative, rather than uniformly covering the entire response. We operationalize this principle through a trajectory-specific release rule. This rule measures the teacher's margin over the student's top-$K$ candidate set, aggregates this margin across NLTK-tokenized sentence segments, and truncates dense OPD supervision upon detecting a BIC-style downward change point. Experimental results across strong-to-weak distillation tasks using the Qwen3 model family indicate that this release rule consistently outperforms standard full-trajectory OPD across five in-domain benchmarks at various student scales. Furthermore, compared to baseline distillation methods, our approach better preserves model capabilities on out-of-domain task. These results suggest that effective strong-to-weak OPD requires evaluating not only the availability of teacher guidance but also its local utility, ensuring that the generated feedback remains teachable.

preprint2026arXiv

R$^3$L: Reasoning 3D Layouts from Relative Spatial Relations

Relative spatial relations provide a compact representation of spatial structure and are fundamental to relative spatial reasoning in 3D layout generation. Recent works leverage Multimodal Large Language Models (MLLMs) to infer such relations, but the inferred relations are often unreliable and are typically handled with post-hoc heuristics. In this paper, we propose R$^3$L, a general framework that improves the reliability and consistency of relative spatial reasoning for 3D layout generation. Our key motivation is that multi-hop reasoning requires repeated reference-frame transformations, which accumulate errors in inferred relations and lead to semantic and metric drift. To mitigate this, we propose invariant spatial decomposition to break coupled relation chains, and consistent spatial imagination to promote self-consistency through an imagine-and-revise loop. We further introduce supportive spatial optimization to ease pose optimization via global-to-local coordinate re-parameterization. Extensive experiments across diverse scene types and instructions demonstrate that R$^3$L produces more physically feasible and semantically consistent layouts. Notably, our analysis shows that resolving frame-induced inconsistencies is crucial for reliable multi-hop relative spatial reasoning. The code is available at https://github.com/Neal2020GitHub/R3L.

preprint2026arXiv

VILTA: A VLM-in-the-Loop Adversary for Enhancing Driving Policy Robustness

The safe deployment of autonomous driving (AD) systems is fundamentally hindered by the long-tail problem, where rare yet critical driving scenarios are severely underrepresented in real-world data. Existing solutions including safety-critical scenario generation and closed-loop learning often rely on rule-based heuristics, resampling methods and generative models learned from offline datasets, limiting their ability to produce diverse and novel challenges. While recent works leverage Vision Language Models (VLMs) to produce scene descriptions that guide a separate, downstream model in generating hazardous trajectories for agents, such two-stage framework constrains the generative potential of VLMs, as the diversity of the final trajectories is ultimately limited by the generalization ceiling of the downstream algorithm. To overcome these limitations, we introduce VILTA (VLM-In-the-Loop Trajectory Adversary), a novel framework that integrates a VLM into the closed-loop training of AD agents. Unlike prior works, VILTA actively participates in the training loop by comprehending the dynamic driving environment and strategically generating challenging scenarios through direct, fine-grained editing of surrounding agents' future trajectories. This direct-editing approach fully leverages the VLM's powerful generalization capabilities to create a diverse curriculum of plausible yet challenging scenarios that extend beyond the scope of traditional methods. We demonstrate that our approach substantially enhances the safety and robustness of the resulting AD policy, particularly in its ability to navigate critical long-tail events.

preprint2026arXiv

Xiaomi EV World Model: A Joint World Model Integrating Reconstruction and Generation for Autonomous Driving

This report presents a unified technical system addressing the two core capabilities of world models for autonomous driving: world representation and world generation. For world representation, we propose WorldRec, a feed-forward reconstruction architecture driven by sparse scene queries. WorldRec initializes structured queries in 3D space, leveraging them to aggregate cross-view, cross-temporal features, thereby naturally enforcing spatial consistency across frames and yielding compact yet high-fidelity 3D Gaussian scene representations. For world generation, we propose WorldGen, a two-stage training framework of bidirectional pretraining followed by causal fine-tuning through three progressive stages (Teacher Forcing, ODE distillation, and DMD), enabling high-quality online causal video generation in as few as 4 denoising steps. Building on both modules, we further introduce the JWM, which deeply integrates WorldRec and WorldGen to achieve synergistic gains in generation stability, cross-frame consistency, and visual fidelity, providing a solid foundation for closed-loop simulation, data synthesis, and end-to-end training in autonomous driving.

preprint2025arXiv

Mirage: One-Step Video Diffusion for Photorealistic and Coherent Asset Editing in Driving Scenes

Vision-centric autonomous driving systems rely on diverse and scalable training data to achieve robust performance. While video object editing offers a promising path for data augmentation, existing methods often struggle to maintain both high visual fidelity and temporal coherence. In this work, we propose \textbf{Mirage}, a one-step video diffusion model for photorealistic and coherent asset editing in driving scenes. Mirage builds upon a text-to-video diffusion prior to ensure temporal consistency across frames. However, 3D causal variational autoencoders often suffer from degraded spatial fidelity due to compression, and directly passing 3D encoder features to decoder layers breaks temporal causality. To address this, we inject temporally agnostic latents from a pretrained 2D encoder into the 3D decoder to restore detail while preserving causal structures. Furthermore, because scene objects and inserted assets are optimized under different objectives, their Gaussians exhibit a distribution mismatch that leads to pose misalignment. To mitigate this, we introduce a two-stage data alignment strategy combining coarse 3D alignment and fine 2D refinement, thereby improving alignment and providing cleaner supervision. Extensive experiments demonstrate that Mirage achieves high realism and temporal consistency across diverse editing scenarios. Beyond asset editing, Mirage can also generalize to other video-to-video translation tasks, serving as a reliable baseline for future research. Our code is available at https://github.com/wm-research/mirage.

preprint2022arXiv

A Contrastive Cross-Channel Data Augmentation Framework for Aspect-based Sentiment Analysis

Aspect-based sentiment analysis (ABSA) is a fine-grained sentiment analysis task, which focuses on detecting the sentiment polarity towards the aspect in a sentence. However, it is always sensitive to the multi-aspect challenge, where features of multiple aspects in a sentence will affect each other. To mitigate this issue, we design a novel training framework, called Contrastive Cross-Channel Data Augmentation (C3 DA), which leverages an in-domain generator to construct more multi-aspect samples and then boosts the robustness of ABSA models via contrastive learning on these generated data. In practice, given a generative pretrained language model and some limited ABSA labeled data, we first employ some parameter-efficient approaches to perform the in-domain fine-tuning. Then, the obtained in-domain generator is used to generate the synthetic sentences from two channels, i.e., Aspect Augmentation Channel and Polarity Augmentation Channel, which generate the sentence condition on a given aspect and polarity respectively. Specifically, our C3 DA performs the sentence generation in a cross-channel manner to obtain more sentences, and proposes an Entropy-Minimization Filter to filter low-quality generated samples. Extensive experiments show that our C3 DA can outperform those baselines without any augmentations by about 1% on accuracy and Macro- F1. Code and data are released in https://github.com/wangbing1416/C3DA.

preprint2022arXiv

AutoPlace: Robust Place Recognition with Single-chip Automotive Radar

This paper presents a novel place recognition approach to autonomous vehicles by using low-cost, single-chip automotive radar. Aimed at improving recognition robustness and fully exploiting the rich information provided by this emerging automotive radar, our approach follows a principled pipeline that comprises (1) dynamic points removal from instant Doppler measurement, (2) spatial-temporal feature embedding on radar point clouds, and (3) retrieved candidates refinement from Radar Cross Section measurement. Extensive experimental results on the public nuScenes dataset demonstrate that existing visual/LiDAR/spinning radar place recognition approaches are less suitable for single-chip automotive radar. In contrast, our purpose-built approach for automotive radar consistently outperforms a variety of baseline methods via a comprehensive set of metrics, providing insights into the efficacy when used in a realistic system.

preprint2022arXiv

Benchmarking the Robustness of LiDAR-Camera Fusion for 3D Object Detection

There are two critical sensors for 3D perception in autonomous driving, the camera and the LiDAR. The camera provides rich semantic information such as color, texture, and the LiDAR reflects the 3D shape and locations of surrounding objects. People discover that fusing these two modalities can significantly boost the performance of 3D perception models as each modality has complementary information to the other. However, we observe that current datasets are captured from expensive vehicles that are explicitly designed for data collection purposes, and cannot truly reflect the realistic data distribution due to various reasons. To this end, we collect a series of real-world cases with noisy data distribution, and systematically formulate a robustness benchmark toolkit, that simulates these cases on any clean autonomous driving datasets. We showcase the effectiveness of our toolkit by establishing the robustness benchmark on two widely-adopted autonomous driving datasets, nuScenes and Waymo, then, to the best of our knowledge, holistically benchmark the state-of-the-art fusion methods for the first time. We observe that: i) most fusion methods, when solely developed on these data, tend to fail inevitably when there is a disruption to the LiDAR input; ii) the improvement of the camera input is significantly inferior to the LiDAR one. We further propose an efficient robust training strategy to improve the robustness of the current fusion method. The benchmark and code are available at https://github.com/kcyu2014/lidar-camera-robust-benchmark

preprint2022arXiv

Directional transport of active particles confined in 3D smooth corrugated channel

The transport phenomenon of active particles confined in 3D(three dimensional) corrugated confined channel with Gaussian noises is investigated. Large noise intensity perpendicular to the symmetry axis is good for the diffusion and current along the axis. The generalized resonance transport phenomenon appears with increasing noise intensity parallel to the symmetry axis. Large noise intensity parallel to axis can suppress the diffusion. The diffusion coefficient has a maximum with increasing polar angle noise intensity. There exits an optimal value of parameter f that result in maximum movement speed. Large f is good for the diffusion. Transport reverse phenomenon appears with increasing channel parameter ε and Δ. Too large or too small values of ε and Δ can suppress the diffusion.

preprint2022arXiv

Exploring Inter-Channel Correlation for Diversity-preserved KnowledgeDistillation

Knowledge Distillation has shown very promising abil-ity in transferring learned representation from the largermodel (teacher) to the smaller one (student).Despitemany efforts, prior methods ignore the important role ofretaining inter-channel correlation of features, leading tothe lack of capturing intrinsic distribution of the featurespace and sufficient diversity properties of features in theteacher network.To solve the issue, we propose thenovel Inter-Channel Correlation for Knowledge Distillation(ICKD), with which the diversity and homology of the fea-ture space of the student network can align with that ofthe teacher network. The correlation between these twochannels is interpreted as diversity if they are irrelevantto each other, otherwise homology. Then the student isrequired to mimic the correlation within its own embed-ding space. In addition, we introduce the grid-level inter-channel correlation, making it capable of dense predictiontasks. Extensive experiments on two vision tasks, includ-ing ImageNet classification and Pascal VOC segmentation,demonstrate the superiority of our ICKD, which consis-tently outperforms many existing methods, advancing thestate-of-the-art in the fields of Knowledge Distillation. Toour knowledge, we are the first method based on knowl-edge distillation boosts ResNet18 beyond 72% Top-1 ac-curacy on ImageNet classification. Code is available at:https://github.com/ADLab-AutoDrive/ICKD.

preprint2022arXiv

Formation and Immediate Deformation of a Small Filament Through Intermittent Magnetic Interactions

It is generally believed that filament formation involves a process of the accumulation of magnetic energy. However, in this paper we discuss the idea that filaments will not erupt and will only deform when the stored magnetic energy is released gradually. Combining high-quality observations from Solar Dynamics Observatory and other instruments, we present the formation and immediate deformation of a small filament (F1) in the active region (AR) 12760 on 28-30 April 2020. Before the filament formation, three successive dipoles quickly emerged with separation motions in the center of AR 12760. Due to the magnetic interaction between magnetic dipoles and pre-existing positive polarities, coronal brightenings consequently appeared in the overlying atmosphere. Subsequently, because of the continuous cancellation of magnetic flux that happened around the adjacent ends of F1 and another nearby filament (F2), the magnetic reconections occurred intermittently occurred between F1 and F2. Finally, F1 lessened in the shear, and F2 became shorter. All the results show that the formation of F1 was closely associated with intermittent interactions between the sequence of emerging dipoles and pre-existing magnetic polarities, and the immediate deformation of F1 was intimately related to intermittent interactions between F1 and F2. We also suggest that the intermittent magnetic interactions driven by the continuous magnetic activities (magnetic-flux emergence, cancellation, and convergence) play an important role in the formation and deformation of filaments.

preprint2022arXiv

Learning Selective Sensor Fusion for States Estimation

Autonomous vehicles and mobile robotic systems are typically equipped with multiple sensors to provide redundancy. By integrating the observations from different sensors, these mobile agents are able to perceive the environment and estimate system states, e.g. locations and orientations. Although deep learning approaches for multimodal odometry estimation and localization have gained traction, they rarely focus on the issue of robust sensor fusion - a necessary consideration to deal with noisy or incomplete sensor observations in the real world. Moreover, current deep odometry models suffer from a lack of interpretability. To this extent, we propose SelectFusion, an end-to-end selective sensor fusion module which can be applied to useful pairs of sensor modalities such as monocular images and inertial measurements, depth images and LIDAR point clouds. Our model is a uniform framework that is not restricted to specific modality or task. During prediction, the network is able to assess the reliability of the latent features from different sensor modalities and estimate trajectory both at scale and global pose. In particular, we propose two fusion modules - a deterministic soft fusion and a stochastic hard fusion, and offer a comprehensive study of the new strategies compared to trivial direct fusion. We extensively evaluate all fusion strategies in both public datasets and on progressively degraded datasets that present synthetic occlusions, noisy and missing data and time misalignment between sensors, and we investigate the effectiveness of the different fusion strategies in attending the most reliable features, which in itself, provides insights into the operation of the various models.

preprint2022arXiv

No Pain, Big Gain: Classify Dynamic Point Cloud Sequences with Static Models by Fitting Feature-level Space-time Surfaces

Scene flow is a powerful tool for capturing the motion field of 3D point clouds. However, it is difficult to directly apply flow-based models to dynamic point cloud classification since the unstructured points make it hard or even impossible to efficiently and effectively trace point-wise correspondences. To capture 3D motions without explicitly tracking correspondences, we propose a kinematics-inspired neural network (Kinet) by generalizing the kinematic concept of ST-surfaces to the feature space. By unrolling the normal solver of ST-surfaces in the feature space, Kinet implicitly encodes feature-level dynamics and gains advantages from the use of mature backbones for static point cloud processing. With only minor changes in network structures and low computing overhead, it is painless to jointly train and deploy our framework with a given static model. Experiments on NvGesture, SHREC'17, MSRAction-3D, and NTU-RGBD demonstrate its efficacy in performance, efficiency in both the number of parameters and computational complexity, as well as its versatility to various static backbones. Noticeably, Kinet achieves the accuracy of 93.27% on MSRAction-3D with only 3.20M parameters and 10.35G FLOPS.

preprint2022arXiv

Numerical Study on Droplet Evaporation and Propagation Stability in Normal-temperature Two-phase Rotating Detonation System

A numerical study is carried out on the droplet-laden two-phase rotating detonation wave (RDW) of kerosene/oxygen-enriched air at normal temperature. Two types of combustors without and with the inlet mixing section (IMS) are constructed to illustrate the effect of IMS on the combustion characteristics of two-phase RDW. The important role of the preheating zone in the IMS after the back-propagation shock on the droplet evaporation is analyzed. The parameter sensitivity of RDW propagation stability to the average droplet diameter d0 is further discussed. Results show that the droplets mainly evaporate after the detonation front in the combustor without IMS, and the reaction heat release is completed in a short distance, which propels continuous propagation of the detonation wave. When d0 gradually increases, the droplet evaporation distance increases, and the coupling between the incident shock and reaction is continuously weakened, finally resulting in the detonation quenching. In the combustor with IMS, a preheating zone is induced close to the contact surface by the back-propagation shock of the RDW. A large number of droplets evaporate in this zone, and generate sufficient mixture of fuel vapor and oxidizer in front of detonation wave to maintain the detonation propagation. Priority to the combustor without IMS, the droplet evaporation relies less on the inlet high-temperature airflow with the assistance of preheating zone, and thus the wave propagation stability can be enhanced and the RDW can sustain for a wider range of d0. The present analysis provides a new understanding of two-phase rotating detonation systems.

preprint2022arXiv

On the pressure gain of stable flow systems with variable cross-section area

For the clarification of total pressure gain performance of rotating detonation propulsion systems, the extended Hugoniot curve is proposed and discussed for the stable flow systems with variable cross-section area (SFSVA). The dimensionless pressure integral (θ) along the system walls is found to have critical impact on the pressure gain of the SFSVA with given inlet Mach number (M0) and heat release (q). The key to obtain positive pressure gain of a SFSVA is to achieve the matching θ, M0 and q.

preprint2022arXiv

RangeUDF: Semantic Surface Reconstruction from 3D Point Clouds

We present RangeUDF, a new implicit representation based framework to recover the geometry and semantics of continuous 3D scene surfaces from point clouds. Unlike occupancy fields or signed distance fields which can only model closed 3D surfaces, our approach is not restricted to any type of topology. Being different from the existing unsigned distance fields, our framework does not suffer from any surface ambiguity. In addition, our RangeUDF can jointly estimate precise semantics for continuous surfaces. The key to our approach is a range-aware unsigned distance function together with a surface-oriented semantic segmentation module. Extensive experiments show that RangeUDF clearly surpasses state-of-the-art approaches for surface reconstruction on four point cloud datasets. Moreover, RangeUDF demonstrates superior generalization capability across multiple unseen datasets, which is nearly impossible for all existing approaches.

preprint2022arXiv

Sparse Attentive Memory Network for Click-through Rate Prediction with Long Sequences

Sequential recommendation predicts users' next behaviors with their historical interactions. Recommending with longer sequences improves recommendation accuracy and increases the degree of personalization. As sequences get longer, existing works have not yet addressed the following two main challenges. Firstly, modeling long-range intra-sequence dependency is difficult with increasing sequence lengths. Secondly, it requires efficient memory and computational speeds. In this paper, we propose a Sparse Attentive Memory (SAM) network for long sequential user behavior modeling. SAM supports efficient training and real-time inference for user behavior sequences with lengths on the scale of thousands. In SAM, we model the target item as the query and the long sequence as the knowledge database, where the former continuously elicits relevant information from the latter. SAM simultaneously models target-sequence dependencies and long-range intra-sequence dependencies with O(L) complexity and O(1) number of sequential updates, which can only be achieved by the self-attention mechanism with O(L^2) complexity. Extensive empirical results demonstrate that our proposed solution is effective not only in long user behavior modeling but also on short sequences modeling. Implemented on sequences of length 1000, SAM is successfully deployed on one of the largest international E-commerce platforms. This inference time is within 30ms, with a substantial 7.30% click-through rate improvement for the online A/B test. To the best of our knowledge, it is the first end-to-end long user sequence modeling framework that models intra-sequence and target-sequence dependencies with the aforementioned degree of efficiency and successfully deployed on a large-scale real-time industrial recommender system.

preprint2022arXiv

Strain effects on topological and valley properties of Janus monolayer $\mathrm{VSiGeN_4}$

Strain is an effective method to tune the electronic properties of two-dimension (2D) materials, and can induce novel phase transition. Recently, 2D $\mathrm{MA_2Z_4}$ family materials are of interest because of their emerging topological, magnetic and superconducting properties. Here, we investigate the impact of strain effects ($a/a_0$:0.96$\sim$1.04) on the physical properties of Janus monolayer $\mathrm{VSiGeN_4}$ as a derivative of $\mathrm{VSi_2N_4}$ or $\mathrm{VGe_2N_4}$, which possesses dynamical, mechanical and thermal stabilities. For out-of-plane magnetic anisotropy, with increasing strain, $\mathrm{VSiGeN_4}$ undergoes transition between ferrovalley semiconductor (FVS), half-valley-metal (HVM), valley-polarized quantum anomalous Hall insulator (VQAHI), HVM and FVS. These imply twice topological phase transitions, which are related with sign-reversible Berry curvature and band inversion between $d_{xy}$+$d_{x^2-y^2}$ and $d_{z^2}$ orbitals for K or -K valley. The band inversion also leads to transformation of valley splitting strength between valence and conduction bands. However, for in-plane magnetic anisotropy, no special quantum anomalous Hall (QAH) states and valley polarization exist within the considered strain range. The actual magnetic anisotropy energy (MAE) shows no special QAH and HVM states in monolayer $\mathrm{VSiGeN_4}$. Fortunately, these can be easily achieved by external magnetic field, which adjusts the easy magnetization axis of $\mathrm{VSiGeN_4}$ from in-plane one to out-of-plane one. Our findings shed light on how strain can be employed to engineer the electronic states of $\mathrm{VSiGeN_4}$, which may open new perspectives for multifunctional quantum devices in valleytronics and spintronics.

preprint2022arXiv

Strain-driven valley states and phase transitions in Janus VSiGeN4 monolayer

The interplay between topology and valley degree of freedom has attracted much interest because it can realize new phenomena and applications. Here, based on first-principles calculations, we demonstrate intrinsically valley-polarized quantum anomalous Hall effect in monolayer ferrovalley material: Janus VSiGeN4, of which the edge states are chiral-spin-valley locking. Besides, a small tensile or compressive strain can drive phase transition in the material from valley-polarized quantum anomalous Hall state to half-valley-metal state. With the increase of the strain, the material turns into ferrovalley semiconductor with valley anomalous Hall effect. The origin of phase transition is sequent band inversion of V d orbital at K valley. Moreover, we find that phase transition causes the sign reversal of Berry curvature and induces different polarized light absorption in different valley states. Our work provides an ideal material platform for practical applications and experimental exploration of the interplay between topology, spintronics, and valleytronics.

preprint2022arXiv

Twin extreme ultraviolet waves in the solar corona

Solar extreme ultraviolet (EUV) waves are spectacular propagating disturbances with EUV enhancements in annular shapes in the solar corona. These EUV waves carry critical information about the coronal magnetised plasma that can shed light on the elusive physical parameters (e.g. the magnetic field strength) by global solar coronal magneto-seismology. EUV waves are closely associated with a wide range of solar atmospheric eruptions, from violent flares and coronal mass ejections (CMEs) to less energetic plasma jets or mini-filament eruptions. However, the physical nature and driving mechanism of EUV waves is still controversial. Here, we report the unique discovery of twin EUV waves (TEWs) that were formed in a single eruption with observations from two different perspectives. In all earlier studies, a single eruption was associated at most with a single EUV wave. The newly found TEWs urge to re-visit our theoretical understanding about the underlying formation mechanism(s) of coronal EUV waves. Two distinct scenarios of TEWs were found. In the first scenario, the two waves were separately associated with a filament eruption and a precursor jet, while in another scenario the two waves were successively associated with a filament eruption. Hence, we label these distinguished scenarios as "fraternal TEWs" and "identical TEWs", respectively. Further, we also suggest that impulsive lateral expansions of two distinct groups of coronal loops are critical to the formation of TEWs in a single eruption.

preprint2021arXiv

Photonic Floquet time crystals

The public and scientists constantly have different perspectives. While on a time crystal, they stand in line and ask: What is a time crystal? Show me a material that is spontaneously crystalline in time? This study synthesizes a photonic material of Floquet time crystals and experimentally observes its indicative period-2T beating. We explicitly reconstruct a discrete time-crystalline ground state and reveal using an appropriately-designed photonic Floquet simulator the rigid period-doubling as a signature of the spontaneous breakage of the discrete time-translational symmetry. Unlike the result of the exquisite many-body interaction, the photonic time crystal is derived from a single-particle topological phase that can be extensively accessed by many pertinent nonequilibrium and periodically-driven platforms. Our observation will drive theoretical and technological interests toward condensed matter physics and topological photonics, and demystify time crystals for the non-scientific public.

preprint2020arXiv

A Survey on Deep Learning for Localization and Mapping: Towards the Age of Spatial Machine Intelligence

Deep learning based localization and mapping has recently attracted significant attention. Instead of creating hand-designed algorithms through exploitation of physical models or geometric theories, deep learning based solutions provide an alternative to solve the problem in a data-driven way. Benefiting from ever-increasing volumes of data and computational power, these methods are fast evolving into a new area that offers accurate and robust systems to track motion and estimate scenes and their structure for real-world applications. In this work, we provide a comprehensive survey, and propose a new taxonomy for localization and mapping using deep learning. We also discuss the limitations of current models, and indicate possible future directions. A wide range of topics are covered, from learning odometry estimation, mapping, to global localization and simultaneous localization and mapping (SLAM). We revisit the problem of perceiving self-motion and scene understanding with on-board sensors, and show how to solve it by integrating these modules into a prospective spatial machine intelligence system (SMIS). It is our hope that this work can connect emerging works from robotics, computer vision and machine learning communities, and serve as a guide for future researchers to apply deep learning to tackle localization and mapping problems.

preprint2020arXiv

An Extreme Ultraviolet Wave Associated with A Solar Filament Activation

Extreme ultraviolet (EUV) waves are impressive coronal propagating disturbances. They are closely associated with various eruptions, and can used for the global coronal seismology and the acceleration of solar energetic particles. Hence, the study of EUV waves plays an important role in solar eruptions and Space Weather. Here we present an EUV wave associated with a filament activation that did not evolve into any eruption. Due to the continuous magnetic flux emergence and cancellation around its one end, the filament rose with untwisting motion, and the filament mass flowed towards another end along the rising fields. Intriguingly, following the filament activation, an EUV wave formed with a fast constant speed ($\sim$500 km s$^{-1}$) ahead of the mass flow, and the overlying coronal loops expanded both in lateral and radial directions. Excluding the possibility of a remote flare and an absent coronal mass ejection, we suggest that the EUV wave was only closely associated with the filament activation. Furthermore, their intimate spacial and temporal relationship indicates that the EUV wave was likely directly triggered by the lateral expansion of overlying loops. We propose that the EUV wave can be interpreted as linear fast-mode wave, and the most vital key for the successful generation of the EUV wave is the impulsive early-phase lateral expansion of overlying loops that was driven by the activated filament mass flow without any eruption.

preprint2020arXiv

Efficient Routing for Quantum Key Distribution Networks

As quantum key distribution becomes increasingly practical, questions of how to effectively employ it in large-scale networks and over large distances becomes increasingly important. To that end, in this work, we model the performance of the E91 entanglement based QKD protocol when operating in a network consisting of both quantum repeaters and trusted nodes. We propose a number of routing protocols for this network and compare their performance under different usage scenarios. Through our modeling, we investigate optimal placement and number of trusted nodes versus repeaters depending on device performance (e.g., quality of the repeater's measurement devices). Along the way we discover interesting lessons determining what are the important physical aspects to improve for upcoming quantum networks in order to improve secure communication rates.

preprint2020arXiv

Monolithic photonic chips for multi-channel frequency mixers and single photon detectors

Lithium niobate photonic chip could realize diverse optical engineering for various applications benefiting from its excellent optical performances. In this letter, we demonstrate monolithic photonic chips for multi-channel sum-frequency conversion based on reverse-proton-exchange periodically poled lithium niobate waveguides, with the different channels showing uniform and excellent conversion efficiencies. To obtain a robust device and provide a convenient interface for applications, the integrated chip is fiber coupled with two fiber arrays. The packaged chip then forms the core of a multi-channel up conversion single photon detector. In each channel the input signal interacts with a 1950-nm single frequency pump laser and the sum frequency output is spectrally filtered and detected by a silicon avalanche photodiode. Average detection efficiency (DE) of 23.2 % and noise count rate (NCR) of 557 counts per second (cps) are achieved, with a standard deviation of 2.73 % and 48 cps over the 30 channels, as well as optical isolation (OI) between nearby channels of more than 71 dB, which are excellent for the extensive applications of monolithic photonic chips in fields including deep space laser communication, high-rate quantum key distribution and single-photon imaging.

preprint2020arXiv

Non-destructive testing and evaluation of composite materials/structures: A state-of-the-art review

Composite materials/structures are advancing in product efficiency, cost-effectiveness and the development of superior specific properties. There are increasing demands in their applications to load-carrying structures in aerospace, wind turbines, transportation, and medical equipment, etc. Thus robust and reliable non-destructive testing (NDT) of composites is essential to reduce safety concerns and maintenance costs. There have been various NDT methods built upon different principles for quality assurance during the whole lifecycle of a composite product. This paper reviews the most established NDT techniques for detection and evaluation of defects/damage evolution in composites. These include acoustic emission, ultrasonic testing, infrared thermography, terahertz testing, shearography, digital image correlation, as well as X-ray and neutron imaging. For each NDT technique, we cover a brief historical background, principles, standard practices, equipment and facilities used for composite research. We also compare and discuss their benefits and limitations, and further summarise their capabilities and applications to composite structures. Each NDT technique has its own potential and rarely achieves a full-scale diagnosis of structural integrity. Future development of NDT techniques for composites will be directed towards intelligent and automated inspection systems with high accuracy and efficient data processing capabilities.

preprint2020arXiv

Observation of pseudogap in SnSe2 atomic layers grown on graphite

Superconducting metal dichalcogenides (MDCs) present several similarities to the other layered superconductors like cuprates. The superconductivity in atomically thin MDCs has been demonstrated by recent experiments, however, the investigation of the superconductivity intertwined with other orders are scarce. Investigating the pseudogap in atomic layers of MDCs may help to understand the superconducting mechanism for these true two-dimensional (2D) superconducting systems. Herein we report a pseudogap opening in the tunneling spectra of thin layers of SnSe2 epitaxially grown on highly oriented pyrolytic graphite (HOPG) with scanning tunneling microscopy/spectroscopy (STM/STS). A significant V-shaped pseudogap was observed to open near the Fermi level (EF) in the STS. And at elevated temperatures, the gap gradually evolves to a shallow dip. Our experimental observations provide direct evidence of a pseudogap state in the electron-doped SnSe2 atomic layers on the HOPG surface, which may stimulate further exploration of the mechanism of superconductivity at 2D limit in MDCs.

preprint2020arXiv

On the regular-convexity of Ricci shrinker limit spaces

In this paper, we study the structure of the pointed-Gromov-Hausdorff limits of sequences of Ricci shrinkers. We define a regular-singular decomposition following the work of Cheeger-Colding for manifolds with a uniform Ricci curvature lower bound, and prove that the regular part of any Ricci shrinker limit space is convex, inspired by Colding-Naber's original idea of parabolic smoothing of the distance functions.

preprint2020arXiv

Planar Turán Number of intersecting triangles

The planar Turán number of a given graph $H$, denoted by $ex_{\mathcal{P}}(n,H)$, is the maximum number of edges over all planar graphs on $n$ vertices that do not contain a copy of $H$ as a subgraph. Let $H_k$ be a friendship graph, which is obtained from $k$ triangles by sharing a common vertex. In this paper, we obtain sharp bounds of $ex_{\mathcal{P}}(n,H_k)$ and $ex_{\mathcal{P}}(n,K_1+P_{k+1})$ for $k\ge2$, which improves the results of Lan and Shi in Electron. J. Combin. 26 (2) (2019), \#P2.11.

preprint2020arXiv

Random walks in time-varying networks with memory

Random walks process on networks plays a fundamental role in understanding the importance of nodes and the similarity of them, which has been widely applied in PageRank, information retrieval, and community detection, etc. Individual's memory has been proved to be important to affect network evolution and dynamical processes unfolding on the network. In this manuscript, we study the random-walk process on extended activity driven network model by taking account of individual's memory. We analyze how individual's memory affects random-walk process unfolding on the network when the timescales of the processes of the random walk and the network evolution are comparable. Under the constraints of long-time evolution, we derive analytical solutions for the distribution of stationary state Wa and the mean first-passage time (MFPT) of the random-walk process. We find that, compared with the memoryless activity-driven model, individual's memory enhances the fluctuation of degree distribution, which reduces the capability of gathering walkers for nodes, especially with large activity and delays the mean first-passage time. The results on real networks also support the theoretical analysis with artificial networks.

preprint2020arXiv

Relational Deep Reinforcement Learning for Routing in Wireless Networks

While routing in wireless networks has been studied extensively, existing protocols are typically designed for a specific set of network conditions and so cannot accommodate any drastic changes in those conditions. For instance, protocols designed for connected networks cannot be easily applied to disconnected networks. In this paper, we develop a distributed routing strategy based on deep reinforcement learning that generalizes to diverse traffic patterns, congestion levels, network connectivity, and link dynamics. We make the following key innovations in our design: (i) the use of relational features as inputs to the deep neural network approximating the decision space, which enables our algorithm to generalize to diverse network conditions, (ii) the use of packet-centric decisions to transform the routing problem into an episodic task by viewing packets, rather than wireless devices, as reinforcement learning agents, which provides a natural way to propagate and model rewards accurately during learning, and (iii) the use of extended-time actions to model the time spent by a packet waiting in a queue, which reduces the amount of training data needed and allows the learning algorithm to converge more quickly. We evaluate our routing algorithm using a packet-level simulator and show that the policy our algorithm learns during training is able to generalize to larger and more congested networks, different topologies, and diverse link dynamics. Our algorithm outperforms shortest path and backpressure routing with respect to packets delivered and delay per packet.

preprint2020arXiv

Representations and fusion rules for the orbifold vertex operator algebras $L_{\widehat{\frak{sl}_2}}(k,0)^{\mathbb{Z}_3}$

For the cyclic group $\mathbb{Z}_3$ and positive integer $k$, we study the representations of the orbifold vertex operator algebra $L_{\widehat{\mathfrak{sl}_2}}(k,0)^{\mathbb{Z}_3}$. All the irreducible modules for $L_{\widehat{\mathfrak{sl}_2}}(k,0)^{\mathbb{Z}_3}$ are classified and constructed explicitly. Quantum dimensions and fusion rules for the orbifold vertex operator algebra $L_{\widehat{\mathfrak{sl}_2}}(k,0)^{\mathbb{Z}_3}$ are completely determined.

preprint2020arXiv

Ricci flow smoothing for locally collapsing manifolds

We show that for certain locally collapsing initial data with Ricci curvature bounded below, one could start the Ricci flow for a definite period of time. This provides a Ricci flow smoothing tool, with which we find topological conditions that detect the collapsing infranil fiber bundles over controlled Riemannian orbifolds among those locally collapsing regions with Ricci curvature bounded below. In the appendix, we also provide a local distance distortion estimate for certain Ricci flows with collapsing initial data.

preprint2020arXiv

See Through Smoke: Robust Indoor Mapping with Low-cost mmWave Radar

This paper presents the design, implementation and evaluation of milliMap, a single-chip millimetre wave (mmWave) radar based indoor mapping system targetted towards low-visibility environments to assist in emergency response. A unique feature of milliMap is that it only leverages a low-cost, off-the-shelf mmWave radar, but can reconstruct a dense grid map with accuracy comparable to lidar, as well as providing semantic annotations of objects on the map. milliMap makes two key technical contributions. First, it autonomously overcomes the sparsity and multi-path noise of mmWave signals by combining cross-modal supervision from a co-located lidar during training and the strong geometric priors of indoor spaces. Second, it takes the spectral response of mmWave reflections as features to robustly identify different types of objects e.g. doors, walls etc. Extensive experiments in different indoor environments show that milliMap can achieve a map reconstruction error less than 0.2m and classify key semantics with an accuracy around 90%, whilst operating through dense smoke.

preprint2020arXiv

The initiation of a solar streamer blowout coronal mass ejection arising from the streamer flank

Streamer blowout (SBO) coronal mass ejections (CMEs) represent a particular class of CMEs that are characterized by a gradual swelling of the overlying streamer and a slow CME containing a flux-rope structure. SBO CMEs arising from the streamer flank fall into a special category of SBO CMEs involving three lower arches under the higher streamer arcade. However, the initiation mechanism for this special category of SBO CMEs remains elusive, due to the observational limitations. Here we report critical observations of a SBO CME associated with the eruption of a polar crown filament that originated from the streamer flank. The filament slowly rose toward the solar equator with the writhing motion, and underwent a sudden acceleration before its eruption. Interestingly, during the rising, the filament fields experienced gradual external reconnections, which is evidenced by the dip-shaped bottom of the enveloping flux-rope structure changing from a smooth concave, the slow inflows ($\sim$1.8 km s$^{-1}$) from both the filament fields and the coronal loops beneath, and the persistent brightenings around the interface between the filament fields and the coronal loops beneath. The newly formed lower loops at the filament source and the Y-shaped structure in the stretched tail fields indicate the internal reconnections for the filament eruption. The clear signatures of the external and internal reconnections shed light on the initiation mechanisms of SBO CMEs.

preprint2020arXiv

The maximum number of s-cliques in connected graphs and its application to spectral moment

Extremal problems concerning the number of complete subgraphs have a long story in extremal graph theory. Let $k_s(G)$ be the number of $s$-cliques in a graph $G$ and $m={{r_m}\choose s}+t_m$, where $0\le t_m\leq r_m$. Edrős showed that $k_s(G)\le {{r_m}\choose s}+{{t_m}\choose{s-1}}$ over all graphs of size $m$ and order $n\geq r_m+1$. %Clearly, $K_{r_m}^{t_m}\cup (n-r_m-1)K_1$ is an extremal graph, where $K_{r_m}^{t_m}$ is the graph by joining a new vertex to $t_m$ vertices of $K_{r_m}$. It is natural to consider an improvement in connected situation: what is the maximum number of $s$-cliques over all connected graphs of size $m$ and order $n$? In this paper, the sharp upper bound of $k_s(G)$ is obtained and extremal graphs are completely characterized. The technique and the bound are different from those in general case. As an application, this result can be used to solve a question on spectral moment.

preprint2020arXiv

Transport of Finite Size Self-Propelled Particles Confined in a 2D Zigzag Channel with Gaussian Colored Noise

The directional transport of finite size self-propelled Brownian particles confined in a 2D zigzag channel with colored noise is investigated. The noises(noise parallel to x-axis and y-axis), the asymmetry parameter Δk, the ratio f(ratio of the particle radius and the bottleneck half width), the selfpropelled speed v0 have joint effect on the particles. The average velocity of self-propelled particles is significantly different from passive particles. The average velocity exhibits complicated behavior with increasing self-propelled speed v0

preprint2019arXiv

ATCSpeech: a multilingual pilot-controller speech corpus from real Air Traffic Control environment

Automatic Speech Recognition (ASR) is greatly developed in recent years, which expedites many applications on other fields. For the ASR research, speech corpus is always an essential foundation, especially for the vertical industry, such as Air Traffic Control (ATC). There are some speech corpora for common applications, public or paid. However, for the ATC, it is difficult to collect raw speeches from real systems due to safety issues. More importantly, for a supervised learning task like ASR, annotating the transcription is a more laborious work, which hugely restricts the prospect of ASR application. In this paper, a multilingual speech corpus (ATCSpeech) from real ATC systems, including accented Mandarin Chinese and English, is built and released to encourage the non-commercial ASR research in ATC domain. The corpus is detailly introduced from the perspective of data amount, speaker gender and role, speech quality and other attributions. In addition, the performance of our baseline ASR models is also reported. A community edition for our speech database can be applied and used under a special contrast. To our best knowledge, this is the first work that aims at building a real and multilingual ASR corpus for the air traffic related research.

preprint2019arXiv

Representations of the orbifold VOAS $L_{\hat{\frak{sl}_2}}(k,0)^K$ and the commutant VOAS $C_{{L_{\hat{\mathfrak{so}_m}}(1,0)}^{\otimes 3}}({L_{\hat{\mathfrak{so}_m}}(3,0)})$

For the Klein group $K$, $k\in\mathbb{Z}_{\geqslant 1}$ and $m\in\mathbb{Z}_{\geqslant 4}$, we study the representations of the orbifold vertex operator algebra $L_{\hat{\mathfrak{sl}_2}}(k,0)^{K}$ and the commutant vertex operator algebra of $L_{\hat{\mathfrak{so}_m}}(3,0)$ in $L_{\hat{\mathfrak{so}_m}}(1,0)^{\otimes 3}$ which can be realized as the orbifold vertex operator subalgebra $L_{\hat{\mathfrak{sl}_2}}(2m,0)^{K}$ or its extension. All the irreducible modules for $L_{\hat{\mathfrak{sl}_2}}(k,0)^{K}$ and $C_{{L_{\hat{\mathfrak{so}_m}}(1,0)}^{\otimes 3}}({L_{\hat{\mathfrak{so}_m}}(3,0)})$ are classified and constructed explicitly.