Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
41works
0followers
32topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

41 published item(s)

preprint2026arXiv

DiffST: Spatiotemporal-Aware Diffusion for Real-World Space-Time Video Super-Resolution

Diffusion-based models have shown strong performance in video super-resolution (VSR) and video frame interpolation (VFI). However, their role in the coupled space-time video super-resolution (STVSR) setting remains limited. Existing diffusion-based STVSR approaches suffer from two issues: (1) low inference efficiency and (2) insufficient utilization of spatiotemporal information. These limitations impede deployment. To address these issues, we introduce DiffST, an efficient spatiotemporal-aware video diffusion framework for real-world STVSR. To improve efficiency, we adapt a pre-trained diffusion model for one-step sampling and process the entire video directly rather than operating on individual frames. Furthermore, to enhance spatiotemporal information utilization, we introduce cross-frame context aggregation (CFCA) and video representation guidance (VRG). The CFCA module aggregates information across multiple keyframes to produce intermediate frames. The VRG module extracts video-level global features to guide the diffusion process. Extensive experiments show that DiffST obtains leading results on real-world STVSR tasks. It also maintains high inference efficiency, running about 17$\times$ faster than previous diffusion-based STVSR methods. Code is available at: https://github.com/zhengchen1999/DiffST.

preprint2026arXiv

EpiGraph: Building Generalists for Evidence-Intensive Epilepsy Reasoning in the Wild

Epilepsy diagnosis and treatment require evidence-intensive reasoning across heterogeneous clinical knowledge, including biosignal patterns, genetic mechanisms, pharmacogenomics, treatment strategies, and patient outcomes. In this work, we present \textsc{EpiGraph}, a large-scale epilepsy knowledge graph and benchmark for evaluating knowledge-augmented clinical reasoning. \textsc{EpiGraph} integrates 48,166 peer-reviewed papers and seven clinical resources into a heterogeneous graph containing 24,324 entities and 32,009 evidence-grounded triplets across five clinical layers. Built upon this graph, \textsc{EpiBench} defines five clinically motivated tasks spanning clinical decision-making, EEG report generation, pharmacogenomic precision medicine, treatment recommendation, and deep research planning. We evaluate six LLMs under both standard and Graph-RAG settings. Results show that integrating \textsc{EpiGraph} consistently improves performance across all tasks, with the largest gains observed in pharmacogenomic reasoning (+30--41\%). Our findings demonstrate that structured epilepsy knowledge substantially enhances evidence-grounded clinical reasoning and provides a practical benchmark framework for evaluating knowledge-augmented LLMs in real-world neurological settings. Our code is available at: https://github.com/LabRAI/EEG-KG.

preprint2026arXiv

LLM as Clinical Graph Structure Refiner: Enhancing Representation Learning in EEG Seizure Diagnosis

Electroencephalogram (EEG) signals are vital for automated seizure detection, but their inherent noise makes robust representation learning challenging. Existing graph construction methods, whether correlation-based or learning-based, often generate redundant or irrelevant edges due to the noisy nature of EEG data. This significantly impairs the quality of graph representation and limits downstream task performance. Motivated by the remarkable reasoning and contextual understanding capabilities of large language models (LLMs), we explore the idea of using LLMs as graph edge refiners. Specifically, we propose a two-stage framework: we first verify that LLM-based edge refinement can effectively identify and remove redundant connections, leading to significant improvements in seizure detection accuracy and more meaningful graph structures. Building on this insight, we further develop a robust solution where the initial graph is constructed using a Transformer-based edge predictor and multilayer perceptron, assigning probability scores to potential edges and applying a threshold to determine their existence. The LLM then acts as an edge set refiner, making informed decisions based on both textual and statistical features of node pairs to validate the remaining connections. Extensive experiments on TUSZ dataset demonstrate that our LLM-refined graph learning framework not only enhances task performance but also yields cleaner and more interpretable graph representations.

preprint2026arXiv

PRISM: Prior Rectification and Uncertainty-Aware Structure Modeling for Diffusion-Based Text Image Super-Resolution

Text image super-resolution (Text-SR) requires more than visually plausible detail synthesis: slight errors in stroke topology may alter character identity and break readability. Existing methods improve text fidelity with stronger recognition-based or generative priors, yet they still face two unresolved challenges under severe degradation: the text condition extracted from low-quality inputs can itself be unreliable, and a plausible global prior does not fully determine fine-grained stroke boundaries. We present PRISM, a single-step diffusion-based Text-SR framework that addresses these two challenges through Flow-Matching Prior Rectification (FMPR) and a Structure-guided Uncertainty-aware Residual Encoder (SURE). FMPR constructs a privileged training-time prior from paired low-quality/high-quality latents and learns a flow matching that transports degraded embeddings toward this restoration-oriented prior space, yielding more accurate and reliable global text guidance. SURE further predicts uncertainty-aware structural residuals to selectively absorb reliable local boundary evidence while suppressing ambiguous stroke cues. Together, these components enable explicit global prior rectification and local structure refinement within a single diffusion restoration pass. Experiments on both synthetic and real-world benchmarks show that PRISM achieves state-of-the-art performance with millisecond-level inference. Our dataset and code will be available at https://github.com/faithxuz/PRISM.

preprint2026arXiv

SkillSmith: Compiling Agent Skills into Boundary-Guided Runtime Interfaces

Recently, skills have been widely adopted in large language model (LLM)-based agent systems across various domains. In existing frameworks, skills are typically injected into the agent reasoning loop as contextual guidance once matched to a runtime task, enabling specialized task-solving capabilities. We find that this execution paradigm introduces two major sources of redundancy: irrelevant context injection and repeated skill-specific reasoning and planning. To this end, we propose SkillSmith, a boundary-first compiler-runtime framework that compiles skill packages offline into minimal executable interfaces. By extracting fine-grained operational boundaries from skills, SkillSmith enables agents to dynamically access and execute only the relevant components at runtime, thereby minimizing unnecessary context injection and redundant reasoning overhead. In the evaluation on SkillsBench benchmark, SkillSmith reduces solve-stage token usage by 57.44%, thinking iterations by 42.99%, solve time by 50.57% (2.02x faster), and token-proportional monetary cost by 57.44% compared with using raw-skills. Moreover, compiled artifacts produced by a stronger model can be reused by a smaller or more efficient runtime model, improving task accuracy in cases where raw skill interpretation fails. The source code and data are available at https://github.com/AetherHeart-AI/Aeloon.

preprint2025arXiv

DeepEvidence: Empowering Biomedical Discovery with Deep Knowledge Graph Research

Biomedical knowledge graphs (KGs) encode vast, heterogeneous information spanning literature, genes, pathways, drugs, diseases, and clinical trials, but leveraging them collectively for scientific discovery remains difficult. Their structural differences, continual evolution, and limited cross-resource alignment require substantial manual integration, limiting the depth and scale of knowledge exploration. We introduce DeepEvidence, an AI-agent framework designed to perform Deep Research across various heterogeneous biomedical KGs. Unlike generic Deep Research systems that rely primarily on internet-scale text, DeepEvidence incorporates specialized knowledge-graph tooling and coordinated exploration strategies to systematically bridge heterogeneous resources. At its core is an orchestrator that directs two complementary agents: Breadth-First ReSearch (BFRS) for broad, multi-graph entity search, and Depth-First ReSearch (DFRS) for multi-hop, evidence-focused reasoning. An internal, incrementally built evidence graph provides a structured record of retrieved entities, relations, and supporting evidence. To operate at scale, DeepEvidence includes unified interfaces for querying diverse biomedical APIs and an execution sandbox that enables programmatic data retrieval, extraction, and analysis. Across established deep-reasoning benchmarks and four key stages of the biomedical discovery lifecycle: drug discovery, pre-clinical experimentation, clinical trial development, and evidence-based medicine, DeepEvidence demonstrates substantial gains in systematic exploration and evidence synthesis. These results highlight the potential of knowledge-graph-driven Deep Research to accelerate biomedical discovery.

preprint2025arXiv

Degrees of freedom of quadratic scalar-nonmetricity theory

We study the number of degrees of freedom (DOFs) in quadratic scalar-nonmetricity (QSN) theory, whose Lagrangian is the linear combination of five quadratic nonmetricity invariants with coefficients depending on a dynamical scalar field. Working in the coincident gauge, we perform the Arnowitt-Deser-Misner decomposition and classify QSN models into distinct cases according to the numbers of their primary constraints. For cases that are physically viable in the sense that both a consistent cosmological background and tensor gravitational waves exist, we count the number of degrees of freedom based on two approaches. First we investigate the linear cosmological perturbations around an FLRW background. Then we perform a Dirac-Bergmann Hamiltonian constraint analysis to count the number of DOFs at the nonperturbative level. We focus on three representative cases. In case II, both the perturbative and nonperturbative approaches yield the same result, which indicates that the theory propagates 10 degrees of freedom. In contrast, in cases V and VI, the Hamiltonian analysis yields 8 degrees of freedom, while only 6 and 5 modes are visible at linear order in perturbations, respectively. This indicates that additional modes are strongly coupled on cosmological backgrounds.

preprint2023arXiv

Automated Sleep Staging via Parallel Frequency-Cut Attention

This paper proposes a novel framework for automatically capturing the time-frequency nature of electroencephalogram (EEG) signals of human sleep based on the authoritative sleep medicine guidance. The framework consists of two parts: the first part extracts informative features by partitioning the input EEG spectrograms into a sequence of time-frequency patches. The second part is constituted by an attention-based architecture to efficiently search for the correlation between partitioned time-frequency patches and defining factors of sleep stages in parallel. The proposed pipeline is validated on the Sleep Heart Health Study dataset with new state-of-the-art results for the stages wake, N2, and N3, obtaining respective F1 scores of 0.93, 0.88, and 0.87, with only EEG signals used. The proposed method also has a high inter-rater reliability of 0.80 kappa. We also visualize the correspondence between sleep staging decisions and features extracted by the proposed method, providing strong interpretability for our model.

preprint2022arXiv

$q$-Munchausen Reinforcement Learning

The recently successful Munchausen Reinforcement Learning (M-RL) features implicit Kullback-Leibler (KL) regularization by augmenting the reward function with logarithm of the current stochastic policy. Though significant improvement has been shown with the Boltzmann softmax policy, when the Tsallis sparsemax policy is considered, the augmentation leads to a flat learning curve for almost every problem considered. We show that it is due to the mismatch between the conventional logarithm and the non-logarithmic (generalized) nature of Tsallis entropy. Drawing inspiration from the Tsallis statistics literature, we propose to correct the mismatch of M-RL with the help of $q$-logarithm/exponential functions. The proposed formulation leads to implicit Tsallis KL regularization under the maximum Tsallis entropy framework. We show such formulation of M-RL again achieves superior performance on benchmark problems and sheds light on more general M-RL with various entropic indices $q$.

preprint2022arXiv

A phase transition driven by subtle distortion without broken symmetry on spin, charge and lattice in Layered LnCu4-δP2(Ln=Eu, Sr)

In the scenario of Landau phase transition theory in condensed matter physics, any thermal dynamic phase transition must be subject to some kind of broken symmetries, that are relative to its spin, charge, orbital and lattice. Here we report a rare phase transition at Tp ~120 K or 140 K in layered materials LnCu4-δP2 (Ln=Eu, Sr) driven by a subtle structural-distortion without any broken symmetry on charge, spin and lattice. The variations of the lattice parameters, (ΔLc/Lc) ~ 0.013% or 0.062%, verified by thermal expansion, is much less than that for a typical crystalline phase transition (~0.5-1%), but the significant anomaly in heat capacity provides clear evidence of its intrinsic nature of thermodynamic transition.

preprint2022arXiv

Adaptive Spike-Like Representation of EEG Signals for Sleep Stages Scoring

Recently there has seen promising results on automatic stage scoring by extracting spatio-temporal features from electroencephalogram (EEG). Such methods entail laborious manual feature engineering and domain knowledge. In this study, we propose an adaptive scheme to probabilistically encode, filter and accumulate the input signals and weight the resultant features by the half-Gaussian probabilities of signal intensities. The adaptive representations are subsequently fed into a transformer model to automatically mine the relevance between features and corresponding stages. Extensive experiments on the largest public dataset against state-of-the-art methods validate the effectiveness of our proposed method and reveal promising future directions.

preprint2022arXiv

CALI: Coarse-to-Fine ALIgnments Based Unsupervised Domain Adaptation of Traversability Prediction for Deployable Autonomous Navigation

Traversability prediction is a fundamental perception capability for autonomous navigation. The diversity of data in different domains imposes significant gaps to the prediction performance of the perception model. In this work, we make efforts to reduce the gaps by proposing a novel coarse-to-fine unsupervised domain adaptation (UDA) model - CALI. Our aim is to transfer the perception model with high data efficiency, eliminate the prohibitively expensive data labeling, and improve the generalization capability during the adaptation from easy-to-obtain source domains to various challenging target domains. We prove that a combination of a coarse alignment and a fine alignment can be beneficial to each other and further design a first-coarse-then-fine alignment process. This proposed work bridges theoretical analyses and algorithm designs, leading to an efficient UDA model with easy and stable training. We show the advantages of our proposed model over multiple baselines in several challenging domain adaptation setups. To further validate the effectiveness of our model, we then combine our perception model with a visual planner to build a navigation system and show the high reliability of our model in complex natural environments where no labeled data is available.

preprint2022arXiv

Cancer Subtyping by Improved Transcriptomic Features Using Vector Quantized Variational Autoencoder

Defining and separating cancer subtypes is essential for facilitating personalized therapy modality and prognosis of patients. The definition of subtypes has been constantly recalibrated as a result of our deepened understanding. During this recalibration, researchers often rely on clustering of cancer data to provide an intuitive visual reference that could reveal the intrinsic characteristics of subtypes. The data being clustered are often omics data such as transcriptomics that have strong correlations to the underlying biological mechanism. However, while existing studies have shown promising results, they suffer from issues associated with omics data: sample scarcity and high dimensionality. As such, existing methods often impose unrealistic assumptions to extract useful features from the data while avoiding overfitting to spurious correlations. In this paper, we propose to leverage a recent strong generative model, Vector Quantized Variational AutoEncoder (VQ-VAE), to tackle the data issues and extract informative latent features that are crucial to the quality of subsequent clustering by retaining only information relevant to reconstructing the input. VQ-VAE does not impose strict assumptions and hence its latent features are better representations of the input, capable of yielding superior clustering performance with any mainstream clustering method. Extensive experiments and medical analysis on multiple datasets comprising 10 distinct cancers demonstrate the VQ-VAE clustering results can significantly and robustly improve prognosis over prevalent subtyping systems.

preprint2022arXiv

Cancer Subtyping via Embedded Unsupervised Learning on Transcriptomics Data

Cancer is one of the deadliest diseases worldwide. Accurate diagnosis and classification of cancer subtypes are indispensable for effective clinical treatment. Promising results on automatic cancer subtyping systems have been published recently with the emergence of various deep learning methods. However, such automatic systems often overfit the data due to the high dimensionality and scarcity. In this paper, we propose to investigate automatic subtyping from an unsupervised learning perspective by directly constructing the underlying data distribution itself, hence sufficient data can be generated to alleviate the issue of overfitting. Specifically, we bypass the strong Gaussianity assumption that typically exists but fails in the unsupervised learning subtyping literature due to small-sized samples by vector quantization. Our proposed method better captures the latent space features and models the cancer subtype manifestation on a molecular basis, as demonstrated by the extensive experimental results.

preprint2022arXiv

Enforcing KL Regularization in General Tsallis Entropy Reinforcement Learning via Advantage Learning

Maximum Tsallis entropy (MTE) framework in reinforcement learning has gained popularity recently by virtue of its flexible modeling choices including the widely used Shannon entropy and sparse entropy. However, non-Shannon entropies suffer from approximation error and subsequent underperformance either due to its sensitivity or the lack of closed-form policy expression. To improve the tradeoff between flexibility and empirical performance, we propose to strengthen their error-robustness by enforcing implicit Kullback-Leibler (KL) regularization in MTE motivated by Munchausen DQN (MDQN). We do so by drawing connection between MDQN and advantage learning, by which MDQN is shown to fail on generalizing to the MTE framework. The proposed method Tsallis Advantage Learning (TAL) is verified on extensive experiments to not only significantly improve upon Tsallis-DQN for various non-closed-form Tsallis entropies, but also exhibits comparable performance to state-of-the-art maximum Shannon entropy algorithms.

preprint2022arXiv

Multi-Target Active Object Tracking with Monte Carlo Tree Search and Target Motion Modeling

In this work, we are dedicated to multi-target active object tracking (AOT), where there are multiple targets as well as multiple cameras in the environment. The goal is maximize the overall target coverage of all cameras. Previous work makes a strong assumption that each camera is fixed in a location and only allowed to rotate, which limits its application. In this work, we relax the setting by allowing all cameras to both move along the boundary lines and rotate. In our setting, the action space becomes much larger, which leads to much higher computational complexity to identify the optimal action. To this end, we propose to leverage the action selection from multi-agent reinforcement learning (MARL) network to prune the search tree of Monte Carlo Tree Search (MCTS) method, so as to find the optimal action more efficiently. Besides, we model the motion of the targets to predict the future position of the targets, which makes a better estimation of the future environment state in the MCTS process. We establish a multi-target 2D environment to simulate the sports games, and experimental results demonstrate that our method can effectively improve the target coverage.

preprint2022arXiv

Multi-Tier Platform for Cognizing Massive Electroencephalogram

An end-to-end platform assembling multiple tiers is built for precisely cognizing brain activities. Being fed massive electroencephalogram (EEG) data, the time-frequency spectrograms are conventionally projected into the episode-wise feature matrices (seen as tier-1). A spiking neural network (SNN) based tier is designed to distill the principle information in terms of spike-streams from the rare features, which maintains the temporal implication in the nature of EEGs. The proposed tier-3 transposes time- and space-domain of spike patterns from the SNN; and feeds the transposed pattern-matrices into an artificial neural network (ANN, Transformer specifically) known as tier-4, where a special spanning topology is proposed to match the two-dimensional input form. In this manner, cognition such as classification is conducted with high accuracy. For proof-of-concept, the sleep stage scoring problem is demonstrated by introducing multiple EEG datasets with the largest comprising 42,560 hours recorded from 5,793 subjects. From experiment results, our platform achieves the general cognition overall accuracy of 87% by leveraging sole EEG, which is 2% superior to the state-of-the-art. Moreover, our developed multi-tier methodology offers visible and graphical interpretations of the temporal characteristics of EEG by identifying the critical episodes, which is demanded in neurodynamics but hardly appears in conventional cognition scenarios.

preprint2022arXiv

Nonlinear Optimal Guidance for Fixed-Time Impact on a Stationary Target

This paper is concerned with devising the nonlinear optimal guidance for intercepting a stationary target with a fixed impact time. According to Pontryagin's Maximum Principle (PMP), some optimality conditions for the solutions of the nonlinear optimal interception problem are established, and the structure of the corresponding optimal control is presented. By employing the optimality conditions, we formulate a parameterized system so that its solution space is the same as that of the nonlinear optimal interception problem. As a consequence, a simple propagation of the parameterized system, without using any optimization method, is sufficient to generate enough sampled data for the mapping from current state and time-to-go to the optimal guidance command. By virtue of the universal approximation theorem, a feedforward neural network, trained by the generated data, is able to represent the mapping from current state and time-to-go to the optimal guidance command. Therefore, the trained network eventually can generate fixed-impact-time nonlinear optimal guidance within a constant time. Finally, the developed nonlinear optimal guidance is exemplified and studied through simulations, showing that the nonlinear optimal guidance law performs better than existing interception guidance laws.

preprint2022arXiv

On-Demand AoI Minimization in Resource-Constrained Cache-Enabled IoT Networks with Energy Harvesting Sensors

We consider a resource-constrained IoT network, where multiple users make on-demand requests to a cache-enabled edge node to send status updates about various random processes, each monitored by an energy harvesting sensor. The edge node serves users' requests by deciding whether to command the corresponding sensor to send a fresh status update or retrieve the most recently received measurement from the cache. Our objective is to find the best actions of the edge node to minimize the average age of information (AoI) of the received measurements upon request, i.e., average on-demand AoI, subject to per-slot transmission and energy constraints. First, we derive a Markov decision process model and propose an iterative algorithm that obtains an optimal policy. Then, we develop an asymptotically optimal low-complexity algorithm -- termed relax-then-truncate -- and prove that it is optimal as the number of sensors goes to infinity. Simulation results illustrate that the proposed relax-then-truncate approach significantly reduces the average on-demand AoI compared to a request-aware greedy (myopic) policy and also depict that it performs close to the optimal solution even for moderate numbers of sensors.

preprint2022arXiv

Optimal MIMO Combining for Blind Federated Edge Learning with Gradient Sparsification

We provide the optimal receive combining strategy for federated learning in multiple-input multiple-output (MIMO) systems. Our proposed algorithm allows the clients to perform individual gradient sparsification which greatly improves performance in scenarios with heterogeneous (non i.i.d.) training data. The proposed method beats the benchmark by a wide margin.

preprint2022arXiv

Restricted mean survival time regression model with time-dependent covariates

In clinical or epidemiological follow-up studies, methods based on time scale indicators such as the restricted mean survival time (RMST) have been developed to some extent. Compared with traditional hazard rate indicator system methods, the RMST is easier to interpret and does not require the proportional hazard assumption. To date, regression models based on the RMST are indirect or direct models of the RMST and baseline covariates. However, time-dependent covariates are becoming increasingly common in follow-up studies. Based on the inverse probability of censoring weighting (IPCW) method, we developed a regression model of the RMST and time-dependent covariates. Through Monte Carlo simulation, we verified the estimation performance of the regression parameters of the proposed model. Compared with the time-dependent Cox model and the fixed (baseline) covariate RMST model, the time-dependent RMST model has a better prediction ability. Finally, an example of heart transplantation was used to verify the above conclusions.

preprint2022arXiv

Robust Beamforming Design for IRS-Aided URLLC in D2D Networks

Intelligent reflecting surface (IRS) and device-to-device (D2D) communication are two promising technologies for improving transmission reliability between transceivers in communication systems. In this paper, we consider the design of reliable communication between the access point (AP) and actuators for a downlink multiuser multiple-input single-output (MISO) system in the industrial IoT (IIoT) scenario. We propose a two-stage protocol combining IRS with D2D communication so that all actuators can successfully receive the message from AP within a given delay. The superiority of the protocol is that the communication reliability between AP and actuators is doubly augmented by the IRS-aided first-stage transmission and the second-stage D2D transmission. A joint optimization problem of active and passive beamforming is formulated, which aims to maximize the number of actuators with successful decoding. We study the joint beamforming problem for cases where the channel state information (CSI) is perfect and imperfect. For each case, we develop efficient algorithms that include convergence and complexity analysis. Simulation results demonstrate the necessity and role of IRS with a well-optimized reflection matrix, and the D2D network in promoting reliable communication. Moreover, the proposed protocol can enable reliable communication even in the presence of stringent latency requirements and CSI estimation errors.

preprint2022arXiv

StructNeRF: Neural Radiance Fields for Indoor Scenes with Structural Hints

Neural Radiance Fields (NeRF) achieve photo-realistic view synthesis with densely captured input images. However, the geometry of NeRF is extremely under-constrained given sparse views, resulting in significant degradation of novel view synthesis quality. Inspired by self-supervised depth estimation methods, we propose StructNeRF, a solution to novel view synthesis for indoor scenes with sparse inputs. StructNeRF leverages the structural hints naturally embedded in multi-view inputs to handle the unconstrained geometry issue in NeRF. Specifically, it tackles the texture and non-texture regions respectively: a patch-based multi-view consistent photometric loss is proposed to constrain the geometry of textured regions; for non-textured ones, we explicitly restrict them to be 3D consistent planes. Through the dense self-supervised depth constraints, our method improves both the geometry and the view synthesis performance of NeRF without any additional training on external data. Extensive experiments on several real-world datasets demonstrate that StructNeRF surpasses state-of-the-art methods for indoor scenes with sparse inputs both quantitatively and qualitatively.

preprint2022arXiv

Treating Interference as Noise in Cell-Free Massive MIMO Networks

How to manage the interference introduced by the enormous wireless devices is a crucial issue to address in the prospective sixth-generation (6G) communications. The treating interference as noise (TIN) optimality conditions are commonly used for interference management and thus attract significant interest in existing wireless systems. Cell-free massive multiple-input multiple-output (CF mMIMO) is a promising technology in 6G that exhibits high system throughput and excellent interference management by exploiting a large number of access points (APs) to serve the users collaboratively. In this paper, we take the first step on studying TIN in CF mMIMO systems from a stochastic geometry perspective by investigating the probability that the TIN conditions hold with spatially distributed network nodes. We propose a novel analytical framework for TIN in a CF mMIMO system with both Binomial Point Process (BPP) and Poisson Point Process (PPP) approximations. We derive the probability that the TIN conditions hold in close form using the PPP approximation. Numerical results validate our derived expressions and illustrate the impact of various system parameters on the probability that the TIN conditions hold.

preprint2021arXiv

Atomic-scale investigation of the irradiation-resistant effect of symmetric tilt grain boundaries of Fe-Ni-Cr alloy

In this paper, the Fe-20Ni-25Cr alloy that is used for fuel cladding or pressure vessels with various grain boundaries (GBs) was investigated by employing molecular dynamics simulations. The bi-crystals comprised of Σ3(111), Σ3(112), Σ9(114), Σ11(113), Σ19(116), and Σ17(223) types GBs were considered to systematically examine the interplay between irradiation defects, irradiation microstructure evolution under stress, and irradiation mechanical properties with irradiation intensity, coincidence site lattice parameter, tilt angle, and GB thickness. It is found that irradiated vacancies and interstitials are annihilated by competitive GB absorption and recombination. Bias absorption of interstitials is observed for most bi-crystals except Σ3(111) and Σ11(113) at 15 keV incident energy, and results in abundant residual vacancies clusters in grain interior. In addition, different GBs exhibit quite diverse irradiation defect sink ability, and the number of residual vacancies is inversely related to the GB thickness, where Σ3(111) and Σ11(113) GBs with narrow GB thickness are weak in defect absorption and the others are strong. Furthermore, uniaxial tensile simulations perpendicular to the GB reveal that all of the mechanical performance of bi-crystals deteriorates after irradiation, which originates from dislocation propagation facilitated by irradiation defect clusters. In particular, regardless of whether the irradiation is applied, the maximum tensile strain, toughness, and Youngs modulus are monotonically correlated with GB tilt angle, while the ultimate tensile strength is stable for larger GB CSL parameter. Finally, on the basis of the evolution of the irradiation defects, microstructures, and mechanical performances, we proposed guidelines of rational design of irradiation-resistant Fe-Ni-Cr alloy.

preprint2021arXiv

Carbon Free High Loading Silicon Anodes Enabled by Sulfide Solid Electrolytes for Robust All Solid-State Batteries

The development of silicon anodes to replace conventional graphite in efforts to increase energy densities of lithium-ion batteries has been largely impeded by poor interfacial stability against liquid electrolytes. Here, stable operation of 99.9 weight% micro-Si (uSi) anode is enabled by utilizing the interface passivating properties of sulfide based solid-electrolytes. Bulk to surface characterization, as well as quantification of interfacial components showed that such an approach eliminates continuous interfacial growth and irreversible lithium losses. In uSi || layered-oxide full cells, high current densities at room temperature (5 mA cm 2), wide operating temperature (-20°C to 80°C) and high loadings (>11 mAh cm-2) were demonstrated for both charge and discharge operations. The promising battery performance can be attributed to both the desirable interfacial property between uSi and sulfide electrolytes, as well as the unique chemo-mechanical behavior of the Li-Si alloys.

preprint2021arXiv

Consensus-Based Distributed Computation of Link-Based Network Metrics

Average consensus algorithms have wide applications in distributed computing systems where all the nodes agree on the average value of their initial states by only exchanging information with their local neighbors. In this letter, we look into link-based network metrics which are polynomial functions of pair-wise node attributes defined over the links in a network. Different from node-based average consensus, such link-based metrics depend on both the distribution of node attributes and the underlying network topology. We propose a general algorithm using the weighted average consensus protocol for the distributed computation of link-based network metrics and provide the convergence conditions and convergence rate analysis.

preprint2021arXiv

Elongation of Curvature-Bounded Path

The paper is concerned with elongating the shortest curvature-bounded path between two oriented points to an expected length. The elongation of curvature-bounded paths to an expected length is fundamentally important to plan missions for nonholonomic-constrained vehicles in many practical applications, such as coordinating multiple nonholonomic-constrained vehicles to reach a destination simultaneously or performing a mission with a strict time window. In the paper, the explicit conditions for the existence of curvature-bounded paths joining two oriented points with an expected length are established by applying the properties of the reachability set of curvature-bounded paths. These existence conditions are numerically verifiable, allowing readily checking the existence of curvature-bounded paths between two prescribed oriented points with a desired length. In addition, once the existence conditions are met, elongation strategies are provided in the paper to get curvature-bounded paths with expected lengths. Finally, some examples of minimum-time path planning for multiple fixed-wing aerial vehicles to cooperatively achieve a triangle-shaped flight formation are presented, illustrating and verifying the developments of the paper.

preprint2021arXiv

Optimizing Information Freshness in a Multiple Access Channel with Heterogeneous Devices

In this work, we study age-optimal scheduling with stability constraints in a multiple access channel with two heterogeneous source nodes transmitting to a common destination. The first node is connected to a power grid and it has randomly arriving data packets. Another energy harvesting (EH) sensor monitors a stochastic process and sends status updates to the destination. We formulate an optimization problem that aims at minimizing the average age of information (AoI) of the EH node subject to the queue stability condition of the grid-connected node. First, we consider a Probabilistic Random Access (PRA) policy where both nodes make independent transmission decisions based on some fixed probability distributions. We show that with this policy, the average AoI is equal to the average peak AoI, if the EH node only sends freshly generated samples. In addition, we derive the optimal solution in closed form, which reveals some interesting properties of the considered system. Furthermore, we consider a Drift-Plus-Penalty (DPP) policy and develop AoI-optimal and peak-AoI-optimal scheduling algorithms using the Lyapunov optimization theory. Simulation results show that the DPP policy outperforms the PRA policy in various scenarios, especially when the destination node has low multi-packet reception capabilities.

preprint2021arXiv

Solving the linear transport equation by a deep neural network approach

In this paper, we study the linear transport model by adopting the deep learning method, in particular the deep neural network (DNN) approach. While the interest of using DNN to study partial differential equations is arising, here we adapt it to study kinetic models, in particular the linear transport model. Moreover, theoretical analysis on the convergence of the neural network and its approximated solution towards the analytic solution is shown. We demonstrate the accuracy and effectiveness of the proposed DNN method in the numerical experiments.

preprint2021arXiv

Time-Optimal Guidance for Intercepting Moving Targets with Impact-Angle Constraints

The minimum-time path for intercepting a moving target with a prescribed impact angle is studied in the paper. The candidate paths from Pontryagin's maximum principle are analyzed, so that each candidate is related to a zero of a real-valued function. It is found that the real-valued functions or their first-order derivatives can be converted to polynomials of at most fourth degree. As a result, each canidate path can be computed within a constant time by embedding a standard polynomial solver into the typical bisection method. The control strategy along the shortest candidate eventually gives rise to the time-optimal guidance law. Finally, the developments of the paper is illustrated and verified by three numerical examples.

preprint2020arXiv

An Ontology-driven Treatment Article Retrieval System for Precision Oncology

This paper presents an ontology-driven treatment article retrieval system developed and experimented using the data and ground truths provided by the TREC 2017 precision medicine track. The key aspects of our system include: meaningful integration of various disease, gene, and drug name ontologies, training of a novel perceptron model for article relevance labeling, a ranking module that considers additional factors such as journal impact and article publication year, and comprehensive query matching rules. Experimental results demonstrate that our proposed system considerably outperforms the results of the best participating system of the TREC 2017 precision medicine challenge.

preprint2020arXiv

Highly flexible electromagnetic interference shielding films based on ultrathin Ni/Ag composites on paper substrates

Highly flexible electromagnetic interference (EMI) shielding material with excellent shielding performance is of great significance to practical applications in next-generation flexible devices. However, most EMI materials suffer from insufficient flexibility and complicated preparation methods. In this study, we propose a new scheme to fabricate a magnetic Ni particle/Ag matrix composite ultrathin film on a paper surface. For a ~2 micro meter thick film on paper, the EMI shielding effectiveness (SE) was found to be 46.2 dB at 8.1 GHz after bending 200,000 times over a radius of ~2 mm. The sheet resistance (Rsq) remained lower than 2.30 Ohm after bending 200,000 times. Contrary to the change in Rsq, the EMI SE of the film generally increased as the weight ratio of Ag to Ni increased, in accordance with the principle that EMI SE is positively related with an increase in electrical conductivity. Desirable EMI shielding ability, ultrahigh flexibility, and simple processing provide this material with excellent application prospects.

preprint2020arXiv

Hybrid Ranking Network for Text-to-SQL

In this paper, we study how to leverage pre-trained language models in Text-to-SQL. We argue that previous approaches under utilize the base language models by concatenating all columns together with the NL question and feeding them into the base language model in the encoding stage. We propose a neat approach called Hybrid Ranking Network (HydraNet) which breaks down the problem into column-wise ranking and decoding and finally assembles the column-wise outputs into a SQL query by straightforward rules. In this approach, the encoder is given a NL question and one individual column, which perfectly aligns with the original tasks BERT/RoBERTa is trained on, and hence we avoid any ad-hoc pooling or additional encoding layers which are necessary in prior approaches. Experiments on the WikiSQL dataset show that the proposed approach is very effective, achieving the top place on the leaderboard.

preprint2020arXiv

Neural Stochastic Block Model & Scalable Community-Based Graph Learning

This paper proposes a novel scalable community-based neural framework for graph learning. The framework learns the graph topology through the task of community detection and link prediction by optimizing with our proposed joint SBM loss function, which results from a non-trivial adaptation of the likelihood function of the classic Stochastic Block Model (SBM). Compared with SBM, our framework is flexible, naturally allows soft labels and digestion of complex node attributes. The main goal is efficient valuation of complex graph data, therefore our design carefully aims at accommodating large data, and ensures there is a single forward pass for efficient evaluation. For large graph, it remains an open problem of how to efficiently leverage its underlying structure for various graph learning tasks. Previously it can be heavy work. With our community-based framework, this becomes less difficult and allows the task models to basically plug-in-and-play and perform joint training. We currently look into two particular applications, the graph alignment and the anomalous correlation detection, and discuss how to make use of our framework to tackle both problems. Extensive experiments are conducted to demonstrate the effectiveness of our approach. We also contributed tweaks of classic techniques which we find helpful for performance and scalability. For example, 1) the GAT+, an improved design of GAT (Graph Attention Network), the scaled-cosine similarity, and a unified implementation of the convolution/attention based and the random-walk based neural graph models.

preprint2020arXiv

On Dubins Paths to a Circle

This paper is concerned with characterizing the shortest path of a Dubins vehicle from a position with a prescribed heading angle to a target circle with the final heading tangential to the target circle. Such a shortest path is of significant importance as it is usually required in real-world scenarios, such as taking a snapshot of a forbidden region or loitering above a ground sensor to collect data by a fixed-wing unmanned aerial vehicle in a minimum time. By applying Pontryagin's maximum principle, some geometric properties for the shortest path are established {without considering any assumption on the relationship between the minimum turning radius and radius of the target circle}, showing that the shortest path must lie in a sufficient family of 12 types. By employing those geometric properties, the analytical solution to each type is devised so that the length of each type can be computed in a constant time. In addition, some properties depending on problem parameters are found so that the shortest path can be computed without checking all the 12 types. Finally, some numerical simulations are presented, illustrating and validating the developments of the paper.

preprint2020arXiv

Pre-Training for Query Rewriting in A Spoken Language Understanding System

Query rewriting (QR) is an increasingly important technique to reduce customer friction caused by errors in a spoken language understanding pipeline, where the errors originate from various sources such as speech recognition errors, language understanding errors or entity resolution errors. In this work, we first propose a neural-retrieval based approach for query rewriting. Then, inspired by the wide success of pre-trained contextual language embeddings, and also as a way to compensate for insufficient QR training data, we propose a language-modeling (LM) based approach to pre-train query embeddings on historical user conversation data with a voice assistant. In addition, we propose to use the NLU hypotheses generated by the language understanding system to augment the pre-training. Our experiments show pre-training provides rich prior information and help the QR task achieve strong performance. We also show joint pre-training with NLU hypotheses has further benefit. Finally, after pre-training, we find a small set of rewrite pairs is enough to fine-tune the QR model to outperform a strong baseline by full training on all QR training data.

preprint2020arXiv

TADOC: Text Analytics Directly on Compression

This article provides a comprehensive description of Text Analytics Directly on Compression (TADOC), which enables direct document analytics on compressed textual data. The article explains the concept of TADOC and the challenges to its effective realizations. Additionally, a series of guidelines and technical solutions that effectively address those challenges, including the adoption of a hierarchical compression method and a set of novel algorithms and data structure designs, are presented. Experiments on six data analytics tasks of various complexities show that TADOC can save 90.8% storage space and 87.9% memory usage, while halving data processing times.

preprint2020arXiv

Theoretical analysis on droplet vaporization at elevated temperatures and pressures

Droplet vaporization has a great impact on combustion of liquid fuels and might greatly affects the engine performance. Physical understanding and theoretical interpretation of the droplet vaporization behavior are decisive to constitute appropriate models for spray combustion. In this work, we proposed a fully transient formulation for the droplet vaporization at wide ranges of temperature and pressure. The governing equations are solved analytically by means of suitable coordinate transformations. The droplet surface temperature and mass fraction of fuel vapor are determined by the matching conditions at the liquid-gas interface. Accordingly, a theoretical model, characterizing the time change of droplet size during its vaporization, is proposed in terms of the unsteadiness factor and droplet vaporization lifetime. The theoretical model successfully reveals the conventional recognition of droplet vaporization characteristics, and its predicted droplet size history agrees well with experimental results reported in the literature. Besides, based on the present theoretical analysis, the experimentally observed dual effect of pressure upon droplet vaporization is analyzed with help of an explicit formula for enthalpy of vaporization indicating its temperature-dependence.

preprint2020arXiv

Three-dimensional topological semimetal phase in layered TaNiTe5 probed by de Haas-van Alphen effect

Layered three-dimensional (3D) topological semimetals have attracted intensively attention due to the exotic phenomena and abundantly tunable properties. Here we report the experimental evidence for the 3D topological semimetal phase in layered material TaNiTe5 single crystals through quantum oscillations. Strong quantum oscillations have been observed with diamagnetism background in TaNiTe5. By analyzing the de Haas-van Alphen oscillations, multi-periodic oscillations were extracted, in content with magnetotransport measurements. Moreover, nontrivial "π" Berry phase with 3D Fermi surface is identified, indicating the topologically nontrivial feature in TaNiTe5. Additionally, we demonstrated the thin-layer of TaNiTe5 crystals is highly feasible by the mechanical exfoliation, which offers a platform to explore exotic properties in low dimensional topological semimetal and paves the way for potential applications in nanodevices.

preprint2019arXiv

A fast implicit solver for semiconductor models in one space dimension

Several different approaches are proposed for solving fully implicit discretizations of a simplified Boltzmann-Poisson system with a linear relaxation-type collision kernel. This system models the evolution of free electrons in semiconductor devices under a low-density assumption. At each implicit time step, the discretized system is formulated as a fixed-point problem, which can then be solved with a variety of methods. A key algorithmic component in all the approaches considered here is a recently developed sweeping algorithm for Vlasov-Poisson systems. A synthetic acceleration scheme has been implemented to accelerate the convergence of iterative solvers by using the solution to a drift-diffusion equation as a preconditioner. The performance of four iterative solvers and their accelerated variants has been compared on problems modeling semiconductor devices with various electron mean-free-path.