Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
24topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

Position: Assistive Agents Need Accessibility Alignment

Assistive agents for Blind and Visually Impaired (BVI) users require accessibility alignment as a first-class design objective. Despite rapid progress in agentic AI, most systems are designed and evaluated under assumptions of sighted interaction, low-cost verification, and tolerable trial-and-error, leading to systematic failures in assistive scenarios that cannot be resolved by model scaling or post-hoc interface adaptations alone. Drawing on an analysis of 778 assistance task instances from prior work, we show that current agentic AI remain prone to failure in assistive scenarios due to mismatches between sighted-user design assumptions and the verification, risk, and interaction constraints faced by BVI users. We argue that accessibility should be treated as an alignment problem rather than a peripheral usability concern. To this end, we introduce accessibility alignment and propose a lifecycle-oriented design pipeline for accessibility-aligned assistive agents, spanning user research, system design, deployment and post-deployment iteration. We conclude that BVI-centered assistive tasks provide a critical stress test for agentic AI and motivate a broader shift toward inclusive agent design.

preprint2023arXiv

DCS-RISR: Dynamic Channel Splitting for Efficient Real-world Image Super-Resolution

Real-world image super-resolution (RISR) has received increased focus for improving the quality of SR images under unknown complex degradation. Existing methods rely on the heavy SR models to enhance low-resolution (LR) images of different degradation levels, which significantly restricts their practical deployments on resource-limited devices. In this paper, we propose a novel Dynamic Channel Splitting scheme for efficient Real-world Image Super-Resolution, termed DCS-RISR. Specifically, we first introduce the light degradation prediction network to regress the degradation vector to simulate the real-world degradations, upon which the channel splitting vector is generated as the input for an efficient SR model. Then, a learnable octave convolution block is proposed to adaptively decide the channel splitting scale for low- and high-frequency features at each block, reducing computation overhead and memory cost by offering the large scale to low-frequency features and the small scale to the high ones. To further improve the RISR performance, Non-local regularization is employed to supplement the knowledge of patches from LR and HR subspace with free-computation inference. Extensive experiments demonstrate the effectiveness of DCS-RISR on different benchmark datasets. Our DCS-RISR not only achieves the best trade-off between computation/parameter and PSNR/SSIM metric, and also effectively handles real-world images with different degradation levels.

preprint2022arXiv

Adaptive Graph Diffusion Networks

Graph Neural Networks (GNNs) have received much attention in the graph deep learning domain. However, recent research empirically and theoretically shows that deep GNNs suffer from over-fitting and over-smoothing problems. The usual solutions either cannot solve extensive runtime of deep GNNs or restrict graph convolution in the same feature space. We propose the Adaptive Graph Diffusion Networks (AGDNs) which perform multi-layer generalized graph diffusion in different feature spaces with moderate complexity and runtime. Standard graph diffusion methods combine large and dense powers of the transition matrix with predefined weighting coefficients. Instead, AGDNs combine smaller multi-hop node representations with learnable and generalized weighting coefficients. We propose two scalable mechanisms of weighting coefficients to capture multi-hop information: Hop-wise Attention (HA) and Hop-wise Convolution (HC). We evaluate AGDNs on diverse, challenging Open Graph Benchmark (OGB) datasets with semi-supervised node classification and link prediction tasks. Until the date of submission (Aug 26, 2022), AGDNs achieve top-1 performance on the ogbn-arxiv, ogbn-proteins and ogbl-ddi datasets and top-3 performance on the ogbl-citation2 dataset. On the similar Tesla V100 GPU cards, AGDNs outperform Reversible GNNs (RevGNNs) with 13% complexity and 1% training runtime of RevGNNs on the ogbn-proteins dataset. AGDNs also achieve comparable performance to SEAL with 36% training and 0.2% inference runtime of SEAL on the ogbl-citation2 dataset.

preprint2022arXiv

Deep Machine Learning Reconstructing Lattice Topology with Strong Thermal Fluctuations

Applying artificial intelligence to scientific problems (namely AI for science) is currently under hot debate. However, the scientific problems differ much from the conventional ones with images, texts, and etc., where new challenges emerges with the unbalanced scientific data and complicated effects from the physical setups. In this work, we demonstrate the validity of the deep convolutional neural network (CNN) on reconstructing the lattice topology (i.e., spin connectivities) in the presence of strong thermal fluctuations and unbalanced data. Taking the kinetic Ising model with Glauber dynamics as an example, the CNN maps the time-dependent local magnetic momenta (a single-node feature) evolved from a specific initial configuration (dubbed as an evolution instance) to the probabilities of the presences of the possible couplings. Our scheme distinguishes from the previous ones that might require the knowledge on the node dynamics, the responses from perturbations, or the evaluations of statistic quantities such as correlations or transfer entropy from many evolution instances. The fine tuning avoids the "barren plateau" caused by the strong thermal fluctuations at high temperatures. Accurate reconstructions can be made where the thermal fluctuations dominate over the correlations and consequently the statistic methods in general fail. Meanwhile, we unveil the generalization of CNN on dealing with the instances evolved from the unlearnt initial spin configurations and those with the unlearnt lattices. We raise an open question on the learning with unbalanced data in the nearly "double-exponentially" large sample space.

preprint2022arXiv

Design, Uncertainty Analysis and Measurement of a Silicon-based Platelet THz Corrugated Horn

Platelets corrugated horn is a promising technology for their scalability to a large corrugated horn array. In this paper, we present the design, fabrication, measurement and uncertainty analysis of a wideband 170-320 GHz platelet corrugated horn that features with low sidelobe across the band (<-30 dB). We also propose an accurate and universal method to analyze the axial misalignment of the platelets for the first time. It is based on the mode matching (MM) method with a closed-form solution to off-axis circular waveguide discontinuities obtained by using Graf addition theorem for the Bessel functions. The uncertainties introduced in the fabrication have been quantitatively analyzed using the Monte Carlo method. The analysis shows the cross-polarization of the corrugated horn degrades significantly with the axial misalignment. It well explains the discrepancy between the designed and the measured cross-polarization of platelets corrugated horn fabricated in THz band. The method can be used to determine the fabrication tolerance needed for other THz corrugated horns and evaluate the impact of the corrugated horn for astronomical observations.

preprint2022arXiv

Functional Tensor Network Solving Many-body Schrödinger Equation

Schrödinger equation belongs to the most fundamental differential equations in quantum physics. However, the exact solutions are extremely rare, and many analytical methods are applicable only to the cases with small perturbations or weak correlations. Solving the many-body Schrödinger equation in the continuous spaces with the presence of strong correlations is an extremely important and challenging issue. In this work, we propose the functional tensor network (FTN) approach to solve the many-body Schrödinger equation. Provided the orthonormal functional bases, we represent the coefficients of the many-body wave-function as tensor network. The observables, such as energy, can be calculated simply by tensor contractions. Simulating the ground state becomes solving a minimization problem defined by the tensor network. An efficient gradient-decent algorithm based on the automatically differentiable tensors is proposed. We here take matrix product state (MPS) as an example, whose complexity scales only linearly with the system size. We apply our approach to solve the ground state of coupled harmonic oscillators, and achieve high accuracy by comparing with the exact solutions. Reliable results are also given with the presence of three-body interactions, where the system cannot be decoupled to isolated oscillators. Our approach is simple and with well-controlled error, superior to the highly-nonlinear neural-network solvers. Our work extends the applications of tensor network from quantum lattice models to the systems in the continuous space. FTN can be used as a general solver of the differential equations with many variables. The MPS exemplified here can be generalized to, e.g., the fermionic tensor networks, to solve the electronic Schrödinger equation.

preprint2022arXiv

Hardness prediction of age-hardening aluminum alloy based on ensemble learning

With the rapid development of artificial intelligence, the combination of material database and machine learning has driven the progress of material informatics. Because aluminum alloy is widely used in many fields, so it is significant to predict the properties of aluminum alloy. In this thesis, the data of Al-Cu-Mg-X (X: Zn, Zr, etc.) alloy are used to input the composition, aging conditions (time and temperature) and predict its hardness. An ensemble learning solution based on automatic machine learning and an attention mechanism introduced into the secondary learner of deep neural network are proposed respectively. The experimental results show that selecting the correct secondary learner can further improve the prediction accuracy of the model. This manuscript introduces the attention mechanism to improve the secondary learner based on deep neural network, and obtains a fusion model with better performance. The R-Square of the best model is 0.9697 and the MAE is 3.4518HV.

preprint2022arXiv

Investigation of Optical Coupling in Microwave Kinetic Inductance Detectors using Superconducting Reflective Plates

To improve the optical coupling in Microwave Kinetic Inductance Detectors (MKIDs), we investigate the use of a reflective plate beneath the meandered absorber. We designed, fabricated and characterized high-Q factors TiN-based MKIDs on sapphire operating at optical wavelengths with a Au/Nb reflective thin bilayer below the meander. The reflector is set at a quarter-wave distance from the meander using a transparent Al$_2$O$_3$ dielectric layer to reach the peak photon absorption. We expect the plate to recover undetected photons by reflecting them back onto the absorber.

preprint2022arXiv

Spatio-Temporal Latent Graph Structure Learning for Traffic Forecasting

Accurate traffic forecasting, the foundation of intelligent transportation systems (ITS), has never been more significant than nowadays due to the prosperity of smart cities and urban computing. Recently, Graph Neural Network truly outperforms the traditional methods. Nevertheless, the most conventional GNN-based model works well while given a pre-defined graph structure. And the existing methods of defining the graph structures focus purely on spatial dependencies and ignore the temporal correlation. Besides, the semantics of the static pre-defined graph adjacency applied during the whole training progress is always incomplete, thus overlooking the latent topologies that may fine-tune the model. To tackle these challenges, we propose a new traffic forecasting framework -- Spatio-Temporal Latent Graph Structure Learning networks (ST-LGSL). More specifically, the model employs a graph generator based on Multilayer perceptron and K-Nearest Neighbor, which learns the latent graph topological information from the entire data considering both spatial and temporal dynamics. Furthermore, with the initialization of MLP-kNN based on ground-truth adjacency matrix and similarity metric in kNN, ST-LGSL aggregates the topologies focusing on geography and node similarity. Additionally, the generated graphs act as the input of the Spatio-temporal prediction module combined with the Diffusion Graph Convolutions and Gated Temporal Convolutions Networks. Experimental results on two benchmarking datasets in real world demonstrate that ST-LGSL outperforms various types of state-of-art baselines.

preprint2021arXiv

Image-to-image Translation via Hierarchical Style Disentanglement

Recently, image-to-image translation has made significant progress in achieving both multi-label (\ie, translation conditioned on different labels) and multi-style (\ie, generation with diverse styles) tasks. However, due to the unexplored independence and exclusiveness in the labels, existing endeavors are defeated by involving uncontrolled manipulations to the translation results. In this paper, we propose Hierarchical Style Disentanglement (HiSD) to address this issue. Specifically, we organize the labels into a hierarchical tree structure, in which independent tags, exclusive attributes, and disentangled styles are allocated from top to bottom. Correspondingly, a new translation process is designed to adapt the above structure, in which the styles are identified for controllable translations. Both qualitative and quantitative results on the CelebA-HQ dataset verify the ability of the proposed HiSD. We hope our method will serve as a solid baseline and provide fresh insights with the hierarchically organized annotations for future research in image-to-image translation. The code has been released at https://github.com/imlixinyang/HiSD.

preprint2021arXiv

Performance Analysis and Improvement on DSRC Application for V2V Communication

In this paper, we focus on the performance of vehicle-to-vehicle (V2V) communication adopting the Dedicated Short Range Communication (DSRC) application in periodic broadcast mode. An analytical model is studied and a fixed point method is used to analyze the packet delivery ratio (PDR) and mean delay based on the IEEE 802.11p standard in a fully connected network under the assumption of perfect PHY performance. With the characteristics of V2V communication, we develop the Semi-persistent Contention Density Control (SpCDC) scheme to improve the DSRC performance. We use Monte Carlo simulation to verify the results obtained by the analytical model. The simulation results show that the packet delivery ratio in SpCDC scheme increases more than 10% compared with IEEE 802.11p in heavy vehicle load scenarios. Meanwhile, the mean reception delay decreases more than 50%, which provides more reliable road safety.

preprint2020arXiv

Energy Self-Sustainability in Full-Spectrum 6G

Full-spectrum ranging from sub 6 GHz to THz and visible light will be exploited in 6G in order to reach unprecedented key-performance-indicators (KPIs). However, extraordinary amount of energy will be consumed by network infrastructure, while functions of massively deployed Internet of Everything (IoE) devices are limited by embedded batteries. Therefore, energy self-sustainable 6G is proposed in this article. First of all, it may achieve network-wide energy efficiency by exploiting cell-free and airborne access networks as well as by implementing intelligent holographic environments. Secondly, by exploiting radio-frequency/visible light signals for providing on-demand wireless power transfer (WPT) and for enabling passive backscatter communication, ``zero-energy&#39;&#39; devices may become a reality. Furthermore, IoE devices actively adapt their transceivers for better performance to a dynamic environment. This article aims to provide a first glance at primary designing principles of energy self-sustainable 6G.

preprint2020arXiv

Gap Probabilities in the Laguerre Unitary Ensemble and Discrete Painlevé Equations

In this paper we study a certain recurrence relation, that can be used to generate ladder operators for the Laguerre Unitary ensemble, from the point of view of Sakai&#39;s geometric theory of Painlevé equations. On one hand, this gives us one more detailed example of the appearance of discrete Painlevé equations in the theory of orthogonal polynomials. On the other hand, it serves as a good illustration of the effectiveness of a recently proposed procedure on how to reduce such recurrences to some canonical discrete Painlevé equations.

preprint2020arXiv

H-AP Deployment for Joint Wireless Information and Energy Transfer in Smart Cities

With the wireless communication being more various in the future, it&#39;s becoming challenging to prolong the lifetime of many battery powered devices, since frequently replacing their batteries is a cumbersome job. An hybrid access point (H-AP) is capable of simultaneously operating wireless information transfer (WIT) and wireless energy transfer (WET) by exploiting the radio frequency (RF) signals. By jointly considering both the mobility of the user and the popularity of the sites, we focus on the design of the H-AP&#39;s deployment scheme. Specifically, a mobility model of the grid based city streets is exploited for characterising the users&#39; movements. Based on this mobility model, the impact of the deployment site&#39;s popularity on the WIT and WET efficiencies is firstly analysed. Then, an H-AP deployment scheme for striking a balance between the WIT and the WET efficiencies is proposed, which is regarded as the B-deployment scheme. The simulation results demonstrate that the B-deployment scheme is more flexible for satisfying diverse requirement of the WIT and WET efficiencies.

preprint2020arXiv

Interpretable Machine Learning Model for Early Prediction of Mortality in Elderly Patients with Multiple Organ Dysfunction Syndrome (MODS): a Multicenter Retrospective Study and Cross Validation

Background: Elderly patients with MODS have high risk of death and poor prognosis. The performance of current scoring systems assessing the severity of MODS and its mortality remains unsatisfactory. This study aims to develop an interpretable and generalizable model for early mortality prediction in elderly patients with MODS. Methods: The MIMIC-III, eICU-CRD and PLAGH-S databases were employed for model generation and evaluation. We used the eXtreme Gradient Boosting model with the SHapley Additive exPlanations method to conduct early and interpretable predictions of patients&#39; hospital outcome. Three types of data source combinations and five typical evaluation indexes were adopted to develop a generalizable model. Findings: The interpretable model, with optimal performance developed by using MIMIC-III and eICU-CRD datasets, was separately validated in MIMIC-III, eICU-CRD and PLAGH-S datasets (no overlapping with training set). The performances of the model in predicting hospital mortality as validated by the three datasets were: AUC of 0.858, sensitivity of 0.834 and specificity of 0.705; AUC of 0.849, sensitivity of 0.763 and specificity of 0.784; and AUC of 0.838, sensitivity of 0.882 and specificity of 0.691, respectively. Comparisons of AUC between this model and baseline models with MIMIC-III dataset validation showed superior performances of this model; In addition, comparisons in AUC between this model and commonly used clinical scores showed significantly better performance of this model. Interpretation: The interpretable machine learning model developed in this study using fused datasets with large sample sizes was robust and generalizable. This model outperformed the baseline models and several clinical scores for early prediction of mortality in elderly ICU patients. The interpretative nature of this model provided clinicians with the ranking of mortality risk features.

preprint2020arXiv

Joint Interleaver and Modulation Design For Multi-User SWIPT-NOMA

Radio frequency (RF) signals can be relied upon for conventional wireless information transfer (WIT) and for challenging wireless power transfer (WPT), which triggers the significant research interest in the topic of simultaneous wireless information and power transfer (SWIPT). By further exploiting the advanced non-orthogonal-multiple-access (NOMA) technique, we are capable of improving the spectrum efficiency of the resource-limited SWIPT system. In our SWIPT system, a hybrid access point (H-AP) superimposes the modulated symbols destined to multiple WIT users by exploiting the power-domain NOMA, while WPT users are capable of harvesting the energy carried by the superposition symbols. In order to maximise the amount of energy transferred to the WPT users, we propose a joint design of the energy interleaver and the constellation rotation based modulator in the symbol-block level by constructively superimposing the symbols destined to the WIT users in the power domain. Furthermore, a transmit power allocation scheme is proposed to guarantee the symbol-error-ratio (SER) of all the WIT users. By considering the sensitivity of practical energy harvesters, the simulation results demonstrate that our scheme is capable of substantially increasing the WPT performance without any remarkable degradation of the WIT performance.

preprint2020arXiv

The Steiner $k$-eccentricity on trees

We study the Steiner $k$-eccentricity on trees, which generalizes the previous one in the paper [X.~Li, G.~Yu, S.~Klavžar, On the average Steiner 3-eccentricity of trees, arXiv:2005.10319, 2020]. To support the algorithm, we achieve much stronger properties for the Steiner $k$-ecc tree than that in the previous paper. Based on this, a linear time algorithm is devised to calculate the Steiner $k$-eccentricity of a vertex in a tree. On the other hand, the lower and upper bounds of the average Steiner $k$-eccentricity index of a tree on order $n$ are established based on a novel technique which is quite different from that in the previous paper but much easier to follow.

preprint2019arXiv

Localization driven superradiant instability

The prominent Dicke superradiant phase arises from coupling an ensemble of atoms to cavity optical field when external optical pumping exceeds a threshold strength. Here we report a prediction of the superrandiant instability driven by Anderson localization, realized with a hybrid system of Dicke and Aubry-Andre (DAA) model for bosons trapped in a one-dimensional (1D) quasiperiodic optical lattice and coupled to a cavity. Our central finding is that for bosons condensed in localized phase given by the DAA model, the resonant superradiant scattering is induced, for which the critical optical pumping of superradiant phase transition approaches zero, giving an instability driven by Anderson localization. The superradiant phase for the DAA model with or without a mobility edge is investigated, showing that the localization driven superradiant instability is in sharp contrast to the superradiance as widely observed for Bose condensate in extended states, and should be insensitive to temperature of the system. This study unveils an insightful effect of localization on the Dicke superradiance, and is well accessible based on the current experiments.

preprint2019arXiv

Photonic hooks from Janus microcylinders

Recently, a type of curved light beams, photonic hooks (PHs), was theoretically predicted and experimentally observed. The production of photonic hook (PH) is due to the breaking of structural symmetry of a plane-wave illuminated microparticle. Herein, we presented and implemented a new approach, of utilizing the symmetry-broken of the microparticles in material composition, for the generation of PHs from Janus microcylinders. Finite element method based numerical simulation and energy flow diagram represented theoretical analysis were used to investigate the field distribution characteristics and formation mechanism of the PHs. The full width at half-maximum (FWHM) of the PH (~0.29$λ$) is smaller than the FWHM of the photonic nanojet (~0.35$λ$) formed from a circular microcylinder with the same geometric radius. By changing the refractive index contrasts between upper and lower half-cylinders, or rotating the Janus microcylinder relative to the central axis, the shape profiles of the PHs can be efficiently modulated. The tunability of the PHs through simple stretching or compression operations, for the Janus microcylinder constituted by one solid inorganic half-cylinder and the other flexible polymer half-cylinder, was studied and discussed as well.