Researcher profile

Lei Xiao

Lei Xiao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

FEDIN: Frequency-Enhanced Deep Interest Network for Click-Through Rate Prediction

Sequential recommendation models often struggle to capture latent periodic patterns in user interests, primarily due to the noise inherent in time-domain behavioral data. While frequency-domain analysis offers a global perspective to address this, existing approaches typically treat user sequences in isolation, overlooking the crucial context of the target item. In this work, we present a novel empirical observation: user attention scores exhibit distinct spectral entropy distributions when conditioned on positive versus negative target items. Specifically, true user interests manifest as highly concentrated spectral patterns with lower entropy in the frequency domain, whereas irrelevant behaviors appear as high-entropy noise. Leveraging this insight, we propose the Frequency-Enhanced Deep Interest Network (FEDIN). FEDIN introduces a frequency-domain branch that utilizes a target-aware spectrum filtering mechanism to isolate these periodic interest signals. Extensive experiments on three public datasets demonstrate that FEDIN consistently outperforms state-of-the-art sequential recommendation baselines, demonstrating superior robustness against noise. We have released our code at: https://github.com/otokoneko/FEDIN.

preprint2023arXiv

ESGReveal: An LLM-based approach for extracting structured data from ESG reports

ESGReveal is an innovative method proposed for efficiently extracting and analyzing Environmental, Social, and Governance (ESG) data from corporate reports, catering to the critical need for reliable ESG information retrieval. This approach utilizes Large Language Models (LLM) enhanced with Retrieval Augmented Generation (RAG) techniques. The ESGReveal system includes an ESG metadata module for targeted queries, a preprocessing module for assembling databases, and an LLM agent for data extraction. Its efficacy was appraised using ESG reports from 166 companies across various sectors listed on the Hong Kong Stock Exchange in 2022, ensuring comprehensive industry and market capitalization representation. Utilizing ESGReveal unearthed significant insights into ESG reporting with GPT-4, demonstrating an accuracy of 76.9% in data extraction and 83.7% in disclosure analysis, which is an improvement over baseline models. This highlights the framework's capacity to refine ESG data analysis precision. Moreover, it revealed a demand for reinforced ESG disclosures, with environmental and social data disclosures standing at 69.5% and 57.2%, respectively, suggesting a pursuit for more corporate transparency. While current iterations of ESGReveal do not process pictorial information, a functionality intended for future enhancement, the study calls for continued research to further develop and compare the analytical capabilities of various LLMs. In summary, ESGReveal is a stride forward in ESG data processing, offering stakeholders a sophisticated tool to better evaluate and advance corporate sustainability efforts. Its evolution is promising in promoting transparency in corporate reporting and aligning with broader sustainable development aims.

preprint2022arXiv

Experimental realization of a quantum image classifier via tensor-network-based machine learning

Quantum machine learning aspires to overcome intractability that currently limits its applicability to practical problems. However, quantum machine learning itself is limited by low effective dimensions achievable in state-of-the-art experiments. Here we demonstrate highly successful classifications of real-life images using photonic qubits, combining a quantum tensor-network representation of hand-written digits and entanglement-based optimization. Specifically, we focus on binary classification for hand-written zeroes and ones, whose features are cast into the tensor-network representation, further reduced by optimization based on entanglement entropy and encoded into two-qubit photonic states. We then demonstrate image classification with a high success rate exceeding 98%, through successive gate operations and projective measurements. Although we work with photons, our approach is amenable to other physical realizations such as nitrogen-vacancy centers, nuclear spins and trapped ions, and our scheme can be scaled to efficient multi-qubit encodings of features in the tensor-product representation, thereby setting the stage for quantum-enhanced multi-class classification.

preprint2022arXiv

Mixture of Virtual-Kernel Experts for Multi-Objective User Profile Modeling

In industrial applications like online advertising and recommendation systems, diverse and accurate user profiles can greatly help improve personalization. Deep learning is widely applied to mine expressive tags to users from their historical interactions in the system, e.g., click, conversion action in the advertising chain. The usual approach is to take a certain action as the objective, and introduce multiple independent Two-Tower models to predict the possibility of users' action on tags (known as CTR or CVR prediction). The predicted users' high probably attractive tags are to represent their preferences. However, the single-action models cannot learn complementarily and support effective training on data-sparse actions. Besides, limited by the lack of information fusion between the two towers, the model learns insufficiently to represent users' preferences on various tag \textbf{topics} well. This paper introduces a novel multi-task model called Mixture of Virtual-Kernel Experts (MVKE) to learn user preferences on various actions and topics unitedly. In MVKE, we propose a concept of Virtual-Kernel Expert, which focuses on modeling one particular facet of the user's preferences, and all of them learn coordinately. Besides, the gate-based structure used in MVKE builds an information fusion bridge between two towers, improving the model's capability and maintaining high efficiency. We apply the model in Tencent Advertising System, where both online and offline evaluations show that our method has a significant improvement compared with the existing ones and brings about an obvious lift to actual advertising revenue.

preprint2022arXiv

NeuralPassthrough: Learned Real-Time View Synthesis for VR

Virtual reality (VR) headsets provide an immersive, stereoscopic visual experience, but at the cost of blocking users from directly observing their physical environment. Passthrough techniques are intended to address this limitation by leveraging outward-facing cameras to reconstruct the images that would otherwise be seen by the user without the headset. This is inherently a real-time view synthesis challenge, since passthrough cameras cannot be physically co-located with the eyes. Existing passthrough techniques suffer from distracting reconstruction artifacts, largely due to the lack of accurate depth information (especially for near-field and disoccluded objects), and also exhibit limited image quality (e.g., being low resolution and monochromatic). In this paper, we propose the first learned passthrough method and assess its performance using a custom VR headset that contains a stereo pair of RGB cameras. Through both simulations and experiments, we demonstrate that our learned passthrough method delivers superior image quality compared to state-of-the-art methods, while meeting strict VR requirements for real-time, perspective-correct stereoscopic view synthesis over a wide field of view for desktop-connected headsets.

preprint2022arXiv

SNeRF: Stylized Neural Implicit Representations for 3D Scenes

This paper presents a stylized novel view synthesis method. Applying state-of-the-art stylization methods to novel views frame by frame often causes jittering artifacts due to the lack of cross-view consistency. Therefore, this paper investigates 3D scene stylization that provides a strong inductive bias for consistent novel view synthesis. Specifically, we adopt the emerging neural radiance fields (NeRF) as our choice of 3D scene representation for their capability to render high-quality novel views for a variety of scenes. However, as rendering a novel view from a NeRF requires a large number of samples, training a stylized NeRF requires a large amount of GPU memory that goes beyond an off-the-shelf GPU capacity. We introduce a new training method to address this problem by alternating the NeRF and stylization optimization steps. Such a method enables us to make full use of our hardware memory capacity to both generate images at higher resolution and adopt more expressive image style transfer methods. Our experiments show that our method produces stylized NeRFs for a wide range of content, including indoor, outdoor and dynamic scenes, and synthesizes high-quality novel views with cross-view consistency.

preprint2021arXiv

Detecting non-Bloch topological invariants in quantum dynamics

Non-Bloch topological invariants preserve the bulk-boundary correspondence in non-Hermitian topological systems, and are a key concept in the contemporary study of non-Hermitian topology. Here we report the dynamic detection of non-Bloch topological invariants in single-photon quantum walks, revealed through the biorthogonal chiral displacement, and crosschecked with the dynamic spin textures in the generalized quasimomentum-time domain following a quantum quench. Both detection schemes are robust against symmetry-preserving disorders, and yield consistent results with theoretical predictions. Our experiments are performed far away from any boundaries, and therefore underline non-Bloch topological invariants as intrinsic properties of the system that persist in the thermodynamic limit. Our work sheds new light on the experimental investigation of non-Hermitian topology.

preprint2021arXiv

Quantum information dynamics in a high-dimensional parity-time-symmetric system

Non-Hermitian systems with parity-time ($\mathcal{PT}$) symmetry give rise to exceptional points (EPs) with exceptional properties that arise due to the coalescence of eigenvectors. Such systems have been extensively explored in the classical domain, where second or higher order EPs have been proposed or realized. In contrast, quantum information studies of $\mathcal{PT}$-symmetric systems have been confined to systems with a two-dimensional Hilbert space. Here by using a single-photon interferometry setup, we simulate quantum dynamics of a four-dimensional $\mathcal{PT}$-symmetric system across a fourth-order exceptional point. By tracking the coherent, non-unitary evolution of the density matrix of the system in $\mathcal{PT}$-symmetry unbroken and broken regions, we observe the entropy dynamics for both the entire system, and the gain and loss subsystems. Our setup is scalable to the higher-dimensional $\mathcal{PT}$-symmetric systems, and our results point towards the rich dynamics and critical properties.

preprint2020arXiv

A feature-supervised generative adversarial network for environmental monitoring during hazy days

The adverse haze weather condition has brought considerable difficulties in vision-based environmental applications. While, until now, most of the existing environmental monitoring studies are under ordinary conditions, and the studies of complex haze weather conditions have been ignored. Thence, this paper proposes a feature-supervised learning network based on generative adversarial networks (GAN) for environmental monitoring during hazy days. Its main idea is to train the model under the supervision of feature maps from the ground truth. Four key technical contributions are made in the paper. First, pairs of hazy and clean images are used as inputs to supervise the encoding process and obtain high-quality feature maps. Second, the basic GAN formulation is modified by introducing perception loss, style loss, and feature regularization loss to generate better results. Third, multi-scale images are applied as the input to enhance the performance of discriminator. Finally, a hazy remote sensing dataset is created for testing our dehazing method and environmental detection. Extensive experimental results show that the proposed method has achieved better performance than current state-of-the-art methods on both synthetic datasets and real-world remote sensing images.

preprint2020arXiv

Conserved quantities in parity-time symmetric systems

Conserved quantities such as energy or the electric charge of a closed system, or the Runge-Lenz vector in Kepler dynamics are determined by its global, local, or accidental symmetries. They were instrumental to advances such as the prediction of neutrinos in the (inverse) beta decay process and the development of self-consistent approximate methods for isolated or thermal many-body systems. In contrast, little is known about conservation laws and their consequences in open systems. Recently, a special class of these systems, called parity-time ($\mathcal{PT}$) symmetric systems, has been intensely explored for their remarkable properties that are absent in their closed counterparts. A complete characterization and observation of conserved quantities in these systems and their consequences is still lacking. Here we present a complete set of conserved observables for a broad class of $\mathcal{PT}$-symmetric Hamiltonians and experimentally demonstrate their properties using single-photon linear optical circuit. By simulating the dynamics of a four-site system across a fourth-order exceptional point, we measure its four conserved quantities and demonstrate their consequences. Our results spell out non-local conservation laws in non-unitary dynamics and provide key elements that will underpin self-consistent analysis of non-Hermitian quantum many-body systems that are forthcoming.

preprint2020arXiv

Experimental demonstration of one-sided device-independent self-testing of any pure two-qubit entangled state

We demonstrate one-sided device-independent self-testing of any pure entangled two-qubit state based on a fine-grained steering inequality. The maximum violation of a fine-grained steering inequality can be used to witness certain steerable correlations, which certify all pure two-qubit entangled states. Our experimental results identify which particular pure two-qubit entangled state has been self-tested and which measurement operators are used on the untrusted side. Furthermore, we analytically derive the robustness bound of our protocol, enabling our subsequent experimental verification of robustness through state tomography. Finally, we ensure that the requisite no-signalling constraints are maintained in the experiment.

preprint2020arXiv

Experimental entropic test of state-independent contextuality via single photons

Recently, an inequality satisfied by non-contextual hidden-variable models and violated by quantum mechanics for all states of a four-level system has been derived based on information-theoretic distance approach to non-classical correlations. In this work, we experimentally demonstrate violation of this inequality with single photons. Our experiment offers a method to study a distinction between quantum and classical correlations from an information-theoretic perspective.

preprint2020arXiv

Field-aware Calibration: A Simple and Empirically Strong Method for Reliable Probabilistic Predictions

It is often observed that the probabilistic predictions given by a machine learning model can disagree with averaged actual outcomes on specific subsets of data, which is also known as the issue of miscalibration. It is responsible for the unreliability of practical machine learning systems. For example, in online advertising, an ad can receive a click-through rate prediction of 0.1 over some population of users where its actual click rate is 0.15. In such cases, the probabilistic predictions have to be fixed before the system can be deployed. In this paper, we first introduce a new evaluation metric named field-level calibration error that measures the bias in predictions over the sensitive input field that the decision-maker concerns. We show that existing post-hoc calibration methods have limited improvements in the new field-level metric and other non-calibration metrics such as the AUC score. To this end, we propose Neural Calibration, a simple yet powerful post-hoc calibration method that learns to calibrate by making full use of the field-aware information over the validation set. We present extensive experiments on five large-scale datasets. The results showed that Neural Calibration significantly improves against uncalibrated predictions in common metrics such as the negative log-likelihood, Brier score and AUC, as well as the proposed field-level calibration error.

preprint2019arXiv

Direct Experimental Test of Forward and Reverse Uncertainty Relations

The canonical Robertson-Schrödinger uncertainty relation provides a loose bound for the product of variances of two non-commuting observables. Recently, several tight forward and reverse uncertainty relations have been proved which go beyond the traditional uncertainty relations. Here we experimentally test multifold of state-dependent uncertainty relations for the product as well as the sum of variances of two incompatible observables for photonic qutrits. These uncertainty relations are independent of any optimization and still tighter than the Robertson-Schrödinger uncertainty relation and other ones found in current literatures. For the first time, we also test the state-dependent reverse uncertainty relations for the sum of variances of two incompatible observables, which implies another unique feature of preparation uncertainty in quantum mechanics. Our experimental results not only foster insight into a fundamental limitation on preparation uncertainty but also may contribute to the study of upper limit of precession measurements for incompatible observables.

preprint2019arXiv

Observation of non-Hermitian bulk-boundary correspondence in quantum dynamics

Bulk-boundary correspondence, a central principle in topological matter relating bulk topological invariants to edge states, breaks down in a generic class of non-Hermitian systems that have so far eluded experimental effort. Here we theoretically predict and experimentally observe non-Hermitian bulk-boundary correspondence, a fundamental generalization of the conventional bulk-boundary correspondence, in discrete-time non-unitary quantum-walk dynamics of single photons. We experimentally demonstrate photon localizations near boundaries even in the absence of topological edge states, thus confirming the non-Hermitian skin effect. Facilitated by our experimental scheme of edge-state reconstruction, we directly measure topological edge states, which match excellently with non-Bloch topological invariants calculated from localized bulk-state wave functions. Our work unequivocally establishes the non-Hermitian bulk-boundary correspondence as a general principle underlying non-Hermitian topological systems, and paves the way for a complete understanding of topological matter in open systems.