Researcher profile

Siqi Li

Siqi Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Rethinking Event-Based Object Dtection through Representation-Level Temporal Aggregation and Model-Level Hypergraph Reasoning

Event cameras provide microsecond-level temporal resolution, low latency, and high dynamic range, offering potential for perception under fast motion and challenging illumination conditions. However, existing Event-based Object Detection (EOD) methods face limitations at both the representation and model levels: prior event representations usually encode temporal information indirectly through redundant structures, while detection models struggle to explicitly aggregate fragmented event responses into coherent high-order object features. To address these limitations, we present Event Dual Temporal-Relational Aggregation Detector (Ev-DTAD), a unified EOD framework that integrates representation-level temporal encoding with model-level temporal-hypergraph reasoning. Specifically, we introduce Hierarchical Temporal Aggregation (HTA), a compact three-channel pseudo-RGB representation that explicitly embeds temporal information across intra- and inter-window events. To further enhance detection under sparse and fragmented event responses, we propose Frequency-aware Hypergraph Temporal Fusion (FHTF), which refines multi-scale event features through temporal evolution modeling and high-order relational reasoning. Extensive experiments on Gen1 (+0.8 mAP and 1.7$\times$ faster), 1Mpx/Gen4 (+0.5 mAP and 1.6$\times$ faster), and eTraM (+3.0 mAP and 2.0$\times$ faster) demonstrate that Ev-DTAD achieves a competitive accuracy-efficiency trade-off, validating the complementarity between compact temporal representation and temporal-hypergraph feature reasoning.

preprint2022arXiv

Bounding Kolmogorov distances through Wasserstein and related integral probability metrics

We establish general upper bounds on the Kolmogorov distance between two probability distributions in terms of the distance between these distributions as measured with respect to the Wasserstein or smooth Wasserstein metrics. These bounds generalise existing results from the literature. To illustrate the broad applicability of our general bounds, we apply them to extract Kolmogorov distance bounds from multivariate normal, beta and variance-gamma approximations that have been established in the Stein's method literature.

preprint2022arXiv

Deep Kernel Representation for Image Reconstruction in PET

Image reconstruction for positron emission tomography (PET) is challenging because of the ill-conditioned tomographic problem and low counting statistics. Kernel methods address this challenge by using kernel representation to incorporate image prior information in the forward model of iterative PET image reconstruction. Existing kernel methods construct the kernels commonly using an empirical process, which may lead to unsatisfactory performance. In this paper, we describe the equivalence between the kernel representation and a trainable neural network model. A deep kernel method is then proposed by exploiting a deep neural network to enable automated learning of an improved kernel model and is directly applicable to single subjects in dynamic PET. The training process utilizes available image prior data to form a set of robust kernels in an optimized way rather than empirically. The results from computer simulations and a real patient dataset demonstrate that the proposed deep kernel method can outperform the existing kernel method and neural network method for dynamic PET image reconstruction.

preprint2021arXiv

Attosecond Coherent Electron Motion in Auger-Meitner Decay

In quantum systems, coherent superpositions of electronic states evolve on ultrafast timescales (few femtosecond to attosecond, 1 as = 0.001 fs = 10^{-18} s), leading to a time dependent charge density. Here we exploit the first attosecond soft x-ray pulses produced by an x-ray free-electron laser to induce a coherent core-hole excitation in nitric oxide. Using an additional circularly polarized infrared laser pulse we create a clock to time-resolve the electron dynamics, and demonstrate control of the coherent electron motion by tuning the photon energy of the x-ray pulse. Core-excited states offer a fundamental test bed for studying coherent electron dynamics in highly excited and strongly correlated matter.

preprint2020arXiv

Attention-based Multi-modal Fusion Network for Semantic Scene Completion

This paper presents an end-to-end 3D convolutional network named attention-based multi-modal fusion network (AMFNet) for the semantic scene completion (SSC) task of inferring the occupancy and semantic labels of a volumetric 3D scene from single-view RGB-D images. Compared with previous methods which use only the semantic features extracted from RGB-D images, the proposed AMFNet learns to perform effective 3D scene completion and semantic segmentation simultaneously via leveraging the experience of inferring 2D semantic segmentation from RGB-D images as well as the reliable depth cues in spatial dimension. It is achieved by employing a multi-modal fusion architecture boosted from 2D semantic segmentation and a 3D semantic completion network empowered by residual attention blocks. We validate our method on both the synthetic SUNCG-RGBD dataset and the real NYUv2 dataset and the results show that our method respectively achieves the gains of 2.5% and 2.6% on the synthetic SUNCG-RGBD dataset and the real NYUv2 dataset against the state-of-the-art method.

preprint2020arXiv

Low-Dose CT Image Denoising Using Parallel-Clone Networks

Deep neural networks have a great potential to improve image denoising in low-dose computed tomography (LDCT). Popular ways to increase the network capacity include adding more layers or repeating a modularized clone model in a sequence. In such sequential architectures, the noisy input image and end output image are commonly used only once in the training model, which however limits the overall learning performance. In this paper, we propose a parallel-clone neural network method that utilizes a modularized network model and exploits the benefit of parallel input, parallel-output loss, and clone-toclone feature transfer. The proposed model keeps a similar or less number of unknown network weights as compared to conventional models but can accelerate the learning process significantly. The method was evaluated using the Mayo LDCT dataset and compared with existing deep learning models. The results show that the use of parallel input, parallel-output loss, and clone-to-clone feature transfer all can contribute to an accelerated convergence of deep learning and lead to improved image quality in testing. The parallel-clone network has been demonstrated promising for LDCT image denoising.

preprint2019arXiv

Attosecond Transient Absorption Spooktroscopy: a ghost imaging approach to ultrafast absorption spectroscopy

The recent demonstration of isolated attosecond pulses from an X-ray free-electron laser (XFEL) opens the possibility for probing ultrafast electron dynamics at X-ray wavelengths. An established experimental method for probing ultrafast dynamics is X-ray transient absorption spectroscopy, where the X-ray absorption spectrum is measured by scanning the central photon energy and recording the resultant photoproducts. The spectral bandwidth inherent to attosecond pulses is wide compared to the resonant features typically probed, which generally precludes the application of this technique in the attosecond regime. In this paper we propose and demonstrate a new technique to conduct transient absorption spectroscopy with broad bandwidth attosecond pulses with the aid of ghost imaging, recovering sub-bandwidth resolution in photoproduct-based absorption measurements.

preprint2019arXiv

Tunable Isolated Attosecond X-ray Pulses with Gigawatt Peak Power from a Free-Electron Laser

The quantum mechanical motion of electrons in molecules and solids occurs on the sub-femtosecond timescale. Consequently, the study of ultrafast electronic phenomena requires the generation of laser pulses shorter than 1 fs and of sufficient intensity to interact with their target with high probability. Probing these dynamics with atomic-site specificity requires the extension of sub-femtosecond pulses to the soft X-ray spectral region. Here we report the generation of isolated GW-scale soft X-ray attosecond pulses with an X-ray free-electron laser. Our source has a pulse energy that is six orders of magnitude larger than any other source of isolated attosecond pulses in the soft X-ray spectral region, with a peak power in the tens of gigawatts. This unique combination of high intensity, high photon energy and short pulse duration enables the investigation of electron dynamics with X-ray non-linear spectroscopy and single-particle imaging.