Researcher profile

Shi Guo

Shi Guo contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

AsyncEvGS: Asynchronous Event-Assisted Gaussian Splatting for Handheld Motion-Blurred Scenes

3D reconstruction methods such as 3D Gaussian Splatting (3DGS) and Neural Radiance Fields (NeRF) achieve impressive photorealism but fail when input images suffer from severe motion blur. While event cameras provide high-temporal-resolution motion cues, existing event-assisted approaches rely on low-resolution sensors and strict synchronization, limiting their practicality for handheld 3D capture on common devices, such as smartphones. We introduce a flexible, high-resolution asynchronous RGB-Event dual-camera system and a corresponding reconstruction framework. Our approach first reconstructs sharp images from the event data and then employs a cross-domain pose estimation module based on the Visual Geometry Transformer (VGGT) to obtain robust initialization for 3DGS. During optimization, we employ a structure-driven event loss and view-specific consistency regularizers to mitigate the ill-posed behavior of traditional event losses and deblurring losses, ensuring both stable and high-fidelity reconstruction. We further contribute AsyncEv-Deblur, a new high-resolution RGB-Event dataset captured with our asynchronous system. Experiments demonstrate that our method achieves state-of-the-art performance on both our challenging dataset and existing benchmarks, substantially improving reconstruction robustness under severe motion blur. Project page: https://openimaginglab.github.io/AsyncEvGS/

preprint2022arXiv

A Differentiable Two-stage Alignment Scheme for Burst Image Reconstruction with Large Shift

Denoising and demosaicking are two essential steps to reconstruct a clean full-color image from the raw data. Recently, joint denoising and demosaicking (JDD) for burst images, namely JDD-B, has attracted much attention by using multiple raw images captured in a short time to reconstruct a single high-quality image. One key challenge of JDD-B lies in the robust alignment of image frames. State-of-the-art alignment methods in feature domain cannot effectively utilize the temporal information of burst images, where large shifts commonly exist due to camera and object motion. In addition, the higher resolution (e.g., 4K) of modern imaging devices results in larger displacement between frames. To address these challenges, we design a differentiable two-stage alignment scheme sequentially in patch and pixel level for effective JDD-B. The input burst images are firstly aligned in the patch level by using a differentiable progressive block matching method, which can estimate the offset between distant frames with small computational cost. Then we perform implicit pixel-wise alignment in full-resolution feature domain to refine the alignment results. The two stages are jointly trained in an end-to-end manner. Extensive experiments demonstrate the significant improvement of our method over existing JDD-B methods. Codes are available at https://github.com/GuoShi28/2StageAlign.

preprint2022arXiv

Distinctive g-factor of moire-confined excitons in van der Waals heterostructures

We investigated experimentally the valley Zeeman splitting of excitonic peaks in the photoluminescence (PL) spectra of high-quality hBN/WS2/MoSe2/hBN heterostructures at near-zero twist angles under perpendicular magnetic fields up to 20 T. We identify two neutral exciton peaks in the PL spectra: the lower energy one exhibits a reduced g-factor relative to that of the higher energy peak, and much lower than the recently reported values for interlayer excitons in other van der Waals (vdW) heterostructures. We provide evidence that such a discernible g-factor stems from the spatial confinement of the exciton in the potential landscape created by the moire pattern, due tolattice mismatch and/or inter-layer twist in heterobilayers. This renders magneto-PL an important tool to reach deeper understanding of the effect of moire patterns on excitonic confinement in vdW heterostructures.

preprint2022arXiv

Efficient Long-Range Attention Network for Image Super-resolution

Recently, transformer-based methods have demonstrated impressive results in various vision tasks, including image super-resolution (SR), by exploiting the self-attention (SA) for feature extraction. However, the computation of SA in most existing transformer based models is very expensive, while some employed operations may be redundant for the SR task. This limits the range of SA computation and consequently the SR performance. In this work, we propose an efficient long-range attention network (ELAN) for image SR. Specifically, we first employ shift convolution (shift-conv) to effectively extract the image local structural information while maintaining the same level of complexity as 1x1 convolution, then propose a group-wise multi-scale self-attention (GMSA) module, which calculates SA on non-overlapped groups of features using different window sizes to exploit the long-range image dependency. A highly efficient long-range attention block (ELAB) is then built by simply cascading two shift-conv with a GMSA module, which is further accelerated by using a shared attention mechanism. Without bells and whistles, our ELAN follows a fairly simple design by sequentially cascading the ELABs. Extensive experiments demonstrate that ELAN obtains even better results against the transformer-based SR models but with significantly less complexity. The source code can be found at https://github.com/xindongzhang/ELAN.

preprint2022arXiv

Image-Adaptive YOLO for Object Detection in Adverse Weather Conditions

Though deep learning-based object detection methods have achieved promising results on the conventional datasets, it is still challenging to locate objects from the low-quality images captured in adverse weather conditions. The existing methods either have difficulties in balancing the tasks of image enhancement and object detection, or often ignore the latent information beneficial for detection. To alleviate this problem, we propose a novel Image-Adaptive YOLO (IA-YOLO) framework, where each image can be adaptively enhanced for better detection performance. Specifically, a differentiable image processing (DIP) module is presented to take into account the adverse weather conditions for YOLO detector, whose parameters are predicted by a small convolutional neural net-work (CNN-PP). We learn CNN-PP and YOLOv3 jointly in an end-to-end fashion, which ensures that CNN-PP can learn an appropriate DIP to enhance the image for detection in a weakly supervised manner. Our proposed IA-YOLO approach can adaptively process images in both normal and adverse weather conditions. The experimental results are very encouraging, demonstrating the effectiveness of our proposed IA-YOLO method in both foggy and low-light scenarios.

preprint2021arXiv

Excited State Spectroscopy of Boron Vacancy Defects in Hexagonal Boron Nitride using Time-Resolved Optically Detected Magnetic Resonance

We report optically detected magnetic resonance (ODMR) measurements of an ensemble of spin-1 negatively charged boron vacancies in hexagonal boron nitride. The photoluminescence decay rates are spin-dependent, with inter-system crossing rates of $1.02~\mathrm{ns^{-1}}$ and $2.03~\mathrm{ns^{-1}}$ for the $m_s=0$ and $m_s=\pm 1$ states, respectively. Time-gating the photoluminescence enhances the ODMR contrast by discriminating between different decay rates. This is particularly effective for detecting the spin of the optically excited state, where a zero-field splitting of $\vert D_{ES}\vert=2.09~\mathrm{GHz}$ is measured. The magnetic field dependence of the time-gated photoluminescence exhibits dips corresponding to the Ground (GSLAC) and excited-state (ESLAC) anti-crossings. Additional dips corresponding to anti-crossings with nearby spin-1/2 parasitic impurities are also observed. The ESLAC dip is sensitive to the angle of the external magnetic field. Comparison to a model suggests that the anti-crossings are mediated by the interaction with nuclear spins, and allow an estimate of the ratio of the spin-dependent relaxation rates from the singlet back into the triplet ground state of $κ_0/κ_1=0.34$. This work provides important spectroscopic signatures of the boron vacancy, and information on the spin pumping and read-out dynamics.