Researcher profile

Dahai Yu

Dahai Yu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2026arXiv

XStreamVGGT: Extremely Memory-Efficient Streaming Vision Geometry Grounded Transformer with KV Cache Compression

Learning-based 3D visual geometry models have benefited substantially from large-scale transformers. Among these, StreamVGGT leverages frame-wise causal attention for strong streaming reconstruction, but suffers from unbounded KV cache growth, leading to escalating memory consumption and inference latency as input frames accumulate. We propose XStreamVGGT, a tuning-free approach that systematically compresses the KV cache through joint pruning and quantization, enabling extremely memory-efficient streaming inference. Specifically, redundant KVs originating from multi-view inputs are pruned through efficient token importance identification, enabling a fixed memory budget. Leveraging the unique distribution of KV tensors, we incorporate KV quantization to further reduce memory consumption. Extensive evaluations show that XStreamVGGT achieves mostly negligible performance degradation while substantially reducing memory usage by 4.42$\times$ and accelerating inference by 5.48$\times$, enabling scalable and practical streaming 3D applications. The code is available at https://github.com/ywh187/XStreamVGGT/.

preprint2021arXiv

Estimation of transmitted wavefronts at defocused positions in a broad bandwidth range

Wavefront aberrations can reflect the imaging quality of high-performance optical systems better than geometric aberrations. Although laser interferometers have emerged as the main tool for measurement of transmitted wavefronts, their application is greatly limited, as they are typically designed for operation at specific wavelengths. In a previous study, we proposed a method for determining the wavefront transmitted by an optical system at any wavelength in a certain band. Although this method works well for most monochromatic systems, where the image plane is at the focal point for the transmission wavelength, for general multi-color systems, it is more practical to measure the wavefront at the defocused image plane. Hence, in this paper, we have developed a complete method for determining transmitted wavefronts in a broad bandwidth at any defocused position, enabling wavefront measurements for multi-color systems. Here, we assume that in small ranges, the Zernike coefficients have a linear relationship with position, such that Zernike coefficients at defocused positions can be derived from measurements performed at the focal point. We conducted experiments to verify these assumptions, validating the new method. The experimental setup has been improved so that it can handle multi-color systems, and a detailed experimental process is summarized. With this technique, application of broadband transmission wavefront measurement can be extended to most general optical systems, which is of great significance for characterization of achromatic and apochromatic optical lenses.

preprint2020arXiv

UDC 2020 Challenge on Image Restoration of Under-Display Camera: Methods and Results

This paper is the report of the first Under-Display Camera (UDC) image restoration challenge in conjunction with the RLQ workshop at ECCV 2020. The challenge is based on a newly-collected database of Under-Display Camera. The challenge tracks correspond to two types of display: a 4k Transparent OLED (T-OLED) and a phone Pentile OLED (P-OLED). Along with about 150 teams registered the challenge, eight and nine teams submitted the results during the testing phase for each track. The results in the paper are state-of-the-art restoration performance of Under-Display Camera Restoration. Datasets and paper are available at https://yzhouas.github.io/projects/UDC/udc.html.