Researcher profile

Yuchen He

Yuchen He contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
11works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

11 published item(s)

preprint2026arXiv

PRIMED: Adaptive Modality Suppression for Referring Audio-Visual Segmentation via Biased Competition

Referring Audio-Visual Segmentation (Ref-AVS) seeks to localize and segment target objects in video frames based on visual, auditory, and textual referring cues. The task is challenging because the relevance of different modalities varies across referring expressions and scenes, while existing methods typically treat multimodal cues as homogeneous inputs for fusion, prompting, or reasoning, making them vulnerable to irrelevant or misleading modalities. To address this problem, we propose PRIMED, inspired by the biased competition theory in cognitive neuroscience, which explicitly models both visual perception and language-driven prior modulation, and enables more accurate Ref-AVS by adaptive modality suppression. Specifically, a Modality Prior Decoder first estimates whether the referring expression relies primarily on audio, vision, or their joint interaction, generating a modality prior to adaptively guide high-level attention. A Token Distiller further extracts compact global visual tokens from high-level features and shares them across Competition-aware Cross-modal Fusion modules to provide hierarchical global context. Additionally, we introduce a Spatial-Aware Semantic Alignment loss to further enhance foreground-background discrimination through contrastive learning. Extensive experiments on the Ref-AVS benchmark demonstrate that PRIMED achieves state-of-the-art overall performance.

preprint2022arXiv

Extreme wave excitation from localized phase-shift perturbations

The modulation instability is a focusing mechanism responsible for the formation of strong wave localizations not only on the water surface, but also in a variety of nonlinear dispersive media. Such dynamics is initiated from the injection of side-bands, which translate into an amplitude modulation of the wave field. The nonlinear stage of unstable wave evolution can be described by exact solutions of the nonlinear Schrödinger equation (NLSE). In that case, the amplitude modulation of such coherent extreme wave structures is connected to a particular phase-shift seed in the carrier wave. In this letter, we show that phase-shifts localization applied to the background, excluding any amplitude modulation excitation, can indeed trigger extreme events. Such rogue waves can be for instance generated by considering the parametrization of fundamental breathers and thus, by seeding only the local phase-shift information to the regular carrier wave. Our wave tank experiments show an excellent agreement with the expected NLSE hydrodynamics and confirm that even though delayed in their evolution, breather-type extreme waves can be generated from a purely regular wave train. Such novel focusing mechanism awaits experimental confirmation in other nonlinear media, such optics, plasma, and Bose-Einstein condensates.

preprint2022arXiv

Galilean-transformed solitons and supercontinuum generation in dispersive media

The Galilean transformation is a universal operation connecting the coordinates of a dynamical system, which move relative to each other with a constant speed. In the context of exact solutions of the universal nonlinear Schrödinger equation (NLSE), inducing a Galilean velocity (GV) to the pulse involves a frequency shift to satisfy the symmetry of the wave equation. As such, the Galilean transformation has been deemed to be not applicable to wave groups in nonlinear dispersive media. In this paper, we demonstrate that in a wave tank generated Galilean transformed envelope and Peregrine solitons show clear variations from their respective pure dynamics on the water surface. The type of deviations depends on the sign of the GV and can be captured by the modified NLSE or the Euler equations. Moreover, we show that positive Galilean-translated envelope soliton pulses exhibit self-modulation. While designated GS and wave steepness values expedite multi-soliton dynamics, the strong focusing of such higher-order coherent waves inevitably lead to the generation of supercontinua as a result of soliton fission. We anticipate that kindred experimental and numerical studies might be implemented in other dispersive wave guides governed by nonlinearity.

preprint2021arXiv

Generative-Adversarial-Networks-based Ghost Recognition

Nowadays, target recognition technique plays an important role in many fields. However, the current target image information based methods suffer from the influence of image quality and the time cost of image reconstruction. In this paper, we propose a novel imaging-free target recognition method combining ghost imaging (GI) and generative adversarial networks (GAN). Based on the mechanism of GI, a set of random speckles sequence is employed to illuminate target, and a bucket detector without resolution is utilized to receive echo signal. The bucket signal sequence formed after continuous detections is constructed into a bucket signal array, which is regarded as the sample of GAN. Then, conditional GAN is used to map bucket signal array and target category. In practical application, the speckles sequence in training step is employed to illuminate target, and the bucket signal array is input GAN for recognition. The proposed method can improve the problems caused by conventional recognition methods that based on target image information, and provide a certain turbulence-free ability. Extensive experiments show that the proposed method achieves promising performance.

preprint2021arXiv

Ghost Imaging Based on Recurrent Neural Network

Benefit from the promising features of second-order correlation, ghost imaging (GI) has received extensive attentions in recent years. Simultaneously, GI is affected by the poor trade-off between sampling rate and imaging quality. The traditional image reconstruction method in GI is to accumulate the action result of each speckle and the corresponding bucket signal. We found that the image reconstruction process of GI is very similar to the Recurrent Neural Network (RNN), which is one of the deep learning algorithm. In this paper, we proposed a novel method that effectively implements GI on the RNN architecture, called GI-RNN. The state of each layer in RNN is determined by the output of the previous layer and the input of this layer, and the output of the network is the sum of all previous states. Therefore, we take the speckle of each illumination and the corresponding bucket signal as the input of each layer, and the output of the network is the sum of all previous speckle and bucket signal, which is the image of the target. The testing results show that the proposed method can achieve image reconstruction at a very low sampling rate (0.38$\%$). Moreover, we compare GI-RNN with traditional GI algorithm and compressed sensing algorithm. The results of different targets show that GI-RNN is 6.61 dB higher than compressed sensing algorithm and 12.58 dB higher than traditional GI algorithm on average. In our view, the proposed method makes an important step to applications of GI.

preprint2021arXiv

Nth-order nonlinear intensity fluctuation amplifier

Stronger light intensity fluctuations are pursued by related applications such as optical resolution, image enhancement, and beam positioning. In this paper, an Nth-order light intensity fluctuation amplifier is proposed, which was demonstrated by a four-wave mixing process with different statistical distribution coupling lights. Firstly, its amplification mechanism is revealed both theoretically and experimentally. The ratio $R$ of statistical distributions and the degree of second-order coherence ${g^{(2)}}(0)$ of beams are used to characterize the affected modulations and the increased light intensity fluctuations through the four-wave mixing process. The results show that the amplification of light intensity fluctuations is caused by not only the fluctuating light fields of incident coupling beams, but also the fluctuating nonlinear coefficient of interaction. At last, we highlight the potentiality of applying such amplifier to other N-order nonlinear optical effects.

preprint2021arXiv

Ptychography Intensity Interferometry Imaging for Dynamic Distant Object

As a promising lensless imaging method for distance objects, intensity interferometry imaging (III) had been suffering from the unreliable phase retrieval process, hindering the development of III for decades. Recently, the introduction of the ptychographic detection in III overcame this challenge, and a method called ptychographic III (PIII) was proposed. We here experimentally demonstrate that PIII can image a dynamic distance object. A reasonable image for the moving object can be retrieved with only two speckle patterns for each probe, and only 10 to 20 iterations are needed. Meanwhile, PIII exhibits robust to the inaccurate information of the probe. Furthermore, PIII successfully recovers the image through a fog obfuscating the imaging light path, under which a conventional camera relying on lenses fails to provide a recognizable image.

preprint2020arXiv

Curvature Regularized Surface Reconstruction from Point Cloud

We propose a variational functional and fast algorithms to reconstruct implicit surface from point cloud data with a curvature constraint. The minimizing functional balances the distance function from the point cloud and the mean curvature term. Only the point location is used, without any local normal or curvature estimation at each point. With the added curvature constraint, the computation becomes particularly challenging. To enhance the computational efficiency, we solve the problem by a novel operator splitting scheme. It replaces the original high-order PDEs by a decoupled PDE system, which is solved by a semi-implicit method. We also discuss approach using an augmented Lagrangian method. The proposed method shows robustness against noise, and recovers concave features and sharp corners better compared to models without curvature constraint. Numerical experiments in two and three dimensional data sets, noisy and sparse data are presented to validate the model.

preprint2020arXiv

Silhouette Vectorization by Affine Scale-space

Silhouettes or 2D planar shapes are extremely important in human communication, which involves many logos, graphics symbols and fonts in vector form. Many more shapes can be extracted from image by binarization or segmentation, thus in raster form that requires a vectorization. There is a need for disposing of a mathematically well defined and justified shape vectorization process, which in addition provides a minimal set of control points with geometric meaning. In this paper we propose a silhouette vectorization method which extracts the outline of a 2D shape from a raster binary image, and converts it to a combination of cubic Bézier polygons and perfect circles. Starting from the boundary curvature extrema computed at sub-pixel level, we identify a set of control points based on the affine scale-space induced by the outline. These control points capture similarity invariant geometric features of the given silhouette and give precise locations of the shape's corners.of the given silhouette. Then, piecewise Bézier cubics are computed by least-square fitting combined with an adaptive splitting to guarantee a predefined accuracy. When there are no curvature extrema identified, either the outline is recognized as a circle using the isoperimetric inequality, or a pair of the most distant outline points are chosen to initiate the fitting. Given their construction, most of our control points are geometrically stable under affine transformations. By comparing with other feature detectors, we show that our method can be used as a reliable feature point detector for silhouettes. Compared to state-of-the-art image vectorization software, our algorithm demonstrates superior reduction on the number of control points, while maintaining high accuracy.

preprint2020arXiv

Two-photon superbunching effect of broadband chaotic stationary light at femtosecond timescale based on cascaded Michelson interferometer

It is challenging for observing superbunching effect with true chaotic light, here we propose and demonstrate a method to achieve superbunching effect of the degree of second-order coherence is 2.42 with broadband stationary chaotic light based on a cascaded Michelson interferometer (CMI), exceeding the theoretical upper limit of 2 for the two-photon bunching effect of chaotic light. The superbunching correlation peak is measured with an ultrafast two-photon absorption detector which the full width at half maximum reaches about 95 fs. Two-photon superbunching theory in a CMI is developed to interpret the effect and is in agreement with experimental results. The theory also predicts that the degree of second-order coherence can be much greater than $2$ if chaotic light propagates $N$ times in a CMI. Finally, a new type of weak signals detection setup which employs broadband chaotic light circulating in a CMI is proposed. Theoretically, it can increase the detection sensitivity of weak signals 79 times after the chaotic light circulating 100 times in the CMI.

preprint2019arXiv

Bifrequency 3D Ghost Imaging with Haar Wavelet Transform

Recently, ghost imaging has been attracting attentions because its mechanism would lead to many applications inaccessible to conventional imaging methods. However, it is challenging for high contrast and high resolution imaging, due to its low signal-to-noise ratio (SNR) and the demand of high sampling rate in detection. To circumvent these challenges, we here propose a ghost imaging scheme that exploits Haar wavelets as illuminating patterns with a bi-frequency light projecting system and frequency-selecting single-pixel detectors. This method provides a theoretically 100% image contrast and high detection SNR, which reduces the requirement of high dynamic range of detectors, enabling high resolution ghost imaging. Moreover, it can highly reduce the sampling rate (far below Nyquist limit) for a sparse object by adaptively abandoning unnecessary patterns during the measurement. These characteristics are experimentally verified with a resolution of 512 times 512 and a sampling rate lower than 5%. A high-resolution (1000 times 1000 times 1000) 3D reconstruction of an object is also achieved from multi-angle images.