Source author record

Ziwei Li

Ziwei Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.ao-ph physics.optics astro-ph.CO Machine Learning physics.geo-ph

Catalog footprint

What is connected

6works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Gradient Starvation in Binary-Reward GRPO: Why Group-Mean Centering Fails and Why the Simplest Fix Works

Group Relative Policy Optimization (GRPO) is a standard algorithm for reinforcement learning from verifiable rewards, but its group-mean-centered advantage can fail under binary rewards. The failure mode is gradient starvation: when every response in a group is correct or every response is wrong, the centered advantage is exactly zero and the policy receives no learning signal. We prove that the true degeneracy rate always exceeds the i.i.d. Bernoulli prediction by Jensen's inequality, and observe a 0.69 degeneracy rate at group size four in logged Qwen3.5-9B GSM8K training. We then show that the fixed-reference Sign advantage, $A=2r-1$, performs pass@$G$ failure descent by increasing the probability that at least one sample in the group succeeds. On the full GSM8K test set across seven seeds, Sign reaches 73.8% accuracy versus 28.4% for standard normalized group-mean DrGRPO at group size four, a 45.4 point gain with $p<0.0001$. The effect is directionally consistent on Llama-3.1-8B and positive but underpowered on a MATH-500 transfer check. Pass@$k$ analysis indicates that the main benefit is search compression rather than large capacity expansion, aligning the empirical gains with recent RLVR ceiling observations.

preprint2022arXiv

Study of Efficient Photonic Chromatic Dispersion Equalization Using MZI-Based Coherent Optical Matrix Multiplication

We propose and study an efficient photonic CDE method using MZI-based coherent optical matrix multiplication. It improves the compensation performance by about 60% when the tap-length is limited, and only 50% taps of the theoretical value is needed for photonic CDE with 1-dB penalty.

preprint2021arXiv

Tropical precipitation clusters as islands on a rough water-vapor topography

Tropical precipitation clusters exhibit power-law frequency distributions in area and volume (integrated precipitation), implying a lack of characteristic scale in tropical convective organization. However, it remains unknown what gives rise to the power laws and how the power-law exponents for area and volume are related to one another. Here, we explore the perspective that precipitation clusters are islands above a convective threshold on a rough column-water-vapor (CWV) topography. This perspective is supported by the agreement between the precipitation clusters and CWV islands in their frequency distributions as well as fractal dimensions. Power laws exist for CWV islands at different thresholds through the CWV topography, suggesting that the existence of power-laws is not specifically related to local precipitation dynamics, but is rather a general feature of CWV islands. Furthermore, the frequency distributions and fractal dimensions of the clusters can be reproduced when the CWV field is modeled to be self-affine with a roughness exponent of 0.3. Self-affine scaling theory relates the statistics of precipitation clusters to the roughness exponent; it also relates the power-law slopes for area and volume without involving the roughness exponent. Thus, the perspective of precipitation clusters as CWV islands provides a useful framework to consider many statistical properties of the precipitation clusters, particularly given that CWV is well-observed over a wide range of length scales in the tropics. However, the statistics of CWV islands at the convective threshold imply a smaller roughness than is inferred from the power spectrum of the bulk CWV field, and further work is needed to understand the scaling of the CWV field.

preprint2020arXiv

Deep Learning for Strong Lensing Search: Tests of the Convolutional Neural Networks and New Candidates from KiDS DR3

Convolutional Neutral Networks have been successfully applied in searching for strong lensing systems, leading to discoveries of new candidates from large surveys. On the other hand, systematic investigations about their robustness are still lacking. In this paper, we first construct a neutral network, and apply it to $r$-band images of Luminous Red Galaxies (LRGs) of the Kilo Degree Survey (KiDS) Data Release 3 to search for strong lensing systems. We build two sets of training samples, one fully from simulations, and the other one using the LRG stamps from KiDS observations as the foreground lens images. With the former training sample, we find 48 high probability candidates after human-inspection, and among them, 27 are newly identified. Using the latter training set, about 67\% of the aforementioned 48 candidates are also found, and there are 11 more new strong lensing candidates identified. We then carry out tests on the robustness of the network performance with respect to the variation of PSF. With the testing samples constructed using PSF in the range of 0.4 to 2 times of the median PSF of the training sample, we find that our network performs rather stable, and the degradation is small. We also investigate how the volume of the training set can affect our network performance by varying it from 0.1 millions to 0.8 millions. The output results are rather stable showing that within the considered range, our network performance is not very sensitive to the volume size.

preprint2020arXiv

Response of Vertical Velocities in Extratropical Precipitation Extremes to Climate Change

Precipitation extremes intensify in most regions in climate-model projections. Changes in vertical velocities contribute to the changes in intensity of precipitation extremes but remain poorly understood. Here, we find that mid-tropospheric vertical velocities in extratropical precipitation extremes strengthen overall in simulations of 21st-century climate change. For each extreme event, we solve the quasi-geostrophic omega equation to decompose this strengthening into different physical contributions. We first consider a dry decomposition in which latent heating is treated as an external forcing of upward motion. Much of the positive contribution to upward motion from increased latent heating is offset by negative contributions from increases in dry static stability and changes in the horizontal length scale of vertical velocities. However, taking changes in latent heating as given is a limitation when the aim is to understand changes in precipitation, since latent heating and precipitation are closely linked. Therefore, we also perform a moist decomposition of the changes in vertical velocities in which latent heating is represented through a moist static stability. In the moist decomposition, changes in moist static stability play a key role and contributions from other factors such as changes in the depth of the upward motion increase in importance. While both dry and moist decompositions are self-consistent, the moist dynamical perspective has greater potential to give insights into the causes of the dynamical contributions to changes in precipitation extremes in different regions.

preprint2015arXiv

Multispectral imaging using a single bucket detector

Current multispectral imagers suffer from low photon efficiency and limited spectrum range. These limitations are partially due to the technological limitations from array sensors (CCD or CMOS), and also caused by separative measurement of the entries/slices of a spatial-spectral data cube. Besides, they are mostly expensive and bulky. To address above issues, this paper proposes to image the 3D multispectral data with a single bucket detector in a multiplexing way. Under the single pixel imaging scheme, we project spatial-spectral modulated illumination onto the target scene to encode the scene's 3D information into a 1D measurement sequence. Conventional spatial modulation is used to resolve the scene's spatial information. To avoid increasing requisite acquisition time for 2D to 3D extension of the latent data, we conduct spectral modulation in a frequency-division multiplexing manner in the speed gap between slow spatial light modulation and fast detector response. Then the sequential reconstruction falls into a simple Fourier decomposition and standard compressive sensing problem. A proof-of-concept setup is built to capture the multispectral data (64 pixels $\times$ 64 pixels $\times$ 10 wavelength bands) in the visible wavelength range (450nm-650nm) with acquisition time being 1 minute. The imaging scheme is of high flexibility for different spectrum ranges and resolutions. It holds great potentials for various low light and airborne applications, and can be easily manufactured production-volume portable multispectral imagers.