Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
17topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2026arXiv

Does the radio-active phase of XTE~J1810$-$197 recur following the same evolutionary pattern?

Magnetars are the most strongly magnetized compact objects known in the Universe and are regarded as one of the primary engines powering a variety of enigmatic, high-energy transients. However, our understanding of magnetars remains highly limited, constrained by observational sample size and radiative variability. XTE~J1810$-$197, which re-entered a radio-active phase in 2018, is one of only six known radio-pulsating magnetars. Leveraging the distinctive capability for simultaneous dual-frequency observations, we utilized the Shanghai Tianma Radio Telescope (TMRT) to monitor this magnetar continuously at both 2.25 and 8.60~GHz, capturing its entire evolution from radio activation to quenching. This enabled precise characterization of the evolution in its integrated profile, spin frequency, flux density, and spectral index ($α$, defined by $S \propto f^α$). The first time derivative of its spin frequency $\dotν$ passed through four distinct phases -- rapid decrease, violent oscillation, steady decline, and stable recovery -- before returning to its pre-outburst value concomitant with the cessation of radio emission. Remarkably, both the amplitudes and the characteristic time-scales of these $\dotν$ variations match those observed during the previous outburst that began in 2003, providing the first demonstration that post-outburst rotational evolution and radiative behavior in a magnetar are repeatable. A twisted-magnetosphere model can qualitatively account for this repeatability as well as for the progressive narrowing and abrupt disappearance of the radio pulse radiation, thereby receiving strong observational support.

preprint2026arXiv

ELDOR: A Dataset and Benchmark for Illegal Gold Mining in the Amazon Rainforest

Illegal gold mining in the Amazon rainforest causes deforestation, water contamination, and long-term ecosystem disruption, yet remains difficult to monitor at fine spatial scales. Satellite imagery supports large-scale observation, but often misses small mining-related structures and subtle land-cover transitions, especially under frequent cloud cover. We introduce ELDOR, a large-scale UAV benchmark for monitoring environmental and landscape disturbance from illegal gold mining in the rainforest. ELDOR contains manually annotated orthomosaic imagery covering over 2,500 hectares, with pixel-level semantic labels for both mining-related activities and surrounding ecological structures. With this unified annotation source, we establish four benchmark tasks: semantic segmentation, segmentation-derived recognition, direct multi-label classification, and class-presence recognition with vision-language models. Across these tasks, we compare generic and remote-sensing-specific segmentation models, vision foundation model-related segmentation methods, direct multi-label classification methods, and vision-language models under a controlled closed-set protocol. Results show that current methods still struggle with rare small-scale mining structures and fine-grained recovery classes, suggesting the need for context-aware and multimodal modeling. To support domain analysis and practical use, we further build an interactive explorer for domain experts that provides a unified interface for data exploration and model inference.

preprint2026arXiv

Feature-based Inversion of 2.5D Controlled Source Electromagnetic Data using Generative Priors

In this study, we investigate feature-based 2.5D controlled source marine electromagnetic (mCSEM) data inversion using generative priors. Two-and-half dimensional modeling using finite difference method (FDM) is adopted to compute the response of horizontal electric dipole (HED) excitation. Rather than using a neural network to approximate the entire inverse mapping in a black-box manner, we adopt a plug-andplay strategy in which a variational autoencoder (VAE) is used solely to learn prior information on conductivity distributions. During the inversion process, the conductivity model is iteratively updated using the Gauss Newton method, while the model space is constrained by projections onto the learned VAE decoder. This framework preserves explicit control over data misfit and enables flexible adaptation to different survey configurations. Numerical and field experiments demonstrate that the proposed approach effectively incorporates prior information, improves reconstruction accuracy, and exhibits good generalization performance.

preprint2026arXiv

FilmSceneDesigner: Chaining Set Design for Procedural Film Scene Generation

Film set design plays a pivotal role in cinematic storytelling and shaping the visual atmosphere. However, the traditional process depends on expert-driven manual modeling, which is labor-intensive and time-consuming. To address this issue, we introduce FilmSceneDesigner, an automated scene generation system that emulates professional film set design workflow. Given a natural language description, including scene type, historical period, and style, we design an agent-based chaining framework to generate structured parameters aligned with film set design workflow, guided by prompt strategies that ensure parameter accuracy and coherence. On the other hand, we propose a procedural generation pipeline which executes a series of dedicated functions with the structured parameters for floorplan and structure generation, material assignment, door and window placement, and object retrieval and layout, ultimately constructing a complete film scene from scratch. Moreover, to enhance cinematic realism and asset diversity, we construct SetDepot-Pro, a curated dataset of 6,862 film-specific 3D assets and 733 materials. Experimental results and human evaluations demonstrate that our system produces structurally sound scenes with strong cinematic fidelity, supporting downstream tasks such as virtual previs, construction drawing and mood board creation.

preprint2026arXiv

GeM-VG: Towards Generalized Multi-image Visual Grounding with Multimodal Large Language Models

Multimodal Large Language Models (MLLMs) have demonstrated impressive progress in single-image grounding and general multi-image understanding. Recently, some methods begin to address multi-image grounding. However, they are constrained by single-target localization and limited types of practical tasks, due to the lack of unified modeling for generalized grounding tasks. Therefore, we propose GeM-VG, an MLLM capable of Generalized Multi-image Visual Grounding. To support this, we systematically categorize and organize existing multi-image grounding tasks according to their reliance of cross-image cues and reasoning, and introduce the MG-Data-240K dataset, addressing the limitations of existing datasets regarding target quantity and image relation. To tackle the challenges of robustly handling diverse multi-image grounding tasks, we further propose a hybrid reinforcement finetuning strategy that integrates chain-of-thought (CoT) reasoning and direct answering, considering their complementary strengths. This strategy adopts an R1-like algorithm guided by a carefully designed rule-based reward, effectively enhancing the model's overall perception and reasoning capabilities. Extensive experiments demonstrate the superior generalized grounding capabilities of our model. For multi-image grounding, it outperforms the previous leading MLLMs by 2.0% and 9.7% on MIG-Bench and MC-Bench, respectively. In single-image grounding, it achieves a 9.1% improvement over the base model on ODINW. Furthermore, our model retains strong capabilities in general multi-image understanding.

preprint2026arXiv

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

We present GLM-4.1V-Thinking, GLM-4.5V, and GLM-4.6V, a family of vision-language models (VLMs) designed to advance general-purpose multimodal understanding and reasoning. In this report, we share our key findings in the development of the reasoning-centric training framework. We first develop a capable vision foundation model with significant potential through large-scale pre-training, which arguably sets the upper bound for the final performance. We then propose Reinforcement Learning with Curriculum Sampling (RLCS) to unlock the full potential of the model, leading to comprehensive capability enhancement across a diverse range of tasks, including STEM problem solving, video understanding, content recognition, coding, grounding, GUI-based agents, and long document interpretation. In a comprehensive evaluation across 42 public benchmarks, GLM-4.5V achieves state-of-the-art performance on nearly all tasks among open-source models of similar size, and demonstrates competitive or even superior results compared to closed-source models such as Gemini-2.5-Flash on challenging tasks including Coding and GUI Agents. Meanwhile, the smaller GLM-4.1V-9B-Thinking remains highly competitive-achieving superior results to the much larger Qwen2.5-VL-72B on 29 benchmarks. We open-source both GLM-4.1V-9B-Thinking and GLM-4.5V. We further introduce the GLM-4.6V series, open-source multimodal models with native tool use and a 128K context window. A brief overview is available at https://z.ai/blog/glm-4.6v. Code, models and more information are released at https://github.com/zai-org/GLM-V.

preprint2026arXiv

High-Performance KV$_3$Sb$_5$/WSe$_2$ van der Waals Photodetectors

Kagome metals AV$_3$Sb$_5$ (A = K, Rb, Cs) have recently emerged as a promising platform for exploring correlated and topological quantum states, yet their potential for optoelectronic applications remains largely unexplored. Here, we report high-performance photodetectors based on van der Waals KV$_3$Sb$_5$/WSe$_2$ heterojunctions. A high-quality Schottky interface readily forms between KV$_3$Sb$_5$ and WSe$_2$, enabling efficient separation and transport of photoinduced carriers. Under 520 nm illumination, the device achieves an open-circuit voltage up to 0.6 V, a responsivity of 809 mA/W, and a fast response time of 18.3 us. This work demonstrates the promising optoelectronic applications of Kagome metals and highlights the potential of KV$_3$Sb$_5$-based van der Waals heterostructures for high-performance photodetection.

preprint2026arXiv

Impact of Pressure and Apical Oxygen Vacancies on Superconductivity in La$_3$Ni$_2$O$_7$

The bilayer nickelate La$_3$Ni$_2$O$_7$ under pressure has recently emerged as a promising system for high-$T_c$ superconductivity. In this work, we investigate the fate of the superconducting properties in La$_3$Ni$_2$O$_{7}$ under pressure, focusing on the effects of structural deformation and apical oxygen vacancies. Employing a low-energy effective $t$-$J_{\parallel}$-$J_{\perp}$ model for the $3d_{x^2-y^2}$ orbitals within the slave-boson mean-field approach, we demonstrate that the pairing strength is significantly enhanced in the high-pressure tetragonal $I4/mmm$ phase compared to the ambient pressure orthorhombic $Amam$ phase. Furthermore, by simulating random configurations of apical oxygen vacancies, we show that oxygen vacancies suppress both pairing strength and superfluid density. These results underscore the critical role of pressure and oxygen stoichiometry in tuning the SC of La$_3$Ni$_2$O$_7$, providing key insights into optimizing its high-$T_c$ behavior.

preprint2026arXiv

Mass conservation, positivity and energy identical-relation preserving scheme for the Navier-Stokes equations with variable density

In this paper, we consider a mass conservation, positivity and energy identical-relation preserving scheme for the Navier-Stokes equations with variable density. Utilizing the square transformation, we first ensure the positivity of the numerical fluid density, which is form-invariant and regardless of the discrete scheme. Then, by proposing a new recovery technique to eliminate the numerical dissipation of the energy and to balance the loss of the mass when approximating the reformation form, we preserve the original energy identical-relation and mass conservation of the proposed scheme. To the best of our knowledge, this is the first work that can preserve the original energy identical-relation for the Navier-Stokes equations with variable density. Moreover, the error estimates of the considered scheme are derived. Finally, we show some numerical examples to verify the correctness and efficiency.

preprint2026arXiv

SAHA: Supervised Autonomous HArvester for selective forest thinning

Forestry plays a vital role in our society, creating significant ecological, economic, and recreational value. Efficient forest management involves labor-intensive and complex operations. One essential task for maintaining forest health and productivity is selective thinning, which requires skilled operators to remove specific trees to create optimal growing conditions for the remaining ones. In this work, we present a solution based on a small-scale robotic harvester (SAHA) designed for executing this task with supervised autonomy. We build on a 4.5-ton harvester platform and implement key hardware modifications for perception and automatic control. We implement learning- and model-based approaches for precise control of hydraulic actuators, accurate navigation through cluttered environments, robust state estimation, and reliable semantic estimation of terrain traversability. Integrating state-of-the-art techniques in perception, planning, and control, our robotic harvester can autonomously navigate forest environments and reach targeted trees for selective thinning. We present experimental results from extensive field trials over kilometer-long autonomous missions in northern European forests, demonstrating the harvester's ability to operate in real forests. We analyze the performance and provide the lessons learned for advancing robotic forest management.

preprint2026arXiv

STAR-PólyaMath: Multi-Agent Reasoning under Persistent Meta-Strategic Supervision

Frontier AI models and multi-agent systems have led to significant improvements in mathematical reasoning. However, for problems requiring extended, long-horizon reasoning, existing systems continue to suffer from fundamental reliability issues: hallucination accumulation, memory fragmentation, and imbalanced reasoning-tool trade-offs. In this paper, we introduce STAR-PólyaMath, a multi-agent framework that systematically addresses these challenges through meta-level supervision and structured Reasoner-Verifier interaction. STAR-PólyaMath is structured as an orchestrated state machine with nested challenge-step-replan loops, governed by a reasoning-free Python orchestrator that separates control from inference and bounds error propagation through trace-back and re-planning. Our key innovation is a persistent Meta-Strategist that maintains cross-attempt memory and exercises meta-level control by issuing high-level strategic guidance or mandatory directives, so the system can escape unproductive loops rather than stagnate or over-rely on tools. STAR-PólyaMath achieves state-of-the-art results on all eight top-tier competition benchmarks: AIME 2025-2026, MathArena Apex Shortlist, MathArena Apex 2025, Putnam 2025, IMO 2025, HMMT February 2026, and USAMO 2026. It obtains perfect scores on AIMEs, Putnam, and HMMT, and shows its largest margin on Apex 2025, scoring 93.75% compared with 80.21% by the strongest baseline GPT-5.5. Ablation studies show that the gains arise from the framework's orchestration rather than from model-level diversity since removing key components or substituting in mixed backbones consistently weakens performance. Code is available at https://github.com/Julius-Woo/STAR-PolyaMath.

preprint2026arXiv

Strong-coupling study of the pairing mechanism in pressurized La$_3$Ni$_2$O$_7$

Recently, the bilayer perovskite nickelate La$_3$Ni$_2$O$_7$ has been reported to exhibit high-temperature superconductivity near $80$ K under a moderate pressure of about $14$GPa. To investigate the underlying pairing mechanism and symmetry in this complex system, we propose and analyze a mixed spin-$1$ and spin-$\frac{1}{2}$ bilayer $t$-$J$ model in the strong coupling regime. This model explicitly incorporates the crucial role of strong Hund's coupling, which favors the formation of local spin-triplet states from the two onsite $E_g$ orbital electrons at half-filling. We further investigate the model using both slave-particle mean-field theory and the density matrix renormalization group method. Our simulation results reveal that the dominate pairing channel is the interlayer one in the $3d_{x^2-y^2}$ orbital. The Hund's coupling is shown to enhance superconductivity within a reasonable physical range. Moreover, electron doping strengthens superconductivity by increasing carrier density; in contrast, hole doping weakens superconductivity. These findings offer critical insights into the unconventional superconductivity of pressurized La$_3$Ni$_2$O$_7$ and underline the important role of orbital-selective behavior and Hund's rule.

preprint2025arXiv

Arithmetic spectral transition for the unitary almost Mathieu operator

We study the unitary almost Mathieu operator (UAMO), a one-dimensional quasi-periodic unitary operator arising from a two-dimensional discrete-time quantum walk on $\mathbb Z^2$ in a homogeneous magnetic field. In the positive Lyapunov exponent regime $0\le λ_1<λ_2\le 1$, we establish an arithmetic localization statement governed by the frequency exponent $β(ω)$. More precisely, for every irrational $ω$ with $β(ω)<L$, where $L>0$ denotes the Lyapunov exponent, and every non-resonant phase $θ$, we prove Anderson localization, i.e. pure point spectrum with exponentially decaying eigenfunctions. This extends our previous arithmetic localization result for Diophantine frequencies (for which $β(ω)=0$) to a sharp threshold in frequency.

preprint2024arXiv

Superconductivity at Pd/Bi$_2$Se$_3$ Interfaces Due to Self-Formed PdBiSe Interlayers

Understanding the physical and chemical processes at the interface of metals and topological insulators is crucial for developing the next generation of topological quantum devices. Here we report the discovery of robust superconductivity in Pd/Bi$_2$Se$_3$ bilayers fabricated by sputtering Pd on the surface of Bi$_2$Se$_3$. Through transmission electron microscopy measurements, we identify that the observed interfacial superconductivity originates from the diffusion of Pd into Bi$_2$Se$_3$. In the diffusion region, Pd chemically reacts with Bi$_2$Se$_3$ and forms a layer of PdBiSe, a known su-perconductor with a bulk transition temperature of 1.5 K. Our work provides a method for in-troducing superconductivity into Bi$_2$Se$_3$, laying the foundation for developing sophisticated Bi$_2$Se$_3$-based topological devices.