Source author record

Haozhe Wang

Haozhe Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Artificial Intelligence physics.app-ph physics.optics Computer Vision cond-mat.mes-hall cond-mat.str-el Machine Learning Networking and Internet Architecture physics.data-an physics.ins-det physics.med-ph

Catalog footprint

What is connected

13works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Bad Seeing or Bad Thinking? Rewarding Perception for Vision-Language Reasoning

Achieving robust perception-reasoning synergy is a central goal for advanced Vision-Language Models (VLMs). Recent advancements have pursued this goal via architectural designs or agentic workflows. However, these approaches are often limited by static textual reasoning or complicated by the significant compute and engineering burden of external agentic complexity. Worse, this heavy investment does not yield proportional gains, often witnessing a "seesaw effect" on perception and reasoning. This motivates a fundamental rethinking of the true bottleneck. In this paper, we argue that the root cause of this trade-off is an ambiguity in modality credit assignment: when a VLM fails, is it due to flawed perception ("bad seeing") or flawed logic ("bad thinking")? To resolve this, we introduce a reinforcement learning framework that improves perception-reasoning synergy by reliably rewarding the perception fidelity. We explicitly decompose the generation process into interleaved perception and reasoning steps. This decoupling enables targeted supervision on perception. Crucially, we introduce Perception Verification (PV), leveraging a "blindfolded reasoning" proxy to reward perceptual fidelity independently of reasoning outcomes. Furthermore, to scale training across free-form VL tasks, we propose Structured Verbal Verification, which replaces high-variance LLM judging with structured algorithmic execution. These techniques are integrated into a Modality-Aware Credit Assignment (MoCA) mechanism, which routes rewards to the specific source of error -- either bad seeing or bad thinking -- enabling a single VLM to achieve simultaneous performance gains across a wide task spectrum.

preprint2026arXiv

Forbidden second harmonics in centrosymmetric bilayer crystals

Optical spectroscopy based on second-order nonlinearity is a critical technique for characterizing two-dimensional (2D) crystals as well as bioimaging and quantum optics. It is generally believed that second-harmonic generation (SHG) in centrosymmetric crystals, such as graphene and other bilayer 2D crystals, is negligible without externally breaking the inversion symmetry. Here, we show that with a new homodyne detection technique, we can apparently circumvent this symmetry-imposed constraint and observe robust SHG in pristine centrosymmetric crystals, without any symmetry-breaking field. With its exceptional sensitivity, we resolve polarization-resolved SHG in bilayer hexagonal boron nitride (h-BN), bilayer 2H-WSe$_2$, and remarkably, Bernal-stacked bilayer graphene, allowing us to unambiguously identify the crystallographic orientation in these crystals via SHG for the first time. We also demonstrate that the new technique can be used to non-invasively detect uniaxial strain and optical geometric phase in these crystals. The observed SHG in our experiments is attributed to second-order nonlinearity in the quadrupole channel, which is controlled by the presence of the $C_2$ symmetry instead of the inversion symmetry. Our new technique expands the capability of nonlinear optical spectroscopy to encompass a large class of centrosymmetric materials that could never be measured before, and can be used for quantum sensing of moiré materials and twisted epitaxial films.

preprint2026arXiv

Frustrated Magnetism in FeGe$_3$O$_4$ with a Chiral Trillium Network

The discovery of new magnetic ground states in geometrically frustrated lattices remains a central challenge in materials science. Here, we report the synthesis, structural characterization, and frustrated magnetic properties of FeGe$_3$O$_4$, a newly identified compound that crystallizes in the noncentrosymmetric cubic space group $P2_13$. In this structure, Fe atoms form an intricate double-trillium lattice with nearest-neighbor Fe--Fe distances of $\sim$4.2~Å, while Ge$^{2+}$ ions mediate magnetic interactions through Fe-Ge-Fe pathways. Field-dependent magnetization at 2~K shows a pronounced nonlinearity, reaching a maximum moment of 2.55(3)~$μ_\mathrm{B}$/Fe$^{2+}$ at 70~kOe without evidence of saturation. Magnetic susceptibility, heat capacity, and neutron scattering collectively reveal the onset of short-range magnetic interactions near 5~K, with no long-range ordering detected down to 0.06~K. Specific heat measurements demonstrate strong frustration: only $\sim$34\% of the expected magnetic entropy is recovered at 2.4~K. Taken together, these results establish FeGe$_3$O$_4$ as a rare example of a geometrically frustrated trillium-lattice magnet, offering a promising platform for exploring exotic quantum magnetic phenomena.

preprint2026arXiv

Starve to Perceive: Taming Lazy Perception in VLMs with Constrained Visual Bandwidth

Vision-Language Models (VLMs) deployed as situated agents in high-resolution visual environments require active perception -- the ability to dynamically decide where to look through operations like zooming, cropping, and panning. However, current training paradigms produce models that mimic the surface form of such operations without functionally depending on their outputs, a phenomenon we term lazy perception. We trace this to a fundamental learning asymmetry: when coarse global views combined with language priors suffice for moderate accuracy, the model has no incentive to learn harder multi-step visual search. If a model can succeed without actively looking, it will never learn to look. This motivates Starve to Perceive, a training paradigm that constrains visual bandwidth -- restricting each observation to a tight token budget so that no single view suffices for task completion, making active perception the only viable strategy. Despite requiring no auxiliary losses, reward shaping, or architectural changes -- serving as a minimal, plug-in modification to standard post-training pipelines -- models trained under perceptual starvation achieve substantial gains of 5% average relative improvement across diverse benchmarks.

preprint2024arXiv

Insulator to Metal Transition, Spin-Phonon Coupling, and Potential Magnetic Transition Observed in Quantum Spin Liquid Candidate LiYbSe$_2$ under High Pressure

Metallization of quantum spin liquid (QSL) materials has long been considered as a potential route to achieve unconventional superconductivity. Here we report our endeavor in this direction by pressurizing a three-dimensional QSL candidate, LiYbSe$_2$, with a previously unreported pyrochlore structure. High-pressure X-ray diffraction and Raman studies up to 50 GPa reveal no appreciable changes of structural symmetry or distortion in this pressure range. This compound is so insulating that its resistance decreases below 10$^5$ $Ω$ only at pressures above 25 GPa in the corresponding temperature range accompanying the gradual reduction of band gap upon compression. Interestingly, an insulator-to-metal transition takes place in LiYbSe$_2$ at about 68 GPa and the metallic behavior remains up to 123.5 GPa, the highest pressure reached in the present study. A possible sign of magnetic or other phase transition was observed in LiYbSe$_2$. The insulator-to-metal transition in LiYbSe$_2$ under high pressure makes it an ideal system to study the pressure effects on QSL candidates of spin-1/2 Yb$^{3+}$ system in different lattice patterns.

preprint2023arXiv

A Novel Estimation Method for Temperature of Magnetic Nanoparticles Dominated by Brownian Relaxation Based on Magnetic Particle Spectroscopy

This paper presents a novel method for estimating the temperature of magnetic nanoparticles (MNPs) based on AC magnetization harmonics of MNPs dominated by Brownian relaxation. The difference in the AC magnetization response and magnetization harmonic between the Fokker-Planck equation and the Langevin function was analyzed, and we studied the relationship between the magnetization harmonic and the key factors, such as Brownian relaxation time, temperature, magnetic field strength, core size and hydrodynamic size of MNPs, excitation frequency, and so on. We proposed a compensation function for AC magnetization harmonic with consideration of the key factors and the difference between the Fokker-Planck equation and the Langevin function. Then a temperature estimation model based on the compensation function and the Langevin function was established. By employing the least squares algorithm, the temperature was successfully calculated. The experimental results show that the temperature error is less than 0.035 K in the temperature range from 310 K to 320 K. The temperature estimation model is expected to improve the performance of the magnetic nanoparticle thermometer and be applied to magnetic nanoparticle-mediated hyperthermia.

preprint2023arXiv

Synergistic Photon Management and Strain-Induced Band Gap Engineering of Two-Dimensional MoS2 Using Semimetal Composite Nanostructures

2D MoS2 attracts increasing attention for its application in flexible electronics and photonic devices. For 2D material optoelectronic devices, light absorption of the molecularly thin 2D absorber would be one of the key limiting factors in device efficiency, and conventional photon management techniques are not necessarily compatible with them. In this paper, we show two semimetal composite nanostructures for synergistic photon management and strain-induced band gap engineering of 2D MoS2: (1) pseudo-periodic Sn nanodots, (2) conductive SnOx (x<1) core-shell nanoneedle structures. Without sophisticated nanolithography, both nanostructures are self-assembled from physical vapor deposition. 2D MoS2 achieves up to >15x enhancement in absorption at λ=650-950 nm under Sn nanodots, and 20-30x at λ=700-900 nm under SnOx (x<1) nanoneedles, both spanning from visible to near infrared regime. Enhanced absorption in MoS2 results from strong near field enhancement and reduced MoS2 band gap due to the tensile strain induced by the Sn nanostructures, as confirmed by Raman and photoluminescence spectroscopy. Especially, we demonstrate that up to 3.5% biaxial tensile strain is introduced to 2D MoS2 using conductive nanoneedle-structured SnOx (x<1), which reduces the band gap by ~0.35 eV to further enhance light absorption at longer wavelengths. To the best of our knowledge, this is the first demonstration of a synergistic triple-functional photon management, stressor, and conductive electrode layer on 2D MoS2. Such synergistic photon management and band gap engineering approach for extended spectral response can be further applied to other 2D materials for future 2D photonic devices.

preprint2022arXiv

Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning

Despite recent success of deep network-based Reinforcement Learning (RL), it remains elusive to achieve human-level efficiency in learning novel tasks. While previous efforts attempt to address this challenge using meta-learning strategies, they typically suffer from sampling inefficiency with on-policy RL algorithms or meta-overfitting with off-policy learning. In this work, we propose a novel meta-RL strategy to address those limitations. In particular, we decompose the meta-RL problem into three sub-tasks, task-exploration, task-inference and task-fulfillment, instantiated with two deep network agents and a task encoder. During meta-training, our method learns a task-conditioned actor network for task-fulfillment, an explorer network with a self-supervised reward shaping that encourages task-informative experiences in task-exploration, and a context-aware graph-based task encoder for task inference. We validate our approach with extensive experiments on several public benchmarks and the results show that our algorithm effectively performs exploration for task inference, improves sample efficiency during both training and testing, and mitigates the meta-overfitting problem.

preprint2022arXiv

Multiplication of freestanding semiconductor membranes from a single wafer by advanced remote epitaxy

Freestanding single-crystalline membranes are an important building block for functional electronics. Especially, compounds semiconductor membranes such as III-N and III-V offer great opportunities for optoelectronics, high-power electronics, and high-speed computing. Despite huge efforts to produce such membranes by detaching epitaxial layers from donor wafers, however, it is still challenging to harvest epitaxial layers using practical processes. Here, we demonstrate a method to grow and harvest multiple epitaxial membranes with extremely high throughput at the wafer scale. For this, 2D materials are directly formed on III-N and III-V substrates in epitaxy systems, which enables an advanced remote epitaxy scheme comprised of multiple alternating layers of 2D materials and epitaxial layers that can be formed by a single epitaxy run. Each epilayer in the multi-stack structure is then harvested by layer-by-layer peeling, producing multiple freestanding membranes with unprecedented throughput from a single wafer. Because 2D materials allow peeling at the interface without damaging the epilayer or the substrate, wafers can be reused for subsequent membrane production. Therefore, this work represents a meaningful step toward high-throughput and low-cost production of single-crystal membranes that can be heterointegrated.

preprint2022arXiv

ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning

Real-Time Bidding (RTB) is an important mechanism in modern online advertising systems. Advertisers employ bidding strategies in RTB to optimize their advertising effects subject to various financial requirements, especially the return-on-investment (ROI) constraint. ROIs change non-monotonically during the sequential bidding process, and often induce a see-saw effect between constraint satisfaction and objective optimization. While some existing approaches show promising results in static or mildly changing ad markets, they fail to generalize to highly dynamic ad markets with ROI constraints, due to their inability to adaptively balance constraints and objectives amidst non-stationarity and partial observability. In this work, we specialize in ROI-Constrained Bidding in non-stationary markets. Based on a Partially Observable Constrained Markov Decision Process, our method exploits an indicator-augmented reward function free of extra trade-off parameters and develops a Curriculum-Guided Bayesian Reinforcement Learning (CBRL) framework to adaptively control the constraint-objective trade-off in non-stationary ad markets. Extensive experiments on a large-scale industrial dataset with two problem settings reveal that CBRL generalizes well in both in-distribution and out-of-distribution data regimes, and enjoys superior learning efficiency and stability.

preprint2020arXiv

Deep-Learning-Enabled Fast Optical Identification and Characterization of Two-Dimensional Materials

Advanced microscopy and/or spectroscopy tools play indispensable role in nanoscience and nanotechnology research, as it provides rich information about the growth mechanism, chemical compositions, crystallography, and other important physical and chemical properties. However, the interpretation of imaging data heavily relies on the "intuition" of experienced researchers. As a result, many of the deep graphical features obtained through these tools are often unused because of difficulties in processing the data and finding the correlations. Such challenges can be well addressed by deep learning. In this work, we use the optical characterization of two-dimensional (2D) materials as a case study, and demonstrate a neural-network-based algorithm for the material and thickness identification of exfoliated 2D materials with high prediction accuracy and real-time processing capability. Further analysis shows that the trained network can extract deep graphical features such as contrast, color, edges, shapes, segment sizes and their distributions, based on which we develop an ensemble approach topredict the most relevant physical properties of 2D materials. Finally, a transfer learning technique is applied to adapt the pretrained network to other applications such as identifying layer numbers of a new 2D material, or materials produced by a different synthetic approach. Our artificial-intelligence-based material characterization approach is a powerful tool that would speed up the preparation, initial characterization of 2D materials and other nanomaterials and potentially accelerate new material discoveries.

preprint2020arXiv

Multi-level Electro-thermal Switching of Optical Phase-Change Materials Using Graphene

Reconfigurable photonic systems featuring minimal power consumption are crucial for integrated optical devices in real-world technology. Current active devices available in foundries, however, use volatile methods to modulate light, requiring a constant supply of power and significant form factors. Essential aspects to overcoming these issues are the development of nonvolatile optical reconfiguration techniques which are compatible with on-chip integration with different photonic platforms and do not disrupt their optical performances. In this paper, a solution is demonstrated using an optoelectronic framework for nonvolatile tunable photonics that employs undoped-graphene microheaters to thermally and reversibly switch the optical phase-change material Ge$_2$Sb$_2$Se$_4$Te$_1$ (GSST). An in-situ Raman spectroscopy method is utilized to demonstrate, in real-time, reversible switching between four different levels of crystallinity. Moreover, a 3D computational model is developed to precisely interpret the switching characteristics, and to quantify the impact of current saturation on power dissipation, thermal diffusion, and switching speed. This model is used to inform the design of nonvolatile active photonic devices; namely, broadband Si$_3$N$_4$ integrated photonic circuits with small form-factor modulators and reconfigurable metasurfaces displaying 2$π$ phase coverage through neural-network-designed GSST meta-atoms. This framework will enable scalable, low-loss nonvolatile applications across a diverse range of photonics platforms.

preprint2011arXiv

A Localization Method for the Internet of Things

Many localization algorithms and systems have been developed by means of wireless sensor networks for both indoor and outdoor environments. To achieve higher localization accuracy, extra hardware equipments are utilized by most of the existing localization solutions, which increase the cost and considerably limit the location-based applications. The Internet of Things (IOT) integrates many technologies, such as Internet, Zigbee, Bluetooth, infrared, WiFi, GPRS, 3G, etc, which can enable different ways to obtain the location information of various objects. Location-based service is a primary service of the IOT, while localization accuracy is a key issue. In this paper, a higher accuracy localization scheme is proposed which can effectively satisfy diverse requirements for many indoor and outdoor location services. The proposed scheme composes of two phases: 1) partition phase, in which the target region is split into small grids; 2) localization refinement phase, in which a higher accuracy of localization can be obtained by applying an algorithm designed in the paper. A trial system is set up to verify correctness of the proposed scheme and furthermore to illustrate its feasibility and availability. The experimental results show that the proposed scheme can improve the localization accuracy.

Haozhe Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Bad Seeing or Bad Thinking? Rewarding Perception for Vision-Language Reasoning

Forbidden second harmonics in centrosymmetric bilayer crystals

Frustrated Magnetism in FeGe$_3$O$_4$ with a Chiral Trillium Network

Starve to Perceive: Taming Lazy Perception in VLMs with Constrained Visual Bandwidth

Insulator to Metal Transition, Spin-Phonon Coupling, and Potential Magnetic Transition Observed in Quantum Spin Liquid Candidate LiYbSe$_2$ under High Pressure

A Novel Estimation Method for Temperature of Magnetic Nanoparticles Dominated by Brownian Relaxation Based on Magnetic Particle Spectroscopy

Synergistic Photon Management and Strain-Induced Band Gap Engineering of Two-Dimensional MoS2 Using Semimetal Composite Nanostructures

Learning Context-aware Task Reasoning for Efficient Meta-reinforcement Learning

Multiplication of freestanding semiconductor membranes from a single wafer by advanced remote epitaxy

ROI-Constrained Bidding via Curriculum-Guided Bayesian Reinforcement Learning

Deep-Learning-Enabled Fast Optical Identification and Characterization of Two-Dimensional Materials

Multi-level Electro-thermal Switching of Optical Phase-Change Materials Using Graphene

A Localization Method for the Internet of Things