Source author record

Zhonghua Wu

Zhonghua Wu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mtrl-sci cond-mat.str-el cond-mat.supr-con Artificial Intelligence Biological Physics cond-mat.soft eess.IV Machine Learning

Catalog footprint

What is connected

11works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

$ξ$-DPO: Direct Preference Optimization via Ratio Reward Margin

Reference-free preference optimization has emerged as an efficient alternative to reinforcement learning from human feedback, with Simple Preference Optimization(SimPO) demonstrating strong performance by eliminating the explicit reference model through a simple objective. However, the joint tuning of the hyperparameters $β$ and $γ$ in SimPO remains a central challenge. We argue that this difficulty arises because the margin formulation in SimPO is not easily interpretable across datasets with different reward gap structures. To better understand this issue, we conduct a comprehensive analysis of SimPO and find that $β$ implicitly controls sample filtering, while the effect of $γ$ depends on the reward gap structure of the dataset. Motivated by these observations, we propose $ξ$-DPO: Direct preference optimization via ratio reward margin. We first reformulate the preference objective through an equivalent transformation, changing the optimization target from maximizing the likelihood of reward gaps to minimizing the distance between reward gaps and optimal margins. Then, we redefine the reward in a ratio form between the chosen and rejected, which effectively cancels the effect of $β$ and yields a bounded and interpretable margin. This margin is called the ratio reward margin and is denoted by $ξ$. Unlike the margin $γ$ in SimPO, $ξ$ explicitly represents the desired relative separation between chosen and rejected responses and can be determined from the initial reward gap distribution, avoiding repeated trial-and-error tuning. ....

preprint2026arXiv

Zoom-IQA: Image Quality Assessment with Reliable Region-Aware Reasoning

Image Quality Assessment (IQA) is a long-standing problem in computer vision. Previous methods typically focus on predicting numerical scores without explanation or providing low-level descriptions lacking precise scores. Recent reasoning-based vision language models (VLMs) have shown strong potential for IQA by jointly generating quality descriptions and scores. However, existing VLM-based IQA methods often suffer from unreliable reasoning due to their limited capability of integrating visual and textual cues. In this work, we introduce Zoom-IQA, a VLM-based IQA model to explicitly emulate key cognitive behaviors: uncertainty awareness, region reasoning, and iterative refinement. Specifically, we present a two-stage training pipeline: 1) supervised fine-tuning (SFT) on our Grounded-Rationale-IQA (GR-IQA) dataset to teach the model to ground its assessments in key regions, and 2) reinforcement learning (RL) for dynamic policy exploration, stabilized by our KL-Coverage regularizer to prevent reasoning and scoring diversity collapse, with a Progressive Re-sampling Strategy for mitigating annotation bias. Extensive experiments show that Zoom-IQA achieves improved robustness, explainability, and generalization. The application to downstream tasks, such as image restoration, further demonstrates the effectiveness of Zoom-IQA.

preprint2022arXiv

Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation

Weakly supervised point cloud segmentation, i.e. semantically segmenting a point cloud with only a few labeled points in the whole 3D scene, is highly desirable due to the heavy burden of collecting abundant dense annotations for the model training. However, existing methods remain challenging to accurately segment 3D point clouds since limited annotated data may lead to insufficient guidance for label propagation to unlabeled data. Considering the smoothness-based methods have achieved promising progress, in this paper, we advocate applying the consistency constraint under various perturbations to effectively regularize unlabeled 3D points. Specifically, we propose a novel DAT (\textbf{D}ual \textbf{A}daptive \textbf{T}ransformations) model for weakly supervised point cloud segmentation, where the dual adaptive transformations are performed via an adversarial strategy at both point-level and region-level, aiming at enforcing the local and structural smoothness constraints on 3D point clouds. We evaluate our proposed DAT model with two popular backbones on the large-scale S3DIS and ScanNet-V2 datasets. Extensive experiments demonstrate that our model can effectively leverage the unlabeled 3D points and achieve significant performance gains on both datasets, setting new state-of-the-art performance for weakly supervised point cloud segmentation.

preprint2022arXiv

Exploring Smoothness and Class-Separation for Semi-supervised Medical Image Segmentation

Semi-supervised segmentation remains challenging in medical imaging since the amount of annotated medical data is often scarce and there are many blurred pixels near the adhesive edges or in the low-contrast regions. To address the issues, we advocate to firstly constrain the consistency of pixels with and without strong perturbations to apply a sufficient smoothness constraint and further encourage the class-level separation to exploit the low-entropy regularization for the model training. Particularly, in this paper, we propose the SS-Net for semi-supervised medical image segmentation tasks, via exploring the pixel-level smoothness and inter-class separation at the same time. The pixel-level smoothness forces the model to generate invariant results under adversarial perturbations. Meanwhile, the inter-class separation encourages individual class features should approach their corresponding high-quality prototypes, in order to make each class distribution compact and separate different classes. We evaluated our SS-Net against five recent methods on the public LA and ACDC datasets. Extensive experimental results under two semi-supervised settings demonstrate the superiority of our proposed SS-Net model, achieving new state-of-the-art (SOTA) performance on both datasets. The code is available at https://github.com/ycwu1997/SS-Net.

preprint2022arXiv

Long-tailed Recognition by Learning from Latent Categories

In this work, we address the challenging task of long-tailed image recognition. Previous long-tailed recognition methods commonly focus on the data augmentation or re-balancing strategy of the tail classes to give more attention to tail classes during the model training. However, due to the limited training images for tail classes, the diversity of tail class images is still restricted, which results in poor feature representations. In this work, we hypothesize that common latent features among the head and tail classes can be used to give better feature representation. Motivated by this, we introduce a Latent Categories based long-tail Recognition (LCReg) method. Specifically, we propose to learn a set of class-agnostic latent features shared among the head and tail classes. Then, we implicitly enrich the training sample diversity via applying semantic data augmentation to the latent features. Extensive experiments on five long-tailed image recognition datasets demonstrate that our proposed LCReg is able to significantly outperform previous methods and achieve state-of-the-art results.

preprint2020arXiv

Exploring Bottom-up and Top-down Cues with Attentive Learning for Webly Supervised Object Detection

Fully supervised object detection has achieved great success in recent years. However, abundant bounding boxes annotations are needed for training a detector for novel classes. To reduce the human labeling effort, we propose a novel webly supervised object detection (WebSOD) method for novel classes which only requires the web images without further annotations. Our proposed method combines bottom-up and top-down cues for novel class detection. Within our approach, we introduce a bottom-up mechanism based on the well-trained fully supervised object detector (i.e. Faster RCNN) as an object region estimator for web images by recognizing the common objectiveness shared by base and novel classes. With the estimated regions on the web images, we then utilize the top-down attention cues as the guidance for region classification. Furthermore, we propose a residual feature refinement (RFR) block to tackle the domain mismatch between web domain and the target domain. We demonstrate our proposed method on PASCAL VOC dataset with three different novel/base splits. Without any target-domain novel-class images and annotations, our proposed webly supervised object detection model is able to achieve promising performance for novel classes. Moreover, we also conduct transfer learning experiments on large scale ILSVRC 2013 detection dataset and achieve state-of-the-art performance.

preprint2015arXiv

Application of Mythen Detector In-situ XRD Study on The Thermal Expansion Behavior of Metal Indium

A Mythen detector has been equipped at the beamline 4B9A of Beijing Synchrotron Radiation Facility, which can be used for in-situ real-time measurement of X-ray diffraction (XRD) full profiles. In this paper, the thermal expansion behavior of metal indium has been studied by using the in-situ XRD technique with the Mythen detector. The indium film was heated from 30 to 160 °C with a heating rate of 2 °C/min. The in-situ XRD full-profiles were collected with a rate of one profile per 10 seconds. Rietveld refinement was used to extract the structural parameters. The results demonstrate that the thermal expansion of metal indium is nonlinear especially when the sample temperature was close to its melting point (156.5 °C). The expansion of a-axis and the contraction of c-axis of the tetragonal unit cell of metallic indium can be well described by biquadratic and cubic polynomials, respectively. The tetragonal unit cell presents a tendency to become cubic one with the increase of temperature but without detectable phase change. This study is not only beneficial to the application of metal indium, but also exhibits the capacity of in-situ time-resolved XRD experiments at the X-ray diffraction station of BSRF.

preprint2015arXiv

Gapless quantum spin liquid ground state in the two-dimensional spin-1/2 triangular antiferromagnet YbMgGaO$_4$

Quantum spin liquid (QSL) is a novel state of matter which refuses the conventional spin freezing even at 0 K. Experimentally searching for the structurally perfect candidates is a big challenge in condensed matter physics. Here we report the successful synthesis of a new spin-1/2 triangular antiferromagnet YbMgGaO$_4$ with R$\bar{3}$m symmetry. The compound with an ideal two-dimensional and spatial isotropic magnetic triangular-lattice has no site-mixing magnetic defects and no antisymmetric Dzyaloshinsky-Moriya (DM) interactions. No spin freezing down to 60 mK (despite $Θ$$_w$ $\sim$ -4 K), the low-T power-law temperature dependence of heat capacity and nonzero susceptibility suggest that YbMgGaO$_4$ is a promising gapless ($\leq$ $|$$Θ$$_w$$|$/100) QSL candidate. The residual spin entropy, which is accurately determined with a non-magnetic reference LuMgGaO$_4$, approaches zero ($<$ 0.6 \%). This indicates that the possible QSL ground state (GS) of the frustrated spin system has been experimentally achieved at the lowest measurement temperatures.

preprint2013arXiv

Re-entrance of Gapless Quantum Spin Liquids Observed in a Newly Synthesized Spin-1/2 Kagome Antiferromagnet $ZnCu_{3}(OH)_{6}SO_{4}$

Quantum spin liquid (QSL) is a novel state of matter with exotic excitations and was theoretically predicted to be realized most possibly in an S=1/2 kagome antiferromagnet. Experimentally searching for the candidate materials is a big challenge in condensed matter physics and only two such candidates were reported so far. Here we report the successful synthesis of a new spin-1/2 kagome antiferromagnet ZnCu3(OH)6SO4. No magnetic ordering is observed down to 50 mK, despite a moderately high Weiss temperature of θW ~ -79 K. It strongly suggests that the material is a new QSL candidate. Most interestingly, the magnetic specific heat clearly exhibits linear behaviors in two low-temperature regions. Both behaviors exactly correspond to two temperature-independent susceptibilities. These consistently reveal a novel re-entrance phenomenon of gapless QSL state at the lowest temperatures. The findings provide new insights into QSL ground and excited states and will inspire new theoretical and experimental studies.

preprint2013arXiv

Transition-metal distribution in kagome antiferromagnet CoCu3(OH)6Cl2 revealed by resonant x-ray diffraction

The distribution of chemically similar transition-metal ions is a fundamental issue in the study of herbertsmithite-type kagome antiferromagnets. Using synchrotron radiation, we have performed resonant powder x-ray diffractions on newly synthesized CoCu3(OH)6Cl2, which provide an exact distribution of transition-metal ions in the frustrated antiferromagnet. Both magnetic susceptibility and specific heat measurements are quantitatively consistent with the occupation fractions determined by resonant x-ray diffraction. The distribution of transition-metal ions and residual magnetic entropy suggest a novel low temperature (T < 4 K) magnetism, where the interlayer triangular spins undergo a spin-glass freezing while the kagome spins still keep highly frustrated.

preprint2012arXiv

Hierarchical structure and biomineralization in cricket tooth

Cricket is a truculent insect with stiff and sharp teeth as a fighting weapon. The structure and possible biomineralization of the cricket teeth are always interested. Synchrotron radiation X-ray fluorescence, X-ray diffraction and small angle X-ray scattering techniques were used to probe the element distribution, possible crystalline structures and size distribution of scatterers in cricket teeth. Scanning electron microscope was used to observe the nanoscaled structure. The results demonstrate that Zn is the main heavy element in cricket teeth. The surface of the cricket teeth has a crystalline compound like ZnFe2(AsO4)2(OH)2(H2O)4. While, the interior of the teeth has a crystalline compound like ZnCl2, which is from the biomineralization. The ZnCl2-like biomineral forms nanoscaled microfibrils and their axial direction points at the top of tooth cusp. The microfibrils aggregate random into intermediate filaments, forming a hierarchical structure. A sketch map of the cricket tooth cusp was proposed and a detailed discussion was given in this paper.

Zhonghua Wu

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

$ξ$-DPO: Direct Preference Optimization via Ratio Reward Margin

Zoom-IQA: Image Quality Assessment with Reliable Region-Aware Reasoning

Dual Adaptive Transformations for Weakly Supervised Point Cloud Segmentation

Exploring Smoothness and Class-Separation for Semi-supervised Medical Image Segmentation

Long-tailed Recognition by Learning from Latent Categories

Exploring Bottom-up and Top-down Cues with Attentive Learning for Webly Supervised Object Detection

Application of Mythen Detector In-situ XRD Study on The Thermal Expansion Behavior of Metal Indium

Gapless quantum spin liquid ground state in the two-dimensional spin-1/2 triangular antiferromagnet YbMgGaO$_4$

Re-entrance of Gapless Quantum Spin Liquids Observed in a Newly Synthesized Spin-1/2 Kagome Antiferromagnet $ZnCu_{3}(OH)_{6}SO_{4}$

Transition-metal distribution in kagome antiferromagnet CoCu3(OH)6Cl2 revealed by resonant x-ray diffraction

Hierarchical structure and biomineralization in cricket tooth