Researcher profile

Haiyang Zhang

Haiyang Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2026arXiv

Following the TRACE: A Structured Path to Empathetic Response Generation with Multi-Agent Models

Empathetic response generation is a crucial task for creating more human-like and supportive conversational agents. However, existing methods face a core trade-off between the analytical depth of specialized models and the generative fluency of Large Language Models (LLMs). To address this, we propose TRACE, Task-decomposed Reasoning for Affective Communication and Empathy, a novel framework that models empathy as a structured cognitive process by decomposing the task into a pipeline for analysis and synthesis. By building a comprehensive understanding before generation, TRACE unites deep analysis with expressive generation. Experimental results show that our framework significantly outperforms strong baselines in both automatic and LLM-based evaluations, confirming that our structured decomposition is a promising paradigm for creating more capable and interpretable empathetic agents. Our code is available at https://anonymous.4open.science/r/TRACE-18EF/README.md.

preprint2025arXiv

RDSA: A Robust Deep Graph Clustering Framework via Dual Soft Assignment

Graph clustering is an essential aspect of network analysis that involves grouping nodes into separate clusters. Recent developments in deep learning have resulted in graph clustering, which has proven effective in many applications. Nonetheless, these methods often encounter difficulties when dealing with real-world graphs, particularly in the presence of noisy edges. Additionally, many denoising graph clustering methods tend to suffer from lower performance, training instability, and challenges in scaling to large datasets compared to non-denoised models. To tackle these issues, we introduce a new framework called the Robust Deep Graph Clustering Framework via Dual Soft Assignment (RDSA). RDSA consists of three key components: (i) a node embedding module that effectively integrates the graph's topological features and node attributes; (ii) a structure-based soft assignment module that improves graph modularity by utilizing an affinity matrix for node assignments; and (iii) a node-based soft assignment module that identifies community landmarks and refines node assignments to enhance the model's robustness. We assess RDSA on various real-world datasets, demonstrating its superior performance relative to existing state-of-the-art methods. Our findings indicate that RDSA provides robust clustering across different graph types, excelling in clustering effectiveness and robustness, including adaptability to noise, stability, and scalability.

preprint2024arXiv

Building confidence in state-of-the-art ab initio calculations of the density virial coefficients B and C of helium-4: Part 2. Direct evaluation by high accuracy experimental data using RIGT

In our previous work [1], using indirect evaluation methods we concluded that the uncertainties of the second and the third density virial coefficient, B and C, of helium-4 at 5 K calculated by various authors had been overestimated. To check the reliability of these values and appraisal of uncertainties from ab initio calculations still further, a refractive-index gas thermometry method was developed to determine simultaneously thermodynamic temperatures and density virial coefficients. Using this technique, high accuracy experimental values of B and C of helium-4 and new values of T-T90 were obtained for the range 5 K to 25 K. A direct comparison with the ab initio calculation density virial coefficients was made. Results support the conclusion of our previous work, i.e., the ab initio calculation uncertainties u(B) [J. Chem. Phys. 136, 224303 (2012)] and u(C) [J. Chem. Phys. 134, 134106 (2011)] of helium-4 were overestimated by a factor of severalfold.

preprint2024arXiv

Building confidence in state-of-the-art ab initio calculations of the density virial coefficients of B and C of helium-4: Part 1. Indirect evaluation methods using the results of SPRIGT

In this work, we propose indirect evaluation methods to check the accuracy and the uncertainty of ab initio calculated virial coefficients. To this end, we have used single-pressure refractive-index gas thermometry (SPRIGT) to estimate the impact of the second density virial coefficient B of helium-4 on temperature measurements between 5 K to 25 K. Our results, in good agreement with values of B obtained from recent ab initio calculations by Czachorowski et al. [Phys. Rev. A 102, 042810 (2020)], suggest uncertainties u(B) estimated previously by other authors were too conservative. Concerning the value of the third density virial coefficient C of helium-4, calculated ab initio by Garberoglio et al. [J. Chem. Phys. 134, 134106 (2011)], our results suggest their uncertainty u(C) is between 1.5 and 10.2 times too high.

preprint2022arXiv

BiTAT: Neural Network Binarization with Task-dependent Aggregated Transformation

Neural network quantization aims to transform high-precision weights and activations of a given neural network into low-precision weights/activations for reduced memory usage and computation, while preserving the performance of the original model. However, extreme quantization (1-bit weight/1-bit activations) of compactly-designed backbone architectures (e.g., MobileNets) often used for edge-device deployments results in severe performance degeneration. This paper proposes a novel Quantization-Aware Training (QAT) method that can effectively alleviate performance degeneration even with extreme quantization by focusing on the inter-weight dependencies, between the weights within each layer and across consecutive layers. To minimize the quantization impact of each weight on others, we perform an orthonormal transformation of the weights at each layer by training an input-dependent correlation matrix and importance vector, such that each weight is disentangled from the others. Then, we quantize the weights based on their importance to minimize the loss of the information from the original weights/activations. We further perform progressive layer-wise quantization from the bottom layer to the top, so that quantization at each layer reflects the quantized distributions of weights and activations at previous layers. We validate the effectiveness of our method on various benchmark datasets against strong neural quantization baselines, demonstrating that it alleviates the performance degeneration on ImageNet and successfully preserves the full-precision model performance on CIFAR-100 with compact backbone networks.

preprint2022arXiv

Channel Estimation with Hybrid Reconfigurable Intelligent Metasurfaces

Reconfigurable Intelligent Surfaces (RISs) are envisioned to play a key role in future wireless communications, enabling programmable radio propagation environments. They are usually considered as almost passive planar structures that operate as adjustable reflectors, giving rise to a multitude of implementation challenges, including the inherent difficulty in estimating the underlying wireless channels. In this paper, we focus on the recently conceived concept of Hybrid Reconfigurable Intelligent Surfaces (HRISs), which do not solely reflect the impinging waveform in a controllable fashion, but are also capable of sensing and processing an adjustable portion of it. We first present implementation details for this metasurface architecture and propose a convenient mathematical model for characterizing its dual operation. As an indicative application of HRISs in wireless communications, we formulate the individual channel estimation problem for the uplink of a multi-user HRIS-empowered communication system. Considering first a noise-free setting, we theoretically quantify the advantage of HRISs in notably reducing the amount of pilots needed for channel estimation, as compared to the case of purely reflective RISs. We then present closed-form expressions for the MSE performance in estimating the individual channels at the HRISs and the base station for the noisy model. Based on these derivations, we propose an automatic differentiation-based first-order optimization approach to efficiently determine the HRIS phase and power splitting configurations for minimizing the weighted sum-MSE performance. Our numerical evaluations demonstrate that HRISs do not only enable the estimation of the individual channels in HRIS-empowered communication systems, but also improve the ability to recover the cascaded channel, as compared to existing methods using passive and reflective RISs.

preprint2022arXiv

Channel Estimation with Simultaneous Reflecting and Sensing Reconfigurable Intelligent Metasurfaces

Reconfigurable Intelligent Surfaces (RISs) are envisioned to play a key role in future wireless communications, enabling programmable radio propagation environments. They are usually considered as nearly passive planar structures that operate as adjustable reflectors, giving rise to a multitude of implementation challenges, including an inherent difficulty in estimating the underlying wireless channels. In this paper, we propose the concept of Hybrid RISs (HRISs), which do not solely reflect the impinging waveform in a controllable fashion, but are also capable of sensing and processing a portion of it via some active reception elements. We first present implementation details for this novel metasurface architecture and propose a simple model for its operation, when considered for wireless communications. As an indicative application of HRISs, we formulate and solve the individual channels identification problem for the uplink of multi-user HRIS-empowered systems. Our numerical results showcase that, in the high signal-to-noise regime, HRISs enable individual channel estimation with notably reduced amounts of pilots, compared to those needed when using a purely reflective RIS that can only estimate the cascaded channel.

preprint2022arXiv

Jointly Learned Symbol Detection and Signal Reflection in RIS-Aided Multi-user MIMO Systems

Reconfigurable Intelligent Surfaces (RISs) are regarded as a key technology for future wireless communications, enabling programmable radio propagation environments. However, the passive reflecting feature of RISs induces notable challenges on channel estimation, making coherent symbol detection a challenging task. In this paper, we consider the uplink of RIS-aided multi-user Multiple-Input Multiple-Output (MIMO) systems and propose a Machine Learning (ML) approach to jointly design the multi-antenna receiver and configure the RIS reflection coefficients, which does not require explicit full knowledge of the channel input-output relationship. Our approach devises a ML-based receiver, while the configurations of the RIS reflection patterns affecting the underlying propagation channel are treated as hyperparameters. Based on this system design formulation, we propose a Bayesian ML framework for optimizing the RIS hyperparameters, according to which the transmitted pilots are directly used to jointly tune the RIS and the multi-antenna receiver. Our simulation results demonstrate the capability of the proposed approach to provide reliable communications in non-linear channel conditions corrupted by Gaussian noise.

preprint2022arXiv

Physics Embedded Machine Learning for Electromagnetic Data Imaging

Electromagnetic (EM) imaging is widely applied in sensing for security, biomedicine, geophysics, and various industries. It is an ill-posed inverse problem whose solution is usually computationally expensive. Machine learning (ML) techniques and especially deep learning (DL) show potential in fast and accurate imaging. However, the high performance of purely data-driven approaches relies on constructing a training set that is statistically consistent with practical scenarios, which is often not possible in EM imaging tasks. Consequently, generalizability becomes a major concern. On the other hand, physical principles underlie EM phenomena and provide baselines for current imaging techniques. To benefit from prior knowledge in big data and the theoretical constraint of physical laws, physics embedded ML methods for EM imaging have become the focus of a large body of recent work. This article surveys various schemes to incorporate physics in learning-based EM imaging. We first introduce background on EM imaging and basic formulations of the inverse problem. We then focus on three types of strategies combining physics and ML for linear and nonlinear imaging and discuss their advantages and limitations. Finally, we conclude with open challenges and possible ways forward in this fast-developing field. Our aim is to facilitate the study of intelligent EM imaging methods that will be efficient, interpretable and controllable.

preprint2021arXiv

UserReg: A Simple but Strong Model for Rating Prediction

Collaborative filtering (CF) has achieved great success in the field of recommender systems. In recent years, many novel CF models, particularly those based on deep learning or graph techniques, have been proposed for a variety of recommendation tasks, such as rating prediction and item ranking. These newly published models usually demonstrate their performance in comparison to baselines or existing models in terms of accuracy improvements. However, others have pointed out that many newly proposed models are not as strong as expected and are outperformed by very simple baselines. This paper proposes a simple linear model based on Matrix Factorization (MF), called UserReg, which regularizes users' latent representations with explicit feedback information for rating prediction. We compare the effectiveness of UserReg with three linear CF models that are widely-used as baselines, and with a set of recently proposed complex models that are based on deep learning or graph techniques. Experimental results show that UserReg achieves overall better performance than the fine-tuned baselines considered and is highly competitive when compared with other recently proposed models. We conclude that UserReg can be used as a strong baseline for future CF research.

preprint2020arXiv

Active suppression of temperature oscillation from a pulse-tube cryocooler in a cryogen-free cryostat: Part 1. Simulation modeling from thermal response characteristics

A cryogen-free cryostat cooled using a 4 K commercial GM or pulse tube cryocooler (PTC) displays temperature oscillations caused by the intrinsic working principle of the regenerative cryocooler. To dampen such oscillations usually requires either a large heat capacity or a large thermal resistance. To understand this phenomenon better and suppress it more effectively, both the step response characteristic and the intrinsic oscillation characteristic of cryostat have been used to obtain the complete transfer functions of a simulation model. The latter is used to test and optimize traditional PID feedback control. The results showed this approach has almost no effect on the temperature oscillation amplitude. Based on this simulation model, a novel active method was proposed and tested numerically. Simulation results predict the method should suppress the amplitude of the original temperature oscillation by a factor of two.

preprint2020arXiv

Active suppression of temperature oscillation from a pulse-tube cryocooler in a cryogen-free cryostat: Part 2. Experimental realization

A cryogen-free cryostat cooled by a closed cycle cryocooler is compact, can provide uninterrupted long-term operation (up to ten thousand hours) and is suited to temperatures from 3 K to 300 K. Its intrinsic temperature oscillation, however, limits its application in experiments requiring high thermal stability at low temperature (below 77 K). Passive suppression methods are effective but all suffer from drawbacks. We describe a novel, active suppression scheme more efficient than traditional proportional-integral (PI) control. The experimental results show that it can reduce the standard deviation of the temperature oscillation by a further 30% compared with PI feedback. To the best of our knowledge, this is the first time such active suppression of temperature oscillations has been implemented with the cryogen-free cryostat. The results also show, however, that an unwanted lower frequency thermal noise will be generated, which appears to be the limit of the method. Nevertheless, the approach could be used to improve the temperature stability in all cryogen-free cryostats.

preprint2020arXiv

Prediction and realization of a temperature control limit at low temperatures in SPRIGT

On May 20th 2019, the World Metrology Day, the Bureau International des Poids et Mesures announced a major revision to the four more SI units. The base unit, the kelvin, is defined by fixing the value of Boltzmann constant as indicated in Mise en pratique for the definition of the kelvin in the SI. To realize the new kelvin, a novel practical realization technique of single-pressure refractive-index gas thermometry (SPRIGT) has been jointly developed by the TIPC-CAS in China and the LNE-Cnam in France. To carry out accurate SPRIGT, experimental methods have been implemented and micro-kelvin level temperature control limits have been predicted and achieved at 5 K to 26 K. The resonator temperature stability can be maintained to within better than 8 μK of its set point with an integration time 33.6 s over 180 h. Besides, solutions for further improving the stability were also demonstrated, which can be a reference for temperature metrology field worldwide and other fields where high-stability temperature is required. The present work should also provide a solid foundation for international data comparison of thermodynamic temperature at low temperatures, and will promote realizations of the new kelvin and the spread of high-accuracy, low-temperature metrology.

preprint2020arXiv

Realization of ppm level pressure stability for primary thermometry using a primary piston gauge

To achieve an uncertainty of 0.25 mK in single-pressure refractive-index gas thermometry (SPRIGT), the relative pressure variation of He-4 gas in the range 30 kPa to 90 kPa, should not exceed 4 ppm (k=1). To this end, a novel pressure control system has been developed. It consists of two main parts: a piston gauge to control the pressure, and a home-made gas compensation system to supplement the micro-leak of the piston gauge. In addition, to maintain the piston at constant height, a servo loop is used that automatically determines in real time the amount of extra gas required. At room temperature, the standard deviations of the stabilized pressure are 3.0 mPa at 30 kPa, 4.5 mPa at 60 kPa and 2 mPa at 90 kPa. For the temperature region 5 K-25 K used for SPRIGT in the present work, the relative pressure stability is better than 0.16 ppm i.e. 25 times better than required. Moreover, the same pressure stabilization system is readily transposable to other primary gas thermometers.

preprint2020arXiv

Resonance frequency measurement with accuracy and stability at the 10-12 level in a copper microwave cavity below 26 K by experimental optimization

Single pressure refractive index gas thermometry (SPRIGT) is a novel primary thermometry, jointly developed by TIPC of CAS in China and LNE-Cnam in France. To realize a competitive uncertainty of 0.25 mK for thermodynamic temperature measurements, high-stability and low-uncertainty of microwave resonance frequency measurements better than 2 ppb should be achieved. This article describes how to realize high-stability and low-uncertainty of resonance frequency measurements in a copper microwave cavity by experimental optimization methods based on Allan analysis of variance. In this manner, 10-12 level accuracy and stability of microwave resonance frequency measurements were realized with an integration time of 3 hours, which is nearly 20 times better than those without optimization in our previous work (Sci. Bull 2019; 64: 286-288). It has potential applications in gas metrology and other research fields, where high-stability and low-uncertainty microwave measurements are necessary. Besides, microwave measurements were carried out isobarically at pressures of (30, 60, 90, and 120) kPa over the temperature range of (5 to 26) K, with good microwave mode consistency for the determined thermodynamic temperatures. These will provide strong support for the success of the implementation of SPRIGT in China.