Researcher profile

Hongbin Zhang

Hongbin Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
19works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

19 published item(s)

preprint2026arXiv

Exploring the Translation Mechanism of Large Language Models

While large language models (LLMs) demonstrate remarkable success in multilingual translation, their internal core translation mechanisms, even at the fundamental word level, remain insufficiently understood. To address this critical gap, this work introduces a systematic framework for interpreting the mechanism behind LLM translation from the perspective of computational components. This paper first proposes subspace-intervened path patching for precise, fine-grained causal analysis, enabling the detection of components crucial to translation tasks and subsequently characterizing their behavioral patterns in human-interpretable terms. Comprehensive experiments reveal that translation is predominantly driven by a sparse subset of components: specialized attention heads serve critical roles in extracting source language, translation indicators, and positional features, which are then integrated and processed by specific multi-layer perceptrons (MLPs) into intermediary English-centric latent representations before ultimately yielding the final translation. The significance of these findings is underscored by the empirical demonstration that targeted fine-tuning a minimal parameter subset ($<5\%$) enhances translation performance while preserving general capabilities. This result further indicates that these crucial components generalize effectively to sentence-level translation and are instrumental in elucidating more intricate translation tasks.

preprint2022arXiv

A study on rare-earth Laves phases for magnetocaloric liquefaction of hydrogen

We are witnessing a great transition towards a society powered by renewable energies to meet the ever-stringent climate target. Hydrogen, as an energy carrier, will play a key role in building a climate-neutral society. Although liquid hydrogen is essential for hydrogen storage and transportation, liquefying hydrogen is costly with the conventional methods based on Joule-Thomas effect. As an emerging technology which is potentially more efficient, magnetocaloric hydrogen liquefaction is a &#34;game-changer&#34;. In this work, we have investigated the rare-earth-based Laves phases ${\rm R}Al_2$ and ${\rm R}Ni_2$ for magnetocaloric hydrogen liquefaction. We have noticed an unaddressed feature that the magnetocaloric effect of second-order magnetocaloric materials can become &#34;giant&#34; near the hydrogen boiling point. This feature indicates strong correlations, down to the boiling point of hydrogen, among the three important quantities of the magnetocaloric effect: the maximum magnetic entropy change $ΔS_{m}^{max}$, the maximum adiabatic temperature change $ΔT_{ad}^{max}$, and the Curie temperature $T_C$. Via a comprehensive literature review, we interpret the correlations for a rare-earth intermetallic series as two trends: (1) $ΔS_{m}^{max}$ increases with decreasing $T_C$; (2) $ΔT_{ad}^{max}$ decreases near room temperature with decreasing $T_C$ but increases at cryogenic temperatures. Moreover, we have developed a mean-field approach to describe these two trends theoretically. The dependence of $ΔS_{m}^{max}$ and $ΔT_{ad}^{max}$ on $T_C$ revealed in this work helps us quickly anticipate the magnetocaloric performance of rare-earth-based compounds, guiding material design and accelerating the discoveries of magnetocaloric materials for hydrogen liquefaction.

preprint2022arXiv

High-throughput screening of Half-antiperovskites with a stacked kagome lattice

Half-antiperovskites (HAPs) are a class of materials consisting of stacked kagome lattices and thus host exotic magnetic and electronic states. We perform high-throughput calculations based on density functional theory (DFT) and atomistic spin dynamics (ASD) simulations to predict stable magnetic HAPs M$_3$X$_2$Z$_2$ (M = Cr, Mn, Fe, Co, and Ni; X is one of the elements from Li to Bi except noble gases and 4$f$ rare-earth metals; Z = S, Se, and Te), with both thermodynamical and mechanical stabilities evaluated. Additionally, the magnetic ground states are obtained by utilizing DFT calculations combined with the ASD simulations. The existing spin frustration in an AFM kagome lattice manifests as competing behavior of the in-plane FM and AFM couplings. For a total number of 930 HAP compositions considered, we have found 23 compounds that are stabilized at non-collinear antiferromagnetic (AFM) state and 11 compounds that possess ferromagnetic (FM) order.

preprint2022arXiv

Inferring random change point from left-censored longitudinal data by segmented mechanistic nonlinear models, with application in HIV surveillance study

The primary goal of public health efforts to control HIV epidemics is to diagnose and treat people with HIV infection as soon as possible after seroconversion. The timing of initiation of antiretroviral therapy (ART) treatment after HIV diagnosis is, therefore, a critical population-level indicator that can be used to measure the effectiveness of public health programs and policies at local and national levels. However, population-based data on ART initiation are unavailable because ART initiation and prescription are typically measured indirectly by public health departments (e.g., with viral suppression as a proxy). In this paper, we present a random change-point model to infer the time of ART initiation utilizing routinely reported individual-level HIV viral load from an HIV surveillance system. To deal with the left-censoring and the nonlinear trajectory of viral load data, we formulate a flexible segmented nonlinear mixed effects model and propose a Stochastic version of EM (StEM) algorithm, coupled with a Gibbs sampler for the inference. We apply the method to a random subset of HIV surveillance data to infer the timing of ART initiation since diagnosis and to gain additional insights into the viral load dynamics. Simulation studies are also performed to evaluate the properties of the proposed method.

preprint2022arXiv

Machine learning-enabled high-entropy alloy discovery

High-entropy alloys are solid solutions of multiple principal elements, capable of reaching composition and feature regimes inaccessible for dilute materials. Discovering those with valuable properties, however, relies on serendipity, as thermodynamic alloy design rules alone often fail in high-dimensional composition spaces. Here, we propose an active-learning strategy to accelerate the design of novel high-entropy Invar alloys in a practically infinite compositional space, based on very sparse data. Our approach works as a closed-loop, integrating machine learning with density-functional theory, thermodynamic calculations, and experiments. After processing and characterizing 17 new alloys (out of millions of possible compositions), we identified 2 high-entropy Invar alloys with extremely low thermal expansion coefficients around 2*10-6 K-1 at 300 K. Our study thus opens a new pathway for the fast and automated discovery of high-entropy alloys with optimal thermal, magnetic and electrical properties.

preprint2022arXiv

MNL-Bandits under Inventory and Limited Switches Constraints

Optimizing the assortment of products to display to customers is a key to increasing revenue for both offline and online retailers. To trade-off between exploring customers&#39; preference and exploiting customers&#39; choices learned from data, in this paper, by adopting the Multi-Nomial Logit (MNL) choice model to capture customers&#39; choices over products, we study the problem of optimizing assortments over a planning horizon $T$ for maximizing the profit of the retailer. To make the problem setting more practical, we consider both the inventory constraint and the limited switches constraint, where the retailer cannot use up the resource inventory before time $T$ and is forbidden to switch the assortment shown to customers too many times. Such a setting suits the case when an online retailer wants to dynamically optimize the assortment selection for a population of customers. We develop an efficient UCB-like algorithm to optimize the assortments while learning customers&#39; choices from data. We prove that our algorithm can achieve a sub-linear regret bound $\tilde{O}\left(T^{1-α/2}\right)$ if $O(T^α)$ switches are allowed. %, and our regret bound is optimal with respect to $T$. Extensive numerical experiments show that our algorithm outperforms baselines and the gap between our algorithm&#39;s performance and the theoretical upper bound is small.

preprint2022arXiv

Quantitative Evaluation of Common Cause Failures in High Safety-significant Safety-related Digital Instrumentation and Control Systems in Nuclear Power Plants

Digital instrumentation and control (DIC) systems at nuclear power plants (NPPs) have many advantages over analog systems. They are proven to be more reliable, cheaper, and easier to maintain given obsolescence of analog components. However, they also pose new engineering and technical challenges, such as possibility of common cause failures (CCFs) unique to digital systems. This paper proposes a Platform for Risk Assessment of DIC (PRADIC) that is developed by Idaho National Laboratory (INL). A methodology for evaluation of software CCFs in high safety-significant safety-related DIC systems of NPPs was developed as part of the framework. The framework integrates three stages of a typical risk assessment, qualitative hazard analysis and quantitative reliability and consequence analyses. The quantified risks compared with respective acceptance criteria provide valuable insights for system architecture alternatives allowing design optimization in terms of risk reduction and cost savings. A comprehensive case study performed to demonstrate the framework capabilities is documented in this paper. Results show that the PRADIC is a powerful tool capable to identify potential digital-based CCFs, estimate their probabilities, and evaluate their impacts on system and plant safety.

preprint2022arXiv

Systems-theoretic Hazard Analysis of Digital Human-System Interface Relevant to Reactor Trip

Human-system interface is one of the key advanced design features applied to modern digital instrumentation and control systems of nuclear power plants. The conventional design is based on a compact workstation-based system within the control room. The compact workstation provides both a strategic operating environment while also a convenient display for plant status information necessary to the operator. The control environment is further enhanced through display panels, visual and auditory alarms, and procedure systems. However, just like the legacy control, the HSI should incorporate diversity to demonstrate sufficient defense-in-depth protection against common cause failures of the safety system. Furthermore, the vulnerability of the HSI is affected by a plethora of factors, such as human error, cyberattacks, software common cause failures, etc., that complicate the design and analysis. Therefore, this work aims to identify and evaluate existing system vulnerabilities to support the licensing, deployment and operation of HSI designs, especially the functions that are relevant to a reactor trip. We performed a systematic hazard analysis to investigate potential vulnerabilities within the HSI design using the novel redundancy-guided systems-theoretic hazard analysis. This method was developed and demonstrated by Idaho National Laboratory under a project initiated by the Risk-Informed Systems Analysis Pathway of the U.S. Department of Energy&#39;s Light Water Reactor Sustainability Program. The goal of the project is to develop a strong technical basis for risk assessment strategies to support effective, reliable, and licensable digital instrumentation and control technologies.

preprint2022arXiv

The synergistic modulation of electronic and geometry structures leads to ultra-low thermal conductivity of graphene-like borides (g-B3X5, X=N, P, As)

The design of novel devices with specific technical interests through modulating structural properties and bonding characteristics promotes the vigorous development of materials informatics. Herein, we propose a synergy strategy of component reconstruction by combining geometric configuration and bonding characteristics. With the synergy strategy, we designed a novel two-dimensional (2D) graphene-like borides, e.g. g-B3N5, which possesses counter-intuitive ultra-low thermal conductivity of 21.08 W/mK despite the small atomic mass. The ultra-low thermal conductivity is attributed to the synergy effect of electronics and geometry on thermal transport due to the combining reconstruction of g-BN and nitrogene. With the synergy effect, the dominant acoustic branches are strongly softened, and the scattering absorption and Umklapp process are simultaneously suppressed. Thus, the thermal conductivity is significantly lowered. To verify the component reconstruction strategy, we further constructed g-B3P5 and g-B3As5, and uncovered the ultra-low thermal conductivity of 2.50 and 1.85 W/mK, respectively. The synergy effect and the designed ultra-low thermal conductivity materials with lightweight atomic mass cater to the demand for light development of momentum machinery and heat protection, such as aerospace vehicles, high-speed rail, automobiles.

preprint2022arXiv

Thermodynamical and topological properties of metastable Fe3Sn

Combining experimental data, first-principles calculations, and Calphad assessment, thermodynamic and topological transport properties of the Fe-Sn system were investigated. Density functional theory (DFT) calculations were performed to evaluate the intermetallics&#39; finite-temperature heat capacity (Cp). A consistent thermodynamic assessment of the Fe-Sn phase diagram was achieved by using the experimental and DFT results, together with all available data from previous publications. Hence, the metastable phase Fe3Sn was firstly introduced into the current metastable phase diagram, and corrected phase locations of Fe5Sn3 and Fe3Sn2 under the newly measured corrected temperature ranges. Furthermore, the anomalous Hall conductivity and anomalous Nernst conductivity of Fe3Sn were calculated, with magnetization directions and doping considered as perturbations to tune such transport properties. It was observed that the enhanced anomalous Hall and Nernst conductivities originate from the combination of nodal lines and small gap areas that can be tuned by doping Mn at Fe sites and varying magnetization direction

preprint2021arXiv

Novel Two-Dimensional Layered MSi$_2$N$_4$ (M = Mo, W): New Promising Thermal Management Materials

With the miniaturization and integration of nanoelectronic devices, efficient heat removal becomes a key factor affecting the reliable operation of the nanoelectronic device. With the high intrinsic thermal conductivity, good mechanical flexibility, and precisely controlled growth, two-dimensional (2D) materials are widely accepted as ideal candidates for thermal management materials. In this work, by solving the phonon Boltzmann transport equation (BTE) based on first-principles calculations, we comprehensively investigated the thermal conductivity of novel 2D layered MSi$_2$N$_4$ (M = Mo, W). Our results point to competitive thermal conductivities (162 W/mK) of monolayer MoSi$_2$N$_4$, which is around two times larger than that of WSi$_2$N$_4$ and seven times larger than that of silicene despite their similar non-planar structures. It is revealed that the high thermal conductivity arises mainly from its large group velocity and low anharmonicity. Our result suggests that MoSi$_2$N$_4$ could be a potential candidate for 2D thermal management materials.

preprint2021arXiv

Spin Hall Conductivity and Anomalous Hall Conductivity in Full Heusler compounds

The spin Hall conductivity (SHC) and anomalous Hall conductivity (AHC) in more than 120 full Heusler compounds are calculated using density functional theory in a high-throughtput way. The electronic structures are mapped to the Wannier basis and the linear response theory is used to get the conductivity. Our results show that the mechanism under the SHC or AHC cannot be simply related to the valence electron numbers or atomic weights, is related to the very details of the electronic structure, which can only be obtained by calculations. A high throughput calculation is efficient to screen out the desired materials. According to our present results, Cu2CoSn, as well as Co2MnAl and Co2MnGa are candidates in spintronic materials regarding to their high SHC and AHC values, which can benefit the spin-torque-driven nanodevices.

preprint2020arXiv

A Redundancy-Guided Approach for the Hazard Analysis of Digital Instrumentation and Control Systems in Advanced Nuclear Power Plants

Digital instrumentation and control (I&C) upgrades are a vital research area for nuclear industry. Despite their performance benefits, deployment of digital I&C in nuclear power plants (NPPs) has been limited. Digital I&C systems exhibit complex failure modes including common cause failures (CCFs) which can be difficult to identify. This paper describes the development of a redundancy-guided application of the Systems-Theoretic Process Analysis (STPA) and Fault Tree Analysis (FTA) for the hazard analysis of digital I&C in advanced NPPs. The resulting Redundancy-guided System-theoretic Hazard Analysis (RESHA) is applied for the case study of a representative state-of-the-art digital reactor trip system. The analysis qualitatively and systematically identifies the most critical CCFs and other hazards of digital I&C systems. Ultimately, RESHA can help researchers make informed decisions for how, and to what degree, defensive measures such as redundancy, diversity, and defense-in-depth can be used to mitigate or eliminate the potential hazards of digital I&C systems.

preprint2020arXiv

Deep Learning Interfacial Momentum Closures in Coarse-Mesh CFD Two-Phase Flow Simulation Using Validation Data

Multiphase flow phenomena have been widely observed in the industrial applications, yet it remains a challenging unsolved problem. Three-dimensional computational fluid dynamics (CFD) approaches resolve of the flow fields on finer spatial and temporal scales, which can complement dedicated experimental study. However, closures must be introduced to reflect the underlying physics in multiphase flow. Among them, the interfacial forces, including drag, lift, turbulent-dispersion and wall-lubrication forces, play an important role in bubble distribution and migration in liquid-vapor two-phase flows. Development of those closures traditionally rely on the experimental data and analytical derivation with simplified assumptions that usually cannot deliver a universal solution across a wide range of flow conditions. In this paper, a data-driven approach, named as feature-similarity measurement (FSM), is developed and applied to improve the simulation capability of two-phase flow with coarse-mesh CFD approach. Interfacial momentum transfer in adiabatic bubbly flow serves as the focus of the present study. Both a mature and a simplified set of interfacial closures are taken as the low-fidelity data. Validation data (including relevant experimental data and validated fine-mesh CFD simulations results) are adopted as high-fidelity data. Qualitative and quantitative analysis are performed in this paper. These reveal that FSM can substantially improve the prediction of the coarse-mesh CFD model, regardless of the choice of interfacial closures, and it provides scalability and consistency across discontinuous flow regimes. It demonstrates that data-driven methods can aid the multiphase flow modeling by exploring the connections between local physical features and simulation errors.

preprint2020arXiv

Effect of N, C and B interstitials on the structural and magnetic properties of alloys with Cu$_3$Au-structure

High-throughput density functional calculations are used to investigate the effect of interstitial B, C and N atoms on 21 alloys reported to crystallize in the cubic Cu$_3$Au structure. It is shown that the interstitials can have a significant impact on the magneto-crystalline anisotropy energy (MAE), the thermodynamic stability and the magnetic ground state structure, making these alloys interesting for hard magnetic, magnetocaloric and other applications. For 29 alloy/interstitial combinations the formation of stable alloys with interstitial concentrations above 5\% is expected. In Ni$_3$Mn interstitial N induces a tetragonal distortion with substantial uniaxial MAE for realistic N concentrations. Mn$_3X$N$_x$ ($X$=Rh, Ir, Pt and Sb) are identified as alloys with strong magneto-crystalline anisotropy. For Mn$_3$Ir we find a strong enhancement of the MAE upon N alloying in the most stable collinear ferrimagnetic state as well as in the non-collinear magnetic ground state. Mn$_3$Ir and Mn$_3$IrN show also interesting topological transport properties. The effect of N concentration and strain on the magnetic properties are discussed. Further, the huge impact of N on the MAE of Mn$_3$Ir and a possible impact of interstitial N on amorphous Mn$_3$Ir, a material that is indispensable in today&#39;s data storage devices, are discussed at hand of the electronic structure. For Mn$_3$Sb, non-collinear, ferrimagnetic and ferromagnetic states are very close in energy, making this material potentially interesting for magnetocaloric applications. For the investigated Mn alloys and competing phases, the determination of the magnetic ground state is essential for a reliable prediction of the phase stability.

preprint2020arXiv

High-throughput Design of Magnetic Materials

Materials design based on density functional theory (DFT) calculations is an emergent field of great potential to accelerate the development and employment of novel materials. Magnetic materials play an essential role in green energy applications as they provide efficient ways of harvesting, converting, and utilizing energy. In this review, after a brief introduction to the major functionalities of magnetic materials, we demonstrated the fundamental properties which can be tackled via high-throughput DFT calculations, with a particular focus on the current challenges and feasible solutions. Successful case studies are summarized on several classes of magnetic materials, followed by bird-view perspectives for the future.

preprint2020arXiv

Using Deep Learning to Explore Local Physical Similarity for Global-scale Bridging in Thermal-hydraulic Simulation

Current system thermal-hydraulic codes have limited credibility in simulating real plant conditions, especially when the geometry and boundary conditions are extrapolated beyond the range of test facilities. This paper proposes a data-driven approach, Feature Similarity Measurement FFSM), to establish a technical basis to overcome these difficulties by exploring local patterns using machine learning. The underlying local patterns in multiscale data are represented by a set of physical features that embody the information from a physical system of interest, empirical correlations, and the effect of mesh size. After performing a limited number of high-fidelity numerical simulations and a sufficient amount of fast-running coarse-mesh simulations, an error database is built, and deep learning is applied to construct and explore the relationship between the local physical features and simulation errors. Case studies based on mixed convection have been designed for demonstrating the capability of data-driven models in bridging global scale gaps.

preprint2019arXiv

Piezospintronic effect in antiperovskite Mn$_3$GaN

Based on first-principles calculations, we investigated the topological transport properties of Mn$_3$GaN with coplanar noncollinear magnetic structures. The intrinsic anomalous Hall conductivity (IAHC) displays a significant dependence with respect to the in-plane magnetization direction between the $Γ_{5g}$ and $Γ_{4g}$ magnetic configurations, where large anomalous Nernst effect (ANE) can be induced by tailoring the magnetization direction. Moreover, we observed strong piezospintronic effect in Mn$_3$GaN, where large IAHC can be induced by moderate epitaxial strain. Symmetry analysis reveals that for both cases, the nonzero IAHC is originated from the spin-orbit coupling instead of the noncollinear magnetic configurations

preprint2019arXiv

Tunning Spin Hall conductivities in GeTe by Ferroelectric Polarization

Controlling charge-spin current conversion by electric fields is crucial in spintronic devices, which can be realized in diatom ferroelectric semiconductor GeTe where it is established that ferroelectricity can change the spin texture. We demonstrated that the spin Hall conductivity (SHC) can be further tuned by ferroelectricity based on the density functional theory calculations. The spin texture variation driven by the electric fields was elucidated from the symmetry point of view, highlighting the interlocked spin and orbital degrees of freedom. We observed that the origin of SHC can be attributed to the Rashba effect and the intrinsic spin-orbit coupling. The magnitude of one component of SHC σ_xy^z can reach as large as 100 {\hbar}/e/(Ωcm) in the vicinity of the band edge, which is promising for engineering spintronic devices. Our work on tunable spin transport properties via the ferroelectric polarization brings novel assets into the field of spintronics.