Source author record

Zhenyu Zhao

Zhenyu Zhao appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning physics.ins-det Applications Numerical Analysis Artificial Intelligence Computation Computation and Language Computer Vision cond-mat.mtrl-sci Cryptography and Security cs.CY eess.SY Human-Computer Interaction math.NA math.OC Methodology Multiagent Systems Networking and Internet Architecture Systems and Control

Catalog footprint

What is connected

14works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Diffusion Attention Expert Model for Predicting and Semi-automatic Localizing STAS in Lung Cancer Histopathological Images

Accurate intraoperative and postoperative diagnosis of spread through air spaces (STAS) is essential for guiding surgical decisions and postoperative management in lung cancer. However, histopathological assessment is labor-intensive and is prone to missed or incorrect diagnoses. We propose a Diffusion Attention Expert Model (DAEM) to detect STAS in frozen sections (FSs) and paraffin sections (PSs). Its diffusion attention expert module leverages full attention aggregation to learn multi-scale features from histopathological images, while a dual-branch architecture strengthens multi-scale feature representation. On an internal dataset, DAEM achieves AUCs of 0.8946 for FSs and 0.9112 for PSs. Validation on external multi-center datasets from eight institutions demonstrates strong generalizability and interpretability. Using tumor microenvironment (TME) features in PSs, we further enable semi-automatic measurement of STAS location and its distance from the primary tumor. Several quantitative TME metrics are identified as potential biomarkers for STAS, including micropapillary-type STAS. Overall, DAEM offers a clinically actionable framework for STAS assessment by enabling accurate and interpretable detection on FSs and PSs, supporting postoperative risk stratification through quantitative TME-based analysis.

preprint2026arXiv

Fourier Extension Based on Weighted Generalized Inverse

This paper introduces a weighted generalized inverse framework for Fourier extensions, designed to suppress spurious oscillations in the extended region while maintaining high approximation accuracy on the original interval. By formulating the Fourier extension problem as a compact operator equation, we propose a weighted best-approximation solution that incorporates a priori smoothness information through suitable weight operators on the Fourier coefficients. This leads to a regularization scheme based on the generalized truncated singular value decomposition (GTSVD). Under algebraic and exponential smoothness assumptions, convergence analysis demonstrates optimal $L^2$ accuracy and improved stability for derivatives. Compared with classical Fourier extension using standard TSVD, the proposed method effectively controls high-frequency components and yields smoother extensions. A practical discretization using uniform sampling is developed, along with an adaptive design of weight functions. Numerical experiments confirm that the method significantly improves derivative approximations and reduces oscillations in the extended domain without compromising accuracy on the original interval.

preprint2026arXiv

Multipath Routing for Multi-Hop UAV Networks

Multi-hop uncrewed aerial vehicle (UAV) networks are promising to extend the terrestrial network coverage. Existing multi-hop UAV networks employ a single routing path by selecting the next-hop forwarding node in a hop-by-hop manner, which leads to local congestion and increases traffic delays. In this paper, a novel traffic-adaptive multipath routing method is proposed for multi-hop UAV networks, which enables each UAV to dynamically split and forward traffic flows across multiple next-hop neighbors, thus meeting latency requirements of diverse traffic flows in dynamic mobile environments. An on-time packet delivery ratio maximization problem is formulated to determine the traffic splitting ratios at each hop. This sequential decision-making problem is modeled as a decentralized partially observable Markov decision process (Dec-POMDP). To solve this Dec-POMDP, a novel multi-agent deep reinforcement leaning (MADRL) algorithm, termed Independent Proximal Policy Optimization with Dirichlet Modeling (IPPO-DM), is developed. Specifically, the IPPO serves as the core optimization framework, where the Dirichlet distribution is leveraged to parameterize a continuous stochastic policy network on the probability simplex, inherently ensuring feasible traffic splitting ratios. Simulation results demonstrate that IPPO-DM outperforms benchmark schemes in terms of both delivery latency guarantee and packet loss performance.

preprint2026arXiv

Shorthand for Thought: Compressing LLM Reasoning via Entropy-Guided Supertokens

Reasoning in Large Language Models incurs significant inference-time compute, yet the token-level information structure of reasoning traces remains underexplored. We observe that reasoning tokens split into two functional types: low-entropy \textit{structural} tokens (recurring phrases that scaffold the reasoning process) and higher-entropy \textit{organic} tokens (problem-specific content that drives toward a solution). This asymmetry motivates a simple, model-agnostic compression pipeline: apply cross-word BPE merges on a model's own reasoning traces to derive \textit{supertokens} that capture frequent structural patterns, then teach the model to adopt them via supervised fine-tuning. Across three model families and five mathematical reasoning benchmarks, our approach shortens reasoning traces by 8.1\% on average with no statistically significant accuracy loss on any model--benchmark pair. Beyond compression, supertokens act as interpretable reasoning-move annotations (backtracking, verification, strategy shifts), exposing the model's high-level strategy at a glance. Analyzing transitions between structural categories reveals systematic differences between correct and incorrect traces: correct traces show productive recovery (backtracking followed by strategy shifts and verification), while incorrect traces are dominated by confusion cycles (repeated hedging and unresolved contradictions). These diagnostic signals suggest applications in reward shaping and early stopping for RL-based reasoning training.

preprint2024arXiv

Electrochemical Removal of HF from Carbonate-based $LiPF_6$-containing Li-ion Battery Electrolytes

Due to the hydrolytic instability of $LiPF_6$ in carbonate-based solvents, HF is a typical impurity in Li-ion battery electrolytes. HF significantly influences the performance of Li-ion batteries, for example by impacting the formation of the solid electrolyte interphase at the anode and by affecting transition metal dissolution at the cathode. Additionally, HF complicates studying fundamental interfacial electrochemistry of Li-ion battery electrolytes, such as direct anion reduction, because it is electrocatalytically relatively unstable, resulting in LiF passivation layers. Methods to selectively remove ppm levels of HF from $LiPF_6$-containing carbonate-based electrolytes are limited. We introduce and benchmark a simple yet efficient electrochemical in situ method to selectively remove ppm amounts of HF from $LiPF_6$-containing carbonate-based electrolytes. The basic idea is the application of a suitable potential to a high surface-area metallic electrode upon which only HF reacts (electrocatalytically) while all other electrolyte components are unaffected under the respective conditions.

preprint2022arXiv

A Survey of Impedance Measurement Methods in Power Electronics

Impedance is one of the vital parameters that provides useful information for many power electronics related applications. A lot of impedance measurement methods in power electronics have been reported. However, a comprehensive investigation among these methods in terms of their characteristics, advantages, and limitations has not been found in the literature. In order to bridge this gap, a survey of the impedance measurement methods is conducted in this paper. These methods are introduced, discussed, and then classified into different categories depending on the measurement modes, principles, and instruments. Moreover, recommendations for the future research on the impedance measurement are also presented.

preprint2022arXiv

Effective Anomaly Detection in Smart Home by Integrating Event Time Intervals

Smart home IoT systems and devices are susceptible to attacks and malfunctions. As a result, users' concerns about their security and safety issues arise along with the prevalence of smart home deployments. In a smart home, various anomalies (such as fire or flooding) could happen, due to cyber attacks, device malfunctions, or human mistakes. These concerns motivate researchers to propose various anomaly detection approaches. Existing works on smart home anomaly detection focus on checking the sequence of IoT devices' events but leave out the temporal information of events. This limitation prevents them to detect anomalies that cause delay rather than missing/injecting events. To fill this gap, in this paper, we propose a novel anomaly detection method that takes the inter-event intervals into consideration. We propose an innovative metric to quantify the temporal similarity between two event sequences. We design a mechanism to learn the temporal patterns of event sequences of common daily activities. Delay-caused anomalies are detected by comparing the sequence with the learned patterns. We collect device events from a real-world testbed for training and testing. The experiment results show that our proposed method achieves accuracies of 93%, 88%, 89% for three daily activities.

preprint2022arXiv

Feature Selection Methods for Uplift Modeling and Heterogeneous Treatment Effect

Uplift modeling is a causal learning technique that estimates subgroup-level treatment effects. It is commonly used in industry and elsewhere for tasks such as targeting ads. In a typical setting, uplift models can take thousands of features as inputs, which is costly and results in problems such as overfitting and poor model interpretability. Consequently, there is a need to select a subset of the most important features for modeling. However, traditional methods for doing feature selection are not fit for the task because they are designed for standard machine learning models whose target is importantly different from uplift models. To address this, we introduce a set of feature selection methods explicitly designed for uplift modeling, drawing inspiration from statistics and information theory. We conduct empirical evaluations on the proposed methods on publicly available datasets, demonstrating the advantages of the proposed methods compared to traditional feature selection. We make the proposed methods publicly available as a part of the CausalML open-source package.

preprint2022arXiv

In-Circuit Impedance Measurement Setups of Inductive Coupling Approach: A Review

In-circuit impedance measurement provides useful information for many EMC applications. The inductive coupling approach is a promising in-circuit impedance measurement method due to its non-contact characteristics and simple on-site implementation. Many measurement setups of this approach were reported. However, a comprehensive survey and comparison of these setups have not been found in the literature. This paper reviews these setups in terms of their characteristics, limitations, and applications. In addition, recommendations for future research are also presented.

preprint2022arXiv

Inductively Coupled In-Circuit Impedance Measurement and Its EMC Applications

In-circuit impedance provides key information for many EMC applications. The inductive coupling approach is a promising method for in-circuit impedance measurement because its measurement setups have no direct electrical contact with the energized system under test, thus greatly simplifying the on-site implementation. This paper presents and summaries the latest research on the inductive coupling approach and its EMC applications. First of all, three common measurement setups for this approach and their respective pros and cons are discussed. Subsequently, their EMC applications are introduced. Finally, recommendations for future research are listed.

preprint2022arXiv

Inform Product Change through Experimentation with Data-Driven Behavioral Segmentation

Online controlled experimentation is widely adopted for evaluating new features in the rapid development cycle for web products and mobile applications. Measurement of the overall experiment sample is a common practice to quantify the overall treatment effect. In order to understand why the treatment effect occurs in a certain way, segmentation becomes a valuable approach to a finer analysis of experiment results. This paper introduces a framework for creating and utilizing user behavioral segments in online experimentation. By using the data of user engagement with individual product components as input, this method defines segments that are closely related to the features being evaluated in the product development cycle. With a real-world example, we demonstrate that the analysis with such behavioral segments offered deep, actionable insights that successfully informed product decision-making.

preprint2020arXiv

CausalML: Python Package for Causal Machine Learning

CausalML is a Python implementation of algorithms related to causal inference and machine learning. Algorithms combining causal inference and machine learning have been a trending topic in recent years. This package tries to bridge the gap between theoretical work on methodology and practical applications by making a collection of methods in this field available in Python. This paper introduces the key concepts, scope, and use cases of this package.

preprint2020arXiv

Uplift Modeling for Multiple Treatments with Cost Optimization

Uplift modeling is an emerging machine learning approach for estimating the treatment effect at an individual or subgroup level. It can be used for optimizing the performance of interventions such as marketing campaigns and product designs. Uplift modeling can be used to estimate which users are likely to benefit from a treatment and then prioritize delivering or promoting the preferred experience to those users. An important but so far neglected use case for uplift modeling is an experiment with multiple treatment groups that have different costs, such as for example when different communication channels and promotion types are tested simultaneously. In this paper, we extend standard uplift models to support multiple treatment groups with different costs. We evaluate the performance of the proposed models using both synthetic and real data. We also describe a production implementation of the approach.

preprint2015arXiv

Relaxed Majorization-Minimization for Non-smooth and Non-convex Optimization

We propose a new majorization-minimization (MM) method for non-smooth and non-convex programs, which is general enough to include the existing MM methods. Besides the local majorization condition, we only require that the difference between the directional derivatives of the objective function and its surrogate function vanishes when the number of iterations approaches infinity, which is a very weak condition. So our method can use a surrogate function that directly approximates the non-smooth objective function. In comparison, all the existing MM methods construct the surrogate function by approximating the smooth component of the objective function. We apply our relaxed MM methods to the robust matrix factorization (RMF) problem with different regularizations, where our locally majorant algorithm shows advantages over the state-of-the-art approaches for RMF. This is the first algorithm for RMF ensuring, without extra assumptions, that any limit point of the iterates is a stationary point.

Zhenyu Zhao

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Diffusion Attention Expert Model for Predicting and Semi-automatic Localizing STAS in Lung Cancer Histopathological Images

Fourier Extension Based on Weighted Generalized Inverse

Multipath Routing for Multi-Hop UAV Networks

Shorthand for Thought: Compressing LLM Reasoning via Entropy-Guided Supertokens

Electrochemical Removal of HF from Carbonate-based $LiPF_6$-containing Li-ion Battery Electrolytes

A Survey of Impedance Measurement Methods in Power Electronics

Effective Anomaly Detection in Smart Home by Integrating Event Time Intervals

Feature Selection Methods for Uplift Modeling and Heterogeneous Treatment Effect

In-Circuit Impedance Measurement Setups of Inductive Coupling Approach: A Review

Inductively Coupled In-Circuit Impedance Measurement and Its EMC Applications

Inform Product Change through Experimentation with Data-Driven Behavioral Segmentation

CausalML: Python Package for Causal Machine Learning

Uplift Modeling for Multiple Treatments with Cost Optimization

Relaxed Majorization-Minimization for Non-smooth and Non-convex Optimization