Researcher profile

Zhihao Tao

Zhihao Tao contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

A Causal Information-Flow Framework for Unbiased Learning-to-Rank

In web search and recommendation systems, user clicks are widely used to train ranking models. However, click data is heavily biased, i.e., users tend to click higher-ranked items (position bias), choose only what was shown to them (selection bias), and trust top results more (trust bias). Without explicitly modeling these biases, the true relevance of ranked items cannot be correctly learned from clicks. Existing Unbiased Learning-to-Rank (ULTR) methods mainly correct position bias and rely on propensity estimation, but they cannot measure remaining bias, provide risk guarantees, or jointly handle multiple bias sources. To overcome these challenges, this paper introduces a novel causal learning-based ranking framework that extends ULTR by combining Structural Causal Models (SCMs) with information-theoretic tools. SCMs specify how clicks are generated and help identify the true relevance signal from click data, while conditional mutual information, measures how much bias leaks into the learned relevance estimates. We use this leakage measure to define a rigorous notion of disentanglement and include it as a regularizer during model training to reduce bias. In addition, we incorporate a causal inference estimator, i.e., doubly robust estimator, to ensure more reliable risk estimation. Experiments on standard Learning-to-Rank benchmarks show that our method consistently reduces measured bias leakage and improves ranking performance, especially in realistic scenarios where multiple biases-such as position and trust bias-interact strongly.

preprint2025arXiv

Time-Modulated Intelligent Reflecting Surfaces for Integrated Sensing, Communication and Security: A Generative AI Design Framework

We propose a novel approach to achieve physical layer security for integrated sensing and communication (ISAC) systems operating in the presence of targets that may be eavesdroppers. The system is aided by a time-modulated intelligent reflecting surface (TM-IRS), which is configured to preserve the integrity of the transmitted data at one or more legitimate communication users (CUs) while making them appear scrambled in all other directions. The TM-IRS design leverages a generative flow network (GFlowNet) framework to learn a stochastic policy that samples high-performing TM-IRS configurations from a vast discrete parameter space. Specifically, we begin by formulating the achievable sum rate for the legitimate CUs and the beampattern gain toward the target direction, based on which we construct reward functions for GFlowNets that jointly capture both communication and sensing performance. The TM-IRS design is modeled as a deterministic Markov decision process (MDP), where each terminal state corresponds to a complete configuration of TM-IRS parameters. GFlowNets, parametrized by deep neural networks are employed to learn a stochastic policy that samples TM-IRS parameter sets with probability proportional to their associated reward. Experimental results demonstrate the effectiveness of the proposed GFlowNet-based method in integrating sensing, communication and security simultaneously, and also exhibit significant sampling efficiency as compared to the exhaustive combinatorial search and enhanced robustness against the existing benchmarks of physical layer security.