Researcher profile

Xiaoling Hu

Xiaoling Hu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
8topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models

Large language models (LLMs) often produce answers with high certainty even when they are incorrect, making reliable confidence estimation essential for deployment in real-world scenarios. Verbalized confidence, where models explicitly state their confidence in natural language, provides a flexible and user-facing uncertainty signal that can be applied even when token logits are unavailable. However, existing verbalized-confidence methods often optimize answer generation and confidence generation jointly, which can cause confidence-alignment objectives to interfere with answer accuracy. In this work, we propose a decoupled and order-aware framework for verbalized confidence calibration. Our method first generates an answer and then estimates confidence conditioned on the fixed question--answer pair, allowing confidence optimization without directly perturbing the answer-generation process. To align confidence with correctness likelihood, we construct a sampling-based surrogate from multiple model completions and optimize rank-based reinforcement learning objectives that encourage responses with higher estimated correctness likelihood to receive higher verbalized confidence. Experiments on reasoning and knowledge-intensive benchmarks show that our method improves calibration and failure prediction performance while largely preserving answer accuracy. These results demonstrate that verbalized confidence can be more reliably aligned by decoupling confidence estimation from answer generation and optimizing the relative ordering of confidence across responses.

preprint2022arXiv

A Manifold View of Adversarial Risk

The adversarial risk of a machine learning model has been widely studied. Most previous works assume that the data lies in the whole ambient space. We propose to take a new angle and take the manifold assumption into consideration. Assuming data lies in a manifold, we investigate two new types of adversarial risk, the normal adversarial risk due to perturbation along normal direction, and the in-manifold adversarial risk due to perturbation within the manifold. We prove that the classic adversarial risk can be bounded from both sides using the normal and in-manifold adversarial risks. We also show with a surprisingly pessimistic case that the standard adversarial risk can be nonzero even when both normal and in-manifold risks are zero. We finalize the paper with empirical studies supporting our theoretical results. Our results suggest the possibility of improving the robustness of a classifier by only focusing on the normal adversarial risk.

preprint2022arXiv

A Topology-Attention ConvLSTM Network and Its Application to EM Images

Structural accuracy of segmentation is important for finescale structures in biomedical images. We propose a novel TopologyAttention ConvLSTM Network (TACNet) for 3D image segmentation in order to achieve high structural accuracy for 3D segmentation tasks. Specifically, we propose a Spatial Topology-Attention (STA) module to process a 3D image as a stack of 2D image slices and adopt ConvLSTM to leverage contextual structure information from adjacent slices. In order to effectively transfer topology-critical information across slices, we propose an Iterative-Topology Attention (ITA) module that provides a more stable topology-critical map for segmentation. Quantitative and qualitative results show that our proposed method outperforms various baselines in terms of topology-aware evaluation metrics.

preprint2022arXiv

Deep Statistic Shape Model for Myocardium Segmentation

Accurate segmentation and motion estimation of myocardium have always been important in clinic field, which essentially contribute to the downstream diagnosis. However, existing methods cannot always guarantee the shape integrity for myocardium segmentation. In addition, motion estimation requires point correspondence on the myocardium region across different frames. In this paper, we propose a novel end-to-end deep statistic shape model to focus on myocardium segmentation with both shape integrity and boundary correspondence preserving. Specifically, myocardium shapes are represented by a fixed number of points, whose variations are extracted by Principal Component Analysis (PCA). Deep neural network is used to predict the transformation parameters (both affine and deformation), which are then used to warp the mean point cloud to the image domain. Furthermore, a differentiable rendering layer is introduced to incorporate mask supervision into the framework to learn more accurate point clouds. In this way, the proposed method is able to consistently produce anatomically reasonable segmentation mask without post processing. Additionally, the predicted point cloud guarantees boundary correspondence for sequential images, which contributes to the downstream tasks, such as the motion estimation of myocardium. We conduct several experiments to demonstrate the effectiveness of the proposed method on several benchmark datasets.

preprint2022arXiv

IRS-Aided Non-Orthogonal ISAC Systems: Performance Analysis and Beamforming Design

Intelligent reflecting surface (IRS) has shown its effectiveness in facilitating orthogonal time-division integrated sensing and communications (TD-ISAC), in which the sensing task and the communication task occupy orthogonal time-frequency resources, while the role of IRS in the more interesting scenarios of non-orthogonal ISAC (NO-ISAC) systems has so far remained unclear. In this paper, we consider an IRS-aided NO-ISAC system, where a distributed IRS is deployed to assist concurrent communication and location sensing for a blind-zone user, occupying non-orthogonal/overlapped time-frequency resources. We first propose a modified Cramer-Rao lower bound (CRLB) to characterize the performances of both communication and location sensing in a unified manner. We further derive the closed-form expressions of the modified CRLB in our considered NO-ISAC system, enabling us to identify the fundamental trade-off between the communication and location sensing performances. In addition, by exploiting the modified CRLB, we propose a joint active and passive beamforming design algorithm that achieves a good communication and location sensing trade-off. Through numerical results, we demonstrate the superiority of the IRS-aided NO-ISAC systems over the IRS-aided TD-ISAC systems, in terms of both communication and localization performances. Besides, it is shown that the IRS-aided NO-ISAC system with random communication signals can achieve comparable localization performance to the IRS-aided localization system with dedicated positioning reference signals. Moreover, we investigate the trade-off between communication performance and localization performance and show how the performance of the NO-ISAC system can be significantly boosted by increasing the number of the IRS elements.

preprint2022arXiv

IRS-Based Integrated Location Sensing and Communication for mmWave SIMO Systems

In this paper, we establish an integrated sensing and communication (ISAC) system based on a distributed semi-passive intelligent reflecting surface (IRS), which allows location sensing and data transmission to be carried out simultaneously, sharing the same frequency and time resources. The detailed working process of the proposed IRS-based ISAC system is designed, including the transmission protocol, location sensing and beamforming optimization. Specifically, each coherence block consists of two periods, the ISAC period with two time blocks and the pure communication (PC) period. During each time block of the ISAC period, data transmission and user positioning are carried out simultaneously. The estimated user location in the first time block will be used for beamforming design in the second time block. During the PC period, only data transmission is conducted, by invoking the user location estimated in the second time block of the ISAC period for beamforming design. {\color{black}Simulation results show that a millimeter-level positioning accuracy can be achieved by the proposed location sensing scheme, demonstrating the advantage of the proposed IRS-based ISAC framework. Besides, the proposed two beamforming schemes based on the estimated location information achieve similar performance to the benchmark schemes assuming perfect channel state information (CSI), which verifies the effectiveness of beamforming design using sensed location information.

preprint2022arXiv

Learning Topological Interactions for Multi-Class Medical Image Segmentation

Deep learning methods have achieved impressive performance for multi-class medical image segmentation. However, they are limited in their ability to encode topological interactions among different classes (e.g., containment and exclusion). These constraints naturally arise in biomedical images and can be crucial in improving segmentation quality. In this paper, we introduce a novel topological interaction module to encode the topological interactions into a deep neural network. The implementation is completely convolution-based and thus can be very efficient. This empowers us to incorporate the constraints into end-to-end training and enrich the feature representation of neural networks. The efficacy of the proposed method is validated on different types of interactions. We also demonstrate the generalizability of the method on both proprietary and public challenge datasets, in both 2D and 3D settings, as well as across different modalities such as CT and Ultrasound. Code is available at: https://github.com/TopoXLab/TopoInteraction

preprint2022arXiv

Trigger Hunting with a Topological Prior for Trojan Detection

Despite their success and popularity, deep neural networks (DNNs) are vulnerable when facing backdoor attacks. This impedes their wider adoption, especially in mission critical applications. This paper tackles the problem of Trojan detection, namely, identifying Trojaned models -- models trained with poisoned data. One popular approach is reverse engineering, i.e., recovering the triggers on a clean image by manipulating the model's prediction. One major challenge of reverse engineering approach is the enormous search space of triggers. To this end, we propose innovative priors such as diversity and topological simplicity to not only increase the chances of finding the appropriate triggers but also improve the quality of the found triggers. Moreover, by encouraging a diverse set of trigger candidates, our method can perform effectively in cases with unknown target labels. We demonstrate that these priors can significantly improve the quality of the recovered triggers, resulting in substantially improved Trojan detection accuracy as validated on both synthetic and publicly available TrojAI benchmarks.

preprint2021arXiv

Angle-Domain Intelligent Reflecting Surface Systems: Design and Analysis

This paper considers an angle-domain intelligent reflecting surface (IRS) system. We derive maximum likelihood (ML) estimators for the effective angles from the base station (BS) to the user and the effective angles of propagation from the IRS to the user. It is demonstrated that the accuracy of the estimated angles improves with the number of BS antennas. Also, deploying the IRS closer to the BS increases the accuracy of the estimated angle from the IRS to the user. Then, based on the estimated angles, we propose a joint optimization of BS beamforming and IRS beamforming, which achieves similar performance to two benchmark algorithms based on full CSI and the multiple signal classification (MUSIC) method respectively. Simulation results show that the optimized BS beam becomes more focused towards the IRS direction as the number of reflecting elements increases. Furthermore, we derive a closed-form approximation, upper bound and lower bound for the achievable rate. The analytical findings indicate that the achievable rate can be improved by increasing the number of BS antennas or reflecting elements. Specifically, the BS-user link and the BS-IRS-user link can obtain power gains of order $N$ and $NM^2$, respectively, where $N$ is the antenna number and $M$ is the number of reflecting elements.

preprint2020arXiv

Location Information Aided Multiple Intelligent Reflecting Surface Systems

This paper proposes a novel location information aided multiple intelligent reflecting surface (IRS) systems. Assuming imperfect user location information, the effective angles from the IRS to the users are estimated, which is then used to design the transmit beam and IRS beam. Furthermore, closed-form expressions for the achievable rate are derived. The analytical findings indicate that the achievable rate can be improved by increasing the number of base station (BS) antennas or reflecting elements. Specifically, a power gain of order $N M^2$ is achieved, where $N$ is the antenna number and $M$ is the number of reflecting elements. Moreover, with a large number of reflecting elements, the individual signal to interference plus noise ratio (SINR) is proportional to $M$, while becomes proportional to $M^2$ as non-line-of-sight (NLOS) paths vanish. Also, it has been shown that high location uncertainty would significantly degrade the achievable rate. Besides, IRSs should be deployed at distinct directions (relative to the BS) and be far away from each other to reduce the interference from multiple IRSs. Finally, an optimal power allocation scheme has been proposed to improve the system performance.

preprint2020arXiv

Programmable Metasurface Based Multicast Systems: Design and Analysis

This paper considers a multi-antenna multicast system with programmable metasurface (PMS) based transmitter. Taking into account of the finite-resolution phase shifts of PMSs, a novel beam training approach is proposed, which achieves comparable performance as the exhaustive beam searching method but with much lower time overhead. Then, a closed-form expression for the achievable multicast rate is presented, which is valid for arbitrary system configurations. In addition, for certain asymptotic scenario, simple approximated expressions for the multicase rate are derived. Closed-form solutions are obtained for the optimal power allocation scheme, and it is shown that equal power allocation is optimal when the pilot power or the number of reflecting elements is sufficiently large. However, it is desirable to allocate more power to weaker users when there are a large number of RF chains. The analytical findings indicate that, with large pilot power, the multicast rate is determined by the weakest user. Also, increasing the number of radio frequency (RF) chains or reflecting elements can significantly improve the multicast rate, and as the phase shift number becomes larger, the multicast rate improves first and gradually converges to a limit. Moreover, increasing the number of users would significantly degrade the multicast rate, but this rate loss can be compensated by implementing a large number of reflecting elements.

preprint2020arXiv

Robust Design for IRS-Aided Communication Systems with User Location Uncertainty

In this paper, we propose a robust design framework for IRS-aided communication systems in the presence of user location uncertainty. By jointly designing the transmit beamforming vector at the BS and phase shifts at the IRS, we aim to minimize the transmit power subject to the worse-case quality of service (QoS) constraint, i.e., ensuring the user rate is above a threshold for all possible user location error realizations. With unit-modulus, this problem is not convex. The location uncertainty in the QoS constraint further increases the difficulty of solving this problem. By utilizing techniques of Taylor expansion, S-Procedure and semidefinite relaxation (SDP), we transform this problem into a sequence of semidefinite programming (SDP) sub-problems. Simulation results show that the proposed robust algorithm substantially outperforms the non-robust algorithm proposed in the literature, in terms of probability of reaching the required QoS target.

preprint2020arXiv

Statistical CSI based Design for Intelligent Reflecting Surface Assisted MISO Systems

This paper considers an intelligent reflecting surface (IRS) aided multiple-input single-output communication system, where statistical channel state information (CSI) is exploited for transmit beamforming and IRS beamforming. A tight upper bound is derived for the ergodic capacity of the system. Based on which, the joint optimization of transmit beam and IRS beam are studied. Depending on whether a line-of-sight path exists between the access point and user, two different cases, namely, Rician fading and Rayleigh fading, are separately treated. Specifically, for the Rician fading case, an iterative algorithm is proposed, which is guaranteed to converge. For the Rayleigh fading case, closed-form designs are obtained for the transmit beam and IRS beam. Simulation results show the proposed beamforming scheme achieves similar performance as the benchmark algorithm requiring instantaneous CSI.