Researcher profile

Yuhong Liu

Yuhong Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

UltraLogic: Enhancing LLM Reasoning through Large-Scale Data Synthesis and Bipolar Float Reward

While Large Language Models (LLMs) have demonstrated significant potential in natural language processing , complex general-purpose reasoning requiring multi-step logic, planning, and verification remains a critical bottleneck. Although Reinforcement Learning with Verifiable Rewards (RLVR) has succeeded in specific domains , the field lacks large-scale, high-quality, and difficulty-calibrated data for general reasoning. To address this, we propose UltraLogic, a framework that decouples the logical core of a problem from its natural language expression through a Code-based Solving methodology to automate high-quality data production. The framework comprises hundreds of unique task types and an automated calibration pipeline across ten difficulty levels. Furthermore, to mitigate binary reward sparsity and the Non-negative Reward Trap, we introduce the Bipolar Float Reward (BFR) mechanism, utilizing graded penalties to effectively distinguish perfect responses from those with logical flaws. Our experiments demonstrate that task diversity is the primary driver for reasoning enhancement , and that BFR, combined with a difficulty matching strategy, significantly improves training efficiency, guiding models toward global logical optima.

preprint2022arXiv

Defending against Co-residence Attack in Energy-Efficient Cloud: An Optimization based Real-time Secure VM Allocation Strategy

Resource sharing among users serves as the foundation of cloud computing, which, however, may also cause vulnerabilities to diverse co-residence attacks launched by malicious virtual machines (VM) residing in the same physical server with the victim VMs. In this paper, we aim to defend against such co-residence attacks through a secure, workload-balanced, and energy-efficient VM allocation strategy. Specifically, we model the problem as an optimization problem by quantifying and minimizing three key factors: (1) the security risks, (2) the power consumption and (3) the unbalanced workloads among different physical servers. Furthermore, this work considers a realistic environmental setting by assuming a random number of VMs from different users arriving at random timings, which requires the optimization solution to be continuously evolving. As the optimization problem is NP-hard, we propose to first cluster VMs in time windows, and further adopt the Ant Colony Optimization (ACO) algorithm to identify the optimal allocation strategy for each time window. Comprehensive experimental results based on real world cloud traces validates the effectiveness of the proposed scheme.

preprint2022arXiv

Interference-Limited Ultra-Reliable and Low-Latency Communications: Graph Neural Networks or Stochastic Geometry?

In this paper, we aim to improve the Quality-of-Service (QoS) of Ultra-Reliability and Low-Latency Communications (URLLC) in interference-limited wireless networks. To obtain time diversity within the channel coherence time, we first put forward a random repetition scheme that randomizes the interference power. Then, we optimize the number of reserved slots and the number of repetitions for each packet to minimize the QoS violation probability, defined as the percentage of users that cannot achieve URLLC. We build a cascaded Random Edge Graph Neural Network (REGNN) to represent the repetition scheme and develop a model-free unsupervised learning method to train it. We analyze the QoS violation probability using stochastic geometry in a symmetric scenario and apply a model-based Exhaustive Search (ES) method to find the optimal solution. Simulation results show that in the symmetric scenario, the QoS violation probabilities achieved by the model-free learning method and the model-based ES method are nearly the same. In more general scenarios, the cascaded REGNN generalizes very well in wireless networks with different scales, network topologies, cell densities, and frequency reuse factors. It outperforms the model-based ES method in the presence of the model mismatch.

preprint2022arXiv

Secure IoT Routing: Selective Forwarding Attacks and Trust-based Defenses in RPL Network

IPv6 Routing Protocol for Low Power and Lossy Networks (RPL) is an essential routing protocol to enable communications for IoT networks with low power devices. RPL uses an objective function and routing constraints to find an optimized routing path for each node in the network. However, recent research has shown that topological attacks, such as selective forwarding attacks, pose great challenges to the secure routing of IoT networks. Many conventional secure routing solutions, on the other hand, are computationally heavy to be directly applied in resource-constrained IoT networks. There is an urgent need to develop lightweight secure routing solutions for IoT networks. In this paper, we first design and implement a series of advanced selective forwarding attacks from the attack perspective, which can flexibly select the type and percentage of forwarding packets in an energy efficient way, and even bad-mouth other innocent nodes in the network. Experiment results show that the proposed attacks can maximize the attack consequences (i.e. number of dropped packets) while maintaining undetected. Moreover, we propose a lightweight trust-based defense solution to detect and eliminate malicious selective forwarding nodes from the network. The results show that the proposed defense solution can achieve high detection accuracy with very limited extra energy usage (i.e. 3.4%).

preprint2020arXiv

Accessible precisions for estimating two conjugate parameters using Gaussian probes

We analyse the precision limits for simultaneous estimation of a pair of conjugate parameters in a displacement channel using Gaussian probes. Having a set of squeezed states as an initial resource, we compute the Holevo Cramér-Rao bound to investigate the best achievable estimation precisions if only passive linear operations are allowed to be performed on the resource prior to probing the channel. The analysis reveals the optimal measurement scheme and allows us to quantify the best precision for one parameter when the precision of the second conjugate parameter is fixed. To estimate the conjugate parameter pair with equal precision, our analysis shows that the optimal probe is obtained by combining two squeezed states with orthogonal squeezing quadratures on a 50:50 beam splitter. If different importance are attached to each parameter, then the optimal mixing ratio is no longer 50:50. Instead it follows a simple function of the available squeezing and the relative importance between the two parameters.

preprint2020arXiv

Direct temporal mode measurement for the characterization of temporally multiplexed high dimensional quantum entanglement in continuous variables

Field-orthogonal temporal mode analysis of optical fields is recently developed for a new framework of quantum information science. But so far, the exact profiles of the temporal modes are not known, which makes it difficult to achieve mode selection and de-multiplexing. Here, we report a novel method that measures directly the exact form of the temporal modes. This in turn enables us to make mode-orthogonal homodyne detection with mode-matched local oscillators. We apply the method to a pulse-pumped, specially engineered fiber parametric amplifier and demonstrate temporally multiplexed multi-dimensional quantum entanglement of continuous variables in telecom wavelength. The temporal mode characterization technique can be generalized to other pulse-excited systems to find their eigen modes for multiplexing in temporal domain.

preprint2020arXiv

Quantum state engineering by nonlinear quantum interference

Multi-photon quantum interference is the underlying principle for optical quantum information processing protocols. Indistinguishability is the key to quantum interference. Therefore, the success of many protocols in optical quantum information processing relies on the availability of photon states with a well-defined spatial and temporal mode. Photons in single spatial mode can be obtained from nonlinear processes in single-mode waveguides. For the temporal mode, the common approach is to engineer the nonlinear processes. But it is complicated because the spectral properties and the nonlinear interaction are often intertwined through phase matching condition. In this paper, we study a different approach which is based on an SU(1,1) nonlinear interferometer with a pulsed pump and a controllable linear spectral phase shift for precise engineering. We systematically analyze the important figures of merit such as modal purity and heralding efficiency to investigate the feasibility of this approach. Specifically, we analyze in detail the requirement on the spectral phase engineering to optimize the figures of merit and apply numerical simulations to a fiber system. Both modal purity and efficiency are improved simultaneously. Furthermore, a novel multi-stage nonlinear interferometer is proposed and shown to achieve more precise state engineering for near-ideal single-mode operation and near-unity efficiency. We also extend the study to the case of high gain in the four-wave mixing process for the spectral engineering of quantum entanglement in continuous variables. Our investigation provides a new approach for precisely tailoring the spectral property of quantum light sources, especially, photon pairs can be engineered to simultaneously possess the features of high purity, high collection efficiency, high brightness, and high flexibility in wavelength and bandwidth selection.

preprint2019arXiv

Measuring the continuous variable quantum entanglement with a parametric amplifier assisted homodyne detection

Traditional method for measuring continuous-variable quantum entanglement relies on balanced homodyne detections, which are sensitive to vacuum quantum noise coupled in through losses resulted from many factors such as detector's quantum efficiency and mode mismatching between detected field and local oscillator. In this paper, we propose and analyze a new measurement method, which is realized by assisting the balanced homodyne detections with a high gain phase sensitive parametric amplifier. The employment of the high gain parametric amplifier helps to tackle the vacuum quantum noise originated from detection losses. Moreover, because the high gain parametric amplifier can couple two fields of different types in a phase sensitive manner, the proposed scheme can be used to reveal quantum entanglement between two fields of different types by using only one balanced homodyne detection. Furthermore, detailed analysis shows that in the multi-mode case, the proposed scheme is also advantageous over the traditional method. Such a new measurement method should find wide applications in quantum information and quantum metrology involving measurement of continuous variables.