Researcher profile

Hongbo Zhu

Hongbo Zhu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - Emerging
18works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

18 published item(s)

preprint2026arXiv

FastStair: Learning to Run Up Stairs with Humanoid Robots

Running up stairs is effortless for humans but remains extremely challenging for humanoid robots due to the simultaneous requirements of high agility and strict stability. Model-free reinforcement learning (RL) can generate dynamic locomotion, yet implicit stability rewards and heavy reliance on task-specific reward shaping tend to result in unsafe behaviors, especially on stairs; conversely, model-based foothold planners encode contact feasibility and stability structure, but enforcing their hard constraints often induces conservative motion that limits speed. We present FastStair, a planner-guided, multi-stage learning framework that reconciles these complementary strengths to achieve fast and stable stair ascent. FastStair integrates a parallel model-based foothold planner into the RL training loop to bias exploration toward dynamically feasible contacts and to pretrain a safety-focused base policy. To mitigate planner-induced conservatism and the discrepancy between low- and high-speed action distributions, the base policy was fine-tuned into speed-specialized experts and then integrated via Low-Rank Adaptation (LoRA) to enable smooth operation across the full commanded-speed range. We deploy the resulting controller on the Oli humanoid robot, achieving stable stair ascent at commanded speeds up to 1.65 m/s and traversing a 33-step spiral staircase (17 cm rise per step) in 12 s, demonstrating robust high-speed performance on long staircases. Notably, the proposed approach served as the champion solution in the Canton Tower Robot Run Up Competition.

preprint2022arXiv

Bond-Selective Intensity Diffraction Tomography

Recovering molecular information remains a grand challenge in the widely used holographic and computational imaging technologies. To address this challenge, we developed a computational mid-infrared photothermal microscope, termed Bond-selective Intensity Diffraction Tomography (BS-IDT). Based on a low-cost brightfield microscope with an add-on pulsed light source, BS-IDT recovers both infrared spectra and bond-selective 3D refractive index maps from intensity-only measurements. High-fidelity infrared fingerprint spectra extraction is validated. Volumetric chemical imaging of biological cells is demonstrated at a speed of ~20 seconds per volume, with a lateral and axial resolution of ~350 nm and ~1.1 micron, respectively. BS-IDT's application potential is investigated by chemically quantifying lipids stored in cancer cells and volumetric chemical imaging on Caenorhabditis elegans with a large field of view (~100 micron X 100 micron).

preprint2022arXiv

Energy Efficiency and Delay Tradeoff in an MEC-Enabled Mobile IoT Network

Mobile Edge Computing (MEC) has recently emerged as a promising technology in the 5G era. It is deemed an effective paradigm to support computation-intensive and delay critical applications even at energy-constrained and computation-limited Internet of Things (IoT) devices. To effectively exploit the performance benefits enabled by MEC, it is imperative to jointly allocate radio and computational resources by considering non-stationary computation demands, user mobility, and wireless fading channels. This paper aims to study the tradeoff between energy efficiency (EE) and service delay for multi-user multi-server MEC-enabled IoT systems when provisioning offloading services in a user mobility scenario. Particularly, we formulate a stochastic optimization problem with the objective of minimizing the long-term average network EE with the constraints of the task queue stability, peak transmit power, maximum CPU-cycle frequency, and maximum user number. To tackle the problem, we propose an online offloading and resource allocation algorithm by transforming the original problem into several individual subproblems in each time slot based on Lyapunov optimization theory, which are then solved by convex decomposition and submodular methods. Theoretical analysis proves that the proposed algorithm can achieve a $[O(1/V), O(V)]$ tradeoff between EE and service delay. Simulation results verify the theoretical analysis and demonstrate our proposed algorithm can offer much better EE-delay performance in task offloading challenges, compared to several baselines.

preprint2022arXiv

Polytopic Planar Region Characterization of Rough Terrains for Legged Locomotion

This paper studies the problem of constructing polytopic representations of planar regions from depth camera readings. This problem is of great importance for terrain mapping in complicated environment and has great potentials in legged locomotion applications. To address the polytopic planar region characterization problem, we propose a two-stage solution scheme. At the first stage, the planar regions embedded within a sequence of depth images are extracted individually first and then merged to establish a terrain map containing only planar regions in a selected frame. To simplify the representations of the planar regions that are applicable to foothold planning for legged robots, we further approximate the extracted planar regions via low-dimensional polytopes at the second stage. With the polytopic representation, the proposed approach achieves a great balance between accuracy and simplicity. Experimental validations with RGB-D cameras are conducted to demonstrate the performance of the proposed scheme. The proposed scheme successfully characterizes the planar regions via polytopes with acceptable accuracy. More importantly, the run time of the overall perception scheme is less than 10ms (i.e., > 100Hz) throughout the tests, which strongly illustrates the advantages of our approach developed in this paper.

preprint2022arXiv

Reynolds number effects on the bistable flows over a wavy circular cylinder

The wake of wavy cylinder has been shown to exhibit bistability. Depending on the initial condition, the final state of the wake can either develop into a steady flow (state I), or periodic shedding (state II). In this paper, we perform direct numerical simulations to reveal the Reynolds number effects on these two wake states. With increasing Reynolds number, the steady vortical structures in state I wake sways back and forth in the spanwise direction, resulting in low-frequency fluctuations in drag forces, but not in lift. For state II, the increase in Reynolds number is associated with the emergence of another spectral peak in the lift coefficient. The secondary frequency is associated with highly three-dimensional vortical structures in the wake. For both states, the wakes transition to turblent flows at higher Reynolds numbers, with the development of small-scale vortices. We further study the streamwise gust flows over the wavy cylinder. The time-varying inflow velocity results in a wide range of instantaneous Reynolds number spanning from the absolutely unstable flow regime to the bistable regime. Depending on the period of the inflow velocity variation, the wake perturbations grown at the absolutely unstable flow regime can be damped out in state I wake, or grow large enough to trigger the transition state II, resulting in loss of flow control efficacy. The above analyses reveal novel flow physics of the bistable states at unexplored Reynolds numbers, and showcase the complex transition behavior between the two states in unsteady flows. The insights gained from this study improve the understanding of the wake dynamics of the wavy cylinder.

preprint2020arXiv

Model-Driven Beamforming Neural Networks

Beamforming is evidently a core technology in recent generations of mobile communication networks. Nevertheless, an iterative process is typically required to optimize the parameters, making it ill-placed for real-time implementation due to high complexity and computational delay. Heuristic solutions such as zero-forcing (ZF) are simpler but at the expense of performance loss. Alternatively, deep learning (DL) is well understood to be a generalizing technique that can deliver promising results for a wide range of applications at much lower complexity if it is sufficiently trained. As a consequence, DL may present itself as an attractive solution to beamforming. To exploit DL, this article introduces general data- and model-driven beamforming neural networks (BNNs), presents various possible learning strategies, and also discusses complexity reduction for the DL-based BNNs. We also offer enhancement methods such as training-set augmentation and transfer learning in order to improve the generality of BNNs, accompanied by computer simulation results and testbed results showing the performance of such BNN solutions.

preprint2020arXiv

Multi-Armed Bandit Based Client Scheduling for Federated Learning

By exploiting the computing power and local data of distributed clients, federated learning (FL) features ubiquitous properties such as reduction of communication overhead and preserving data privacy. In each communication round of FL, the clients update local models based on their own data and upload their local updates via wireless channels. However, latency caused by hundreds to thousands of communication rounds remains a bottleneck in FL. To minimize the training latency, this work provides a multi-armed bandit-based framework for online client scheduling (CS) in FL without knowing wireless channel state information and statistical characteristics of clients. Firstly, we propose a CS algorithm based on the upper confidence bound policy (CS-UCB) for ideal scenarios where local datasets of clients are independent and identically distributed (i.i.d.) and balanced. An upper bound of the expected performance regret of the proposed CS-UCB algorithm is provided, which indicates that the regret grows logarithmically over communication rounds. Then, to address non-ideal scenarios with non-i.i.d. and unbalanced properties of local datasets and varying availability of clients, we further propose a CS algorithm based on the UCB policy and virtual queue technique (CS-UCB-Q). An upper bound is also derived, which shows that the expected performance regret of the proposed CS-UCB-Q algorithm can have a sub-linear growth over communication rounds under certain conditions. Besides, the convergence performance of FL training is also analyzed. Finally, simulation results validate the efficiency of the proposed algorithms.

preprint2020arXiv

The ABC130 barrel module prototyping programme for the ATLAS strip tracker

For the Phase-II Upgrade of the ATLAS Detector, its Inner Detector, consisting of silicon pixel, silicon strip and transition radiation sub-detectors, will be replaced with an all new 100 % silicon tracker, composed of a pixel tracker at inner radii and a strip tracker at outer radii. The future ATLAS strip tracker will include 11,000 silicon sensor modules in the central region (barrel) and 7,000 modules in the forward region (end-caps), which are foreseen to be constructed over a period of 3.5 years. The construction of each module consists of a series of assembly and quality control steps, which were engineered to be identical for all production sites. In order to develop the tooling and procedures for assembly and testing of these modules, two series of major prototyping programs were conducted: an early program using readout chips designed using a 250 nm fabrication process (ABCN-25) and a subsequent program using a follow-up chip set made using 130 nm processing (ABC130 and HCC130 chips). This second generation of readout chips was used for an extensive prototyping program that produced around 100 barrel-type modules and contributed significantly to the development of the final module layout. This paper gives an overview of the components used in ABC130 barrel modules, their assembly procedure and findings resulting from their tests.

preprint2019arXiv

Deep Learning Cell Imaging through Anderson Localizing Optical Fibre

We demonstrate a deep-learning-based fibre imaging system which can transfer real-time artifact-free cell images through a meter-long Anderson localizing optical fibre. The cell samples are illuminated by an incoherent LED light source. A deep convolutional neural network is applied to the image reconstruction process. The network training uses data generated by a set-up with straight fibre at room temperature (~20 °C) but can be utilized directly for high fidelity reconstruction of cell images that are transported through fibre with a few degrees bend and/or fibre with segments heated up to 50 °C. In addition, cell images located several millimeters away from the bare fibre end can be transported and recovered successfully without the assistance of any distal optics. We further evidence that the trained neural network is able to reconstruct the images of cells which are never used in the training process and feature very different morphology.

preprint2016arXiv

Design and Implementation of a TDD-Based 128-Antenna Massive MIMO Prototyping System

Spurred by the dramatic mobile IP growth and the emerging Internet of Things (IoT) and cloud-based applications, wireless networking is witnessing a paradigm shift. By fully exploiting the spatial degrees of freedom, the massive multipleinput- multiple-output (MIMO) technology promises significant gains in both data rates and link reliability. This paper presents a time-division duplex (TDD)-based 128-antenna massive MIMO prototyping system designed to operate on a 20 MHz bandwidth. Up to twelve single-antenna users can be served by the designed system at the same time. System model is provided and link-level simulation corresponding to our practical TDDbased massive MIMO prototyping system is conducted to validate our design and performance of the algorithms. Based on the system hardware design demonstrated in this paper, both uplink real-time video and downlink data transmissions are realized, and the experiment results show that 268.8 Mbps rate was achieved for eight single-antenna users using QPSK modulation. The maximum spectral efficiency of the designed system will be 80.64 bit/s/Hz by twelve single-antenna users with 256-QAM modulation.

preprint2016arXiv

Full duplex communication using visible light

In this work, we propose, fabricate and characterize a full duplex communication system using visible light on a single chip. Both the suspended p-n junction InGaN/GaN multiple quantum well (MQW) devices and the suspended waveguides are obtained on a GaN-on-silicon platform by wafer-level processing. Two suspended p-n junction InGaN/GaN MQW devices that can both emit and detect light simultaneously are connected using suspended waveguides to form an in-plane visible light communication (VLC) system. The light that is emitted from one suspended p-n junction InGaN/GaN MQW device can induce a current in the device located at the other end of the waveguide via in-plane light coupling, thus leading to full duplex communication using visible light. This proof-of-concept in-plane VLC system paves the way towards the implementation of a full duplex communications system operating at the same frequency using visible light on a single chip.

preprint2016arXiv

Semi-coherent Detection and Performance Analysis for Ambient Backscatter System

We study a novel communication mechanism, ambient backscatter, that utilizes radio frequency (RF) signals transmitted from an ambient source as both energy supply and information carrier to enable communications between low-power devices. Different from existing non-coherent schemes, we here design the semi-coherent detection, where channel parameters can be obtained from unknown data symbols and a few pilot symbols. We first derive the optimal detector for the complex Gaussian ambient RF signal from likelihood ratio test and compute the corresponding closed-form bit error rate (BER). To release the requirement for prior knowledge of the ambient RF signal, we next design a suboptimal energy detector with ambient RF signals being either the complex Gaussian or the phase shift keying (PSK). The corresponding detection thresholds, the analytical BER, and the outage probability are also obtained in closed-form. Interestingly, the complex Gaussian source would cause an error floor while the PSK source does not, which brings nontrivial indication of constellation design as opposed to the popular Gaussian-embedded literatures. Simulations are provided to corroborate the theoretical studies.

preprint2016arXiv

Wireless Information and Power Transfer Design for Energy Cooperation Distributed Antenna Systems

Distributed antenna systems (DAS) have been widely implemented in state-of-the-art cellular communication systems to cover dead spots. Recent studies have also indicated that DAS have advantages in wireless energy transfer (WET). In this paper, we study simultaneous wireless information and power transfer (SWIPT) for a multiple-input single-output (MISO) DAS in the downlink which consists of arbitrarily distributed remote antenna units (RAUs). In order to save the energy cost, we adopt energy cooperation of energy harvesting (EH) and two-way energy flows to let the RAUs trade their harvested energy through the smart grid network. Under individual EH constraints, per-RAU power constraints and various smart grid considerations, we investigate a power management strategy that determines how to utilize the stochastically spatially distributed harvested energy at the RAUs and how to trade the energy with the smart grid simultaneously to supply maximum wireless information transfer (WIT) with a minimum WET constraint for a receiver adopting power splitting (PS). Our analysis shows that the optimal design can be achieved in two steps. The first step is to maximize a new objective that can simultaneously maximize both WET and WIT, considering both the smart grid profitable and smart grid neutral cases. For the grid-profitable case, we derive the optimal full power strategy and provide a closed-form result to see under what condition this strategy is used. On the other hand, for the grid-neutral case, we illustrate that the optimal power policy has a double-threshold structure and present an optimal allocation strategy. The second step is then to solve the whole problem by obtaining the splitting power ratio based on the minimum WET constraint. Simulation results are provided to evaluate the performance under various settings and characterize the double-threshold structure.

preprint2015arXiv

Monolithic photonic integration of suspended light emitting diode, waveguide and photodetector

We report here a monolithic photonic integration of light emitting diode (LED) with waveguide and photodetector to build a highly-integrated photonic system to perform functionalities on the GaN-on-silicon platform. Suspended p-n junction InGaN/GaN multiple quantum wells (MQWs) are used for device fabrication. Part of the LED emission is coupled into suspended waveguide and then, the guided light laterally propagates along the waveguide and is finally sensed by the photodetector. Planar optical communication experimentally demonstrates that the proof-of-concept monolithic photonic integration system can achieve the on-chip optical interconnects. This work paves the way towards novel active electro-optical sensing system and planar optical communication in the visible range.

preprint2015arXiv

Outage Balancing in Downlink Non-Orthogonal Multiple Access With Statistical Channel State Information

This paper considers a downlink non-orthogonal multiple access (NOMA) system where the source intends to transmit independent information to the users at targeted data rates under statistical channel state information at the transmitter. The problem of outage balancing among the users is studied with the issues of power allocation, decoding order selection, and user grouping being taken into account. Specifically, with regard to the max-min fairness criterion, we derive the optimal power allocation in closed-form and prove the corresponding optimal decoding order for the elementary downlink NOMA system. By assigning a weighting factor for each user, the analytical results can be used to evaluate the outage performance of the downlink NOMA system under various fairness constraints. Further, we investigate the case with user grouping, in which each user group can be treated as an elementary downlink NOMA system. The associated problems of power and resource allocation among different user groups are solved. The implementation complexity issue of NOMA is also considered with focus on that caused by successive interference cancellation and user grouping. The complexity and performance tradeoff is analyzed by simulations, which provides fruitful insights for the practical application of NOMA. The simulation results substantiate our analysis and show considerable performance gain of NOMA when compared with orthogonal multiple access.

preprint2015arXiv

Power Allocation Schemes for Multicell Massive MIMO Systems

This paper investigates the sum-rate gains brought by power allocation strategies in multicell massive multipleinput multiple-output systems, assuming time-division duplex transmission. For both uplink and downlink, we derive tractable expressions for the achievable rate with zero-forcing receivers and precoders respectively. To avoid high complexity joint optimization across the network, we propose a scheduling mechanism for power allocation, where in a single time slot, only cells that do not interfere with each other adjust their transmit powers. Based on this, corresponding transmit power allocation strategies are derived, aimed at maximizing the sum rate per-cell. These schemes are shown to bring considerable gains over equal power allocation for practical antenna configurations (e.g., up to a few hundred). However, with fixed number of users (N), these gains diminish as M turns to infinity, and equal power allocation becomes optimal. A different conclusion is drawn for the case where both M and N grow large together, in which case: (i) improved rates are achieved as M grows with fixed M/N ratio, and (ii) the relative gains over the equal power allocation diminish as M/N grows. Moreover, we also provide applicable values of M/N under an acceptable power allocation gain threshold, which can be used as to determine when the proposed power allocation schemes yield appreciable gains, and when they do not. From the network point of view, the proposed scheduling approach can achieve almost the same performance as the joint power allocation after one scheduling round, with much reduced complexity.

preprint2015arXiv

Study of the Beamstrahlung Effects at the CEPC

The discovery of a 125 GeV Higgs boson at the LHC marked a breakthrough in particle physics. The relative lightness of the new particle inspires the consideration of a high luminosity Circular Electron Positron Collider (CEPC) as a Higgs Factory to study the Higgs boson in a clean environment. At the CEPC, the beamstrahlung might represent one of the most important sources of beam-induced backgrounds that will impact the detector. It will introduce additional backgrounds to the CEPC detector through the subsequent electron-positron pair production and the hadronic process. Therefore its impacts should be carefully evaluated. In this paper, the beamstrahlung-induced backgrounds are first estimated with analytical methods and are further evaluated in detail with Monte Carlo simulation. The detector occupancy due to the beamstrahlung at the location where the first vertex detector layer may be placed is found to be well below 0.5%. Radiation levels characterised as non-ionising energy loss (NIEL) and total ionising dose (TID) are estimated to be ~ $10^{11} 1 $ MeV $ n_{eq}/cm^2/$yr and ~ 300 kRad/yr, respectively.

preprint2014arXiv

Power Scaling of Uplink Massive MIMO Systems with Arbitrary-Rank Channel Means

This paper investigates the uplink achievable rates of massive multiple-input multiple-output (MIMO) antenna systems in Ricean fading channels, using maximal-ratio combining (MRC) and zero-forcing (ZF) receivers, assuming perfect and imperfect channel state information (CSI). In contrast to previous relevant works, the fast fading MIMO channel matrix is assumed to have an arbitrary-rank deterministic component as well as a Rayleigh-distributed random component. We derive tractable expressions for the achievable uplink rate in the large-antenna limit, along with approximating results that hold for any finite number of antennas. Based on these analytical results, we obtain the scaling law that the users' transmit power should satisfy, while maintaining a desirable quality of service. In particular, it is found that regardless of the Ricean $K$-factor, in the case of perfect CSI, the approximations converge to the same constant value as the exact results, as the number of base station antennas, $M$, grows large, while the transmit power of each user can be scaled down proportionally to $1/M$. If CSI is estimated with uncertainty, the same result holds true but only when the Ricean $K$-factor is non-zero. Otherwise, if the channel experiences Rayleigh fading, we can only cut the transmit power of each user proportionally to $1/\sqrt M$. In addition, we show that with an increasing Ricean $K$-factor, the uplink rates will converge to fixed values for both MRC and ZF receivers.