Source author record

Zhenhua Liu

Zhenhua Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision math.DG Networking and Internet Architecture Artificial Intelligence Computational Engineering, Finance, and Science Distributed, Parallel, and Cluster Computing eess.IV math.OC Operating Systems Robotics Systems and Control

Catalog footprint

What is connected

10works

11topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning

In real home deployments, household agents must often operate from a complete household scene and a situated household request, rather than from a clean task specification. Such requests require agents to identify task-relevant entities, recover intended task conditions, and resolve ordering constraints from the surrounding scene context. We formalize this capability as full-scene household reasoning: given a complete household scene and a situated household request, an agent must infer executable task structure before producing a grounded skill-level action sequence. This setting is challenging because complete household scenes contain substantial task-irrelevant information, making direct complete-scene prompting inefficient and error-prone. In practical deployment, this challenge is further amplified by privacy and local compute constraints, which favor compact open-weight models with limited long-context reasoning ability. We propose TaskGround, a training-free and model-agnostic Ground-Infer-Execute framework that grounds complete scenes into compact task-relevant scene slices, infers executable task structure, and compiles it into grounded skill-level action sequences. To evaluate this setting, we introduce FullHome, a human-validated evaluation suite of 400 household tasks spanning diverse home-scale environments and both goal-oriented and process-constrained requirements. On FullHome, TaskGround improves task success rates by large margins across both proprietary and open-weight models. Notably, it makes Qwen3.5-9B competitive with GPT-5 under direct complete-scene prompting while reducing total input-token cost by up to 18x. Our results identify executable task-structure inference as a central bottleneck in full-scene household reasoning and show that structured grounding can make compact local models substantially more effective for practical household deployment.

preprint2025arXiv

On a conjecture of Almgren II: area-minimizing submanifolds with fractal singular sets on almost any manifold

This paper is the second in a two-part solution to Almgren's conjecture on the existence of area-minimizing submanifolds with fractal singular sets. In part one, we construct area-minimizing submanifolds with fractal singular sets on certain special manifolds. Here we continue our work and show that area-minimizing submanifolds with fractal singular sets exist on almost any smooth manifold.

preprint2022arXiv

GhostSR: Learning Ghost Features for Efficient Image Super-Resolution

Modern single image super-resolution (SISR) system based on convolutional neural networks (CNNs) achieves fancy performance while requires huge computational costs. The problem on feature redundancy is well studied in visual recognition task, but rarely discussed in SISR. Based on the observation that many features in SISR models are also similar to each other, we propose to use shift operation to generate the redundant features (i.e., ghost features). Compared with depth-wise convolution which is time-consuming on GPU-like devices, shift operation can bring a practical inference acceleration for CNNs on common hardwares. We analyze the benefits of shift operation on SISR task and make the shift orientation learnable based on Gumbel-Softmax trick. Besides, a clustering procedure is explored based on pre-trained models to identify the intrinsic filters for generating intrinsic features. The ghost features will be derived by moving these intrinsic features along a specific orientation. Finally, the complete output features are constructed by concatenating the intrinsic and ghost features together. Extensive experiments on several benchmark models and datasets demonstrate that both the non-compact and lightweight SISR models embedded with the proposed method can achieve a comparable performance to that of their baselines with a large reduction of parameters, FLOPs and GPU inference latency. For instance, we reduce the parameters by 46%, FLOPs by 46% and GPU inference latency by 42% of $\times2$ EDSR network with basically lossless performance.

preprint2022arXiv

Multi-Entanglement Routing Design over Quantum Networks

Quantum networks are considered as a promising future platform for quantum information exchange and quantum applications, which have capabilities far beyond the traditional communication networks. Remote quantum entanglement is an essential component of a quantum network. How to efficiently design a multi-routing entanglement protocol is a fundamental yet challenging problem. In this paper, we study a quantum entanglement routing problem to simultaneously maximize the number of quantum-user pairs and their expected throughput. Our approach is to formulate the problem as two sequential integer programming steps. We propose efficient entanglement routing algorithms for the two integer programming steps and analyze their time complexity and performance bounds. Results of evaluation highlight that our approach outperforms existing solutions in both served quantum-user pairs numbers and the network expected throughput.

preprint2022arXiv

P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation

This paper introduces a novel Pre-trained Spatial Temporal Many-to-One (P-STMO) model for 2D-to-3D human pose estimation task. To reduce the difficulty of capturing spatial and temporal information, we divide this task into two stages: pre-training (Stage I) and fine-tuning (Stage II). In Stage I, a self-supervised pre-training sub-task, termed masked pose modeling, is proposed. The human joints in the input sequence are randomly masked in both spatial and temporal domains. A general form of denoising auto-encoder is exploited to recover the original 2D poses and the encoder is capable of capturing spatial and temporal dependencies in this way. In Stage II, the pre-trained encoder is loaded to STMO model and fine-tuned. The encoder is followed by a many-to-one frame aggregator to predict the 3D pose in the current frame. Especially, an MLP block is utilized as the spatial feature extractor in STMO, which yields better performance than other methods. In addition, a temporal downsampling strategy is proposed to diminish data redundancy. Extensive experiments on two benchmarks show that our method outperforms state-of-the-art methods with fewer parameters and less computational overhead. For example, our P-STMO model achieves 42.1mm MPJPE on Human3.6M dataset when using 2D poses from CPN as inputs. Meanwhile, it brings a 1.5-7.1 times speedup to state-of-the-art methods. Code is available at https://github.com/paTRICK-swk/P-STMO.

preprint2020arXiv

Flex: Closing the Gaps between Usage and Allocation

Data centers are giant factories of Internet data and services. Worldwide data centers consume energy and emit emissions more than airline industry. Unfortunately, most of data centers are significantly underutilized. One of the major reasons is the big gaps between the real usage and the provisioned resources because users tend to over-estimate their demand and data center operators often rely on users' requests for resource allocation. In this paper, we first conduct an in-depth analysis of a Google cluster trace to unveil the root causes for low utilization and highlight the great potential to improve it. We then developed an online resource manager Flex to maximize the cluster utilization while satisfying the Quality of Service (QoS). Large-scale evaluations based on real-world traces show that Flex admits up to 1.74x more requests and 1.6x higher utilization compared to tradition schedulers while maintaining the QoS.

preprint2020arXiv

The Existence of Embedded $G$-Invariant Minimal Hypersurface

For a compact connected Lie group $G$ acting as isometries on a compact orientable Riemannian manifold $M^{n+1},$ and cohomogeneity not equal to 0 or 2, we prove the existence of a nontrivial embedded $G$-invariant minimal hypersurface, that is smooth outside a set of Hausdorff dimension at most $n-7.

preprint2020arXiv

Weak Counterexamples to $L^2$ Curvature Estimates for Minimizing Surfaces

We construct a sequence of smooth minimizing surfaces in a sequence of metrics converging to the standard Euclidean metric, so that they have diverging $L^2$ norm of second fundamental form.

preprint2016arXiv

Optimizing Energy Storage Participation in Emerging Power Markets

The growing amount of intermittent renewables in power generation creates challenges for real-time matching of supply and demand in the power grid. Emerging ancillary power markets provide new incentives to consumers (e.g., electrical vehicles, data centers, and others) to perform demand response to help stabilize the electricity grid. A promising class of potential demand response providers includes energy storage systems (ESSs). This paper evaluates the benefits of using various types of novel ESS technologies for a variety of emerging smart grid demand response programs, such as regulation services reserves (RSRs), contingency reserves, and peak shaving. We model, formulate and solve optimization problems to maximize the net profit of ESSs in providing each demand response. Our solution selects the optimal power and energy capacities of the ESS, determines the optimal reserve value to provide as well as the ESS real-time operational policy for program participation. Our results highlight that applying ultra-capacitors and flywheels in RSR has the potential to be up to 30 times more profitable than using common battery technologies such as LI and LA batteries for peak shaving.

preprint2015arXiv

Cyber physical modeling of distributed resources for distribution system operations

Co-simulation platforms are necessary to study the interactions of complex systems integrated in future smart grids. The Virtual Grid Integration Laboratory (VirGIL) is a modular co-simulation platform designed to study interactions between demand response strategies, building comfort, communication networks, and power system operation. This paper presents the coupling of power systems, buildings, communications and control under a master algorithm. There are two objectives. First, to use a modular architecture for VirGIL, based on the Functional Mock-up Interface (FMI), where several different modules can be added, exchanged, and tested. Second, to use a commercial power system simulation platform, familiar to power system operators, such as DIgSILENT Powerfactory. This will help reduce the barriers to the industry for adopting such platforms, investigate and subsequently deploy demand response strategies in their daily operation. VirGIL further introduces the integration of the Quantized State System (QSS) methods for simulation in this co-simulation platform. Results on how these systems interact using a real network and consumption data are also presented.

Zhenhua Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

TaskGround: Structured Executable Task Inference for Full-Scene Household Reasoning

On a conjecture of Almgren II: area-minimizing submanifolds with fractal singular sets on almost any manifold

GhostSR: Learning Ghost Features for Efficient Image Super-Resolution

Multi-Entanglement Routing Design over Quantum Networks

P-STMO: Pre-Trained Spatial Temporal Many-to-One Model for 3D Human Pose Estimation

Flex: Closing the Gaps between Usage and Allocation

The Existence of Embedded $G$-Invariant Minimal Hypersurface

Weak Counterexamples to $L^2$ Curvature Estimates for Minimizing Surfaces

Optimizing Energy Storage Participation in Emerging Power Markets

Cyber physical modeling of distributed resources for distribution system operations