Source author record

Yuhang Liu

Yuhang Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.mes-hall cond-mat.mtrl-sci Machine Learning Artificial Intelligence eess.IV math.DG math.OC Multimedia Performance physics.app-ph physics.soc-ph Robotics

Catalog footprint

What is connected

11works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

S2Aligner: Pair-Efficient and Transferable Pre-Training for Sparse Text-Attributed Graphs

Pre-training on text-attributed graphs (TAGs) is central to building transferable graph foundation models, where LLM-as-Aligner methods align graph and text representations through the semantic knowledge of large language models. However, these methods usually assume that node texts provide sufficient and reliable supervision, an assumption often violated in real-world sparse TAGs. When textual anchors are missing, noisy, or uneven across domains, graph structures must be aligned with weak semantic evidence, leading to unreliable structure-semantics correspondence and sparsity-induced transfer bias. This paper presents S2Aligner, a sparsity-aware and structure-enhanced LLM-as-Aligner framework for graph-text pre-training on sparse TAGs. The key idea is to decouple semantic alignment from structural modeling, allowing topology-aware signals to enhance alignment without contaminating the shared semantic space. Specifically, S2Aligner decomposes graph-text representations into semantic and structural components, uses structure-oriented reconstruction with consistency control to inject reliable topology cues into text representations, and suppresses inconsistent structural signals under textual sparsity. Moreover, S2Aligner introduces sparsity-aware cross-domain risk balancing, which calibrates domain risks through a global-domain density ratio and downweights unreliable sparse samples via graph reliability estimation. Theoretical analysis shows that this objective reduces cross-domain generalization gaps by controlling domain risk discrepancy. Extensive experiments across diverse graph domains, sparsity levels, and downstream tasks demonstrate that S2Aligner consistently outperforms existing baselines.

preprint2026arXiv

What Makes a Representation Good for Single-Cell Perturbation Prediction?

Single-cell perturbation modeling is fundamental for understanding and predicting cellular responses to genetic perturbations. However, existing approaches, from causal representation learning to foundation models, often struggle with an overlooked challenge: gene expression is dominated by perturbation-invariant information, while perturbation-specific signals are intrinsically sparse. As a result, learned representations either entangle invariant and perturbation-specific information, leading to spurious and non-generalizable predictors, or suppress perturbation-specific signals altogether, rendering them ineffective for prediction. To address this, we propose PerturbedVAE, a general framework designed to resolve this signal imbalance. The framework explicitly separates perturbation-specific information from dominant invariant structure and recovers causal representations to effectively utilize such information for prediction. We further provide an identifiability analysis that characterizes the conditions under which sparse perturbation effects can be reliably recovered, thereby clarifying how the framework can be concretely specified under such conditions. Empirically, PerturbedVAE achieves state-of-the-art performance on a widely used benchmark across multiple evaluation settings, yielding significant gains on out-of-distribution combinatorial predictions and uncovering interpretable perturbation-response programs.

preprint2023arXiv

Structural tuning magnetism and topology in a magnetic topological insulator

To date, the most widely-studied quantum anomalous Hall insulator (QAHI) platform is achieved by dilute doping of magnetic ions into thin films of the alloyed tetradymite topological insulator (TI) (Bi$_{1-x}$Sb$_x$)$_2$Te$_3$ (BST). In these films, long-range magnetic ordering of the transition metal substituants opens an exchange gap $Δ$ in the topological surface states, stabilizing spin-polarized, dissipationless edge channels with a nonzero Chern number $\mathcal{C}$. The long-range ordering of the spatially separated magnetic ions is itself mediated by electronic states in the host TI, leading to a sophisticated feedback between magnetic and electronic properties. Here we present a study of the electronic and magnetic response of a BST-based QAHI system to structural tuning via hydrostatic pressure. We identify a systematic closure of the topological gap under compressive strain accompanied by a simultaneous enhancement in the magnetic ordering strength. Combining these experimental results with first-principle calculations we identify structural deformation as a strong tuning parameter to traverse a rich topological phase space and modify magnetism in the magnetically doped BST system.

preprint2022arXiv

Automated Noncontact Trapping of Moving Micro-particle with Ultrasonic Phased Array System and Microscopic Vision

Noncontact particle manipulation (NPM) technology has significantly extended mankind's analysis capability into micro and nano scale, which in turn greatly promoted the development of material science and life science. Though NPM by means of electric, magnetic, and optical field has achieved great success, from the robotic perspective, it is still labor-intensive manipulation since professional human assistance is somehow mandatory in early preparation stage. Therefore, developing automated noncontact trapping of moving particles is worthwhile, particularly for applications where particle samples are rare, fragile or contact sensitive. Taking advantage of latest dynamic acoustic field modulating technology, and particularly by virtue of the great scalability of acoustic manipulation from micro-scale to sub-centimeter-scale, we propose an automated noncontact trapping of moving micro-particles with ultrasonic phased array system and microscopic vision in this paper. The main contribution of this work is for the first time, as far as we know, we achieved fully automated moving micro-particle trapping in acoustic NPM field by resorting to robotic approach. In short, the particle moving status is observed and predicted by binocular microscopic vision system, by referring to which the acoustic trapping zone is calculated and generated to capture and stably hold the particle. The problem of hand-eye relationship of noncontact robotic end-effector is also solved in this work. Experiments demonstrated the effectiveness of this work.

preprint2022arXiv

Declaration-based Prompt Tuning for Visual Question Answering

In recent years, the pre-training-then-fine-tuning paradigm has yielded immense success on a wide spectrum of cross-modal tasks, such as visual question answering (VQA), in which a visual-language (VL) model is first optimized via self-supervised task objectives, e.g., masked language modeling (MLM) and image-text matching (ITM), and then fine-tuned to adapt to downstream task (e.g., VQA) via a brand-new objective function, e.g., answer prediction. The inconsistency of the objective forms not only severely limits the generalization of pre-trained VL models to downstream tasks, but also requires a large amount of labeled data for fine-tuning. To alleviate the problem, we propose an innovative VL fine-tuning paradigm (named Declaration-based Prompt Tuning, abbreviated as DPT), which jointly optimizes the objectives of pre-training and fine-tuning of VQA model, boosting the effective adaptation of pre-trained VL models to the downstream task. Specifically, DPT reformulates the objective form of VQA task via (1) textual adaptation, which converts the given questions into declarative sentence-form for prompt-tuning, and (2) task adaptation, which optimizes the objective function of VQA problem in the manner of pre-training phase. Experimental results on GQA dataset show that DPT outperforms the fine-tuned counterpart by a large margin regarding accuracy in both fully-supervised (2.68%) and zero-shot/few-shot (over 31%) settings. All the data and codes will be available to facilitate future research.

preprint2022arXiv

NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

This paper reviews the challenge on constrained high dynamic range (HDR) imaging that was part of the New Trends in Image Restoration and Enhancement (NTIRE) workshop, held in conjunction with CVPR 2022. This manuscript focuses on the competition set-up, datasets, the proposed methods and their results. The challenge aims at estimating an HDR image from multiple respective low dynamic range (LDR) observations, which might suffer from under- or over-exposed regions and different sources of noise. The challenge is composed of two tracks with an emphasis on fidelity and complexity constraints: In Track 1, participants are asked to optimize objective fidelity scores while imposing a low-complexity constraint (i.e. solutions can not exceed a given number of operations). In Track 2, participants are asked to minimize the complexity of their solutions while imposing a constraint on fidelity scores (i.e. solutions are required to obtain a higher fidelity score than the prescribed baseline). Both tracks use the same data and metrics: Fidelity is measured by means of PSNR with respect to a ground-truth HDR image (computed both directly and with a canonical tonemapping operation), while complexity metrics include the number of Multiply-Accumulate (MAC) operations and runtime (in seconds).

preprint2022arXiv

PointScatter: Point Set Representation for Tubular Structure Extraction

This paper explores the point set representation for tubular structure extraction tasks. Compared with the traditional mask representation, the point set representation enjoys its flexibility and representation ability, which would not be restricted by the fixed grid as the mask. Inspired by this, we propose PointScatter, an alternative to the segmentation models for the tubular structure extraction task. PointScatter splits the image into scatter regions and parallelly predicts points for each scatter region. We further propose the greedy-based region-wise bipartite matching algorithm to train the network end-to-end and efficiently. We benchmark the PointScatter on four public tubular datasets, and the extensive experiments on tubular structure segmentation and centerline extraction task demonstrate the effectiveness of our approach. Code is available at https://github.com/zhangzhao2022/pointscatter.

preprint2022arXiv

Rotational hypersurfaces with constant Gauss-Kronecker curvature

We study rotational hypersurfaces with constant Gauss-Kronecker curvature. We solve the ODE for the generating curves of such hypersurfaces and analyze several geometric properties of such hypersurfaces. In particular, we discover a class of non-compact rotational hypersurfaces with constant and negative Gauss-Kronecker curvature and finite volume, which can be seen as the higher-dimensional generalization of the pseudo-sphere. Finally we investigate other types of rotational hypersurfaces with similar curvature constraints, including those with prescribed Gauss-Kronecker curvature.

preprint2019arXiv

Phonon and Thermal Properties of Quasi-Two-Dimensional FePS3 and MnPS3 Antiferromagnetic Semiconductor Materials

We report results of investigation of the phonon and thermal properties of the exfoliated films of layered single crystals of antiferromagnetic FePS3 and MnPS3 semiconductors. The Raman spectroscopy was conducted using three different excitation lasers with the wavelengths of 325 nm (UV), 488 nm (blue), and 633 nm (red). The resonant UV-Raman spectroscopy reveals new spectral features, which are not detectable via visible Raman light scattering. The thermal conductivity of FePS3 and MnPS3 thin films was measured by two different techniques: the steady-state Raman optothermal and transient time-resolved magneto-optical Kerr effect. The Raman optothermal measurements provided the orientation-average thermal conductivity of FePS3 to be 1.35 W/mK at room temperature. The transient measurements revealed that the through-plane and in-plane thermal conductivity of FePS3 is 0.85 W/mK and 2.7 W/mK, respectively. The films of MnPS3 have higher thermal conductivity of 1.1 W/mK through-plane and 6.3 W/mK in-plane. The data obtained by both techniques reveal strong thermal anisotropy of the films and the dominant contribution of phonons to heat conduction. Our results are important for the proposed applications of the antiferromagnetic semiconductor thin films in spintronic devices.

preprint2013arXiv

A Simple Policy for Multiple Queues with Size-Independent Service Times

We consider a service system with two Poisson arrival queues. A server chooses which queue to serve at each moment. Once a queue is served, all the customers will be served within a fixed amount of time. This model is useful in studying airport shuttling or certain online computing systems. We propose a simple yet optimal state-independent policy for this problem which is not only easy to implement, but also performs very well.

preprint2013arXiv

Some Results on Ethnic Conflicts Based on Evolutionary Game Simulation

The force of the ethnic separatism, essentially origining from negative effect of ethnic identity, is damaging the stability and harmony of multiethnic countries. In order to eliminate the foundation of the ethnic separatism and set up a harmonious ethnic relationship, some scholars have proposed a viewpoint: ethnic harmony could be promoted by popularizing civic identity. However, this viewpoint is discussed only from a philosophical prospective and still lack supports of scientific evidences. Because ethic group and ethnic identity are products of evolution and ethnic identity is the parochialism strategy under the perspective of game theory, this paper proposes an evolutionary game simulation model to study the relationship between civic identity and ethnic conflict based on evolutionary game theory. The simulation results indicate that: 1) the ratio of individuals with civic identity has a positive association with the frequency of ethnic conflicts; 2) ethnic conflict will not die out by killing all ethnic members once for all, and it also cannot be reduced by a forcible pressure, i.e., increasing the ratio of individuals with civic identity; 3) the average frequencies of conflicts can stay in a low level by promoting civic identity periodically and persistently.

Yuhang Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

S2Aligner: Pair-Efficient and Transferable Pre-Training for Sparse Text-Attributed Graphs

What Makes a Representation Good for Single-Cell Perturbation Prediction?

Structural tuning magnetism and topology in a magnetic topological insulator

Automated Noncontact Trapping of Moving Micro-particle with Ultrasonic Phased Array System and Microscopic Vision

Declaration-based Prompt Tuning for Visual Question Answering

NTIRE 2022 Challenge on High Dynamic Range Imaging: Methods and Results

PointScatter: Point Set Representation for Tubular Structure Extraction

Rotational hypersurfaces with constant Gauss-Kronecker curvature

Phonon and Thermal Properties of Quasi-Two-Dimensional FePS3 and MnPS3 Antiferromagnetic Semiconductor Materials

A Simple Policy for Multiple Queues with Size-Independent Service Times

Some Results on Ethnic Conflicts Based on Evolutionary Game Simulation