Source author record

Yuhan Zhang

Yuhan Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci Computer Vision cond-mat.mes-hall Artificial Intelligence Distributed, Parallel, and Cluster Computing Machine Learning math-ph math.DS math.MP

Catalog footprint

What is connected

9works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Phase transitions for unique codings of fat Sierpinski gaskets with multiple digits

Given an integer $M\ge 1$ and $β\in(1, M+1)$, let $S_{β, M}$ be the fat Sierpinski gasket in $\mathbb R^2$ generated by the iterated function system $\left\{f_d(x)=\frac{x+d}β: d\inΩ_M\right\}$, where $Ω_M=\{(i,j)\in\mathbb Z_{\ge 0}^2: i+j\le M\}$. Then each $x\in S_{β, M}$ may be represented as a series $x=\sum_{i=1}^\infty\frac{d_i}{β^i}=:Π_β((d_i))$, and the infinite sequence $(d_i)\inΩ_M^{\mathbb N}$ is called a \emph{coding} of $x$. Since $β<M+1$, a point in $S_{β, M}$ may have multiple codings. Let $U_{β, M}$ be the set of $x\in S_{β, M}$ having a unique coding, that is \[ U_{β, M}=\left\{x\in S_{β, M}: \#Π_β^{-1}(x)=1\right\}. \] When $M=1$, Kong and Li [2020, Nonlinearity] described two critical bases for the phase transitions of the intrinsic univoque set $\widetilde U_{β, 1}$, which is a subset of $U_{β, 1}$. In this paper we consider $M\ge 2$, and characterize the two critical bases $β_G(M)$ and $β_c(M)$ for the phase transitions of $U_{β, M}$: (i) if $β\in(1, β_G(M)]$, then $U_{β, M}$ is finite; (ii) if $β\in(β_G(M), β_c(M))$ then $U_{β, M}$ is countably infinite; (iii) if $β=β_c(M)$ then $U_{β, M}$ is uncountable and has zero Hausdorff dimension; (iv) if $β>β_c(M)$ then $U_{β, M}$ has positive Hausdorff dimension. Our results can also be applied to the intrinsic univoque set $\widetilde{U}_{β, M}$. Moreover, we show that the first critical base $β_G(M)$ is a Perron number, while the second critical base $β_c(M)$ is a transcendental number.

preprint2026arXiv

RelayGR: Scaling Long-Sequence Generative Recommendation via Cross-Stage Relay-Race Inference

Real-time recommender systems execute multi-stage cascades (retrieval, pre-processing, fine-grained ranking) under strict tail-latency SLOs, leaving only tens of milliseconds for ranking. Generative recommendation (GR) models can improve quality by consuming long user-behavior sequences, but in production their online sequence length is tightly capped by the ranking-stage P99 budget. We observe that the majority of GR tokens encode user behaviors that are independent of the item candidates, suggesting an opportunity to pre-infer a user-behavior prefix once and reuse it during ranking rather than recomputing it on the critical path. Realizing this idea at industrial scale is non-trivial: the prefix cache must survive across multiple pipeline stages before the final ranking instance is determined, the user population implies cache footprints far beyond a single device, and indiscriminate pre-inference would overload shared resources under high QPS. We present RelayGR, a production system that enables in-HBM relay-race inference for GR. RelayGR selectively pre-infers long-term user prefixes, keeps their KV caches resident in HBM over the request lifecycle, and ensures the subsequent ranking can consume them without remote fetches. RelayGR combines three techniques: 1) a sequence-aware trigger that admits only at-risk requests under a bounded cache footprint and pre-inference load, 2) an affinity-aware router that co-locates cache production and consumption by routing both the auxiliary pre-infer signal and the ranking request to the same instance, and 3) a memory-aware expander that uses server-local DRAM to capture short-term cross-request reuse while avoiding redundant reloads. We implement RelayGR on Huawei Ascend NPUs and evaluate it with real queries. Under a fixed P99 SLO, RelayGR supports up to 1.5$\times$ longer sequences and improves SLO-compliant throughput by up to 3.6$\times$.

preprint2026arXiv

Test-time generative augmentation for medical image segmentation

Medical image segmentation is critical for clinical diagnosis, treatment planning, and monitoring, yet segmentation models often struggle with uncertainties stemming from occlusions, ambiguous boundaries, and variations in imaging devices. Traditional test-time augmentation (TTA) techniques typically rely on predefined geometric and photometric transformations, limiting their adaptability and effectiveness in complex medical scenarios. In this study, we introduced Test-Time Generative Augmentation (TTGA), a novel augmentation strategy specifically tailored for medical image segmentation at inference time. Different from conventional augmentation strategies that suffer from excessive randomness or limited flexibility, TTGA leverages a domain-fine-tuned generative model to produce contextually relevant and diverse augmentations tailored to the characteristics of each test image. Built upon diffusion model inversion, a masked null-text inversion method is proposed to enable region-specific augmentations during sampling. Furthermore, a dual denoising pathway is designed to balance precise identity preservation with controlled variability. We demonstrate the efficacy of our TTGA through extensive experiments across three distinct segmentation tasks spanning nine datasets. Our results consistently demonstrate that TTGA not only improves segmentation accuracy (with DSC gains ranging from 0.1% to 2.3% over the baseline) but also offers pixel-wise error estimation (with DSC gains ranging from 1.1% to 29.0% over the baseline). The source code and demonstration are available at: https://github.com/maxiao0234/TTGA.

preprint2023arXiv

Semantic Interleaving Global Channel Attention for Multilabel Remote Sensing Image Classification

Multi-Label Remote Sensing Image Classification (MLRSIC) has received increasing research interest. Taking the cooccurrence relationship of multiple labels as additional information helps to improve the performance of this task. Current methods focus on using it to constrain the final feature output of a Convolutional Neural Network (CNN). On the one hand, these methods do not make full use of label correlation to form feature representation. On the other hand, they increase the label noise sensitivity of the system, resulting in poor robustness. In this paper, a novel method called Semantic Interleaving Global Channel Attention (SIGNA) is proposed for MLRSIC. First, the label co-occurrence graph is obtained according to the statistical information of the data set. The label co-occurrence graph is used as the input of the Graph Neural Network (GNN) to generate optimal feature representations. Then, the semantic features and visual features are interleaved, to guide the feature expression of the image from the original feature space to the semantic feature space with embedded label relations. SIGNA triggers global attention of feature maps channels in a new semantic feature space to extract more important visual features. Multihead SIGNA based feature adaptive weighting networks are proposed to act on any layer of CNN in a plug-and-play manner. For remote sensing images, better classification performance can be achieved by inserting CNN into the shallow layer. We conduct extensive experimental comparisons on three data sets: UCM data set, AID data set, and DFC15 data set. Experimental results demonstrate that the proposed SIGNA achieves superior classification performance compared to state-of-the-art (SOTA) methods. It is worth mentioning that the codes of this paper will be open to the community for reproducibility research. Our codes are available at https://github.com/kyle-one/SIGNA.

preprint2020arXiv

Circle Loss: A Unified Perspective of Pair Similarity Optimization

This paper provides a pair similarity optimization viewpoint on deep feature learning, aiming to maximize the within-class similarity $s_p$ and minimize the between-class similarity $s_n$. We find a majority of loss functions, including the triplet loss and the softmax plus cross-entropy loss, embed $s_n$ and $s_p$ into similarity pairs and seek to reduce $(s_n-s_p)$. Such an optimization manner is inflexible, because the penalty strength on every single similarity score is restricted to be equal. Our intuition is that if a similarity score deviates far from the optimum, it should be emphasized. To this end, we simply re-weight each similarity to highlight the less-optimized similarity scores. It results in a Circle loss, which is named due to its circular decision boundary. The Circle loss has a unified formula for two elemental deep feature learning approaches, i.e. learning with class-level labels and pair-wise labels. Analytically, we show that the Circle loss offers a more flexible optimization approach towards a more definite convergence target, compared with the loss functions optimizing $(s_n-s_p)$. Experimentally, we demonstrate the superiority of the Circle loss on a variety of deep feature learning tasks. On face recognition, person re-identification, as well as several fine-grained image retrieval datasets, the achieved performance is on par with the state of the art.

preprint2016arXiv

Probing Carrier Transport and Structure-property Relationship of Highly Ordered Organic Semiconductors at Two-dimensional Limit

One of the basic assumptions in organic field-effect transistors, the most fundamental device unit in organic electronics, is that charge transport occurs two-dimensionally in the first few molecular layers near the dielectric interface. Although the mobility of bulk organic semiconductors has increased dramatically, direct probing of intrinsic charge transport in the two-dimensional limit has not been possible due to excessive disorders and traps in ultrathin organic thin films. Here, highly ordered mono- to tetra-layer pentacene crystals are realized by van der Waals (vdW) epitaxy on hexagonal BN. We find that the charge transport is dominated by hopping in the first conductive layer, but transforms to band-like in subsequent layers. Such abrupt phase transition is attributed to strong modulation of the molecular packing by interfacial vdW interactions, as corroborated by quantitative structural characterization and density functional theory calculations. The structural modulation becomes negligible beyond the second conductive layer, leading to a mobility saturation thickness of only ~3nm. Highly ordered organic ultrathin films provide a platform for new physics and device structures (such as heterostructures and quantum wells) that are not possible in conventional bulk crystals.

preprint2015arXiv

A van der Waals pn heterojunction with organic/inorganic semiconductors

van der Waals (vdW) heterojunctions formed by two-dimensional (2D) materials have attracted tremendous attention due to their excellent electrical/optical properties and device applications. However, current 2D heterojunctions are largely limited to atomic crystals, and hybrid organic/inorganic structures are rarely explored. Here, we fabricate hybrid 2D heterostructures with p-type dioctylbenzothienobenzothiophene (C8-BTBT) and n-type MoS2. We find that few-layer C8-BTBT molecular crystals can be grown on monolayer MoS2 by vdW epitaxy, with pristine interface and controllable thickness down to monolayer. The operation of the C8-BTBT/MoS2 vertical heterojunction devices is highly tunable by bias and gate voltages between three different regimes: interfacial recombination, tunneling and blocking. The pn junction shows diode-like behavior with rectifying ratio up to 105 at the room temperature. Our devices also exhibit photovoltaic responses with power conversion efficiency of 0.31% and photoresponsivity of 22mA/W. With wide material combinations, such hybrid 2D structures will offer possibilities for opto-electronic devices that are not possible from individual constituents.

preprint2014arXiv

Two-dimensional Quasi-Freestanding Molecular Crystals for High-Performance Organic Field-Effect Transistors

Two-dimensional atomic crystals are extensively studied in recent years due to their exciting physics and device applications. However, a molecular counterpart, with scalable processability and competitive device performance, is still challenging. Here, we demonstrate that high-quality few-layer dioctylbenzothienobenzothiophene molecular crystals can be grown on graphene or boron nitride substrate via van der Waals epitaxy, with precisely controlled thickness down to monolayer, large-area single crystal, low process temperature and patterning capability. The crystalline layers are atomically smooth and effectively decoupled from the substrate due to weak van der Waals interactions, affording a pristine interface for high-performance organic transistors. As a result, monolayer dioctylbenzothienobenzothiophene molecular crystal field-effect transistors on boron nitride show record-high carrier mobility up to 10cm2V-1s-1 and aggressively scaled saturation voltage around 1V. Our work unveils an exciting new class of two-dimensional molecular materials for electronic and optoelectronic applications.

preprint2014arXiv

Unveiling the Structural Origin of the High Carrier Mobility of a Molecular Monolayer on Boron Nitride

Very recently, it was demonstrated that the carrier mobility of a molecular monolayer dioctylbenzothienobenzothiophene (C8-BTBT) on boron nitride can reach 10 cm2/Vs, the highest among the previously reported monolayer molecular field-effect transistors. Here we show that the high-quality single crystal of the C8-BTBT monolayer may be the key origin of the record-high carrier mobility. We discover that the C8-BTBT molecules prefer layer-by-layer growth on both hexagonal boron nitride and graphene. The flatness of these substrates substantially decreases the C8-BTBT nucleation density and enables repeatable growth of large-area single crystal of the C8-BTBT monolayer. Our experimental result indicates that only out-of-plane roughness greater than 0.6 nm of the substrates could induce disturbance in the crystal growth and consequently affect the charge transport. This information would be important in guiding the growth of high-quality epitaxy molecular film.

Yuhan Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

9 published item(s)

Phase transitions for unique codings of fat Sierpinski gaskets with multiple digits

RelayGR: Scaling Long-Sequence Generative Recommendation via Cross-Stage Relay-Race Inference

Test-time generative augmentation for medical image segmentation

Semantic Interleaving Global Channel Attention for Multilabel Remote Sensing Image Classification

Circle Loss: A Unified Perspective of Pair Similarity Optimization

Probing Carrier Transport and Structure-property Relationship of Highly Ordered Organic Semiconductors at Two-dimensional Limit

A van der Waals pn heterojunction with organic/inorganic semiconductors

Two-dimensional Quasi-Freestanding Molecular Crystals for High-Performance Organic Field-Effect Transistors

Unveiling the Structural Origin of the High Carrier Mobility of a Molecular Monolayer on Boron Nitride