Source author record

Yanjing Li

Yanjing Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mes-hall Machine Learning Artificial Intelligence Computer Vision

Catalog footprint

What is connected

4works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

SURGE: Surrogate Gradient Adaptation in Binary Neural Networks

The training of Binary Neural Networks (BNNs) is fundamentally based on gradient approximation for non-differentiable binarization operations (e.g., sign function). However, prevailing methods including the Straight-Through Estimator (STE) and its improved variants, rely on hand-crafted designs that suffer from gradient mismatch problem and information loss induced by fixed-range gradient clipping. To address this, we propose SURrogate GradiEnt Adaptation (SURGE), a novel learnable gradient compensation framework with theoretical grounding. SURGE mitigates gradient mismatch through auxiliary backpropagation. Specifically, we design a Dual-Path Gradient Compensator (DPGC) that constructs a parallel full-precision auxiliary branch for each binarized layer, decoupling gradient flow via output decomposition during backpropagation. DPGC enables bias-reduced gradient estimation by leveraging the full-precision branch to estimate components beyond STE's first-order approximation. To further enhance training stability, we introduce an Adaptive Gradient Scaler (AGS) based on an optimal scale factor to dynamically balance inter-branch gradient contributions via norm-based scaling. Experiments on image classification, object detection, and language understanding tasks demonstrate that SURGE performs best over state-of-the-art methods.

preprint2022arXiv

TerViT: An Efficient Ternary Vision Transformer

Vision transformers (ViTs) have demonstrated great potential in various visual tasks, but suffer from expensive computational and memory cost problems when deployed on resource-constrained devices. In this paper, we introduce a ternary vision transformer (TerViT) to ternarize the weights in ViTs, which are challenged by the large loss surface gap between real-valued and ternary parameters. To address the issue, we introduce a progressive training scheme by first training 8-bit transformers and then TerViT, and achieve a better optimization than conventional methods. Furthermore, we introduce channel-wise ternarization, by partitioning each matrix to different channels, each of which is with an unique distribution and ternarization interval. We apply our methods to popular DeiT and Swin backbones, and extensive results show that we can achieve competitive performance. For example, TerViT can quantize Swin-S to 13.1MB model size while achieving above 79% Top-1 accuracy on ImageNet dataset.

preprint2013arXiv

From Coulomb Blockade to Resonant Transmission in a MoS2 Nanoribbon

We have measured a nanoribbon of MoS2 at low temperature, and observed the evolution of the system from a regime of multiple small quantum dots in series to one where the entire nanoribbon acts as a single quantum dot. At higher Fermi energies, resonant transmission through disorder-induced potential wells is evident. Our findings shed light on the length scale of quasi-ballistic transport in the material.

preprint2012arXiv

Tunneling Spectroscopy of Graphene using Planar Pb Probes

We show that evaporating lead (Pb) directly on graphene can create high-yield, high-quality tunnel probes, and we demonstrate high magnetic field/low temperature spectroscopy using these probes. Comparisons of Pb, Al and Ti/Au probes shows that after oxidation a well-formed self-limited tunnel barrier is created only between the Pb and the graphene. Tunneling spectroscopy using the Pb probes manifests energy-dependent features such as scattering resonances and localization behavior, and can thus be used to probe the microscopic electronics of graphene.