Source author record

Lijun Li

Lijun Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.mtrl-sci cond-mat.str-el cond-mat.mes-hall cond-mat.supr-con Artificial Intelligence Computation and Language Computer Science and Game Theory Computer Vision cond-mat.other Distributed, Parallel, and Cluster Computing econ.TH Logic in Computer Science Machine Learning

Catalog footprint

What is connected

13works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Relay Buffer Independent Communication over Pooled HBM for Efficient MoE Inference on Ascend

Mixture-of-Experts (MoE) inference requires large-scale token exchange across devices, making dispatch and combine major bottlenecks in both prefill and decode. Beyond network transfer, routing-driven layout transformation, temporary relay, and output restoration can add substantial overhead. Existing MoE communication paths are often buffer-centric, using explicit inter-process relay and reordering buffers around collective transfer. This report presents a relay-buffer-free communication design for MoE inference acceleration on Ascend systems. The design reorganizes dispatch and combine around direct placement into destination expert windows and direct reading from remote expert windows. Built on globally pooled high-bandwidth memory and symmetric-memory allocation, it removes most intermediate relay and reordering buffers while retaining only lightweight control state, including counts, offsets, and synchronization metadata. We instantiate the design as two schedules for the main phases of MoE inference: a prefill schedule with richer planning state for throughput-oriented execution, and a compact decode schedule for latency-sensitive execution. Experiments on Ascend-based MoE workloads show reduced dispatch and combine latency in both settings. At the serving level, the implementation improves time to first token (TTFT), preserves competitive time per output token (TPOT), and enlarges the feasible scheduling space under practical latency constraints. These results indicate that, on platforms with globally addressable device memory, reducing intermediate buffering and output restoration around expert execution is an effective direction for accelerating MoE inference.

preprint2026arXiv

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

While LLM-based agents can interact with environments via invoking external tools, their expanded capabilities also amplify security risks. Monitoring step-level tool invocation behaviors in real time and proactively intervening before unsafe execution is critical for agent deployment, yet remains under-explored. In this work, we first construct TS-Bench, a novel benchmark for step-level tool invocation safety detection in LLM agents. We then develop a guardrail model, TS-Guard, using multi-task reinforcement learning. The model proactively detects unsafe tool invocation actions before execution by reasoning over the interaction history. It assesses request harmfulness and action-attack correlations, producing interpretable and generalizable safety judgments and feedback. Furthermore, we introduce TS-Flow, a guardrail-feedback-driven reasoning framework for LLM agents, which reduces harmful tool invocations of ReAct-style agents by 65 percent on average and improves benign task completion by approximately 10 percent under prompt injection attacks.

preprint2022arXiv

Approximate Group Fairness for Clustering

We incorporate group fairness into the algorithmic centroid clustering problem, where $k$ centers are to be located to serve $n$ agents distributed in a metric space. We refine the notion of proportional fairness proposed in [Chen et al., ICML 2019] as {\em core fairness}, and $k$-clustering is in the core if no coalition containing at least $n/k$ agents can strictly decrease their total distance by deviating to a new center together. Our solution concept is motivated by the situation where agents are able to coordinate and utilities are transferable. A string of existence, hardness and approximability results is provided. Particularly, we propose two dimensions to relax core requirements: one is on the degree of distance improvement, and the other is on the size of deviating coalition. For both relaxations and their combination, we study the extent to which relaxed core fairness can be satisfied in metric spaces including line, tree and general metric space, and design approximation algorithms accordingly.

preprint2022arXiv

One-stage Action Detection Transformer

In this work, we introduce our solution to the EPIC-KITCHENS-100 2022 Action Detection challenge. One-stage Action Detection Transformer (OADT) is proposed to model the temporal connection of video segments. With the help of OADT, both the category and time boundary can be recognized simultaneously. After ensembling multiple OADT models trained from different features, our model can reach 21.28\% action mAP and ranks the 1st on the test-set of the Action detection challenge.

preprint2020arXiv

Gate-Tunable Reversible Rashba-Edelstein Effect in a Few-Layer Graphene/2H-TaS2 Heterostructure at Room Temperature

We report the observation of current-induced spin polarization, the Rashba-Edelstein effect (REE), and its Onsager reciprocal phenomenon, the spin galvanic effect (SGE), in a few-layer graphene/2H-TaS2 heterostructure at room temperature. Spin-sensitive electrical measurements unveil full spin-polarization reversal by an applied gate voltage. The observed gate-tunable charge-to-spin conversion is explained by the ideal work function mismatch between 2H-TaS2 and graphene, which allows strong interface-induced Bychkov-Rashba interaction with a spin-gap reaching 70 meV, while keeping the Dirac nature of the spectrum intact across electron and hole sectors. The reversible electrical generation and control of the nonequilibrium spin polarization vector, not previously observed in a nonmagnetic material, are elegant manifestations of emergent 2D Dirac fermions with robust spin-helical structure. Our experimental findings, supported by first-principles relativistic electronic structure and transport calculations, demonstrate a route to design low-power spin-logic circuits from layered materials.

preprint2016arXiv

Electron-hole asymmetry, Dirac fermions, and quantum magnetoresistance in BaMnBi2

We report two-dimensional quantum transport and Dirac fermions in BaMnBi2 single crystals. BaMnBi2 is a layered bad metal with highly anisotropic conductivity and magnetic order below 290 K. Magnetotransport properties, nonzero Berry phase, small cyclotronmass, and the first-principles band structure calculations indicate the presence of Dirac fermions in Bi square nets. Quantum oscillations in the Hall channel suggest the presence of both electron and hole pockets, whereas Dirac and parabolic states coexist at the Fermi level.

preprint2016arXiv

Model-Checking of Linear-Time Properties Based on Possibility Measure

We study the LTL model-checking in possibilistic Kripke structure using possibility measure. First, the notion of possibilistic Kripke structure and the related possibility measure are introduced, then model-checking of reachability and repeated reachability linear-time properties in finite possibilistic Kripke structure are studied. Standard safety property and -regular property in possibilistic Kripke structure are introduced, the verification of regular safety property and -regular property using finite automata are thoroughly studied. It has been shown that the verification of regular safety property and -regular property in finite possibilistic Kripke structure can be transformed into the verification of reachability property and repeated reachability property in the product possibilistic Kripke structure introduced in this paper. Several examples are given to illustrate the methods presented in the paper.

preprint2016arXiv

Superconductivity and Charge Density Wave in ZrTe$_{3-x}$Se$_{x}$

Charge density wave (CDW), the periodic modulation of the electronic charge density, will open a gap on the Fermi surface that commonly leads to decreased or vanishing conductivity. On the other hand superconductivity, a commonly believed competing order, features a Fermi surface gap that results in infinite conductivity. Here we report that superconductivity emerges upon Se doping in CDW conductor ZrTe$_{3}$ when the long range CDW order is gradually suppressed. Superconducting critical temperature $T_c(x)$ in ZrTe$_{3-x}$Se$_x$ (${0\leq}x\leq0.1$) increases up to 4 K plateau for $0.04$$\leq$$x$$\leq$$0.07$. Further increase in Se content results in diminishing $T_{c}$ and filametary superconductivity. The CDW modes from Raman spectra are observed in $x$ = 0.04 and 0.1 crystals, where signature of ZrTe$_{3}$ CDW order in resistivity vanishes. The electronic-scattering for high $T_{c}$ crystals is dominated by local CDW fluctuations at high temperures, the resistivity is linear up to highest measured $T=300K$ and contributes to substantial in-plane anisotropy.

preprint2014arXiv

Anisotropic giant magnetoresistance in NbSb2

The extremely large transverse magnetoreistance (the magnetoresistant ratio $\sim 1.3\times10^5\%$ in 2 K and 9 T field, and $4.3\times 10^6\%$ in 0.4 K and 32 T field, without saturation), and the metal-semiconductor crossover induced by magnetic field, are reported in NbSb$_2$ single crystal with electric current parallel to the $b$-axis. The metal-semiconductor crossover is preserved when the current is along the $ac$-plane but the magnetoresistant ratio is significantly suppressed. The sign reversal of the Hall resistivity in the field close to the crossover point, and the electronic structure calculation reveals the coexistence of a small number of holes with very high mobility and a large number of electrons with low mobility. These effects are attributed to the change of the Fermi surface induced by the magnetic field.

preprint2014arXiv

Microscopic evidence for strong periodic lattice distortion in 2D charge-density wave systems

In the quasi-2D electron systems of the layered transition metal dichalcogenides (TMD) there is still a controversy about the nature of the transitions to charge-density wave (CDW) phases, i.e. whether they are described by a Peierls-type mechanism or by a lattice-driven model. By performing scanning tunneling microscopy (STM) experiments on the canonical TMD-CDW systems, we have imaged the electronic modulation and the lattice distortion separately in 2H-TaS$_2$, TaSe$_2$, and NbSe$_2$. Across the three materials, we found dominant lattice contributions instead of the electronic modulation expected from Peierls transitions, in contrast to the CDW states that show the hallmark of contrast inversion between filled and empty states. Our results imply that the periodic lattice distortion (PLD) plays a vital role in the formation of CDW phases in the TMDs and illustrate the importance of taking into account the more complicated lattice degree of freedom when studying correlated electron systems.

preprint2013arXiv

Computing with Non-equilibrium Ratchets

Electronic ratchets transduce local spatial asymmetries into directed currents in the absence of a global drain bias, by rectifying temporal signals that reside far from thermal equilibrium. We show that the absence of a drain bias can provide distinct energy advantages for computation, specifically, reducing static dissipation in a logic circuit. Since the ratchet functions as a gate voltage-controlled current source, it also potentially reduces the dynamic dissipation associated with charging/discharging capacitors. In addition, the unique charging mechanism eliminates timing related constraints on logic inputs, in principle allowing for adiabatic charging. We calculate the ratchet currents in classical and quantum limits, and show how a sequence of ratchets can be cascaded to realize universal Boolean logic.

preprint2013arXiv

New Superconductivity in Layered 1T-TaS2-xSex Single Crystals Fabricated by Chemical Vapor Transport

Layered transition-metal dichalcogenides 1T-TaS2-xSex (0<=x<=2) single crystals have been successfully fabricated by using a chemical vapor transport technique in which Ta locates in octahedral coordination with S and Se atoms. This is the first superconducting example by the substitution of S site, which violates an initial rule based on the fact that superconductivity merely emerges in 1T-TaS2 by applying the high pressure or substitution of Ta site. We demonstrate the appearance of a series of electronic states in 1T-TaS2-xSex with Se content. Namely, the Mott phase melts into a nearly commensurate charge-density-wave (NCCDW) phase, superconductivity in a wide x range develops within the NCCDW state, and finally commensurate charge-density-wave (CCDW) phase reproduces for heavy Se content. The present results reveal that superconductivity is only characterized by robust Ta 5d band, demonstrating the universal nature in 1T-TaS2 systems that superconductivity and NCCDW phase coexist in the real space.

preprint2009arXiv

Superconductivity and single crystal growth of Ni0:05TaS2

Superconductivity was discovered in a Ni0:05TaS2 single crystal. A Ni0:05TaS2 single crystal was successfully grown via the NaCl/KCl flux method. The obtained lattice constant c of Ni0:05TaS2 is 1.1999 nm, which is significantly smaller than that of 2H-TaS2 (1.208 nm). Electrical resistivity and magnetization measurements reveal that the superconductivity transition temperature of Ni0:05TaS2 is enhanced from 0.8 K (2H-TaS2) to 3.9 K. The charge-density-wave transition of the matrix compound 2H-TaS2 is suppressed in Ni0:05TaS2. The success of Ni0:05TaS2 single crystal growth via a NaCl/KCl flux demonstrates that NaCl/KCl flux method will be a feasible method for single crystal growth of the layered transition metal dichalcogenides.

Lijun Li

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

Relay Buffer Independent Communication over Pooled HBM for Efficient MoE Inference on Ascend

ToolSafe: Enhancing Tool Invocation Safety of LLM-based agents via Proactive Step-level Guardrail and Feedback

Approximate Group Fairness for Clustering

One-stage Action Detection Transformer

Gate-Tunable Reversible Rashba-Edelstein Effect in a Few-Layer Graphene/2H-TaS2 Heterostructure at Room Temperature

Electron-hole asymmetry, Dirac fermions, and quantum magnetoresistance in BaMnBi2

Model-Checking of Linear-Time Properties Based on Possibility Measure

Superconductivity and Charge Density Wave in ZrTe$_{3-x}$Se$_{x}$

Anisotropic giant magnetoresistance in NbSb2

Microscopic evidence for strong periodic lattice distortion in 2D charge-density wave systems

Computing with Non-equilibrium Ratchets

New Superconductivity in Layered 1T-TaS2-xSex Single Crystals Fabricated by Chemical Vapor Transport

Superconductivity and single crystal growth of Ni0:05TaS2