Source author record

Jing Long

Jing Long appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning physics.optics Artificial Intelligence cond-mat.mes-hall Distributed, Parallel, and Cluster Computing Information Retrieval

Catalog footprint

What is connected

4works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

AdaptiveLoad: Towards Efficient Video Diffusion Transformer Training

In video generation models, particularly world models, training large-scale video diffusion Transformers (such as DiT and MMDiT) poses significant computational challenges due to the extreme variance in sequence lengths within mixed-mode datasets. Existing bucket-based data loading strategies typically rely on "equal token length" constraints. This approach fails to account for the quadratic complexity of self-attention mechanisms, leading to severe load imbalance and underutilization of GPU resources. This paper proposes \textit{AdaptiveLoad}, an integrated optimization framework consisting of two core components: (1) A dual-constraint adaptive load balancing system, which eliminates long-sequence bottlenecks by simultaneously limiting memory consumption and computational load ($B \times S^p \le M_{\text{comp}}$); (2) A fused LayerNorm-Modulate CUDA kernel, which utilizes a D-tile coalesced reduction strategy to increase throughput and alleviate memory pressure. Experimental results on the Wan 2.1 world model demonstrate that our method reduces the computational imbalance rate from 39\% to 18.9\%, improves peak VRAM utilization efficiency by 22.7\%, and achieves an overall training throughput increase of 27.2\%.

preprint2022arXiv

Decentralized Collaborative Learning Framework for Next POI Recommendation

Next Point-of-Interest (POI) recommendation has become an indispensable functionality in Location-based Social Networks (LBSNs) due to its effectiveness in helping people decide the next POI to visit. However, accurate recommendation requires a vast amount of historical check-in data, thus threatening user privacy as the location-sensitive data needs to be handled by cloud servers. Although there have been several on-device frameworks for privacy-preserving POI recommendations, they are still resource-intensive when it comes to storage and computation, and show limited robustness to the high sparsity of user-POI interactions. On this basis, we propose a novel decentralized collaborative learning framework for POI recommendation (DCLR), which allows users to train their personalized models locally in a collaborative manner. DCLR significantly reduces the local models' dependence on the cloud for training, and can be used to expand arbitrary centralized recommendation models. To counteract the sparsity of on-device user data when learning each local model, we design two self-supervision signals to pretrain the POI representations on the server with geographical and categorical correlations of POIs. To facilitate collaborative learning, we innovatively propose to incorporate knowledge from either geographically or semantically similar users into each local model with attentive aggregation and mutual information maximization. The collaborative learning process makes use of communications between devices while requiring only minor engagement from the central server for identifying user groups, and is compatible with common privacy preservation mechanisms like differential privacy. We evaluate DCLR with two real-world datasets, where the results show that DCLR outperforms state-of-the-art on-device frameworks and yields competitive results compared with centralized counterparts.

preprint2015arXiv

Plasmonic Crystal Cavity on Single-Mode Optical Fiber End Facet for Label-Free Biosensing

All surface plasmon resonance (SPR) devices on single-mode optical fibers' (SMF) end facets, as reported up to date, are limited by severely broad and shallow resonance spectra. The consequent poor performance when they are used as refractive index sensors, together with the challenge of nanofabrication on fiber end facets, has prohibited the development of such devices for label-free biosensing. Meanwhile, the planewave coupled, multimode fiber and fiber sidewall SPR counterparts are extensively employed for label-free biosensing. In this paper, we report the design, fabrication and characterization of a plasmonic crystal cavity on a SMF end facet, which shows high performance label-free sensing capability that comes from a steep cavity resonance near the plasmonic bandedge. The experimental figure-of-merit is 68 RIU^-1, which is over twenty times improvement to previous reports. The refractive index detection limit is 3.5*10^-6 RIU at 1 s integration time. We also describe a novel glue-and-strip process to transfer gold nano structures onto fiber end facets.

preprint2015arXiv

Reproducible Ultrahigh Electromagnetic SERS Enhancement in Nanosphere-Plane Junctions

Surface enhanced Raman scattering (SERS) in nanoscale hotspots has been placed great hopes upon for identification of minimum chemical traces and in-situ investigation of single molecule structures and dynamics. However, previous work consists of either irreproducible enhancement factors (EF) from random aggregates, or moderate EFs despite better reproducibility. Consequently, systematic study of SERS at the single and few molecules level is still very limited, and the promised applications are far from being realized. Here we report EFs as high as the most intense hotspots in previous work yet achieved in a reproducible and well controlled manner, that is, electromagnetic EFs (EMEF) of 10^9~10 with an error down to 10^+/-0.08 from gold nanospheres on atomically flat gold planes under radially polarized (RP) laser excitation. In addition, our experiment reveals the EF's unexpected nonlinearity under as low as hundreds of nanowatts of laser power.

Jing Long

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

AdaptiveLoad: Towards Efficient Video Diffusion Transformer Training

Decentralized Collaborative Learning Framework for Next POI Recommendation

Plasmonic Crystal Cavity on Single-Mode Optical Fiber End Facet for Label-Free Biosensing

Reproducible Ultrahigh Electromagnetic SERS Enhancement in Nanosphere-Plane Junctions