Researcher profile

Jiaming Wang

Jiaming Wang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Interplanetary magnetic correlation and low-frequency spectrum over many solar rotations

Fluctuations and structure across a wide range of spatial and temporal scales are frequently studied in the solar wind. The properties of the low-frequency fluctuations are of relevance to turbulent energy injection into the plasma and the transport of high-energy cosmic rays. Correlation analysis of decade-long intervals of interplanetary data permits study of fluctuations at time scales much longer than suitably defined correlation times, and therefore at frequencies well below those associated with the Kolmogorov inertial range of {\it in situ} turbulence. At the frequencies of interest, we study the familiar occurrence of the $1/f$ spectral signature. We also study point spectral features due to solar rotation and their relation with the $1/f$ signal. We report novel properties at timescales ranging from minutes up to years, using data selected by wind speed, phase of solar cycle, and cartesian components of the magnetic field. A surprising finding is that the power in solar rotation harmonics is consistent with an extension of the $1/f$ spectrum, down to frequencies as low as around $\unit[5 \times 10^{-7}]{Hz}$. The presence of a broadband $1/f$ spectrum across different wind types supports the interpretation that $1/f$ signals may be related to or even originate from the solar dynamo.

preprint2026arXiv

SOP-Maze: Evaluating Large Language Models on Complicated Business Standard Operating Procedures

As large language models (LLMs) are widely deployed as domain-specific agents, many benchmarks have been proposed to evaluate their ability to follow instructions and make decisions in real-world scenarios. However, business scenarios often involve complex standard operating procedures (SOPs), and the evaluation of LLM capabilities in such contexts has not been fully explored. To bridge this gap, we propose SOP-Maze, a benchmark constructed from real-world business data and adapted into a collection of 397 instances and 3422 subtasks from 23 complex SOP scenarios. We further categorize SOP tasks into two broad classes: Lateral Root System (LRS), representing wide-option tasks that demand precise selection; and Heart Root System (HRS), which emphasizes deep logical reasoning with complex branches. Extensive experiments reveal that nearly all state-of-the-art models struggle with SOP-Maze. We conduct a comprehensive analysis and identify three key error categories: (i) route blindness: difficulty following procedures; (ii) conversational fragility: inability to handle real dialogue nuances; and (iii) calculation errors: mistakes in time or arithmetic reasoning under complex contexts. The systematic study explores LLM performance across SOP tasks that challenge both breadth and depth, offering new insights for improving model capabilities. We have open-sourced our work on: https://github.com/meituan-longcat/SOP-Maze.

preprint2026arXiv

TextAlign: Preference Alignment for Text Rendering with Hierarchical Rewards

Faithful text rendering remains a persistent weakness of large text-to-image generative models, as it requires both semantic instruction following and fine-grained glyph-level structure. Prior methods often improve this ability through architecture-specific modules or encoder modifications, which complicate deployment across foundation models. We study text rendering as a post-training preference-alignment problem and propose TextAlign, a non-invasive framework that keeps the generator architecture unchanged. The key component is a hierarchical vision-language model (VLM)-based reward that decomposes rendering errors into global, word, and glyph levels, then converts binary defect judgments into a scalar preference signal. The resulting signal supports both Group Relative Policy Optimization (GRPO) and Direct Preference Optimization (DPO). Experiments on FLUX.1-dev and Z-Image-Turbo show consistent gains in OCR-based text accuracy without degrading general generation quality. Compared with strong foundation and text-rendering baselines, including SD3.5, Qwen-Image, AnyText, and TextDiffuser, these results indicate that reward design offers a scalable alternative to model redesign for improving text rendering.

preprint2022arXiv

Emergence of spin singlets with inhomogeneous gaps in the kagome Heisenberg antiferromagnets Zn-barlowite and herbertsmithite

The kagome Heisenberg antiferromagnet formed by frustrated spins arranged in a lattice of corner-sharing triangles is a prime candidate for hosting a quantum spin liquid (QSL) ground state consisting of entangled spin singlets. But the existence of various competing states makes a convincing theoretical prediction of the QSL ground state difficult, calling for experimental clues from model materials. The kagome lattice materials Zn-barlowite ZnCu$_{3}$(OD)$_{6}$FBr and herbertsmithite ZnCu$_{3}$(OD)$_{6}$Cl$_2$ do not exhibit long range order, and they are considered the best realizations of the kagome Heisenberg antiferromagnet known to date. Here we use $^{63}$Cu nuclear quadrupole resonance combined with the inverse Laplace transform (ILT) to probe locally the inhomogeneity of delicate quantum ground states affected by disorder. We present direct evidence for the gradual emergence of spin singlets with spatially varying excitation gaps, but even at temperatures far below the super-exchange energy scale their fraction is limited to approximately 60\% of the total spins. Theoretical models need to incorporate the role of disorder to account for the observed inhomogeneously gapped behaviour.

preprint2022arXiv

Emergence of the spin polarized domains in the kagome lattice Heisenberg antiferromagnet Zn-barlowite (Zn$_{0.95}$Cu$_{0.05}$)Cu$_{3}$(OD)$_{6}$FBr

Kagome lattice Heisenberg antiferromagnets are known to be highly sensitive to perturbations caused by structural disorder. NMR is a local probe ideally suited for investigating such disorder-induced effects, but in practice large distributions in the conventional one-dimensional NMR data make it difficult to distinguish the intrinsic behavior expected for pristine kagome quantum spin liquids from disorder induced effects. Here we report the development of a two-dimensional NMR data acquisition scheme applied to Zn-barlowite (Zn$_{0.95}$Cu$_{0.05}$)Cu$_{3}$(OD)$_{6}$FBr kagome lattice, and successfully correlate the distribution of the low energy spin excitations with that of the local spin susceptibility. We present evidence for the gradual growth of domains with a local spin polarization induced by 5\% Cu$^{2+}$ defect spins occupying the interlayer non-magnetic Zn$^{2+}$ sites. These spin polarized domains account for $\sim60$\% of the sample volume at 2~K, where gapless excitations induced by interlayer defects dominate the low energy sector of spin excitations within the kagome planes.

preprint2022arXiv

Freezing of the Lattice in the Kagome Lattice Heisenberg Antiferromagnet Zn-barlowite ZnCu$_3$(OD)$_6$FBr

We use $^{79}$Br nuclear quadrupole resonance (NQR) to demonstrate that ultra slow lattice dynamics set in below the temperature scale set by the Cu-Cu super-exchange interaction $J$~($\simeq160$~K) in the kagome lattice Heisenberg antiferromagnet Zn-barlowite. The lattice completely freezes below 50~K, and $^{79}$Br NQR lineshapes become twice broader due to increased lattice distortions. Moreover, the frozen lattice exhibits an oscillatory component in the transverse spin echo decay, a typical signature of pairing of nuclear spins by indirect nuclear spin-spin interaction. This indicates that some Br sites form structural dimers via a pair of kagome Cu sites prior to the gradual emergence of spin singlets below $\sim30$~K. Our findings underscore the significant roles played by subtle structural distortions in determining the nature of the disordered magnetic ground state of the kagome lattice.

preprint2022arXiv

GLSD: The Global Large-Scale Ship Database and Baseline Evaluations

In this paper, we introduce a challenging global large-scale ship database (called GLSD), designed specifically for ship detection tasks. The designed GLSD database includes a total of 212,357 annotated instances from 152,576 images. Based on the collected images, we propose 13 ship categories that widely exist in international routes. These categories include Sailing boat, Fishing boat, Passenger ship, Warship, General cargo ship, Container ship, Bulk cargo carrier, Barge, Ore carrier, Speed boat, Canoe, Oil carrier, and Tug. The motivations of developing GLSD include the following: 1) providing a refine and extensive ship detection database that benefits the object detection community, 2) establishing a database with exhaustive labels (bounding boxes and ship class categories) in a uniform classification scheme, and 3) providing a large-scale ship database with geographic information (covering more than 3000 ports and 33 routes) that benefits multi-modal analysis. In addition, we discuss the evaluation protocols corresponding to image characteristics in GLSD and analyze the performance of selected state-of-the-art object detection algorithms on GSLD, aiming to establish baselines for future studies. More information regarding the designed GLSD can be found at https://github.com/jiaming-wang/GLSD.

preprint2022arXiv

Spin excitations of a proximate Kitaev quantum spin liquid realized in Cu$_2$IrO$_3$

Magnetic moments arranged at the corners of a honeycomb lattice are predicted to form a novel state of matter, Kitaev quantum spin liquid, under the influence of frustration effects between bond-dependent Ising interactions. Some layered honeycomb iridates and related materials, such as Na$_{2}$IrO$_{3}$ and $α$-RuCl$_{3}$, are proximate to Kitaev quantum spin liquid, but bosonic spin-wave excitations associated with undesirable antiferromagnetic long-range order mask the inherent properties of Kitaev Hamiltonian. Here, we use $^{63}$Cu nuclear quadrupole resonance to uncover the low energy spin excitations in the nearly ideal honeycomb lattice of effective spin $S = 1/2$ at the Ir$^{4+}$ sites in Cu$_{2}$IrO$_{3}$. We demonstrate that, unlike Na$_{2}$IrO$_{3}$, Ir spin fluctuations exhibit no evidence for critical slowing down toward magnetic long range order in zero external magnetic field. Moreover, the low energy spin excitation spectrum is dominated by a mode that has a large excitation gap comparable to the Ising interactions, a signature expected for Majorana fermions of Kitaev quantum spin liquid.

preprint2021arXiv

SSCAN: A Spatial-spectral Cross Attention Network for Hyperspectral Image Denoising

Hyperspectral images (HSIs) have been widely used in a variety of applications thanks to the rich spectral information they are able to provide. Among all HSI processing tasks, HSI denoising is a crucial step. Recently, deep learning-based image denoising methods have made great progress and achieved great performance. However, existing methods tend to ignore the correlations between adjacent spectral bands, leading to problems such as spectral distortion and blurred edges in denoised results. In this study, we propose a novel HSI denoising network, termed SSCAN, that combines group convolutions and attention modules. Specifically, we use a group convolution with a spatial attention module to facilitate feature extraction by directing models' attention to band-wise important features. We propose a spectral-spatial attention block (SSAB) to exploit the spatial and spectral information in hyperspectral images in an effective manner. In addition, we adopt residual learning operations with skip connections to ensure training stability. The experimental results indicate that the proposed SSCAN outperforms several state-of-the-art HSI denoising algorithms.

preprint2020arXiv

Stroke Constrained Attention Network for Online Handwritten Mathematical Expression Recognition

In this paper, we propose a novel stroke constrained attention network (SCAN) which treats stroke as the basic unit for encoder-decoder based online handwritten mathematical expression recognition (HMER). Unlike previous methods which use trace points or image pixels as basic units, SCAN makes full use of stroke-level information for better alignment and representation. The proposed SCAN can be adopted in both single-modal (online or offline) and multi-modal HMER. For single-modal HMER, SCAN first employs a CNN-GRU encoder to extract point-level features from input traces in online mode and employs a CNN encoder to extract pixel-level features from input images in offline mode, then use stroke constrained information to convert them into online and offline stroke-level features. Using stroke-level features can explicitly group points or pixels belonging to the same stroke, therefore reduces the difficulty of symbol segmentation and recognition via the decoder with attention mechanism. For multi-modal HMER, other than fusing multi-modal information in decoder, SCAN can also fuse multi-modal information in encoder by utilizing the stroke based alignments between online and offline modalities. The encoder fusion is a better way for combining multi-modal information as it implements the information interaction one step before the decoder fusion so that the advantages of multiple modalities can be exploited earlier and more adequately when training the encoder-decoder model. Evaluated on a benchmark published by CROHME competition, the proposed SCAN achieves the state-of-the-art performance.