Researcher profile

Chong Li

Chong Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
16works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

16 published item(s)

preprint2026arXiv

TokAlign++: Advancing Vocabulary Adaptation via Better Token Alignment

Tokenization is a foundational step in the text process of Large Language Models (LLMs). Texts must be first tokenized into token IDs, which are then input to LLMs. Inefficient tokenization results in long token-ID sequences and will slow down the training and inference of LLMs. The fine-grained knowledge transfer between LLMs, like token-level distillation, is also impeded by the mismatch in vocabulary. To bridge this gap, we introduce a method named TokAlign++ to improve vocabulary adaptation performance by learning better token alignment lexicon. The source and target vocabularies are taken as two different languages, and the bilingual token alignment lexicon is learned from monolingual token representations. Model parameters are rearranged following this bilingual lexicon for new vocabulary, and progressively fine-tuned for adaptation. Experimental results on 15 languages show that our method boosts the multilingual text compression rates and preserves most of the multilingual ability of vanilla models. It costs as few as 1k steps to restore the performance of the vanilla model. After unifying vocabularies between vanilla models, token-level distillation remarkably improves the base model with only 235M tokens.

preprint2025arXiv

Structured and sparse partial least squares coherence for multivariate cortico-muscular analysis

Multivariate cortico-muscular analysis has recently emerged as a promising approach for evaluating the corticospinal neural pathway. However, current multivariate approaches encounter challenges such as high dimensionality and limited sample sizes, thus restricting their further applications. In this paper, we propose a structured and sparse partial least squares coherence algorithm (ssPLSC) to extract shared latent space representations related to cortico-muscular interactions. Our approach leverages an embedded optimization framework by integrating a partial least squares (PLS)-based objective function, a sparsity constraint and a connectivity-based structured constraint, addressing the generalizability, interpretability and spatial structure. To solve the optimization problem, we develop an efficient alternating iterative algorithm within a unified framework and prove its convergence experimentally. Extensive experimental results from one synthetic and several real-world datasets have demonstrated that ssPLSC can achieve competitive or better performance over some representative multivariate cortico-muscular fusion methods, particularly in scenarios characterized by limited sample sizes and high noise levels. This study provides a novel multivariate fusion method for cortico-muscular analysis, offering a transformative tool for the evaluation of corticospinal pathway integrity in neurological disorders.

preprint2022arXiv

Engineering the microwave to infrared noise photon flux for superconducting quantum systems

Electromagnetic filtering is essential for the coherent control, operation and readout of superconducting quantum circuits at milliKelvin temperatures. The suppression of spurious modes around transition frequencies of a few GHz is well understood and mainly achieved by on-chip and package considerations. Noise photons of higher frequencies -- beyond the pair-breaking energies -- cause decoherence and require spectral engineering before reaching the packaged quantum chip. The external wires that pass into the refrigerator and go down to the quantum circuit provide a direct path for these photons. This article contains quantitative analysis and experimental data for the noise photon flux through coaxial, filtered wiring. The attenuation of the coaxial cable at room temperature and the noise photon flux estimates for typical wiring configurations are provided. Compact cryogenic microwave low-pass filters with CR-110 and Esorb-230 absorptive dielectric fillings are presented along with experimental data at room and cryogenic temperatures up to 70 GHz. Filter cut-off frequencies between 1 to 10 GHz are set by the filter length, and the roll-off is material dependent. The relative dielectric permittivity and magnetic permeability for the Esorb-230 material in the pair-breaking frequency range of 75 to 110 GHz are measured, and the filter properties in this frequency range are calculated. The estimated dramatic suppression of the noise photon flux due to the filter proves its usefulness for experiments with superconducting quantum systems.

preprint2022arXiv

Frame-wise Action Representations for Long Videos via Sequence Contrastive Learning

Prior works on action representation learning mainly focus on designing various architectures to extract the global representations for short video clips. In contrast, many practical applications such as video alignment have strong demand for learning dense representations for long videos. In this paper, we introduce a novel contrastive action representation learning (CARL) framework to learn frame-wise action representations, especially for long videos, in a self-supervised manner. Concretely, we introduce a simple yet efficient video encoder that considers spatio-temporal context to extract frame-wise representations. Inspired by the recent progress of self-supervised learning, we present a novel sequence contrastive loss (SCL) applied on two correlated views obtained through a series of spatio-temporal data augmentations. SCL optimizes the embedding space by minimizing the KL-divergence between the sequence similarity of two augmented views and a prior Gaussian distribution of timestamp distance. Experiments on FineGym, PennAction and Pouring datasets show that our method outperforms previous state-of-the-art by a large margin for downstream fine-grained action classification. Surprisingly, although without training on paired videos, our approach also shows outstanding performance on video alignment and fine-grained frame retrieval tasks. Code and models are available at https://github.com/minghchen/CARL_code.

preprint2022arXiv

Gas Column Density Distribution of Molecular Clouds in the Third Quadrant of the Milky Way

We have obtained column density maps for an unbiased sample of 120 molecular clouds in the third quadrant of the Milky Way mid-plane (b$\le |5|^{\circ}$) within the galactic longitude range from 195$^{\circ}$ to 225$^{\circ}$, using the high sensitivity $^{12}$CO and $^{13}$CO ($J=1-0$) data from the Milky Way Imaging Scroll Painting (MWISP) project. The probability density functions of the molecular hydrogen column density of the clouds, N-PDFs, are fitted with both log-normal (LN) function and log-normal plus power-law (LN+PL) function. The molecular clouds are classified into three categories according to their shapes of N-PDFs, i.e., LN, LN+PL, and UN (unclear), respectively. About 72\% of the molecular clouds fall into the LN category, while 18\% and 10\% into the LN+PL and UN categories, respectively. A power-law scaling relation, $σ_s\propto N_{H_2}^{0.44}$, exists between the width of the N-PDF, $σ_s$, and the average column density, $N_{H_2}$, of the molecular clouds. However, $σ_s$ shows no correlation with the mass of the clouds. A correlation is found between the dispersion of normalized column density, $σ_{N/\rm <N>}$, and the sonic Mach number, $\mathcal{M}$, of molecular clouds. Overall, as predicted by numerical simulations, the N-PDFs of the molecular clouds with active star formation activity tend to have N-PDFs with power-law high-density tails.

preprint2022arXiv

Molecular Clouds Associated with {H \small{II}} regions and Candidates within l = 106.65$^\circ$ to 109.50$^\circ$ and b = ${-}$1.85$^\circ$ to 0.95$^\circ$

We present a large-scale simultaneous survey of the CO isotopologues ($\rm {}^{12}{CO}$, $\rm{}^{13}{CO}$, and $\rm{C}{}^{18}{O}$) J = 1 ${-}$ 0 line emission toward the Galactic plane region of l = 106.65$^\circ$ to 109.50$^\circ$ and b = ${-}$1.85$^\circ$ to 0.95$^\circ$ using the Purple Mountain Observatory 13.7 m millimeter-wavelength telescope. Except for the molecular gas in the solar neighborhood, the emission from the molecular gas in this region is concentrated in the velocity range of [${-}$60, ${-}$35] $\rm km~s^{-1}$. The gas in the region can be divided into four clouds, with mass in the range of $\sim$10$^{3}$ to 10$^{4}$\,${M_{\sun}}$. We have identified 25 filaments based on the $\rm {}^{13}{CO}$ data. The median excitation temperature, length, line mass, line width, and virial parameter of the filaments are 10.89 K, 8.49 pc, 146.11 ${M}_{\odot}~ \rm pc^{-1}$, 1.01 $\rm km~s^{-1}$, and 3.14, respectively. Among these filaments, eight have virial parameters of less than 2, suggesting that they are gravitationally bound and can lead to star formation. Nineteen {H \small {II}} regions or candidates have previously been found in the region and we investigate the relationships between these {H \small {II}} regions/candidates and surrounding molecular clouds in detail. Using morphology similarity and radial velocity consistency between {H \small {II}} regions/candidates and molecular clouds as evidence for association, and raised temperature and velocity broadening as signatures of interaction, we propose that 12 {H \small {II}} regions/candidates are associated with their surrounding molecular clouds. In the case of the {H \small {II}} region of S142, the energy of the {H \small {II}} region is sufficient to maintain the turbulence in the surrounding molecular gas.

preprint2022arXiv

Valley Piezoelectric Mechanism for Interpreting and Optimizing Piezoelectricity in Quantum Materials via Anomalous Hall Effect

Quantum materials have exhibited attractive electro-mechanical responses, but their piezoelectric coefficients are far from satisfactory due to the lack of fundamental mechanisms to benefit from the quantum effects. We discovered the valley piezoelectric mechanism that is absent in traditional piezoelectric theory yet promising to overcome this challenge. A theoretical model was developed to elucidate the valley piezoelectricity as the Valley Hall effect driven by pseudoelectric field, which can be significant in quantum systems with broken time reversal symmetry. Consistent tight-binding and density-functional-theory (DFT) calculations validate the model and unveil the crucial dependence of valley piezoelectricity on valley splitting, hybridization energy, bandgap, and Poisson ratio. Doping, passivation, and external stress are proposed as rational strategies to optimize piezoelectricity, with a more than 130% increase of piezoelectricity demonstrated by DFT simulations. The general valley piezoelectric model bridges the gap between electro-mechanical response and quantum effects, which opens an opportunity to achieve outstanding piezoelectricity in quantum materials via optimizing spin-valley and spin-orbit couplings.

preprint2022arXiv

Writable spin wave nanochannels in an artificial-spin-ice-mediated ferromagnetic thin film

Magnonics, which employs spin-waves to transmit and process information, is a promising venue for low-power data processing. One of the major challenges is the local control of the spin-wave propagation path. Here, we introduce the concept of writable magnonics by taking advantage of the highly flexible reconfigurability and rewritability of artificial spin ice systems. Using micromagnetic simulations, we show that globally switchable spin-wave propagation and the locally writable spin-wave nanochannels can be realized in a ferromagnetic thin film underlying an artificial pinwheel spin ice. The rewritable magnonics enabled by reconfigurable spin wave nanochannels provides a unique setting to design programmable magnonic circuits and logic devices for ultra-low power applications.

preprint2021arXiv

Impact of Distributed Rate Limiting on Load Distribution in a Latency-sensitive Messaging Service

The cloud&#39;s flexibility and promise of seamless auto-scaling notwithstanding, its ability to meet service level objectives (SLOs) typically calls for some form of control in resource usage. This seemingly traditional problem gives rise to new challenges in a cloud setting, and in particular a subtle yet significant trade-off involving load-distribution decisions (the distribution of workload across available cloud resources to optimize performance), and rate limiting (the capping of individual workloads to prevent global over-commitment). This paper investigates that trade-off through the design and implementation of a real-time messaging system motivated by Internet-of-Things (IoT) applications, and demonstrates a solution capable of realizing an effective compromise. The paper&#39;s contributions are in both explicating the source of this trade-off, and in demonstrating a possible solution.

preprint2020arXiv

Automatic Crack Detection on Road Pavements Using Encoder Decoder Architecture

Inspired by the development of deep learning in computer vision and object detection, the proposed algorithm considers an encoder-decoder architecture with hierarchical feature learning and dilated convolution, named U-Hierarchical Dilated Network (U-HDN), to perform crack detection in an end-to-end method. Crack characteristics with multiple context information are automatically able to learn and perform end-to-end crack detection. Then, a multi-dilation module embedded in an encoder-decoder architecture is proposed. The crack features of multiple context sizes can be integrated into the multi-dilation module by dilation convolution with different dilatation rates, which can obtain much more cracks information. Finally, the hierarchical feature learning module is designed to obtain a multi-scale features from the high to low-level convolutional layers, which are integrated to predict pixel-wise crack detection. Some experiments on public crack databases using 118 images were performed and the results were compared with those obtained with other methods on the same images. The results show that the proposed U-HDN method achieves high performance because it can extract and fuse different context sizes and different levels of feature maps than other algorithms.

preprint2020arXiv

Ensemble of Deep Convolutional Neural Networks for Automatic Pavement Crack Detection and Measurement

Automated pavement crack detection and measurement are important road issues. Agencies have to guarantee the improvement of road safety. Conventional crack detection and measurement algorithms can be extremely time-consuming and low efficiency. Therefore, recently, innovative algorithms have received increased attention from researchers. In this paper, we propose an ensemble of convolutional neural networks (without a pooling layer) based on probability fusion for automated pavement crack detection and measurement. Specifically, an ensemble of convolutional neural networks was employed to identify the structure of small cracks with raw images. Secondly, outputs of the individual convolutional neural network model for the ensemble were averaged to produce the final crack probability value of each pixel, which can obtain a predicted probability map. Finally, the predicted morphological features of the cracks were measured by using the skeleton extraction algorithm. To validate the proposed method, some experiments were performed on two public crack databases (CFD and AigleRN) and the results of the different state-of-the-art methods were compared. The experimental results show that the proposed method outperforms the other methods. For crack measurement, the crack length and width can be measure based on different crack types (complex, common, thin, and intersecting cracks.). The results show that the proposed algorithm can be effectively applied for crack measurement.

preprint2020arXiv

Simultaneous blockade of a photon phonon, and magnon induced by a two-level atom

The hybrid microwave optomechanical-magnetic system has recently emerged as a promising candidate for coherent information processing because of the ultrastrong microwave photon-magnon coupling and the longlife of the magnon and phonon. As a quantum information processing device, the realization of a single excitation holds special meaning for the hybrid system. In this paper, we introduce a single two-level atom into the optomechanical-magnetic system and show that an unconventional blockade due to destructive interference cannot offer a blockade of both the photon and magnon. Meanwhile, under the condition of single excitation resonance, the blockade of the photon, phonon, and magnon can be achieved simultaneously even in a weak optomechanical region, but the phonon blockade still requires the cryogenic temperature condition.

preprint2020arXiv

The Molecular Clouds associated with the H II Regions/Candidates between $l=207.7^\circ$ and $l=211.7^\circ$

Using the PMO-13.7 m millimeter telescope at Delingha in China, we have conducted a large-scale simultaneous survey of $^{12}$CO, $^{13}$CO, and C$^{18}$O $J=1-0$ emission toward the sky region centered at $l$=$209.7^\circ$, $b$=$-$2.25$^\circ$ with a coverage of $4.0^\circ \times 4.5^\circ$. The majority of the emission in the region comes from the clouds with velocities lying in the range from $-$3 km s$^{-1}$ to 55 km s$^{-1}$, at kinematic distances from 0.5 kpc to 7.0 kpc. The molecular clouds in the region are concentrated into three velocity ranges. The molecular clouds associated with the ten H II regions/candidates are identified and their physical properties are presented. Massive stars are found within Sh2-280, Sh2-282, Sh2-283, and BFS54, and we suggest them to be the candidate excitation sources of the H II regions. The distributions of excitation temperature and line width with the projected distance from the center of H II region/candidate suggest that the majority of the ten H II regions/candidates and their associated molecular gas are three-dimensional structures, rather than two-dimensional structures.

preprint2019arXiv

Assessing the Performance of Molecular Gas Clump Identification Algorithms

The detection of clumps(cores) in molecular clouds is an important issue in sub-millimetre astronomy. However, the completeness of the identification and the accuracy of the returned parameters of the automated clump identification algorithms are still not clear by now. In this work, we test the performance and bias of the GaussClumps, ClumpFind, Fellwalker, Reinhold, and Dendrograms algorithms in identifying simulated clumps. By designing the simulated clumps with various sizes, peak brightness, and crowdedness, we investigate the characteristics of the algorithms and their performance. In the aspect of detection completeness, Fellwalker, Dendrograms, and Gaussclumps are the first, second, and third best algorithms, respectively. The numbers of correct identifications of the six algorithms gradually increase as the size and SNR of the simulated clumps increase and they decrease as the crowdedness increases. In the aspect of the accuracy of retrieved parameters, Fellwalker and Dendrograms exhibit better performance than the other algorithms. The average deviations in clump parameters for all algorithms gradually increase as the size and SNR of clumps increase. Most of the algorithms except Fellwalker exhibit significant deviation in extracting the total flux of clumps. Taken altogether, Fellwalker, Gaussclumps, and Dendrograms exhibit the best performance in detection completeness and extracting parameters. The deviation in virial parameter for the six algorithms is relatively low. When applying the six algorithms to the clump identification for the Rosette molecular cloud, ClumpFind1994, ClumpFind2006, Gaussclumps, Fellwalker, and Reinhold exhibit performance that is consistent with the results from the simulated test.

preprint2019arXiv

Ground-state cooling of an magnomechanical resonator induced by magnetic damping

Quantum manipulation of mechanical resonators has been widely applied in fundamental physics and quantum information processing. Among them, cooling the mechanical system to its quantum ground state is regarded as a key step. In this work, we propose a scheme which one can realize ground-state cooling of resonator in a cavity magnomechanical system. The system consists of a microwave cavity and a small ferromagnetic sphere, in which phonon-magnon coupling and cavity photon-magnon coupling can be achieved via magnetostrictive interaction and magnetic dipole interaction, respectively. After adiabatically eliminating the cavity mode, an effective Hamiltonian which consists of magnon and mechanical modes is obtained. Within experimentally feasible parameters, we demonstrate that the ground-state cooling of the magnomechanical resonator can be achieved by extra magnetic damping. Unlike optomechanical cooling, magnomechanical interaction is utilized to realize the cooling of resonators. We further illustrate the ground-state cooling can be effectively controlled by the external magnetic field.

preprint2019arXiv

The Column Density Structure of Orion A Depicted by N-PDF

We have conducted a large-field simultaneous survey of $^{12}$CO, $^{13}$CO, and C$^{18}$O $J=1-0$ emission toward the Orion A giant molecular cloud (GMC) with a sky coverage of $\sim$ 4.4 deg$^2$ using the PMO-13.7 m millimeter-wavelength telescope. We use the probability distribution function of the column density (N-PDF) to investigate the distribution of molecular hydrogen in the Orion A GMC. The H$_2$ column density, derived from the $^{13}$CO emission, of the GMC is dominated by log-normal distribution in the range from $\sim$4$\times10^{21}$ to $\sim$1.5$\times10^{23}$ cm$^{-2}$ with excesses both at the low-density and high-density ends. The excess of the low-density end is possibly caused by an extended and low-temperature ($\sim$10 K) component with velocities in the range of 5$-$8 km s$^{-1}$. Compared with the northern sub-regions, the southern sub-regions of the Orion A GMC contain less gas with column density in $N_{H_2} > 1.25\times 10^{22}\ \rm{cm}^{-2}$. The dispersions of the N-PDFs of the sub-regions are found to correlate with the evolutionary stages of the clouds across the Orion A GMC. The structure hierarchy of Orion A GMC is explored with the DENDROGRAM algorithm, and it is found that the GMC is composed of two branches. All structures except one in the tree have virial parameters less than 2, indicating self-gravity is important on the spatial scales from $\sim$0.3 to $\sim$4 pc. Although power-laws and departures from log-normal distributions are found at the high-density end of N-PDFs of active star-forming regions, the N-PDFs of structures in the Orion A GMC are predominantly log-normal on scales from R$\sim$0.4 to 4 pc.