Researcher profile

Song Cheng

Song Cheng contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2026arXiv

Dynamical Correlation of the Post-quench Non-thermal Equilibrium State

After a quantum quench, the integrable system is expected to relax to a non-thermal equilibrium state (NTES) whose local properties are believed to be governed by a generalized Gibbs ensemble (GGE). Combining quench action and the form factor approach, we compute the field-field correlation in the NTES produced by an interaction quench of the Lieb-Liniger model. The spectral distribution is shown to be qualitatively different from that of a thermal equilibrium state (TES): a new dispersion branch appears whose microscopic mechanism can be traced to the algebraic decaying tail for the root density distribution function, and indicates the existence of a broader family of NTES featuring similar spectral property.

preprint2024arXiv

Two-dimensional polarized superfluids under the prism of the fermion sign problem

Understanding if attractive fermions in an unbalanced occupation of its flavors can give rise to a superfluid state in two dimensions (2D), realizing the Fulde-Ferrel-Larkin-Ovchinnikov (FFLO) state, presents a long-standing question. A limitation on its solution by numerics is posed by the sign problem, which constrains the applicability of quantum Monte Carlo techniques at sufficiently low temperatures and large lattice sizes, where a potential signature of polarized superfluidity would be unambiguous. By using a recently explored argument that the sign problem may be used instead to infer quantum critical behavior, we explore the regime where partial polarization occurs in the phase diagram, further showing that the average sign $\langle {\cal S}\rangle$ of quantum Monte Carlo weights tracks the criticality between balanced (or fully polarized) and polarized phases. Using the attractive Hubbard model with an unbalanced population, our investigation expands the scope of problems in which $\langle {\cal S}\rangle$ can be used for monitoring critical behavior, providing compelling albeit indirect evidence for the robustness of an FFLO phase in 2D.

preprint2020arXiv

Compressing deep neural networks by matrix product operators

A deep neural network is a parametrization of a multilayer mapping of signals in terms of many alternatively arranged linear and nonlinear transformations. The linear transformations, which are generally used in the fully connected as well as convolutional layers, contain most of the variational parameters that are trained and stored. Compressing a deep neural network to reduce its number of variational parameters but not its prediction power is an important but challenging problem toward the establishment of an optimized scheme in training efficiently these parameters and in lowering the risk of overfitting. Here we show that this problem can be effectively solved by representing linear transformations with matrix product operators (MPOs), which is a tensor network originally proposed in physics to characterize the short-range entanglement in one-dimensional quantum states. We have tested this approach in five typical neural networks, including FC2, LeNet-5, VGG, ResNet, and DenseNet on two widely used data sets, namely, MNIST and CIFAR-10, and found that this MPO representation indeed sets up a faithful and efficient mapping between input and output signals, which can keep or even improve the prediction accuracy with a dramatically reduced number of parameters. Our method greatly simplifies the representations in deep learning, and opens a possible route toward establishing a framework of modern neural networks which might be simpler and cheaper, but more efficient.

preprint2020arXiv

Domain Adaption for Knowledge Tracing

With the rapid development of online education system, knowledge tracing which aims at predicting students' knowledge state is becoming a critical and fundamental task in personalized education. Traditionally, existing methods are domain-specified. However, there are a larger number of domains (e.g., subjects, schools) in the real world and the lacking of data in some domains, how to utilize the knowledge and information in other domains to help train a knowledge tracing model for target domains is increasingly important. We refer to this problem as domain adaptation for knowledge tracing (DAKT) which contains two aspects: (1) how to achieve great knowledge tracing performance in each domain. (2) how to transfer good performed knowledge tracing model between domains. To this end, in this paper, we propose a novel adaptable framework, namely adaptable knowledge tracing (AKT) to address the DAKT problem. Specifically, for the first aspect, we incorporate the educational characteristics (e.g., slip, guess, question texts) based on the deep knowledge tracing (DKT) to obtain a good performed knowledge tracing model. For the second aspect, we propose and adopt three domain adaptation processes. First, we pre-train an auto-encoder to select useful source instances for target model training. Second, we minimize the domain-specific knowledge state distribution discrepancy under maximum mean discrepancy (MMD) measurement to achieve domain adaptation. Third, we adopt fine-tuning to deal with the problem that the output dimension of source and target domain are different to make the model suitable for target domains. Extensive experimental results on two private datasets and seven public datasets clearly prove the effectiveness of AKT for great knowledge tracing performance and its superior transferable ability.