Source author record

Rui Lin

Rui Lin appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.quant-gas Machine Learning Artificial Intelligence cond-mat.mtrl-sci cond-mat.str-el eess.AS quant-ph Sound

Catalog footprint

What is connected

13works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

VULCAN: Tool-Augmented Multi Agents for Iterative 3D Object Arrangement

Despite the remarkable progress of Multimodal Large Language Models (MLLMs) in 2D vision-language tasks, their application to complex 3D scene manipulation remains underexplored. In this paper, we bridge this critical gap by tackling three key challenges in 3D object arrangement task using MLLMs. First, to address the weak visual grounding of MLLMs, which struggle to link programmatic edits with precise 3D outcomes, we introduce an MCP-based API. This shifts the interaction from brittle raw code manipulation to more robust, function-level updates. Second, we augment the MLLM's 3D scene understanding with a suite of specialized visual tools to analyze scene state, gather spatial information, and validate action outcomes. This perceptual feedback loop is critical for closing the gap between language-based updates and precise 3D-aware manipulation. Third, to manage the iterative, error-prone updates, we propose a collaborative multi-agent framework with designated roles for planning, execution, and verification. This decomposition allows the system to robustly handle multi-step instructions and recover from intermediate errors. We demonstrate the effectiveness of our approach on a diverse set of 25 complex object arrangement tasks, where it significantly outperforms existing baselines. Website: vulcan-3d.github.io

preprint2025arXiv

Hear: Hierarchically Enhanced Aesthetic Representations For Multidimensional Music Evaluation

Evaluating song aesthetics is challenging due to the multidimensional nature of musical perception and the scarcity of labeled data. We propose HEAR, a robust music aesthetic evaluation framework that combines: (1) a multi-source multi-scale representations module to obtain complementary segment- and track-level features, (2) a hierarchical augmentation strategy to mitigate overfitting, and (3) a hybrid training objective that integrates regression and ranking losses for accurate scoring and reliable top-tier song identification. Experiments demonstrate that HEAR consistently outperforms the baseline across all metrics on both tracks of the ICASSP 2026 SongEval benchmark. The code and trained model weights are available at https://github.com/Eps-Acoustic-Revolution-Lab/EAR_HEAR.

preprint2022arXiv

Coarse to Fine: Image Restoration Boosted by Multi-Scale Low-Rank Tensor Completion

Existing low-rank tensor completion (LRTC) approaches aim at restoring a partially observed tensor by imposing a global low-rank constraint on the underlying completed tensor. However, such a global rank assumption suffers the trade-off between restoring the originally details-lacking parts and neglecting the potentially complex objects, making the completion performance unsatisfactory on both sides. To address this problem, we propose a novel and practical strategy for image restoration that restores the partially observed tensor in a coarse-to-fine (C2F) manner, which gets rid of such trade-off by searching proper local ranks for both low- and high-rank parts. Extensive experiments are conducted to demonstrate the superiority of the proposed C2F scheme. The codes are available at: https://github.com/RuiLin0212/C2FLRTC.

preprint2022arXiv

Deformable Butterfly: A Highly Structured and Sparse Linear Transform

We introduce a new kind of linear transform named Deformable Butterfly (DeBut) that generalizes the conventional butterfly matrices and can be adapted to various input-output dimensions. It inherits the fine-to-coarse-grained learnable hierarchy of traditional butterflies and when deployed to neural networks, the prominent structures and sparsity in a DeBut layer constitutes a new way for network compression. We apply DeBut as a drop-in replacement of standard fully connected and convolutional layers, and demonstrate its superiority in homogenizing a neural network and rendering it favorable properties such as light weight and low inference complexity, without compromising accuracy. The natural complexity-accuracy tradeoff arising from the myriad deformations of a DeBut layer also opens up new rooms for analytical and practical research. The codes and Appendix are publicly available at: https://github.com/ruilin0212/DeBut.

preprint2022arXiv

Dissipation-engineered family of nearly dark states in many-body cavity-atom systems

Three-level atomic systems coupled to light have the capacity to host dark states. We study a system of V-shaped three-level atoms coherently coupled to the two quadratures of a dissipative cavity. The interplay between the atomic level structure and dissipation makes the phase diagram of the open system drastically different from the closed one. In particular, it leads to the stabilization of a continuous family of dark and nearly dark excited many-body states with inverted atomic populations as the steady states. The multistability of these states can be probed via their distinct fluctuations and excitation spectra, as well as the system's Liouvillian dynamics which are highly sensitive to ramp protocols. Our model can be implemented experimentally by encoding the two higher-energy modes in orthogonal density-modulated states in a bosonic quantum gas. This implementation offers prospects for potential applications like the realization of quantum optical random walks and microscopy with subwavelength spatial resolution.

preprint2022arXiv

Observing dynamical currents in a non-Hermitian momentum lattice

We report on the experimental realization and detection of dynamical currents in a spin-textured lattice in momentum space. Collective tunneling is implemented via cavity-assisted Raman scattering of photons by a spinor Bose-Einstein condensate into an optical cavity. The photon field inducing the tunneling processes is subject to cavity dissipation, resulting in effective directional dynamics in a non-Hermitian setting. We observe that the individual tunneling events are superradiant in nature and locally resolve them in the lattice by performing real-time, frequency-resolved measurements of the leaking cavity field. The results can be extended to a regime exhibiting a cascade of currents and simultaneous coherences between multiple lattice sites, where numerical simulations provide further understanding of the dynamics. Our observations showcase dynamical tunneling in momentum-space lattices and provide prospects to realize dynamical gauge fields in driven-dissipative settings.

preprint2022arXiv

PECAN: A Product-Quantized Content Addressable Memory Network

A novel deep neural network (DNN) architecture is proposed wherein the filtering and linear transform are realized solely with product quantization (PQ). This results in a natural implementation via content addressable memory (CAM), which transcends regular DNN layer operations and requires only simple table lookup. Two schemes are developed for the end-to-end PQ prototype training, namely, through angle- and distance-based similarities, which differ in their multiplicative and additive natures with different complexity-accuracy tradeoffs. Even more, the distance-based scheme constitutes a truly multiplier-free DNN solution. Experiments confirm the feasibility of such Product-Quantized Content Addressable Memory Network (PECAN), which has strong implication on hardware-efficient deployments especially for in-memory computing.

preprint2022arXiv

What Do Adversarially trained Neural Networks Focus: A Fourier Domain-based Study

Although many fields have witnessed the superior performance brought about by deep learning, the robustness of neural networks remains an open issue. Specifically, a small adversarial perturbation on the input may cause the model to produce a completely different output. Such poor robustness implies many potential hazards, especially in security-critical applications, e.g., autonomous driving and mobile robotics. This work studies what information the adversarially trained model focuses on. Empirically, we notice that the differences between the clean and adversarial data are mainly distributed in the low-frequency region. We then find that an adversarially-trained model is more robust than its naturally-trained counterpart due to the reason that the former pays more attention to learning the dominant information in low-frequency components. In addition, we consider two common ways to improve model robustness, namely, by data augmentation and by using stronger network architectures, and understand these techniques from a frequency-domain perspective. We are hopeful this work can shed light on the design of more robust neural networks.

preprint2021arXiv

FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation

Learning convolutional neural networks (CNNs) with low bitwidth is challenging because performance may drop significantly after quantization. Prior arts often discretize the network weights by carefully tuning hyper-parameters of quantization (e.g. non-uniform stepsize and layer-wise bitwidths), which are complicated and sub-optimal because the full-precision and low-precision models have a large discrepancy. This work presents a novel quantization pipeline, Frequency-Aware Transformation (FAT), which has several appealing benefits. (1) Rather than designing complicated quantizers like existing works, FAT learns to transform network weights in the frequency domain before quantization, making them more amenable to training in low bitwidth. (2) With FAT, CNNs can be easily trained in low precision using simple standard quantizers without tedious hyper-parameter tuning. Theoretical analysis shows that FAT improves both uniform and non-uniform quantizers. (3) FAT can be easily plugged into many CNN architectures. When training ResNet-18 and MobileNet-V2 in 4 bits, FAT plus a simple rounding operation already achieves 70.5% and 69.2% top-1 accuracy on ImageNet without bells and whistles, outperforming recent state-of-the-art by reducing 54.9X and 45.7X computations against full-precision models. We hope FAT provides a novel perspective for model quantization. Code is available at \url{https://github.com/ChaofanTao/FAT_Quantization}.

preprint2020arXiv

HOTCAKE: Higher Order Tucker Articulated Kernels for Deeper CNN Compression

The emerging edge computing has promoted immense interests in compacting a neural network without sacrificing much accuracy. In this regard, low-rank tensor decomposition constitutes a powerful tool to compress convolutional neural networks (CNNs) by decomposing the 4-way kernel tensor into multi-stage smaller ones. Building on top of Tucker-2 decomposition, we propose a generalized Higher Order Tucker Articulated Kernels (HOTCAKE) scheme comprising four steps: input channel decomposition, guided Tucker rank selection, higher order Tucker decomposition and fine-tuning. By subjecting each CONV layer to HOTCAKE, a highly compressed CNN model with graceful accuracy trade-off is obtained. Experiments show HOTCAKE can compress even pre-compressed models and produce state-of-the-art lightweight networks.

preprint2020arXiv

MCTDH-X: The multiconfigurational time-dependent Hartree method for indistinguishable particles software

We introduce and describe the multiconfigurational time-depenent Hartree for indistinguishable particles (MCTDH-X) software. This powerful tool allows the investigation of ground state properties and dynamics of interacting quantum many-body systems in different spatial dimensions. The MCTDH-X software is a set of programs and scripts to compute, analyze, and visualize solutions for the time-dependent and time-independent many-body Schrödinger equation for indistinguishable quantum particles. As the MCTDH-X software represents a general solver for the Schrödinger equation, it is applicable to a wide range of problems in the fields of atomic, optical, molecular physics as well as condensed matter systems. In particular, it can be used to study light-matter interactions, correlated dynamics of electrons, as well as some aspects related to quantum information and computing. The MCTDH-X software solves a set of non-linear coupled working equations based on the application of the variational principle to the Schrödinger equation. These equations are obtained by using an ansatz for the many-body wavefunction that is a time-dependent expansion in a set of time-dependent many-body basis states. The time-dependence of the basis set enables MCTDH-X to deal with quantum dynamics at a superior accuracy as compared to, for instance, exact diagonalization approaches. Herein, we give an introduction to the MCTDH-X software via an easy-to-follow tutorial with a focus on accessibility. We use the double well to illustrate the fermionization of bosonic particles, the crystallization of fermionic particles, characteristics of the superfluid and Mott-insulator quantum phases in Hubbard models, and even dynamical quantum phase transitions. Our tutorial guides the potential user to apply the MCTDH-X software also to more complex systems.

preprint2020arXiv

Pathway to chaos through hierarchical superfluidity in a cavity-BEC system

We explore the role of atomic correlations in a harmonically trapped Bose-Einstein condensate coupled to a dissipative cavity, where both the atoms and the cavity are blue detuned from the external pumping laser. Using a genuine many-body approach that goes beyond mean-field, we extract density distributions and many-body correlations to unveil a pathway to chaos at large pump power through a hierarchical self-organization of the atoms, where the atoms transition from a single-well optical lattice to a double-well optical lattice. Correlated states of the atoms emerge and are characterized by local superfluid correlations in phases which are globally superfluid or Mott insulating. Local superfluid-Mott transitions are precluded by a dynamical instability to chaos which occurs via quasiperiodic attractors. Our results explain the mechanism behind the dynamical instabilities observed in experiments.

preprint2019arXiv

Spiral order from orientationally correlated random bonds in classical XY models

We discuss the stability of ferromagnetic long-range order in three-dimensional classical XY ferromagnets upon substitution of a small subset of equally oriented bonds by impurity bonds, on which the ferromagnetic exchange J_perp > 0 is replaced by a strong antiferromagnetic coupling J_imp < 0. In the presence of a single impurity bond, once the absolute value of the frustrating coupling J_imp < 0 exceeds a threshold J_c > 0, the ground state becomes two-fold degenerate, corresponding to either clockwise or anticlockwise canting of the spins in the vicinity of the impurity bond. In the presence of a small concentration of impurity bonds, the effective low-energy Hamiltonian is that of Ising variables encoding the sense of rotation of the local canting around the impurities. Those degrees of freedom interact through a dipolar interaction mediated by spin waves. A ferromagnetic Ising ground state indicates the instability of the XY ferromagnet towards a spiral state with a wave vector proportional to the concentration of impurity bonds. To analyze under which circumstances such a ground state arises, we study first impurities forming superlattices. For a subclass of those, we can rigorously establish the existence of spiral order. For another class of superlattices, the Ising variables order ferromagnetically in planes perpendicular to the orientation of impurity bonds, but antiferromagnetically parallel to it, which results in a fan-like XY ground state. Second, we consider the case when the impurity bonds are randomly distributed on the three-dimensional host lattice according to a Poisson process. We show the phenomenon of spiral order by disorder with an ordering wave vector proportional to the impurity concentration. The analytical predictions are confirmed by Monte Carlo simulations and are relevant for magnetic materials such as YBaCuFeO_5.

Rui Lin

What is connected

Connect this record

See the researcher in context

Building this map preview

13 published item(s)

VULCAN: Tool-Augmented Multi Agents for Iterative 3D Object Arrangement

Hear: Hierarchically Enhanced Aesthetic Representations For Multidimensional Music Evaluation

Coarse to Fine: Image Restoration Boosted by Multi-Scale Low-Rank Tensor Completion

Deformable Butterfly: A Highly Structured and Sparse Linear Transform

Dissipation-engineered family of nearly dark states in many-body cavity-atom systems

Observing dynamical currents in a non-Hermitian momentum lattice

PECAN: A Product-Quantized Content Addressable Memory Network

What Do Adversarially trained Neural Networks Focus: A Fourier Domain-based Study

FAT: Learning Low-Bitwidth Parametric Representation via Frequency-Aware Transformation

HOTCAKE: Higher Order Tucker Articulated Kernels for Deeper CNN Compression

MCTDH-X: The multiconfigurational time-dependent Hartree method for indistinguishable particles software

Pathway to chaos through hierarchical superfluidity in a cavity-BEC system

Spiral order from orientationally correlated random bonds in classical XY models