Source author record

Ming Lu

Ming Lu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision math.QA math.RT Artificial Intelligence cond-mat.mes-hall cond-mat.mtrl-sci eess.IV math.CO physics.app-ph quant-ph

Catalog footprint

What is connected

8works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

iQuantum groups and iHopf algebras II: dual canonical bases

Building on the iHopf algebra realization of quasi-split universal iquantum groups developed in a prequel, we construct the dual canonical basis for a universal iquantum group of arbitrary finite type, which are further shown to be preserved by the ibraid group action; this recovers the results of Lu-Pan in ADE type obtained earlier in a geometric approach. Moreover, we identify the dual canonical basis for the Drinfeld double quantum group of arbitrary finite type, which is realized via iHopf algebra on the double Borel, with Berenstein-Greenstein's double canonical basis, settling several of their conjectures.

preprint2026arXiv

ParkGaussian: Surround-view 3D Gaussian Splatting for Autonomous Parking

Parking is a critical task for autonomous driving systems (ADS), with unique challenges in crowded parking slots and GPS-denied environments. However, existing works focus on 2D parking slot perception, mapping, and localization, 3D reconstruction remains underexplored, which is crucial for capturing complex spatial geometry in parking scenarios. Naively improving the visual quality of reconstructed parking scenes does not directly benefit autonomous parking, as the key entry point for parking is the slots perception module. To address these limitations, we curate the first benchmark named ParkRecon3D, specifically designed for parking scene reconstruction. It includes sensor data from four surround-view fisheye cameras with calibrated extrinsics and dense parking slot annotations. We then propose ParkGaussian, the first framework that integrates 3D Gaussian Splatting (3DGS) for parking scene reconstruction. To further improve the alignment between reconstruction and downstream parking slot detection, we introduce a slot-aware reconstruction strategy that leverages existing parking perception methods to enhance the synthesis quality of slot regions. Experiments on ParkRecon3D demonstrate that ParkGaussian achieves state-of-the-art reconstruction quality and better preserves perception consistency for downstream tasks. The code and dataset will be released at: https://github.com/wm-research/ParkGaussian

preprint2026arXiv

ResTok: Learning Hierarchical Residuals in 1D Visual Tokenizers for Autoregressive Image Generation

Existing 1D visual tokenizers for autoregressive (AR) generation largely follow the design principles of language modeling, as they are built directly upon transformers whose priors originate in language, yielding single-hierarchy latent tokens and treating visual data as flat sequential token streams. However, this language-like formulation overlooks key properties of vision, particularly the hierarchical and residual network designs that have long been essential for convergence and efficiency in visual models. To bring "vision" back to vision, we propose the Residual Tokenizer (ResTok), a 1D visual tokenizer that builds hierarchical residuals for both image tokens and latent tokens. The hierarchical representations obtained through progressively merging enable cross-level feature fusion at each layer, substantially enhancing representational capacity. Meanwhile, the semantic residuals between hierarchies prevent information overlap, yielding more concentrated latent distributions that are easier for AR modeling. Cross-level bindings consequently emerge without any explicit constraints. To accelerate the generation process, we further introduce a hierarchical AR generator that substantially reduces sampling steps by predicting an entire level of latent tokens at once rather than generating them strictly token-by-token. Extensive experiments demonstrate that restoring hierarchical residual priors in visual tokenization significantly improves AR image generation, achieving a gFID of 2.34 on ImageNet-256 with only 9 sampling steps. Code is available at https://github.com/Kwai-Kolors/ResTok.

preprint2026arXiv

Three-dimensional quantum anomalous Hall effect in Weyl semimetals

The quantum anomalous Hall effect (QAHE) is a quantum phenomenon in which a two-dimensional system exhibits a quantized Hall resistance $h/e^2$ in the absence of magnetic field, where $h$ is the Planck constant and $e$ is the electron charge. In this work, we extend this novel phase to three dimensions and thus propose a three-dimensional QAHE exhibiting richer and more versatile transport behaviors. We first confirm this three-dimensional QAHE through the quantized Chern number, then establish its bulk-boundary correspondence, and finally reaffirm it via the distinctive transport properties. Remarkably, we find that the three-dimensional QAHE hosts two chiral surface states along one spatial direction while a pair of chiral hinge states along another direction, and the location of the hinge states depends sensitively on the Fermi energy. These two types of boundary states are further connected through a perpendicular chiral surface states, whose chirality is also Fermi energy dependent. Consequently, depending on the transport direction, its Hall resistance can quantize to $0$, $h/e^2$, or $\pm h/e^2$ when the Fermi energy is tuned across the charge neutral point. This three-dimensional QAHE not only fill the gap in the Hall effect family but also holds significant potentials in device applications such as in-memory computing.

preprint2026arXiv

Xiaomi EV World Model: A Joint World Model Integrating Reconstruction and Generation for Autonomous Driving

This report presents a unified technical system addressing the two core capabilities of world models for autonomous driving: world representation and world generation. For world representation, we propose WorldRec, a feed-forward reconstruction architecture driven by sparse scene queries. WorldRec initializes structured queries in 3D space, leveraging them to aggregate cross-view, cross-temporal features, thereby naturally enforcing spatial consistency across frames and yielding compact yet high-fidelity 3D Gaussian scene representations. For world generation, we propose WorldGen, a two-stage training framework of bidirectional pretraining followed by causal fine-tuning through three progressive stages (Teacher Forcing, ODE distillation, and DMD), enabling high-quality online causal video generation in as few as 4 denoising steps. Building on both modules, we further introduce the JWM, which deeply integrates WorldRec and WorldGen to achieve synergistic gains in generation stability, cross-frame consistency, and visual fidelity, providing a solid foundation for closed-loop simulation, data synthesis, and end-to-end training in autonomous driving.

preprint2026arXiv

YODA: Yet Another One-step Diffusion-based Video Compressor

While one-step diffusion models have recently excelled in perceptual image compression, their application to video remains limited. Prior efforts typically rely on pretrained 2D autoencoders that generate per-frame latent representations independently, thereby neglecting temporal dependencies. We present YODA--Yet Another One-step Diffusion-based Video Compressor--which embeds multiscale features from temporal references for both latent generation and latent coding to better exploit spatial-temporal correlations for more compact representation, and employs a linear Diffusion Transformer (DiT) for efficient one-step denoising. YODA achieves state-of-the-art perceptual performance, consistently outperforming traditional and deep-learning baselines on LPIPS, DISTS, FID, and KID. Source code will be publicly available at https://github.com/NJUVISION/YODA.

preprint2023arXiv

$\imath$Hall algebra of Jordan quiver and $\imath$Hall-Littlewood functions

We show that the $\imath$Hall algebra of the Jordan quiver is a polynomial ring in infinitely many generators and obtain transition relations among several generating sets. We establish a ring isomorphism from this $\imath$Hall algebra to the ring of symmetric functions in two parameters $t, θ$, which maps the $\imath$Hall basis to a class of (modified) inhomogeneous Hall-Littlewood ($\imath$HL) functions. The (modified) $\imath$HL functions admit a formulation via raising and lowering operators. We formulate and prove Pieri rules for (modified) $\imath$HL functions. The modified $\imath$HL functions specialize at $θ=0$ to the modified HL functions; they specialize at $θ=1$ to the deformed universal characters of type C, which further specialize at $(t=0, θ=1)$ to the universal characters of type C.

preprint2022arXiv

$\imath$Hall algebras and $\imath$quantum groups

We survey some recent development on the theory of $\imath$Hall algebras. Starting from $\imath$quivers (aka quivers with involutions), we construct a class of 1-Gorenstein algebras called $\imath$quiver algebras, whose semi-derived Hall algebras give us $\imath$Hall algebras. We then use these $\imath$Hall algebras to realize quasi-split $\imath$quantum groups arising from quantum symmetric pairs. Relative braid group symmetries on $\imath$quantum groups are realized via reflection functors. In case of Jordan $\imath$quiver, the $\imath$Hall algebra is commutative and connections to $\imath$Hall-Littlewood symmetric functions are developed. In case of $\imath$quivers of diagonal type, our construction amounts to a reformulation of Bridgeland-Hall algebra realization of the Drinfeld double quantum groups (which in turn generalizes Ringel-Hall algebra realization of halves of quantum groups). Many rank 1 and rank 2 computations are supplied to illustrate the general constructions. We also briefly review $\imath$Hall algebras of weighted projective lines, and use them to realize Drinfeld type presentations of $\imath$quantum loop algebras.

Institution

Affiliation not imported yet

This author record came from a source that does not expose affiliation metadata. Once the author claims the profile or we enrich the record from another provider, this section will link to the concrete institution.

Topic footprint