Researcher profile

Junkai Zhang

Junkai Zhang contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

ICU-Bench:Benchmarking Continual Unlearning in Multimodal Large Language Models

Although Multimodal Large Language Models (MLLMs) have achieved remarkable progress across many domains, their training on large-scale multimodal datasets raises serious privacy concerns, making effective machine unlearning increasingly necessary. However, existing benchmarks mainly focus on static or short-sequence settings, offering limited support for evaluating continual privacy deletion requests in realistic deployments. To bridge this gap, we introduce ICU-Bench, a continual multimodal unlearning benchmark built on privacy-critical document data. ICU-Bench contains 1,000 privacy-sensitive profiles from two document domains, medical reports and labor contracts, with 9,500 images, 16,000 question-answer pairs, and 100 forget tasks. Additionally, new continual unlearning metrics are introduced, facilitating a comprehensive analysis of forgetting effectiveness, historical forgetting preservation, retained utility, and stability throughout the continual unlearning process. Through extensive experiments with representative unlearning methods on ICU-Bench, we show that existing methods generally struggle in continual settings and exhibit clear limitations in balancing forgetting quality, utility preservation, and scalability over long task sequences. These findings highlight the need for multimodal unlearning methods explicitly designed for continual privacy deletion.

preprint2026arXiv

METASYMBO: Multi-Agent Language-Guided Metamaterial Discovery via Symbolic Latent Evolution

Metamaterial discovery seeks microstructured materials whose geometry induces targeted mechanical behavior. Existing inverse-design methods can efficiently generate candidates, but they typically require explicit numerical property targets and are less suitable for early-stage exploration, where researchers often begin with incomplete constraints and qualitative intents expressed in natural language. Large language models can interpret such intents, but they lack geometric awareness and physical property validity. To address this gap, we propose MetaSymbO, a multi-agent framework for language-guided Metamaterial discovery via Symbolic-driven latent evOlution. Specifically, MetaSymbO contains three agents: a Designer that interprets free-form design intents and retrieves a semantically consistent scaffold, a Generator that synthesizes candidate microstructures in a disentangled latent space, and a Supervisor that provides fast property-aware feedback for iterative refinement. To move beyond the limitations of reproducing known samples from literature and training data, we further introduce symbolic-driven latent evolution, which applies programmable operators over disentangled latent factors to compose, modify, and refine structures at inference time. Extensive experiments demonstrate that (i) MetaSymbO improves structural validity by up to 34% in symmetry and nearly 98% in periodicity compared to state-of-the-art baselines; (ii) MetaSymbO achieves about 6-7% higher language-guidance scores while maintaining superior structure novelty compared to advanced reasoning LLMs; (iii) qualitative analyses confirm the effectiveness of symbolic logic operators in enabling programmable semantic alignment; and (iv) realworld case studies on auxetic, high-stiffness metamaterial design further validate its practical capability.

preprint2026arXiv

RDKV: Rate-Distortion Bit Allocation for Joint Eviction and Quantization of the KV Cache

Large language models (LLMs) have shown strong performance across diverse tasks, but their inference with long input contexts is bottlenecked by memory size and bandwidth. The Key-Value (KV) cache size grows linearly with sequence length and needs to be re-read from off-chip high-bandwidth memory (HBM) to on-chip memory at every decoding step, resulting in memory-bound inference. Existing methods reduce the cache by either eviction or quantization, but typically treat the two in isolation. In this paper, we cast KV cache compression as a rate-distortion problem, under which eviction and quantization are two end-points of the same bit allocation scheme. This exposes the need to optimize them jointly, motivating our method, RDKV (Rate-Distortion KV cache compression). RDKV derives the weight of each token or channel from the distortion that compression induces on the attention computation. Based on these weights, it assigns each token or channel a bit-width ranging from full precision down to zero bits guided by reverse water-filling, applied once after the prefilling stage. Experiments on LongBench, RULER, and InfiniteBench show that RDKV outperforms the best evaluated baseline by 9.1% on average. On LongBench it recovers 97.81% of full-cache accuracy with only 2.48% cache retention. Compared with full-cache FlashAttention-2 decoding, it achieves 4.5x decode speedup and 1.9x peak memory reduction with 128K context length, while maintaining comparable performance.

preprint2022arXiv

3D intrinsic shapes of quiescent galaxies in observations and simulations

We study the intrinsic 3D shapes of quiescent galaxies over the last half of cosmic history based on their axial ratio distribution. To this end, we construct a sample of unprecedented size, exploiting multi-wavelength $u$-to-$K_s$ photometry from the deep wide area surveys KiDS+VIKING paired with high-quality $i$-band imaging from HSC-SSP. Dependencies of the shapes on mass, redshift, photometric bulge prominence and environment are considered. For comparison, the intrinsic shapes of quenched galaxies in the IllustrisTNG simulations are analyzed and contrasted to their formation history. We find that over the full $0<z<0.9$ range, and in both simulations and observations, spheroidal 3D shapes become more abundant at $M_* > 10^{11}\ M_{\odot}$, with the effect being most pronounced at lower redshifts. In TNG, the most massive galaxies feature the highest ex-situ stellar mass fractions, pointing to violent relaxation via mergers as the mechanism responsible for their 3D shape transformation. Larger differences between observed and simulated shapes are found at low to intermediate masses. At any mass, the most spheroidal quiescent galaxies in TNG feature the highest bulge mass fractions, and conversely observed quiescent galaxies with the highest bulge-to-total ratios are found to be intrinsically the roundest. Finally, we detect an environmental influence on galaxy shape, at least at the highest masses, such that at fixed mass and redshift quiescent galaxies tend to be rounder in denser environments.

preprint2022arXiv

Cool outflows in MaNGA: a systematic study and comparison to the warm phase

This paper investigates the neutral gas phase of galactic winds via the Na I D$λλ5890,5895$Å feature within $z \sim 0.04$ MaNGA galaxies, and directly compares their incidence and strength to the ionized winds detected within the same parent sample. We find evidence for neutral outflows in 127 galaxies ($\sim 5$ per cent of the analysed line-emitting sample). Na I D winds are preferentially seen in galaxies with dustier central regions and both wind phases are more often found in systems with elevated SFR surface densities, especially when there has been a recent upturn in the star formation activity according to the SFR$_{5Myr}$/SFR$_{800Myr}$ parameter. We find the ionized outflow kinematics to be in line with what we measure in the neutral phase. This demonstrates that, despite their small contributions to the total outflow mass budget, there is value to collecting empirical measurements of the ionized wind phase to provide information on the bulk motion in the outflow. Depending on dust corrections applied to the ionized gas diagnostics, the neutral phase has $\sim 1.2 - 1.8$ dex higher mass outflow rates ($\dot{M}_{out}$), on average, compared to the ionized phase. We quantify scaling relations between $\dot{M}_{out}$ and the strengths of the physical wind drivers (SFR, $L_{AGN}$). Using a radial-azimuthal stacking method, and by considering inclination dependencies, we find results consistent with biconical outflows orthogonal to the disk plane. Our work complements other multi-phase outflow studies in the literature which consider smaller samples, more extreme objects, or proceed via stacking of larger samples.