Source author record

Ying Ma

Ying Ma appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

cond-mat.str-el Computation and Language Computer Vision cond-mat.mtrl-sci cond-mat.supr-con eess.AS Information Retrieval Machine Learning math.QA Sound

Catalog footprint

What is connected

6works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Adaptive Diffusion-based Augmentation for Recommendation

Recommendation systems often rely on implicit feedback, where only positive user-item interactions can be observed. Negative sampling is therefore crucial to provide proper negative training signals. However, existing methods tend to mislabel potentially positive but unobserved items as negatives and lack precise control over negative sample selection. We aim to address these by generating controllable negative samples, rather than sampling from the existing item pool. In this context, we propose Adaptive Diffusion-based Augmentation for Recommendation (ADAR), a novel and model-agnostic module that leverages diffusion to synthesize informative negatives. Inspired by the progressive corruption process in diffusion, ADAR simulates a continuous transition from positive to negative, allowing for fine-grained control over sample hardness. To mine suitable negative samples, we theoretically identify the transition point at which a positive sample turns negative and derive a score-aware function to adaptively determine the optimal sampling timestep. By identifying this transition point, ADAR generates challenging negative samples that effectively refine the model's decision boundary. Experiments confirm that ADAR is broadly compatible and boosts the performance of existing recommendation models substantially, including collaborative filtering and sequential recommendation, without architectural modifications.

preprint2026arXiv

Cooperative concurrence of 4f and 3d flat bands in kagome heavy-fermion metal YbCr6Ge6

Flat-band (FB) systems originating from special lattice geometry like in kagome metals as well as localized orbitals in the materials such as heavy-fermion (HF) compounds have induced intensive interest due to their band topology and strong electron correlation effects, leading to emergent quantum states of matter. However, the question of how these two distinct FBs coexist and interact remains unsettled. Here, we report that YbCr6Ge6 hosting both Cr-kagome lattice and Yb-4f electrons exhibits HF behaviors and a robust antiferromagnetic ground state with transition temperature TN = 3 K, significantly higher than other similar kagome metals with Yb ions. Angle-resolved photoemission spectroscopy measurements reveal the coexistence of FBs originating from both Cr-kagome lattice and localized Yb-4f electrons near Fermi energy level EF. More importantly, the clear spectroscopic signatures of a hybridization of Yb-4f FB with kagome-lattice-derived conduction bands and the high density of states of Cr-kagome FB near EF provide the underlying microscopic mechanisms of HF behaviors and enhanced antiferromagnetism in YbCr6Ge6. Our findings demonstrate that the novel kagome HF metals can not only host the cooperative coexistence of two different types of FBs, but also provide a paradigm material platform to explore the exotic correlated topological quantum phenomena.

preprint2026arXiv

Nematic-fluctuation-mediated superconductivity in CuxTiSe2

The interplay among electronic nematicity, charge density wave, and superconductivity in correlated electronic systems has induced extensive research interest. Here, we discover the existence of nematic fluctuations in TiSe2 single crystal and investigate its evolution with Cu intercalation. It is observed that the elastoresistivity coefficient mEg exhibits a divergent temperature dependence following a Curie-Weiss law at high temperature. Upon Cu intercalation, the characteristic temperature T* of nematic fluctuation is progressively suppressed and becomes near zero when the superconductivity is optimized. Further intercalation of Cu leads to the sign change of T* and the suppression of superconductivity. These results strongly indicate that nematic phase transition may play a vital role in enhancing superconductivity in CuxTiSe2. Therefore, CuxTiSe2 provides a unique material platform to explore the nematic-fluctuation-mediated superconductivity.

preprint2026arXiv

RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video

Multimodal Large Language Models (MLLMs) have made rapid progress in perception, understanding, and reasoning, yet existing benchmarks fall short in evaluating these abilities under continuous and dynamic real-world video streams. Such settings require models to maintain coherent understanding and reasoning as visual scenes evolve over time. **We introduce RTV-Bench, a fine-grained benchmark for real-time video analysis with MLLMs**. It is built upon three key principles: multi-timestamp question answering, hierarchical question structures spanning perception and reasoning, and multi-dimensional evaluation of continuous perception, understanding, and reasoning. RTV-Bench comprises 552 diverse videos and 4,608 carefully curated QA pairs covering a wide range of dynamic scenarios. We evaluate a broad range of state-of-the-art MLLMs, including proprietary, open-source offline, and open-source real-time models. Our results show that real-time models generally outperform offline counterparts but still lag behind leading proprietary systems. While scaling model capacity generally yields performance gains, simply increasing the density of sampled input frames does not consistently translate into improved results. These observations suggest inherent limitations in current architectures when handling long-horizon video streams, underscoring the need for models explicitly designed for streaming video processing and analysis.

preprint2022arXiv

Diagram automorphisms and canonical bases for quantum affine algebras, II

Let ${\mathbf U}_q^-$ be the negative part of the quantum enveloping algebra, and $σ$ the algebra automorphism on ${\mathbf U}_q^-$ induced from a diagram automorphism. Let $\underline{\mathbf U}_q^-$ be the quantum algebra obtained from $σ$, and $\widetilde{\mathbf B}$ (resp. $\widetilde{\underline{\mathbf B}}$) the canonical signed basis of ${\mathbf U}_q^-$ (resp. $\underline{\mathbf U}_q^-$). Assume that ${\mathbf U}_q^-$ is simply-laced of finite or affine type. In our previous papers [SZ1, 2], we have proved by an elementary method, that there exists a natural bijection $\widetilde{\mathbf B}^σ \simeq \widetilde{\underline{\mathbf B}}$ in the case where $σ$ is admissible. In this paper, we show that such a bijection exists even if $σ$ is not admissible, possibly except some small rank cases.

preprint2020arXiv

Lattice-based Improvements for Voice Triggering Using Graph Neural Networks

Voice-triggered smart assistants often rely on detection of a trigger-phrase before they start listening for the user request. Mitigation of false triggers is an important aspect of building a privacy-centric non-intrusive smart assistant. In this paper, we address the task of false trigger mitigation (FTM) using a novel approach based on analyzing automatic speech recognition (ASR) lattices using graph neural networks (GNN). The proposed approach uses the fact that decoding lattice of a falsely triggered audio exhibits uncertainties in terms of many alternative paths and unexpected words on the lattice arcs as compared to the lattice of a correctly triggered audio. A pure trigger-phrase detector model doesn't fully utilize the intent of the user speech whereas by using the complete decoding lattice of user audio, we can effectively mitigate speech not intended for the smart assistant. We deploy two variants of GNNs in this paper based on 1) graph convolution layers and 2) self-attention mechanism respectively. Our experiments demonstrate that GNNs are highly accurate in FTM task by mitigating ~87% of false triggers at 99% true positive rate (TPR). Furthermore, the proposed models are fast to train and efficient in parameter requirements.

Ying Ma

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

Adaptive Diffusion-based Augmentation for Recommendation

Cooperative concurrence of 4f and 3d flat bands in kagome heavy-fermion metal YbCr6Ge6

Nematic-fluctuation-mediated superconductivity in CuxTiSe2

RTV-Bench: Benchmarking MLLM Continuous Perception, Understanding and Reasoning through Real-Time Video

Diagram automorphisms and canonical bases for quantum affine algebras, II

Lattice-based Improvements for Voice Triggering Using Graph Neural Networks