Researcher profile

Yubo Ma

Yubo Ma contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Qwen-Scope: Turning Sparse Features into Development Tools for Large Language Models

Large language models have achieved remarkable capabilities across diverse tasks, yet their internal decision-making processes remain largely opaque, limiting our ability to inspect, control, and systematically improve them. This opacity motivates a growing body of research in mechanistic interpretability, with sparse autoencoders (SAEs) emerging as one of the most promising tools for decomposing model activations into sparse, interpretable feature representations. We introduce Qwen-Scope, an open-source suite of SAEs built on the Qwen model family, comprising 14 groups of SAEs across 7 model variants from the Qwen3 and Qwen3.5 series, covering both dense and mixture-of-expert architectures. Built on top of these SAEs, we show that SAEs can go beyond post-hoc analysis to serve as practical interfaces for model development along four directions: (i) inference-time steering, where SAE feature directions control language, concepts, and preferences without modifying model weights; (ii) evaluation analysis, where activated SAE features provide a representation-level proxy for benchmark redundancy and capability coverage; (iii) data-centric workflows, where SAE features support multilingual toxicity classification and safety-oriented data synthesis; and (iv) post-training optimization, where SAE-derived signals are incorporated into supervised fine-tuning and reinforcement learning objectives to mitigate undesirable behaviors such as code-switching and repetition. Together, these results demonstrate that SAEs can serve not only as post-hoc analysis tools, but also as reusable representation-level interfaces for diagnosing, controlling, evaluating, and improving large language models. By open-sourcing Qwen-Scope, we aim to support mechanistic research and accelerate practical workflows that connect model internals to downstream behavior.

preprint2022arXiv

Prompt for Extraction? PAIE: Prompting Argument Interaction for Event Argument Extraction

In this paper, we propose an effective yet efficient model PAIE for both sentence-level and document-level Event Argument Extraction (EAE), which also generalizes well when there is a lack of training data. On the one hand, PAIE utilizes prompt tuning for extractive objectives to take the best advantages of Pre-trained Language Models (PLMs). It introduces two span selectors based on the prompt to select start/end tokens among input texts for each role. On the other hand, it captures argument interactions via multi-role prompts and conducts joint optimization with optimal span assignments via a bipartite matching loss. Also, with a flexible prompt design, PAIE can extract multiple arguments with the same role instead of conventional heuristic threshold tuning. We have conducted extensive experiments on three benchmarks, including both sentence- and document-level EAE. The results present promising improvements from PAIE (3.5\% and 2.3\% F1 gains in average on three benchmarks, for PAIE-base and PAIE-large respectively). Further analysis demonstrates the efficiency, generalization to few-shot settings, and effectiveness of different extractive prompt tuning strategies. Our code is available at https://github.com/mayubo2333/PAIE.

preprint2020arXiv

Phase transition and entropic force of de Sitter black hole in massive gravity

It is well known that de Sitter(dS) black holes generally have a black hole horizon and a cosmological horizon, both of which have Hawking radiation. But the radiation temperature of the two horizons is generally different, so dS black holes do not meet the requirements of thermal equilibrium stability, which brings certain difficulties to the study of the thermodynamic characteristics of black holes. In this paper, dS black hole is regarded as a thermodynamic system, and the effective thermodynamic quantities of the system are obtained. The influence of various state parameters on the effective thermodynamic quantities in the massive gravity space-time is discussed. The condition of the phase transition of the de Sitter black hole in massive gravity space-time is given. We consider that the total entropy of the dS black hole is the sum of the corresponding entropy of the two horizons plus an extra term from the correlation of the two horizons. By comparing the entropic force of interaction between black hole horizon and the cosmological horizon with Lennard-Jones force between two particles, we find that the change rule of entropic force between the two system is surprisingly the same. The research will help us to explore the real reason of accelerating expansion of the universe.

preprint2020arXiv

Thermodynamic properties of higher-dimensional dS black holes in dRGT massive gravity

On the basis of the state parameter of de Sitter space-time satisfying the first law of thermodynamics,we can derive some effective thermodynamic quantities.When the temperature of the black hole horizon is equal to that of the cosmological horizon, we think that the effective temperature of the space-time should have the same value. Using this condition, we obtain a differential equation of the entropy of the de Sitter black hole in the higherdimensional de Rham, Gabadadze and Tolley (dRGT) massive gravity. Solving the differential equation, we obtain the corrected entropy and effective thermodynamic quantities of the de Sitter black hole. The results show that for multiparameter black holes, the entropy satisfied differential equation is invariable with different independent state parameters. Therefore, the entropy of higher-dimensional dS black holes in dRGT massive gravity is only a function of the position of the black hole horizon, and is independent of other state parameters. It is consistent with the corresponding entropy of the black hole horizon and the cosmological horizon. The thermodynamic quantities of self-consistent de Sitter spacetime are given theoretically, and the equivalent thermodynamic quantities have the second-order phase transformation similar to AdS black hole, but unlike AdS black hole, the equivalent temperature of de Sitter space-time has a maximum value. By satisfying the requirement of thermodynamic equilibrium and stability of space-time, the conditions for the existence of dS black holes in the universe are obtained.

preprint2019arXiv

Phase transitions and entropy force of charged de Sitter black holes with cloud of string and quintessence

In this paper, we investigate the combined effects of the cloud of strings and quintessence on the thermodynamics of a Reissner-Nordström-de Sitter black hole. Based on the equivalent thermodynamic quantities considering the correlation between the black hole horizon and the cosmological horizon, we extensively discuss the phase transitions of the space-time. Our analysis prove that similar to the case in AdS space-time, second-order phase transitions could take place under certain conditions, with the absence of first-order phase transition in the charged de Sitter black holes with cloud of string and quintessence. The effects of different thermodynamic quantities on the phase transitions are also quantitatively discussed, which provides a new approach to study the thermodynamic qualities of unstable dS space-time. Focusing on the entropy force generated by the interaction between the black hole horizon and the cosmological horizon, as well as the Lennard-Jones force between two particles, our results demonstrate the strong degeneracy between the entropy force of the two horizons and the ratio of the horizon positions, which follows the surprisingly similar law given the relation between the Lennard-Jones force and the ratio of two particle positions. Therefore, the study of the entropy force between two horizons, is not only beneficial to the deep exploration of the three modes of cosmic evolution, but also helpful to understand the correlation between the microstates of particles in black holes and those in ordinary thermodynamic systems.