Researcher profile

Yadong Xue

Yadong Xue contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

GLM-4.5V and GLM-4.1V-Thinking: Towards Versatile Multimodal Reasoning with Scalable Reinforcement Learning

We present GLM-4.1V-Thinking, GLM-4.5V, and GLM-4.6V, a family of vision-language models (VLMs) designed to advance general-purpose multimodal understanding and reasoning. In this report, we share our key findings in the development of the reasoning-centric training framework. We first develop a capable vision foundation model with significant potential through large-scale pre-training, which arguably sets the upper bound for the final performance. We then propose Reinforcement Learning with Curriculum Sampling (RLCS) to unlock the full potential of the model, leading to comprehensive capability enhancement across a diverse range of tasks, including STEM problem solving, video understanding, content recognition, coding, grounding, GUI-based agents, and long document interpretation. In a comprehensive evaluation across 42 public benchmarks, GLM-4.5V achieves state-of-the-art performance on nearly all tasks among open-source models of similar size, and demonstrates competitive or even superior results compared to closed-source models such as Gemini-2.5-Flash on challenging tasks including Coding and GUI Agents. Meanwhile, the smaller GLM-4.1V-9B-Thinking remains highly competitive-achieving superior results to the much larger Qwen2.5-VL-72B on 29 benchmarks. We open-source both GLM-4.1V-9B-Thinking and GLM-4.5V. We further introduce the GLM-4.6V series, open-source multimodal models with native tool use and a 128K context window. A brief overview is available at https://z.ai/blog/glm-4.6v. Code, models and more information are released at https://github.com/zai-org/GLM-V.

preprint2026arXiv

GLM-5V-Turbo: Toward a Native Foundation Model for Multimodal Agents

We present GLM-5V-Turbo, a step toward native foundation models for multimodal agents. As foundation models are increasingly deployed in real environments, agentic capability depends not only on language reasoning, but also on the ability to perceive, interpret, and act over heterogeneous contexts such as images, videos, webpages, documents, GUIs. GLM-5V-Turbo is built around this objective: multimodal perception is integrated as a core component of reasoning, planning, tool use, and execution, rather than as an auxiliary interface to a language model. This report summarizes the main improvements behind GLM-5V-Turbo across model design, multimodal training, reinforcement learning, toolchain expansion, and integration with agent frameworks. These developments lead to strong performance in multimodal coding, visual tool use, and framework-based agentic tasks, while preserving competitive text-only coding capability. More importantly, our development process offers practical insights for building multimodal agents, highlighting the central role of multimodal perception, hierarchical optimization, and reliable end-to-end verification.

preprint2025arXiv

The transverse-traceless gauge and the gauge problem of second order gravitational waves

The gauge problem arises in the second order gravitational waves due to the mode mixing. Here, we introduce the transverse-traceless (TT) gauge to cosmological backgrounds, and find that if we choose the TT gauge at first order, the second order tensor mode would be gauge invariant. Analogous to the Ricci flat spacetime, the vacuum condition is the key to guarantee the existence of the TT gauge on cosmological backgrounds. When we have the vacuum condition, the Poisson gauge, the uniform curvature gauge, the synchronous gauge and the total matter gauge are all equivalent to the TT gauge. Once the vacuum condition is approximately satisfied, the Poisson gauge would reduce to the TT gauge at the same order of approximation. With the sub-horizon limit, the vacuum condition could be obtained approximately, and the Poisson gauge, the uniform curvature gauge and the synchronous gauge are all approximated TT gauge. Our findings explain several existing results in the literature and indicate that the proposed TT gauge is useful to discuss higher order gravitational waves.

preprint2022arXiv

Effects of Born-Infeld electrodynamics on black hole shadows

In this work, we study the shadow of Born-Infeld (BI) black holes with magnetic monopoles and Schwarzschild black holes immersed in the BI uniform magnetic field. Illuminated by a celestial sphere, black hole images are obtained by using the backward ray-tracing method. For magnetically charged BI black holes, we find that the shadow radius increases with the increase of nonlinear electromagnetics effects. For Schwarzschild black holes immersed in the BI uniform magnetic field, photons tend to move towards the axis of symmetric, resulting in stretched shadows along the equatorial plane.

preprint2022arXiv

Shadow and Photon Sphere of Black Hole in Clouds of Strings and Quintessence

In this work, we study the shadow and photon sphere of the black bole in clouds of strings and quintessence with static and infalling spherical accretions. We obtain the geodesics of the photons near a black hole with different impact parameters $b$. The string clouds model and quintessence influence the specific intensity by affecting the geodesic and the average radial position of photons. And the range of string clouds parameter $a$ is constrained to ensure that the shadow can be observed. Moreover, we use a model of the photon emissivity $j(ν_e)$ to get the specific intensities. The light sources in the accretion follow a normal distribution with an attenuation factor $γ$. The shadow with static spherical accretion is plotted. The apparent shape of the shadow is a perfect circle, and the value of $γ$ affects the brightness of the photon sphere. We investigate the profile and specific intensity of the shadows with static and infalling spherical accretions respectively. The interior of the shadows with an infalling spherical accretion will be darker than that with the static spherical accretion, and the specific intensity with both static and infalling spherical accretion gradually converges.

preprint2022arXiv

Temporal and Spatial Chaos of RN-AdS Black Holes Immersed in Perfect Fluid Dark Matter

We investigate the thermodynamic chaos of RN-AdS black holes immersed in Perfect Fluid Dark Matter by considering the dynamical equations of the fluid system evolved in the spinodal region. Based on the Melnikov method, it is shown that there exists a critical amplitude that affects the temporal chaos. And the influence of black holes charge and state parameter on the critical amplitude is investigated with specific initial temperature. Then, for inevitable spatial chaos, three different types of portraits are discssued according to the difference between the phase transition pressure and the ambient pressure. Additionally, we check the local equilibrium near saddle points which shows that spatial chaos always exists regardless of the perturbation intensity.