Researcher profile

Xin Han

Xin Han contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
10topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Awaking Spatial Intelligence in Unified Multimodal Understanding and Generation

We present JoyAI-Image, a unified multimodal foundation model for visual understanding, text-to-image generation, and instruction-guided image editing. JoyAI-Image couples a spatially enhanced Multimodal Large Language Model (MLLM) with a Multimodal Diffusion Transformer (MMDiT), allowing perception and generation to interact through a shared multimodal interface. Around this architecture, we build a scalable training recipe that combines unified instruction tuning, long-text rendering supervision, spatially grounded data, and both general and spatial editing signals. This design gives the model broad multimodal capability while strengthening geometry-aware reasoning and controllable visual synthesis. Experiments across understanding, generation, long-text rendering, and editing benchmarks show that JoyAI-Image achieves state-of-the-art or highly competitive performance. More importantly, the bidirectional loop between enhanced understanding, controllable spatial editing, and novel-view-assisted reasoning enables the model to move beyond general visual competence toward stronger spatial intelligence. These results suggest a promising path for unified visual models in downstream applications such as vision-language-action systems and world models.

preprint2023arXiv

Insulator-to-metal Mott transition facilitated by lattice deformation in monolayer $α$-RuCl$_3$ on graphite

Creating heterostructures with graphene/graphite is a practical method for charge-doping $α$-RuCl$_3$, but not sufficient to cause the insulator-to-metal transition. In this study, detailed scanning tunneling microscopy/spectroscopy measurements on $α$-RuCl$_3$ with various lattice deformations reveal that both in-plane and out-of-plane lattice distortions may collapse the Mott-gap in the case of monolayer $α$-RuCl$_3$ in proximity to graphite, but have little impact on its bulk form alone. In the Mott-Hubbard framework, the transition is attributed to the lattice distortion-facilitated substantial modulation of the electron correlation parameter. Observation of the orbital textures on a highly compressed monolayer $α$-RuCl$_3$ flake on graphite provides valuable evidence that electrons are efficiently transferred from the heterointerface into Cl3$p$ orbitals under the lattice distortion. It is believed that the splitting of Ru $t_{2g}$ bands within the trigonal distortion of Ru-Cl-Ru octahedra bonds generated the electrons transfer pathways. The increase of the Cl3$p$ states enhance the hopping integral in the Mott-Hubbard bands, resulting in the Mott-transition. These findings suggest a new route for implementing the insulator-to-metal transition upon doping in $α$-RuCl$_3$ by deforming the lattice in addition to the formation of heterostructure.

preprint2022arXiv

KST-GCN: A Knowledge-Driven Spatial-Temporal Graph Convolutional Network for Traffic Forecasting

While considering the spatial and temporal features of traffic, capturing the impacts of various external factors on travel is an essential step towards achieving accurate traffic forecasting. However, existing studies seldom consider external factors or neglect the effect of the complex correlations among external factors on traffic. Intuitively, knowledge graphs can naturally describe these correlations. Since knowledge graphs and traffic networks are essentially heterogeneous networks, it is challenging to integrate the information in both networks. On this background, this study presents a knowledge representation-driven traffic forecasting method based on spatial-temporal graph convolutional networks. We first construct a knowledge graph for traffic forecasting and derive knowledge representations by a knowledge representation learning method named KR-EAR. Then, we propose the Knowledge Fusion Cell (KF-Cell) to combine the knowledge and traffic features as the input of a spatial-temporal graph convolutional backbone network. Experimental results on the real-world dataset show that our strategy enhances the forecasting performances of backbones at various prediction horizons. The ablation and perturbation analysis further verify the effectiveness and robustness of the proposed method. To the best of our knowledge, this is the first study that constructs and utilizes a knowledge graph to facilitate traffic forecasting; it also offers a promising direction to integrate external information and spatial-temporal information for traffic forecasting. The source code is available at https://github.com/lehaifeng/T-GCN/tree/master/KST-GCN.

preprint2022arXiv

Observation of topological flat bands in the kagome semiconductor Nb$_3$Cl$_8$

The destructive interference of wavefunctions in a kagome lattice can give rise to topological flat bands (TFBs) with a highly degenerate state of electrons. Recently, TFBs have been observed in several kagome metals, including Fe$_3$Sn$_2$, FeSn, CoSn, and YMn$_6$Sn$_6$. Nonetheless, kagome materials that are both exfoliable and semiconducting are lacking, which seriously hinders their device applications. Herein, we show that Nb$_3$Cl$_8$, which hosts a breathing kagome lattice, is gapped out because of the absence of inversion symmetry, while the TFBs survive because of the protection of the mirror reflection symmetry. By angle-resolved photoemission spectroscopy measurements and first-principles calculations, we directly observe the TFB and a moderate band gap in Nb$_3$Cl$_8$. By mechanical exfoliation, we successfully obtain monolayers of Nb$_3$Cl$_8$ and confirm that they are stable under ambient conditions. In addition, our calculations show that monolayers of Nb$_3$Cl$_8$ have a magnetic ground state, thus providing opportunities to study the interplay between geometry, topology, and magnetism.

preprint2021arXiv

The method to increase the thrust of high Mach number Scramjets

The problem of engine unstart of scramjets has not been resolved. In this paper, the mechanism of engine unstart is discussed from the point of view of shock/shock interaction and deflagration-to-detonation transition. The shock/shock interaction leads to the nonlinear, transient and discontinuous process of the supersonic combustion flow field. This process is similar to the deflagration-to-detonation transition process. If the velocity of pre-combustion shock wave is faster than the velocity in the isolator, it will propagate upstream and cause the engine unstart. The C-J detonation velocity is defined as the stable operation boundary of scramjets, which is the maximum shock wave produced by combustion theoretically. The scramjets will work stable if the velocity in the isolator is faster than the corresponding C-J detonation velocity. The combustion characteristics and propulsive performance of scramjets is theoretically analyzed by using C-J detonation theory. For high Mach number scramjets, the velocity in the isolator is much faster than the C-J detonation velocity. Therefore, extra fuel and oxygen can be injected into the combustor to increase the thrust as long as the shock wave velocity driven by the combustion products is slower than the air velocity in the isolator. The theoretical results agree well with the existing experimental results, which can be used as a baseline for the development of scramjets.