Researcher profile

Xu Hou

Xu Hou contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 13 - UnverifiedVerification L1Unclaimed author
2works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

2 published item(s)

preprint2026arXiv

ELMM: Efficient Lightweight Multimodal Large Language Models for Multimodal Knowledge Graph Completion

Multimodal Knowledge Graphs (MKGs) extend traditional knowledge graphs by incorporating visual and textual modalities, enabling richer and more expressive entity representations. However, existing MKGs often suffer from incompleteness, which hinder their effectiveness in downstream tasks. Therefore, multimodal knowledge graph completion (MKGC) task is receiving increasing attention. While large language models (LLMs) have shown promise for knowledge graph completion (KGC), their application to the multimodal setting remains underexplored. Moreover, applying Multimodal Large Language Models (MLLMs) to the task of MKGC introduces significant challenges: (1) the large number of image tokens per entity leads to semantic noise and modality conflicts, and (2) the high computational cost of processing large token inputs. To address these issues, we propose Efficient Lightweight Multimodal Large Language Models (ELMM) for MKGC. ELMM proposes a Multi-view Visual Token Compressor (MVTC) based on multi-head attention mechanism, which adaptively compresses image tokens from both textual and visual views, thereby effectively reducing redundancy while retaining necessary information and avoiding modality conflicts. Additionally, we design an attention pruning strategy to remove redundant attention layers from MLLMs, thereby significantly reducing the inference cost. We further introduce a linear projection to compensate for the performance degradation caused by pruning. Extensive experiments on four benchmark datasets demonstrate that ELMM achieves state-of-the-art performance.

preprint2020arXiv

Creating topological polar structure in a nonpolar matter

Nontrivial topological structures offer rich playground in condensed matter physics including fluid dynamics, superconductivity, and ferromagnetism, and they promise alternative device configurations for post-Moore spintronics and electronics. Indeed, magnetic skyrmions are actively pursued for high-density data storage, while polar vortices with exotic negative capacitance may enable ultralow power consumption in microelectronics. Following extensive investigations on a variety of magnetic textures including vortices, domain walls and skyrmions in the past decades, studies on polar topologies have taken off in recent years, resulting in discoveries of closure domains, vortices, and skyrmions in ferroelectric materials. Nevertheless, the atomic-scale creation of topological polar structures is largely confined in a single ferroelectric system, PbTiO3 (PTO) with large polarization, casting doubt on the generality of polar topologies and limiting their potential applications. In this work, we successfully create previously unrealized atomic-scale polar antivortices in the nominally nonpolar SrTiO3 (STO), expanding the reaches of topological structures and completing an important missing link in polar topologies. The work shed considerable new insight into the formation of topological polar structures, and offers guidance in searching for new polar textures.