Source author record

Wenhao Liu

Wenhao Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language astro-ph.GA cond-mat.mtrl-sci Artificial Intelligence astro-ph.HE cond-mat.str-el cond-mat.mes-hall astro-ph.CO Computer Vision cond-mat.supr-con Cryptography and Security Human-Computer Interaction Information Retrieval

Catalog footprint

What is connected

24works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Benchmark^2: Systematic Evaluation of LLM Benchmarks

The rapid proliferation of benchmarks for evaluating large language models (LLMs) has created an urgent need for systematic methods to assess benchmark quality itself. We propose Benchmark^2, a comprehensive framework comprising three complementary metrics: (1) Cross-Benchmark Ranking Consistency, measuring whether a benchmark produces model rankings aligned with peer benchmarks; (2) Discriminability Score, quantifying a benchmark's ability to differentiate between models; and (3) Capability Alignment Deviation, identifying problematic instances where stronger models fail but weaker models succeed within the same model family. We conduct extensive experiments across 15 benchmarks spanning mathematics, reasoning, and knowledge domains, evaluating 11 LLMs across four model families. Our analysis reveals significant quality variations among existing benchmarks and demonstrates that selective benchmark construction based on our metrics can achieve comparable evaluation performance with substantially reduced test sets.

preprint2025arXiv

Atomic-scale spin sensing of a 2D $d$-wave altermagnet via helical tunneling

Altermagnetism simultaneously possesses nonrelativistic spin responses and zero net magnetization, thus combining advantages of ferromagnetism and antiferromagnetism. This superiority originates from its unique dual feature, i.e., opposite-magnetic sublattices in real space and alternating spin polarization in momentum space enforced by the same crystal symmetry. Therefore, the determination of an altermagnetic order and its unique spin response inherently necessitates atomic-scale spin-resolved measurements in real and momentum spaces, an experimental milestone yet to be achieved. Here, via utilizing the helical edge (hinge) modes of a higher order topological insulator as the spin sensor, we realize spin-resolved scanning tunneling microscopy which enables us to pin down the dual-space feature of a layered $d$-wave altermagnet, KV$_2$Se$_2$O. In real space, atomic-registered mapping demonstrates the checkerboard antiferromagnetic order together with density-wave lattice modulation, and in momentum space, spin-resolved spectroscopic imaging provides a direct visualization of d-wave spin splitting of the band structure. Critically, using this new topology-guaranteed spin filter we directly reveal the unidirectional, spin-polarized quasiparticle excitations originating from the crystal symmetry-paired X and Y valleys around opposite magnetic sublattices simultaneously --the unique spin response for $d$-wave altermagnetism. Our experiments establish a solid basis for the exploration and utilization of altermagnetism in layered materials and further facilitate access to atomic-scale spin sensing and manipulating of 2D quantum materials.

preprint2025arXiv

Observation of robust one-dimensional edge channels in a three-dimensional quantum spin Hall insulator

Topologically protected edge channels show prospects for quantum devices. They have been found experimentally in two-dimensional (2D) quantum spin Hall insulators (QSHIs), weak topological insulators and higher-order topological insulators (HOTIs), but the number of materials realizing these topologies is still quite limited. Here, we provide evidence for topological edge states within a novel topology named three-dimensional (3D) QSHIs. Its topology originates solely from a nonzero $S_z$ spin Chern number for each $k_z$ plane of the crystal and is realized in bulk $α$-Bi$_4$I$_4$ with trivial symmetry indicators, as we show by density functional theory calculations. We experimentally observe the related edge states at each type of monolayer and bilayer step of this material by scanning tunneling microscopy. Consistently, the edge states are neither interrupted, nor backscattered by defects at the step edges corroborating their helical character as expected from the nontrivial topology. Furthermore, two individual edge channels are directly observed at bilayer steps without visible interaction gap opening, demonstrating the robustness of these edge modes against vertical stacking. Our results establish $α$-Bi$_4$I$_4$ as the first material realization of a 3D QSHI whose definition goes beyond the scope of topological symmetry indicators, and provide a pathway for realizing nearly-quantized spin Hall conductivity per unit cell in a bulk crystal.

preprint2024arXiv

The Dust Attenuation Scaling Relation of Star-Forming Galaxies in the EAGLE Simulations

Dust attenuation in star-forming galaxies (SFGs), as parameterized by the infrared excess (IRX $\equiv L_{\rm IR}/L_{\rm UV}$), is found to be tightly correlated with star formation rate (SFR), metallicity and galaxy size, following a universal IRX relation up to $z=3$. This scaling relation can provide a fundamental constraint for theoretical models to reconcile galaxy star formation, chemical enrichment, and structural evolution across cosmic time. We attempt to reproduce the universal IRX relation over $0.1\leq z\leq 2.5$ using the EAGLE hydrodynamical simulations and examine sensitive parameters in determining galaxy dust attenuation. Our findings show that while the predicted universal IRX relation from EAGLE approximately aligns with observations at $z\leq 0.5$, noticeable disparities arise at different stellar masses and higher redshifts. Specifically, we investigate how modifying various galaxy parameters can affect the predicted universal IRX relation in comparison to the observed data. We demonstrate that the simulated gas-phase metallicity is the critical quantity for the shape of the predicted universal IRX relation. We find that the influence of the infrared luminosity and infrared excess is less important while galaxy size has virtually no significant effect. Overall, the EAGLE simulations are not able to replicate some of the observed characteristics between IRX and galaxy parameters of SFGs, emphasizing the need for further investigation and testing for our current state-of-the-art theoretical models.

Wenhao Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

24 published item(s)

Benchmark^2: Systematic Evaluation of LLM Benchmarks

Atomic-scale spin sensing of a 2D $d$-wave altermagnet via helical tunneling

Observation of robust one-dimensional edge channels in a three-dimensional quantum spin Hall insulator

The Dust Attenuation Scaling Relation of Star-Forming Galaxies in the EAGLE Simulations

A Generative Language Model for Few-shot Aspect-Based Sentiment Analysis

A three-stage magnetic phase transition revealed in ultrahigh-quality van der Waals magnet CrSBr

CaPE: Contrastive Parameter Ensembling for Reducing Hallucination in Abstractive Summarization

Chandra view of Abell 407: the central compact group of galaxies and the interaction between the radio AGN and the ICM

Converse: A Tree-Based Modular Task-Oriented Dialogue System

DialFact: A Benchmark for Fact-Checking in Dialogue

Exploring Neural Models for Query-Focused Summarization

MixQG: Neural Question Generation with Mixed Answer Types

Open Vocabulary Object Detection with Pseudo Bounding-Box Labels

QAConv: Question Answering on Informative Conversations

QAFactEval: Improved QA-Based Factual Consistency Evaluation for Summarization

Quiz Design Task: Helping Teachers Create Quizzes with Automated Question Generation

Structure Extraction in Task-Oriented Dialogues with Slot Clustering

Submillimetre galaxies in two massive protoclusters at z = 2.24: witnessing the enrichment of extreme starbursts in the outskirts of HAE density peaks

Systematic biases in determining dust attenuation curves through galaxy SED fitting

The Physical Properties of Star-Forming Galaxies with Strong [O III] Lines at z=3.25

Enhanced Superconductivity in the Se-substituted 1T-PdTe$_2$

AGN feedback in the FR II galaxy 3C 220.1

Efficient Certificateless Signcryption Tag-KEMs for Resource-constrained Devices

Evidence for a very low-column density hole in the Galactic halo in the direction of the high latitude molecular cloud MBM 16