Researcher profile

Yanzeng Li

Yanzeng Li contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Detecting Hallucinations in Retrieval-Augmented Generation via Semantic-level Internal Reasoning Graph

The Retrieval-augmented generation (RAG) system based on Large language model (LLM) has made significant progress. It can effectively reduce factuality hallucinations, but faithfulness hallucinations still exist. Previous methods for detecting faithfulness hallucinations either neglect to capture the models' internal reasoning processes or handle those features coarsely, making it difficult for discriminators to learn. This paper proposes a semantic-level internal reasoning graph-based method for detecting faithfulness hallucination. Specifically, we first extend the layer-wise relevance propagation algorithm from the token level to the semantic level, constructing an internal reasoning graph based on attribution vectors. This provides a more faithful semantic-level representation of dependency. Furthermore, we design a general framework based on a small pre-trained language model to utilize the dependencies in LLM's reasoning for training and hallucination detection, which can dynamically adjust the pass rate of correct samples through a threshold. Experimental results demonstrate that our method achieves better overall performance compared to state-of-the-art baselines on RAGTruth and Dolly-15k.

preprint2022arXiv

Crake: Causal-Enhanced Table-Filler for Question Answering over Large Scale Knowledge Base

Semantic parsing solves knowledge base (KB) question answering (KBQA) by composing a KB query, which generally involves node extraction (NE) and graph composition (GC) to detect and connect related nodes in a query. Despite the strong causal effects between NE and GC, previous works fail to directly model such causalities in their pipeline, hindering the learning of subtask correlations. Also, the sequence-generation process for GC in previous works induces ambiguity and exposure bias, which further harms accuracy. In this work, we formalize semantic parsing into two stages. In the first stage (graph structure generation), we propose a causal-enhanced table-filler to overcome the issues in sequence-modelling and to learn the internal causalities. In the second stage (relation extraction), an efficient beam-search algorithm is presented to scale complex queries on large-scale KBs. Experiments on LC-QuAD 1.0 indicate that our method surpasses previous state-of-the-arts by a large margin (17%) while remaining time and space efficiency. The code and models are available at https://github.com/AOZMH/Crake.

preprint2022arXiv

VGStore: A Multimodal Extension to SPARQL for Querying RDF Scene Graph

Semantic Web technology has successfully facilitated many RDF models with rich data representation methods. It also has the potential ability to represent and store multimodal knowledge bases such as multimodal scene graphs. However, most existing query languages, especially SPARQL, barely explore the implicit multimodal relationships like semantic similarity, spatial relations, etc. We first explored this issue by organizing a large-scale scene graph dataset, namely Visual Genome, in the RDF graph database. Based on the proposed RDF-stored multimodal scene graph, we extended SPARQL queries to answer questions containing relational reasoning about color, spatial, etc. Further demo (i.e., VGStore) shows the effectiveness of customized queries and displaying multimodal data.

preprint2020arXiv

Enhancing Pre-trained Chinese Character Representation with Word-aligned Attention

Most Chinese pre-trained models take character as the basic unit and learn representation according to character's external contexts, ignoring the semantics expressed in the word, which is the smallest meaningful utterance in Chinese. Hence, we propose a novel word-aligned attention to exploit explicit word information, which is complementary to various character-based Chinese pre-trained language models. Specifically, we devise a pooling mechanism to align the character-level attention to the word level and propose to alleviate the potential issue of segmentation error propagation by multi-source information fusion. As a result, word and character information are explicitly integrated at the fine-tuning procedure. Experimental results on five Chinese NLP benchmark tasks demonstrate that our model could bring another significant gain over several pre-trained models.

preprint2020arXiv

Metasurfaces for the Infrared Spectral Range Fabricated Using Two-Photon Polymerization

Fabrication of metasurfaces is ofthen time consuming and expensive, involving complex lithographic processes. The maskless fabrication of metasurfaces composed of rectangular Au bars is reported as a suitable alternative, providing cost-effective, rapid prototyping of metasurfaces. The investigated metasurfaces were fabricated using a simple three-step process which is discussed in detail. The fabrication process establishes a simple method for producing high fidelity 2D patterns suitable to synthesize metasurfaces for chemical sensing, beam steering, and perfect reflection/transmission. Comprehensive polarization-sensitive reflection data reveal multiple resonances in the infrared spectral range. In addition to the dipole and substrate resonances, a resonance which is attributed to a coupling between the excitation of the metasurface and the substrate phonon mode is observed.

preprint2020arXiv

Reciprocal plasmonic metasurfaces: Theory and applications

A new configuration for metasurface construction is presented to achieve multi-functional capabilities including perfect absorption, bio/chem sensing, and surface-mode lasing. The reciprocal plasmonic metasurfaces discussed here are composed of two plasmonic surfaces of reciprocal geometries separated by a dielectric spacer. Compared to conventional metasurfaces this simple geometry exhibits an enhanced optical performance. The discussed reciprocal metasurface design further enables effective structural optimization and allows for a simple and scalable fabrication. The physical principle and potential applications of the reciprocal plasmonic metasurfaces are demonstrated using numerical and analytical approaches.

preprint2019arXiv

THz optical properties of polymethacrylates after thermal annealing

Polymer based stereolithographic additive manufacturing has been established for the rapid and low-cost fabrication of THz optical components due to its ability to construct complex 3D geometries with high resolution. For polymer based or integrated optics, thermal annealing processes are often used to optimize material properties. However, despite the growing interest in THz optics fabricated using stereolithography, the effects of thermal annealing on the THz dielectric properties of polymethacrylates compatible with stereolithography has not been studied yet. In this manuscript we report on the THz ellipsometric response of thermally annealed polymethacrylates prepared using UV polymerization. Our findings indicate that the investigated polymethacrylate maintain a stable optical response in THz spectral range from 650 to 950 GHz after thermal annealing at temperatures up to 70 degrees C for several hours.