Source author record

Xiaoguang Li

Xiaoguang Li appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computation and Language cond-mat.mtrl-sci cond-mat.mes-hall Computer Vision Artificial Intelligence eess.IV Information Retrieval Machine Learning cond-mat.str-el Cryptography and Security hep-ph math-ph math.MP math.NA math.PR Molecular Networks Multimedia nucl-th Numerical Analysis

Catalog footprint

What is connected

22works

19topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

ELAIPBench: A Benchmark for Expert-Level Artificial Intelligence Paper Understanding

While large language models (LLMs) excel at many domain-specific tasks, their ability to deeply comprehend and reason about full-length academic papers remains underexplored. Existing benchmarks often fall short of capturing such depth, either due to surface-level question design or unreliable evaluation metrics. To address this gap, we introduce ELAIPBench, a benchmark curated by domain experts to evaluate LLMs' comprehension of artificial intelligence (AI) research papers. Developed through an incentive-driven, adversarial annotation process, ELAIPBench features 403 multiple-choice questions from 137 papers. It spans three difficulty levels and emphasizes non-trivial reasoning rather than shallow retrieval. Our experiments show that the best-performing LLM achieves an accuracy of only 39.95%, far below human performance. Moreover, we observe that frontier LLMs equipped with a thinking mode or a retrieval-augmented generation (RAG) system fail to improve final results-even harming accuracy due to overthinking or noisy retrieval. These findings underscore the significant gap between current LLM capabilities and genuine comprehension of academic papers.

preprint2022arXiv

How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired Analysis

Recently, there has been a trend to investigate the factual knowledge captured by Pre-trained Language Models (PLMs). Many works show the PLMs' ability to fill in the missing factual words in cloze-style prompts such as "Dante was born in [MASK]." However, it is still a mystery how PLMs generate the results correctly: relying on effective clues or shortcut patterns? We try to answer this question by a causal-inspired analysis that quantitatively measures and evaluates the word-level patterns that PLMs depend on to generate the missing words. We check the words that have three typical associations with the missing words: knowledge-dependent, positionally close, and highly co-occurred. Our analysis shows: (1) PLMs generate the missing factual words more by the positionally close and highly co-occurred words than the knowledge-dependent words; (2) the dependence on the knowledge-dependent words is more effective than the positionally close and highly co-occurred words. Accordingly, we conclude that the PLMs capture the factual knowledge ineffectively because of depending on the inadequate associations.

preprint2022arXiv

Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering

To alleviate the data scarcity problem in training question answering systems, recent works propose additional intermediate pre-training for dense passage retrieval (DPR). However, there still remains a large discrepancy between the provided upstream signals and the downstream question-passage relevance, which leads to less improvement. To bridge this gap, we propose the HyperLink-induced Pre-training (HLP), a method to pre-train the dense retriever with the text relevance induced by hyperlink-based topology within Web documents. We demonstrate that the hyperlink-based structures of dual-link and co-mention can provide effective relevance signals for large-scale pre-training that better facilitate downstream passage retrieval. We investigate the effectiveness of our approach across a wide range of open-domain QA datasets under zero-shot, few-shot, multi-hop, and out-of-domain scenarios. The experiments show our HLP outperforms the BM25 by up to 7 points as well as other pre-training methods by more than 10 points in terms of top-20 retrieval accuracy under the zero-shot scenario. Furthermore, HLP significantly outperforms other pre-training methods under the other scenarios.

preprint2022arXiv

MISF: Multi-level Interactive Siamese Filtering for High-Fidelity Image Inpainting

Although achieving significant progress, existing deep generative inpainting methods are far from real-world applications due to the low generalization across different scenes. As a result, the generated images usually contain artifacts or the filled pixels differ greatly from the ground truth. Image-level predictive filtering is a widely used image restoration technique, predicting suitable kernels adaptively according to different input scenes. Inspired by this inherent advantage, we explore the possibility of addressing image inpainting as a filtering task. To this end, we first study the advantages and challenges of image-level predictive filtering for image inpainting: the method can preserve local structures and avoid artifacts but fails to fill large missing areas. Then, we propose semantic filtering by conducting filtering on the deep feature level, which fills the missing semantic information but fails to recover the details. To address the issues while adopting the respective advantages, we propose a novel filtering technique, i.e., Multilevel Interactive Siamese Filtering (MISF), which contains two branches: kernel prediction branch (KPB) and semantic & image filtering branch (SIFB). These two branches are interactively linked: SIFB provides multi-level features for KPB while KPB predicts dynamic kernels for SIFB. As a result, the final method takes the advantage of effective semantic & image-level filling for high-fidelity inpainting. We validate our method on three challenging datasets, i.e., Dunhuang, Places2, and CelebA. Our method outperforms state-of-the-art baselines on four metrics, i.e., L1, PSNR, SSIM, and LPIPS. Please try the released code and model at https://github.com/tsingqguo/misf.

preprint2022arXiv

Nonreciprocal dynamics of ferrimagnetic bimerons

Magnetic bimerons are topologically nontrivial spin textures in in-plane easy-axis magnets, which can be used as particle-like information carriers. Here, we report a theoretical study on the nonreciprocal dynamics of asymmetrical ferrimagnetic (FiM) bimerons induced by spin currents. The FiM bimerons have the ability to move at a speed of kilometers per second and do not show the skyrmion Hall effect at the angular momentum compensation point. Our micromagnetic simulations and analytical results demonstrate that spin currents are able to induce the nonreciprocal transport and a drift motion of the FiM bimeron even if the system is at the angular momentum compensation point. By analyzing the current-induced effective fields, we find that the nonreciprocal transport is attributed to the asymmetry of the bimeron structure. Our results are useful for understanding the physics of bimerons in ferrimagnets and may provide guidelines for building bimeron-based spintronic devices.

preprint2022arXiv

Read before Generate! Faithful Long Form Question Answering with Machine Reading

Long-form question answering (LFQA) aims to generate a paragraph-length answer for a given question. While current work on LFQA using large pre-trained model for generation are effective at producing fluent and somewhat relevant content, one primary challenge lies in how to generate a faithful answer that has less hallucinated content. We propose a new end-to-end framework that jointly models answer generation and machine reading. The key idea is to augment the generation model with fine-grained, answer-related salient information which can be viewed as an emphasis on faithful facts. State-of-the-art results on two LFQA datasets, ELI5 and MS MARCO, demonstrate the effectiveness of our method, in comparison with strong baselines on automatic and human evaluation metrics. A detailed analysis further proves the competency of our methods in generating fluent, relevant, and more faithful answers.

preprint2022arXiv

Stabilization and application of asymmetric Néel skyrmions in hybrid nanostructures

Increasing amounts of information force the continuous improvement of information storage and processing technologies, further device miniaturization, and their efficiency increase. Magnetic skyrmions, topological quasiparticles, and the smallest stable magnetic textures possess intriguing properties and potential for data storage applications. Hybrid nanostructures with elements of different magnetization orientations can offer additional advantages for developing skyrmion-based spintronic and magnonic devices. We show that an Néel-type skyrmion confined within a nanodot placed on top of a ferromagnetic stripe produces a unique and compelling platform for exploring mutual coupling between magnetization textures. The skyrmion induces an imprint upon the stripe, which, in turn, asymmetrically squeezes the skyrmion in the dot, increasing their size and the range of skyrmion stability for small values of DMI, as well as introducing skyrmion bi-stability. At the end, we present a proof-of-concept technique for unconstrained transport of a skyrmion along a racetrack based on proposed hybrid systems. Our results demonstrate a hybrid structure that is promising for applications in magnonics and spintronics.

preprint2021arXiv

Non-invasive Self-attention for Side Information Fusion in Sequential Recommendation

Sequential recommender systems aim to model users' evolving interests from their historical behaviors, and hence make customized time-relevant recommendations. Compared with traditional models, deep learning approaches such as CNN and RNN have achieved remarkable advancements in recommendation tasks. Recently, the BERT framework also emerges as a promising method, benefited from its self-attention mechanism in processing sequential data. However, one limitation of the original BERT framework is that it only considers one input source of the natural language tokens. It is still an open question to leverage various types of information under the BERT framework. Nonetheless, it is intuitively appealing to utilize other side information, such as item category or tag, for more comprehensive depictions and better recommendations. In our pilot experiments, we found naive approaches, which directly fuse types of side information into the item embeddings, usually bring very little or even negative effects. Therefore, in this paper, we propose the NOninVasive self-attention mechanism (NOVA) to leverage side information effectively under the BERT framework. NOVA makes use of side information to generate better attention distribution, rather than directly altering the item embedding, which may cause information overwhelming. We validate the NOVA-BERT model on both public and commercial datasets, and our method can stably outperform the state-of-the-art models with negligible computational overheads.

preprint2020arXiv

A lateral semicircular canal segmentation based geometric calibration for human temporal bone CT Image

Computed Tomography (CT) of the temporal bone has become an important method for diagnosing ear diseases. Due to the different posture of the subject and the settings of CT scanners, the CT image of the human temporal bone should be geometrically calibrated to ensure the symmetry of the bilateral anatomical structure. Manual calibration is a time-consuming task for radiologists and an important pre-processing step for further computer-aided CT analysis. We propose an automatic calibration algorithm for temporal bone CT images. The lateral semicircular canals (LSCs) are segmented as anchors at first. Then, we define a standard 3D coordinate system. The key step is the LSC segmentation. We design a novel 3D LSC segmentation encoder-decoder network, which introduces a 3D dilated convolution and a multi-pooling scheme for feature fusion in the encoding stage. The experimental results show that our LSC segmentation network achieved a higher segmentation accuracy. Our proposed method can help to perform calibration of temporal bone CT images efficiently.

preprint2020arXiv

Blur-Attention: A boosting mechanism for non-uniform blurred image restoration

Dynamic scene deblurring is a challenging problem in computer vision. It is difficult to accurately estimate the spatially varying blur kernel by traditional methods. Data-driven-based methods usually employ kernel-free end-to-end mapping schemes, which are apt to overlook the kernel estimation. To address this issue, we propose a blur-attention module to dynamically capture the spatially varying features of non-uniform blurred images. The module consists of a DenseBlock unit and a spatial attention unit with multi-pooling feature fusion, which can effectively extract complex spatially varying blur features. We design a multi-level residual connection structure to connect multiple blur-attention modules to form a blur-attention network. By introducing the blur-attention network into a conditional generation adversarial framework, we propose an end-to-end blind motion deblurring method, namely Blur-Attention-GAN (BAG), for a single image. Our method can adaptively select the weights of the extracted features according to the spatially varying blur features, and dynamically restore the images. Experimental results show that the deblurring capability of our method achieved outstanding objective performance in terms of PSNR, SSIM, and subjective visual quality. Furthermore, by visualizing the features extracted by the blur-attention module, comprehensive discussions are provided on its effectiveness.

preprint2020arXiv

DUMA: Reading Comprehension with Transposition Thinking

Multi-choice Machine Reading Comprehension (MRC) requires model to decide the correct answer from a set of answer options when given a passage and a question. Thus in addition to a powerful Pre-trained Language Model (PrLM) as encoder, multi-choice MRC especially relies on a matching network design which is supposed to effectively capture the relationships among the triplet of passage, question and answers. While the newer and more powerful PrLMs have shown their mightiness even without the support from a matching network, we propose a new DUal Multi-head Co-Attention (DUMA) model, which is inspired by human's transposition thinking process solving the multi-choice MRC problem: respectively considering each other's focus from the standpoint of passage and question. The proposed DUMA has been shown effective and is capable of generally promoting PrLMs. Our proposed method is evaluated on two benchmark multi-choice MRC tasks, DREAM and RACE, showing that in terms of powerful PrLMs, DUMA can still boost the model to reach new state-of-the-art performance.

preprint2020arXiv

HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions

Collecting supporting evidence from large corpora of text (e.g., Wikipedia) is of great challenge for open-domain Question Answering (QA). Especially, for multi-hop open-domain QA, scattered evidence pieces are required to be gathered together to support the answer extraction. In this paper, we propose a new retrieval target, hop, to collect the hidden reasoning evidence from Wikipedia for complex question answering. Specifically, the hop in this paper is defined as the combination of a hyperlink and the corresponding outbound link document. The hyperlink is encoded as the mention embedding which models the structured knowledge of how the outbound link entity is mentioned in the textual context, and the corresponding outbound link document is encoded as the document embedding representing the unstructured knowledge within it. Accordingly, we build HopRetriever which retrieves hops over Wikipedia to answer complex questions. Experiments on the HotpotQA dataset demonstrate that HopRetriever outperforms previously published evidence retrieval methods by large margins. Moreover, our approach also yields quantifiable interpretations of the evidence collection process.

preprint2020arXiv

Mitigating Query-Flooding Parameter Duplication Attack on Regression Models with High-Dimensional Gaussian Mechanism

Public intelligent services enabled by machine learning algorithms are vulnerable to model extraction attacks that can steal confidential information of the learning models through public queries. Differential privacy (DP) has been considered a promising technique to mitigate this attack. However, we find that the vulnerability persists when regression models are being protected by current DP solutions. We show that the adversary can launch a query-flooding parameter duplication (QPD) attack to infer the model information by repeated queries. To defend against the QPD attack on logistic and linear regression models, we propose a novel High-Dimensional Gaussian (HDG) mechanism to prevent unauthorized information disclosure without interrupting the intended services. In contrast to prior work, the proposed HDG mechanism will dynamically generate the privacy budget and random noise for different queries and their results to enhance the obfuscation. Besides, for the first time, HDG enables an optimal privacy budget allocation that automatically determines the minimum amount of noise to be added per user-desired privacy level on each dimension. We comprehensively evaluate the performance of HDG using real-world datasets and shows that HDG effectively mitigates the QPD attack while satisfying the privacy requirements. We also prepare to open-source the relevant codes to the community for further research.

preprint2020arXiv

The Graph Limit of The Minimizer of The Onsager-Machlup Functional and Its Computation

The Onsager-Machlup (OM) functional is well-known for characterizing the most probable transition path of a diffusion process with non-vanishing noise. However, it suffers from a notorious issue that the functional is unbounded below when the specified transition time $T$ goes to infinity. This hinders the interpretation of the results obtained by minimizing the OM functional. We provide a new perspective on this issue. Under mild conditions, we show that although the infimum of the OM functional becomes unbounded when $T$ goes to infinity, the sequence of minimizers does contain convergent subsequences on the space of curves. The graph limit of this minimizing subsequence is an extremal of the abbreviated action functional, which is related to the OM functional via the Maupertuis principle with an optimal energy. We further propose an energy-climbing geometric minimization algorithm (EGMA) which identifies the optimal energy and the graph limit of the transition path simultaneously. This algorithm is successfully applied to several typical examples in rare event studies. Some interesting comparisons with the Freidlin-Wentzell action functional are also made.

preprint2014arXiv

Finding Transition Pathways on Manifolds

We consider noise-induced transition paths in randomly perturbed dynami- cal systems on a smooth manifold. The classical Freidlin-Wentzell large devia- tion theory in Euclidean spaces is generalized and new forms of action functionals are derived in the spaces of functions and the space of curves to accommodate the intrinsic constraints associated with the manifold. Numerical meth- ods are proposed to compute the minimum action paths for the systems with constraints. The examples of conformational transition paths for a single and double rod molecules arising in polymer science are numerically investigated.

preprint2014arXiv

Proximity Effects in Topological Insulator Heterostructures

Topological insulators (TIs) are bulk insulators that possess robust helical conducting states along their interfaces with conventional insulators. A tremendous research effort has recently been devoted to TI-based heterostructures, in which conventional proximity effects give rise to a series of exotic physical phenomena. This paper reviews our recent works on the potential existence of topological proximity effects at the interface between a topological insulator and a normal insulator or other topologically trivial systems. Using first-principles approaches, we have established the tunability of the vertical location of the topological helical state via intriguing dual-proximity effects. To further elucidate the control parameters of this effect, we have used the graphene-based heterostructures as prototypical systems to reveal a more complete phase diagram. On the application side of the topological helical states, we have presented a catalysis example, where the topological helical state plays an essential role in facilitating surface reactions by serving as an effective electron bath. These discoveries lay the foundation for accurate manipulation of the real space properties of the topological helical state in TI-based heterostructures and pave the way for realization of the salient functionality of topological insulators in future device applications.

preprint2014arXiv

The examination of stable charge states of vacancies in Cu2ZnSnS4

The stable charge states of vacancies in the solar cell absorber material Cu2ZnSnS4 are investigated using Kohn-Sham (KS) defect-induced single particle levels analysis by concerning the screened Coulomb hybrid functional. We found out that the Cu, Zn and S vacancies (denoted by VCu, VZn, VS) do not induce single particle defect levels in the vicinity of the band gap thus each of them has only one stable charge state corresponding to the fully occupied valence band VCu1-, VZn2- and VS0, respectively (and therefore cannot account for any defect transition energy levels). The Sn vacancy (VSn) has three stable charge states VSn2-, VSn3- and VSn4-, which may account for two charge transition energy levels. By comparing with previous charge transition energy levels studies, our results indicate that the examination of stable charge states is a necessary and important step which should be done before charge transition energy levels calculations.

preprint2013arXiv

Colossal Magnetoresistance Manganites and Related Prototype Devices

We review colossal magnetoresistance in single phase manganites, as related to the field sensitive spin charge interactions and phase separation; the rectifying property and negative/positive magnetoresistance in manganite/Nb:SrTiO3 pn junctions in relation to the special interface electronic structure; magnetoelectric coupling in manganite/ferroelectric structures that takes advantage of strain, carrier density, and magnetic field sensitivity; tunneling magnetoresistance in tunnel junctions with dielectric, ferroelectric, and organic semiconductor spacers using the fully spin polarized nature of manganites; and the effect of particle size on magnetic properties in manganite nanoparticles

preprint2013arXiv

Topological Proximity Effects in Graphene Nanoribbon Heterostructures

Topological insulators (TI) are bulk insulators that possess robust chiral conducting states along their interfaces with normal insulators. A tremendous research effort has recently been devoted to TI-based heterostructures, in which conventional proximity effects give rise to many exotic physical phenomena. Here we establish the potential existence of "topological proximity effects" at the interface of a topological graphene nanoribbon (GNR) and a normal GNR. Specifically, we show that the location of the topological edge states exhibits versatile tunability as a function of the interface orientation, as well as the strengths of the interface coupling and spin-orbit coupling in the normal GNR. For zigzag and bearded GNRs, the topological edge state can be tuned to be either at the interface or outer edge of the normal ribbon. For armchair GNR, the potential location of the topological edge state can be further enriched to be at the edge of or within the normal ribbon, at the interface, or diving into the topological GNR. We also discuss potential experimental realization of the predicted topological proximity effects, which may pave the way for integrating the salient functionality of TI and graphene in future device applications.

preprint2013arXiv

Tuning the vertical location of helical surface states in topological insulator heterostructures via dual-proximity effects

In integrating topological insulators (TIs) with conventional materials, one crucial issue is how the topological surface states (TSS) will behave in such heterostructures. We use first-principles approaches to establish accurate tunability of the vertical location of the TSS via intriguing dual-proximity effects. By depositing a conventional insulator (CI) overlayer onto a TI substrate (Bi2Se3 or Bi2Te3), we demonstrate that, the TSS can float to the top of the CI film, or stay put at the CI/TI interface, or be pushed down deeper into the otherwise structurally homogeneous TI substrate. These contrasting behaviors imply a rich variety of possible quantum phase transitions in the hybrid systems, dictated by key material-specific properties of the CI. These discoveries lay the foundation for accurate manipulation of the real space properties of TSS in TI heterostructures of diverse technological significance.

preprint2012arXiv

Transition Path, Quasi-potential Energy Landscape and Stability of Genetic Switches

One of the fundamental cellular processes governed by genetic regulatory networks in cells is the transition among different states under the intrinsic and extrinsic noise. Based on a two-state genetic switching model with positive feedback, we develop a framework to understand the metastability in gene expressions. This framework is comprised of identifying the transition path, reconstructing the global quasi-potential energy landscape, analyzing the uphill and downhill transition paths, etc. It is successfully utilized to investigate the stability of genetic switching models and fluctuation properties in different regimes of gene expression with positive feedback. The quasi-potential energy landscape, which is the rationalized version of Waddington potential, provides a quantitative tool to understand the metastability in more general biological processes with intrinsic noise.

preprint2010arXiv

Meson Emission Model of Psi to N Nbar m Charmonium Strong Decays

In this paper we consider a sequential "meson emission" mechanism for charmonium decays of the type Psi -> N Nbar m, where Psi is a generic charmonium state, N is a nucleon and m is a light meson. This decay mechanism, which may not be dominant in general, assumes that an NNbar pair is created during charmonium annihilation, and the light meson m is emitted from the outgoing nucleon or antinucleon line. A straightforward generalization of this model can incorporate intermediate N* resonances. We derive Dalitz plot event densities for the cases Psi = eta_c, J/psi, chi_c0, chi_c1} and psi' and m = pi0, f0 and omega (and implicitly, any 0^{-+}, 0^{++} or 1^{--} final light meson). It may be possible to separate the contribution of this decay mechanism to the full decay amplitude through characteristic event densities. For the decay subset Psi -> p pbar pi0 the two model parameters are known, so we are able to predict absolute numerical partial widths for Gamma(Psi -> p pbar pi0). In the specific case J/psi -> p pbar pi0 the predicted partial width and M_{p pi0} event distribution are intriguingly close to experiment. We also consider the possibility of scalar meson and glueball searches in Psi -> p pbar f0. If the meson emission contributions to Psi -> N Nbar m decays can be isolated and quantified, they can be used to estimate meson-nucleon strong couplings {g_NNm}, which are typically poorly known, and are a crucial input in meson exchange models of the NN interaction. The determination of g_NNpi from Jψ-> p pbar pi0 and the (poorly known) g_NNomega and the anomalous "strong magnetic" coupling kappa_{NNomega} from J/psi -> p pbar omega are considered as examples.

Xiaoguang Li

What is connected

Connect this record

See the researcher in context

Building this map preview

22 published item(s)

ELAIPBench: A Benchmark for Expert-Level Artificial Intelligence Paper Understanding

How Pre-trained Language Models Capture Factual Knowledge? A Causal-Inspired Analysis

Hyperlink-induced Pre-training for Passage Retrieval in Open-domain Question Answering

MISF: Multi-level Interactive Siamese Filtering for High-Fidelity Image Inpainting

Nonreciprocal dynamics of ferrimagnetic bimerons

Read before Generate! Faithful Long Form Question Answering with Machine Reading

Stabilization and application of asymmetric Néel skyrmions in hybrid nanostructures

Non-invasive Self-attention for Side Information Fusion in Sequential Recommendation

A lateral semicircular canal segmentation based geometric calibration for human temporal bone CT Image

Blur-Attention: A boosting mechanism for non-uniform blurred image restoration

DUMA: Reading Comprehension with Transposition Thinking

HopRetriever: Retrieve Hops over Wikipedia to Answer Complex Questions

Mitigating Query-Flooding Parameter Duplication Attack on Regression Models with High-Dimensional Gaussian Mechanism

The Graph Limit of The Minimizer of The Onsager-Machlup Functional and Its Computation

Finding Transition Pathways on Manifolds

Proximity Effects in Topological Insulator Heterostructures

The examination of stable charge states of vacancies in Cu2ZnSnS4

Colossal Magnetoresistance Manganites and Related Prototype Devices

Topological Proximity Effects in Graphene Nanoribbon Heterostructures

Tuning the vertical location of helical surface states in topological insulator heterostructures via dual-proximity effects

Transition Path, Quasi-potential Energy Landscape and Stability of Genetic Switches

Meson Emission Model of Psi to N Nbar m Charmonium Strong Decays