Researcher profile

Yiqun Liu

Yiqun Liu contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
26works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

26 published item(s)

preprint2026arXiv

Decoupling Knowledge and Task Subspaces for Composable Parametric Retrieval Augmented Generation

Parametric Retrieval-Augmented Generation (PRAG) encodes external documents into lightweight parameter modules that can be retrieved and merged at inference time, offering a promising alternative to in-context retrieval augmentation. Despite its potential, many PRAG implementations train document adapters with task-supervised objectives, which may cause each adapter to encode both document-specific facts and reusable task-solving behavior. This entanglement may make adapter composition less reliable: when multiple adapters are merged at inference time, their overlapping task behaviors can accumulate together with document-specific updates, potentially making the merged adapter less stable and less focused on the intended document knowledge. To examine this issue, we explore Orthogonal Subspace Decomposition (OSD), an adapter-training setup that separates reusable task behavior from document-specific knowledge adapters. Concretely, we first train a Task LoRA to capture reusable task behavior, and then train document LoRAs to encode document-specific knowledge in a orthogonal subspace. This setup provides a controlled way to examine how orthogonalizing task and document LoRA updates affects adapter composition in multi-document PRAG. Experiments across multiple knowledge-intensive tasks and model scales suggest that this orthogonalization strategy can improve compositional robustness in parametric RAG, especially when multiple document adapters are merged.

preprint2026arXiv

Enhancing Judgment Document Generation via Agentic Legal Information Collection and Rubric-Guided Optimization

Automating the drafting of judgment documents is pivotal to judicial efficiency, yet it remains challenging due to the dual requirements of comprehensive retrieval of legal information and rigorous logical reasoning. Existing approaches, typically relying on standard Retrieval-Augmented Generation and Supervised Fine-Tuning, often suffer from insufficient evidence recall, hallucinated statutory references, and logically flawed legal reasoning. To bridge this gap, we propose Judge-R1, a unified framework designed to enhance LLM-based judgment document generation by jointly improving legal information collection and judgment document generation. First, we introduce Agentic Legal Information Collection, which employs a dynamic planning agent to retrieve precise statutes and precedents from multiple sources. Second, we implement Rubric-Guided Optimization, a reinforcement learning phase utilizing Group Relative Policy Optimization (GRPO) with a comprehensive legal reward function to enforce adherence to judicial standards and reasoning logic. Extensive experiments on the JuDGE benchmark demonstrate that Judge-R1 significantly outperforms state-of-the-art baselines in both legal accuracy and generation quality.

preprint2026arXiv

How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study

While AI-generated hallucinations pose considerable risks, the underlying cognitive mechanisms by which humans can successfully recognize or be misled by these hallucinations remain unclear. To address this problem, this paper explores humans' neural dynamics to characterize how the brain processes hallucinated content. We record EEG signals from 27 participants while they are performing a verification task to judge the correctness of image descriptions generated by a multi-modal large language model (MLLM). Based on an averaged event-related potential (ERP) study, we reveal that multiple cognitive processes, e.g., semantic integration, inferential processing, memory retrieval, and cognitive load, exhibit distinct patterns when humans process hallucinated versus non-hallucinated content. Notably, neural responses to hallucinations that were misjudged versus correctly judged by human participants showed significant differences. This indicates that misjudged AI-generated hallucinations failed to trigger the standard neurocognitive fact verification pathway.

preprint2024arXiv

Caseformer: Pre-training for Legal Case Retrieval Based on Inter-Case Distinctions

Legal case retrieval aims to help legal workers find relevant cases related to their cases at hand, which is important for the guarantee of fairness and justice in legal judgments. While recent advances in neural retrieval methods have significantly improved the performance of open-domain retrieval tasks (e.g., Web search), their advantages have not been observed in legal case retrieval due to their thirst for annotated data. As annotating large-scale training data in legal domains is prohibitive due to the need for domain expertise, traditional search techniques based on lexical matching such as TF-IDF, BM25, and Query Likelihood are still prevalent in legal case retrieval systems. While previous studies have designed several pre-training methods for IR models in open-domain tasks, these methods are usually suboptimal in legal case retrieval because they cannot understand and capture the key knowledge and data structures in the legal corpus. To this end, we propose a novel pre-training framework named Caseformer that enables the pre-trained models to learn legal knowledge and domain-specific relevance information in legal case retrieval without any human-labeled data. Through three unsupervised learning tasks, Caseformer is able to capture the special language, document structure, and relevance patterns of legal case documents, making it a strong backbone for downstream legal case retrieval tasks. Experimental results show that our model has achieved state-of-the-art performance in both zero-shot and full-data fine-tuning settings. Also, experiments on both Chinese and English legal datasets demonstrate that the effectiveness of Caseformer is language-independent in legal case retrieval.

preprint2024arXiv

Wikiformer: Pre-training with Structured Information of Wikipedia for Ad-hoc Retrieval

With the development of deep learning and natural language processing techniques, pre-trained language models have been widely used to solve information retrieval (IR) problems. Benefiting from the pre-training and fine-tuning paradigm, these models achieve state-of-the-art performance. In previous works, plain texts in Wikipedia have been widely used in the pre-training stage. However, the rich structured information in Wikipedia, such as the titles, abstracts, hierarchical heading (multi-level title) structure, relationship between articles, references, hyperlink structures, and the writing organizations, has not been fully explored. In this paper, we devise four pre-training objectives tailored for IR tasks based on the structured knowledge of Wikipedia. Compared to existing pre-training methods, our approach can better capture the semantic knowledge in the training corpus by leveraging the human-edited structured data from Wikipedia. Experimental results on multiple IR benchmark datasets show the superior performance of our model in both zero-shot and fine-tuning settings compared to existing strong retrieval baselines. Besides, experimental results in biomedical and legal domains demonstrate that our approach achieves better performance in vertical domains compared to previous models, especially in scenarios where long text similarity matching is needed.

preprint2022arXiv

A Survey on Dropout Methods and Experimental Verification in Recommendation

Overfitting is a common problem in machine learning, which means the model too closely fits the training data while performing poorly in the test data. Among various methods of coping with overfitting, dropout is one of the representative ways. From randomly dropping neurons to dropping neural structures, dropout has achieved great success in improving model performances. Although various dropout methods have been designed and widely applied in past years, their effectiveness, application scenarios, and contributions have not been comprehensively summarized and empirically compared by far. It is the right time to make a comprehensive survey. In this paper, we systematically review previous dropout methods and classify them into three major categories according to the stage where dropout operation is performed. Specifically, more than seventy dropout methods published in top AI conferences or journals (e.g., TKDE, KDD, TheWebConf, SIGIR) are involved. The designed taxonomy is easy to understand and capable of including new dropout methods. Then, we further discuss their application scenarios, connections, and contributions. To verify the effectiveness of distinct dropout methods, extensive experiments are conducted on recommendation scenarios with abundant heterogeneous information. Finally, we propose some open problems and potential research directions about dropout that worth to be further explored.

preprint2022arXiv

A Survey on the Fairness of Recommender Systems

Recommender systems are an essential tool to relieve the information overload challenge and play an important role in people's daily lives. Since recommendations involve allocations of social resources (e.g., job recommendation), an important issue is whether recommendations are fair. Unfair recommendations are not only unethical but also harm the long-term interests of the recommender system itself. As a result, fairness issues in recommender systems have recently attracted increasing attention. However, due to multiple complex resource allocation processes and various fairness definitions, the research on fairness in recommendation is scattered. To fill this gap, we review over 60 papers published in top conferences/journals, including TOIS, SIGIR, and WWW. First, we summarize fairness definitions in the recommendation and provide several views to classify fairness issues. Then, we review recommendation datasets and measurements in fairness studies and provide an elaborate taxonomy of fairness methods in the recommendation. Finally, we conclude this survey by outlining some promising future directions.

preprint2022arXiv

Brain Topography Adaptive Network for Satisfaction Modeling in Interactive Information Access System

With the growth of information on the Web, most users heavily rely on information access systems (e.g., search engines, recommender systems, etc.) in their daily lives. During this procedure, modeling users' satisfaction status plays an essential part in improving their experiences with the systems. In this paper, we aim to explore the benefits of using Electroencephalography (EEG) signals for satisfaction modeling in interactive information access system design. Different from existing EEG classification tasks, the arisen of satisfaction involves multiple brain functions, such as arousal, prototypicality, and appraisals, which are related to different brain topographical areas. Thus modeling user satisfaction raises great challenges to existing solutions. To address this challenge, we propose BTA, a Brain Topography Adaptive network with a multi-centrality encoding module and a spatial attention mechanism module to capture cognitive connectivities in different spatial distances. We explore the effectiveness of BTA for satisfaction modeling in two popular information access scenarios, i.e., search and recommendation. Extensive experiments on two real-world datasets verify the effectiveness of introducing brain topography adaptive strategy in satisfaction modeling. Furthermore, we also conduct search result re-ranking task and video rating prediction task based on the satisfaction inferred from brain signals on search and recommendation scenarios, respectively. Experimental results show that brain signals extracted with BTA help improve the performance of interactive information access systems significantly.

preprint2022arXiv

CorpusBrain: Pre-train a Generative Retrieval Model for Knowledge-Intensive Language Tasks

Knowledge-intensive language tasks (KILT) usually require a large body of information to provide correct answers. A popular paradigm to solve this problem is to combine a search system with a machine reader, where the former retrieves supporting evidences and the latter examines them to produce answers. Recently, the reader component has witnessed significant advances with the help of large-scale pre-trained generative models. Meanwhile most existing solutions in the search component rely on the traditional ``index-retrieve-then-rank'' pipeline, which suffers from large memory footprint and difficulty in end-to-end optimization. Inspired by recent efforts in constructing model-based IR models, we propose to replace the traditional multi-step search pipeline with a novel single-step generative model, which can dramatically simplify the search process and be optimized in an end-to-end manner. We show that a strong generative retrieval model can be learned with a set of adequately designed pre-training tasks, and be adopted to improve a variety of downstream KILT tasks with further fine-tuning. We name the pre-trained generative retrieval model as CorpusBrain as all information about the corpus is encoded in its parameters without the need of constructing additional index. Empirical results show that CorpusBrain can significantly outperform strong baselines for the retrieval task on the KILT benchmark and establish new state-of-the-art downstream performances. We also show that CorpusBrain works well under zero- and low-resource settings.

preprint2022arXiv

Disentangled Modeling of Domain and Relevance for Adaptable Dense Retrieval

Recent advance in Dense Retrieval (DR) techniques has significantly improved the effectiveness of first-stage retrieval. Trained with large-scale supervised data, DR models can encode queries and documents into a low-dimensional dense space and conduct effective semantic matching. However, previous studies have shown that the effectiveness of DR models would drop by a large margin when the trained DR models are adopted in a target domain that is different from the domain of the labeled data. One of the possible reasons is that the DR model has never seen the target corpus and thus might be incapable of mitigating the difference between the training and target domains. In practice, unfortunately, training a DR model for each target domain to avoid domain shift is often a difficult task as it requires additional time, storage, and domain-specific data labeling, which are not always available. To address this problem, in this paper, we propose a novel DR framework named Disentangled Dense Retrieval (DDR) to support effective and flexible domain adaptation for DR models. DDR consists of a Relevance Estimation Module (REM) for modeling domain-invariant matching patterns and several Domain Adaption Modules (DAMs) for modeling domain-specific features of multiple target corpora. By making the REM and DAMs disentangled, DDR enables a flexible training paradigm in which REM is trained with supervision once and DAMs are trained with unsupervised data. Comprehensive experiments in different domains and languages show that DDR significantly improves ranking performance compared to strong DR baselines and substantially outperforms traditional retrieval methods in most scenarios.

preprint2022arXiv

Evaluating Interpolation and Extrapolation Performance of Neural Retrieval Models

A retrieval model should not only interpolate the training data but also extrapolate well to the queries that are different from the training data. While neural retrieval models have demonstrated impressive performance on ad-hoc search benchmarks, we still know little about how they perform in terms of interpolation and extrapolation. In this paper, we demonstrate the importance of separately evaluating the two capabilities of neural retrieval models. Firstly, we examine existing ad-hoc search benchmarks from the two perspectives. We investigate the distribution of training and test data and find a considerable overlap in query entities, query intent, and relevance labels. This finding implies that the evaluation on these test sets is biased toward interpolation and cannot accurately reflect the extrapolation capacity. Secondly, we propose a novel evaluation protocol to separately evaluate the interpolation and extrapolation performance on existing benchmark datasets. It resamples the training and test data based on query similarity and utilizes the resampled dataset for training and evaluation. Finally, we leverage the proposed evaluation protocol to comprehensively revisit a number of widely-adopted neural retrieval models. Results show models perform differently when moving from interpolation to extrapolation. For example, representation-based retrieval models perform almost as well as interaction-based retrieval models in terms of interpolation but not extrapolation. Therefore, it is necessary to separately evaluate both interpolation and extrapolation performance and the proposed resampling method serves as a simple yet effective evaluation tool for future IR studies.

preprint2022arXiv

HelixFold: An Efficient Implementation of AlphaFold2 using PaddlePaddle

Accurate protein structure prediction can significantly accelerate the development of life science. The accuracy of AlphaFold2, a frontier end-to-end structure prediction system, is already close to that of the experimental determination techniques. Due to the complex model architecture and large memory consumption, it requires lots of computational resources and time to implement the training and inference of AlphaFold2 from scratch. The cost of running the original AlphaFold2 is expensive for most individuals and institutions. Therefore, reducing this cost could accelerate the development of life science. We implement AlphaFold2 using PaddlePaddle, namely HelixFold, to improve training and inference speed and reduce memory consumption. The performance is improved by operator fusion, tensor fusion, and hybrid parallelism computation, while the memory is optimized through Recompute, BFloat16, and memory read/write in-place. Compared with the original AlphaFold2 (implemented with Jax) and OpenFold (implemented with PyTorch), HelixFold needs only 7.5 days to complete the full end-to-end training and only 5.3 days when using hybrid parallelism, while both AlphaFold2 and OpenFold take about 11 days. HelixFold saves 1x training time. We verified that HelixFold's accuracy could be on par with AlphaFold2 on the CASP14 and CAMEO datasets. HelixFold's code is available on GitHub for free download: https://github.com/PaddlePaddle/PaddleHelix/tree/dev/apps/protein_folding/helixfold, and we also provide stable web services on https://paddlehelix.baidu.com/app/drug/protein/forecast.

preprint2022arXiv

NxtPost: User to Post Recommendations in Facebook Groups

In this paper, we present NxtPost, a deployed user-to-post content-based sequential recommender system for Facebook Groups. Inspired by recent advances in NLP, we have adapted a Transformer-based model to the domain of sequential recommendation. We explore causal masked multi-head attention that optimizes both short and long-term user interests. From a user's past activities validated by defined safety process, NxtPost seeks to learn a representation for the user's dynamic content preference and to predict the next post user may be interested in. In contrast to previous Transformer-based methods, we do not assume that the recommendable posts have a fixed corpus. Accordingly, we use an external item/token embedding to extend a sequence-based approach to a large vocabulary. We achieve 49% abs. improvement in offline evaluation. As a result of NxtPost deployment, 0.6% more users are meeting new people, engaging with the community, sharing knowledge and getting support. The paper shares our experience in developing a personalized sequential recommender system, lessons deploying the model for cold start users, how to deal with freshness, and tuning strategies to reach higher efficiency in online A/B experiments.

preprint2022arXiv

Towards a Better Understanding Human Reading Comprehension with Brain Signals

Reading comprehension is a complex cognitive process involving many human brain activities. Plenty of works have studied the patterns and attention allocations of reading comprehension in information retrieval related scenarios. However, little is known about what happens in human brain during reading comprehension and how these cognitive activities can affect information retrieval process. Additionally, with the advances in brain imaging techniques such as electroencephalogram (EEG), it is possible to collect brain signals in almost real time and explore whether it can be utilized as feedback to facilitate information acquisition performance. In this paper, we carefully design a lab-based user study to investigate brain activities during reading comprehension. Our findings show that neural responses vary with different types of reading contents, i.e., contents that can satisfy users' information needs and contents that cannot. We suggest that various cognitive activities, e.g., cognitive loading, semantic-thematic understanding, and inferential processing, underpin these neural responses at the micro-time scale during reading comprehension. From these findings, we illustrate several insights for information retrieval tasks, such as ranking models construction and interface design. Besides, we suggest the possibility of detecting reading comprehension status for a proactive real-world system. To this end, we propose a Unified framework for EEG-based Reading Comprehension Modeling (UERCM). To verify its effectiveness, we conduct extensive experiments based on EEG features for two reading comprehension tasks: answer sentence classification and answer extraction. Results show that it is feasible to improve the performance of two tasks with brain signals.

preprint2022arXiv

Towards Representation Alignment and Uniformity in Collaborative Filtering

Collaborative filtering (CF) plays a critical role in the development of recommender systems. Most CF methods utilize an encoder to embed users and items into the same representation space, and the Bayesian personalized ranking (BPR) loss is usually adopted as the objective function to learn informative encoders. Existing studies mainly focus on designing more powerful encoders (e.g., graph neural network) to learn better representations. However, few efforts have been devoted to investigating the desired properties of representations in CF, which is important to understand the rationale of existing CF methods and design new learning objectives. In this paper, we measure the representation quality in CF from the perspective of alignment and uniformity on the hypersphere. We first theoretically reveal the connection between the BPR loss and these two properties. Then, we empirically analyze the learning dynamics of typical CF methods in terms of quantified alignment and uniformity, which shows that better alignment or uniformity both contribute to higher recommendation performance. Based on the analyses results, a learning objective that directly optimizes these two properties is proposed, named DirectAU. We conduct extensive experiments on three public datasets, and the proposed learning framework with a simple matrix factorization model leads to significant performance improvements compared to state-of-the-art CF methods. Our implementations are publicly available at https://github.com/THUwangcy/DirectAU.

preprint2021arXiv

Anisotropic dressing of electrons in electron-doped cuprate superconductors

The recent experiments revealed a remarkable possibility for the absence of the disparity between the phase diagrams of the electron- and hole-doped cuprate superconductors, while such an aspect should be also reflected in the dressing of the electrons. Here the phase diagram of the electron-doped cuprate superconductors and the related exotic features of the anisotropic dressing of the electrons are studied based on the kinetic-energy driven superconductivity. It is shown that although the optimized Tc in the electron-doped side is much smaller than that in the hole-doped case, the electron- and hole-doped cuprate superconductors rather resemble each other in the doping range of the superconducting dome, indicating an absence of the disparity between the phase diagrams of the electron- and hole-doped cuprate superconductors. In particular, the anisotropic dressing of the electrons due to the strong electron's coupling to a strongly dispersive spin excitation leads to that the electron Fermi surface is truncated to form the disconnected Fermi arcs centered around the nodal region. Concomitantly, the dip in the peak-dip-hump structure of the quasiparticle excitation spectrum is directly associated with the corresponding peak in the quasiparticle scattering rate, while the dispersion kink is always accompanied by the corresponding inflection point in the total self-energy, as the dip in the peak-dip-hump structure and dispersion kink in the hole-doped counterparts. The theory also predicts that both the normal and anomalous self-energies exhibit the well-pronounced low-energy peak-structures.

preprint2021arXiv

Characteristic energy of the nematic-order state and its connection to enhancement of superconductivity in cuprate superconductors

The new development in sublattice-phase-resolved imaging of electronic structure now allow for the visualisation of the nematic-order state characteristic energy of cuprate superconductors in a wide doping regime. However, it is still unclear how this characteristic energy of the nematic-order state is correlated with the enhancement of superconductivity. Here the doping dependence of the nematic-order state characteristic energy in cuprate superconductors and of its possible connection to the enhancement of superconductivity is investigated within the framework of the kinetic-energy-driven superconductivity. It is shown that the characteristic energy of the nematic-order state is found to be particularly large in the underdoped regime, then it smoothly decreases upon the increase of doping, in full agreement with the corresponding experimental observations. Moreover, the characteristic energy of the nematic-order state as a function of the nematic-order state strength in the underdoped regime presents a similar behavior of the superconducting transition temperature. This suggests a possible connection between the nematic-order state characteristic energy and the enhancement of the superconductivity.

preprint2021arXiv

Enhancement of superconductivity by electronic nematicity in cuprate superconductors

The cuprate superconductors are characterized by numerous ordering tendencies, with the nematic order being the most distinct form of order. Here the intertwinement of the electronic nematicity with superconductivity in cuprate superconductors is studied based on the kinetic-energy-driven superconductivity. It is shown that the optimized Tc takes a dome-like shape with the weak and strong strength regions on each side of the optimal strength of the electronic nematicity, where the optimized Tc reaches its maximum. This dome-like shape nematic-order strength dependence of Tc indicates that the electronic nematicity enhances superconductivity. Moreover, this nematic order induces the anisotropy of the electron Fermi surface (EFS), where although the original EFS with the four-fold rotation symmetry is broken up into that with a residual two-fold rotation symmetry, this EFS with the two-fold rotation symmetry still is truncated to form the Fermi arcs with the most spectral weight that locates at the tips of the Fermi arcs. Concomitantly, these tips of the Fermi arcs connected by the wave vectors ${\bf q}_{i}$ construct an octet scattering model, however, the partial wave vectors and their respective symmetry-corresponding partners occur with unequal amplitudes, leading to these ordered states being broken both rotation and translation symmetries. As a natural consequence, the electronic structure is inequivalent between the $k_{x}$ and $k_{y}$ directions. These anisotropic features of the electronic structure are also confirmed via the result of the autocorrelation of the single-particle excitation spectra, where the breaking of the rotation symmetry is verified by the inequivalence on the average of the electronic structure at the two Bragg scattering sites. Furthermore, the strong energy dependence of the order parameter of the electronic nematicity is also discussed.

preprint2021arXiv

Peak-structure in self-energy of cuprate superconductors

The recently deduced normal and anomalous self-energies from photoemission spectra of cuprate superconductors via the machine learning technique are calling for an explanation. Here the normal and anomalous self-energies in cuprate superconductors are analyzed within the framework of the kinetic-energy-driven superconductivity. It is shown that the exchanged spin excitations give rise to the well-pronounced low-energy peak-structures in both the normal and anomalous self-energies, however, they do not cancel in the total self-energy. In particular, the peak-structure in the normal self-energy is mainly responsible for the peak-dip-hump structure in the single-particle excitation spectrum, and can persist into the normal-state, while the sharp peak in the anomalous self-energy gives rise to a crucial contribution to the superconducting gap, and vanishes in the normal-state. Moreover, the evolution of the peak-structure with doping and momentum are also analyzed.

preprint2021arXiv

Renormalization of dispersion in electron-doped bilayer cuprate superconductors

The renormalization of the electrons in cuprate superconductors is characterized by the kink in the quasiparticle dispersion. Here the bilayer coupling effect on the quasiparticle dispersion kink in the electron-doped bilayer cuprate superconductors is studied based on the kinetic-energy-driven superconductivity. It is shown that the kink in the quasiparticle dispersion is present all around the electron Fermi surface, as the quasiparticle dispersion kink in the single-layer case. However, in comparison with the corresponding single-layer case, the kink effect in the quasiparticle dispersion at around the antinodal region becomes the most pronounced, indicating that the kink effect in the quasiparticle dispersion at around the antinodal region is enhanced by the bilayer coupling.

preprint2020arXiv

An Empirical Study of Clarifying Question-Based Systems

Search and recommender systems that take the initiative to ask clarifying questions to better understand users' information needs are receiving increasing attention from the research community. However, to the best of our knowledge, there is no empirical study to quantify whether and to what extent users are willing or able to answer these questions. In this work, we conduct an online experiment by deploying an experimental system, which interacts with users by asking clarifying questions against a product repository. We collect both implicit interaction behavior data and explicit feedback from users showing that: (a) users are willing to answer a good number of clarifying questions (11-21 on average), but not many more than that; (b) most users answer questions until they reach the target product, but also a fraction of them stops due to fatigue or due to receiving irrelevant questions; (c) part of the users' answers (12-17%) are actually opposite to the description of the target product; while (d) most of the users (66-84%) find the question-based system helpful towards completing their tasks. Some of the findings of the study contradict current assumptions on simulated evaluations in the field, while they point towards improvements in the evaluation framework and can inspire future interactive search/recommender system designs.

preprint2020arXiv

ARPES autocorrelation in electron-doped cuprate superconductors

The angle-resolved photoemission spectroscopy (ARPES) autocorrelation in the electron-doped cuprate superconductors is studied based on the kinetic-energy driven superconducting (SC) mechanism. It is shown that the strong electron correlation induces the electron Fermi surface (EFS) reconstruction, where the most of the quasiparticles locate at around the hot spots on EFS, and then these hot spots connected by the scattering wave vectors ${\bf q}_{i}$ construct an {\it octet} scattering model. In a striking analogy to the hole-doped case, the sharp ARPES autocorrelation peaks are directly correlated with the scattering wave vectors ${\bf q}_{i}$, and are weakly dispersive in momentum space. However, in a clear contrast to the hole-doped counterparts, the position of the ARPES autocorrelation peaks move toward to the opposite direction with the increase of doping. The theory also indicates that there is an intrinsic connection between the ARPES autocorrelation and quasiparticle scattering interference (QSI) in the electron-doped cuprate superconductors.

preprint2020arXiv

Renormalization of electrons in bilayer cuprate superconductors

The characteristic features of the renormalization of the electrons in the bilayer cuprate superconductors are investigated within the kinetic-energy driven superconductivity. It is shown that the quasiparticle excitation spectrum is split into its bonding and antibonding components due to the presence of the bilayer coupling, with each component that is independent. However, in the underdoped and optimally doped regimes, although the bonding and antibonding electron Fermi surface (EFS) contours deriving from the bonding and antibonding layers are truncated to form the bonding and antibonding Fermi arcs, almost all spectral weights in the bonding and antibonding Fermi arcs are reduced to the tips of the bonding and antibonding Fermi arcs, which in this case coincide with the bonding and antibonding hot spots. These hot spots connected by the scattering wave vectors ${\bf q}_{i} $ construct an octet scattering model, and then the enhancement of the quasiparticle scattering processes with the scattering wave vectors ${\bf q}_{i}$ is confirmed via the result of the autocorrelation of the ARPES spectral intensities. Moreover, the peak-dip-hump (PDH) structure developed in each component of the quasiparticle excitation spectrum along the corresponding EFS is directly related with the peak structure in the quasiparticle scattering rate except for at around the hot spots, where the PDH structure is caused mainly by the bilayer coupling. Although the kink in the quasiparticle dispersion is present all around EFS, when the momentum moves away from the node to the antinode, the kink energy smoothly decreases, while the dispersion kink becomes more pronounced, and in particular, near the cut close to the antinode, develops into a break separating of the fasting dispersing high-energy part of the quasiparticle excitation spectrum from the slower dispersing low-energy part.

preprint2020arXiv

RepBERT: Contextualized Text Embeddings for First-Stage Retrieval

Although exact term match between queries and documents is the dominant method to perform first-stage retrieval, we propose a different approach, called RepBERT, to represent documents and queries with fixed-length contextualized embeddings. The inner products of query and document embeddings are regarded as relevance scores. On MS MARCO Passage Ranking task, RepBERT achieves state-of-the-art results among all initial retrieval techniques. And its efficiency is comparable to bag-of-words methods.

preprint2020arXiv

STAS: Adaptive Selecting Spatio-Temporal Deep Features for Improving Bias Correction on Precipitation

Numerical Weather Prediction (NWP) can reduce human suffering by predicting disastrous precipitation in time. A commonly-used NWP in the world is the European Centre for medium-range weather forecasts (EC). However, it is necessary to correct EC forecast through Bias Correcting on Precipitation (BCoP) since we still have not fully understood the mechanism of precipitation, making EC often have some biases. The existing BCoPs suffers from limited prior data and the fixed Spatio-Temporal (ST) scale. We thus propose an end-to-end deep-learning BCoP model named Spatio-Temporal feature Auto-Selective (STAS) model to select optimal ST regularity from EC via the ST Feature-selective Mechanisms (SFM/TFM). Given different input features, these two mechanisms can automatically adjust the spatial and temporal scales for correcting. Experiments on an EC public dataset indicate that compared with 8 published BCoP methods, STAS shows state-of-the-art performance on several criteria of BCoP, named threat scores (TS). Further, ablation studies justify that the SFM/TFM indeed work well in boosting the performance of BCoP, especially on the heavy precipitation.

preprint2019arXiv

Doping dependence of electromagnetic response in cuprate superconductors

The study of the electromagnetic response in cuprate superconductors plays a crucial role in the understanding of the essential physics of these materials. Here the doping dependence of the electromagnetic response in cuprate superconductors is studied within the kinetic-energy driven superconducting mechanism. The kernel of the response function is evaluated based on the linear response approximation for a purely transverse vector potential, and can be broken up into its diamagnetic and paramagnetic parts. In particular, this paramagnetic part exactly cancels the corresponding diamagnetic part in the normal-state, and then the Meissner effect is obtained within the entire superconducting phase. Following this kernel of the response function, the electromagnetic response calculation in terms of the specular reflection model qualitatively reproduces many of the striking features observed in the experiments. In particular, the local magnetic-field profile follows an exponential law, while the superfluid density exhibits the nonlinear temperature behavior at the lowest temperatures, followed by the linear temperature dependence extending over the most of the superconducting temperature range. Moreover, the maximal value of the superfluid density occurs at around the critical doping $δ_{\rm critical}\sim 0.16$, and then decreases in both lower doped and higher doped regimes. The theory also shows that the nonlinear temperature dependence of the superfluid density at the lowest temperatures can be attributed to the nonlocal effects induced by the d-wave gap nodes on the electron Fermi surface.