Researcher profile

Xinyu Dai

Xinyu Dai contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
12works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

12 published item(s)

preprint2026arXiv

Causal Evidence for Attention Head Imbalance in Modality Conflict Hallucination

Modality-conflict hallucination occurs when multimodal large language models (MLLMs) prioritize erroneous textual premises over contradictory visual evidence. To understand why visual evidence fails to prevail during generation, we take a mechanistic perspective and examine which internal components drive or resist this failure. We perform head-level causal analysis using path patching across five open-source MLLMs and identify two groups of attention heads with opposing causal roles: hallucination-driving heads and hallucination-resisting heads. We find a consistent asymmetry: driving effects are more broadly distributed and carry greater aggregate weight, whereas resisting effects concentrate in a small number of high-importance heads. Ablation experiments further confirm that these groups exert opposing effects during generation: distributed driving influence and localized resistance together form an imbalanced routing structure that biases generation toward the erroneous premise. Motivated by this finding, we propose MACI (Modality-conflict-Aware Causal Intervention), a conditional intervention that suppresses causally identified hallucination-driving heads only when conflict is detected. Across five MLLMs, MACI achieves the largest hallucination reduction among compared inference-time baselines on the MMMC benchmark with a favorable hallucination-accuracy trade-off, and transfers zero-shot to the SCI-SemanticConflict test.

preprint2022arXiv

$\textit{latent}$-GLAT: Glancing at Latent Variables for Parallel Text Generation

Recently, parallel text generation has received widespread attention due to its success in generation efficiency. Although many advanced techniques are proposed to improve its generation quality, they still need the help of an autoregressive model for training to overcome the one-to-many multi-modal phenomenon in the dataset, limiting their applications. In this paper, we propose $\textit{latent}$-GLAT, which employs the discrete latent variables to capture word categorical information and invoke an advanced curriculum learning technique, alleviating the multi-modality problem. Experiment results show that our method outperforms strong baselines without the help of an autoregressive model, which further broadens the application scenarios of the parallel decoding paradigm.

preprint2022arXiv

Analyzing the Intensity of Complaints on Social Media

Complaining is a speech act that expresses a negative inconsistency between reality and human expectations. While prior studies mostly focus on identifying the existence or the type of complaints, in this work, we present the first study in computational linguistics of measuring the intensity of complaints from text. Analyzing complaints from such perspective is particularly useful, as complaints of certain degrees may cause severe consequences for companies or organizations. We create the first Chinese dataset containing 3,103 posts about complaints from Weibo, a popular Chinese social media platform. These posts are then annotated with complaints intensity scores using Best-Worst Scaling (BWS) method. We show that complaints intensity can be accurately estimated by computational models with the best mean square error achieving 0.11. Furthermore, we conduct a comprehensive linguistic analysis around complaints, including the connections between complaints and sentiment, and a cross-lingual comparison for complaints expressions used by Chinese and English speakers. We finally show that our complaints intensity scores can be incorporated for better estimating the popularity of posts on social media.

preprint2022arXiv

Variability Selected Active Galactic Nuclei from ASAS-SN Survey: Constraining the Low Luminosity AGN Population

Low luminosity active galactic nuclei (LLAGN) probe accretion physics in the low Eddington regime and can provide additional clues about galaxy evolution. AGN variability is ubiquitous and thus provides a reliable tool for finding AGN. We analyze the All-Sky Automated Survey for SuperNovae light curves of 1218 galaxies with $g<14$ mag and Sloan Digital Sky Survey spectra in search of AGN. We find 37 objects that are both variable and have AGN-like structure functions, which is about 3% of the sample. The majority of the variability selected AGN are LLAGN with Eddington ratios ranging from $10^{-4}$ to $10^{-2}$. We thus estimate the fraction of LLAGN in the population of galaxies as 2% down to a median Eddington ratio of $2\times10^{-3}$. Combining the BPT line ratio diagnostics and the broad-line AGN, up to $\sim$60% of the AGN candidates are confirmed spectroscopically. The BPT diagnostics also classified 10-30% of the candidates as star forming galaxies rather than AGN.

preprint2021arXiv

Conditional Neural Process for non-parametric modeling of AGN light curve

The consequences of complex disturbed environments in the vicinity of a supermassive black hole are not well represented by standard statistical models of optical variability in active galactic nuclei (AGN). Thus, developing new methodologies for investigating and modeling AGN light curves is crucial. Conditional Neural Processes (CNPs) are nonlinear function models that forecast stochastic time-series based on a finite amount of known data without the use of any additional parameters or prior knowledge (kernels). We provide a CNP algorithm that is specifically designed for simulating AGN light curves. It was trained using data from the All-Sky Automated Survey for Supernovae, which included 153 AGN. We present CNP modeling performance for a subsample of five AGNs with distinctive difficult-to-model properties. The performance of CNP in predicting temporal flux fluctuation was assessed using a minimizing loss function, and the results demonstrated the algorithm&#39;s usefulness. Our preliminary parallelization experiments show that CNP can efficiently handle large amounts of data. These results imply that CNP can be more effective than standard tools in modeling large volumes of AGN data (as anticipated from time-domain surveys such as the Vera C. Rubin Observatory&#39;s Legacy Survey of Space and Time).

preprint2021arXiv

Optical Confirmation of X-ray selected Galaxy clusters from the Swift AGN and Cluster survey with MDM and Pan-STARRS Data (Paper III)

To understand structure formation in the universe and impose stronger constraints on the cluster mass function and cosmological models, it is important to have large galaxy cluster catalogs. The Swift AGN and Cluster Survey is a serendipitous X-ray survey aimed at building a large statistically selected X-ray cluster catalog with 442 cluster candidates in its first release. Our initial SDSS follow-up study confirmed $50\%$ of clusters in the SDSS footprint as z $<$ 0.5 clusters. Here, we present further optical follow-up analysis of 248 (out of 442) cluster candidates from the Swift cluster catalog using multi-band imaging from the MDM $2.4m$ telescope and the Pan-STARRS survey. We report the optical confirmation of 55 clusters with $> 3σ$ galaxy overdensities and detectable red sequences in the color-magnitude space. The majority of these confirmed clusters have redshifts z $<$ 0.6. The remaining candidates are potentially higher redshift clusters that are excellent targets for infrared observations. We report the X-ray luminosity and the optical richness for these confirmed clusters. We also discuss the distinction between X-ray and optical observables for the detected and non-detected cluster candidates.

preprint2020arXiv

A Reinforced Generation of Adversarial Examples for Neural Machine Translation

Neural machine translation systems tend to fail on less decent inputs despite its significant efficacy, which may significantly harm the credibility of this systems-fathoming how and when neural-based systems fail in such cases is critical for industrial maintenance. Instead of collecting and analyzing bad cases using limited handcrafted error features, here we investigate this issue by generating adversarial examples via a new paradigm based on reinforcement learning. Our paradigm could expose pitfalls for a given performance metric, e.g., BLEU, and could target any given neural machine translation architecture. We conduct experiments of adversarial attacks on two mainstream neural machine translation architectures, RNN-search, and Transformer. The results show that our method efficiently produces stable attacks with meaning-preserving adversarial examples. We also present a qualitative and quantitative analysis for the preference pattern of the attack, demonstrating its capability of pitfall exposure.

preprint2020arXiv

Adding A Filter Based on The Discriminator to Improve Unconditional Text Generation

The autoregressive language model (ALM) trained with maximum likelihood estimation (MLE) is widely used in unconditional text generation. Due to exposure bias, the generated texts still suffer from low quality and diversity. This presents statistically as a discrepancy between the real text and generated text. Some research shows a discriminator can detect this discrepancy. Because the discriminator can encode more information than the generator, discriminator has the potentiality to improve generator. To alleviate the exposure bias, generative adversarial networks (GAN) use the discriminator to update the generator&#39;s parameters directly, but they fail by being evaluated precisely. A critical reason for the failure is the difference between the discriminator input and the ALM input. We propose a novel mechanism by adding a filter which has the same input as the discriminator. First, discriminator detects the discrepancy signals and passes to filter directly (or by learning). Then, we use the filter to reject some generated samples with a sampling-based method. Thus, the original generative distribution is revised to reduce the discrepancy. Two ALMs, RNN-based and Transformer-based, are experimented. Evaluated precisely by three metrics, our mechanism consistently outperforms the ALMs and all kinds of GANs across two benchmark data sets.

preprint2020arXiv

Confirmed short periodic variability of subparsec supermassive binary black hole candidate Mrk 231

Here we confirm the short periodic variability of a subparsec supermassive binary black hole (SMBBH) candidate Mrk 231 in the extended optical photometric data set collected by the Catalina Real-Time Transient Survey (CRTS) and All-Sky Automated Survey for Supernovae (ASAS-SN). Using the Lomb-Scargle periodogram and 2DHybrid method, we detected the significant periodicity of ~ 1.1 yr beyond a damped random walk model in the CRTS+ASAS-SN optical data set. Mrk 231 has been previously proposed as a SMBBH candidate with a highly unequal mass ratio (q~ 0.03), very tight mutual separation of ~590 AU, and an orbital period of ~1.2 yr. Hence, our result further supports, even though not prove, the intriguing hypothesis that SMBBHs with low mass ratios may be more common than close-equal mass SMBBHs. This result, however, was obtained from the contribution of CRTS data with limited sampling cadence and photometric accuracy, and further monitoring of Mrk 231 is crucial to confirm the periodicity.

preprint2020arXiv

Prompt Agnostic Essay Scorer: A Domain Generalization Approach to Cross-prompt Automated Essay Scoring

Cross-prompt automated essay scoring (AES) requires the system to use non target-prompt essays to award scores to a target-prompt essay. Since obtaining a large quantity of pre-graded essays to a particular prompt is often difficult and unrealistic, the task of cross-prompt AES is vital for the development of real-world AES systems, yet it remains an under-explored area of research. Models designed for prompt-specific AES rely heavily on prompt-specific knowledge and perform poorly in the cross-prompt setting, whereas current approaches to cross-prompt AES either require a certain quantity of labelled target-prompt essays or require a large quantity of unlabelled target-prompt essays to perform transfer learning in a multi-step manner. To address these issues, we introduce Prompt Agnostic Essay Scorer (PAES) for cross-prompt AES. Our method requires no access to labelled or unlabelled target-prompt data during training and is a single-stage approach. PAES is easy to apply in practice and achieves state-of-the-art performance on the Automated Student Assessment Prize (ASAP) dataset.

preprint2020arXiv

R3: A Reading Comprehension Benchmark Requiring Reasoning Processes

Existing question answering systems can only predict answers without explicit reasoning processes, which hinder their explainability and make us overestimate their ability of understanding and reasoning over natural language. In this work, we propose a novel task of reading comprehension, in which a model is required to provide final answers and reasoning processes. To this end, we introduce a formalism for reasoning over unstructured text, namely Text Reasoning Meaning Representation (TRMR). TRMR consists of three phrases, which is expressive enough to characterize the reasoning process to answer reading comprehension questions. We develop an annotation platform to facilitate TRMR&#39;s annotation, and release the R3 dataset, a \textbf{R}eading comprehension benchmark \textbf{R}equiring \textbf{R}easoning processes. R3 contains over 60K pairs of question-answer pairs and their TRMRs. Our dataset is available at: \url{http://anonymous}.

preprint2019arXiv

Confirmation of Planet-Mass Objects in Extragalactic Systems

Quasar microlensing serves as a unique probe of discrete objects within galaxies and galaxy clusters. Recent advancement of the technique shows that it can constrain planet-scale objects beyond our native galaxy by studying their induced microlensing signatures, the energy shift of emission lines originated in the vicinity of the black hole of high redshift background quasars. We employ this technique to exert effective constraints on the planet-mass object distribution within two additional lens systems, Q J0158$-$4325 ($z_l = 0.317$) and SDSS J1004+4112 ($z_l = 0.68$) using Chandra observations of the two gravitationally-lensed quasars. The observed variations of the emission line peak energy can be explained as microlensing of the FeK$α$ emission region induced by planet-mass microlenses. To corroborate this, we perform microlensing simulations to determine the probability of a caustic transiting the source region and compare this with the observed line shift rates. Our analysis yields constraints on the sub-stellar population, with masses ranging from Moon ($10^{-8} M_{\odot}$) to Jupiter ($10^{-3} M_{\odot}$) sized bodies, within these galaxy or cluster scale structures, with total mass fractions of $\sim 3\times10^{-4}$ and $\sim 1\times10^{-4}$ with respect to halo mass for Q J0158$-$4325 and SDSS J1004+4112, respectively. Our analysis suggests that unbound planet-mass objects are universal in galaxies, and we surmise the objects to be either free-floating planets or primordial black holes. We present the first-ever constraints on the sub-stellar mass distribution in the intra-cluster light of a galaxy cluster. Our analysis yields the most stringent limit for primordial black holes at the mass range.