Researcher profile

Yue Dai

Yue Dai contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

Mixture-of-Experts (MoE) architectures enhance the efficiency of large language models by activating only a subset of experts per token. However, standard MoE employs a fixed Top-K routing strategy, leading to redundant computation and suboptimal inference latency. Existing acceleration methods either require costly retraining with architectural changes or suffer from severe performance drop at high sparsity due to train-inference mismatch. To address these limitations, we propose BEAM (Binary Expert Activation Masking), a novel method that learns token-adaptive expert selection via trainable binary masks. With a straight-through estimator and an auxiliary regularization loss, BEAM induces dynamic expert sparsity through end-to-end training while maintaining model capability. We further implement an efficient custom CUDA kernel for BEAM, ensuring seamless integration with the vLLM inference framework. Experiments show that BEAM retains over 98\% of the original model's performance while reducing MoE layer FLOPs by up to 85\%, achieving up to 2.5$\times$ faster decoding and 1.4$\times$ higher throughput, demonstrating its effectiveness as a practical, plug-and-play solution for efficient MoE inference.

preprint2022arXiv

An Analysis of Deep Reinforcement Learning Agents for Text-based Games

Text-based games(TBG) are complex environments which allow users or computer agents to make textual interactions and achieve game goals.In TBG agent design and training process, balancing the efficiency and performance of the agent models is a major challenge. Finding TBG agent deep learning modules' performance in standardized environments, and testing their performance among different evaluation types is also important for TBG agent research. We constructed a standardized TBG agent with no hand-crafted rules, formally categorized TBG evaluation types, and analyzed selected methods in our environment.

preprint2022arXiv

ImageArg: A Multi-modal Tweet Dataset for Image Persuasiveness Mining

The growing interest in developing corpora of persuasive texts has promoted applications in automated systems, e.g., debating and essay scoring systems; however, there is little prior work mining image persuasiveness from an argumentative perspective. To expand persuasiveness mining into a multi-modal realm, we present a multi-modal dataset, ImageArg, consisting of annotations of image persuasiveness in tweets. The annotations are based on a persuasion taxonomy we developed to explore image functionalities and the means of persuasion. We benchmark image persuasiveness tasks on ImageArg using widely-used multi-modal learning methods. The experimental results show that our dataset offers a useful resource for this rich and challenging topic, and there is ample room for modeling improvement.

preprint2022arXiv

Infrared Radiation of Graphene Electrothermal Film Triggered Alpha and Theta Brainwaves

The alpha and theta frequency brainwave activity in Electroencephalogram (EEG) signal has been correlated with attention, inhibitory processes, memory, perceptual abilities, and sleep. The enhanced alpha and theta brainwave activity may bring positive behavioral modifications such as promoting creativity and a quick sleep. Herein, we discover that infrared radiation from multilayer graphene electrothermal film can obviously promote the appearance of alpha and theta brainwave in human mind. In particular, the occurrence frequency of the alpha and theta waves in EEG can be effectively enhanced up to 2.3 and 3.0 times, respectively. And the duration time of the alpha and theta waves in EEG can also be effectively extended. The mechanism may be attributed to the efficient infrared radiation caused by graphene mainly focused on the range from 7 to 14 micron, coinciding with the radiation wavelength of natural human body, which can be effectively absorbed by the human skin and speed up the blood microcirculation and metabolism. The comparative effect of different working temperature and heating materials such as water, Cu and even monolayer graphene are systematically investigated, indicating the infrared radiation from the multilayer graphene electrothermal film at 50 degrees has the largest enhancement effect of alpha and theta brainwaves. The multilayer graphene film electrical heater represents a convenient and surprising way for triggering the alpha and theta brainwaves, which has many potential applications in the area of enlarged health cerements.

preprint2022arXiv

Not all entangled states are useful for ancilla-assisted quantum process tomography

It is well known that one can extract all the information of an unknown quantum channel by means of quantum process tomography, such as standard quantum-process tomography and ancilla-assisted quantum process tomography (AAQPT). Furthermore, it has been shown that entanglement is not necessary for AAQPT, there exist separable states which are also useful for it. Surprisingly, in this work we find that not all entangled states are useful for AAQPT, there also exist some entangled states which are useless. The realignment operation used in entanglement detection can be related to the question whether a bipartite state is useful for AAQPT. We derive the relationship between them and show the process of extracting the complete information of an unknown channel by the realignment operation. Based on this relationship, we present examples of a two-qutrit entangled state and a two-qutrit bound entangled state. Both of these two examples are entangled but they cannot be used for AAQPT. Last but not least, experimental verification has also been performed on the IBM platform.

preprint2021arXiv

Observation of the tradeoff between internal quantum nonseparability and external classical correlations

The monogamy relations of entanglement are highly significant. However, they involve only amounts of entanglement shared by different subsystems. Results on monogamy relations between entanglement and other kinds of correlations, and particularly classical correlations, are very scarce. Here we experimentally observe a tradeoff relation between internal quantum nonseparability and external total correlations in a photonic system and found that even purely classical external correlations have a detrimental effect on internal nonseparability. The nonseparability we consider, measured by the concurrence, is between different degrees of freedom within the same photon, and the external classical correlations, measured by the standard quantum mutual information, are generated between the photons of a photon pair using the time-bin method. Our observations show that to preserve the internal entanglement in a system, it is necessary to maintain low external correlations, including classical ones, between the system and its environment.

preprint2020arXiv

Experimentally accessible lower bounds for genuine multipartite entanglement and coherence measures

Experimentally quantifying entanglement and coherence are extremely important for quantum resource theory. However, because the quantum state tomography requires exponentially growing measurements with the number of qubits, it is hard to quantify entanglement and coherence based on the full information of the experimentally realized multipartite states. Fortunately, other methods have been found to directly measure the fidelity of experimental states without quantum state tomography. Here we present a fidelity-based method to derive experimentally accessible lower bounds for measures of genuine multipartite entanglement and coherence. On the one hand, the method works for genuine multipartite entanglement measures including the convex-roof extended negativity, the concurrence, the G-concurrence, and the geometric measure for genuine multipartite entanglement. On the other hand, the method also delivers observable lower bounds for the convex roof of the $l_{1}$-norm of coherence, the geometric measure of coherence, and the coherence of formation. Furthermore, all the lower bounds are based on the fidelity between the chosen pure state and the target state, and we obtain the lower bounds of several real experimental states as examples of our results.

preprint2020arXiv

Numerical and analytical results for geometric measure of coherence and geometric measure of entanglement

Quantifying coherence and entanglement is extremely important in quantum information processing. Here, we present numerical and analytical results for the geometric measure of coherence, and also present numerical results for the geometric measure of entanglement. On the one hand, we first provide a semidefinite algorithm to numerically calculate geometric measure of coherence for arbitrary finite-dimensional mixed states. Based on this semidefinite algorithm, we test randomly generated single-qubit states, single-qutrit states, and a special kind of $d$-dimensional mixed states. Moreover, we also obtain an analytical solution of geometric measure of coherence for a special kind of mixed states. On the other hand, another algorithm is proposed to calculate the geometric measure of entanglement for arbitrary two-qubit and qubit-qutrit states, and some special kinds of higher dimensional mixed states. For other states, the algorithm can get a lower bound of the geometric measure of entanglement. Randomly generated two-qubit states, the isotropic states and the Werner states are tested. Furthermore, we compare our numerical results with some analytical results, which coincide with each other.