Source author record

Yue Dai

Yue Dai appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

quant-ph Artificial Intelligence Computation and Language gr-qc hep-th Multimedia physics.app-ph

Catalog footprint

What is connected

10works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

Mixture-of-Experts (MoE) architectures enhance the efficiency of large language models by activating only a subset of experts per token. However, standard MoE employs a fixed Top-K routing strategy, leading to redundant computation and suboptimal inference latency. Existing acceleration methods either require costly retraining with architectural changes or suffer from severe performance drop at high sparsity due to train-inference mismatch. To address these limitations, we propose BEAM (Binary Expert Activation Masking), a novel method that learns token-adaptive expert selection via trainable binary masks. With a straight-through estimator and an auxiliary regularization loss, BEAM induces dynamic expert sparsity through end-to-end training while maintaining model capability. We further implement an efficient custom CUDA kernel for BEAM, ensuring seamless integration with the vLLM inference framework. Experiments show that BEAM retains over 98\% of the original model's performance while reducing MoE layer FLOPs by up to 85\%, achieving up to 2.5$\times$ faster decoding and 1.4$\times$ higher throughput, demonstrating its effectiveness as a practical, plug-and-play solution for efficient MoE inference.

preprint2022arXiv

An Analysis of Deep Reinforcement Learning Agents for Text-based Games

Text-based games(TBG) are complex environments which allow users or computer agents to make textual interactions and achieve game goals.In TBG agent design and training process, balancing the efficiency and performance of the agent models is a major challenge. Finding TBG agent deep learning modules' performance in standardized environments, and testing their performance among different evaluation types is also important for TBG agent research. We constructed a standardized TBG agent with no hand-crafted rules, formally categorized TBG evaluation types, and analyzed selected methods in our environment.

preprint2022arXiv

ImageArg: A Multi-modal Tweet Dataset for Image Persuasiveness Mining

The growing interest in developing corpora of persuasive texts has promoted applications in automated systems, e.g., debating and essay scoring systems; however, there is little prior work mining image persuasiveness from an argumentative perspective. To expand persuasiveness mining into a multi-modal realm, we present a multi-modal dataset, ImageArg, consisting of annotations of image persuasiveness in tweets. The annotations are based on a persuasion taxonomy we developed to explore image functionalities and the means of persuasion. We benchmark image persuasiveness tasks on ImageArg using widely-used multi-modal learning methods. The experimental results show that our dataset offers a useful resource for this rich and challenging topic, and there is ample room for modeling improvement.

preprint2022arXiv

Infrared Radiation of Graphene Electrothermal Film Triggered Alpha and Theta Brainwaves

The alpha and theta frequency brainwave activity in Electroencephalogram (EEG) signal has been correlated with attention, inhibitory processes, memory, perceptual abilities, and sleep. The enhanced alpha and theta brainwave activity may bring positive behavioral modifications such as promoting creativity and a quick sleep. Herein, we discover that infrared radiation from multilayer graphene electrothermal film can obviously promote the appearance of alpha and theta brainwave in human mind. In particular, the occurrence frequency of the alpha and theta waves in EEG can be effectively enhanced up to 2.3 and 3.0 times, respectively. And the duration time of the alpha and theta waves in EEG can also be effectively extended. The mechanism may be attributed to the efficient infrared radiation caused by graphene mainly focused on the range from 7 to 14 micron, coinciding with the radiation wavelength of natural human body, which can be effectively absorbed by the human skin and speed up the blood microcirculation and metabolism. The comparative effect of different working temperature and heating materials such as water, Cu and even monolayer graphene are systematically investigated, indicating the infrared radiation from the multilayer graphene electrothermal film at 50 degrees has the largest enhancement effect of alpha and theta brainwaves. The multilayer graphene film electrical heater represents a convenient and surprising way for triggering the alpha and theta brainwaves, which has many potential applications in the area of enlarged health cerements.

preprint2022arXiv

Not all entangled states are useful for ancilla-assisted quantum process tomography

It is well known that one can extract all the information of an unknown quantum channel by means of quantum process tomography, such as standard quantum-process tomography and ancilla-assisted quantum process tomography (AAQPT). Furthermore, it has been shown that entanglement is not necessary for AAQPT, there exist separable states which are also useful for it. Surprisingly, in this work we find that not all entangled states are useful for AAQPT, there also exist some entangled states which are useless. The realignment operation used in entanglement detection can be related to the question whether a bipartite state is useful for AAQPT. We derive the relationship between them and show the process of extracting the complete information of an unknown channel by the realignment operation. Based on this relationship, we present examples of a two-qutrit entangled state and a two-qutrit bound entangled state. Both of these two examples are entangled but they cannot be used for AAQPT. Last but not least, experimental verification has also been performed on the IBM platform.

preprint2021arXiv

Observation of the tradeoff between internal quantum nonseparability and external classical correlations

The monogamy relations of entanglement are highly significant. However, they involve only amounts of entanglement shared by different subsystems. Results on monogamy relations between entanglement and other kinds of correlations, and particularly classical correlations, are very scarce. Here we experimentally observe a tradeoff relation between internal quantum nonseparability and external total correlations in a photonic system and found that even purely classical external correlations have a detrimental effect on internal nonseparability. The nonseparability we consider, measured by the concurrence, is between different degrees of freedom within the same photon, and the external classical correlations, measured by the standard quantum mutual information, are generated between the photons of a photon pair using the time-bin method. Our observations show that to preserve the internal entanglement in a system, it is necessary to maintain low external correlations, including classical ones, between the system and its environment.

preprint2020arXiv

Experimentally accessible lower bounds for genuine multipartite entanglement and coherence measures

Experimentally quantifying entanglement and coherence are extremely important for quantum resource theory. However, because the quantum state tomography requires exponentially growing measurements with the number of qubits, it is hard to quantify entanglement and coherence based on the full information of the experimentally realized multipartite states. Fortunately, other methods have been found to directly measure the fidelity of experimental states without quantum state tomography. Here we present a fidelity-based method to derive experimentally accessible lower bounds for measures of genuine multipartite entanglement and coherence. On the one hand, the method works for genuine multipartite entanglement measures including the convex-roof extended negativity, the concurrence, the G-concurrence, and the geometric measure for genuine multipartite entanglement. On the other hand, the method also delivers observable lower bounds for the convex roof of the $l_{1}$-norm of coherence, the geometric measure of coherence, and the coherence of formation. Furthermore, all the lower bounds are based on the fidelity between the chosen pure state and the target state, and we obtain the lower bounds of several real experimental states as examples of our results.

preprint2020arXiv

Numerical and analytical results for geometric measure of coherence and geometric measure of entanglement

Quantifying coherence and entanglement is extremely important in quantum information processing. Here, we present numerical and analytical results for the geometric measure of coherence, and also present numerical results for the geometric measure of entanglement. On the one hand, we first provide a semidefinite algorithm to numerically calculate geometric measure of coherence for arbitrary finite-dimensional mixed states. Based on this semidefinite algorithm, we test randomly generated single-qubit states, single-qutrit states, and a special kind of $d$-dimensional mixed states. Moreover, we also obtain an analytical solution of geometric measure of coherence for a special kind of mixed states. On the other hand, another algorithm is proposed to calculate the geometric measure of entanglement for arbitrary two-qubit and qubit-qutrit states, and some special kinds of higher dimensional mixed states. For other states, the algorithm can get a lower bound of the geometric measure of entanglement. Randomly generated two-qubit states, the isotropic states and the Werner states are tested. Furthermore, we compare our numerical results with some analytical results, which coincide with each other.

preprint2016arXiv

Quantum entanglement in three accelerating qubits coupled to scalar field

We consider quantum entanglement of three accelerating qubits, each of which is locally coupled with a real scalar field, without causal influence among the qubits or among the fields. The initial states are assumed to be the GHZ and W states, which are the two representative three-partite entangled states. For each initial state, we study how various kinds of entanglement depend on the accelerations of the three qubits. All kinds of entanglement eventually suddenly die if at least two of three qubits have large enough accelerations. This result implies the eventual sudden death of all kinds of entanglement among three particles coupled with scalar fields when they are sufficiently close to the horizon of a black hole.

preprint2015arXiv

Killing quantum entanglement by acceleration or a black hole

We consider two entangled accelerating qubits coupled with real scalar fields, each described by the Unruh-Wald model. It is demonstrated that because of the Unruh effect, the bipartite entanglement of the two qubits suddenly dies when the acceleration of one or more qubits are large enough. We also consider three entangled accelerating qubits in GHZ state and in W state, with equal acceleration-frequency ratio, and found that in either state, the tripartite entanglement suddenly dies at a certain value of acceleration-frequency ratio. The equivalence between the Rindler metric and the Schwarzchild metric in the vicinity of the horizon of a black hole implies that for the two entangled qubits outside a black hole, the entanglement suddenly dies when one or both of the qubits are close enough to the horizon, while for the three entangled qubits in GHZ or W state, the tripartite entanglement suddenly dies when these qubits are close enough to the horizon.

Yue Dai

What is connected

Connect this record

See the researcher in context

Building this map preview

10 published item(s)

BEAM: Binary Expert Activation Masking for Dynamic Routing in MoE

An Analysis of Deep Reinforcement Learning Agents for Text-based Games

ImageArg: A Multi-modal Tweet Dataset for Image Persuasiveness Mining

Infrared Radiation of Graphene Electrothermal Film Triggered Alpha and Theta Brainwaves

Not all entangled states are useful for ancilla-assisted quantum process tomography

Observation of the tradeoff between internal quantum nonseparability and external classical correlations

Experimentally accessible lower bounds for genuine multipartite entanglement and coherence measures

Numerical and analytical results for geometric measure of coherence and geometric measure of entanglement

Quantum entanglement in three accelerating qubits coupled to scalar field

Killing quantum entanglement by acceleration or a black hole