Researcher profile

Yi Zhong

Yi Zhong contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
11topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study

While AI-generated hallucinations pose considerable risks, the underlying cognitive mechanisms by which humans can successfully recognize or be misled by these hallucinations remain unclear. To address this problem, this paper explores humans' neural dynamics to characterize how the brain processes hallucinated content. We record EEG signals from 27 participants while they are performing a verification task to judge the correctness of image descriptions generated by a multi-modal large language model (MLLM). Based on an averaged event-related potential (ERP) study, we reveal that multiple cognitive processes, e.g., semantic integration, inferential processing, memory retrieval, and cognitive load, exhibit distinct patterns when humans process hallucinated versus non-hallucinated content. Notably, neural responses to hallucinations that were misjudged versus correctly judged by human participants showed significant differences. This indicates that misjudged AI-generated hallucinations failed to trigger the standard neurocognitive fact verification pathway.

preprint2023arXiv

Thick branes in Born-Infeld determinantal gravity in Weitzenböck spacetime

By adopting the idea of Born-Infeld electromagnetism, the Born-Infeld determinantal gravity in Weitzenböck spacetime provides a way to smooth the Big Bang singularity at the classical level. We consider a thick braneworld scenario in the higher-dimensional extension of this gravity, and investigate the torsion effects on the brane structure and gravitational perturbation. For three particular parameter choices, analytic domain wall solutions are obtained. They have a similar brane configuration that the brane thickness becomes thinner as the spacetime torsion gets stronger. For each model, the massless graviton is localized on the brane with the width of localization decreasing with the enhancement of the spacetime torsion, while the massive gravitons propagate in the bulk and contribute a correction term proportional to ${1}/{(k r)^{3}}$ to the Newtonian potential. A sparsity constraint on the fundamental 5-dimensional gravitational scale is estimated from the gravitational experiment. Moreover, the parameter ranges in which the Kaluza-Klein gravitons are tachyonic free are analyzed.

preprint2022arXiv

CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One

Continual learning requires incremental compatibility with a sequence of tasks. However, the design of model architecture remains an open question: In general, learning all tasks with a shared set of parameters suffers from severe interference between tasks; while learning each task with a dedicated parameter subspace is limited by scalability. In this work, we theoretically analyze the generalization errors for learning plasticity and memory stability in continual learning, which can be uniformly upper-bounded by (1) discrepancy between task distributions, (2) flatness of loss landscape and (3) cover of parameter space. Then, inspired by the robust biological learning system that processes sequential experiences with multiple parallel compartments, we propose Cooperation of Small Continual Learners (CoSCL) as a general strategy for continual learning. Specifically, we present an architecture with a fixed number of narrower sub-networks to learn all incremental tasks in parallel, which can naturally reduce the two errors through improving the three components of the upper bound. To strengthen this advantage, we encourage to cooperate these sub-networks by penalizing the difference of predictions made by their feature representations. With a fixed parameter budget, CoSCL can improve a variety of representative continual learning approaches by a large margin (e.g., up to 10.64% on CIFAR-100-SC, 9.33% on CIFAR-100-RS, 11.45% on CUB-200-2011 and 6.72% on Tiny-ImageNet) and achieve the new state-of-the-art performance.

preprint2022arXiv

Interference-Limited Ultra-Reliable and Low-Latency Communications: Graph Neural Networks or Stochastic Geometry?

In this paper, we aim to improve the Quality-of-Service (QoS) of Ultra-Reliability and Low-Latency Communications (URLLC) in interference-limited wireless networks. To obtain time diversity within the channel coherence time, we first put forward a random repetition scheme that randomizes the interference power. Then, we optimize the number of reserved slots and the number of repetitions for each packet to minimize the QoS violation probability, defined as the percentage of users that cannot achieve URLLC. We build a cascaded Random Edge Graph Neural Network (REGNN) to represent the repetition scheme and develop a model-free unsupervised learning method to train it. We analyze the QoS violation probability using stochastic geometry in a symmetric scenario and apply a model-based Exhaustive Search (ES) method to find the optimal solution. Simulation results show that in the symmetric scenario, the QoS violation probabilities achieved by the model-free learning method and the model-based ES method are nearly the same. In more general scenarios, the cascaded REGNN generalizes very well in wireless networks with different scales, network topologies, cell densities, and frequency reuse factors. It outperforms the model-based ES method in the presence of the model mismatch.

preprint2022arXiv

Memory Replay with Data Compression for Continual Learning

Continual learning needs to overcome catastrophic forgetting of the past. Memory replay of representative old training samples has been shown as an effective solution, and achieves the state-of-the-art (SOTA) performance. However, existing work is mainly built on a small memory buffer containing a few original data, which cannot fully characterize the old data distribution. In this work, we propose memory replay with data compression (MRDC) to reduce the storage cost of old training samples and thus increase their amount that can be stored in the memory buffer. Observing that the trade-off between the quality and quantity of compressed data is highly nontrivial for the efficacy of memory replay, we propose a novel method based on determinantal point processes (DPPs) to efficiently determine an appropriate compression quality for currently-arrived training samples. In this way, using a naive data compression algorithm with a properly selected quality can largely boost recent strong baselines by saving more compressed data in a limited storage space. We extensively validate this across several benchmarks of class-incremental learning and in a realistic scenario of object detection for autonomous driving.

preprint2021arXiv

First-order formalism and thick branes in mimetic gravity

In this paper, we investigate thick branes generated by a scalar field in mimetic gravity theory. By introducing two auxiliary super-potentials, we transform the second-order field equations of the system into a set of first-order equations. With this first-order formalism, several types of analytical thick brane solutions are obtained. Then, tensor and scalar perturbations are analysed. We find that both kinds of perturbations are stable. The effective potentials for the tensor and scalar perturbations are dual to each other. The tensor zero mode can be localized on the brane while the scalar zero mode cannot. Thus, the four-dimensional Newtonian potential can be recovered on the brane.

preprint2021arXiv

On Meta Distribution and Local Delay for Cache-Enabled Networks with Random DTX: Analysis and Optimization

A fine-grained analysis of the cache-enabled networks is crucial for system design. In this paper, we focus on the meta distribution of the signal-to-interference ratio (SIR) for the cache-enabled networks where the locations of the base stations (BSs) are modeled as a Poisson point process (PPP). With the application of the random caching and the random discontinuous transmission (DTX) schemes, we derive the moments of the conditional successful transmission probability (STP), the exact meta distribution and its beta approximation by utilizing stochastic geometry. The closed-form expressions of the mean and variance of the local delay (i.e., the network jitter) are also derived. We then consider the maximization of the mean STP and the minimization of the average system transmission delay by jointly optimizing the caching probability and the BS active probability. Finally, the numerical results demonstrate the superiority of the proposed optimization schemes over the existing caching strategies and reveal the impacts of the key network parameters on the cache-enabled networks in terms of mean STP, STP variance, meta distribution, mean local delay and network jitter.

preprint2021arXiv

Sequential Convolutional Recurrent Neural Networks for Fast Automatic Modulation Classification

A novel and efficient end-to-end learning model for automatic modulation classification is proposed for wireless spectrum monitoring applications, which automatically learns from the time domain in-phase and quadrature data without requiring the design of hand-crafted expert features. With the intuition of convolutional layers with pooling serving as the role of front-end feature distillation and dimensionality reduction, sequential convolutional recurrent neural networks are developed to take complementary advantage of parallel computing capability of convolutional neural networks and temporal sensitivity of recurrent neural networks. Experimental results demonstrate that the proposed architecture delivers overall superior performance in signal to noise ratio range above -10~dB, and achieves significantly improved classification accuracy from 80\% to 92.1\% at high signal to noise ratio range, while drastically reduces the average training and prediction time by approximately 74% and 67%, respectively. Response patterns learned by the proposed architecture are visualized to better understand the physics of the model. Furthermore, a comparative study is performed to investigate the impacts of various sequential convolutional recurrent neural network structure settings on classification performance. A representative sequential convolutional recurrent neural network architecture with the two-layer convolutional neural network and subsequent two-layer long short-term memory neural network is developed to suggest the option for fast automatic modulation classification.

preprint2020arXiv

An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments

In this paper, the real-time deployment of unmanned aerial vehicles (UAVs) as flying base stations (BSs) for optimizing the throughput of mobile users is investigated for UAV networks. This problem is formulated as a time-varying mixed-integer non-convex programming (MINP) problem, which is challenging to find an optimal solution in a short time with conventional optimization techniques. Hence, we propose an actor-critic-based (AC-based) deep reinforcement learning (DRL) method to find near-optimal UAV positions at every moment. In the proposed method, the process searching for the solution iteratively at a particular moment is modeled as a Markov decision process (MDP). To handle infinite state and action spaces and improve the robustness of the decision process, two powerful neural networks (NNs) are configured to evaluate the UAV position adjustments and make decisions, respectively. Compared with the heuristic algorithm, sequential least-squares programming and fixed UAVs methods, simulation results have shown that the proposed method outperforms these three benchmarks in terms of the throughput at every moment in UAV networks.

preprint2020arXiv

Effect of Spatial and Temporal Traffic Statistics on the Performance of Wireless Networks

The traffic in wireless networks has become diverse and fluctuating both spatially and temporally due to the emergence of new wireless applications and the complexity of scenarios. The purpose of this paper is to quantitatively analyze the impact of the wireless traffic, which fluctuates both spatially and temporally, on the performance of the wireless networks. Specially, we propose to combine the tools from stochastic geometry and queueing theory to model the spatial and temporal fluctuation of traffic, which to our best knowledge has seldom been evaluated analytically. We derive the spatial and temporal statistics, the total arrival rate, the stability of queues and the delay of users by considering two different spatial properties of traffic, i.e., the uniformly and non-uniformly distributed cases. The numerical results indicate that although the fluctuation of traffic (reflected by the variance of total arrival rate) when the users are clustered is much fiercer than that when the users are uniformly distributed, the unstable probability is smaller. Our work provides a useful reference for the design of wireless networks when the complex spatio-temporal fluctuation of the traffic is considered.

preprint2020arXiv

Spatio-temporal Modeling for Massive and Sporadic Access

The vision for smart city imperiously appeals to the implementation of Internet-of-Things (IoT), some features of which, such as massive access and bursty short packet transmissions, require new methods to enable the cellular system to seamlessly support its integration. Rigorous theoretical analysis is indispensable to obtain constructive insight for the networking design of massive access. In this paper, we propose and define the notion of massive and sporadic access (MSA) to quantitatively describe the massive access of IoT devices. We evaluate the temporal correlation of interference and successful transmission events, and verify that such correlation is negligible in the scenario of MSA. In view of this, in order to resolve the difficulty in any precise spatio-temporal analysis where complex interactions persist among the queues, we propose an approximation that all nodes are moving so fast that their locations are independent at different time slots. Furthermore, we compare the original static network and the equivalent network with high mobility to demonstrate the effectiveness of the proposed approximation approach. The proposed approach is promising for providing a convenient and general solution to evaluate and design the IoT network with massive and sporadic access.

preprint2020arXiv

Triple Memory Networks: a Brain-Inspired Method for Continual Learning

Continual acquisition of novel experience without interfering previously learned knowledge, i.e. continual learning, is critical for artificial neural networks, but limited by catastrophic forgetting. A neural network adjusts its parameters when learning a new task, but then fails to conduct the old tasks well. By contrast, the brain has a powerful ability to continually learn new experience without catastrophic interference. The underlying neural mechanisms possibly attribute to the interplay of hippocampus-dependent memory system and neocortex-dependent memory system, mediated by prefrontal cortex. Specifically, the two memory systems develop specialized mechanisms to consolidate information as more specific forms and more generalized forms, respectively, and complement the two forms of information in the interplay. Inspired by such brain strategy, we propose a novel approach named triple memory networks (TMNs) for continual learning. TMNs model the interplay of hippocampus, prefrontal cortex and sensory cortex (a neocortex region) as a triple-network architecture of generative adversarial networks (GAN). The input information is encoded as specific representation of the data distributions in a generator, or generalized knowledge of solving tasks in a discriminator and a classifier, with implementing appropriate brain-inspired algorithms to alleviate catastrophic forgetting in each module. Particularly, the generator replays generated data of the learned tasks to the discriminator and the classifier, both of which are implemented with a weight consolidation regularizer to complement the lost information in generation process. TMNs achieve new state-of-the-art performance on a variety of class-incremental learning benchmarks on MNIST, SVHN, CIFAR-10 and ImageNet-50, comparing with strong baseline methods.

preprint2020arXiv

Using Deep Convolutional Neural Networks to Diagnose COVID-19 From Chest X-Ray Images

The COVID-19 epidemic has become a major safety and health threat worldwide. Imaging diagnosis is one of the most effective ways to screen COVID-19. This project utilizes several open-source or public datasets to present an open-source dataset of COVID-19 CXRs, named COVID-19-CXR-Dataset, and introduces a deep convolutional neural network model. The model validates on 740 test images and achieves 87.3% accuracy, 89.67 % precision, and 84.46% recall, and correctly classifies 98 out of 100 COVID-19 x-ray images in test set with more than 81% prediction probability under the condition of 95% confidence interval. This project may serve as a reference for other researchers aiming to advance the development of deep learning applications in medical imaging.