Source author record

Yi Zhong

Yi Zhong appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Networking and Internet Architecture gr-qc hep-th Information Theory math.IT eess.SP Artificial Intelligence cond-mat.stat-mech physics.class-ph Computation and Language Computer Vision eess.IV nlin.CD Performance

Catalog footprint

What is connected

24works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study

While AI-generated hallucinations pose considerable risks, the underlying cognitive mechanisms by which humans can successfully recognize or be misled by these hallucinations remain unclear. To address this problem, this paper explores humans' neural dynamics to characterize how the brain processes hallucinated content. We record EEG signals from 27 participants while they are performing a verification task to judge the correctness of image descriptions generated by a multi-modal large language model (MLLM). Based on an averaged event-related potential (ERP) study, we reveal that multiple cognitive processes, e.g., semantic integration, inferential processing, memory retrieval, and cognitive load, exhibit distinct patterns when humans process hallucinated versus non-hallucinated content. Notably, neural responses to hallucinations that were misjudged versus correctly judged by human participants showed significant differences. This indicates that misjudged AI-generated hallucinations failed to trigger the standard neurocognitive fact verification pathway.

preprint2023arXiv

Thick branes in Born-Infeld determinantal gravity in Weitzenböck spacetime

By adopting the idea of Born-Infeld electromagnetism, the Born-Infeld determinantal gravity in Weitzenböck spacetime provides a way to smooth the Big Bang singularity at the classical level. We consider a thick braneworld scenario in the higher-dimensional extension of this gravity, and investigate the torsion effects on the brane structure and gravitational perturbation. For three particular parameter choices, analytic domain wall solutions are obtained. They have a similar brane configuration that the brane thickness becomes thinner as the spacetime torsion gets stronger. For each model, the massless graviton is localized on the brane with the width of localization decreasing with the enhancement of the spacetime torsion, while the massive gravitons propagate in the bulk and contribute a correction term proportional to ${1}/{(k r)^{3}}$ to the Newtonian potential. A sparsity constraint on the fundamental 5-dimensional gravitational scale is estimated from the gravitational experiment. Moreover, the parameter ranges in which the Kaluza-Klein gravitons are tachyonic free are analyzed.

preprint2022arXiv

CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One

Continual learning requires incremental compatibility with a sequence of tasks. However, the design of model architecture remains an open question: In general, learning all tasks with a shared set of parameters suffers from severe interference between tasks; while learning each task with a dedicated parameter subspace is limited by scalability. In this work, we theoretically analyze the generalization errors for learning plasticity and memory stability in continual learning, which can be uniformly upper-bounded by (1) discrepancy between task distributions, (2) flatness of loss landscape and (3) cover of parameter space. Then, inspired by the robust biological learning system that processes sequential experiences with multiple parallel compartments, we propose Cooperation of Small Continual Learners (CoSCL) as a general strategy for continual learning. Specifically, we present an architecture with a fixed number of narrower sub-networks to learn all incremental tasks in parallel, which can naturally reduce the two errors through improving the three components of the upper bound. To strengthen this advantage, we encourage to cooperate these sub-networks by penalizing the difference of predictions made by their feature representations. With a fixed parameter budget, CoSCL can improve a variety of representative continual learning approaches by a large margin (e.g., up to 10.64% on CIFAR-100-SC, 9.33% on CIFAR-100-RS, 11.45% on CUB-200-2011 and 6.72% on Tiny-ImageNet) and achieve the new state-of-the-art performance.

preprint2022arXiv

Interference-Limited Ultra-Reliable and Low-Latency Communications: Graph Neural Networks or Stochastic Geometry?

In this paper, we aim to improve the Quality-of-Service (QoS) of Ultra-Reliability and Low-Latency Communications (URLLC) in interference-limited wireless networks. To obtain time diversity within the channel coherence time, we first put forward a random repetition scheme that randomizes the interference power. Then, we optimize the number of reserved slots and the number of repetitions for each packet to minimize the QoS violation probability, defined as the percentage of users that cannot achieve URLLC. We build a cascaded Random Edge Graph Neural Network (REGNN) to represent the repetition scheme and develop a model-free unsupervised learning method to train it. We analyze the QoS violation probability using stochastic geometry in a symmetric scenario and apply a model-based Exhaustive Search (ES) method to find the optimal solution. Simulation results show that in the symmetric scenario, the QoS violation probabilities achieved by the model-free learning method and the model-based ES method are nearly the same. In more general scenarios, the cascaded REGNN generalizes very well in wireless networks with different scales, network topologies, cell densities, and frequency reuse factors. It outperforms the model-based ES method in the presence of the model mismatch.

preprint2022arXiv

Memory Replay with Data Compression for Continual Learning

Continual learning needs to overcome catastrophic forgetting of the past. Memory replay of representative old training samples has been shown as an effective solution, and achieves the state-of-the-art (SOTA) performance. However, existing work is mainly built on a small memory buffer containing a few original data, which cannot fully characterize the old data distribution. In this work, we propose memory replay with data compression (MRDC) to reduce the storage cost of old training samples and thus increase their amount that can be stored in the memory buffer. Observing that the trade-off between the quality and quantity of compressed data is highly nontrivial for the efficacy of memory replay, we propose a novel method based on determinantal point processes (DPPs) to efficiently determine an appropriate compression quality for currently-arrived training samples. In this way, using a naive data compression algorithm with a properly selected quality can largely boost recent strong baselines by saving more compressed data in a limited storage space. We extensively validate this across several benchmarks of class-incremental learning and in a realistic scenario of object detection for autonomous driving.

preprint2021arXiv

First-order formalism and thick branes in mimetic gravity

In this paper, we investigate thick branes generated by a scalar field in mimetic gravity theory. By introducing two auxiliary super-potentials, we transform the second-order field equations of the system into a set of first-order equations. With this first-order formalism, several types of analytical thick brane solutions are obtained. Then, tensor and scalar perturbations are analysed. We find that both kinds of perturbations are stable. The effective potentials for the tensor and scalar perturbations are dual to each other. The tensor zero mode can be localized on the brane while the scalar zero mode cannot. Thus, the four-dimensional Newtonian potential can be recovered on the brane.

preprint2021arXiv

On Meta Distribution and Local Delay for Cache-Enabled Networks with Random DTX: Analysis and Optimization

A fine-grained analysis of the cache-enabled networks is crucial for system design. In this paper, we focus on the meta distribution of the signal-to-interference ratio (SIR) for the cache-enabled networks where the locations of the base stations (BSs) are modeled as a Poisson point process (PPP). With the application of the random caching and the random discontinuous transmission (DTX) schemes, we derive the moments of the conditional successful transmission probability (STP), the exact meta distribution and its beta approximation by utilizing stochastic geometry. The closed-form expressions of the mean and variance of the local delay (i.e., the network jitter) are also derived. We then consider the maximization of the mean STP and the minimization of the average system transmission delay by jointly optimizing the caching probability and the BS active probability. Finally, the numerical results demonstrate the superiority of the proposed optimization schemes over the existing caching strategies and reveal the impacts of the key network parameters on the cache-enabled networks in terms of mean STP, STP variance, meta distribution, mean local delay and network jitter.

preprint2021arXiv

Sequential Convolutional Recurrent Neural Networks for Fast Automatic Modulation Classification

A novel and efficient end-to-end learning model for automatic modulation classification is proposed for wireless spectrum monitoring applications, which automatically learns from the time domain in-phase and quadrature data without requiring the design of hand-crafted expert features. With the intuition of convolutional layers with pooling serving as the role of front-end feature distillation and dimensionality reduction, sequential convolutional recurrent neural networks are developed to take complementary advantage of parallel computing capability of convolutional neural networks and temporal sensitivity of recurrent neural networks. Experimental results demonstrate that the proposed architecture delivers overall superior performance in signal to noise ratio range above -10~dB, and achieves significantly improved classification accuracy from 80\% to 92.1\% at high signal to noise ratio range, while drastically reduces the average training and prediction time by approximately 74% and 67%, respectively. Response patterns learned by the proposed architecture are visualized to better understand the physics of the model. Furthermore, a comparative study is performed to investigate the impacts of various sequential convolutional recurrent neural network structure settings on classification performance. A representative sequential convolutional recurrent neural network architecture with the two-layer convolutional neural network and subsequent two-layer long short-term memory neural network is developed to suggest the option for fast automatic modulation classification.

preprint2020arXiv

An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments

In this paper, the real-time deployment of unmanned aerial vehicles (UAVs) as flying base stations (BSs) for optimizing the throughput of mobile users is investigated for UAV networks. This problem is formulated as a time-varying mixed-integer non-convex programming (MINP) problem, which is challenging to find an optimal solution in a short time with conventional optimization techniques. Hence, we propose an actor-critic-based (AC-based) deep reinforcement learning (DRL) method to find near-optimal UAV positions at every moment. In the proposed method, the process searching for the solution iteratively at a particular moment is modeled as a Markov decision process (MDP). To handle infinite state and action spaces and improve the robustness of the decision process, two powerful neural networks (NNs) are configured to evaluate the UAV position adjustments and make decisions, respectively. Compared with the heuristic algorithm, sequential least-squares programming and fixed UAVs methods, simulation results have shown that the proposed method outperforms these three benchmarks in terms of the throughput at every moment in UAV networks.

preprint2020arXiv

Effect of Spatial and Temporal Traffic Statistics on the Performance of Wireless Networks

The traffic in wireless networks has become diverse and fluctuating both spatially and temporally due to the emergence of new wireless applications and the complexity of scenarios. The purpose of this paper is to quantitatively analyze the impact of the wireless traffic, which fluctuates both spatially and temporally, on the performance of the wireless networks. Specially, we propose to combine the tools from stochastic geometry and queueing theory to model the spatial and temporal fluctuation of traffic, which to our best knowledge has seldom been evaluated analytically. We derive the spatial and temporal statistics, the total arrival rate, the stability of queues and the delay of users by considering two different spatial properties of traffic, i.e., the uniformly and non-uniformly distributed cases. The numerical results indicate that although the fluctuation of traffic (reflected by the variance of total arrival rate) when the users are clustered is much fiercer than that when the users are uniformly distributed, the unstable probability is smaller. Our work provides a useful reference for the design of wireless networks when the complex spatio-temporal fluctuation of the traffic is considered.

preprint2020arXiv

Spatio-temporal Modeling for Massive and Sporadic Access

The vision for smart city imperiously appeals to the implementation of Internet-of-Things (IoT), some features of which, such as massive access and bursty short packet transmissions, require new methods to enable the cellular system to seamlessly support its integration. Rigorous theoretical analysis is indispensable to obtain constructive insight for the networking design of massive access. In this paper, we propose and define the notion of massive and sporadic access (MSA) to quantitatively describe the massive access of IoT devices. We evaluate the temporal correlation of interference and successful transmission events, and verify that such correlation is negligible in the scenario of MSA. In view of this, in order to resolve the difficulty in any precise spatio-temporal analysis where complex interactions persist among the queues, we propose an approximation that all nodes are moving so fast that their locations are independent at different time slots. Furthermore, we compare the original static network and the equivalent network with high mobility to demonstrate the effectiveness of the proposed approximation approach. The proposed approach is promising for providing a convenient and general solution to evaluate and design the IoT network with massive and sporadic access.

preprint2020arXiv

Triple Memory Networks: a Brain-Inspired Method for Continual Learning

Continual acquisition of novel experience without interfering previously learned knowledge, i.e. continual learning, is critical for artificial neural networks, but limited by catastrophic forgetting. A neural network adjusts its parameters when learning a new task, but then fails to conduct the old tasks well. By contrast, the brain has a powerful ability to continually learn new experience without catastrophic interference. The underlying neural mechanisms possibly attribute to the interplay of hippocampus-dependent memory system and neocortex-dependent memory system, mediated by prefrontal cortex. Specifically, the two memory systems develop specialized mechanisms to consolidate information as more specific forms and more generalized forms, respectively, and complement the two forms of information in the interplay. Inspired by such brain strategy, we propose a novel approach named triple memory networks (TMNs) for continual learning. TMNs model the interplay of hippocampus, prefrontal cortex and sensory cortex (a neocortex region) as a triple-network architecture of generative adversarial networks (GAN). The input information is encoded as specific representation of the data distributions in a generator, or generalized knowledge of solving tasks in a discriminator and a classifier, with implementing appropriate brain-inspired algorithms to alleviate catastrophic forgetting in each module. Particularly, the generator replays generated data of the learned tasks to the discriminator and the classifier, both of which are implemented with a weight consolidation regularizer to complement the lost information in generation process. TMNs achieve new state-of-the-art performance on a variety of class-incremental learning benchmarks on MNIST, SVHN, CIFAR-10 and ImageNet-50, comparing with strong baseline methods.

preprint2020arXiv

Using Deep Convolutional Neural Networks to Diagnose COVID-19 From Chest X-Ray Images

The COVID-19 epidemic has become a major safety and health threat worldwide. Imaging diagnosis is one of the most effective ways to screen COVID-19. This project utilizes several open-source or public datasets to present an open-source dataset of COVID-19 CXRs, named COVID-19-CXR-Dataset, and introduces a deep convolutional neural network model. The model validates on 740 test images and achieves 87.3% accuracy, 89.67 % precision, and 84.46% recall, and correctly classifies 98 out of 100 COVID-19 x-ray images in test set with more than 81% prediction probability under the condition of 95% confidence interval. This project may serve as a reference for other researchers aiming to advance the development of deep learning applications in medical imaging.

preprint2016arXiv

De Sitter and power-law solutions in some models of modified gravity

Inspired by some recent works of Lovelock Brans-Dicke gravity and mimetic gravity, cosmology solutions in extensions of these two modified gravities are investigated. A non-local term is added to the Lovelock Brans-Dicke action and Gauss-Bonnet terms to the mimetic action,correspondingly. De Sitter and power scale factor solutions are then obtained in both theories. They can provide natural new approaches to a more accurate description of the unverse evolution.

preprint2016arXiv

Time-Dependent Scalar Fields in Modified Gravities in a Stationary Spacetime

Most no-hair theorems involve the assumption that the scalar field is independent of time. Recently in [Phys. Rev. D90 (2014) 041501(R)] the existence of time-dependent scalar hair outside a stationary black hole in general relativity was ruled out. We generalize this work to modified gravities and non-minimally coupled scalar field with an additional assumption that the spacetime is axisymmetric. It is shown that in higher-order gravity such as metric $f(R)$ gravity the time-dependent scalar hair doesn't exist. While in Palatini $f(R)$ gravity and non-minimally coupled case the time-dependent scalar hair may exist.

preprint2015arXiv

Localization and mass spectra of various matter fields on scalar-tensor brane

Recently, a new scalar-tensor braneworld model was presented in [Phys. Rev. \textbf{D 86} (2012) 127502]. It not only solves the gauge hierarchy problem but also reproduces a correct Friedmann-like equation on the brane. In this new model, there are two different brane solutions, for which the mass spectra of gravity on the brane are the same. In this paper, we investigate localization and mass spectra of various bulk matter fields (i.e., scalar, vector, Kalb-Ramond, and fermion fields) on the brane. It is shown that the zero modes of all the matter fields can be localized on the positive tension brane under some conditions, and the mass spectra of each kind of bulk matter field for the two brane solutions are different, which implies that the two brane solutions are not physically equivalent.

preprint2015arXiv

Normal heat conduction in lattice models with asymmetry harmonic interparticle interactions

We study the thermal conduction behaviors of one-dimensional lattice models with asymmetry harmonic interparticle interactions in this paper. Normal thermal conductivity independent of the system size is observed when the lattice chains are long enough. Because only the harmonic interactions are involved, the result confirms without ambiguous interpretation that the asymmetry plays key role in resulting in the normal thermal conduction in one-dimensional momentum conserving lattices. Both equilibrium and nonequilibrium simulations are performed to support the conclusion.

preprint2015arXiv

Shadow of noncommutative geometry inspired black hole

In this paper, the shadow casted by the rotating black hole inspired by noncommutative geometry is investigated. In addition to the dimensionless spin parameter $a/M_{0}$ with $M_{0}$ black hole mass and inclination angle $i$, the dimensionless noncommutative parameter $\sqrt{\vartheta}/M_{0}$ is also found to affect the shape of the black hole shadow. The result shows that the size of the shadow slightly decreases with the parameter $\sqrt{\vartheta}/M_{0}$, while the distortion increases with it. Compared to the Kerr black hole, the parameter $\sqrt{\vartheta}/M_{0}$ increases the deformation of the shadow. This may offer a way to distinguish noncommutative geometry inspired black hole from Kerr one via astronomical instruments in the near future.

preprint2014arXiv

Warped Brane worlds in Critical Gravity

We investigate the brane models in arbitrary dimensional critical gravity presented in [Phys. Rev. Lett. 106, 181302 (2011)]. For the model of the thin branes with codimension one, the Gibbons-Hawking surface term and the junction conditions are derived, with which the analytical solutions for the flat, AdS, and dS branes are obtained at the critical point of the critical gravity. It is found that all these branes are embedded in an AdS$_{n}$ spacetime, but, in general, the effective cosmological constant $Λ$ of the AdS$_{n}$ spacetime is not equal to the naked one $Λ_0$ in the critical gravity, which can be positive, zero, and negative. Another interesting result is that the brane tension can also be positive, zero, or negative, depending on the symmetry of the thin brane and the values of the parameters of the theory, which is very different from the case in general relativity. It is shown that the mass hierarchy problem can be solved in the braneworld model in the higher-derivative critical gravity. We also study the thick brane model and find analytical and numerical solutions of the flat, AdS, and dS branes. It is find that some branes will have inner structure when some parameters of the theory are larger than their critical values, which may result in resonant KK modes for some bulk matter fields. The flat branes with positive energy density and AdS branes with negative energy density are embedded in an $n$-dimensional AdS spacetime, while the dS branes with positive energy density are embedded in an $n$-dimensional Minkowski one.

preprint2013arXiv

Managing Interference Correlation Through Random Medium Access

The capacity of wireless networks is fundamentally limited by interference. However, little research has focused on the interference correlation, which may greatly increase the local delay (namely the number of time slots required for a node to successfully transmit a packet). This paper focuses on the question that whether increasing randomness in the MAC, such as frequency-hopping multiple access (FHMA) and ALOHA, helps to reduce the effect of interference correlation. We derive closed-form results for the mean and variance of the local delay for the two MAC protocols and evaluate the optimal parameters that minimize the mean local delay. Based on the optimal parameters, we propose the definitions of two operation regimes: correlation-limited regime and bandwidth-limited regime. Our results reveal that while the mean local delays for FHMA with N sub-bands and for ALOHA with transmit probability p are the same when p=1/N with thermal noise ignored, significant difference exists between the variances. At last, we evaluate the mean delay-jitter tradeoff and the bounds on the tail probability of the local delay, which shed key insights into the system design.

preprint2013arXiv

Multi-channel Hybrid Access Femtocells: A Stochastic Geometric Analysis

For two-tier networks consisting of macrocells and femtocells, the channel access mechanism can be configured to be open access, closed access, or hybrid access. Hybrid access arises as a compromise between open and closed access mechanisms, in which a fraction of available spectrum resource is shared to nonsubscribers while the remaining reserved for subscribers. This paper focuses on a hybrid access mechanism for multi-channel femtocells which employ orthogonal spectrum access schemes. Considering a randomized channel assignment strategy, we analyze the performance in the downlink. Using stochastic geometry as technical tools, we model the distribution of femtocells as Poisson point process or Neyman-Scott cluster process and derive the distributions of signal-to-interference-plus-noise ratios, and mean achievable rates, of both nonsubscribers and subscribers. The established expressions are amenable to numerical evaluation, and shed key insights into the performance tradeoff between subscribers and nonsubscribers. The analytical results are corroborated by numerical simulations.

preprint2013arXiv

Resonances of Kalb-Ramond field on symmetric and asymmetric thick branes

In this paper, we investigate the localization of the Kalb-Ramond field on symmetric and asymmetric thick branes, which are generated by a background scalar field. In order to localize the Kalb-Ramond field, we introduce a coupling with the background scalar field, and find that there exist some Kaluza-Klein resonant modes. For the case of symmetric brane, we seek the resonances by using the relative probability method and transfer matrix method, and obtain the same result for the two methods. For the asymmetric case, we use the transfer matrix method. We find that the number of resonances will decrease with the increase of the asymmetry.

preprint2012arXiv

Normal heat conduction in one dimensional momentum conserving lattices with asymmetric interactions

The heat conduction behavior of one dimensional momentum conserving lattice systems with asymmetric interparticle interactions is numerically investigated. It is found that with certain degree of interaction asymmetry, the heat conductivity measured in nonequilibrium stationary states converges in the thermodynamical limit, in clear contrast to the well accepted viewpoint that Fourier's law is generally violated in low dimensional momentum conserving systems. It suggests in nonequilibrium stationary states the mass gradient resulted from the asymmetric interactions may provide an additional phonon scattering mechanism other than that due to the nonlinear interactions.

preprint2012arXiv

Stochastic Analysis of Mean Interference for RTS/CTS Mechanism

The RTS/CTS handshake mechanism in WLAN is studied using stochastic geometry. The effect of RTS/CTS is treated as a thinning procedure for a spatially distributed point process that models the potential transceivers in a WLAN, and the resulting concurrent transmission processes are described. Exact formulas for the intensity of the concurrent transmission processes and the mean interference experienced by a typical receiver are established. The analysis yields useful results for understanding how the design parameters of RTS/CTS affect the network interference.

Yi Zhong

What is connected

Connect this record

See the researcher in context

Building this map preview

24 published item(s)

How do Humans Process AI-generated Hallucination Contents: a Neuroimaging Study

Thick branes in Born-Infeld determinantal gravity in Weitzenböck spacetime

CoSCL: Cooperation of Small Continual Learners is Stronger than a Big One

Interference-Limited Ultra-Reliable and Low-Latency Communications: Graph Neural Networks or Stochastic Geometry?

Memory Replay with Data Compression for Continual Learning

First-order formalism and thick branes in mimetic gravity

On Meta Distribution and Local Delay for Cache-Enabled Networks with Random DTX: Analysis and Optimization

Sequential Convolutional Recurrent Neural Networks for Fast Automatic Modulation Classification

An Actor-Critic-Based UAV-BSs Deployment Method for Dynamic Environments

Effect of Spatial and Temporal Traffic Statistics on the Performance of Wireless Networks

Spatio-temporal Modeling for Massive and Sporadic Access

Triple Memory Networks: a Brain-Inspired Method for Continual Learning

Using Deep Convolutional Neural Networks to Diagnose COVID-19 From Chest X-Ray Images

De Sitter and power-law solutions in some models of modified gravity

Time-Dependent Scalar Fields in Modified Gravities in a Stationary Spacetime

Localization and mass spectra of various matter fields on scalar-tensor brane

Normal heat conduction in lattice models with asymmetry harmonic interparticle interactions

Shadow of noncommutative geometry inspired black hole

Warped Brane worlds in Critical Gravity

Managing Interference Correlation Through Random Medium Access

Multi-channel Hybrid Access Femtocells: A Stochastic Geometric Analysis

Resonances of Kalb-Ramond field on symmetric and asymmetric thick branes

Normal heat conduction in one dimensional momentum conserving lattices with asymmetric interactions

Stochastic Analysis of Mean Interference for RTS/CTS Mechanism