Source author record

Zhi Zhou

Zhi Zhou appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.NA Numerical Analysis Networking and Internet Architecture Distributed, Parallel, and Cluster Computing Artificial Intelligence Social and Information Networks Machine Learning math.OC Computer Vision eess.SP math.AP physics.flu-dyn Robotics

Catalog footprint

What is connected

36works

13topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

VT-Bench: A Unified Benchmark for Visual-Tabular Multi-Modal Learning

Multi-model learning has attracted great attention in visual-text tasks. However, visual-tabular data, which plays a pivotal role in high-stakes domains like healthcare and industry, remains underexplored. In this paper, we introduce \textit{VT-Bench}, the first unified benchmark for standardizing vision-tabular discriminative prediction and generative reasoning tasks. VT-Bench aggregates 14 datasets across 9 domains (medical-centric, while covering pets, media, and transportation) with over 756K samples. We evaluate 23 representative models, including unimodal experts, specialized visual-tabular models, general-purpose vision-language models (VLMs), and tool-augmented methods, highlighting substantial challenges of visual-tabular learning. We believe VT-Bench will stimulate the community to build more powerful multi-modal vision-tabular foundation models. Benchmark: https://github.com/Ziyi-Jia990/VT-Bench

preprint2023arXiv

HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association

Federated learning (FL) is a promising paradigm that enables collaboratively learning a shared model across massive clients while keeping the training data locally. However, for many existing FL systems, clients need to frequently exchange model parameters of large data size with the remote cloud server directly via wide-area networks (WAN), leading to significant communication overhead and long transmission time. To mitigate the communication bottleneck, we resort to the hierarchical federated learning paradigm of HiFL, which reaps the benefits of mobile edge computing and combines synchronous client-edge model aggregation and asynchronous edge-cloud model aggregation together to greatly reduce the traffic volumes of WAN transmissions. Specifically, we first analyze the convergence bound of HiFL theoretically and identify the key controllable factors for model performance improvement. We then advocate an enhanced design of HiFlash by innovatively integrating deep reinforcement learning based adaptive staleness control and heterogeneity-aware client-edge association strategy to boost the system efficiency and mitigate the staleness effect without compromising model accuracy. Extensive experiments corroborate the superior performance of HiFlash in model accuracy, communication reduction, and system efficiency.

preprint2022arXiv

Drag force on spherical particles trapped at a liquid interface

The dynamics of particles attached to an interface separating two immiscible fluids are encountered in a wide variety of applications. Here we present a combined asymptotic and numerical investigation of the fluid motion past spherical particles attached to a deformable interface undergoing uniform creeping flows in the limit of small Capillary number and small deviation of the contact angle from 90 degrees. Under the assumption of a constant three-phase contact angle, we calculate the interfacial deformation around an isolated particle and a particle pair. Applying the Lorentz reciprocal theorem to the zeroth-order approximation corresponding to spherical particles at a flat interface and the first correction in Capillary number and correction contact angle allows us to obtain explicit analytical expressions for the hydrodynamic drag in terms of the zeroth-order approximations and the correction deformations. The drag coefficients are computed as a function of the three-phase contact angle, the viscosity ratio of the two fluids, the Bond number, and the separation distance between the particles. In addition, the capillary force acting on the particles due to the interfacial deformation is calculated.

preprint2022arXiv

Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping

With the wide penetration of smart robots in multifarious fields, Simultaneous Localization and Mapping (SLAM) technique in robotics has attracted growing attention in the community. Yet collaborating SLAM over multiple robots still remains challenging due to performance contradiction between the intensive graphics computation of SLAM and the limited computing capability of robots. While traditional solutions resort to the powerful cloud servers acting as an external computation provider, we show by real-world measurements that the significant communication overhead in data offloading prevents its practicability to real deployment. To tackle these challenges, this paper promotes the emerging edge computing paradigm into multi-robot SLAM and proposes RecSLAM, a multi-robot laser SLAM system that focuses on accelerating map construction process under the robot-edge-cloud architecture. In contrast to conventional multi-robot SLAM that generates graphic maps on robots and completely merges them on the cloud, RecSLAM develops a hierarchical map fusion technique that directs robots' raw data to edge servers for real-time fusion and then sends to the cloud for global merging. To optimize the overall pipeline, an efficient multi-robot SLAM collaborative processing framework is introduced to adaptively optimize robot-to-edge offloading tailored to heterogeneous edge resource conditions, meanwhile ensuring the workload balancing among the edge servers. Extensive evaluations show RecSLAM can achieve up to 39% processing latency reduction over the state-of-the-art. Besides, a proof-of-concept prototype is developed and deployed in real scenes to demonstrate its effectiveness.

preprint2022arXiv

Identification of potential in diffusion equations from terminal observation: analysis and discrete approximation

The aim of this paper is to study the recovery of a spatially dependent potential in a (sub)diffusion equation from overposed final time data. We construct a monotone operator one of whose fixed points is the unknown potential. The uniqueness of the identification is theoretically verified by using the monotonicity of the operator and a fixed point argument. Moreover, we show a conditional stability in Hilbert spaces under some suitable conditions on the problem data. Next, a completely discrete scheme is developed, by using Galerkin finite element method in space and finite difference method in time, and then a fixed point iteration is applied to reconstruct the potential. We prove the linear convergence of the iterative algorithm by the contraction mapping theorem, and present a thorough error analysis for the reconstructed potential. Our derived \textsl{a priori} error estimate provides a guideline to choose discretization parameters according to the noise level. The analysis relies heavily on some suitable nonstandard error estimates for the direct problem as well as the aforementioned conditional stability. Numerical experiments are provided to illustrate and complement our theoretical analysis.

preprint2022arXiv

Modified BDF2 schemes for subdiffusion models with a singular source term

The aim of this paper is to study the time stepping scheme for approximately solving the subdiffusion equation with a weakly singular source term. In this case, many popular time stepping schemes, including the correction of high-order BDF methods, may lose their high-order accuracy. To fill in this gap, in this paper, we develop a novel time stepping scheme, where the source term is regularized by using a $k$-fold integral-derivative and the equation is discretized by using a modified BDF2 convolution quadrature. We prove that the proposed time stepping scheme is second-order, even if the source term is nonsmooth in time and incompatible with the initial data. Numerical results are presented to support the theoretical results.

preprint2021arXiv

Arbitrarily High-order Maximum Bound Preserving Schemes with Cut-off Postprocessing for Allen-Cahn Equations

We develop and analyze a class of maximum bound preserving schemes for approximately solving Allen--Cahn equations. We apply a $k$th-order single-step scheme in time (where the nonlinear term is linearized by multi-step extrapolation), and a lumped mass finite element method in space with piecewise $r$th-order polynomials and Gauss--Lobatto quadrature. At each time level, a cut-off post-processing is proposed to eliminate extra values violating the maximum bound principle at the finite element nodal points. As a result, the numerical solution satisfies the maximum bound principle (at all nodal points), and the optimal error bound $O(τ^k+h^{r+1})$ is theoretically proved for a certain class of schemes. These time stepping schemes under consideration includes algebraically stable collocation-type methods, which could be arbitrarily high-order in both space and time. Moreover, combining the cut-off strategy with the scalar auxiliary value (SAV) technique, we develop a class of energy-stable and maximum bound preserving schemes, which is arbitrarily high-order in time. Numerical results are provided to illustrate the accuracy of the proposed method.

preprint2021arXiv

Deep Reinforcement Learning with Spatio-temporal Traffic Forecasting for Data-Driven Base Station Sleep Control

To meet the ever increasing mobile traffic demand in 5G era, base stations (BSs) have been densely deployed in radio access networks (RANs) to increase the network coverage and capacity. However, as the high density of BSs is designed to accommodate peak traffic, it would consume an unnecessarily large amount of energy if BSs are on during off-peak time. To save the energy consumption of cellular networks, an effective way is to deactivate some idle base stations that do not serve any traffic demand. In this paper, we develop a traffic-aware dynamic BS sleep control framework, named DeepBSC, which presents a novel data-driven learning approach to determine the BS active/sleep modes while meeting lower energy consumption and satisfactory Quality of Service (QoS) requirements. Specifically, the traffic demands are predicted by the proposed GS-STN model, which leverages the geographical and semantic spatial-temporal correlations of mobile traffic. With accurate mobile traffic forecasting, the BS sleep control problem is cast as a Markov Decision Process that is solved by Actor-Critic reinforcement learning methods. To reduce the variance of cost estimation in the dynamic environment, we propose a benchmark transformation method that provides robust performance indicator for policy update. To expedite the training process, we adopt a Deep Deterministic Policy Gradient (DDPG) approach, together with an explorer network, which can strengthen the exploration further. Extensive experiments with a real-world dataset corroborate that our proposed framework significantly outperforms the existing methods.

preprint2021arXiv

Numerical Estimation of a Diffusion Coefficient in Subdiffusion

In this work, we consider the numerical recovery of a spatially dependent diffusion coefficient in a subdiffusion model from distributed observations. The subdiffusion model involves a Caputo fractional derivative of order $α\in(0,1)$ in time. The numerical estimation is based on the regularized output least-squares formulation, with an $H^1(Ω)$ penalty. We prove the well-posedness of the continuous formulation, e.g., existence and stability. Next, we develop a fully discrete scheme based on the Galerkin finite element method in space and backward Euler convolution quadrature in time. We prove the subsequential convergence of the sequence of discrete solutions to a solution of the continuous problem as the discretization parameters (mesh size and time step size) tend to zero. Further, under an additional regularity condition on the exact coefficient, we derive convergence rates in a weighted $L^2(Ω)$ norm for the discrete approximations to the exact coefficient {in the one- and two-dimensional cases}. The analysis relies heavily on suitable nonstandard nonsmooth data error estimates for the direct problem. We provide illustrative numerical results to support the theoretical study.

preprint2020arXiv

Age of Processing: Age-driven Status Sampling and Processing Offloading for Edge Computing-enabled Real-time IoT Applications

The freshness of status information is of great importance for time-critical Internet of Things (IoT) applications. A metric measuring status freshness is the age-of-information (AoI), which captures the time elapsed from the status being generated at the source node (e.g., a sensor) to the latest status update.However, in intelligent IoT applications such as video surveillance, the status information is revealed after some computation intensive and time-consuming data processing operations, which would affect the status freshness. In this paper, we propose a novel metric, age-of-processing (AoP), to quantify such status freshness, which captures the time elapsed of the newest received processed status data since it is generated. Compared with AoI, AoP further takes the data processing time into account. Since an IoT device has limited computation and energy resource, the device can choose to offload the data processing to the nearby edge server under constrained status sampling frequency.We aim to minimize the average AoP in a long-term process by jointly optimizing the status sampling frequency and processing offloading policy. We formulate this online problem as an infinite-horizon constrained Markov decision process (CMDP) with average reward criterion. We then transform the CMDP problem into an unconstrained Markov decision process (MDP) by leveraging a Lagrangian method, and propose a Lagrangian transformation framework for the original CMDP problem. Furthermore, we integrate the framework with perturbation based refinement for achieving the optimal policy of the CMDP problem. Extensive numerical evaluations show that the proposed algorithm outperforms the benchmarks, with an average AoP reduction up to 30%.

preprint2020arXiv

An Inverse Potential Problem for Subdiffusion: Stability and Reconstruction

In this work, we study the inverse problem of recovering a potential coefficient in the subdiffusion model, which involves a Djrbashian-Caputo derivative of order $α\in(0,1)$ in time, from the terminal data. We prove that the inverse problem is locally Lipschitz for small terminal time, under certain conditions on the initial data. This result extends the result in Choulli and Yamamoto (1997) for the standard parabolic case to the fractional case. The analysis relies on refined properties of two-parameter Mittag-Leffler functions, e.g., complete monotonicity and asymptotics. Further, we develop an efficient and easy-to-implement algorithm for numerically recovering the coefficient based on (preconditioned) fixed point iteration and Anderson acceleration. The efficiency and accuracy of the algorithm is illustrated with several numerical examples.

preprint2020arXiv

DeepCP: Deep Learning Driven Cascade Prediction Based Autonomous Content Placement in Closed Social Network

Online social networks (OSNs) are emerging as the most popular mainstream platform for content cascade diffusion. In order to provide satisfactory quality of experience (QoE) for users in OSNs, much research dedicates to proactive content placement by using the propagation pattern, user's personal profiles and social relationships in open social network scenarios (e.g., Twitter and Weibo). In this paper, we take a new direction of popularity-aware content placement in a closed social network (e.g., WeChat Moment) where user's privacy is highly enhanced. We propose a novel data-driven holistic deep learning framework, namely DeepCP, for joint diffusion-aware cascade prediction and autonomous content placement without utilizing users' personal and social information. We first devise a time-window LSTM model for content popularity prediction and cascade geo-distribution estimation. Accordingly, we further propose a novel autonomous content placement mechanism CP-GAN which adopts the generative adversarial network (GAN) for agile placement decision making to reduce the content access latency and enhance users' QoE. We conduct extensive experiments using cascade diffusion traces in WeChat Moment (WM). Evaluation results corroborate that the proposed DeepCP framework can predict the content popularity with a high accuracy, generate efficient placement decision in a real-time manner, and achieve significant content access latency reduction over existing schemes.

preprint2020arXiv

HFEL: Joint Edge Association and Resource Allocation for Cost-Efficient Hierarchical Federated Edge Learning

Federated Learning (FL) has been proposed as an appealing approach to handle data privacy issue of mobile devices compared to conventional machine learning at the remote cloud with raw user data uploading. By leveraging edge servers as intermediaries to perform partial model aggregation in proximity and relieve core network transmission overhead, it enables great potentials in low-latency and energy-efficient FL. Hence we introduce a novel Hierarchical Federated Edge Learning (HFEL) framework in which model aggregation is partially migrated to edge servers from the cloud. We further formulate a joint computation and communication resource allocation and edge association problem for device users under HFEL framework to achieve global cost minimization. To solve the problem, we propose an efficient resource scheduling algorithm in the HFEL framework. It can be decomposed into two subproblems: \emph{resource allocation} given a scheduled set of devices for each edge server and \emph{edge association} of device users across all the edge servers. With the optimal policy of the convex resource allocation subproblem for a set of devices under a single edge server, an efficient edge association strategy can be achieved through iterative global cost reduction adjustment process, which is shown to converge to a stable system point. Extensive performance evaluations demonstrate that our HFEL framework outperforms the proposed benchmarks in global cost saving and achieves better training performance compared to conventional federated learning.

preprint2020arXiv

HierTrain: Fast Hierarchical Edge AI Learning with Hybrid Parallelism in Mobile-Edge-Cloud Computing

Nowadays, deep neural networks (DNNs) are the core enablers for many emerging edge AI applications. Conventional approaches to training DNNs are generally implemented at central servers or cloud centers for centralized learning, which is typically time-consuming and resource-demanding due to the transmission of a large amount of data samples from the device to the remote cloud. To overcome these disadvantages, we consider accelerating the learning process of DNNs on the Mobile-Edge-Cloud Computing (MECC) paradigm. In this paper, we propose HierTrain, a hierarchical edge AI learning framework, which efficiently deploys the DNN training task over the hierarchical MECC architecture. We develop a novel \textit{hybrid parallelism} method, which is the key to HierTrain, to adaptively assign the DNN model layers and the data samples across the three levels of edge device, edge server and cloud center. We then formulate the problem of scheduling the DNN training tasks at both layer-granularity and sample-granularity. Solving this optimization problem enables us to achieve the minimum training time. We further implement a hardware prototype consisting of an edge device, an edge server and a cloud server, and conduct extensive experiments on it. Experimental results demonstrate that HierTrain can achieve up to 6.9x speedup compared to the cloud-based hierarchical training approach.

preprint2020arXiv

High-order Time Stepping Schemes for Semilinear Subdiffusion Equations

The aim of this paper is to develop and analyze high-order time stepping schemes for solving semilinear subdiffusion equations. We apply the $k$-step BDF convolution quadrature to discretize the time-fractional derivative with order $α\in (0,1)$, and modify the starting steps in order to achieve optimal convergence rate. This method has already been well-studied for the linear fractional evolution equations in Jin, Li and Zhou \cite{JinLiZhou:correction}, while the numerical analysis for the nonlinear problem is still missing in the literature. By splitting the nonlinear potential term into an irregular linear part and a smoother nonlinear part, and using the generating function technique, we prove that the convergence order of the corrected BDF$k$ scheme is $O(τ^{\min(k,1+2α-ε)})$, without imposing further assumption on the regularity of the solution. Numerical examples are provided to support our theoretical results.

preprint2020arXiv

Incomplete Iterative Solution of the Subdiffusion Problem

In this work, we develop an efficient incomplete iterative scheme for the numerical solution of the subdiffusion model involving a Caputo derivative of order $α\in(0,1)$ in time. It is based on piecewise linear Galerkin finite element method in space and backward Euler convolution quadrature in time and solves one linear algebraic system inexactly by an iterative algorithm at each time step. We present theoretical results for both smooth and nonsmooth solutions, using novel weighted estimates of the time-stepping scheme. The analysis indicates that with the number of iterations at each time level chosen properly, the error estimates are nearly identical with that for the exact linear solver, and the theoretical findings provide guidelines on the choice. Illustrative numerical results are presented to complement the theoretical analysis.

preprint2020arXiv

Leveraging the Power of Prediction: Predictive Service Placement for Latency-Sensitive Mobile Edge Computing

Mobile edge computing (MEC) is emerging to support delay-sensitive 5G applications at the edge of mobile networks. When a user moves erratically among multiple MEC nodes, the challenge of how to dynamically migrate its service to maintain service performance (i.e., user-perceived latency) arises. However, frequent service migration can significantly increase operational cost, incurring the conflict between improving performance and reducing cost. To address these mis-aligned objectives, this paper studies the performance optimization of mobile edge service placement under the constraint of long-term cost budget. It is challenging because the budget involves the future uncertain information (e.g., user mobility). To overcome this difficulty, we devote to leveraging the power of prediction and advocate predictive service placement with predicted near-future information. By using two-timescale Lyapunov optimization method, we propose a T-slot predictive service placement (PSP) algorithm to incorporate the prediction of user mobility based on a frame-based design. We characterize the performance bounds of PSP in terms of cost-delay trade-off theoretically. Furthermore, we propose a new weight adjustment scheme for the queue in each frame named PSP-WU to exploit the historical queue information, which greatly reduces the length of queue while improving the quality of user-perceived latency. Rigorous theoretical analysis and extensive evaluations using realistic data traces demonstrate the superior performance of the proposed predictive schemes.

preprint2020arXiv

Nonlocal-in-time dynamics and crossover of diffusive regimes

We study a simple nonlocal-in-time dynamic system proposed for the effective modeling of complex diffusive regimes in heterogeneous media. We present its solutions and their commonly studied statistics such as the mean square distance. This interesting model employs a nonlocal operator to replace the conventional first-order time-derivative. It introduces a finite memory effect of a constant length encoded through a kernel function. The nonlocal-in-time operator is related to fractional time derivatives that rely on the entire time-history on one hand, while reduces to, on the other hand, the classical time derivative if the length of the memory window diminishes. This allows us to demonstrate the effectiveness of the nonlocal-in-time model in capturing the crossover widely observed in nature between the initial sub-diffusion and the long time normal diffusion.

preprint2020arXiv

Subdiffusion with Time-Dependent Coefficients: Improved Regularity and Second-Order Time Stepping

This article concerns second-order time discretization of subdiffusion equations with time-dependent diffusion coefficients. High-order differentiability and regularity estimates are established for subdiffusion equations with time-dependent coefficients. Using these regularity results and a perturbation argument of freezing the diffusion coefficient, we prove that the convolution quadrature generated by the second-order backward differentiation formula, with proper correction at the first time step, can achieve second-order convergence for both nonsmooth initial data and incompatible source term. Numerical experiments are consistent with the theoretical results.

preprint2020arXiv

The energy technique for the six-step BDF method

In combination with the Grenander--Szegö theorem, we observe that a relaxed positivity condition on multipliers, milder than the basic %fundamental requirement of the Nevanlinna--Odeh multipliers that the sum of the absolute values of their components is strictly less than $1$, makes the energy technique applicable to the stability analysis of BDF methods for parabolic equations with selfadjoint elliptic part. This is particularly useful for the six-step BDF method for which no Nevanlinna--Odeh multiplier exists. We introduce multipliers satisfying the positivity property for the six-step BDF method and establish stability of the method for parabolic equations.

preprint2020arXiv

When Deep Reinforcement Learning Meets Federated Learning: Intelligent Multi-Timescale Resource Management for Multi-access Edge Computing in 5G Ultra Dense Network

Ultra-dense edge computing (UDEC) has great potential, especially in the 5G era, but it still faces challenges in its current solutions, such as the lack of: i) efficient utilization of multiple 5G resources (e.g., computation, communication, storage and service resources); ii) low overhead offloading decision making and resource allocation strategies; and iii) privacy and security protection schemes. Thus, we first propose an intelligent ultra-dense edge computing (I-UDEC) framework, which integrates blockchain and Artificial Intelligence (AI) into 5G ultra-dense edge computing networks. First, we show the architecture of the framework. Then, in order to achieve real-time and low overhead computation offloading decisions and resource allocation strategies, we design a novel two-timescale deep reinforcement learning (\textit{2Ts-DRL}) approach, consisting of a fast-timescale and a slow-timescale learning process, respectively. The primary objective is to minimize the total offloading delay and network resource usage by jointly optimizing computation offloading, resource allocation and service caching placement. We also leverage federated learning (FL) to train the \textit{2Ts-DRL} model in a distributed manner, aiming to protect the edge devices' data privacy. Simulation results corroborate the effectiveness of both the \textit{2Ts-DRL} and FL in the I-UDEC framework and prove that our proposed algorithm can reduce task execution time up to 31.87%.

preprint2016arXiv

An Analysis of Galerkin Proper Orthogonal Decomposition for Subdiffusion

In this work, we develop a novel Galerkin-L1-POD scheme for the subdiffusion model with a Caputo fractional derivative of order $α\in (0,1)$ in time, which is often used to describe anomalous diffusion processes in heterogeneous media. The nonlocality of the fractional derivative requires storing all the solutions from time zero. The proposed scheme is based on continuous piecewise linear finite elements, L1 time stepping, and proper orthogonal decomposition (POD). By constructing an effective reduced-order scheme using problem-adapted basis functions, it can significantly reduce the computational complexity and storage requirement. We shall provide a complete error analysis of the scheme under realistic regularity assumptions by means of a novel energy argument. Extensive numerical experiments are presented to verify the convergence analysis and the efficiency of the proposed scheme.

preprint2016arXiv

Evaluating Stochastic Methods in Power System Operations with Wind Power

Wind power is playing an increasingly important role in electricity markets. However, it's inherent variability and uncertainty cause operational challenges and costs as more operating reserves are needed to maintain system reliability. Several operational strategies have been proposed to address these challenges, including advanced probabilistic wind forecasting techniques, dynamic operating reserves, and various unit commitment (UC) and economic dispatch (ED) strategies under uncertainty. This paper presents a consistent framework to evaluate different operational strategies in power system operations with renewable energy. We use conditional Kernel Density Estimation (KDE) for probabilistic wind power forecasting. Forecast scenarios are generated considering spatio-temporal correlations, and further reduced to lower the computational burden. Scenario-based stochastic programming with different decomposition techniques and interval optimization are tested to examine economic, reliability, and computational performance compared to deterministic UC/ED benchmarks. We present numerical results for a modified IEEE-118 bus system with realistic system load and wind data.

preprint2015arXiv

A Finite Element Method With Singularity Reconstruction for Fractional Boundary Value Problems

We consider a two-point boundary value problem involving a Riemann-Liouville fractional derivative of order $\al\in (1,2)$ in the leading term on the unit interval $(0,1)$. Generally the standard Galerkin finite element method can only give a low-order convergence even if the source term is very smooth due to the presence of the singularity term $x^{\al-1}$ in the solution representation. In order to enhance the convergence, we develop a simple singularity reconstruction strategy by splitting the solution into a singular part and a regular part, where the former captures explicitly the singularity. We derive a new variational formulation for the regular part, and establish that the Galerkin approximation of the regular part can achieve a better convergence order in the $L^2(0,1)$, $H^{\al/2}(0,1)$ and $L^\infty(0,1)$-norms than the standard Galerkin approach, with a convergence rate for the recovered singularity strength identical with the $L^2(0,1)$ error estimate. The reconstruction approach is very flexible in handling explicit singularity, and it is further extended to the case of a Neumann type boundary condition on the left end point, which involves a strong singularity $x^{\al-2}$. Extensive numerical results confirm the theoretical study and efficiency of the proposed approach.

preprint2015arXiv

A Petrov-Galerkin Finite Element Method for Fractional Convection-Diffusion Equations

In this work, we develop variational formulations of Petrov-Galerkin type for one-dimensional fractional boundary value problems involving either a Riemann-Liouville or Caputo derivative of order $α\in(3/2, 2)$ in the leading term and both convection and potential terms. They arise in the mathematical modeling of asymmetric super-diffusion processes in heterogeneous media. The well-posedness of the formulations and sharp regularity pickup of the variational solutions are established. A novel finite element method is developed, which employs continuous piecewise linear finite elements and "shifted" fractional powers for the trial and test space, respectively. The new approach has a number of distinct features: It allows deriving optimal error estimates in both $L^2(D)$ and $H^1(D)$ norms; and on a uniform mesh, the stiffness matrix of the leading term is diagonal and the resulting linear system is well conditioned. Further, in the Riemann-Liouville case, an enriched FEM is proposed to improve the convergence. Extensive numerical results are presented to verify the theoretical analysis and robustness of the numerical scheme.

preprint2015arXiv

A simple finite element method for the boundary value problem with a Riemann-Liouville derivative

We consider a boundary value problem involving a Riemann-Liouville fractional derivative of order $α\in (3/2,2)$ on the unit interval $(0,1)$. The standard Galerkin finite element approximation converges slowly due to the presence of singularity term $x^{α-1}$ in the solution representation. In this work, we develop a simple technique, by transforming it into a second-order two-point boundary value problem with nonlocal low order terms, whose solution can reconstruct directly the solution to the original problem. The stability of the variational formulation, and the optimal regularity pickup of the solution are analyzed. A novel Galerkin finite element method with piecewise linear or quadratic finite elements is developed, and $L^2(D)$ error estimates are provided. The approach is then applied to the corresponding fractional Sturm-Liouville problem, and error estimates of the eigenvalue approximations are given. Extensive numerical results fully confirm our theoretical study.

preprint2015arXiv

An analysis of the L1 Scheme for the subdiffusion equation with nonsmooth data

The subdiffusion equation with a Caputo fractional derivative of order $α\in(0,1)$ in time arises in a wide variety of practical applications, and it is often adopted to model anomalous subdiffusion processes in heterogeneous media. The L1 scheme is one of the most popular and successful numerical methods for discretizing the Caputo fractional derivative in time. The scheme was analyzed earlier independently by Lin and Xu (2007) and Sun and Wu (2006), and an $O(τ^{2-α})$ convergence rate was established, under the assumption that the solution is twice continuously differentiable in time. However, in view of the smoothing property of the subdiffusion equation, this regularity condition is restrictive, since it does not hold even for the homogeneous problem with a smooth initial data. In this work, we revisit the error analysis of the scheme, and establish an $O(τ)$ convergence rate for both smooth and nonsmooth initial data. The analysis is valid for more general sectorial operators. In particular, the L1 scheme is applied to one-dimensional space-time fractional diffusion equations, which involves also a Riemann-Liouville derivative of order $β\in(3/2,2)$ in space, and error estimates are provided for the fully discrete scheme. Numerical experiments are provided to verify the sharpness of the error estimates, and robustness of the scheme with respect to data regularity.

preprint2015arXiv

An Analysis of the Rayleigh-Stokes problem for a Generalized Second-Grade Fluid

We study the Rayleigh-Stokes problem for a generalized second-grade fluid which involves a Riemann-Liouville fractional derivative in time, and present an analysis of the problem in the continuous, space semidiscrete and fully discrete formulations. We establish the Sobolev regularity of the homogeneous problem for both smooth and nonsmooth initial data $v$, including $v\in L^2(Ω)$. A space semidiscrete Galerkin scheme using continuous piecewise linear finite elements is developed, and optimal with respect to initial data regularity error estimates for the finite element approximations are derived. Further, two fully discrete schemes based on the backward Euler method and second-order backward difference method and the related convolution quadrature are developed, and optimal error estimates are derived for the fully discrete approximations for both smooth and nonsmooth initial data. Numerical results for one- and two-dimensional examples with smooth and nonsmooth initial data are presented to illustrate the efficiency of the method, and to verify the convergence theory.

preprint2015arXiv

Error Estimates for Approximations of Distributed Order Time Fractional Diffusion with Nonsmooth Data

In this work, we consider the numerical solution of an initial boundary value problem for the distributed order time fractional diffusion equation. The model arises in the mathematical modeling of ultra-slow diffusion processes observed in some physical problems, whose solution decays only logarithmically as the time $t$ tends to infinity. We develop a space semidiscrete scheme based on the standard Galerkin finite element method, and establish error estimates optimal with respect to data regularity in $L^2(D)$ and $H^1(D)$ norms for both smooth and nonsmooth initial data. Further, we propose two fully discrete schemes, based on the Laplace transform and convolution quadrature generated by the backward Euler method, respectively, and provide optimal convergence rates in the $L^2(D)$ norm, which exhibits exponential convergence and first-order convergence in time, respectively. Extensive numerical experiments are provided to verify the error estimates for both smooth and nonsmooth initial data, and to examine the asymptotic behavior of the solution.

preprint2015arXiv

On Nonnegativity Preservation in Finite Element Methods for Subdiffusion Equations

We consider three types of subdiffusion models, namely single-term, multi-term and distributed order fractional diffusion equations, for which the maximum-principle holds and which, in particular, preserve nonnegativity. Hence the solution is nonnegative for nonnegative initial data. Following earlier work on the heat equation, our purpose is to study whether this property is inherited by certain spatially semidiscrete and fully discrete piecewise linear finite element methods, including the standard Galerkin method, the lumped mass method and the finite volume element method. It is shown that, as for the heat equation, when the mass matrix is nondiagonal, nonnegativity is not preserved for small time or time-step, but may reappear after a positivity threshold. For the lumped mass method nonnegativity is preserved if and only if the triangulation in the finite element space is of Delaunay type. Numerical experiments illustrate and complement the theoretical results.

preprint2015arXiv

Two Schemes for Fractional Diffusion and Diffusion-Wave Equations with Nonsmooth Data

We consider the initial/boundary value problem for the fractional diffusion and diffusion-wave equations involving a Caputo fractional derivative in time. We develop two "simple" fully discrete schemes based on the Galerkin finite element method in space and convolution quadrature in time with the generating function given by the implicit backward Euler method/second-order backward difference method, and establish error estimates optimal with respect to the regularity of the initial data. These two schemes are first and second-order accurate in time for nonsmooth initial data. Extensive numerical experiments for one and two-dimensional problems confirm the convergence analysis. A detailed comparison with several popular time stepping schemes is also performed. The numerical results indicate that the proposed fully discrete schemes are accurate and robust for nonsmooth data, and competitive with existing schemes.

preprint2014arXiv

The Galerkin Finite Element Method for A Multi-term Time-Fractional Diffusion equation

We consider the initial/boundary value problem for a diffusion equation involving multiple time-fractional derivatives on a bounded convex polyhedral domain. We analyze a space semidiscrete scheme based on the standard Galerkin finite element method using continuous piecewise linear functions. Nearly optimal error estimates for both cases of initial data and inhomogeneous term are derived, which cover both smooth and nonsmooth data. Further we develop a fully discrete scheme based on a finite difference discretization of the time-fractional derivatives, and discuss its stability and error estimate. Extensive numerical experiments for one and two-dimension problems confirm the convergence rates of the theoretical results.

preprint2013arXiv

Error Analysis of Finite Element Methods for Space-Fractional Parabolic Equations

We consider an initial/boundary value problem for one-dimensional fractional-order parabolic equations with a space fractional derivative of Riemann-Liouville type and order $α\in (1,2)$. We study a spatial semidiscrete scheme with the standard Galerkin finite element method with piecewise linear finite elements, as well as fully discrete schemes based on the backward Euler method and Crank-Nicolson method. Error estimates in the $L^2\II$- and $H^{α/2}\II$-norm are derived for the semidiscrete scheme, and in the $L^2\II$-norm for the fully discrete schemes. These estimates are for both smooth and nonsmooth initial data, and are expressed directly in terms of the smoothness of the initial data. Extensive numerical results are presented to illustrate the theoretical results.

preprint2013arXiv

Error Analysis of Semidiscrete Finite Element Methods for Inhomogeneous Time-Fractional Diffusion

We consider the initial boundary value problem for the inhomogeneous time-fractional diffusion equation with a homogeneous Dirichlet boundary condition and a nonsmooth right hand side data in a bounded convex polyhedral domain. We analyze two semidiscrete schemes based on the standard Galerkin and lumped mass finite element methods. Almost optimal error estimates are obtained for right hand side data $f(x,t)\in L^\infty(0,T;\dot H^q(Ω))$, $-1< q \le 1$, for both semidiscrete schemes. For lumped mass method, the optimal $L^2(Ω)$-norm error estimate requires symmetric meshes. Finally, numerical experiments for one- and two-dimensional examples are presented to verify our theoretical results.

preprint2013arXiv

Galerkin FEM for fractional order parabolic equations with initial data in $H^{-s},~0 < s \le 1$

We investigate semi-discrete numerical schemes based on the standard Galerkin and lumped mass Galerkin finite element methods for an initial-boundary value problem for homogeneous fractional diffusion problems with non-smooth initial data. We assume that $Ω\subset \mathbb{R}^d$, $d=1,2,3$ is a convex polygonal (polyhedral) domain. We theoretically justify optimal order error estimates in $L_2$- and $H^1$-norms for initial data in $H^{-s}(Ω),~0\le s \le 1$. We confirm our theoretical findings with a number of numerical tests that include initial data $v$ being a Dirac $δ$-function supported on a $(d-1)$-dimensional manifold.

preprint2012arXiv

Error estimates for a semidiscrete finite element method for fractional order parabolic equations

We consider the initial boundary value problem for the homogeneous time-fractional diffusion equation $\partial^α_t u - \De u =0$ ($0< α< 1$) with initial condition $u(x,0)=v(x)$ and a homogeneous Dirichlet boundary condition in a bounded polygonal domain $Ω$. We shall study two semidiscrete approximation schemes, i.e., Galerkin FEM and lumped mass Galerkin FEM, by using piecewise linear functions. We establish optimal with respect to the regularity of the solution error estimates, including the case of nonsmooth initial data, i.e., $v \in L_2(Ω)$.

Zhi Zhou

What is connected

Connect this record

See the researcher in context

Building this map preview

36 published item(s)

VT-Bench: A Unified Benchmark for Visual-Tabular Multi-Modal Learning

HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association

Drag force on spherical particles trapped at a liquid interface

Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping

Identification of potential in diffusion equations from terminal observation: analysis and discrete approximation

Modified BDF2 schemes for subdiffusion models with a singular source term

Arbitrarily High-order Maximum Bound Preserving Schemes with Cut-off Postprocessing for Allen-Cahn Equations

Deep Reinforcement Learning with Spatio-temporal Traffic Forecasting for Data-Driven Base Station Sleep Control

Numerical Estimation of a Diffusion Coefficient in Subdiffusion

Age of Processing: Age-driven Status Sampling and Processing Offloading for Edge Computing-enabled Real-time IoT Applications

An Inverse Potential Problem for Subdiffusion: Stability and Reconstruction

DeepCP: Deep Learning Driven Cascade Prediction Based Autonomous Content Placement in Closed Social Network

HFEL: Joint Edge Association and Resource Allocation for Cost-Efficient Hierarchical Federated Edge Learning

HierTrain: Fast Hierarchical Edge AI Learning with Hybrid Parallelism in Mobile-Edge-Cloud Computing

High-order Time Stepping Schemes for Semilinear Subdiffusion Equations

Incomplete Iterative Solution of the Subdiffusion Problem

Leveraging the Power of Prediction: Predictive Service Placement for Latency-Sensitive Mobile Edge Computing

Nonlocal-in-time dynamics and crossover of diffusive regimes

Subdiffusion with Time-Dependent Coefficients: Improved Regularity and Second-Order Time Stepping

The energy technique for the six-step BDF method

When Deep Reinforcement Learning Meets Federated Learning: Intelligent Multi-Timescale Resource Management for Multi-access Edge Computing in 5G Ultra Dense Network

An Analysis of Galerkin Proper Orthogonal Decomposition for Subdiffusion

Evaluating Stochastic Methods in Power System Operations with Wind Power

A Finite Element Method With Singularity Reconstruction for Fractional Boundary Value Problems

A Petrov-Galerkin Finite Element Method for Fractional Convection-Diffusion Equations

A simple finite element method for the boundary value problem with a Riemann-Liouville derivative

An analysis of the L1 Scheme for the subdiffusion equation with nonsmooth data

An Analysis of the Rayleigh-Stokes problem for a Generalized Second-Grade Fluid

Error Estimates for Approximations of Distributed Order Time Fractional Diffusion with Nonsmooth Data

On Nonnegativity Preservation in Finite Element Methods for Subdiffusion Equations

Two Schemes for Fractional Diffusion and Diffusion-Wave Equations with Nonsmooth Data

The Galerkin Finite Element Method for A Multi-term Time-Fractional Diffusion equation

Error Analysis of Finite Element Methods for Space-Fractional Parabolic Equations

Error Analysis of Semidiscrete Finite Element Methods for Inhomogeneous Time-Fractional Diffusion

Galerkin FEM for fractional order parabolic equations with initial data in $H^{-s},~0 < s \le 1$

Error estimates for a semidiscrete finite element method for fractional order parabolic equations