Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
21works
0followers
13topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

21 published item(s)

preprint2026arXiv

VT-Bench: A Unified Benchmark for Visual-Tabular Multi-Modal Learning

Multi-model learning has attracted great attention in visual-text tasks. However, visual-tabular data, which plays a pivotal role in high-stakes domains like healthcare and industry, remains underexplored. In this paper, we introduce \textit{VT-Bench}, the first unified benchmark for standardizing vision-tabular discriminative prediction and generative reasoning tasks. VT-Bench aggregates 14 datasets across 9 domains (medical-centric, while covering pets, media, and transportation) with over 756K samples. We evaluate 23 representative models, including unimodal experts, specialized visual-tabular models, general-purpose vision-language models (VLMs), and tool-augmented methods, highlighting substantial challenges of visual-tabular learning. We believe VT-Bench will stimulate the community to build more powerful multi-modal vision-tabular foundation models. Benchmark: https://github.com/Ziyi-Jia990/VT-Bench

preprint2023arXiv

HiFlash: Communication-Efficient Hierarchical Federated Learning with Adaptive Staleness Control and Heterogeneity-aware Client-Edge Association

Federated learning (FL) is a promising paradigm that enables collaboratively learning a shared model across massive clients while keeping the training data locally. However, for many existing FL systems, clients need to frequently exchange model parameters of large data size with the remote cloud server directly via wide-area networks (WAN), leading to significant communication overhead and long transmission time. To mitigate the communication bottleneck, we resort to the hierarchical federated learning paradigm of HiFL, which reaps the benefits of mobile edge computing and combines synchronous client-edge model aggregation and asynchronous edge-cloud model aggregation together to greatly reduce the traffic volumes of WAN transmissions. Specifically, we first analyze the convergence bound of HiFL theoretically and identify the key controllable factors for model performance improvement. We then advocate an enhanced design of HiFlash by innovatively integrating deep reinforcement learning based adaptive staleness control and heterogeneity-aware client-edge association strategy to boost the system efficiency and mitigate the staleness effect without compromising model accuracy. Extensive experiments corroborate the superior performance of HiFlash in model accuracy, communication reduction, and system efficiency.

preprint2022arXiv

Drag force on spherical particles trapped at a liquid interface

The dynamics of particles attached to an interface separating two immiscible fluids are encountered in a wide variety of applications. Here we present a combined asymptotic and numerical investigation of the fluid motion past spherical particles attached to a deformable interface undergoing uniform creeping flows in the limit of small Capillary number and small deviation of the contact angle from 90 degrees. Under the assumption of a constant three-phase contact angle, we calculate the interfacial deformation around an isolated particle and a particle pair. Applying the Lorentz reciprocal theorem to the zeroth-order approximation corresponding to spherical particles at a flat interface and the first correction in Capillary number and correction contact angle allows us to obtain explicit analytical expressions for the hydrodynamic drag in terms of the zeroth-order approximations and the correction deformations. The drag coefficients are computed as a function of the three-phase contact angle, the viscosity ratio of the two fluids, the Bond number, and the separation distance between the particles. In addition, the capillary force acting on the particles due to the interfacial deformation is calculated.

preprint2022arXiv

Edge Robotics: Edge-Computing-Accelerated Multi-Robot Simultaneous Localization and Mapping

With the wide penetration of smart robots in multifarious fields, Simultaneous Localization and Mapping (SLAM) technique in robotics has attracted growing attention in the community. Yet collaborating SLAM over multiple robots still remains challenging due to performance contradiction between the intensive graphics computation of SLAM and the limited computing capability of robots. While traditional solutions resort to the powerful cloud servers acting as an external computation provider, we show by real-world measurements that the significant communication overhead in data offloading prevents its practicability to real deployment. To tackle these challenges, this paper promotes the emerging edge computing paradigm into multi-robot SLAM and proposes RecSLAM, a multi-robot laser SLAM system that focuses on accelerating map construction process under the robot-edge-cloud architecture. In contrast to conventional multi-robot SLAM that generates graphic maps on robots and completely merges them on the cloud, RecSLAM develops a hierarchical map fusion technique that directs robots' raw data to edge servers for real-time fusion and then sends to the cloud for global merging. To optimize the overall pipeline, an efficient multi-robot SLAM collaborative processing framework is introduced to adaptively optimize robot-to-edge offloading tailored to heterogeneous edge resource conditions, meanwhile ensuring the workload balancing among the edge servers. Extensive evaluations show RecSLAM can achieve up to 39% processing latency reduction over the state-of-the-art. Besides, a proof-of-concept prototype is developed and deployed in real scenes to demonstrate its effectiveness.

preprint2022arXiv

Identification of potential in diffusion equations from terminal observation: analysis and discrete approximation

The aim of this paper is to study the recovery of a spatially dependent potential in a (sub)diffusion equation from overposed final time data. We construct a monotone operator one of whose fixed points is the unknown potential. The uniqueness of the identification is theoretically verified by using the monotonicity of the operator and a fixed point argument. Moreover, we show a conditional stability in Hilbert spaces under some suitable conditions on the problem data. Next, a completely discrete scheme is developed, by using Galerkin finite element method in space and finite difference method in time, and then a fixed point iteration is applied to reconstruct the potential. We prove the linear convergence of the iterative algorithm by the contraction mapping theorem, and present a thorough error analysis for the reconstructed potential. Our derived \textsl{a priori} error estimate provides a guideline to choose discretization parameters according to the noise level. The analysis relies heavily on some suitable nonstandard error estimates for the direct problem as well as the aforementioned conditional stability. Numerical experiments are provided to illustrate and complement our theoretical analysis.

preprint2022arXiv

Modified BDF2 schemes for subdiffusion models with a singular source term

The aim of this paper is to study the time stepping scheme for approximately solving the subdiffusion equation with a weakly singular source term. In this case, many popular time stepping schemes, including the correction of high-order BDF methods, may lose their high-order accuracy. To fill in this gap, in this paper, we develop a novel time stepping scheme, where the source term is regularized by using a $k$-fold integral-derivative and the equation is discretized by using a modified BDF2 convolution quadrature. We prove that the proposed time stepping scheme is second-order, even if the source term is nonsmooth in time and incompatible with the initial data. Numerical results are presented to support the theoretical results.

preprint2021arXiv

Arbitrarily High-order Maximum Bound Preserving Schemes with Cut-off Postprocessing for Allen-Cahn Equations

We develop and analyze a class of maximum bound preserving schemes for approximately solving Allen--Cahn equations. We apply a $k$th-order single-step scheme in time (where the nonlinear term is linearized by multi-step extrapolation), and a lumped mass finite element method in space with piecewise $r$th-order polynomials and Gauss--Lobatto quadrature. At each time level, a cut-off post-processing is proposed to eliminate extra values violating the maximum bound principle at the finite element nodal points. As a result, the numerical solution satisfies the maximum bound principle (at all nodal points), and the optimal error bound $O(τ^k+h^{r+1})$ is theoretically proved for a certain class of schemes. These time stepping schemes under consideration includes algebraically stable collocation-type methods, which could be arbitrarily high-order in both space and time. Moreover, combining the cut-off strategy with the scalar auxiliary value (SAV) technique, we develop a class of energy-stable and maximum bound preserving schemes, which is arbitrarily high-order in time. Numerical results are provided to illustrate the accuracy of the proposed method.

preprint2021arXiv

Deep Reinforcement Learning with Spatio-temporal Traffic Forecasting for Data-Driven Base Station Sleep Control

To meet the ever increasing mobile traffic demand in 5G era, base stations (BSs) have been densely deployed in radio access networks (RANs) to increase the network coverage and capacity. However, as the high density of BSs is designed to accommodate peak traffic, it would consume an unnecessarily large amount of energy if BSs are on during off-peak time. To save the energy consumption of cellular networks, an effective way is to deactivate some idle base stations that do not serve any traffic demand. In this paper, we develop a traffic-aware dynamic BS sleep control framework, named DeepBSC, which presents a novel data-driven learning approach to determine the BS active/sleep modes while meeting lower energy consumption and satisfactory Quality of Service (QoS) requirements. Specifically, the traffic demands are predicted by the proposed GS-STN model, which leverages the geographical and semantic spatial-temporal correlations of mobile traffic. With accurate mobile traffic forecasting, the BS sleep control problem is cast as a Markov Decision Process that is solved by Actor-Critic reinforcement learning methods. To reduce the variance of cost estimation in the dynamic environment, we propose a benchmark transformation method that provides robust performance indicator for policy update. To expedite the training process, we adopt a Deep Deterministic Policy Gradient (DDPG) approach, together with an explorer network, which can strengthen the exploration further. Extensive experiments with a real-world dataset corroborate that our proposed framework significantly outperforms the existing methods.

preprint2021arXiv

Numerical Estimation of a Diffusion Coefficient in Subdiffusion

In this work, we consider the numerical recovery of a spatially dependent diffusion coefficient in a subdiffusion model from distributed observations. The subdiffusion model involves a Caputo fractional derivative of order $α\in(0,1)$ in time. The numerical estimation is based on the regularized output least-squares formulation, with an $H^1(Ω)$ penalty. We prove the well-posedness of the continuous formulation, e.g., existence and stability. Next, we develop a fully discrete scheme based on the Galerkin finite element method in space and backward Euler convolution quadrature in time. We prove the subsequential convergence of the sequence of discrete solutions to a solution of the continuous problem as the discretization parameters (mesh size and time step size) tend to zero. Further, under an additional regularity condition on the exact coefficient, we derive convergence rates in a weighted $L^2(Ω)$ norm for the discrete approximations to the exact coefficient {in the one- and two-dimensional cases}. The analysis relies heavily on suitable nonstandard nonsmooth data error estimates for the direct problem. We provide illustrative numerical results to support the theoretical study.

preprint2020arXiv

Age of Processing: Age-driven Status Sampling and Processing Offloading for Edge Computing-enabled Real-time IoT Applications

The freshness of status information is of great importance for time-critical Internet of Things (IoT) applications. A metric measuring status freshness is the age-of-information (AoI), which captures the time elapsed from the status being generated at the source node (e.g., a sensor) to the latest status update.However, in intelligent IoT applications such as video surveillance, the status information is revealed after some computation intensive and time-consuming data processing operations, which would affect the status freshness. In this paper, we propose a novel metric, age-of-processing (AoP), to quantify such status freshness, which captures the time elapsed of the newest received processed status data since it is generated. Compared with AoI, AoP further takes the data processing time into account. Since an IoT device has limited computation and energy resource, the device can choose to offload the data processing to the nearby edge server under constrained status sampling frequency.We aim to minimize the average AoP in a long-term process by jointly optimizing the status sampling frequency and processing offloading policy. We formulate this online problem as an infinite-horizon constrained Markov decision process (CMDP) with average reward criterion. We then transform the CMDP problem into an unconstrained Markov decision process (MDP) by leveraging a Lagrangian method, and propose a Lagrangian transformation framework for the original CMDP problem. Furthermore, we integrate the framework with perturbation based refinement for achieving the optimal policy of the CMDP problem. Extensive numerical evaluations show that the proposed algorithm outperforms the benchmarks, with an average AoP reduction up to 30%.

preprint2020arXiv

An Inverse Potential Problem for Subdiffusion: Stability and Reconstruction

In this work, we study the inverse problem of recovering a potential coefficient in the subdiffusion model, which involves a Djrbashian-Caputo derivative of order $α\in(0,1)$ in time, from the terminal data. We prove that the inverse problem is locally Lipschitz for small terminal time, under certain conditions on the initial data. This result extends the result in Choulli and Yamamoto (1997) for the standard parabolic case to the fractional case. The analysis relies on refined properties of two-parameter Mittag-Leffler functions, e.g., complete monotonicity and asymptotics. Further, we develop an efficient and easy-to-implement algorithm for numerically recovering the coefficient based on (preconditioned) fixed point iteration and Anderson acceleration. The efficiency and accuracy of the algorithm is illustrated with several numerical examples.

preprint2020arXiv

DeepCP: Deep Learning Driven Cascade Prediction Based Autonomous Content Placement in Closed Social Network

Online social networks (OSNs) are emerging as the most popular mainstream platform for content cascade diffusion. In order to provide satisfactory quality of experience (QoE) for users in OSNs, much research dedicates to proactive content placement by using the propagation pattern, user's personal profiles and social relationships in open social network scenarios (e.g., Twitter and Weibo). In this paper, we take a new direction of popularity-aware content placement in a closed social network (e.g., WeChat Moment) where user's privacy is highly enhanced. We propose a novel data-driven holistic deep learning framework, namely DeepCP, for joint diffusion-aware cascade prediction and autonomous content placement without utilizing users' personal and social information. We first devise a time-window LSTM model for content popularity prediction and cascade geo-distribution estimation. Accordingly, we further propose a novel autonomous content placement mechanism CP-GAN which adopts the generative adversarial network (GAN) for agile placement decision making to reduce the content access latency and enhance users' QoE. We conduct extensive experiments using cascade diffusion traces in WeChat Moment (WM). Evaluation results corroborate that the proposed DeepCP framework can predict the content popularity with a high accuracy, generate efficient placement decision in a real-time manner, and achieve significant content access latency reduction over existing schemes.

preprint2020arXiv

HFEL: Joint Edge Association and Resource Allocation for Cost-Efficient Hierarchical Federated Edge Learning

Federated Learning (FL) has been proposed as an appealing approach to handle data privacy issue of mobile devices compared to conventional machine learning at the remote cloud with raw user data uploading. By leveraging edge servers as intermediaries to perform partial model aggregation in proximity and relieve core network transmission overhead, it enables great potentials in low-latency and energy-efficient FL. Hence we introduce a novel Hierarchical Federated Edge Learning (HFEL) framework in which model aggregation is partially migrated to edge servers from the cloud. We further formulate a joint computation and communication resource allocation and edge association problem for device users under HFEL framework to achieve global cost minimization. To solve the problem, we propose an efficient resource scheduling algorithm in the HFEL framework. It can be decomposed into two subproblems: \emph{resource allocation} given a scheduled set of devices for each edge server and \emph{edge association} of device users across all the edge servers. With the optimal policy of the convex resource allocation subproblem for a set of devices under a single edge server, an efficient edge association strategy can be achieved through iterative global cost reduction adjustment process, which is shown to converge to a stable system point. Extensive performance evaluations demonstrate that our HFEL framework outperforms the proposed benchmarks in global cost saving and achieves better training performance compared to conventional federated learning.

preprint2020arXiv

HierTrain: Fast Hierarchical Edge AI Learning with Hybrid Parallelism in Mobile-Edge-Cloud Computing

Nowadays, deep neural networks (DNNs) are the core enablers for many emerging edge AI applications. Conventional approaches to training DNNs are generally implemented at central servers or cloud centers for centralized learning, which is typically time-consuming and resource-demanding due to the transmission of a large amount of data samples from the device to the remote cloud. To overcome these disadvantages, we consider accelerating the learning process of DNNs on the Mobile-Edge-Cloud Computing (MECC) paradigm. In this paper, we propose HierTrain, a hierarchical edge AI learning framework, which efficiently deploys the DNN training task over the hierarchical MECC architecture. We develop a novel \textit{hybrid parallelism} method, which is the key to HierTrain, to adaptively assign the DNN model layers and the data samples across the three levels of edge device, edge server and cloud center. We then formulate the problem of scheduling the DNN training tasks at both layer-granularity and sample-granularity. Solving this optimization problem enables us to achieve the minimum training time. We further implement a hardware prototype consisting of an edge device, an edge server and a cloud server, and conduct extensive experiments on it. Experimental results demonstrate that HierTrain can achieve up to 6.9x speedup compared to the cloud-based hierarchical training approach.

preprint2020arXiv

High-order Time Stepping Schemes for Semilinear Subdiffusion Equations

The aim of this paper is to develop and analyze high-order time stepping schemes for solving semilinear subdiffusion equations. We apply the $k$-step BDF convolution quadrature to discretize the time-fractional derivative with order $α\in (0,1)$, and modify the starting steps in order to achieve optimal convergence rate. This method has already been well-studied for the linear fractional evolution equations in Jin, Li and Zhou \cite{JinLiZhou:correction}, while the numerical analysis for the nonlinear problem is still missing in the literature. By splitting the nonlinear potential term into an irregular linear part and a smoother nonlinear part, and using the generating function technique, we prove that the convergence order of the corrected BDF$k$ scheme is $O(τ^{\min(k,1+2α-ε)})$, without imposing further assumption on the regularity of the solution. Numerical examples are provided to support our theoretical results.

preprint2020arXiv

Incomplete Iterative Solution of the Subdiffusion Problem

In this work, we develop an efficient incomplete iterative scheme for the numerical solution of the subdiffusion model involving a Caputo derivative of order $α\in(0,1)$ in time. It is based on piecewise linear Galerkin finite element method in space and backward Euler convolution quadrature in time and solves one linear algebraic system inexactly by an iterative algorithm at each time step. We present theoretical results for both smooth and nonsmooth solutions, using novel weighted estimates of the time-stepping scheme. The analysis indicates that with the number of iterations at each time level chosen properly, the error estimates are nearly identical with that for the exact linear solver, and the theoretical findings provide guidelines on the choice. Illustrative numerical results are presented to complement the theoretical analysis.

preprint2020arXiv

Leveraging the Power of Prediction: Predictive Service Placement for Latency-Sensitive Mobile Edge Computing

Mobile edge computing (MEC) is emerging to support delay-sensitive 5G applications at the edge of mobile networks. When a user moves erratically among multiple MEC nodes, the challenge of how to dynamically migrate its service to maintain service performance (i.e., user-perceived latency) arises. However, frequent service migration can significantly increase operational cost, incurring the conflict between improving performance and reducing cost. To address these mis-aligned objectives, this paper studies the performance optimization of mobile edge service placement under the constraint of long-term cost budget. It is challenging because the budget involves the future uncertain information (e.g., user mobility). To overcome this difficulty, we devote to leveraging the power of prediction and advocate predictive service placement with predicted near-future information. By using two-timescale Lyapunov optimization method, we propose a T-slot predictive service placement (PSP) algorithm to incorporate the prediction of user mobility based on a frame-based design. We characterize the performance bounds of PSP in terms of cost-delay trade-off theoretically. Furthermore, we propose a new weight adjustment scheme for the queue in each frame named PSP-WU to exploit the historical queue information, which greatly reduces the length of queue while improving the quality of user-perceived latency. Rigorous theoretical analysis and extensive evaluations using realistic data traces demonstrate the superior performance of the proposed predictive schemes.

preprint2020arXiv

Nonlocal-in-time dynamics and crossover of diffusive regimes

We study a simple nonlocal-in-time dynamic system proposed for the effective modeling of complex diffusive regimes in heterogeneous media. We present its solutions and their commonly studied statistics such as the mean square distance. This interesting model employs a nonlocal operator to replace the conventional first-order time-derivative. It introduces a finite memory effect of a constant length encoded through a kernel function. The nonlocal-in-time operator is related to fractional time derivatives that rely on the entire time-history on one hand, while reduces to, on the other hand, the classical time derivative if the length of the memory window diminishes. This allows us to demonstrate the effectiveness of the nonlocal-in-time model in capturing the crossover widely observed in nature between the initial sub-diffusion and the long time normal diffusion.

preprint2020arXiv

Subdiffusion with Time-Dependent Coefficients: Improved Regularity and Second-Order Time Stepping

This article concerns second-order time discretization of subdiffusion equations with time-dependent diffusion coefficients. High-order differentiability and regularity estimates are established for subdiffusion equations with time-dependent coefficients. Using these regularity results and a perturbation argument of freezing the diffusion coefficient, we prove that the convolution quadrature generated by the second-order backward differentiation formula, with proper correction at the first time step, can achieve second-order convergence for both nonsmooth initial data and incompatible source term. Numerical experiments are consistent with the theoretical results.

preprint2020arXiv

The energy technique for the six-step BDF method

In combination with the Grenander--Szegö theorem, we observe that a relaxed positivity condition on multipliers, milder than the basic %fundamental requirement of the Nevanlinna--Odeh multipliers that the sum of the absolute values of their components is strictly less than $1$, makes the energy technique applicable to the stability analysis of BDF methods for parabolic equations with selfadjoint elliptic part. This is particularly useful for the six-step BDF method for which no Nevanlinna--Odeh multiplier exists. We introduce multipliers satisfying the positivity property for the six-step BDF method and establish stability of the method for parabolic equations.

preprint2020arXiv

When Deep Reinforcement Learning Meets Federated Learning: Intelligent Multi-Timescale Resource Management for Multi-access Edge Computing in 5G Ultra Dense Network

Ultra-dense edge computing (UDEC) has great potential, especially in the 5G era, but it still faces challenges in its current solutions, such as the lack of: i) efficient utilization of multiple 5G resources (e.g., computation, communication, storage and service resources); ii) low overhead offloading decision making and resource allocation strategies; and iii) privacy and security protection schemes. Thus, we first propose an intelligent ultra-dense edge computing (I-UDEC) framework, which integrates blockchain and Artificial Intelligence (AI) into 5G ultra-dense edge computing networks. First, we show the architecture of the framework. Then, in order to achieve real-time and low overhead computation offloading decisions and resource allocation strategies, we design a novel two-timescale deep reinforcement learning (\textit{2Ts-DRL}) approach, consisting of a fast-timescale and a slow-timescale learning process, respectively. The primary objective is to minimize the total offloading delay and network resource usage by jointly optimizing computation offloading, resource allocation and service caching placement. We also leverage federated learning (FL) to train the \textit{2Ts-DRL} model in a distributed manner, aiming to protect the edge devices' data privacy. Simulation results corroborate the effectiveness of both the \textit{2Ts-DRL} and FL in the I-UDEC framework and prove that our proposed algorithm can reduce task execution time up to 31.87%.