Source author record

Chang Tian

Chang Tian appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Science and Game Theory Machine Learning math.DG Social and Information Networks

Catalog footprint

What is connected

3works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

On Minkowskian Product of Finsler Manifolds

Let (M_1,F_1) and (M_2,F_2) be a pair of Finsler manifolds. The Minkowskian product Finsler manifold (M,F) of (M_1,F_1) and (M_2,F_2) with respect to a product function f is the product manifold M=M_1\times M_2 endowed with the Finsler metric F^2=f(K,H), where K=(F_1)^2,H=(F_2)^2. In this paper, the Cartan connection and Berwald connection of (M,F) are derived in terms of the corresponding objects of (M_1,F_1) and (M_2,F_2). Necessary and sufficient conditions for (M,F) to be Berwald (resp. weakly Berwald, Landsberg, weakly Landsberg) manifold are obtained. Thus an effective method for constructing special Finsler manifolds mentioned above is given.

preprint2021arXiv

Successive Convex Approximation Based Off-Policy Optimization for Constrained Reinforcement Learning

We propose a successive convex approximation based off-policy optimization (SCAOPO) algorithm to solve the general constrained reinforcement learning problem, which is formulated as a constrained Markov decision process (CMDP) in the context of average cost. The SCAOPO is based on solving a sequence of convex objective/feasibility optimization problems obtained by replacing the objective and constraint functions in the original problems with convex surrogate functions. At each iteration, the convex surrogate problem can be efficiently solved by Lagrange dual method even the policy is parameterized by a high-dimensional function. Moreover, the SCAOPO enables to reuse old experiences from previous updates, thereby significantly reducing the implementation cost when deployed in the real-world engineering systems that need to online learn the environment. In spite of the time-varying state distribution and the stochastic bias incurred by the off-policy learning, the SCAOPO with a feasible initial point can still provably converge to a Karush-Kuhn-Tucker (KKT) point of the original problem almost surely.

preprint2016arXiv

Providing Probabilistic Robustness Guarantee for Crowdsensing

Due to its flexible and pervasive sensing ability, crowdsensing has been extensively studied recently in research communities. However, the fundamental issue of how to meet the requirement of sensing robustness in crowdsensing remains largely unsolved. Specifically, from the task owner's perspective, how to minimize the total payment in crowdsensing while guaranteeing the sensing data quality is a critical issue to be resolved. We elegantly model the robustness requirement over sensing data quality as chance constraints, and investigate both hard and soft chance constraints for different crowdsensing applications. For the former, we reformulate the problem through Boole's Inequality, and explore the optimal value gap between the original problem and the reformulated problem. For the latter, we study a serial of a general payment minimization problem, and propose a binary search algorithm that achieves both feasibility and low payment. The performance gap between our solution and the optimal solution is also theoretically analyzed. Extensive simulations validate our theoretical analysis.