Source author record

James Zhang

James Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.RA Machine Learning math.QA Artificial Intelligence Computation and Language math.RT Multiagent Systems

Catalog footprint

What is connected

15works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud

Predictive autoscaling (autoscaling with workload forecasting) is an important mechanism that supports autonomous adjustment of computing resources in accordance with fluctuating workload demands in the Cloud. In recent works, Reinforcement Learning (RL) has been introduced as a promising approach to learn the resource management policies to guide the scaling actions under the dynamic and uncertain cloud environment. However, RL methods face the following challenges in steering predictive autoscaling, such as lack of accuracy in decision-making, inefficient sampling and significant variability in workload patterns that may cause policies to fail at test time. To this end, we propose an end-to-end predictive meta model-based RL algorithm, aiming to optimally allocate resource to maintain a stable CPU utilization level, which incorporates a specially-designed deep periodic workload prediction model as the input and embeds the Neural Process to guide the learning of the optimal scaling actions over numerous application services in the Cloud. Our algorithm not only ensures the predictability and accuracy of the scaling strategy, but also enables the scaling decisions to adapt to the changing workloads with high sample efficiency. Our method has achieved significant performance improvement compared to the existing algorithms and has been deployed online at Alipay, supporting the autoscaling of applications for the world-leading payment platform.

preprint2022arXiv

Learning Large-scale Universal User Representation with Sparse Mixture of Experts

Learning user sequence behaviour embedding is very sophisticated and challenging due to the complicated feature interactions over time and high dimensions of user features. Recent emerging foundation models, e.g., BERT and its variants, encourage a large body of researchers to investigate in this field. However, unlike natural language processing (NLP) tasks, the parameters of user behaviour model come mostly from user embedding layer, which makes most existing works fail in training a universal user embedding of large scale. Furthermore, user representations are learned from multiple downstream tasks, and the past research work do not address the seesaw phenomenon. In this paper, we propose SUPERMOE, a generic framework to obtain high quality user representation from multiple tasks. Specifically, the user behaviour sequences are encoded by MoE transformer, and we can thus increase the model capacity to billions of parameters, or even to trillions of parameters. In order to deal with seesaw phenomenon when learning across multiple tasks, we design a new loss function with task indicators. We perform extensive offline experiments on public datasets and online experiments on private real-world business scenarios. Our approach achieves the best performance over state-of-the-art models, and the results demonstrate the effectiveness of our framework.

preprint2022arXiv

Unit Ball Model for Embedding Hierarchical Structures in the Complex Hyperbolic Space

Learning the representation of data with hierarchical structures in the hyperbolic space attracts increasing attention in recent years. Due to the constant negative curvature, the hyperbolic space resembles tree metrics and captures the tree-like properties naturally, which enables the hyperbolic embeddings to improve over traditional Euclidean models. However, many real-world hierarchically structured data such as taxonomies and multitree networks have varying local structures and they are not trees, thus they do not ubiquitously match the constant curvature property of the hyperbolic space. To address this limitation of hyperbolic embeddings, we explore the complex hyperbolic space, which has the variable negative curvature, for representation learning. Specifically, we propose to learn the embeddings of hierarchically structured data in the unit ball model of the complex hyperbolic space. The unit ball model based embeddings have a more powerful representation capacity to capture a variety of hierarchical structures. Through experiments on synthetic and real-world data, we show that our approach improves over the hyperbolic embedding models significantly. We also explore the competence of complex hyperbolic geometry on the multitree structure and $1$-$N$ structure.

preprint2022arXiv

Variational Policy Propagation for Multi-agent Reinforcement Learning

We propose a \emph{collaborative} multi-agent reinforcement learning algorithm named variational policy propagation (VPP) to learn a \emph{joint} policy through the interactions over agents. We prove that the joint policy is a Markov Random Field under some mild conditions, which in turn reduces the policy space effectively. We integrate the variational inference as special differentiable layers in policy such that the actions can be efficiently sampled from the Markov Random Field and the overall policy is differentiable. We evaluate our algorithm on several large scale challenging tasks and demonstrate that it outperforms previous state-of-the-arts.

preprint2020arXiv

Model Embedding Model-Based Reinforcement Learning

Model-based reinforcement learning (MBRL) has shown its advantages in sample-efficiency over model-free reinforcement learning (MFRL). Despite the impressive results it achieves, it still faces a trade-off between the ease of data generation and model bias. In this paper, we propose a simple and elegant model-embedding model-based reinforcement learning (MEMB) algorithm in the framework of the probabilistic reinforcement learning. To balance the sample-efficiency and model bias, we exploit both real and imaginary data in the training. In particular, we embed the model in the policy update and learn $Q$ and $V$ functions from the real data set. We provide the theoretical analysis of MEMB with the Lipschitz continuity assumption on the model and policy. At last, we evaluate MEMB on several benchmarks and demonstrate our algorithm can achieve state-of-the-art performance.

preprint2020arXiv

Neural Physicist: Learning Physical Dynamics from Image Sequences

We present a novel architecture named Neural Physicist (NeurPhy) to learn physical dynamics directly from image sequences using deep neural networks. For any physical system, given the global system parameters, the time evolution of states is governed by the underlying physical laws. How to learn meaningful system representations in an end-to-end way and estimate accurate state transition dynamics facilitating long-term prediction have been long-standing challenges. In this paper, by leveraging recent progresses in representation learning and state space models (SSMs), we propose NeurPhy, which uses variational auto-encoder (VAE) to extract underlying Markovian dynamic state at each time step, neural process (NP) to extract the global system parameters, and a non-linear non-recurrent stochastic state space model to learn the physical dynamic transition. We apply NeurPhy to two physical experimental environments, i.e., damped pendulum and planetary orbits motion, and achieve promising results. Our model can not only extract the physically meaningful state representations, but also learn the state transition dynamics enabling long-term predictions for unseen image sequences. Furthermore, from the manifold dimension of the latent state space, we can easily identify the degree of freedom (DoF) of the underlying physical systems.

preprint2020arXiv

Riemannian Proximal Policy Optimization

In this paper, We propose a general Riemannian proximal optimization algorithm with guaranteed convergence to solve Markov decision process (MDP) problems. To model policy functions in MDP, we employ Gaussian mixture model (GMM) and formulate it as a nonconvex optimization problem in the Riemannian space of positive semidefinite matrices. For two given policy functions, we also provide its lower bound on policy improvement by using bounds derived from the Wasserstein distance of GMMs. Preliminary experiments show the efficacy of our proposed Riemannian proximal policy optimization algorithm.

preprint2016arXiv

Discriminant Formulas and Applications

We solve two conjectures of Ceken-Palmieri-Wang-Zhang concerning discriminants and give some applications.

preprint2016arXiv

Discriminants and Automorphism Groups of Veronese subrings of skew polynomial rings

We study important invariants and properties of the Veronese subalgebras of $q$-skew polynomial rings, including their discriminant, center and automorphism group, as well as cancellation property and the Tits alternative.

preprint2015arXiv

Invariant theory for quantum Weyl algebras under finite group action

We study the invariant theory of a class of quantum Weyl algebras under group actions and prove that the fixed subrings are always Gorenstein. We also verify the Tits alternative for the automorphism groups of these quantum Weyl algebras.

preprint2014arXiv

Hopf actions and Nakayama automorphisms

Let H be a Hopf algebra with antipode S, and let A be an N-Koszul Artin-Schelter regular algebra. We study connections between the Nakayama automorphism of A and S^2 of H when H coacts on A inner-faithfully. Several applications pertaining to Hopf actions on Artin-Schelter regular algebras are given.

preprint2014arXiv

Quantum binary polyhedral groups and their actions on quantum planes

We classify quantum analogues of actions of finite subgroups G of SL_2(k) on commutative polynomial rings k[u,v]. More precisely, we produce a classification of pairs (H,R), where H is a finite dimensional Hopf algebra that acts inner faithfully and preserves the grading of an Artin-Schelter regular algebra R of global dimension two. Remarkably, the corresponding invariant rings R^H share similar regularity and Gorenstein properties as the invariant rings k[u,v]^G in the classic setting. We also present several questions and directions for expanding this work in noncommutative invariant theory.

preprint2014arXiv

The discriminant controls automorphism groups of noncommutative algebras

We use the discriminant to determine the automorphism groups of some noncommutative algebras, and we prove that a family of noncommutative algebras has tractable automorphism groups.

preprint2014arXiv

The discriminant criterion and automorphism groups of quantized algebras

We compute the automorphism groups of some quantized algebras, including tensor products of quantum Weyl algebras and some skew polynomial rings.

preprint2013arXiv

Hopf actions on filtered regular algebras

We study finite dimensional Hopf algebra actions on so-called filtered Artin-Schelter regular algebras of dimension n, particularly on those of dimension 2. The first Weyl algebra is an example of such on algebra with n=2, for instance. Results on the Gorenstein condition and on the global dimension of the corresponding fixed subrings are also provided.

James Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

A Meta Reinforcement Learning Approach for Predictive Autoscaling in the Cloud

Learning Large-scale Universal User Representation with Sparse Mixture of Experts

Unit Ball Model for Embedding Hierarchical Structures in the Complex Hyperbolic Space

Variational Policy Propagation for Multi-agent Reinforcement Learning

Model Embedding Model-Based Reinforcement Learning

Neural Physicist: Learning Physical Dynamics from Image Sequences

Riemannian Proximal Policy Optimization

Discriminant Formulas and Applications

Discriminants and Automorphism Groups of Veronese subrings of skew polynomial rings

Invariant theory for quantum Weyl algebras under finite group action

Hopf actions and Nakayama automorphisms

Quantum binary polyhedral groups and their actions on quantum planes

The discriminant controls automorphism groups of noncommutative algebras

The discriminant criterion and automorphism groups of quantized algebras

Hopf actions on filtered regular algebras