Source author record

Keqin Liu

Keqin Liu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.RA Machine Learning math.OC math.PR Networking and Internet Architecture Systems and Control Computer Science and Game Theory math.AG math.CV math.DS math.GM math.NA math.ST Numerical Analysis Statistics Theory

Catalog footprint

What is connected

17works

15topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Noncommutative Partial Derivative

We introduce the axiomatic definition of the point-derivative for noncommutative algebras and present the counterparts of the ordinary multi-variable chain rule and Clairaut's Theorem in the context of partial point-derivatives.

preprint2020arXiv

Automatic Integration

The purpose of this paper is to introduce the concept of the automatic integration and present a new way of approximating definite integrals using the automatic integration based on an associative algebra with zero divisors.

preprint2020arXiv

How to Define Automatic Differentiation

Based on a class of associative algebras with zero-divisors which are called real-like algebras by us, we introduce a way of defining automatic differentiation and present different ways of doing automatic differentiation to compute the first, the second and the third derivatives of a function exactly and simultaneously.

preprint2015arXiv

Algebraic Regularity over Quaternions and Regular Four-Manifolds

Based on a new generalization of Cauchy-Riemann system presented in this paper, we introduce a class of quaternion-valued functions of a quaternionic variable, which are called algebraic regular functions. The set of algebraic regular functions is not only a real associative algebra, but also respect the composition of functions. Using algebraic regular functions as transition maps, we introduce a class of four-manifolds called the regular four-manifolds.

preprint2014arXiv

Some Results about Triangular Representations of Lie Algebras

We introduce the concept of a triangular representation of a Lie algebra, give a counterpart of Ado's theorem, and discuss $2$-irreducible triangular modules over a nonreductive Lie algebra.

preprint2013arXiv

Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems

In the Multi-Armed Bandit (MAB) problem, there is a given set of arms with unknown reward models. At each time, a player selects one arm to play, aiming to maximize the total expected reward over a horizon of length T. An approach based on a Deterministic Sequencing of Exploration and Exploitation (DSEE) is developed for constructing sequential arm selection policies. It is shown that for all light-tailed reward distributions, DSEE achieves the optimal logarithmic order of the regret, where regret is defined as the total expected reward loss against the ideal case with known reward models. For heavy-tailed reward distributions, DSEE achieves O(T^1/p) regret when the moments of the reward distributions exist up to the pth order for 1<p<=2 and O(T^1/(1+p/2)) for p>2. With the knowledge of an upperbound on a finite moment of the heavy-tailed reward distributions, DSEE offers the optimal logarithmic regret order. The proposed DSEE approach complements existing work on MAB by providing corresponding results for general reward distributions. Furthermore, with a clearly defined tunable parameter-the cardinality of the exploration sequence, the DSEE approach is easily extendable to variations of MAB, including MAB with various objectives, decentralized MAB with multiple players and incomplete reward observations under collisions, MAB with unknown Markov dynamics, and combinatorial MAB with dependent arms that often arise in network optimization problems such as the shortest path, the minimum spanning, and the dominating set problems under unknown random weights.

preprint2012arXiv

Adaptive Shortest-Path Routing under Unknown and Stochastically Varying Link States

We consider the adaptive shortest-path routing problem in wireless networks under unknown and stochastically varying link states. In this problem, we aim to optimize the quality of communication between a source and a destination through adaptive path selection. Due to the randomness and uncertainties in the network dynamics, the quality of each link varies over time according to a stochastic process with unknown distributions. After a path is selected for communication, the aggregated quality of all links on this path (e.g., total path delay) is observed. The quality of each individual link is not observable. We formulate this problem as a multi-armed bandit with dependent arms. We show that by exploiting arm dependencies, a regret polynomial with network size can be achieved while maintaining the optimal logarithmic order with time. This is in sharp contrast with the exponential regret order with network size offered by a direct application of the classic MAB policies that ignore arm dependencies. Furthermore, our results are obtained under a general model of link-quality distributions (including heavy-tailed distributions) and find applications in cognitive radio and ad hoc networks with unknown and dynamic communication environments.

preprint2012arXiv

Distributed Flow Scheduling in an Unknown Environment

Flow scheduling tends to be one of the oldest and most stubborn problems in networking. It becomes more crucial in the next generation network, due to fast changing link states and tremendous cost to explore the global structure. In such situation, distributed algorithms often dominate. In this paper, we design a distributed virtual game to solve the flow scheduling problem and then generalize it to situations of unknown environment, where online learning schemes are utilized. In the virtual game, we use incentives to stimulate selfish users to reach a Nash Equilibrium Point which is valid based on the analysis of the `Price of Anarchy'. In the unknown-environment generalization, our ultimate goal is the minimization of cost in the long run. In order to achieve balance between exploration of routing cost and exploitation based on limited information, we model this problem based on Multi-armed Bandit Scenario and combined newly proposed DSEE with the virtual game design. Armed with these powerful tools, we find a totally distributed algorithm to ensure the logarithmic growing of regret with time, which is optimum in classic Multi-armed Bandit Problem. Theoretical proof and simulation results both affirm this claim. To our knowledge, this is the first research to combine multi-armed bandit with distributed flow scheduling.

preprint2012arXiv

Representations$^{6-th}$ of Lie Algebras

We introduce representations$^{6-th}$ of Lie algebras, and study the counterparts of the P-B-W Theorem and the Hopf algebra structure for the enveloping algebras of Lie algebras in the context of representations$^{6-th}$ of Lie algebras.

preprint2012arXiv

Sheaf Structures On a Class of Noncommutative Spectra

We introduce a class of noncommutative spectra and give the sheaf structure on the class of noncommutative spectra.

preprint2011arXiv

A Class of Noncommutative Spectra

We construct a class of noncommutative spectra and give the basic properties of the class of noncommutative spectra.

preprint2011arXiv

Dynamic Intrusion Detection in Resource-Constrained Cyber Networks

We consider a large-scale cyber network with N components (e.g., paths, servers, subnets). Each component is either in a healthy state (0) or an abnormal state (1). Due to random intrusions, the state of each component transits from 0 to 1 over time according to certain stochastic process. At each time, a subset of K (K < N) components are checked and those observed in abnormal states are fixed. The objective is to design the optimal scheduling for intrusion detection such that the long-term network cost incurred by all abnormal components is minimized. We formulate the problem as a special class of Restless Multi-Armed Bandit (RMAB) process. A general RMAB suffers from the curse of dimensionality (PSPACE-hard) and numerical methods are often inapplicable. We show that, for this class of RMAB, Whittle index exists and can be obtained in closed form, leading to a low-complexity implementation of Whittle index policy with a strong performance. For homogeneous components, Whittle index policy is shown to have a simple structure that does not require any prior knowledge on the intrusion processes. Based on this structure, Whittle index policy is further shown to be optimal over a finite time horizon with an arbitrary length. Beyond intrusion detection, these results also find applications in queuing networks with finite-size buffers.

preprint2011arXiv

Invariant Algebras

We introduce invariant algebras and representation$^{(c_1,..., c_8)}$ of algebras, and give many ways of constructing Lie algebras, Jordan algebras, Leibniz algebras, pre-Lie algebras and left-symmetric algebras in an invariant algebras.

preprint2011arXiv

Learning in A Changing World: Restless Multi-Armed Bandit with Unknown Dynamics

We consider the restless multi-armed bandit (RMAB) problem with unknown dynamics in which a player chooses M out of N arms to play at each time. The reward state of each arm transits according to an unknown Markovian rule when it is played and evolves according to an arbitrary unknown random process when it is passive. The performance of an arm selection policy is measured by regret, defined as the reward loss with respect to the case where the player knows which M arms are the most rewarding and always plays the M best arms. We construct a policy with an interleaving exploration and exploitation epoch structure that achieves a regret with logarithmic order when arbitrary (but nontrivial) bounds on certain system parameters are known. When no knowledge about the system is available, we show that the proposed policy achieves a regret arbitrarily close to the logarithmic order. We further extend the problem to a decentralized setting where multiple distributed players share the arms without information exchange. Under both an exogenous restless model and an endogenous restless model, we show that a decentralized extension of the proposed policy preserves the logarithmic regret order as in the centralized setting. The results apply to adaptive learning in various dynamic systems and communication networks, as well as financial investment.

preprint2010arXiv

Distributed Learning in Multi-Armed Bandit with Multiple Players

We formulate and study a decentralized multi-armed bandit (MAB) problem. There are M distributed players competing for N independent arms. Each arm, when played, offers i.i.d. reward according to a distribution with an unknown parameter. At each time, each player chooses one arm to play without exchanging observations or any information with other players. Players choosing the same arm collide, and, depending on the collision model, either no one receives reward or the colliding players share the reward in an arbitrary way. We show that the minimum system regret of the decentralized MAB grows with time at the same logarithmic order as in the centralized counterpart where players act collectively as a single entity by exchanging observations and making decisions jointly. A decentralized policy is constructed to achieve this optimal order while ensuring fairness among players and without assuming any pre-agreement or information exchange among players. Based on a Time Division Fair Sharing (TDFS) of the M best arms, the proposed policy is constructed and its order optimality is proven under a general reward model. Furthermore, the basic structure of the TDFS policy can be used with any order-optimal single-player policy to achieve order optimality in the decentralized setting. We also establish a lower bound on the system regret growth rate for a general class of decentralized polices, to which the proposed policy belongs. This problem finds potential applications in cognitive radio networks, multi-channel communication systems, multi-agent systems, web search and advertising, and social networks.

preprint2010arXiv

Hopf-like Algebras and Extended P-B-W Theorems

Based on invariant algebras, we introduce representations$^{6-th}$ of Lie algebras and representations$^{< 4-th>}$ of Leibniz algebras, give the extended P-B-W Theorems in the context of the new representations of Lie algebras and Leibniz algebras, and generalize the Hopf-algebra structure on the enveloping algebras of Lie Algebras.

preprint2008arXiv

On Myopic Sensing for Multi-Channel Opportunistic Access: Structure, Optimality, and Performance

We consider a multi-channel opportunistic communication system where the states of these channels evolve as independent and statistically identical Markov chains (the Gilbert-Elliot channel model). A user chooses one channel to sense and access in each slot and collects a reward determined by the state of the chosen channel. The problem is to design a sensing policy for channel selection to maximize the average reward, which can be formulated as a multi-arm restless bandit process. In this paper, we study the structure, optimality, and performance of the myopic sensing policy. We show that the myopic sensing policy has a simple robust structure that reduces channel selection to a round-robin procedure and obviates the need for knowing the channel transition probabilities. The optimality of this simple policy is established for the two-channel case and conjectured for the general case based on numerical results. The performance of the myopic sensing policy is analyzed, which, based on the optimality of myopic sensing, characterizes the maximum throughput of a multi-channel opportunistic communication system and its scaling behavior with respect to the number of channels. These results apply to cognitive radio networks, opportunistic transmission in fading environments, and resource-constrained jamming and anti-jamming.

Keqin Liu

What is connected

Connect this record

See the researcher in context

Building this map preview

17 published item(s)

Noncommutative Partial Derivative

Automatic Integration

How to Define Automatic Differentiation

Algebraic Regularity over Quaternions and Regular Four-Manifolds

Some Results about Triangular Representations of Lie Algebras

Deterministic Sequencing of Exploration and Exploitation for Multi-Armed Bandit Problems

Adaptive Shortest-Path Routing under Unknown and Stochastically Varying Link States

Distributed Flow Scheduling in an Unknown Environment

Representations$^{6-th}$ of Lie Algebras

Sheaf Structures On a Class of Noncommutative Spectra

A Class of Noncommutative Spectra

Dynamic Intrusion Detection in Resource-Constrained Cyber Networks

Invariant Algebras

Learning in A Changing World: Restless Multi-Armed Bandit with Unknown Dynamics

Distributed Learning in Multi-Armed Bandit with Multiple Players

Hopf-like Algebras and Extended P-B-W Theorems

On Myopic Sensing for Multi-Channel Opportunistic Access: Structure, Optimality, and Performance