Source author record

Apostolos N. Burnetas

Apostolos N. Burnetas appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.OC Machine Learning

Catalog footprint

What is connected

2works

2topics

2close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2015arXiv

Asymptotically Optimal Multi-Armed Bandit Policies under a Cost Constraint

We develop asymptotically optimal policies for the multi armed bandit (MAB), problem, under a cost constraint. This model is applicable in situations where each sample (or activation) from a population (bandit) incurs a known bandit dependent cost. Successive samples from each population are iid random variables with unknown distribution. The objective is to design a feasible policy for deciding from which population to sample from, so as to maximize the expected sum of outcomes of $n$ total samples or equivalently to minimize the regret due to lack on information on sample distributions, For this problem we consider the class of feasible uniformly fast (f-UF) convergent policies, that satisfy the cost constraint sample-path wise. We first establish a necessary asymptotic lower bound for the rate of increase of the regret function of f-UF policies. Then we construct a class of f-UF policies and provide conditions under which they are asymptotically optimal within the class of f-UF policies, achieving this asymptotic lower bound. At the end we provide the explicit form of such policies for the case in which the unknown distributions are Normal with unknown means and known variances.

preprint2015arXiv

Inventory Policies for Two Products under Poisson Demand: Interaction between Demand Substitution, Limited Storage Capacity and Replenishment Time Uncertainty

We consider a two-product inventory system with independent Poisson demands, limited joint storage capacity and partial demand substitution. Replenishment is performed simultaneously for both products and the replenishment time may be fixed or exponentially distributed. For both cases we develop a Continuous Time Markov Chain model for the inventory levels and derive expressions for the expected profit per unit time. We prove that the profit function is submodular in the order quantities, which allows for a more efficient algorithm to determine the optimal ordering policy. Using computational experiments we assess the effect of substitution and replenishment time uncertainty on the order quantities and the profit as a function of the storage capacity.

Apostolos N. Burnetas

What is connected

Connect this record

See the researcher in context

Building this map preview

2 published item(s)

Asymptotically Optimal Multi-Armed Bandit Policies under a Cost Constraint

Inventory Policies for Two Products under Poisson Demand: Interaction between Demand Substitution, Limited Storage Capacity and Replenishment Time Uncertainty