Source author record

Ayfer Ozgur

Ayfer Ozgur appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory math.IT Machine Learning math.ST Statistics Theory Distributed, Parallel, and Cluster Computing math.FA math.PR Networking and Internet Architecture

Catalog footprint

What is connected

19works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

Global Multiclass Classification and Dataset Construction via Heterogeneous Local Experts

In the domains of dataset construction and crowdsourcing, a notable challenge is to aggregate labels from a heterogeneous set of labelers, each of whom is potentially an expert in some subset of tasks (and less reliable in others). To reduce costs of hiring human labelers or training automated labeling systems, it is of interest to minimize the number of labelers while ensuring the reliability of the resulting dataset. We model this as the problem of performing $K$-class classification using the predictions of smaller classifiers, each trained on a subset of $[K]$, and derive bounds on the number of classifiers needed to accurately infer the true class of an unlabeled sample under both adversarial and stochastic assumptions. By exploiting a connection to the classical set cover problem, we produce a near-optimal scheme for designing such configurations of classifiers which recovers the well known one-vs.-one classification approach as a special case. Experiments with the MNIST and CIFAR-10 datasets demonstrate the favorable accuracy (compared to a centralized classifier) of our aggregation scheme applied to classifiers trained on subsets of the data. These results suggest a new way to automatically label data or adapt an existing set of local classifiers to larger-scale multiclass problems.

preprint2021arXiv

Over-the-Air Statistical Estimation

We study schemes and lower bounds for distributed minimax statistical estimation over a Gaussian multiple-access channel (MAC) under squared error loss, in a framework combining statistical estimation and wireless communication. First, we develop "analog" joint estimation-communication schemes that exploit the superposition property of the Gaussian MAC and we characterize their risk in terms of the number of nodes and dimension of the parameter space. Then, we derive information-theoretic lower bounds on the minimax risk of any estimation scheme restricted to communicate the samples over a given number of uses of the channel and show that the risk achieved by our proposed schemes is within a logarithmic factor of these lower bounds. We compare both achievability and lower bound results to previous "digital" lower bounds, where nodes transmit errorless bits at the Shannon capacity of the MAC, showing that estimation schemes that leverage the physical layer offer a drastic reduction in estimation error over digital schemes relying on a physical-layer abstraction.

preprint2020arXiv

Fisher information under local differential privacy

We develop data processing inequalities that describe how Fisher information from statistical samples can scale with the privacy parameter $\varepsilon$ under local differential privacy constraints. These bounds are valid under general conditions on the distribution of the score of the statistical model, and they elucidate under which conditions the dependence on $\varepsilon$ is linear, quadratic, or exponential. We show how these inequalities imply order optimal lower bounds for private estimation for both the Gaussian location model and discrete distribution estimation for all levels of privacy $\varepsilon>0$. We further apply these inequalities to sparse Bernoulli models and demonstrate privacy mechanisms and estimators with order-matching squared $\ell^2$ error.

preprint2020arXiv

Information Constrained Optimal Transport: From Talagrand, to Marton, to Cover

The optimal transport problem studies how to transport one measure to another in the most cost-effective way and has wide range of applications from economics to machine learning. In this paper, we introduce and study an information constrained variation of this problem. Our study yields a strengthening and generalization of Talagrand's celebrated transportation cost inequality. Following Marton's approach, we show that the new transportation cost inequality can be used to recover old and new concentration of measure results. Finally, we provide an application of this new inequality to network information theory. We show that it can be used to recover almost immediately a recent solution to a long-standing open problem posed by Cover regarding the capacity of the relay channel.

preprint2018arXiv

Minimax Learning for Remote Prediction

The classical problem of supervised learning is to infer an accurate predictor of a target variable $Y$ from a measured variable $X$ by using a finite number of labeled training samples. Motivated by the increasingly distributed nature of data and decision making, in this paper we consider a variation of this classical problem in which the prediction is performed remotely based on a rate-constrained description $M$ of $X$. Upon receiving $M$, the remote node computes an estimate $\hat Y$ of $Y$. We follow the recent minimax approach to study this learning problem and show that it corresponds to a one-shot minimax noisy source coding problem. We then establish information theoretic bounds on the risk-rate Lagrangian cost and a general method to design a near-optimal descriptor-estimator pair, which can be viewed as a rate-constrained analog to the maximum conditional entropy principle used in the classical minimax learning problem. Our results show that a naive estimate-compress scheme for rate-constrained prediction is not in general optimal.

preprint2016arXiv

Capacity of the Energy Harvesting Channel with a Finite Battery

We consider an energy harvesting channel, in which the transmitter is powered by an exogenous stochastic energy harvesting process $E_t$, such that $0\leq E_t\leq\bar{E}$, which can be stored in a battery of finite size $\bar{B}$. We provide a simple and insightful formula for the approximate capacity of this channel with bounded guarantee on the approximation gap independent of system parameters. This approximate characterization of the capacity identifies two qualitatively different operating regimes for this channel: in the large battery regime, when $\bar{B}\geq \bar{E}$, the capacity is approximately equal to that of an AWGN channel with an average power constraint equal to the average energy harvesting rate, i.e. it depends only on the mean of $E_t$ and is (almost) independent of the distribution of $E_t$ and the exact value of $\bar{B}$. In particular, this suggests that a battery size $\bar{B}\approx\bar{E}$ is approximately sufficient to extract the infinite battery capacity of the system. In the small battery regime, when $\bar{B}<\bar{E}$, we clarify the dependence of the capacity on the distribution of $E_t$ and the value of $\bar{B}$. There are three steps to proving this result which can be of interest in their own right: 1) we characterize the capacity of this channel as an $n$-letter mutual information rate under various assumptions on the availability of energy arrival information; 2) we characterize the approximately optimal online power control policy that maximizes the long-term average throughput of the system; 3) we show that the information-theoretic capacity of this channel is equal, within a constant gap, to its long-term average throughput. This last result provides a connection between the information- and communication-theoretic formulations of the energy-harvesting communication problem that have been so far studied in isolation.

preprint2016arXiv

Capacity of the Energy Harvesting Gaussian MAC

We consider an energy harvesting multiple access channel (MAC) where the transmitters are powered by an exogenous stochastic energy harvesting process and equipped with finite batteries. We characterize the capacity region of this channel as n-letter mutual information rate and develop inner and outer bounds that differ by a constant gap. An interesting conclusion that emerges from our results is that the sum-capacity approaches that of a standard AWGN MAC (with only an average constraint on the transmitted power), as the number of users in the MAC becomes large.

preprint2016arXiv

Cut-Set Bound Is Loose for Gaussian Relay Networks

The cut-set bound developed by Cover and El Gamal in 1979 has since remained the best known upper bound on the capacity of the Gaussian relay channel. We develop a new upper bound on the capacity of the Gaussian primitive relay channel which is tighter than the cut-set bound. Our proof is based on typicality arguments and concentration of Gaussian measure. Combined with a simple tensorization argument proposed by Courtade and Ozgur in 2015, our result also implies that the current capacity approximations for Gaussian relay networks, which have linear gap to the cut-set bound in the number of nodes, are order-optimal and leads to a lower bound on the pre-constant.

preprint2016arXiv

Improving on the Cut-Set Bound via Geometric Analysis of Typical Sets

We consider the discrete memoryless symmetric primitive relay channel, where, a source $X$ wants to send information to a destination $Y$ with the help of a relay $Z$ and the relay can communicate to the destination via an error-free digital link of rate $R_0$, while $Y$ and $Z$ are conditionally independent and identically distributed given $X$. We develop two new upper bounds on the capacity of this channel that are tighter than existing bounds, including the celebrated cut-set bound. Our approach significantly deviates from the standard information-theoretic approach for proving upper bounds on the capacity of multi-user channels. We build on the blowing-up lemma to analyze the probabilistic geometric relations between the typical sets of the $n$-letter random variables associated with a reliable code for communicating over this channel. These relations translate to new entropy inequalities between the $n$-letter random variables involved. As an application of our bounds, we study an open question posed by (Cover, 1987), namely, what is the minimum needed $Z$-$Y$ link rate $R_0^*$ in order for the capacity of the relay channel to be equal to that of the broadcast cut. We consider the special case when the $X$-$Y$ and $X$-$Z$ links are both binary symmetric channels. Our tighter bounds on the capacity of the relay channel immediately translate to tighter lower bounds for $R_0^*$. More interestingly, we show that when $p\to 1/2$, $R_0^*\geq 0.1803$; even though the broadcast channel becomes completely noisy as $p\to 1/2$ and its capacity, and therefore the capacity of the relay channel, goes to zero, a strictly positive rate $R_0$ is required for the relay channel capacity to be equal to the broadcast bound.

preprint2015arXiv

Can Feedback Increase the Capacity of the Energy Harvesting Channel?

We investigate if feedback can increase the capacity of an energy harvesting communication channel where a transmitter powered by an exogenous energy arrival process and equipped with a finite battery communicates to a receiver over a memoryless channel. For a simple special case where the energy arrival process is deterministic and the channel is a BEC, we explicitly compute the feed-forward and feedback capacities and show that feedback can strictly increase the capacity of this channel. Building on this example, we also show that feedback can increase the capacity when the energy arrivals are i.i.d. known noncausally at the transmitter and the receiver.

preprint2015arXiv

Capacity of the AWGN Channel with Random Battery Recharges

We consider communication over the AWGN channel with a transmitter whose battery is recharged with RF energy transfer at random times known to the receiver. We assume that the recharging process is i.i.d. Bernoulli. We characterize the capacity of this channel as the limit of an $n$-letter maximum mutual information rate under both causal and noncausal transmitter knowledge of the battery recharges. With noncausal knowledge, it is possible to explicitly identify the maximizing input distribution, which we use to demonstrate that the capacity with noncausal knowledge of the battery recharges is strictly larger than that with causal knowledge. We then proceed to derive explicit upper and lower bounds on the capacity, which are within 1.05 bits/s/Hz of each other for all parameter values.

preprint2015arXiv

Cut-Set Bound Is Loose for Gaussian Relay Networks

preprint2015arXiv

STAC: Simultaneous Transmitting and Air Computing in Wireless Data Center Networks

The data center network (DCN), wired or wireless, features large amounts of Many-to-One (M2O) sessions. Each M2O session is currently operated based on Point-to-Point (P2P) communications and Store-and-Forward (SAF) relays, and is generally followed by certain further computation at the destination. %typically a weighted summation of the received digits. Different from this separate P2P/SAF-based-transmission and computation strategy, this paper proposes STAC, a novel physical layer scheme that achieves Simultaneous Transmission and Air Computation in wireless DCNs. In particular, STAC takes advantage of the superposition nature of electromagnetic (EM) waves, and allows multiple transmitters to transmit in the same time slot with appropriately chosen parameters, such that the received superimposed signal can be directly transformed to the needed summation at the receiver. Exploiting the static channel environment and compact space in DCN, we propose an enhanced Software Defined Network (SDN) architecture to enable STAC, where wired connections are established to provide the wireless transceivers external reference signals. Theoretical analysis and simulation show that with STAC used, both the bandwidth and energy efficiencies can be improved severalfold.

preprint2014arXiv

Achieving Full DoF in Heterogeneous Parallel Broadcast Channels with Outdated CSIT

We consider communication over heterogeneous parallel channels, where a transmitter is connected to two users via two parallel channels: a MIMO broadcast channel (BC) and a noiseless rate-limited multicast channel. We characterize the optimal degrees of freedom (DoF) region of this setting when the transmitter has delayed channel state information (CSIT) regarding the MIMO BC. Our results show that jointly coding over the two channels strictly outperforms simple channel aggregation and can even achieve the instantaneous CSIT performance with completely outdated CSIT on the MIMO BC in the sum DoF sense; this happens when the multicast rate of the second channel is larger than a certain threshold. The main idea is to send information over the MIMO BC at a rate above its capacity and then use the second channel to send additional side information to allow for reliable decoding at both receivers. We call this scheme a two-phase overload-multicast strategy. We show that such a strategy is also sum DoF optimal for the K-user MIMO BC with a parallel multicast channel when the rate of the multicast channel is high enough and can again achieve the instantaneous CSIT performance (optimal sum DoF) with completely outdated CSIT. For the regime where the capacity of the multicast channel is small, we propose another joint coding strategy which is sum DoF optimal.

preprint2014arXiv

Feedback through Overhearing

In this paper we examine the value of feedback that comes from overhearing, without dedicated feedback resources. We focus on a simple model for this purpose: a deterministic two-hop interference channel, where feedback comes from overhearing the forward-links. A new aspect brought by this setup is the dual-role of the relay signal. While the relay signal needs to convey the source message to its corresponding destination, it can also provide a feedback signal which can potentially increase the capacity of the first hop. We derive inner and outer bounds on the sum capacity which match for a large range of the parameter values. Our results identify the parameter ranges where overhearing can provide non-negative capacity gain and can even achieve the performance with dedicated-feedback resources. The results also provide insights into which transmissions are most useful to overhear.

preprint2012arXiv

Approximately achieving Gaussian relay network capacity with lattice codes

Recently, it has been shown that a quantize-map-and-forward scheme approximately achieves (within a constant number of bits) the Gaussian relay network capacity for arbitrary topologies. This was established using Gaussian codebooks for transmission and random mappings at the relays. In this paper, we show that the same approximation result can be established by using lattices for transmission and quantization along with structured mappings at the relays.

preprint2012arXiv

Wireless Network Simplification: the Gaussian N-Relay Diamond Network

We consider the Gaussian N-relay diamond network, where a source wants to communicate to a destination node through a layer of N-relay nodes. We investigate the following question: what fraction of the capacity can we maintain by using only k out of the N available relays? We show that independent of the channel configurations and the operating SNR, we can always find a subset of k relays which alone provide a rate (kC/(k+1))-G, where C is the information theoretic cutset upper bound on the capacity of the whole network and G is a constant that depends only on N and k (logarithmic in N and linear in k). In particular, for k = 1, this means that half of the capacity of any N-relay diamond network can be approximately achieved by routing information over a single relay. We also show that this fraction is tight: there are configurations of the N-relay diamond network where every subset of k relays alone can at most provide approximately a fraction k/(k+1) of the total capacity. These high-capacity k-relay subnetworks can be also discovered efficiently. We propose an algorithm that computes a constant gap approximation to the capacity of the Gaussian N-relay diamond network in O(N log N) running time and discovers a high-capacity k-relay subnetwork in O(kN) running time. This result also provides a new approximation to the capacity of the Gaussian N-relay diamond network which is hybrid in nature: it has both multiplicative and additive gaps. In the intermediate SNR regime, this hybrid approximation is tighter than existing purely additive or purely multiplicative approximations to the capacity of this network.

preprint2010arXiv

Linear Capacity Scaling in Wireless Networks: Beyond Physical Limits?

We investigate the role of cooperation in wireless networks subject to a spatial degrees of freedom limitation. To address the worst case scenario, we consider a free-space line-of-sight type environment with no scattering and no fading. We identify three qualitatively different operating regimes that are determined by how the area of the network A, normalized with respect to the wavelength lambda, compares to the number of users n. In networks with sqrt{A}/lambda < sqrt{n}, the limitation in spatial degrees of freedom does not allow to achieve a capacity scaling better than sqrt{n} and this performance can be readily achieved by multi-hopping. This result has been recently shown by Franceschetti et al. However, for networks with sqrt{A}/lambda > sqrt{n}, the number of available degrees of freedom is min(n, sqrt{A}/lambda), larger that what can be achieved by multi-hopping. We show that the optimal capacity scaling in this regime is achieved by hierarchical cooperation. In particular, in networks with sqrt{A}/lambda> n, hierarchical cooperation can achieve linear scaling.

preprint2009arXiv

Throughput-Delay Trade-off for Hierarchical Cooperation in Ad Hoc Wireless Networks

Hierarchical cooperation has recently been shown to achieve better throughput scaling than classical multihop schemes under certain assumptions on the channel model in static wireless networks. However, the end-to-end delay of this scheme turns out to be significantly larger than those of multihop schemes. A modification of the scheme is proposed here that achieves a throughput-delay trade-off $D(n)=(\log n)^2 T(n)$ for T(n) between $Θ(\sqrt{n}/\log n)$ and $Θ(n/\log n)$, where D(n) and T(n) are respectively the average delay per bit and the aggregate throughput in a network of n nodes. This trade-off complements the previous results of El Gamal et al., which show that the throughput-delay trade-off for multihop schemes is given by D(n)=T(n) where T(n) lies between $Θ(1)$ and $Θ(\sqrt{n})$. Meanwhile, the present paper considers the network multiple-access problem, which may be of interest in its own right.

Ayfer Ozgur

What is connected

Connect this record

See the researcher in context

Building this map preview

19 published item(s)

Global Multiclass Classification and Dataset Construction via Heterogeneous Local Experts

Over-the-Air Statistical Estimation

Fisher information under local differential privacy

Information Constrained Optimal Transport: From Talagrand, to Marton, to Cover

Minimax Learning for Remote Prediction

Capacity of the Energy Harvesting Channel with a Finite Battery

Capacity of the Energy Harvesting Gaussian MAC

Cut-Set Bound Is Loose for Gaussian Relay Networks

Improving on the Cut-Set Bound via Geometric Analysis of Typical Sets

Can Feedback Increase the Capacity of the Energy Harvesting Channel?

Capacity of the AWGN Channel with Random Battery Recharges

Cut-Set Bound Is Loose for Gaussian Relay Networks

STAC: Simultaneous Transmitting and Air Computing in Wireless Data Center Networks

Achieving Full DoF in Heterogeneous Parallel Broadcast Channels with Outdated CSIT

Feedback through Overhearing

Approximately achieving Gaussian relay network capacity with lattice codes

Wireless Network Simplification: the Gaussian N-Relay Diamond Network

Linear Capacity Scaling in Wireless Networks: Beyond Physical Limits?

Throughput-Delay Trade-off for Hierarchical Cooperation in Ad Hoc Wireless Networks