Source author record

Upamanyu Madhow

Upamanyu Madhow appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Information Theory Machine Learning math.IT eess.SP Networking and Internet Architecture eess.SY Multiagent Systems Neural and Evolutionary Computing Systems and Control

Catalog footprint

What is connected

16works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Neuro-Inspired Deep Neural Networks with Sparse, Strong Activations

While end-to-end training of Deep Neural Networks (DNNs) yields state of the art performance in an increasing array of applications, it does not provide insight into, or control over, the features being extracted. We report here on a promising neuro-inspired approach to DNNs with sparser and stronger activations. We use standard stochastic gradient training, supplementing the end-to-end discriminative cost function with layer-wise costs promoting Hebbian ("fire together," "wire together") updates for highly active neurons, and anti-Hebbian updates for the remaining neurons. Instead of batch norm, we use divisive normalization of activations (suppressing weak outputs using strong outputs), along with implicit $\ell_2$ normalization of neuronal weights. Experiments with standard image classification tasks on CIFAR-10 demonstrate that, relative to baseline end-to-end trained architectures, our proposed architecture (a) leads to sparser activations (with only a slight compromise on accuracy), (b) exhibits more robustness to noise (without being trained on noisy data), (c) exhibits more robustness to adversarial perturbations (without adversarial training).

preprint2022arXiv

Self-supervised Speaker Recognition Training Using Human-Machine Dialogues

Speaker recognition, recognizing speaker identities based on voice alone, enables important downstream applications, such as personalization and authentication. Learning speaker representations, in the context of supervised learning, heavily depends on both clean and sufficient labeled data, which is always difficult to acquire. Noisy unlabeled data, on the other hand, also provides valuable information that can be exploited using self-supervised training methods. In this work, we investigate how to pretrain speaker recognition models by leveraging dialogues between customers and smart-speaker devices. However, the supervisory information in such dialogues is inherently noisy, as multiple speakers may speak to a device in the course of the same dialogue. To address this issue, we propose an effective rejection mechanism that selectively learns from dialogues based on their acoustic homogeneity. Both reconstruction-based and contrastive-learning-based self-supervised methods are compared. Experiments demonstrate that the proposed method provides significant performance improvements, superior to earlier work. Dialogue pretraining when combined with the rejection mechanism yields 27.10% equal error rate (EER) reduction in speaker recognition, compared to a model without self-supervised pretraining.

preprint2021arXiv

Generalized Likelihood Ratio Test for Adversarially Robust Hypothesis Testing

Machine learning models are known to be susceptible to adversarial attacks which can cause misclassification by introducing small but well designed perturbations. In this paper, we consider a classical hypothesis testing problem in order to develop fundamental insight into defending against such adversarial perturbations. We interpret an adversarial perturbation as a nuisance parameter, and propose a defense based on applying the generalized likelihood ratio test (GLRT) to the resulting composite hypothesis testing problem, jointly estimating the class of interest and the adversarial perturbation. While the GLRT approach is applicable to general multi-class hypothesis testing, we first evaluate it for binary hypothesis testing in white Gaussian noise under $\ell_{\infty}$ norm-bounded adversarial perturbations, for which a known minimax defense optimizing for the worst-case attack provides a benchmark. We derive the worst-case attack for the GLRT defense, and show that its asymptotic performance (as the dimension of the data increases) approaches that of the minimax defense. For non-asymptotic regimes, we show via simulations that the GLRT defense is competitive with the minimax approach under the worst-case attack, while yielding a better robustness-accuracy tradeoff under weaker attacks. We also illustrate the GLRT approach for a multi-class hypothesis testing problem, for which a minimax strategy is not known, evaluating its performance under both noise-agnostic and noise-aware adversarial settings, by providing a method to find optimal noise-aware attacks, and heuristics to find noise-agnostic attacks that are close to optimal in the high SNR regime.

preprint2021arXiv

Wireless Fingerprinting via Deep Learning: The Impact of Confounding Factors

Can we distinguish between two wireless transmitters sending exactly the same message, using the same protocol? The opportunity for doing so arises due to subtle nonlinear variations across transmitters, even those made by the same manufacturer. Since these effects are difficult to model explicitly, we investigate learning device fingerprints using complex-valued deep neural networks (DNNs) that take as input the complex baseband signal at the receiver. We ask whether such fingerprints can be made robust to distribution shifts across time and locations due to clock drift and variations in the wireless channel. In this paper, we point out that, unless proactively discouraged from doing so, DNNs learn these strong confounding features rather than the nonlinear device-specific characteristics that we seek to learn. We propose and evaluate strategies, based on augmentation and estimation, to promote generalization across realizations of these confounding factors, using data from WiFi and ADS-B protocols. We conclude that, while DNN training has the advantage of not requiring explicit signal models, significant modeling insights are required to focus the learning on the effects we wish to capture.

preprint2020arXiv

A design framework for all-digital mmWave massive MIMO with per-antenna nonlinearities

Millimeter wave MIMO combines the benefits of compact antenna arrays with a large number of elements and massive bandwidths, so that fully digital beamforming has the potential of supporting a large number of simultaneous users with {\it per user} data rates of multiple gigabits/sec (Gbps). In this paper, we develop an analytical model for the impact of nonlinearities in such a system, and illustrate its utility in providing hardware design guidelines regarding two key challenges: the low available precision of analog-to-digital conversion at high sampling rates, and nonlinearities in ultra-high speed radio frequency (RF) and baseband circuits. We consider linear minimum mean square error (LMMSE) reception for a multiuser MIMO uplink, and provide performance guarantees based on two key concepts: (a) summarization of the impact of per-antenna nonlinearities via a quantity that we term the "intrinsic SNR", (b) using linear MMSE performance in an ideal system without nonlinearities to bound that in our non-ideal system. For our numerical results, we employ nominal parameters corresponding to outdoor picocells operating at a carrier frequency of 140 GHz, with a data rate of 10 Gbps per user.

preprint2020arXiv

Polarizing Front Ends for Robust CNNs

The vulnerability of deep neural networks to small, adversarially designed perturbations can be attributed to their "excessive linearity." In this paper, we propose a bottom-up strategy for attenuating adversarial perturbations using a nonlinear front end which polarizes and quantizes the data. We observe that ideal polarization can be utilized to completely eliminate perturbations, develop algorithms to learn approximately polarizing bases for data, and investigate the effectiveness of the proposed strategy on the MNIST and Fashion MNIST datasets.

preprint2016arXiv

Learning Sparse, Distributed Representations using the Hebbian Principle

The "fire together, wire together" Hebbian model is a central principle for learning in neuroscience, but surprisingly, it has found limited applicability in modern machine learning. In this paper, we take a first step towards bridging this gap, by developing flavors of competitive Hebbian learning which produce sparse, distributed neural codes using online adaptation with minimal tuning. We propose an unsupervised algorithm, termed Adaptive Hebbian Learning (AHL). We illustrate the distributed nature of the learned representations via output entropy computations for synthetic data, and demonstrate superior performance, compared to standard alternatives such as autoencoders, in training a deep convolutional net on standard image datasets.

preprint2016arXiv

Newtonized Orthogonal Matching Pursuit: Frequency Estimation over the Continuum

We propose a fast sequential algorithm for the fundamental problem of estimating frequencies and amplitudes of a noisy mixture of sinusoids. The algorithm is a natural generalization of Orthogonal Matching Pursuit (OMP) to the continuum using Newton refinements, and hence is termed Newtonized OMP (NOMP). Each iteration consists of two phases: detection of a new sinusoid, and sequential Newton refinements of the parameters of already detected sinusoids. The refinements play a critical role in two ways: (1) sidestepping the potential basis mismatch from discretizing a continuous parameter space, (2) providing feedback for locally refining parameters estimated in previous iterations. We characterize convergence, and provide a Constant False Alarm Rate (CFAR) based termination criterion. By benchmarking against the Cramer Rao Bound, we show that NOMP achieves near-optimal performance under a variety of conditions. We compare the performance of NOMP with classical algorithms such as MUSIC and more recent Atomic norm Soft Thresholding (AST) and Lasso algorithms, both in terms of frequency estimation accuracy and run time.

preprint2015arXiv

A scalable architecture for distributed receive beamforming: analysis and experimental demonstration

We propose, analyze and demonstrate an architecture for scalable cooperative reception. In a cluster of N + 1 receive nodes, one node is designated as the final receiver, and the N other nodes act as amplify-and-forward relays which adapt their phases such that the relayed signals add up constructively at the designated receiver. This yields received SNR scaling linearly with N, while avoiding the linear increase in overhead incurred by a direct approach in which received signals are separately quantized and transmitted for centralized processing. By transforming the task of long-distance distributed receive beamforming into one of local distributed transmit beamforming, we can leverage a scalable one-bit feedback algorithm for phase synchronization. We show that time division between the long-distance and local links eliminates the need for explicit frequency synchronization. We provide an analytical framework, whose results closely match Monte Carlo simulations, to evaluate the impact of phase noise due to relaying delay on the performance of the one-bit feedback algorithm. Experimental results from our prototype implementation on software-defined radios demonstrate the expected gains in received signal strength despite significant oscillator drift, and are consistent with results from our analytical framework.

preprint2015arXiv

Compressive channel estimation and tracking for large arrays in mm wave picocells

We propose and investigate a compressive architecture for estimation and tracking of sparse spatial channels in millimeter (mm) wave picocellular networks. The base stations are equipped with antenna arrays with a large number of elements (which can fit within compact form factors because of the small carrier wavelength) and employ radio frequency (RF) beamforming, so that standard least squares adaptation techniques (which require access to individual antenna elements) are not applicable. We focus on the downlink, and show that "compressive beacons," transmitted using pseudorandom phase settings at the base station array, and compressively processed using pseudorandom phase settings at the mobile array, provide information sufficient for accurate estimation of the two-dimensional (2D) spatial frequencies associated with the directions of departure of the dominant rays from the base station, and the associated complex gains. This compressive approach is compatible with coarse phase-only control, and is based on a near-optimal sequential algorithm for frequency estimation which can exploit the geometric continuity of the channel across successive beaconing intervals to reduce the overhead to less than 1% even for very large (32 x 32) arrays. Compressive beaconing is essentially omnidirectional, and hence does not enjoy the SNR and spatial reuse benefits of beamforming obtained during data transmission. We therefore discuss system level design considerations for ensuring that the beacon SNR is sufficient for accurate channel estimation, and that inter-cell beacon interference is controlled by an appropriate reuse scheme.

preprint2015arXiv

Compressive spectral embedding: sidestepping the SVD

Spectral embedding based on the Singular Value Decomposition (SVD) is a widely used "preprocessing" step in many learning tasks, typically leading to dimensionality reduction by projecting onto a number of dominant singular vectors and rescaling the coordinate axes (by a predefined function of the singular value). However, the number of such vectors required to capture problem structure grows with problem size, and even partial SVD computation becomes a bottleneck. In this paper, we propose a low-complexity it compressive spectral embedding algorithm, which employs random projections and finite order polynomial expansions to compute approximations to SVD-based embedding. For an m times n matrix with T non-zeros, its time complexity is O((T+m+n)log(m+n)), and the embedding dimension is O(log(m+n)), both of which are independent of the number of singular vectors whose effect we wish to capture. To the best of our knowledge, this is the first work to circumvent this dependence on the number of singular vectors for general SVD-based embeddings. The key to sidestepping the SVD is the observation that, for downstream inference tasks such as clustering and classification, we are only interested in using the resulting embedding to evaluate pairwise similarity metrics derived from the euclidean norm, rather than capturing the effect of the underlying matrix on arbitrary vectors as a partial SVD tries to do. Our numerical results on network datasets demonstrate the efficacy of the proposed method, and motivate further exploration of its application to large-scale inference tasks.

preprint2014arXiv

Compressive parameter estimation in AWGN

Compressed sensing is by now well-established as an effective tool for extracting sparsely distributed information, where sparsity is a discrete concept, referring to the number of dominant nonzero signal components in some basis for the signal space. In this paper, we establish a framework for estimation of continuous-valued parameters based on compressive measurements on a signal corrupted by additive white Gaussian noise (AWGN). While standard compressed sensing based on naive discretization has been shown to suffer from performance loss due to basis mismatch, we demonstrate that this is not an inherent property of compressive measurements. Our contributions are summarized as follows: (a) We identify the isometries required to preserve fundamental estimation-theoretic quantities such as the Ziv-Zakai bound (ZZB) and the Cramer-Rao bound (CRB). Under such isometries, compressive projections can be interpreted simply as a reduction in "effective SNR." (b) We show that the threshold behavior of the ZZB provides a criterion for determining the minimum number of measurements for "accurate" parameter estimation. (c) We provide detailed computations of the number of measurements needed for the isometries in (a) to hold for the problem of frequency estimation in a mixture of sinusoids. We show via simulations that the design criterion in (b) is accurate for estimating the frequency of a single sinusoid.

preprint2014arXiv

Scalable and Efficient Geographic Routing in Mobile Ad Hoc Wireless Networks

We propose and evaluate a scalable position-publish and an accompanying routing protocol which is efficient despite operating with imperfect information regarding the destination's location. The traffic generated by our position-publish protocol fits within the transport capacity of large mobile ad hoc networks (MANETs) with constant communication bandwidth allocated for routing overhead, even as the network size increases. The routing protocol guarantees, with high probability, routes whose lengths are within a constant "stretch" factor of the shortest path from source to destination. The key idea underlying the scalability of the publish protocol is for each potential destination node to send location updates (with frequency decaying with distance) to a subset of network nodes, structured as annular regions around it (the natural approach of updating circular regions in distance-dependent fashion does not scale). The routing protocol must therefore account for the fact that the source and/or relay nodes may not have estimates of the destination's location (or may have stale estimates). Spatial and temporal scaling of protocol parameters are chosen so as to guarantee scalability, route reliability and route stretch, and these analytical design prescriptions are verified using simulations.

preprint2013arXiv

Noncoherent Trellis Coded Quantization: A Practical Limited Feedback Technique for Massive MIMO Systems

Accurate channel state information (CSI) is essential for attaining beamforming gains in single-user (SU) multiple-input multiple-output (MIMO) and multiplexing gains in multi-user (MU) MIMO wireless communication systems. State-of-the-art limited feedback schemes, which rely on pre-defined codebooks for channel quantization, are only appropriate for a small number of transmit antennas and low feedback overhead. In order to scale informed transmitter schemes to emerging massive MIMO systems with a large number of transmit antennas at the base station, one common approach is to employ time division duplexing (TDD) and to exploit the implicit feedback obtained from channel reciprocity. However, most existing cellular deployments are based on frequency division duplexing (FDD), hence it is of great interest to explore backwards compatible massive MIMO upgrades of such systems. For a fixed feedback rate per antenna, the number of codewords for quantizing the channel grows exponentially with the number of antennas, hence generating feedback based on look-up from a standard vector quantized codebook does not scale. In this paper, we propose noncoherent trellis-coded quantization (NTCQ), whose encoding complexity scales linearly with the number of antennas. The approach exploits the duality between source encoding in a Grassmannian manifold and noncoherent sequence detection. Furthermore, since noncoherent detection can be realized near-optimally using a bank of coherent detectors, we obtain a low-complexity implementation of NTCQ encoding using an off-the-shelf Viterbi algorithm applied to standard trellis coded quantization. We also develop advanced NTCQ schemes which utilize various channel properties such as temporal/spatial correlations. Simulation results show the proposed NTCQ and its extensions can achieve near-optimal performance with moderate complexity and feedback overhead.

preprint2012arXiv

Cooperative localization using angle of arrival measurements: sequential algorithms and non-line-of-sight suppression

We investigate localization of a source based on angle of arrival (AoA) measurements made at a geographically dispersed network of cooperating receivers. The goal is to efficiently compute accurate estimates despite outliers in the AoA measurements due to multipath reflections in non-line-of-sight (NLOS) environments. Maximal likelihood (ML) location estimation in such a setting requires exhaustive testing of estimates from all possible subsets of "good" measurements, which has exponential complexity in the number of measurements. We provide a randomized algorithm that approaches ML performance with linear complexity in the number of measurements. The building block for this algorithm is a low-complexity sequential algorithm for updating the source location estimates under line-of-sight (LOS) environments. Our Bayesian framework can exploit the ability to resolve multiple paths in wideband systems to provide significant performance gains over narrowband systems in NLOS environments, and easily extends to accommodate additional information such as range measurements and prior information about location.

preprint2011arXiv

Phase-Quantized Block Noncoherent Communication

Analog-to-digital conversion (ADC) is a key bottleneck in scaling DSP-centric receiver architectures to multiGigabit/s speeds. Recent information-theoretic results, obtained under ideal channel conditions (perfect synchronization, no dispersion), indicate that low-precision ADC (1-4 bits) could be a suitable choice for designing such high speed systems. In this work, we study the impact of employing low-precision ADC in a {\it carrier asynchronous} system. Specifically, we consider transmission over the block noncoherent Additive White Gaussian Noise (AWGN) channel, and investigate the achievable performance under low-precision output quantization. We focus attention on an architecture in which the receiver quantizes {\it only the phase} of the received signal: this has the advantage of being implementable without automatic gain control, using multiple 1-bit ADCs preceded by analog multipliers. For standard uniform Phase Shift Keying (PSK) modulation, we study the structure of the transition density of the resulting phase-quantized block noncoherent channel. Several results, based on the symmetry inherent in the channel model, are provided to characterize this transition density. Low-complexity procedures for computing the channel capacity, and for block demodulation, are obtained using these results. Numerical computations are performed to compare the performance of quantized and unquantized systems, for different quantization precisions, and different block lengths. It is observed, for example, that with QPSK modulation, 8-bin phase quantization of the received signal recovers about 80-85% of the capacity attained with unquantized observations, while 12-bin phase quantization recovers more than 90% of the unquantized capacity. Dithering the constellation is shown to improve the performance in the face of drastic quantization.

Upamanyu Madhow

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

Neuro-Inspired Deep Neural Networks with Sparse, Strong Activations

Self-supervised Speaker Recognition Training Using Human-Machine Dialogues

Generalized Likelihood Ratio Test for Adversarially Robust Hypothesis Testing

Wireless Fingerprinting via Deep Learning: The Impact of Confounding Factors

A design framework for all-digital mmWave massive MIMO with per-antenna nonlinearities

Polarizing Front Ends for Robust CNNs

Learning Sparse, Distributed Representations using the Hebbian Principle

Newtonized Orthogonal Matching Pursuit: Frequency Estimation over the Continuum

A scalable architecture for distributed receive beamforming: analysis and experimental demonstration

Compressive channel estimation and tracking for large arrays in mm wave picocells

Compressive spectral embedding: sidestepping the SVD

Compressive parameter estimation in AWGN

Scalable and Efficient Geographic Routing in Mobile Ad Hoc Wireless Networks

Noncoherent Trellis Coded Quantization: A Practical Limited Feedback Technique for Massive MIMO Systems

Cooperative localization using angle of arrival measurements: sequential algorithms and non-line-of-sight suppression

Phase-Quantized Block Noncoherent Communication