Source author record

Alexander Wei

Alexander Wei appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning cond-mat.mtrl-sci Data Structures and Algorithms physics.app-ph physics.optics

Catalog footprint

What is connected

5works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

More Than a Toy: Random Matrix Models Predict How Real-World Neural Representations Generalize

Of theories for why large-scale machine learning models generalize despite being vastly overparameterized, which of their assumptions are needed to capture the qualitative phenomena of generalization in the real world? On one hand, we find that most theoretical analyses fall short of capturing these qualitative phenomena even for kernel regression, when applied to kernels derived from large-scale neural networks (e.g., ResNet-50) and real data (e.g., CIFAR-100). On the other hand, we find that the classical GCV estimator (Craven and Wahba, 1978) accurately predicts generalization risk even in such overparameterized settings. To bolster this empirical finding, we prove that the GCV estimator converges to the generalization risk whenever a local random matrix law holds. Finally, we apply this random matrix theory lens to explain why pretrained representations generalize better as well as what factors govern scaling laws for kernel regression. Our findings suggest that random matrix theory, rather than just being a toy model, may be central to understanding the properties of neural representations in practice.

preprint2022arXiv

Predicting Out-of-Distribution Error with the Projection Norm

We propose a metric -- Projection Norm -- to predict a model's performance on out-of-distribution (OOD) data without access to ground truth labels. Projection Norm first uses model predictions to pseudo-label test samples and then trains a new model on the pseudo-labels. The more the new model's parameters differ from an in-distribution model, the greater the predicted OOD error. Empirically, our approach outperforms existing methods on both image and text classification tasks and across different network architectures. Theoretically, we connect our approach to a bound on the test error for overparameterized linear models. Furthermore, we find that Projection Norm is the only approach that achieves non-trivial detection performance on adversarial examples. Our code is available at https://github.com/yaodongyu/ProjNorm.

preprint2020arXiv

Better and Simpler Learning-Augmented Online Caching

Lykouris and Vassilvitskii (ICML 2018) introduce a model of online caching with machine-learned advice, where each page request additionally comes with a prediction of when that page will next be requested. In this model, a natural goal is to design algorithms that (1) perform well when the advice is accurate and (2) remain robust in the worst case a la traditional competitive analysis. Lykouris and Vassilvitskii give such an algorithm by adapting the Marker algorithm to the learning-augmented setting. In a recent work, Rohatgi (SODA 2020) improves on their result with an approach also inspired by randomized marking. We continue the study of this problem, but with a somewhat different approach: We consider combining the BlindOracle algorithm, which just naïvely follows the predictions, with an optimal competitive algorithm for online caching in a black-box manner. The resulting algorithm outperforms all existing approaches while being significantly simpler. Moreover, we show that combining BlindOracle with LRU is in fact optimal among deterministic algorithms for this problem.

preprint2020arXiv

Cost-Effective Methods to Nanopattern Thermally Stable Platforms on Kapton HN Flexible Films Using Inkjet Printing Technology to Produce Printable Nitrate Sensors, Mercury Aptasensors, Protein Sensors, and Organic Thin Film Transistors

Kapton HN films, adopted worldwide due to their superior thermal durability (up to 400 °C), allow the high temperature sintering of nanoparticle based metal inks. By carefully selecting inks and Kapton substrates, outstanding thermal stability and anti-delaminating features are obtained in both aqueous and organic solutions and were applied to four novel devices: a solid state ion selective nitrate sensor, an ssDNA based mercury aptasensor, a low cost protein sensor, and a long lasting organic thin film transistor (OTFT). Many experimental studies on parameter combinations were conducted during the development of the above devices. The results showed that the ion selective nitrate sensor displayed a linear sensitivity range with a limit of detection of 2 ppm. The mercury sensor exhibited a linear correlation between the RCT values and the increasing concentrations of mercury. The protein printed circuit board (PCB) sensor provided a much simpler method of protein detection. Finally, the OTFT demonstrated a stable performance with mobility values for the linear and saturation regimes, and the threshold voltage. These devices have shown their value and reveal possibilities that could be pursued.

preprint2016arXiv

Lasing Action with Gold Nanorod Hyperbolic Metamaterials

Coherent nanoscale photon sources are of paramount importance to achieving all-optical communication. Several nanolasers smaller than the diffraction limit have been theoretically proposed and experimentally demonstrated using plasmonic cavities to confine optical fields. Such compact cavities exhibit large Purcell factors, thereby enhancing spontaneous emission, which feeds into the lasing mode. However, most plasmonic nanolasers reported so far have employed resonant nanostructures and therefore had the lasing restricted to the proximity of the resonance wavelength. Here, we report on an approach based on gold nanorod hyperbolic metamaterials for lasing. Hyperbolic metamaterials provide broadband Purcell enhancement due to large photonic density of optical states, while also supporting surface plasmon modes to deliver optical feedback for lasing due to nonlocal effects in nanorod media. We experimentally demonstrate the advantage of hyperbolic metamaterials in achieving lasing action by its comparison with that obtained in a metamaterial with elliptic dispersion. The conclusions from the experimental results are supported with numerical simulations comparing the Purcell factors and surface plasmon modes for the metamaterials with different dispersions. We show that although the metamaterials of both types support lasing, emission with hyperbolic samples is about twice as strong with 35% lower threshold vs. the elliptic ones. Hence, hyperbolic metamaterials can serve as a convenient platform of choice for nanoscale coherent photon sources in a broad wavelength range.