Researcher profile

Liam Collins

Liam Collins contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2026arXiv

Exploiting ID-Text Complementarity via Ensembling for Sequential Recommendation

Modern Sequential Recommendation (SR) models commonly utilize modality features to represent items, motivated in large part by recent advancements in language and vision modeling. To do so, several works completely replace ID embeddings with modality embeddings, claiming that modality embeddings render ID embeddings unnecessary because they can match or even exceed ID embedding performance. On the other hand, many works jointly utilize ID and modality features, but posit that complex fusion strategies, such as multi-stage training and/or intricate alignment architectures, are necessary for this joint utilization. However, underlying both these lines of work is a lack of understanding of the complementarity of ID and modality features. In this work, we address this gap by studying the complementarity of ID- and text-based SR models. We show that these models do learn complementary signals, meaning that either should provide performance gain when used properly alongside the other. Motivated by this, we propose a new SR method that preserves ID-text complementarity through independent model training, then harnesses it through a simple ensembling strategy. Despite this method's simplicity, we show it outperforms several competitive SR baselines, implying that both ID and text features are necessary to achieve state-of-the-art SR performance but complex fusion architectures are not.

preprint2022arXiv

FedAvg with Fine Tuning: Local Updates Lead to Representation Learning

The Federated Averaging (FedAvg) algorithm, which consists of alternating between a few local stochastic gradient updates at client nodes, followed by a model averaging update at the server, is perhaps the most commonly used method in Federated Learning. Notwithstanding its simplicity, several empirical studies have illustrated that the output model of FedAvg, after a few fine-tuning steps, leads to a model that generalizes well to new unseen tasks. This surprising performance of such a simple method, however, is not fully understood from a theoretical point of view. In this paper, we formally investigate this phenomenon in the multi-task linear representation setting. We show that the reason behind generalizability of the FedAvg's output is its power in learning the common data representation among the clients' tasks, by leveraging the diversity among client data distributions via local updates. We formally establish the iteration complexity required by the clients for proving such result in the setting where the underlying shared representation is a linear map. To the best of our knowledge, this is the first such result for any setting. We also provide empirical evidence demonstrating FedAvg's representation learning ability in federated image classification with heterogeneous data.

preprint2022arXiv

How Does the Task Landscape Affect MAML Performance?

Model-Agnostic Meta-Learning (MAML) has become increasingly popular for training models that can quickly adapt to new tasks via one or few stochastic gradient descent steps. However, the MAML objective is significantly more difficult to optimize compared to standard non-adaptive learning (NAL), and little is understood about how much MAML improves over NAL in terms of the fast adaptability of their solutions in various scenarios. We analytically address this issue in a linear regression setting consisting of a mixture of easy and hard tasks, where hardness is related to the rate that gradient descent converges on the task. Specifically, we prove that in order for MAML to achieve substantial gain over NAL, (i) there must be some discrepancy in hardness among the tasks, and (ii) the optimal solutions of the hard tasks must be closely packed with the center far from the center of the easy tasks optimal solutions. We also give numerical and analytical results suggesting that these insights apply to two-layer neural networks. Finally, we provide few-shot image classification experiments that support our insights for when MAML should be used and emphasize the importance of training MAML on hard tasks in practice.

preprint2021arXiv

Super-R BiFeO$_3$: Epitaxial stabilization of a low-symmetry phase with giant electromechanical response

Piezoelectrics interconvert mechanical energy and electric charge and are widely used in actuators and sensors. The best performing materials are ferroelectrics at a morphotropic phase boundary (MPB), where several phases can intimately coexist. Switching between these phases by electric field produces a large electromechanical response. In the ferroelectric BiFeO$_3$, strain can be used to create an MPB-like phase mixture and thus to generate large electric field dependent strains. However, this enhanced response occurs at localized, randomly positioned regions of the film, which potentially complicates nanodevice design. Here, we use epitaxial strain and orientation engineering in tandem - anisotropic epitaxy - to craft a hitherto unavailable low-symmetry phase of BiFeO$_3$ which acts as a structural bridge between the rhombohedral-like and tetragonal-like polymorphs. Interferometric displacement sensor measurements and first-principle calculations reveal that under external electric bias, this phase undergoes a transition to the tetragonal-like polymorph, generating a piezoelectric response enhanced by over 200%, and associated giant field-induced reversible strain. These results offer a new route to engineer giant electromechanical properties in thin films, with broader perspectives for other functional oxide systems.

preprint2020arXiv

Exploring Nanoscale Ferroelectricity in Doped Hafnium Oxide by Interferometric Piezoresponse Force Microscopy

Hafnium oxide (HfO2)-based ferroelectrics offer remarkable promise for memory and logic devices in view of their compatibility with traditional silicon CMOS technology, high switchable polarization, good endurance and thickness scalability. These factors have led to steep rise in research on this class of materials over the past number of years. At the same time, only a few reports on the direct sensing of nanoscale ferroelectric properties exist, with many questions remaining regarding the emergence of ferroelectricity in these materials. While piezoresponse force microscopy (PFM) is ideally suited to probe piezo- and ferro-electricity on the nanoscale, it is known to suffer artifacts which complicate quantitative interpretation of results and can even lead to claims of ferroelectricity in materials which are not ferroelectric. In this paper we explore the possibility of using an improved PFM method based on interferometric displacement sensing (IDS) to study nanoscale ferroelectricity in bare Si doped HfO2. Our results indicate a clear difference in the local remnant state of various HfO2 crystallites with reported values for the piezoelectric coupling in range 0.6-1.5 pm/V. In addition, we report unusual ferroelectric polarization switching including possible contributions from electrostriction and Vegard effect, which may indicate oxygen vacancies or interfacial effects influence the emergence of nanoscale ferroelectricity in HfO2.

preprint2020arXiv

Fast Scanning Probe Microscopy via Machine Learning: Non-rectangular scans with compressed sensing and Gaussian process optimization

Fast scanning probe microscopy enabled via machine learning allows for a broad range of nanoscale, temporally resolved physics to be uncovered. However, such examples for functional imaging are few in number. Here, using piezoresponse force microscopy (PFM) as a model application, we demonstrate a factor of 5.8 improvement in imaging rate using a combination of sparse spiral scanning with compressive sensing and Gaussian processing reconstruction. It is found that even extremely sparse scans offer strong reconstructions with less than 6 % error for Gaussian processing reconstructions. Further, we analyze the error associated with each reconstructive technique per reconstruction iteration finding the error is similar past approximately 15 iterations, while at initial iterations Gaussian processing outperforms compressive sensing. This study highlights the capabilities of reconstruction techniques when applied to sparse data, particularly sparse spiral PFM scans, with broad applications in scanning probe and electron microscopies.

preprint2020arXiv

Super-resolution and signal separation in contact Kelvin probe force microscopy of electrochemically active ferroelectric materials

Imaging mechanisms in contact Kelvin Probe Force Microscopy (cKPFM) are explored via information theory-based methods. Gaussian Processes are used to achieve super-resolution in the cKPFM signal, effectively extrapolating across the spatial and parameter space. Tensor matrix factorization is applied to reduce the multidimensional signal to the tensor convolution of the scalar functions that show clear trending behavior with the imaging parameters. These methods establish a workflow for the analysis of the multidimensional data sets, that can then be related to the relevant physical mechanisms. We also provide an interactive Google Colab notebook (http://bit.ly/39kMtuR) that goes through all the analysis discussed in the paper.

preprint2020arXiv

Task-Robust Model-Agnostic Meta-Learning

Meta-learning methods have shown an impressive ability to train models that rapidly learn new tasks. However, these methods only aim to perform well in expectation over tasks coming from some particular distribution that is typically equivalent across meta-training and meta-testing, rather than considering worst-case task performance. In this work we introduce the notion of "task-robustness" by reformulating the popular Model-Agnostic Meta-Learning (MAML) objective [Finn et al. 2017] such that the goal is to minimize the maximum loss over the observed meta-training tasks. The solution to this novel formulation is task-robust in the sense that it places equal importance on even the most difficult and/or rare tasks. This also means that it performs well over all distributions of the observed tasks, making it robust to shifts in the task distribution between meta-training and meta-testing. We present an algorithm to solve the proposed min-max problem, and show that it converges to an $ε$-accurate point at the optimal rate of $\mathcal{O}(1/ε^2)$ in the convex setting and to an $(ε, δ)$-stationary point at the rate of $\mathcal{O}(\max\{1/ε^5, 1/δ^5\})$ in nonconvex settings. We also provide an upper bound on the new task generalization error that captures the advantage of minimizing the worst-case task loss, and demonstrate this advantage in sinusoid regression and image classification experiments.