Source author record

Kevin Smith

Kevin Smith appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision Machine Learning eess.IV math.AP math.CV math.DG Artificial Intelligence Formal Languages and Automata Theory Networking and Internet Architecture physics.acc-ph

Catalog footprint

What is connected

11works

10topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Coupled continuity equations for constant scalar curvature Kähler metrics

Inspired by a parabolic system of Li-Yuan-Zhang and the continuity equation of La Nave-Tian, we study a system of elliptic equations for a Kähler metric $ω$ and a closed $(1, 1)$-form $α$. Assuming a uniform estimate for $ω$, we prove higher order estimates and smooth convergence to a cscK metric coupled to a harmonic $(1, 1)$-form. A simplification of the system is used to recover existence results for Kähler-Einstein metrics when $c_1(X) < 0$. On Riemann surfaces with genus at least $2$, we show smooth convergence to the unique Kähler-Einstein metric from a large class of initial data.

preprint2022arXiv

Physion: Evaluating Physical Prediction from Vision in Humans and Machines

While current vision algorithms excel at many challenging tasks, it is unclear how well they understand the physical dynamics of real-world environments. Here we introduce Physion, a dataset and benchmark for rigorously evaluating the ability to predict how physical scenarios will evolve over time. Our dataset features realistic simulations of a wide range of physical phenomena, including rigid and soft-body collisions, stable multi-object configurations, rolling, sliding, and projectile motion, thus providing a more comprehensive challenge than previous benchmarks. We used Physion to benchmark a suite of models varying in their architecture, learning objective, input-output structure, and training data. In parallel, we obtained precise measurements of human prediction behavior on the same set of scenarios, allowing us to directly evaluate how well any model could approximate human behavior. We found that vision algorithms that learn object-centric representations generally outperform those that do not, yet still fall far short of human performance. On the other hand, graph neural networks with direct access to physical state information both perform substantially better and make predictions that are more similar to those made by humans. These results suggest that extracting physical representations of scenes is the main bottleneck to achieving human-level and human-like physical understanding in vision algorithms. We have publicly released all data and code to facilitate the use of Physion to benchmark additional models in a fully reproducible manner, enabling systematic evaluation of progress towards vision algorithms that understand physical environments as robustly as people do.

preprint2022arXiv

PSL is Dead. Long Live PSL

Property Specification Language (PSL) is a form of temporal logic that has been mainly used in discrete domains (e.g. formal hardware verification). In this paper, we show that by merging machine learning techniques with PSL monitors, we can extend PSL to work on continuous domains. We apply this technique in machine learning-based anomaly detection to analyze scenarios of real-time streaming events from continuous variables in order to detect abnormal behaviors of a system. By using machine learning with formal models, we leverage the strengths of both machine learning methods and formal semantics of time. On one hand, machine learning techniques can produce distributions on continuous variables, where abnormalities can be captured as deviations from the distributions. On the other hand, formal methods can characterize discrete temporal behaviors and relations that cannot be easily learned by machine learning techniques. Interestingly, the anomalies detected by machine learning and the underlying time representation used are discrete events. We implemented a temporal monitoring package (TEF) that operates in conjunction with normal data science packages for anomaly detection machine learning systems, and we show that TEF can be used to perform accurate interpretation of temporal correlation between events.

preprint2022arXiv

The continuity equation on Hopf and Inoue surfaces

We study the continuity equation of La Nave-Tian, extended to the Hermitian setting by Sherman-Weinkove, on Hopf and Inoue surfaces. We prove a priori estimates for solutions in both cases, and Gromov-Hausdorff convergence of Inoue surfaces to a circle.

preprint2022arXiv

What Makes Transfer Learning Work For Medical Images: Feature Reuse & Other Factors

Transfer learning is a standard technique to transfer knowledge from one domain to another. For applications in medical imaging, transfer from ImageNet has become the de-facto approach, despite differences in the tasks and image characteristics between the domains. However, it is unclear what factors determine whether - and to what extent - transfer learning to the medical domain is useful. The long-standing assumption that features from the source domain get reused has recently been called into question. Through a series of experiments on several medical image benchmark datasets, we explore the relationship between transfer learning, data size, the capacity and inductive bias of the model, as well as the distance between the source and target domain. Our findings suggest that transfer learning is beneficial in most cases, and we characterize the important role feature reuse plays in its success.

preprint2020arXiv

Adding Seemingly Uninformative Labels Helps in Low Data Regimes

Evidence suggests that networks trained on large datasets generalize well not solely because of the numerous training examples, but also class diversity which encourages learning of enriched features. This raises the question of whether this remains true when data is scarce - is there an advantage to learning with additional labels in low-data regimes? In this work, we consider a task that requires difficult-to-obtain expert annotations: tumor segmentation in mammography images. We show that, in low-data settings, performance can be improved by complementing the expert annotations with seemingly uninformative labels from non-expert annotators, turning the task into a multi-class problem. We reveal that these gains increase when less expert data is available, and uncover several interesting properties through further studies. We demonstrate our findings on CSAW-S, a new dataset that we introduce here, and confirm them on two public datasets.

preprint2020arXiv

Decoupling Inherent Risk and Early Cancer Signs in Image-based Breast Cancer Risk Models

The ability to accurately estimate risk of developing breast cancer would be invaluable for clinical decision-making. One promising new approach is to integrate image-based risk models based on deep neural networks. However, one must take care when using such models, as selection of training data influences the patterns the network will learn to identify. With this in mind, we trained networks using three different criteria to select the positive training data (i.e. images from patients that will develop cancer): an inherent risk model trained on images with no visible signs of cancer, a cancer signs model trained on images containing cancer or early signs of cancer, and a conflated model trained on all images from patients with a cancer diagnosis. We find that these three models learn distinctive features that focus on different patterns, which translates to contrasts in performance. Short-term risk is best estimated by the cancer signs model, whilst long-term risk is best estimated by the inherent risk model. Carelessly training with all images conflates inherent risk with early cancer signs, and yields sub-optimal estimates in both regimes. As a consequence, conflated models may lead physicians to recommend preventative action when early cancer signs are already visible.

preprint2020arXiv

Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks

Visual relationship detection is fundamental for holistic image understanding. However, the localization and classification of (subject, predicate, object) triplets remain challenging tasks, due to the combinatorial explosion of possible relationships, their long-tailed distribution in natural images, and an expensive annotation process. This paper introduces a novel weakly-supervised method for visual relationship detection that relies on minimal image-level predicate labels. A graph neural network is trained to classify predicates in images from a graph representation of detected objects, implicitly encoding an inductive bias for pairwise relations. We then frame relationship detection as the explanation of such a predicate classifier, i.e. we obtain a complete relation by recovering the subject and object of a predicted predicate. We present results comparable to recent fully- and weakly-supervised methods on three diverse and challenging datasets: HICO-DET for human-object interaction, Visual Relationship Detection for generic object-to-object relations, and UnRel for unusual triplets; demonstrating robustness to non-comprehensive annotations and good few-shot generalization.

preprint2020arXiv

Parabolic complex Monge-Ampère equations on compact Hermitian manifolds

We prove the long-time existence and convergence of solutions to a general class of parabolic equations, not necessarily concave in the Hessian of the unknown function, on a compact Hermitian manifold. The limiting function is identified as the solution of an elliptic complex Monge-Ampère equation.

preprint2016arXiv

New Generation Value Networks for Content Delivery

In this paper we paint a broad picture of the Internet content delivery market, by taking into consideration both economical and technical challenges that might drive the interactions among the stakeholders in the future. We focus on a few disrupting factors, namely ubiquitous encryption, traffic boost, network scalability, latency needs and network control; and try to figure out whether the current model (CDN) is robust against their variation, which optimization can be envisioned and how the most accredited option (ICN) can be of help.

preprint2015arXiv

High-gradient High-charge CW Superconducting RF gun with CsK2Sb photocathode

High-gradient CW photo-injectors operating at high accelerating gradients promise to revolutionize many sciences and applications. They can establish the basis for super-bright monochromatic X-ray free-electron lasers, super-bright hadron beams, nuclear- waste transmutation or a new generation of microchip production. In this letter we report on our operation of a superconducting RF electron gun with a record-high accelerating gradient at the CsK2Sb photocathode (i.e. ~ 20 MV/m) generating a record-high bunch charge (i.e., 3 nC). We briefly describe the system and then detail our experimental results. This achievement opens new era in generating high-power electron beams with a very high brightness.

Kevin Smith

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

Coupled continuity equations for constant scalar curvature Kähler metrics

Physion: Evaluating Physical Prediction from Vision in Humans and Machines

PSL is Dead. Long Live PSL

The continuity equation on Hopf and Inoue surfaces

What Makes Transfer Learning Work For Medical Images: Feature Reuse & Other Factors

Adding Seemingly Uninformative Labels Helps in Low Data Regimes

Decoupling Inherent Risk and Early Cancer Signs in Image-based Breast Cancer Risk Models

Explanation-based Weakly-supervised Learning of Visual Relations with Graph Networks

Parabolic complex Monge-Ampère equations on compact Hermitian manifolds

New Generation Value Networks for Content Delivery

High-gradient High-charge CW Superconducting RF gun with CsK2Sb photocathode