Source author record

Ildoo Kim

Ildoo Kim appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.AP Machine Learning Computer Vision math.FA math.PR Distributed, Parallel, and Cluster Computing physics.flu-dyn

Catalog footprint

What is connected

16works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

A weighted $L_q(L_p)$-theory for fully degenerate second-order evolution equations with unbounded time-measurable coefficients

We study the fully degenerate second-order evolution equation $u_t=a^{ij}(t)u_{x^ix^j} +b^i(t) u_{x^i} + c(t)u+f, \quad t>0, x\in \mathbb{R}^d$ given with the zero initial data. Here $a^{ij}(t)$, $b^i(t)$, $c(t)$ are merely locally integrable functions, and $(a^{ij}(t))_{d \times d}$ is a nonnegative symmetric matrix with the smallest eigenvalue $δ(t)\geq 0$. We show that there is a positive constant $N$ such that $\int_0^{T} \left(\int_{\mathbb{R}^d} \left(|u|+|u_{xx} |\right)^{p} dx \right)^{q/p} e^{-q\int_0^t c(s)ds} w(α(t)) δ(t) dt \leq N \int_0^{T} \left(\int_{\mathbb{R}^d} \left|f\left(t,x\right)\right|^{p} dx \right)^{q/p} e^{-q\int_0^t c(s)ds} w(α(t)) (δ(t))^{1-q} dt,$ where $p,q \in (1,\infty)$, $α(t)=\int_0^t δ(s)ds$, and $w$ is a Muckenhoupt's weight.

preprint2022arXiv

Contrastive Regularization for Semi-Supervised Learning

Consistency regularization on label predictions becomes a fundamental technique in semi-supervised learning, but it still requires a large number of training iterations for high performance. In this study, we analyze that the consistency regularization restricts the propagation of labeling information due to the exclusion of samples with unconfident pseudo-labels in the model updates. Then, we propose contrastive regularization to improve both efficiency and accuracy of the consistency regularization by well-clustered features of unlabeled data. In specific, after strongly augmented samples are assigned to clusters by their pseudo-labels, our contrastive regularization updates the model so that the features with confident pseudo-labels aggregate the features in the same cluster, while pushing away features in different clusters. As a result, the information of confident pseudo-labels can be effectively propagated into more unlabeled samples during training by the well-clustered features. On benchmarks of semi-supervised learning tasks, our contrastive regularization improves the previous consistency-based methods and achieves state-of-the-art results, especially with fewer training iterations. Our method also shows robust performance on open-set semi-supervised learning where unlabeled data includes out-of-distribution samples.

preprint2021arXiv

On the morphology of two-dimensional laminar vortex streets behind triangles

The two-dimensional laminar vortex streets behind a triangle have two morphologically distinct structures depending on the Reynolds number and the aspect ratio of the triangle. These two structures are the conventional structure and the separated rows structure, where the latter is characterized by a thin layer of irrotational fluid between two vortex rows. In this paper, by means of numerical simulation, we find that the separated rows structure occurs when the thickness of boundary layers is less than 25% of their separation distance. We also show from the linear stability analysis that the criterion is related to the coupling of two boundary layers in producing unstable modes.

preprint2020arXiv

A well-posedness theory in Sobolev spaces for the stochastic magnetohydrodynamic equations in the whole space

We prove the existence of a mild solution to the three dimensional incompressible stochastic magnetohydrodynamic equations in the whole space with the initial data which belong to the Sobolev spaces.

preprint2020arXiv

AutoCLINT: The Winning Method in AutoCV Challenge 2019

NeurIPS 2019 AutoDL challenge is a series of six automated machine learning competitions. Particularly, AutoCV challenges mainly focused on classification tasks on visual domain. In this paper, we introduce the winning method in the competition, AutoCLINT. The proposed method implements an autonomous training strategy, including efficient code optimization, and applies an automated data augmentation to achieve the fast adaptation of pretrained networks. We implement a light version of Fast AutoAugment to search for data augmentation policies efficiently for the arbitrarily given image domains. We also empirically analyze the components of the proposed method and provide ablation studies focusing on AutoCV datasets.

preprint2020arXiv

Spatially Attentive Output Layer for Image Classification

Most convolutional neural networks (CNNs) for image classification use a global average pooling (GAP) followed by a fully-connected (FC) layer for output logits. However, this spatial aggregation procedure inherently restricts the utilization of location-specific information at the output layer, although this spatial information can be beneficial for classification. In this paper, we propose a novel spatial output layer on top of the existing convolutional feature maps to explicitly exploit the location-specific output information. In specific, given the spatial feature maps, we replace the previous GAP-FC layer with a spatially attentive output layer (SAOL) by employing a attention mask on spatial logits. The proposed location-specific attention selectively aggregates spatial logits within a target region, which leads to not only the performance improvement but also spatially interpretable outputs. Moreover, the proposed SAOL also permits to fully exploit location-specific self-supervision as well as self-distillation to enhance the generalization ability during training. The proposed SAOL with self-supervision and self-distillation can be easily plugged into existing CNNs. Experimental results on various classification tasks with representative architectures show consistent performance improvements by SAOL at almost the same computational cost.

preprint2020arXiv

torchgpipe: On-the-fly Pipeline Parallelism for Training Giant Models

We design and implement a ready-to-use library in PyTorch for performing micro-batch pipeline parallelism with checkpointing proposed by GPipe (Huang et al., 2019). In particular, we develop a set of design components to enable pipeline-parallel gradient computation in PyTorch's define-by-run and eager execution environment. We show that each component is necessary to fully benefit from pipeline parallelism in such environment, and demonstrate the efficiency of the library by applying it to various network architectures including AmoebaNet-D and U-Net. Our library is available at https://github.com/kakaobrain/torchgpipe .

preprint2019arXiv

Fast AutoAugment

Data augmentation is an essential technique for improving generalization ability of deep learning models. Recently, AutoAugment has been proposed as an algorithm to automatically search for augmentation policies from a dataset and has significantly enhanced performances on many image recognition tasks. However, its search method requires thousands of GPU hours even for a relatively small dataset. In this paper, we propose an algorithm called Fast AutoAugment that finds effective augmentation policies via a more efficient search strategy based on density matching. In comparison to AutoAugment, the proposed algorithm speeds up the search time by orders of magnitude while achieves comparable performances on image recognition tasks with various models and datasets including CIFAR-10, CIFAR-100, SVHN, and ImageNet.

preprint2016arXiv

A Sobolev Space theory for stochastic partial differential equations with time-fractional derivatives

In this article we present an $L_p$-theory ($p\geq 2$) for the time-fractional quasi-linear stochastic partial differential equations (SPDEs) of type $$ \partial^α_tu=L(ω,t,x)u+f(u)+\partial^β_t \sum_{k=1}^{\infty}\int^t_0 ( Λ^k(ω,t,x)u+g^k(u))dw^k_t, $$ where $α\in (0,2)$, $β<α+\frac{1}{2}$, and $\partial^α_t$ and $\partial^β_t$ denote the Caputo derivative of order $α$ and $β$ respectively. The processes $w^k_t$, $k\in \mathbb{N}=\{1,2,\cdots\}$, are independent one-dimensional Wiener processes defined on a probability space $Ω$, $L$ is a second order operator of either divergence or non-divergence type, and $Λ^k$ are linear operators of order up to two. The coefficients of the equations depend on $ω(\in Ω), t,x$ and are allowed to be discontinuous. This class of SPDEs can be used to describe random effects on transport of particles in medium with thermal memory or particles subject to sticking and trapping.

preprint2015arXiv

An $L_q(L_p)$-theory for parabolic pseudo-differential equations: Calderón-Zygmund approach

In this paper we present a Calderón-Zygmund approach for a large class of parabolic equations with pseudo-differential operators $\mathcal{A}(t)$ of arbitrary order $γ\in(0,\infty)$. It is assumed that $\cA(t)$ is merely measurable with respect to the time variable. The unique solvability of the equation $$ \frac{\partial u}{\partial t}=\cA u-λu+f, \quad (t,x)\in \fR^{d+1} $$ and the $L_{q}(\fR,L_{p})$-estimate $$ \|u_{t}\|_{L_{q}(\fR,L_{p})}+\|(-Δ)^{γ/2}u\|_{L_{q}(\fR,L_{p})} +λ\|u\|_{L_{q}(\fR,L_{p})}\leq N\|f\|_{L_{q}(\fR,L_{p})} $$ are obtained for any $λ> 0$ and $p,q\in (1,\infty)$.

preprint2015arXiv

An $L_q(L_p)$-theory for the time fractional evolution equations with variable coefficients

We introduce an $L_q(L_p)$-theory for the quasi-linear fractional equations of the type $$ \partial^α_t u(t,x)=a^{ij}(t,x)u_{x^i x^j}(t,x)+f(t,x,u), \quad t>0, \,x\in \mathbf{R}^d. $$ Here, $α\in (0,2)$, $p,q>1$, and $\partial^α_t$ is the Caupto fractional derivative of order $α$. Uniqueness, existence, and $L_q(L_p)$-estimates of solutions are obtained. The leading coefficients $a^{ij}(t,x)$ are assumed to be piecewise continuous in $t$ and uniformly continuous in $x$. In particular $a^{ij}(t,x)$ are allowed to be discontinuous with respect to the time variable. Our approach is based on classical tools in PDE theories such as the Marcinkiewicz interpolation theorem, the Calderon-Zygmund theorem, and perturbation arguments.

preprint2015arXiv

Parabolic Littlewood-Paley inequality for a class of time-dependent operators of arbitrary order, and applications to higher order stochastic PDE

In this paper we prove a parabolic version of the Littlewood-Paley inequality for a class of time-dependent local and non-local operators of arbitrary order, and as an application we show this inequality gives a fundamental estimate for the $L_p$-theory of the stochastic partial differential equations.

preprint2014arXiv

An L_p-theory for a class of non-local elliptic equations related to nonsymmetric measurable kernels

We study the integro-differential operators $L$ with kernels $K(y) = a(y) J(y)$, where $J(y)dy$ is a Lévy measure on $\bR^d$ (i.e. $\int_{\bR^d}(1\wedge |y|^2)J(y)dy<\infty$) and $a(y)$ is an only measurable function with positive lower and upper bounds. Under few additional conditions on $J(y)$, we prove the unique solvability of the equation $Lu-λu=f$ in $L_p$-spaces and present some $L_p$-estimates of the solutions.

preprint2014arXiv

Parabolic BMO estimates for pseudo-differential operators of arbitrary order

In this article we prove the BMO-$L_{\infty}$ estimate $$ \|(-Δ)^{γ/2} u\|_{BMO(\mathbf{R}^{d+1})}\leq N \|\frac{\partial}{\partial t}u-A(t)u\|_{L_{\infty}(\mathbf{R}^{d+1})}, \quad \forall\, u\in C^{\infty}_c(\mathbf{R}^{d+1}) $$ for a wide class of pseudo-differential operators $A(t)$ of order $γ\in (0,\infty)$. The coefficients of $A(t)$ are assumed to be merely measurable in time variable. As an application to the equation $$ \frac{\partial}{\partial t}u=A(t)u+f,\quad t\in \mathbf{R} $$ we prove that for any $u\in C^{\infty}_c(\mathbf{R}^{d+1})$ $$ \|u_t\|_{L_p(\mathbf{R}^{d+1})}+\|(-Δ)^{γ/2}u\|_{L_p(\mathbf{R}^{d+1})}\leq N\|u_t-A(t)u\|_{L_p(\mathbf{R}^{d+1})}, $$ where $p\ in (1,\infty)$ and the constant $N$ is independent of $u$.

preprint2013arXiv

Parabolic Littlewood-Paley inequality for $ϕ(-Δ)$-type operators and applications to Stochastic integro-differential equations

In this paper we prove a parabolic version of the Littlewood-Paley inequality for the operators of the type $ϕ(-Δ)$, where $ϕ$ is a Bernstein function. As an application, we construct an $L_p$-theory for the stochastic integro-differential equations of the type $du=(-ϕ(-Δ)u+f)dt +gdW_t$.

preprint2010arXiv

A generalization of the Littlewood-Paley inequality for the fractional Laplacian $(-Δ)^{α/2}$

We prove a parabolic version of the Littlewood-Paley inequality for the fractional Laplacian $(-Δ)^{α/2}$, where $α\in (0,2)$.

Ildoo Kim

What is connected

Connect this record

See the researcher in context

Building this map preview

16 published item(s)

A weighted $L_q(L_p)$-theory for fully degenerate second-order evolution equations with unbounded time-measurable coefficients

Contrastive Regularization for Semi-Supervised Learning

On the morphology of two-dimensional laminar vortex streets behind triangles

A well-posedness theory in Sobolev spaces for the stochastic magnetohydrodynamic equations in the whole space

AutoCLINT: The Winning Method in AutoCV Challenge 2019

Spatially Attentive Output Layer for Image Classification

torchgpipe: On-the-fly Pipeline Parallelism for Training Giant Models

Fast AutoAugment

A Sobolev Space theory for stochastic partial differential equations with time-fractional derivatives

An $L_q(L_p)$-theory for parabolic pseudo-differential equations: Calderón-Zygmund approach

An $L_q(L_p)$-theory for the time fractional evolution equations with variable coefficients

Parabolic Littlewood-Paley inequality for a class of time-dependent operators of arbitrary order, and applications to higher order stochastic PDE

An L_p-theory for a class of non-local elliptic equations related to nonsymmetric measurable kernels

Parabolic BMO estimates for pseudo-differential operators of arbitrary order

Parabolic Littlewood-Paley inequality for $ϕ(-Δ)$-type operators and applications to Stochastic integro-differential equations

A generalization of the Littlewood-Paley inequality for the fractional Laplacian $(-Δ)^{α/2}$