Researcher profile

Ildoo Kim

Ildoo Kim contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2023arXiv

A weighted $L_q(L_p)$-theory for fully degenerate second-order evolution equations with unbounded time-measurable coefficients

We study the fully degenerate second-order evolution equation $u_t=a^{ij}(t)u_{x^ix^j} +b^i(t) u_{x^i} + c(t)u+f, \quad t>0, x\in \mathbb{R}^d$ given with the zero initial data. Here $a^{ij}(t)$, $b^i(t)$, $c(t)$ are merely locally integrable functions, and $(a^{ij}(t))_{d \times d}$ is a nonnegative symmetric matrix with the smallest eigenvalue $δ(t)\geq 0$. We show that there is a positive constant $N$ such that $\int_0^{T} \left(\int_{\mathbb{R}^d} \left(|u|+|u_{xx} |\right)^{p} dx \right)^{q/p} e^{-q\int_0^t c(s)ds} w(α(t)) δ(t) dt \leq N \int_0^{T} \left(\int_{\mathbb{R}^d} \left|f\left(t,x\right)\right|^{p} dx \right)^{q/p} e^{-q\int_0^t c(s)ds} w(α(t)) (δ(t))^{1-q} dt,$ where $p,q \in (1,\infty)$, $α(t)=\int_0^t δ(s)ds$, and $w$ is a Muckenhoupt's weight.

preprint2022arXiv

Contrastive Regularization for Semi-Supervised Learning

Consistency regularization on label predictions becomes a fundamental technique in semi-supervised learning, but it still requires a large number of training iterations for high performance. In this study, we analyze that the consistency regularization restricts the propagation of labeling information due to the exclusion of samples with unconfident pseudo-labels in the model updates. Then, we propose contrastive regularization to improve both efficiency and accuracy of the consistency regularization by well-clustered features of unlabeled data. In specific, after strongly augmented samples are assigned to clusters by their pseudo-labels, our contrastive regularization updates the model so that the features with confident pseudo-labels aggregate the features in the same cluster, while pushing away features in different clusters. As a result, the information of confident pseudo-labels can be effectively propagated into more unlabeled samples during training by the well-clustered features. On benchmarks of semi-supervised learning tasks, our contrastive regularization improves the previous consistency-based methods and achieves state-of-the-art results, especially with fewer training iterations. Our method also shows robust performance on open-set semi-supervised learning where unlabeled data includes out-of-distribution samples.

preprint2021arXiv

On the morphology of two-dimensional laminar vortex streets behind triangles

The two-dimensional laminar vortex streets behind a triangle have two morphologically distinct structures depending on the Reynolds number and the aspect ratio of the triangle. These two structures are the conventional structure and the separated rows structure, where the latter is characterized by a thin layer of irrotational fluid between two vortex rows. In this paper, by means of numerical simulation, we find that the separated rows structure occurs when the thickness of boundary layers is less than 25% of their separation distance. We also show from the linear stability analysis that the criterion is related to the coupling of two boundary layers in producing unstable modes.

preprint2020arXiv

AutoCLINT: The Winning Method in AutoCV Challenge 2019

NeurIPS 2019 AutoDL challenge is a series of six automated machine learning competitions. Particularly, AutoCV challenges mainly focused on classification tasks on visual domain. In this paper, we introduce the winning method in the competition, AutoCLINT. The proposed method implements an autonomous training strategy, including efficient code optimization, and applies an automated data augmentation to achieve the fast adaptation of pretrained networks. We implement a light version of Fast AutoAugment to search for data augmentation policies efficiently for the arbitrarily given image domains. We also empirically analyze the components of the proposed method and provide ablation studies focusing on AutoCV datasets.

preprint2020arXiv

Spatially Attentive Output Layer for Image Classification

Most convolutional neural networks (CNNs) for image classification use a global average pooling (GAP) followed by a fully-connected (FC) layer for output logits. However, this spatial aggregation procedure inherently restricts the utilization of location-specific information at the output layer, although this spatial information can be beneficial for classification. In this paper, we propose a novel spatial output layer on top of the existing convolutional feature maps to explicitly exploit the location-specific output information. In specific, given the spatial feature maps, we replace the previous GAP-FC layer with a spatially attentive output layer (SAOL) by employing a attention mask on spatial logits. The proposed location-specific attention selectively aggregates spatial logits within a target region, which leads to not only the performance improvement but also spatially interpretable outputs. Moreover, the proposed SAOL also permits to fully exploit location-specific self-supervision as well as self-distillation to enhance the generalization ability during training. The proposed SAOL with self-supervision and self-distillation can be easily plugged into existing CNNs. Experimental results on various classification tasks with representative architectures show consistent performance improvements by SAOL at almost the same computational cost.

preprint2020arXiv

torchgpipe: On-the-fly Pipeline Parallelism for Training Giant Models

We design and implement a ready-to-use library in PyTorch for performing micro-batch pipeline parallelism with checkpointing proposed by GPipe (Huang et al., 2019). In particular, we develop a set of design components to enable pipeline-parallel gradient computation in PyTorch's define-by-run and eager execution environment. We show that each component is necessary to fully benefit from pipeline parallelism in such environment, and demonstrate the efficiency of the library by applying it to various network architectures including AmoebaNet-D and U-Net. Our library is available at https://github.com/kakaobrain/torchgpipe .

preprint2019arXiv

Fast AutoAugment

Data augmentation is an essential technique for improving generalization ability of deep learning models. Recently, AutoAugment has been proposed as an algorithm to automatically search for augmentation policies from a dataset and has significantly enhanced performances on many image recognition tasks. However, its search method requires thousands of GPU hours even for a relatively small dataset. In this paper, we propose an algorithm called Fast AutoAugment that finds effective augmentation policies via a more efficient search strategy based on density matching. In comparison to AutoAugment, the proposed algorithm speeds up the search time by orders of magnitude while achieves comparable performances on image recognition tasks with various models and datasets including CIFAR-10, CIFAR-100, SVHN, and ImageNet.