Source author record

Gayoung Lee

Gayoung Lee appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision astro-ph.GA astro-ph.IM Graphics Machine Learning

Catalog footprint

What is connected

6works

5topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Generator Knows What Discriminator Should Learn in Unconditional GANs

Recent methods for conditional image generation benefit from dense supervision such as segmentation label maps to achieve high-fidelity. However, it is rarely explored to employ dense supervision for unconditional image generation. Here we explore the efficacy of dense supervision in unconditional generation and find generator feature maps can be an alternative of cost-expensive semantic label maps. From our empirical evidences, we propose a new generator-guided discriminator regularization(GGDR) in which the generator feature maps supervise the discriminator to have rich semantic representations in unconditional generation. In specific, we employ an U-Net architecture for discriminator, which is trained to predict the generator feature maps given fake images as inputs. Extensive experiments on mulitple datasets show that our GGDR consistently improves the performance of baseline methods in terms of quantitative and qualitative aspects. Code is available at https://github.com/naver-ai/GGDR

preprint2022arXiv

Memory Efficient Patch-based Training for INR-based GANs

Recent studies have shown remarkable progress in GANs based on implicit neural representation (INR) - an MLP that produces an RGB value given its (x, y) coordinate. They represent an image as a continuous version of the underlying 2D signal instead of a 2D array of pixels, which opens new horizons for GAN applications (e.g., zero-shot super-resolution, image outpainting). However, training existing approaches require a heavy computational cost proportional to the image resolution, since they compute an MLP operation for every (x, y) coordinate. To alleviate this issue, we propose a multi-stage patch-based training, a novel and scalable approach that can train INR-based GANs with a flexible computational cost regardless of the image resolution. Specifically, our method allows to generate and discriminate by patch to learn the local details of the image and learn global structural information by a novel reconstruction loss to enable efficient GAN training. We conduct experiments on several benchmark datasets to demonstrate that our approach enhances baseline models in GPU memory while maintaining FIDs at a reasonable level.

preprint2022arXiv

Performance Assessment of the KASI-Deep Rolling Imaging Fast-optics Telescope pathfinder

In a $Λ$CDM universe, most galaxies evolve by mergers and accretions, leaving faint and/or diffuse structures, such as tidal streams and stellar halos. Although these structures are a good indicator of galaxies' recent mass assembly history, they have the disadvantage of being difficult to observe due to their low surface brightness (LSB). To recover these LSB features by minimizing the photometric uncertainties introduced by the optical system, we developed a new optimized telescope named K-DRIFT pathfinder, adopting a linear astigmatism free-three mirror system. Thanks to the off-axis design, it is expected to avoid the loss and scattering of light on the optical path within the telescope. To assess the performance of this prototype telescope, we investigate the photometric depth and capability to identify LSB features. We find that the surface brightness limit reaches down to $μ_{r,1σ}\sim28.5$ mag arcsec$^{-2}$ in $10^{\prime\prime}\times10^{\prime\prime}$ boxes, enabling us to identify a single stellar stream to the east of NGC 5907. We also examine the characteristics of the point spread function (PSF) and find that the PSF wing reaches a very low level. Still, however, some internal reflections appear within a radius of $\sim$6 arcmin from the center of sources. Despite a relatively small aperture (0.3 m) and short integration time (2 hr), this result demonstrates that our telescope is highly efficient in LSB detection.

preprint2022arXiv

RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and Styles

Scene text editing (STE), which converts a text in a scene image into the desired text while preserving an original style, is a challenging task due to a complex intervention between text and style. In this paper, we propose a novel STE model, referred to as RewriteNet, that decomposes text images into content and style features and re-writes a text in the original image. Specifically, RewriteNet implicitly distinguishes the content from the style by introducing scene text recognition. Additionally, independent of the exact supervisions with synthetic examples, we propose a self-supervised training scheme for unlabeled real-world images, which bridges the domain gap between synthetic and real data. Our experiments present that RewriteNet achieves better generation performances than other comparisons. Further analysis proves the feature decomposition of RewriteNet and demonstrates the reliability and robustness through diverse experiments. Our implementation is publicly available at \url{https://github.com/clovaai/rewritenet}

preprint2020arXiv

Few-shot Compositional Font Generation with Dual Memory

Generating a new font library is a very labor-intensive and time-consuming job for glyph-rich scripts. Despite the remarkable success of existing font generation methods, they have significant drawbacks; they require a large number of reference images to generate a new font set, or they fail to capture detailed styles with only a few samples. In this paper, we focus on compositional scripts, a widely used letter system in the world, where each glyph can be decomposed by several components. By utilizing the compositionality of compositional scripts, we propose a novel font generation framework, named Dual Memory-augmented Font Generation Network (DM-Font), which enables us to generate a high-quality font library with only a few samples. We employ memory components and global-context awareness in the generator to take advantage of the compositionality. In the experiments on Korean-handwriting fonts and Thai-printing fonts, we observe that our method generates a significantly better quality of samples with faithful stylization compared to the state-of-the-art generation methods quantitatively and qualitatively. Source code is available at https://github.com/clovaai/dmfont.

preprint2016arXiv

Deep Saliency with Encoded Low level Distance Map and High Level Features

Recent advances in saliency detection have utilized deep learning to obtain high level features to detect salient regions in a scene. These advances have demonstrated superior results over previous works that utilize hand-crafted low level features for saliency detection. In this paper, we demonstrate that hand-crafted features can provide complementary information to enhance performance of saliency detection that utilizes only high level features. Our method utilizes both high level and low level features for saliency detection under a unified deep learning framework. The high level features are extracted using the VGG-net, and the low level features are compared with other parts of an image to form a low level distance map. The low level distance map is then encoded using a convolutional neural network(CNN) with multiple 1X1 convolutional and ReLU layers. We concatenate the encoded low level distance map and the high level features, and connect them to a fully connected neural network classifier to evaluate the saliency of a query region. Our experiments show that our method can further improve the performance of state-of-the-art deep learning-based saliency detection methods.

Gayoung Lee

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

Generator Knows What Discriminator Should Learn in Unconditional GANs

Memory Efficient Patch-based Training for INR-based GANs

Performance Assessment of the KASI-Deep Rolling Imaging Fast-optics Telescope pathfinder

RewriteNet: Reliable Scene Text Editing with Implicit Decomposition of Text Contents and Styles

Few-shot Compositional Font Generation with Dual Memory

Deep Saliency with Encoded Low level Distance Map and High Level Features