Source author record

Soo-Young Lee

Soo-Young Lee appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

physics.optics Computation and Language Machine Learning nlin.CD cond-mat.mes-hall quant-ph cond-mat.supr-con Neural and Evolutionary Computing

Catalog footprint

What is connected

14works

8topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models

We report a GPT-based multi-sentence language model for dialogue generation and document understanding. First, we propose a hierarchical GPT which consists of three blocks, i.e., a sentence encoding block, a sentence generating block, and a sentence decoding block. The sentence encoding and decoding blocks are basically the encoder-decoder blocks of the standard Transformers, which work on each sentence independently. The sentence generating block is inserted between the encoding and decoding blocks, and generates the next sentence embedding vector from the previous sentence embedding vectors. We believe it is the way human make conversation and understand paragraphs and documents. Since each sentence may consist of fewer words, the sentence encoding and decoding Transformers can use much smaller dimensional embedding vectors. Secondly, we note the attention in the Transformers utilizes the inner-product similarity measure. Therefore, to compare the two vectors in the same space, we set the transform matrices for queries and keys to be the same. Otherwise, the similarity concept is incongruent. We report experimental results to show that these two modifications increase the language model performance for tasks with multiple sentences.

preprint2020arXiv

Semi-supervised Disentanglement with Independent Vector Variational Autoencoders

We aim to separate the generative factors of data into two latent vectors in a variational autoencoder. One vector captures class factors relevant to target classification tasks, while the other vector captures style factors relevant to the remaining information. To learn the discrete class features, we introduce supervision using a small amount of labeled data, which can simply yet effectively reduce the effort required for hyperparameter tuning performed in existing unsupervised methods. Furthermore, we introduce a learning objective to encourage statistical independence between the vectors. We show that (i) this vector independence term exists within the result obtained on decomposing the evidence lower bound with multiple latent vectors, and (ii) encouraging such independence along with reducing the total correlation within the vectors enhances disentanglement performance. Experiments conducted on several image datasets demonstrate that the disentanglement achieved via our method can improve classification performance and generation controllability.

preprint2016arXiv

Compositional Sentence Representation from Character within Large Context Text

This paper describes a Hierarchical Composition Recurrent Network (HCRN) consisting of a 3-level hierarchy of compositional models: character, word and sentence. This model is designed to overcome two problems of representing a sentence on the basis of a constituent word sequence. The first is a data-sparsity problem in word embedding, and the other is a no usage of inter-sentence dependency. In the HCRN, word representations are built from characters, thus resolving the data-sparsity problem, and inter-sentence dependency is embedded into sentence representation at the level of sentence composition. We adopt a hierarchy-wise learning scheme in order to alleviate the optimization difficulties of learning deep hierarchical recurrent network in end-to-end fashion. The HCRN was quantitatively and qualitatively evaluated on a dialogue act classification task. Especially, sentence representations with an inter-sentence dependency are able to capture both implicit and explicit semantics of sentence, significantly improving performance. In the end, the HCRN achieved state-of-the-art performance with a test error rate of 22.7% for dialogue act classification on the SWBD-DAMSL database.

preprint2016arXiv

Deep CNNs along the Time Axis with Intermap Pooling for Robustness to Spectral Variations

Convolutional neural networks (CNNs) with convolutional and pooling operations along the frequency axis have been proposed to attain invariance to frequency shifts of features. However, this is inappropriate with regard to the fact that acoustic features vary in frequency. In this paper, we contend that convolution along the time axis is more effective. We also propose the addition of an intermap pooling (IMP) layer to deep CNNs. In this layer, filters in each group extract common but spectrally variant features, then the layer pools the feature maps of each group. As a result, the proposed IMP CNN can achieve insensitivity to spectral variations characteristic of different speakers and utterances. The effectiveness of the IMP CNN architecture is demonstrated on several LVCSR tasks. Even without speaker adaptation techniques, the architecture achieved a WER of 12.7% on the SWB part of the Hub5'2000 evaluation test set, which is competitive with other state-of-the-art methods.

preprint2015arXiv

A Novel Analytic Approach to Model Line Edge Roughness using Stochastic Exposure Distribution in Electron-beam Lithography

The line edge roughness (LER) becomes a issue of e-beam lithography when feature size is reduced into nanometers. Therefore, minimizing the LER is a important method to increase the density of circuit patterns. One of the possible ways is through simulation. The stochastic exposure distributions in the resist is generated by the Monte Carlo simulation. In addition a resist development simulation needs to be carried out. Although there are several ways to simulate or estimate LER but none of them can reveal as much the inner relationship between LER and different parameters as theocratical analysing methods can do. In this paper, a new approach to analytically derive the LER based on the statistical exposure, is described. Our approach is based on analytic model of stochastic exposure distribution, deriving standard deviation of exposure and analyzing the variance of edge location after development. Even though it may not be a complete modeling of LER, it can still show some strong relationship between LER and some inner parameters.

preprint2015arXiv

Exceptional points in coupled dissipative dynamical systems

We study the transient behavior in coupled dissipative dynamical systems based on the linear analysis around the steady state. We find that the transient time is minimized at a specific set of system parameters and show that at this parameter set, two eigenvalues and two eigenvectors of Jacobian matrix coalesce at the same time, this degenerate point is called the exceptional point. For the case of coupled limit cycle oscillators, we investigate the transient behavior into the amplitude death state, and clarify that the exceptional point is associated with a critical point of frequency locking, as well as the transition of the envelope oscillation.

preprint2014arXiv

Abnormal high-$Q$ modes of coupled stadium-shaped microcavities

It is well known that the strongly deformed microcavity with fully chaotic ray dynamics cannot support high-Q modes due to its fast chaotic diffusion to the critical line of refractive emission. Here, we investigate how the Q factor is modified when two chaotic cavities are coupled, and show that some modes, whose Q factor is about 10 times higher than that of the corresponding single cavity, can exist. These abnormal high-Q modes are the result of an optimal combination of coupling and cavity geometry. As an example, in the coupled stadium-shaped microcavities, the mode pattern extends over both cavities such that it follows a whispering-gallery-type mode at both ends, whereas a big coupling spot forms at the closest contact of the two microcavities. The pattern of such a 'rounded bow tie' mode allows the mode to have a high-Q factor. This mode pattern minimizes the leakage of light at both ends of the microcavities as the pattern at both ends is similar to whispering gallery mode.

preprint2014arXiv

Quantum Goos-Hänchen shift and tunneling transmission at a curved step potential

We study the quantum Goos-Hänchen (GH) shift and the tunneling transmission at a curved step potential by investigating the time evolution of a wave packet. An initial wave packet is expanded in terms of the eigenmodes of a circular step potential. Its time evolution is then given by the interference of their simple eigenmode oscillations. We show that the GH shift along the step boundary can be explained by the energy-dependent phase loss upon reflection, which is defined by modifying the one-dimensional (1D) effective potential derived from the 2D circular system. We also demonstrate that the tunneling transmission of the wave packet is characterized by a free-space image distant from the boundary. The tunneling transmission exhibits a rather wide angle divergence and the direction of maximum tunneling is slightly rotated from the tangent at the incident point, which is consistent with the time delay of the tunneling wave packet computed in the 1D modified effective potential.

preprint2013arXiv

Hierarchical Data Representation Model - Multi-layer NMF

In this paper, we propose a data representation model that demonstrates hierarchical feature learning using nsNMF. We extend unit algorithm into several layers. Experiments with document and image data successfully discovered feature hierarchies. We also prove that proposed method results in much better classification and reconstruction performance, especially for small number of features. feature hierarchies.

preprint2012arXiv

Analysis of multiple exceptional points related to three interacting eigenmodes in a non-Hermitian Hamiltonian

We have investigated the exceptional points (EPs) which are degeneracies of a non-Hermitian Hamiltonian, in the case that three modes are interacting with each other. Even though the parametric evolution of the modes cannot be uniquely determined when encircling more than two EPs once, we can recover the initial configuration of the modes by encircling two EPs three times or three EPs twice. We confirm our expectation by numerically calculating the modes of an open quantum system, two dielectric microdisks, and 3$\times$3 matrix model.

preprint2012arXiv

Sticky Normal-Superconductor Interface

We study the quantum Goos-Hänchen(GH) effect for wave-packet dynamics at a normal/superconductor (NS) interface. We find that the effect is amplified by a factor $(E_F/Δ)$, with $E_F$ the Fermi energy and $Δ$ the gap. Interestingly, the GH effect appears only as a time delay $δt$ without any lateral shift, and the corresponding delay length is about $(E_F/Δ)λ_F$, with $λ_F$ the Fermi wavelength. This makes the NS interface "sticky" when $Δ\ll E_F$, since typically GH effects are of wavelength order. This "sticky" behavior can be further enhanced by a resonance mode in NSNS interface. Finally, for a large $Δ$, the resonance-mode effect makes a transition from Andreev to the specular electron reflection as the width of the sandwiched superconductor is reduced.

preprint2011arXiv

Quasiscarred modes and their branching behavior at an exceptional point

We study quasiscarring phenomenon and mode branching at an exceptional point (EP) in typically deformed microcavities. It is shown that quasiscarred (QS) modes are dominant in some mode group and their pattern can be understood by short-time ray dynamics near the critical line. As cavity deformation increases, high-Q and low-Q QS modes are branching in an opposite way, at an EP, into two robust mode types showing QS and diamond patterns, respectively. Similar branching behavior can be also found at another EP appearing at a higher deformation. This branching behavior of QS modes has its origin on the fact that an EP is a square-root branch point.

preprint2009arXiv

Coupled non-identical microdisks: avoided crossing of energy levels and unidirectional far-field emission

We investigate two coupled microdisks with non-identical radii focusing on the parametric evolution of energy levels and the unidirectional far-field emission. We show that the evolution of energy levels is characterized by the avoided crossing intrinsically associated with the exceptional point or the non-Hermitian degeneracy. These spectral properties explain highly asymmetric near-field intensity pattern of the resonance mode. The observed unidirectional far-field emission is shown to be understood by considering the forbidden inter-disk coupling in the ray picture induced by the frustrated total internal reflection near the closest point between two disks when the inter-disk distance is small enough.

preprint2009arXiv

Observation of an exceptional point in a chaotic optical microcavity

We present spectroscopic observation of an exceptional point or the transition point between diabatic crossing and avoided crossing of neighboring quasi-eigenmodes in a chaotic optical microcavity with a large size parameter. The transition to the avoided crossing was impeded until the degree of deformation exceeded a threshold deformation owing to the system's openness also enhanced by the shape deformation. As a result, a singular topology was observed around the exceptional point on the eigenfrequency surfaces, resulting in fundamental inconsistency in mode labeling.

Soo-Young Lee

What is connected

Connect this record

See the researcher in context

Building this map preview

14 published item(s)

Hierarchical GPT with Congruent Transformers for Multi-Sentence Language Models

Semi-supervised Disentanglement with Independent Vector Variational Autoencoders

Compositional Sentence Representation from Character within Large Context Text

Deep CNNs along the Time Axis with Intermap Pooling for Robustness to Spectral Variations

A Novel Analytic Approach to Model Line Edge Roughness using Stochastic Exposure Distribution in Electron-beam Lithography

Exceptional points in coupled dissipative dynamical systems

Abnormal high-$Q$ modes of coupled stadium-shaped microcavities

Quantum Goos-Hänchen shift and tunneling transmission at a curved step potential

Hierarchical Data Representation Model - Multi-layer NMF

Analysis of multiple exceptional points related to three interacting eigenmodes in a non-Hermitian Hamiltonian

Sticky Normal-Superconductor Interface

Quasiscarred modes and their branching behavior at an exceptional point

Coupled non-identical microdisks: avoided crossing of energy levels and unidirectional far-field emission

Observation of an exceptional point in a chaotic optical microcavity