Researcher profile

Sunghun Kim

Sunghun Kim contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
7works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

7 published item(s)

preprint2026arXiv

Solar Open Technical Report

We introduce Solar Open, a 102B-parameter bilingual Mixture-of-Experts language model for underserved languages. Solar Open demonstrates a systematic methodology for building competitive LLMs by addressing three interconnected challenges. First, to train effectively despite data scarcity for underserved languages, we synthesize 4.5T tokens of high-quality, domain-specific, and RL-oriented data. Second, we coordinate this data through a progressive curriculum jointly optimizing composition, quality thresholds, and domain coverage across 20 trillion tokens. Third, to enable reasoning capabilities through scalable RL, we apply our proposed framework SnapPO for efficient optimization. Across benchmarks in English and Korean, Solar Open achieves competitive performance, demonstrating the effectiveness of this methodology for underserved language AI development.

preprint2022arXiv

Decoupled Side Information Fusion for Sequential Recommendation

Side information fusion for sequential recommendation (SR) aims to effectively leverage various side information to enhance the performance of next-item prediction. Most state-of-the-art methods build on self-attention networks and focus on exploring various solutions to integrate the item embedding and side information embeddings before the attention layer. However, our analysis shows that the early integration of various types of embeddings limits the expressiveness of attention matrices due to a rank bottleneck and constrains the flexibility of gradients. Also, it involves mixed correlations among the different heterogeneous information resources, which brings extra disturbance to attention calculation. Motivated by this, we propose Decoupled Side Information Fusion for Sequential Recommendation (DIF-SR), which moves the side information from the input to the attention layer and decouples the attention calculation of various side information and item representation. We theoretically and empirically show that the proposed solution allows higher-rank attention matrices and flexible gradients to enhance the modeling capacity of side information fusion. Also, auxiliary attribute predictors are proposed to further activate the beneficial interaction between side information and item representation learning. Extensive experiments on four real-world datasets demonstrate that our proposed solution stably outperforms state-of-the-art SR models. Further studies show that our proposed solution can be readily incorporated into current attention-based SR models and significantly boost performance. Our source code is available at https://github.com/AIM-SE/DIF-SR.

preprint2022arXiv

Evolutionary Preference Learning via Graph Nested GRU ODE for Session-based Recommendation

Session-based recommendation (SBR) aims to predict the user next action based on the ongoing sessions. Recently, there has been an increasing interest in modeling the user preference evolution to capture the fine-grained user interests. While latent user preferences behind the sessions drift continuously over time, most existing approaches still model the temporal session data in discrete state spaces, which are incapable of capturing the fine-grained preference evolution and result in sub-optimal solutions. To this end, we propose Graph Nested GRU ordinary differential equation (ODE), namely GNG-ODE, a novel continuum model that extends the idea of neural ODEs to continuous-time temporal session graphs. The proposed model preserves the continuous nature of dynamic user preferences, encoding both temporal and structural patterns of item transitions into continuous-time dynamic embeddings. As the existing ODE solvers do not consider graph structure change and thus cannot be directly applied to the dynamic graph, we propose a time alignment technique, called t-Alignment, to align the updating time steps of the temporal session graphs within a batch. Empirical results on three benchmark datasets show that GNG-ODE significantly outperforms other baselines.

preprint2020arXiv

A weak topological insulator state in quasi-one-dimensional superconductor TaSe$_3$

A well-established way to find novel Majorana particles in a solid-state system is to have superconductivity arising from the topological electronic structure. To this end, the heterostructure systems that consist of normal superconductor and topological material have been actively explored in the past decade. However, a search for the single material system that simultaneously exhibits intrinsic superconductivity and topological phase has been largely limited, although such a system is far more favorable especially for the quantum device applications. Here, we report the electronic structure study of a quasi-one-dimensional (q1D) superconductor TaSe$_3$. Our results of angle-resolved photoemission spectroscopy (ARPES) and first-principles calculation clearly show that TaSe$_3$ is a topological superconductor. The characteristic bulk inversion gap, in-gap state and its shape of non-Dirac dispersion concurrently point to the topologically nontrivial nature of this material. The further investigations of the Z$_2$ indices and the topologically distinctive surface band crossings disclose that it belongs to the weak topological insulator (WTI) class. Hereby, TaSe$_3$ becomes the first verified example of an intrinsic 1D topological superconductor. It hopefully provides a promising platform for future applications utilizing Majorana bound states localized at the end of 1D intrinsic topological superconductors.

preprint2020arXiv

ClovaCall: Korean Goal-Oriented Dialog Speech Corpus for Automatic Speech Recognition of Contact Centers

Automatic speech recognition (ASR) via call is essential for various applications, including AI for contact center (AICC) services. Despite the advancement of ASR, however, most publicly available call-based speech corpora such as Switchboard are old-fashioned. Also, most existing call corpora are in English and mainly focus on open domain dialog or general scenarios such as audiobooks. Here we introduce a new large-scale Korean call-based speech corpus under a goal-oriented dialog scenario from more than 11,000 people, i.e., ClovaCall corpus. ClovaCall includes approximately 60,000 pairs of a short sentence and its corresponding spoken utterance in a restaurant reservation domain. We validate the effectiveness of our dataset with intensive experiments using two standard ASR models. Furthermore, we release our ClovaCall dataset and baseline source codes to be available via https://github.com/ClovaAI/ClovaCall.

preprint2020arXiv

Proximity-induced hidden order transition in a correlated heterostructure Sr$_2$VO$_3$FeAs

Symmetry is one of the most significant concepts in physics, and its importance has been largely manifested in phase transitions by its spontaneous breaking. In strongly correlated systems, however, mysterious and enigmatic phase transitions, inapplicable of the symmetry description, have been discovered and often dubbed hidden order transitions, as found in, $\it{e.g.}$, high-$T_C$ cuprates, heavy fermion superconductors, and quantum spin liquid candidates. Here, we report a new type of hidden order transition in a correlated heterostructure Sr$_2$VO$_3$FeAs, whose origin is attributed to an unusually enhanced Kondo-type proximity coupling between localized spins of V and itinerant electrons of FeAs. Most notably, a fully isotropic gap opening, identified by angle-resolved photoemission spectroscopy, occurs selectively in one of the Fermi surfaces below $T_{\rm HO}$ $\sim$ 150 K, associated with a singular behavior of the specific heat and a strong enhancement on the anisotropic magnetoresistance. These observations are incompatible with the prevalent broken-symmetry-driven scenarios of electronic gap opening and highlight a critical role of proximity coupling. Our findings demonstrate that correlated heterostructures offer a novel platform for design and engineering of exotic hidden order phases.

preprint2019arXiv

Lifted electron pocket and reversed orbital occupancy imbalance in FeSe

The FeSe nematic phase has been the focus of recent research on iron based superconductors (IBSs) due to its unique properties. A number of electronic structure studies were performed to find the origin of the phase. However, such attempts came out with conflicting results and caused additional controversies. Here, we report results from angle resolved photoemission and X-ray absorption spectroscopy studies on FeSe with detwinning by a piezo stack. We have fully resolved band dispersions with orbital characters near the Brillouin zone corner which reveals absence of a Fermi pocket at the Y point in the 1Fe Brillouin zone. In addition, the occupation imbalance between dxz and dyz orbitals is found to be opposite to that of iron pnictides, which is consistent with the identified band characters. These results settle down controversial issues in the FeSe nematic phase and shed light on the origin of nematic phases in IBSs.