Researcher profile

Jungin Lee

Jungin Lee contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
6works
0followers
7topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

6 published item(s)

preprint2026arXiv

Distribution of the cokernels of determinantal row-sparse matrices

We study the distribution of the cokernels of random row-sparse integral matrices $A_n$ according to the determinantal measure from a structured matrix $B_n$ with a parameter $k_n \ge 3$. Under a mild assumption on the growth rate of $k_n$, we prove that the distribution of the $p$-Sylow subgroup of the cokernel of $A_n$ converges to that of Cohen--Lenstra for every prime $p$. Our result extends the work of A. Mészáros which established convergence to the Cohen--Lenstra distribution when $p \ge 5$ and $k_n=3$ for all positive integers $n$.

preprint2022arXiv

Jordan--Landau theorem for matrices over finite fields

Given a positive integer $r$ and a prime power $q$, we estimate the probability that the characteristic polynomial $f_{A}(t)$ of a random matrix $A$ in $\mathrm{GL}_{n}(\mathbb{F}_{q})$ is square-free with $r$ (monic) irreducible factors when $n$ is large. We also estimate the analogous probability that $f_{A}(t)$ has $r$ irreducible factors counting with multiplicity. In either case, the main term $(\log n)^{r-1}((r-1)!n)^{-1}$ and the error term $O((\log n)^{r-2}n^{-1})$, whose implied constant only depends on $r$ but not on $q$ nor $n$, coincide with the probability that a random permutation on $n$ letters is a product of $r$ disjoint cycles. The main ingredient of our proof is a recursion argument due to S. D. Cohen, which was previously used to estimate the probability that a random degree $n$ monic polynomial in $\mathbb{F}_{q}[t]$ is square-free with $r$ irreducible factors and the analogous probability that the polynomial has $r$ irreducible factors counting with multiplicity. We obtain our result by carefully modifying Cohen's recursion argument in the matrix setting, using Reiner's theorem that counts the number of $n \times n$ matrices with a fixed characteristic polynomial over $\mathbb{F}_{q}$.

preprint2021arXiv

Adaptable Multi-Domain Language Model for Transformer ASR

We propose an adapter based multi-domain Transformer based language model (LM) for Transformer ASR. The model consists of a big size common LM and small size adapters. The model can perform multi-domain adaptation with only the small size adapters and its related layers. The proposed model can reuse the full fine-tuned LM which is fine-tuned using all layers of an original model. The proposed LM can be expanded to new domains by adding about 2% of parameters for a first domain and 13% parameters for after second domain. The proposed model is also effective in reducing the model maintenance cost because it is possible to omit the costly and time-consuming common LM pre-training process. Using proposed adapter based approach, we observed that a general LM with adapter can outperform a dedicated music domain LM in terms of word error rate (WER).

preprint2020arXiv

Attention based on-device streaming speech recognition with large speech corpus

In this paper, we present a new on-device automatic speech recognition (ASR) system based on monotonic chunk-wise attention (MoChA) models trained with large (> 10K hours) corpus. We attained around 90% of a word recognition rate for general domain mainly by using joint training of connectionist temporal classifier (CTC) and cross entropy (CE) losses, minimum word error rate (MWER) training, layer-wise pre-training and data augmentation methods. In addition, we compressed our models by more than 3.4 times smaller using an iterative hyper low-rank approximation (LRA) method while minimizing the degradation in recognition accuracy. The memory footprint was further reduced with 8-bit quantization to bring down the final model size to lower than 39 MB. For on-demand adaptation, we fused the MoChA models with statistical n-gram models, and we could achieve a relatively 36% improvement on average in word error rate (WER) for target domains including the general domain.

preprint2019arXiv

On a number of isogeny classes of simple abelian varieties over finite fields

In this paper, we investigate the asymptotic behavior of the number $s_q(g)$ of isogeny classes of simple abelian varieties of dimension $g$ over a finite field $\mathbb{F}_q$. We prove that the logarithmic asymptotic of $s_q(g)$ is the same as the logarithmic asymptotic of the number $m_q(g)$ of isogeny classes of all abelian varieties of dimension $g$ over $\mathbb{F}_q$. We also prove that $$ \limsup_{g \rightarrow \infty} \frac{s_q(g)}{m_q(g)}=1. $$ This suggests that there are much more simple isogeny classes of abelian varieties over $\mathbb{F}_q$ of dimension $g$ than non-simple ones for sufficiently large $g$, which can be understood as the opposite situation to a main result of Lipnowski and Tsimerman (Duke Math 167:3403-3453, 2018).