Researcher profile

Si-Hyeon Lee

Si-Hyeon Lee contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
10works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

10 published item(s)

preprint2026arXiv

Accelerating LMO-Based Optimization via Implicit Gradient Transport

Recent optimizers such as Lion and Muon have demonstrated strong empirical performance by normalizing gradient momentum via linear minimization oracles (LMOs). While variance reduction has been explored to accelerate LMO-based methods, it typically incurs substantial computational overhead due to additional gradient evaluations. At the same time, the theoretical understanding of LMO-based methods remains fragmented across unconstrained and constrained formulations. Motivated by these limitations, we propose \emph{LMO-IGT}, a new class of stochastic LMO-based methods leveraging implicit gradient transport (IGT). We further introduce a unified framework for stochastic LMO-based optimization together with a new stationarity measure, the \emph{regularized support function} (RSF), which bridges gradient-norm and Frank--Wolfe-gap notions within a common framework. By evaluating stochastic gradients at transported points, LMO-IGT accelerates convergence while retaining the single-gradient-per-iteration structure of standard stochastic LMO. Our analysis establishes that stochastic LMO achieves an iteration complexity of $\mathcal{O}(\varepsilon^{-4})$, variance-reduced LMO achieves $\mathcal{O}(\varepsilon^{-3})$ at the cost of additional gradient evaluations, and LMO-IGT achieves $\mathcal{O}(\varepsilon^{-3.5})$ using only a single stochastic gradient per iteration. Empirically, LMO-IGT consistently improves over stochastic LMO counterparts with negligible overhead. Among its instantiations, Muon-IGT achieves the strongest overall performance across evaluated settings, demonstrating that IGT provides an effective and practical acceleration mechanism for modern LMO-based optimization.

preprint2026arXiv

Enhancing Sum Capacity via Quantum and No-Signaling Cooperation Between Transmitters

We consider a communication scenario over a discrete memoryless interference channel or multiple access channel without feedback, where transmitters exploit classical, quantum, or no-signaling cooperation. In this scenario, several previous works have shown that the sum capacities of channels involving pseudo-telepathy games can be enhanced by quantum or no-signaling cooperation. However, a full characterization of which channels admit such an improvement remains open. By focusing on the common characteristics of previously studied channels, we propose a broader class of channels for which quantum or no-signaling cooperation increases the sum capacity. Channels in this class are associated with a pseudo-telepathy game, with channel inputs specified as tuples of questions and answers from the game. In addition, when the channel inputs satisfy the winning condition of the game, the channel decomposes into parallel weakly symmetric sub-channels and is less noisy compared to the case when the inputs do not meet the winning condition.

preprint2026arXiv

LENS: Low-Frequency Eigen Noise Shaping for Efficient Diffusion Sampling

Distilled diffusion models accelerate image generation by reducing the number of denoising steps, but often suffer from degraded image quality. To mitigate this trade-off, test-time optimization methods improve quality, yet their iterative nature incurs substantial computational overhead and leads to slow inference, limiting practical usability. Recent hypernetwork-based approaches amortize this process during training, but still require costly noise modulation in high-dimensional latent spaces. In this work, we propose LENS (Low-frequency Eigen Noise Shaping), an efficient noise modulation framework that operates in a low-dimensional subspace. Our approach is motivated by the observation that low-frequency components of the noise largely determine the global structure and visual fidelity of generated images. Based on this observation, we provide a theoretical justification for restricting modulation to the low-frequency subspace and derive a principled training objective. Building on this, LENS employs a lightweight, standalone network to selectively modulate these components, enabling efficient and targeted noise modulation. Extensive experiments demonstrate that LENS achieves competitive image quality while reducing FLOPs by 400-700$\times$, model parameters by 25-75$\times$, and inference-time overhead by 10-20$\times$ compared to prior methods.

preprint2022arXiv

Anti-Jamming Games in Multi-Band Wireless Ad Hoc Networks

For multi-band wireless ad hoc networks of multiple users, an anti-jamming game between the users and a jammer is studied. In this game, the users (resp. jammer) want to maximize (resp. minimize) the expected rewards of the users taking into account various factors such as communication rate, hopping cost, and jamming loss. We analyze the arms race of the game and derive an optimal frequency hopping policy at each stage of the arms race based on the Markov decision process (MDP). It is analytically shown that the arms race reaches an equilibrium after a few rounds, and a frequency hopping policy and a jamming strategy at the equilibrium are characterized. We propose two kinds of collision avoidance protocols to ensure that at most one user communicates in each frequency band, and provide various numerical results that show the effects of the reward parameters and collision avoidance protocols on the optimal frequency hopping policy and the expected rewards at the equilibrium. Moreover, we discuss about equilibria for the case where the jammer adopts some unpredictable jamming strategies.

preprint2021arXiv

Some results on r-truncated degenerate Poisson Random Variables

The zero-truncated Poisson distributions are certain discrete probability distributions whose supports are the set of positive integers, which are also known as the conditional Poisson distributions or the positive Poisson distributions. In this paper, we introduce the r-truncated degenerate Poisson random variable with parameter a and investigate various properties of this random variable

preprint2020arXiv

Mobility-Assisted Covert Communication over Wireless Ad Hoc Networks

We study the effect of node mobility on the throughput scaling of the covert communication over a wireless adhoc network. It is assumed that $n$ mobile nodes want to communicate each other in a unit disk while keeping the presence of the communication secret from each of $Θ(n^s)$ non-colluding wardens ($s>0$). Our results show that the node mobility greatly improves the throughput scaling, compared to the case of fixed node location. In particular, for $0<s<1$, the aggregate throughput scaling is shown to be linear in $n$ when the number of channel uses that each warden uses to judge the presence of communication is not too large compared to $n$. For the achievability, we modify the two-hop based scheme by Grossglauser and Tse (2002), which was proposed for a wireless ad hoc network without a covertness constraint, by introducing a preservation region around each warden in which the senders are not allowed to transmit and by carefully analyzing the effect of covertness constraint on the transmit power and the resultant transmission rates. This scheme is shown to be optimal for $0<s<1$ under an assumption that each node outside preservation regions around wardens uses the same transmit power.

preprint2020arXiv

Secrecy Capacity of a Gaussian Wiretap Channel With ADCs is Always Positive

We consider a complex Gaussian wiretap channel with finite-resolution analog-to-digital converters (ADCs) at both the legitimate receiver and the eavesdropper. For this channel, we show that a positive secrecy rate is always achievable as long as the channel gains at the legitimate receiver and at the eavesdropper are different, regardless of the quantization levels of the ADCs. For the achievability, we first consider the case of one-bit ADCs at the legitimate receiver and apply a binary input distribution where the two input points have the same phase when the channel gain at the legitimate receiver is less than that at the eavesdropper, and otherwise the opposite phase. Then the result is generalized for the case of arbitrary finite-resolution ADCs at the legitimate receiver by translating the input distribution appropriately. For the special case of the real Gaussian wiretap channel with one-bit ADCs at both the legitimate receiver and the eavesdropper, we show that our choice of input distribution satisfies a necessary condition of optimal distributions for Wyner codes.

preprint2020arXiv

Treating Interference as Noise is Optimal for Covert Communication over Interference Channels

We study the covert communication over K-user discrete memoryless interference channels (DM-ICs) with a warden. It is assumed that the warden&#39;s channel output distribution induced by K &#34;off&#34; input symbols, which are sent when no communication occurs, is not a convex combination of those induced by any other combination of input symbols (otherwise, the square-root law does not hold). We derive the exact covert capacity region and show that a simple point-to-point based scheme with treating interference as noise is optimal. In addition, we analyze the secret key length required for the reliable and covert communication with the desired rates, and present a channel condition where a secret key between each user pair is unnecessary. The results are extended to the Gaussian case and the case with multiple wardens.

preprint2013arXiv

A New Achievable Scheme for Interference Relay Channels

We establish an achievable rate region for discrete memoryless interference relay channels that consist of two source-destination pairs and one or more relays. We develop an achievable scheme combining Han-Kobayashi and noisy network coding schemes. We apply our achievability to two cases. First, we characterize the capacity region of a class of discrete memoryless interference relay channels. This class naturally generalizes the injective deterministic discrete memoryless interference channel by El Gamal and Costa and the deterministic discrete memoryless relay channel with orthogonal receiver components by Kim. Moreover, for the Gaussian interference relay channel with orthogonal receiver components, we show that our scheme achieves a better sum rate than that of noisy network coding.

preprint2011arXiv

Capacity Scaling of Wireless Ad Hoc Networks: Shannon Meets Maxwell

In this paper, we characterize the information-theoretic capacity scaling of wireless ad hoc networks with $n$ randomly distributed nodes. By using an exact channel model from Maxwell&#39;s equations, we successfully resolve the conflict in the literature between the linear capacity scaling by Özgür et al. and the degrees of freedom limit given as the ratio of the network diameter and the wavelength $λ$ by Franceschetti et al. In dense networks where the network area is fixed, the capacity scaling is given as the minimum of $n$ and the degrees of freedom limit $λ^{-1}$ to within an arbitrarily small exponent. In extended networks where the network area is linear in $n$, the capacity scaling is given as the minimum of $n$ and the degrees of freedom limit $\sqrt{n}λ^{-1}$ to within an arbitrarily small exponent. Hence, we recover the linear capacity scaling by Özgür et al. if $λ=O(n^{-1})$ in dense networks and if $λ=O(n^{-1/2})$ in extended networks. Otherwise, the capacity scaling is given as the degrees of freedom limit characterized by Franceschetti et al. For achievability, a modified hierarchical cooperation is proposed based on a lower bound on the capacity of multiple-input multiple-output channel between two node clusters using our channel model.