Researcher profile

Sunjin Choi

Sunjin Choi contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
3topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Rethinking Network Topologies for Cost-Effective Mixture-of-Experts LLM Serving

Mixture-of-experts (MoE) architectures have turned LLM serving into a cluster-scale workload in which communication consumes a considerable portion of LLM serving runtime. This has prompted industry to invest heavily in expensive high-bandwidth scale-up networks. We question whether such costly infrastructure is strictly necessary. We present the first systematic cross-layer analysis of network cost-effectiveness for MoE LLM serving, comparing four representative XPU (e.g., GPU/TPU) topologies (scale-up, scale-out, 3D torus, and 3D full-mesh). We find that lower-cost switchless topologies are more cost-effective than the scale-up topology across all serving scenarios explored, improving cost-effectiveness by 20.6-56.2%. In particular, the 3D full-mesh topology is Pareto-optimal in terms of the performance-cost tradeoff. We also find that current scale-up link bandwidths are over-provisioned: reducing the link bandwidth improves throughput per cost by up to 27%. A forward-looking analysis of upcoming GPU generations indicates that the cost-performance advantage of switchless networks will likely persist.

preprint2022arXiv

From giant gravitons to black holes

We study AdS$_5$ black holes from a recently suggested giant graviton expansion formula for the index of $U(N)$ maximal super-Yang-Mills theory. We compute the large $N$ entropy at fixed charges and giant graviton numbers $n_I$ by a saddle point analysis, and further maximize it in $n_I$. This agrees with the dual black hole entropy in the small black hole limit. To get black holes at general sizes, one should note that various giant graviton indices cancel because gauge theory does not suffer from a Hagedorn-like pathology by an infinite baryonic tower. With one assumption on the mechanism of this cancellation, we account for the dual black hole entropy at general sizes. We interpret our results as analytic continuations of the large $N$ free energies of SCFTs, and based on it compute the entropies of AdS$_{4,7}$ black holes from M5, M2 giant gravitons.

preprint2022arXiv

Supersymmetric Spectral Form Factor and Euclidean Black Holes

The late-time behavior of spectral form factor (SFF) encodes the inherent discreteness of a quantum system, which should be generically non-vanishing. We study an index analog of the microcanonical spectrum form factor in four-dimensional $\mathcal{N}=4$ super Yang-Mills theory. In the large $N$ limit and at large enough energy, the most dominant saddle corresponds to the black hole in the AdS bulk. This gives rise to the slope that decreases exponentially for a small imaginary chemical potential, which is a natural analog of an early time. We find that the `late-time' behavior is governed by the multi-cut saddles that arise in the index matrix model, which are non-perturbatively sub-dominant at early times. These saddles become dominant at late times, preventing the SFF from decaying. These multi-cut saddles correspond to the orbifolded Euclidean black holes in the AdS bulk, therefore giving the geometrical interpretation of the `ramp.' Our analysis is done in the standard AdS/CFT setting without ensemble average or wormholes.

preprint2020arXiv

Universal 3d Cardy Block and Black Hole Entropy

We discuss the Cardy limit of 3d supersymmetric partition functions which allow the factorization into the hemisphere indices: the generalized superconformal index, the refined topologically twisted index and the squashed sphere partition function. In the Cardy limit, the hemisphere index can be evaluated by the saddle point approximation where there exists a dominant saddle point contribution, which we call the Cardy block. The Cardy block turns out to be a simple but powerful object as it is a building block of other partition functions in the Cardy limit. The factorization to the Cardy block allows us to find universal relations among the partition functions, which we formulate as index theorems. Furthermore, if we consider a holographic 3d SCFT and its large $N$ limit, those partition functions relate to various entropic quantities of the dual gravity theory in AdS$_4$. As a result, our result provides the microscopic derivation of the universal relations among those entropic quantities of the gravity theory. We also discuss explicit examples, which confirm our general index theorems.