Source author record

Hanbaek Lyu

Hanbaek Lyu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.PR math.CO math.DS math.OC math.ST nlin.CG nlin.PS nlin.SI Statistics Theory nlin.AO Populations and Evolution

Catalog footprint

What is connected

11works

12topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Sampling random graph homomorphisms and applications to network data analysis

A graph homomorphism is a map between two graphs that preserves adjacency relations. We consider the problem of sampling a random graph homomorphism from a graph into a large network. We propose two complementary MCMC algorithms for sampling random graph homomorphisms and establish bounds on their mixing times and the concentration of their time averages. Based on our sampling algorithms, we propose a novel framework for network data analysis that circumvents some of the drawbacks in methods based on independent and neighborhood sampling. Various time averages of the MCMC trajectory give us various computable observables, including well-known ones such as homomorphism density and average clustering coefficient and their generalizations. Furthermore, we show that these network observables are stable with respect to a suitably renormalized cut distance between networks. We provide various examples and simulations demonstrating our framework through synthetic networks. We also \commHL{demonstrate the performance of} our framework on the tasks of network clustering and subgraph classification on the Facebook100 dataset and on Word Adjacency Networks of a set of classic novels.

preprint2022arXiv

Learning to predict synchronization of coupled oscillators on randomly generated graphs

Suppose we are given a system of coupled oscillators on an unknown graph along with the trajectory of the system during some period. Can we predict whether the system will eventually synchronize? Even with a known underlying graph structure, this is an important yet analytically intractable question in general. In this work, we take an alternative approach to the synchronization prediction problem by viewing it as a classification problem based on the fact that any given system will eventually synchronize or converge to a non-synchronizing limit cycle. By only using some basic statistics of the underlying graphs such as edge density and diameter, our method can achieve perfect accuracy when there is a significant difference in the topology of the underlying graphs between the synchronizing and the non-synchronizing examples. However, in the problem setting where these graph statistics cannot distinguish the two classes very well (e.g., when the graphs are generated from the same random graph model), we find that pairing a few iterations of the initial dynamics along with the graph statistics as the input to our classification algorithms can lead to significant improvement in accuracy; far exceeding what is known by the classical oscillator theory. More surprisingly, we find that in almost all such settings, dropping out the basic graph statistics and training our algorithms with only initial dynamics achieves nearly the same accuracy. We demonstrate our method on three models of continuous and discrete coupled oscillators -- the Kuramoto model, Firefly Cellular Automata, and Greenberg-Hastings model. Finally, we also propose an "ensemble prediction" algorithm that successfully scales our method to large graphs by training on dynamics observed from multiple random subgraphs.

preprint2022arXiv

Online nonnegative CP-dictionary learning for Markovian data

Online Tensor Factorization (OTF) is a fundamental tool in learning low-dimensional interpretable features from streaming multi-modal data. While various algorithmic and theoretical aspects of OTF have been investigated recently, a general convergence guarantee to stationary points of the objective function without any incoherence or sparsity assumptions is still lacking even for the i.i.d. case. In this work, we introduce a novel algorithm that learns a CANDECOMP/PARAFAC (CP) basis from a given stream of tensor-valued data under general constraints, including nonnegativity constraints that induce interpretability of the learned CP basis. We prove that our algorithm converges almost surely to the set of stationary points of the objective function under the hypothesis that the sequence of data tensors is generated by an underlying Markov chain. Our setting covers the classical i.i.d. case as well as a wide range of application contexts including data streams generated by independent or MCMC sampling. Our result closes a gap between OTF and Online Matrix Factorization in global convergence analysis \commHL{for CP-decompositions}. Experimentally, we show that our algorithm converges much faster than standard algorithms for nonnegative tensor factorization tasks on both synthetic and real-world data. Also, we demonstrate the utility of our algorithm on a diverse set of examples from image, video, and time-series data, illustrating how one may learn qualitatively different CP-dictionaries from the same tensor data by exploiting the tensor structure in multiple ways.

preprint2022arXiv

Supervised Dictionary Learning with Auxiliary Covariates

Supervised dictionary learning (SDL) is a classical machine learning method that simultaneously seeks feature extraction and classification tasks, which are not necessarily a priori aligned objectives. The goal of SDL is to learn a class-discriminative dictionary, which is a set of latent feature vectors that can well-explain both the features as well as labels of observed data. In this paper, we provide a systematic study of SDL, including the theory, algorithm, and applications of SDL. First, we provide a novel framework that `lifts' SDL as a convex problem in a combined factor space and propose a low-rank projected gradient descent algorithm that converges exponentially to the global minimizer of the objective. We also formulate generative models of SDL and provide global estimation guarantees of the true parameters depending on the hyperparameter regime. Second, viewed as a nonconvex constrained optimization problem, we provided an efficient block coordinate descent algorithm for SDL that is guaranteed to find an $\varepsilon$-stationary point of the objective in $O(\varepsilon^{-1}(\log \varepsilon^{-1})^{2})$ iterations. For the corresponding generative model, we establish a novel non-asymptotic local consistency result for constrained and regularized maximum likelihood estimation problems, which may be of independent interest. Third, we apply SDL for imbalanced document classification by supervised topic modeling and also for pneumonia detection from chest X-ray images. We also provide simulation studies to demonstrate that SDL becomes more effective when there is a discrepancy between the best reconstructive and the best discriminative dictionaries.

preprint2020arXiv

COVID-19 Time-series Prediction by Joint Dictionary Learning and Online NMF

Predicting the spread and containment of COVID-19 is a challenge of utmost importance that the broader scientific community is currently facing. One of the main sources of difficulty is that a very limited amount of daily COVID-19 case data is available, and with few exceptions, the majority of countries are currently in the "exponential spread stage," and thus there is scarce information available which would enable one to predict the phase transition between spread and containment. In this paper, we propose a novel approach to predicting the spread of COVID-19 based on dictionary learning and online nonnegative matrix factorization (online NMF). The key idea is to learn dictionary patterns of short evolution instances of the new daily cases in multiple countries at the same time, so that their latent correlation structures are captured in the dictionary patterns. We first learn such patterns by minibatch learning from the entire time-series and then further adapt them to the time-series by online NMF. As we progressively adapt and improve the learned dictionary patterns to the more recent observations, we also use them to make one-step predictions by the partial fitting. Lastly, by recursively applying the one-step predictions, we can extrapolate our predictions into the near future. Our prediction results can be directly attributed to the learned dictionary patterns due to their interpretability.

preprint2020arXiv

Double jump phase transition in a soliton cellular automaton

In this paper, we consider the soliton cellular automaton introduced in [Takahashi 1990] with a random initial configuration. We give multiple constructions of a Young diagram describing various statistics of the system in terms of familiar objects like birth-and-death chains and Galton-Watson forests. Using these ideas, we establish limit theorems showing that if the first $n$ boxes are occupied independently with probability $p\in(0,1)$, then the number of solitons is of order $n$ for all $p$, and the length of the longest soliton is of order $\log n$ for $p<1/2$, order $\sqrt{n}$ for $p=1/2$, and order $n$ for $p>1/2$. Additionally, we uncover a condensation phenomenon in the supercritical regime: For each fixed $j\geq 1$, the top $j$ soliton lengths have the same order as the longest for $p\leq 1/2$, whereas all but the longest have order at most $\log n$ for $p>1/2$. As an application, we obtain scaling limits for the lengths of the $k^{\text{th}}$ longest increasing and decreasing subsequences in a random stack-sortable permutation of length $n$ in terms of random walks and Brownian excursions.

preprint2020arXiv

Phase transition in random contingency tables with non-uniform margins

For parameters $n,δ,B,$ and $C$, let $X=(X_{k\ell})$ be the random uniform contingency table whose first $\lfloor n^δ \rfloor $ rows and columns have margin $\lfloor BCn \rfloor$ and the last $n$ rows and columns have margin $\lfloor Cn \rfloor$. For every $0<δ<1$, we establish a sharp phase transition of the limiting distribution of each entry of $X$ at the critical value $B_{c}=1+\sqrt{1+1/C}$. In particular, for $1/2<δ<1$, we show that the distribution of each entry converges to a geometric distribution in total variation distance, whose mean depends sensitively on whether $B<B_{c}$ or $B>B_{c}$. Our main result shows that $\mathbb{E}[X_{11}]$ is uniformly bounded for $B<B_{c}$, but has sharp asymptotic $C(B-B_{c}) n^{1-δ}$ for $B>B_{c}$. We also establish a strong law of large numbers for the row sums in top right and top left blocks.

preprint2020arXiv

Stretched exponential decay for subcritical parking times on $\mathbb{Z}^d$

In the parking model on $\mathbb{Z}^d$, each vertex is initially occupied by a car (with probability $p$) or by a vacant parking spot (with probability $1-p$). Cars perform independent random walks and when they enter a vacant spot, they park there, thereby rendering the spot occupied. Cars visiting occupied spots simply keep driving (continuing their random walk). It is known that $p=1/2$ is a critical value in the sense that the origin is a.s. visited by finitely many distinct cars when $p<1/2$, and by infinitely many distinct cars when $p\geq 1/2$. Furthermore, any given car a.s. eventually parks for $p \leq 1/2$ and with positive probability does not park for $p > 1/2$. We study the subcritical phase and prove that the tail of the parking time $τ$ of the car initially at the origin obeys the bounds \[ \exp\left( - C_1 t^{\frac{d}{d+2}}\right) \leq \mathbb{P}_p(τ> t) \leq \exp\left( - c_2 t^{\frac{d}{d+2}}\right) \] for $p>0$ sufficiently small. For $d=1$, we prove these inequalities for all $p \in [0,1/2)$. This result presents an asymmetry with the supercritical phase ($p>1/2$), where methods of Bramson--Lebowitz imply that for $d=1$ the corresponding tail of the parking time of the parking spot of the origin decays like $e^{-c\sqrt{t}}$. Our exponent $d/(d+2)$ also differs from those previously obtained in the case of moving obstacles.

preprint2019arXiv

Large deviations and one-sided scaling limit of randomized multicolor box-ball system

The basic $κ$-color box-ball (BBS) system is an integrable cellular automaton on one dimensional lattice whose local states take $\{0,1,\cdots,κ\}$ with $0$ regarded as an empty box. The time evolution is defined by a combinatorial rule of quantum group theoretical origin, and the complete set of conserved quantities is given by a $κ$-tuple of Young diagrams. In the randomized BBS, a probability distribution on $\{0,1,\cdots,κ\}$ to independently fill the consecutive $n$ sites in the initial state induces a highly nontrivial probability measure on the $κ$-tuple of those invariant Young diagrams. In a recent work \cite{kuniba2018randomized}, their large $n$ `equilibrium shape' has been determined in terms of Schur polynomials by a Markov chain method and also by a very different approach of Thermodynamic Bethe Ansatz (TBA). In this paper, we establish a large deviations principle for the row lengths of the invariant Young diagrams. As a corollary, they are shown to converge almost surely to the equilibrium shape at an exponential rate. We also refine the TBA analysis and obtain the exact scaling form of the vacancy, the row length and the column multiplicity, which exhibit nontrivial factorization in a one-parameter specialization.

preprint2012arXiv

A Note on Graph Characteristics and Hadwiger's Conjecture

This is a note on three graph parameters motivated by the Euler-Poincare characteristic for simplicial complex. We show those three graph parameters of a given connected graph $G$ is greater than or equal to that of the complete graph with $\max(h(G),χ(G))$ vertices. This will yield three different simultaneous upperbounds of both the hadwiger number and chromatic number by means of the number of particular types of induced subgraphs. Some applications to Hadwiger's Conjecture is also discussed.

preprint2012arXiv

Four-Dimensional Discrete-time Lotka-Volterra Models with an Application to Ecology

This paper presents a study of the two-predators-two-preys discrete-time Lotka-Volterra model with self- inhibition terms for preys with direct applications to ecological problems. Parameters in the model are modified so that each of them has its own biological meaning, enabling more intuitive interpretation of biological conditions to the model . Moreover, the modified version is applicable to simulate a part of a large ecosystem, not only a closed predator-prey system with four species. An easy graphical method of analysis of the conditions on parameters ensuring long persistence under coexistence is presented. As an application, it is explained that a predator specie who feed on relatively small number of preys compared to the other predator species must be selective on the available preys in order for long persistence of the ecosystem. This may be regarded as a theoretical explanation of the existence of flush-pursuer birds, those who uses highly specialized hunting strategy and cross-adapts to the ecosystem relative to the ordinary bird species.

Hanbaek Lyu

What is connected

Connect this record

See the researcher in context

Building this map preview

11 published item(s)

Sampling random graph homomorphisms and applications to network data analysis

Learning to predict synchronization of coupled oscillators on randomly generated graphs

Online nonnegative CP-dictionary learning for Markovian data

Supervised Dictionary Learning with Auxiliary Covariates

COVID-19 Time-series Prediction by Joint Dictionary Learning and Online NMF

Double jump phase transition in a soliton cellular automaton

Phase transition in random contingency tables with non-uniform margins

Stretched exponential decay for subcritical parking times on $\mathbb{Z}^d$

Large deviations and one-sided scaling limit of randomized multicolor box-ball system

A Note on Graph Characteristics and Hadwiger's Conjecture

Four-Dimensional Discrete-time Lotka-Volterra Models with an Application to Ecology