Source author record

Minki Kim

Minki Kim appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

math.CO Cryptography and Security Machine Learning Distributed, Parallel, and Cluster Computing Networking and Internet Architecture Neural and Evolutionary Computing

Catalog footprint

What is connected

8works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2021arXiv

A system of disjoint representatives of line segments with given $k$ directions

We prove that for all positive integers $n$ and $k$, there exists an integer $N = N(n,k)$ satisfying the following. If $U$ is a set of $k$ direction vectors in the plane and $\mathcal{J}_U$ is the set of all line segments in direction $u$ for some $u\in U$, then for every $N$ families $\mathcal{F}_1, \ldots, \mathcal{F}_N$, each consisting of $n$ mutually disjoint segments in $\mathcal{J}_U$, there is a set $\{A_1, \ldots, A_n\}$ of $n$ disjoint segments in $\bigcup_{1\leq i\leq N}\mathcal{F}_i$ and distinct integers $p_1, \ldots, p_n\in \{1, \ldots, N\}$ satisfying that $A_j\in \mathcal{F}_{p_j}$ for all $j\in \{1, \ldots, n\}$. We generalize this property for underlying lines on fixed $k$ directions to $k$ families of simple curves with certain conditions.

preprint2021arXiv

Badges and rainbow matchings

Drisko proved that $2n-1$ matchings of size $n$ in a bipartite graph have a rainbow matching of size $n$. For general graphs it is conjectured that $2n$ matchings suffice for this purpose (and that $2n-1$ matchings suffice when $n$ is even). The known graphs showing sharpness of this conjecture for $n$ even are called badges. We improve the previously best known bound from $3n-2$ to $3n-3$, using a new line of proof that involves analysis of the appearance of badges. We also prove a "cooperative" generalization: for $t>0$ and $n \geq 3$, any $3n-4+t$ sets of edges, the union of every $t$ of which contains a matching of size $n$, have a rainbow matching of size $n$.

preprint2021arXiv

Domination numbers and noncover complexes of hypergraphs

Let $\mathcal{H}$ be a hypergraph on a finite set $V$. A {\em cover} of $\mathcal{H}$ is a set of vertices that meets all edges of $\mathcal{H}$. If $W$ is not a cover of $\mathcal{H}$, then $W$ is said to be a {\em noncover} of $\mathcal{H}$. The {\em noncover complex} of $\mathcal{H}$ is the abstract simplicial complex whose faces are the noncovers of $\mathcal{H}$. In this paper, we study homological properties of noncover complexes of hypergraphs. In particular, we obtain an upper bound on their Leray numbers. The bound is in terms of hypergraph domination numbers. Also, our proof idea is applied to compute the homotopy type of the noncover complexes of certain uniform hypergraphs, called {\em tight paths} and {\em tight cycles}. This extends to hypergraphs known results on graphs.

preprint2021arXiv

Evaluation and Optimization of Distributed Machine Learning Techniques for Internet of Things

Federated learning (FL) and split learning (SL) are state-of-the-art distributed machine learning techniques to enable machine learning training without accessing raw data on clients or end devices. However, their \emph{comparative training performance} under real-world resource-restricted Internet of Things (IoT) device settings, e.g., Raspberry Pi, remains barely studied, which, to our knowledge, have not yet been evaluated and compared, rendering inconvenient reference for practitioners. This work firstly provides empirical comparisons of FL and SL in real-world IoT settings regarding (i) learning performance with heterogeneous data distributions and (ii) on-device execution overhead. Our analyses in this work demonstrate that the learning performance of SL is better than FL under an imbalanced data distribution but worse than FL under an extreme non-IID data distribution. Recently, FL and SL are combined to form splitfed learning (SFL) to leverage each of their benefits (e.g., parallel training of FL and lightweight on-device computation requirement of SL). This work then considers FL, SL, and SFL, and mount them on Raspberry Pi devices to evaluate their performance, including training time, communication overhead, power consumption, and memory usage. Besides evaluations, we apply two optimizations. Firstly, we generalize SFL by carefully examining the possibility of a hybrid type of model training at the server-side. The generalized SFL merges sequential (dependent) and parallel (independent) processes of model training and is thus beneficial for a system with large-scaled IoT devices, specifically at the server-side operations. Secondly, we propose pragmatic techniques to substantially reduce the communication overhead by up to four times for the SL and (generalized) SFL.

preprint2020arXiv

Can We Use Split Learning on 1D CNN Models for Privacy Preserving Training?

A new collaborative learning, called split learning, was recently introduced, aiming to protect user data privacy without revealing raw input data to a server. It collaboratively runs a deep neural network model where the model is split into two parts, one for the client and the other for the server. Therefore, the server has no direct access to raw data processed at the client. Until now, the split learning is believed to be a promising approach to protect the client's raw data; for example, the client's data was protected in healthcare image applications using 2D convolutional neural network (CNN) models. However, it is still unclear whether the split learning can be applied to other deep learning models, in particular, 1D CNN. In this paper, we examine whether split learning can be used to perform privacy-preserving training for 1D CNN models. To answer this, we first design and implement an 1D CNN model under split learning and validate its efficacy in detecting heart abnormalities using medical ECG data. We observed that the 1D CNN model under split learning can achieve the same accuracy of 98.9\% like the original (non-split) model. However, our evaluation demonstrates that split learning may fail to protect the raw data privacy on 1D CNN models. To address the observed privacy leakage in split learning, we adopt two privacy leakage mitigation techniques: 1) adding more hidden layers to the client side and 2) applying differential privacy. Although those mitigation techniques are helpful in reducing privacy leakage, they have a significant impact on model accuracy. Hence, based on those results, we conclude that split learning alone would not be sufficient to maintain the confidentiality of raw sequential data in 1D CNN models.

preprint2020arXiv

End-to-End Evaluation of Federated Learning and Split Learning for Internet of Things

This work is the first attempt to evaluate and compare felderated learning (FL) and split neural networks (SplitNN) in real-world IoT settings in terms of learning performance and device implementation overhead. We consider a variety of datasets, different model architectures, multiple clients, and various performance metrics. For learning performance, which is specified by the model accuracy and convergence speed metrics, we empirically evaluate both FL and SplitNN under different types of data distributions such as imbalanced and non-independent and identically distributed (non-IID) data. We show that the learning performance of SplitNN is better than FL under an imbalanced data distribution, but worse than FL under an extreme non-IID data distribution. For implementation overhead, we end-to-end mount both FL and SplitNN on Raspberry Pis, and comprehensively evaluate overheads including training time, communication overhead under the real LAN setting, power consumption and memory usage. Our key observations are that under IoT scenario where the communication traffic is the main concern, the FL appears to perform better over SplitNN because FL has the significantly lower communication overhead compared with SplitNN, which empirically corroborate previous statistical analysis. In addition, we reveal several unrecognized limitations about SplitNN, forming the basis for future research.

preprint2020arXiv

The layer number of $α$-evenly distributed point sets

For a finite point set in $\mathbb{R}^d$, we consider a peeling process where the vertices of the convex hull are removed at each step. The layer number $L(X)$ of a given point set $X$ is defined as the number of steps of the peeling process in order to delete all points in $X$. It is known that if $X$ is a set of random points in $\mathbb{R}^d$, then the expectation of $L(X)$ is $Θ(|X|^{2/(d+1)})$, and recently it was shown that if $X$ is a point set of the square grid on the plane, then $L(X)=Θ(|X|^{2/3})$. In this paper, we investigate the layer number of $α$-evenly distributed point sets for $α>1$; these point sets share the regularity aspect of random point sets but in a more general setting. The set of lattice points is also an $α$-evenly distributed point set for some $α>1$. We find an upper bound of $O(|X|^{3/4})$ for the layer number of an $α$-evenly distributed point set $X$ in a unit disk on the plane for some $α>1$, and provide an explicit construction that shows the growth rate of this upper bound cannot be improved. In addition, we give an upper bound of $O(|X|^{\frac{d+1}{2d}})$ for the layer number of an $α$-evenly distributed point set $X$ in a unit ball in $\mathbb{R}^d$ for some $α>1$ and $d\geq 3$.

preprint2016arXiv

On tiling the integers with $4$-sets of the same gap sequence

Partitioning a set into similar, if not, identical, parts is a fundamental research topic in combinatorics. The question of partitioning the integers in various ways has been considered throughout history. Given a set $\{x_1, \ldots, x_n\}$ of integers where $x_1<\cdots<x_n$, let the {\it gap sequence} of this set be the nondecreasing sequence $d_1, \ldots, d_{n-1}$ where $\{d_1, \ldots, d_{n-1}\}$ equals $\{x_{i+1}-x_i:i\in\{1,\ldots, n-1\}\}$ as a multiset. This paper addresses the following question, which was explicitly asked by Nakamigawa: can the set of integers be partitioned into sets with the same gap sequence? The question is known to be true for any set where the gap sequence has length at most two. This paper provides evidence that the question is true when the gap sequence has length three. Namely, we prove that given positive integers $p$ and $q$, there is a positive integer $r_0$ such that for all $r\geq r_0$, the set of integers can be partitioned into $4$-sets with gap sequence $p, q$, $r$.

Minki Kim

What is connected

Connect this record

See the researcher in context

Building this map preview

8 published item(s)

A system of disjoint representatives of line segments with given $k$ directions

Badges and rainbow matchings

Domination numbers and noncover complexes of hypergraphs

Evaluation and Optimization of Distributed Machine Learning Techniques for Internet of Things

Can We Use Split Learning on 1D CNN Models for Privacy Preserving Training?

End-to-End Evaluation of Federated Learning and Split Learning for Internet of Things

The layer number of $α$-evenly distributed point sets

On tiling the integers with $4$-sets of the same gap sequence