Source author record

Yoshiaki Inoue

Yoshiaki Inoue appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning math.PR math.ST Networking and Internet Architecture Performance Statistics Theory

Catalog footprint

What is connected

3works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Estimating customer impatience in a service system with unobserved balking

This paper studies a service system in which arriving customers are provided with information about the delay they will experience. Based on this information they decide to wait for service or to leave the system. Specifically, every customer has a patience threshold and they balk if the observed delay is above the threshold. The main objective is to estimate the parameters of the customers' patience-level distribution and the corresponding potential arrival rate, using knowledge of the actual queue-length process only. The main complication, and distinguishing feature of our setup, lies in the fact that customers who decide not to join are not observed, remarkably, we manage to devise a procedure to estimate the underlying patience and arrival rate parameters. The model is a multi-server queue with a Poisson stream of customers, enabling evaluation of the corresponding likelihood function of the state-dependent effective arrival process. We establish strong consistency of the MLE and derive the asymptotic distribution of the estimation error. Several applications and extensions of the method are discussed. The performance is further assessed through a series of numerical experiments. By fitting parameters of hyperexponential and generalized-hyperexponential distributions our method provides a robust estimation framework for any continuous patience-level distribution.

preprint2022arXiv

Stochastic Image Transmission with CoAP for Extreme Environments

Communication in extreme environments is an important research topic for various use cases including environmental monitoring. A typical example is underwater acoustic communication for 6G mobile networks. The major challenges in such environments are extremely high-latency and high-error rate. They make real-time image transmission difficult using existing communication protocols. This is partly because frequent retransmission in noisy networks increases latency and leads to serious deterioration of real-timeness. To address this problem, this paper proposes a stochastic image transmission with Constrained Application Protocol (CoAP) for extreme environments. The goal of the proposed idea is to achieve approximate real-time image transmission without retransmission using CoAP over UDP. To this end, an image is divided into blocks, and value is assigned for each block based on the requirement. By the stochastic transmission of blocks, the reception probability is guaranteed without retransmission even when packets are lost in networks. We implemented the proposed scheme using Raspberry Pi 4 to demonstrate the feasibility. The performance of the proposed image transmission was confirmed from the experimental results.

preprint2021arXiv

Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization

GPU-accelerated computing is a key technology to realize high-speed inference servers using deep neural networks (DNNs). An important characteristic of GPU-based inference is that the computational efficiency, in terms of the processing speed and energy consumption, drastically increases by processing multiple jobs together in a batch. In this paper, we formulate GPU-based inference servers as a batch service queueing model with batch-size dependent processing times. We first show that the energy efficiency of the server monotonically increases with the arrival rate of inference jobs, which suggests that it is energy-efficient to operate the inference server under a utilization level as high as possible within a latency requirement of inference jobs. We then derive a closed-form upper bound for the mean latency, which provides a simple characterization of the latency performance. Through simulation and numerical experiments, we show that the exact value of the mean latency is well approximated by this upper bound. We further compare this upper bound with the latency curve measured in real implementation of GPU-based inference servers and we show that the real performance curve is well explained by the derived simple formula.