Researcher profile

Yoshiaki Inoue

Yoshiaki Inoue contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 15 - UnverifiedVerification L1Unclaimed author
3works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

3 published item(s)

preprint2022arXiv

Estimating customer impatience in a service system with unobserved balking

This paper studies a service system in which arriving customers are provided with information about the delay they will experience. Based on this information they decide to wait for service or to leave the system. Specifically, every customer has a patience threshold and they balk if the observed delay is above the threshold. The main objective is to estimate the parameters of the customers' patience-level distribution and the corresponding potential arrival rate, using knowledge of the actual queue-length process only. The main complication, and distinguishing feature of our setup, lies in the fact that customers who decide not to join are not observed, remarkably, we manage to devise a procedure to estimate the underlying patience and arrival rate parameters. The model is a multi-server queue with a Poisson stream of customers, enabling evaluation of the corresponding likelihood function of the state-dependent effective arrival process. We establish strong consistency of the MLE and derive the asymptotic distribution of the estimation error. Several applications and extensions of the method are discussed. The performance is further assessed through a series of numerical experiments. By fitting parameters of hyperexponential and generalized-hyperexponential distributions our method provides a robust estimation framework for any continuous patience-level distribution.

preprint2022arXiv

Stochastic Image Transmission with CoAP for Extreme Environments

Communication in extreme environments is an important research topic for various use cases including environmental monitoring. A typical example is underwater acoustic communication for 6G mobile networks. The major challenges in such environments are extremely high-latency and high-error rate. They make real-time image transmission difficult using existing communication protocols. This is partly because frequent retransmission in noisy networks increases latency and leads to serious deterioration of real-timeness. To address this problem, this paper proposes a stochastic image transmission with Constrained Application Protocol (CoAP) for extreme environments. The goal of the proposed idea is to achieve approximate real-time image transmission without retransmission using CoAP over UDP. To this end, an image is divided into blocks, and value is assigned for each block based on the requirement. By the stochastic transmission of blocks, the reception probability is guaranteed without retransmission even when packets are lost in networks. We implemented the proposed scheme using Raspberry Pi 4 to demonstrate the feasibility. The performance of the proposed image transmission was confirmed from the experimental results.

preprint2021arXiv

Queueing Analysis of GPU-Based Inference Servers with Dynamic Batching: A Closed-Form Characterization

GPU-accelerated computing is a key technology to realize high-speed inference servers using deep neural networks (DNNs). An important characteristic of GPU-based inference is that the computational efficiency, in terms of the processing speed and energy consumption, drastically increases by processing multiple jobs together in a batch. In this paper, we formulate GPU-based inference servers as a batch service queueing model with batch-size dependent processing times. We first show that the energy efficiency of the server monotonically increases with the arrival rate of inference jobs, which suggests that it is energy-efficient to operate the inference server under a utilization level as high as possible within a latency requirement of inference jobs. We then derive a closed-form upper bound for the mean latency, which provides a simple characterization of the latency performance. Through simulation and numerical experiments, we show that the exact value of the mean latency is well approximated by this upper bound. We further compare this upper bound with the latency curve measured in real implementation of GPU-based inference servers and we show that the real performance curve is well explained by the derived simple formula.