Source author record

Takashi Onishi

Takashi Onishi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Artificial Intelligence Machine Learning cond-mat.mtrl-sci eess.SY math.OC Systems and Control

Catalog footprint

What is connected

4works

6topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Dropout Q-Functions for Doubly Efficient Reinforcement Learning

Randomized ensembled double Q-learning (REDQ) (Chen et al., 2021b) has recently achieved state-of-the-art sample efficiency on continuous-action reinforcement learning benchmarks. This superior sample efficiency is made possible by using a large Q-function ensemble. However, REDQ is much less computationally efficient than non-ensemble counterparts such as Soft Actor-Critic (SAC) (Haarnoja et al., 2018a). To make REDQ more computationally efficient, we propose a method of improving computational efficiency called DroQ, which is a variant of REDQ that uses a small ensemble of dropout Q-functions. Our dropout Q-functions are simple Q-functions equipped with dropout connection and layer normalization. Despite its simplicity of implementation, our experimental results indicate that DroQ is doubly (sample and computationally) efficient. It achieved comparable sample efficiency with REDQ, much better computational efficiency than REDQ, and comparable computational efficiency with that of SAC.

preprint2022arXiv

Railway Operation Rescheduling System via Dynamic Simulation and Reinforcement Learning

The number of railway service disruptions has been increasing owing to intensification of natural disasters. In addition, abrupt changes in social situations such as the COVID-19 pandemic require railway companies to modify the traffic schedule frequently. Therefore, automatic support for optimal scheduling is anticipated. In this study, an automatic railway scheduling system is presented. The system leverages reinforcement learning and a dynamic simulator that can simulate the railway traffic and passenger flow of a whole line. The proposed system enables rapid generation of the traffic schedule of a whole line because the optimization process is conducted in advance as the training. The system is evaluated using an interruption scenario, and the results demonstrate that the system can generate optimized schedules of the whole line in a few minutes.

preprint2022arXiv

Soft Sensors and Process Control using AI and Dynamic Simulation

During the operation of a chemical plant, product quality must be consistently maintained, and the production of off-specification products should be minimized. Accordingly, process variables related to the product quality, such as the temperature and composition of materials at various parts of the plant must be measured, and appropriate operations (that is, control) must be performed based on the measurements. Some process variables, such as temperature and flow rate, can be measured continuously and instantaneously. However, other variables, such as composition and viscosity, can only be obtained through time-consuming analysis after sampling substances from the plant. Soft sensors have been proposed for estimating process variables that cannot be obtained in real time from easily measurable variables. However, the estimation accuracy of conventional statistical soft sensors, which are constructed from recorded measurements, can be very poor in unrecorded situations (extrapolation). In this study, we estimate the internal state variables of a plant by using a dynamic simulator that can estimate and predict even unrecorded situations on the basis of chemical engineering knowledge and an artificial intelligence (AI) technology called reinforcement learning, and propose to use the estimated internal state variables of a plant as soft sensors. In addition, we describe the prospects for plant operation and control using such soft sensors and the methodology to obtain the necessary prediction models (i.e., simulators) for the proposed system.

preprint2014arXiv

Spin-Orbit Interaction Effects in the Electronic Structure of B20-type CoSi: First-Principles Density Functional Study

We have performed fully relativistic first-principles density functional calculations for non-magnetic B20-type CoSi. The spin-orbit interaction has crucial effects on the electronic structures of a chiral crystal. The calculated band structure around the Fermi energy shows Bloch vector $k$-linear dispersion expressed by a $real$-$spin$ Weyl Hamiltonian, i.e., a mass-less Dirac Hamiltonian. We found the hedgehog-like spin texture in Bloch $\boldsymbol k$-vector space (momentum space) on the isoenergy surface around the $Γ$ point. The Fermi velocity for $k$-linear dispersion is about 0.22$v^g_F$, where $v^g_F$ is the Fermi velocity of graphene.

Takashi Onishi

What is connected

Connect this record

See the researcher in context

Building this map preview

4 published item(s)

Dropout Q-Functions for Doubly Efficient Reinforcement Learning

Railway Operation Rescheduling System via Dynamic Simulation and Reinforcement Learning

Soft Sensors and Process Control using AI and Dynamic Simulation

Spin-Orbit Interaction Effects in the Electronic Structure of B20-type CoSi: First-Principles Density Functional Study