Researcher profile

Takashi Mori

Takashi Mori contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
14works
0followers
5topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

14 published item(s)

preprint2022arXiv

Heating Rates under Fast Periodic Driving beyond Linear Response

Heating under periodic driving is a generic nonequilibrium phenomenon, and it is a challenging problem in nonequilibrium statistical physics to derive a quantitatively accurate heating rate. In this work, we provide a simple formula on the heating rate under fast and strong periodic driving in classical and quantum many-body systems. The key idea behind the formula is constructing a time-dependent dressed Hamiltonian by moving to a rotating frame, which is found by a truncation of the high-frequency expansion of the micromotion operator, and applying the linear-response theory. It is confirmed for specific classical and quantum models that the second-order truncation of the high-frequency expansion yields quantitatively accurate heating rates beyond the linear-response regime. Our result implies that the information on heating dynamics is encoded in the first few terms of the high-frequency expansion, although heating is often associated with an asymptotically divergent behavior of the high-frequency expansion.

preprint2022arXiv

Interplay between depth of neural networks and locality of target functions

It has been recognized that heavily overparameterized deep neural networks (DNNs) exhibit surprisingly good generalization performance in various machine-learning tasks. Although benefits of depth have been investigated from different perspectives such as the approximation theory and the statistical learning theory, existing theories do not adequately explain the empirical success of overparameterized DNNs. In this work, we report a remarkable interplay between depth and locality of a target function. We introduce $k$-local and $k$-global functions, and find that depth is beneficial for learning local functions but detrimental to learning global functions. This interplay is not properly captured by the neural tangent kernel, which describes an infinitely wide neural network within the lazy learning regime.

preprint2022arXiv

Power-law escape rate of SGD

Stochastic gradient descent (SGD) undergoes complicated multiplicative noise for the mean-square loss. We use this property of SGD noise to derive a stochastic differential equation (SDE) with simpler additive noise by performing a random time change. Using this formalism, we show that the log loss barrier $Δ\log L=\log[L(θ^s)/L(θ^*)]$ between a local minimum $θ^*$ and a saddle $θ^s$ determines the escape rate of SGD from the local minimum, contrary to the previous results borrowing from physics that the linear loss barrier $ΔL=L(θ^s)-L(θ^*)$ decides the escape rate. Our escape-rate formula strongly depends on the typical magnitude $h^*$ and the number $n$ of the outlier eigenvalues of the Hessian. This result explains an empirical fact that SGD prefers flat minima with low effective dimensions, giving an insight into implicit biases of SGD.

preprint2022arXiv

Strength of Minibatch Noise in SGD

The noise in stochastic gradient descent (SGD), caused by minibatch sampling, is poorly understood despite its practical importance in deep learning. This work presents the first systematic study of the SGD noise and fluctuations close to a local minimum. We first analyze the SGD noise in linear regression in detail and then derive a general formula for approximating SGD noise in different types of minima. For application, our results (1) provide insight into the stability of training a neural network, (2) suggest that a large learning rate can help generalization by introducing an implicit regularization, (3) explain why the linear learning rate-batchsize scaling law fails at a large learning rate or at a small batchsize and (4) can provide an understanding of how discrete-time nature of SGD affects the recently discovered power-law phenomenon of SGD.

preprint2021arXiv

Is deeper better? It depends on locality of relevant features

It has been recognized that a heavily overparameterized artificial neural network exhibits surprisingly good generalization performance in various machine-learning tasks. Recent theoretical studies have made attempts to unveil the mystery of the overparameterization. In most of those previous works, the overparameterization is achieved by increasing the width of the network, while the effect of increasing the depth has remained less well understood. In this work, we investigate the effect of increasing the depth within an overparameterized regime. To gain an insight into the advantage of depth, we introduce local and global labels as abstract but simple classification rules. It turns out that the locality of the relevant feature for a given classification rule plays a key role; our experimental results suggest that deeper is better for local labels, whereas shallower is better for global labels. We also compare the results of finite networks with those of the neural tangent kernel (NTK), which is equivalent to an infinitely wide network with a proper initialization and an infinitesimal learning rate. It is shown that the NTK does not correctly capture the depth dependence of the generalization performance, which indicates the importance of the feature learning rather than the lazy learning.

preprint2020arXiv

Thermalization in open many-body systems based on eigenstate thermalization hypothesis

We investigate steady states of macroscopic quantum systems under dissipation not obeying the detailed balance condition. We argue that the Gibbs state at an effective temperature gives a good description of the steady state provided that the system Hamiltonian obeys the eigenstate thermalization hypothesis (ETH) and the perturbation theory in the weak system-environment coupling is valid in the thermodynamic limit. We derive a criterion to guarantee the validity of the perturbation theory, which is satisfied in the thermodynamic limit for sufficiently weak dissipation when the Liouvillian is gapped for bulk-dissipated systems, while the perturbation theory breaks down in boundary-dissipated chaotic systems due to the presence of diffusive transports. We numerically confirm these theoretical predictions. This work suggests a connection between steady states of macroscopic open quantum systems and the ETH.

preprint2013arXiv

Exactness of the mean-field dynamics in optical cavity systems

Validity of the mean-field approach to open system dynamics in the optical cavity system is examined. It is rigorously shown that the mean-field approach is justified in the thermodynamic limit. The result is applicable to nonequilibrium situations, e.g. the thermal reservoirs may have different temperatures, and the system may be subject to a time-dependent external field. The result of this work will lead to further studies on macroscopic open quantum systems.

preprint2013arXiv

Nonadditivity in Quasiequilibrium States of Spin Systems with Lattice Distortion

It is pointed out that there exists a short-range interacting system, i.e. the elastic spin model, which is extensive but nonadditive. It is numerically shown that, depending on the statistical ensemble, the specific heat or the susceptibility becomes negative in a certain parameter region, which shows ensemble inequivalence in this model. Further, we numerically estimate the effective Hamiltonian for spin variables, and it is clarified that the effective interaction among spin variables is long-ranged. Remarkably, the so called Kac's prescription, which is usually regarded as a mathematical operation to make the system extensive, naturally holds in the effective interaction.

preprint2013arXiv

Phase transitions in systems with non-additive long-range interactions

We consider spin systems with long-range interactions in nonadditive regime. When the non-additive scaling limit is employed, the energy and the entropy compete and the system exhibits some phase transitions. Such systems do not satisfy the additivity, which results in some unfamiliar properties related to phase transitions. In this paper, the concept of additivity and its consequence are explained and the recent progress on statistical mechanics of long-range interacting systems are reviewed. It is shown that the parameter space is clearly decomposed into the three regions according to the stability of the uniform state predicted by the mean-field theory. Based on this parameter space decomposition, recent results on the exactness of MF theory are explained. When the interaction is non-negative (ferromagnetic), the analysis of the mean-field theory is exact and a typical spin configuration is always uniform in the canonical ensemble. However, in the restricted canonical ensemble, i.e., the canonical ensemble with a restriction of the value of the magnetization, it is shown that the mean-field theory does not necessarily give the exact description of the system and phase transitions between the mean-field uniform states (MF phase) and the inhomogeneous states (non-MF phase) occur. A new finding is that when the interaction potential changes its sign depending on the distance, the non-MF phase appears even in the canonical ensemble.

preprint2012arXiv

Critical temperature and correlation length of an elastic interaction model for spin-crossover materials

It has previously been pointed out that the coexistence of infinite-range and short-range interactions causes a system to have a phase transition of the mean-field universality class, in which the cluster size is finite even at the critical point. In the present paper, we study this property in a model of bistable molecules, whose size changes depending on the bistable states. The molecules can move in space, interacting via an elastic interaction. It is known that due to the different sizes, an effective long-range interaction between the spins appears, and thus this model has a mean-field type of phase transition. It is found that the scaling properties of the shift of the critical temperature from the pure short-range limit in the model with infinite-range and short-range interactions hold also in the present model, regarding the ratio of the size of the two states as a control parameter for the strength of the long-range interaction. By studying the structure factor, it is shown that the dependence of the cluster size at the critical temperature also shows the same scaling properties as a previously studied model with both infinite-range and short-range interactions. We therefore conclude that these scaling relations hold universally in hybrid models with both short-range and weak long-range interactions.

preprint2012arXiv

Microcanonical Analysis of Exactness of the Mean-Field Theory in Long-Range Interacting Systems

Classical spin systems with nonadditive long-range interactions are studied in the microcanonical ensemble. It is expected that the entropy of such a system is identical to that of the corresponding mean-field model, which is called "exactness of the mean-field theory". It is found out that this expectation is not necessarily true if the microcanonical ensemble is not equivalent to the canonical ensemble in the mean-field model. Moreover, necessary and sufficient conditions for exactness of the mean-field theory are obtained. These conditions are investigated for two concrete models, the α-Potts model with annealed vacancies and the α-Potts model with invisible states.

preprint2011arXiv

Crossover between a Short-range and a Long-range Ising model

Recently, it has been found that an effective long-range interaction is realized among local bistable variables (spins) in systems where the elastic interaction causes ordering of the spins. In such systems, generally we expect both long-range and short-range interactions to exist. In the short-range Ising model, the correlation length diverges at the critical point. In contrast, in the long-range interacting model the spin configuration is always uniform and the correlation length is zero. As long as a system has non-zero long-range interactions, it shows criticality in the mean-field universality class, and the spin configuration is uniform beyond a certain scale. Here we study the crossover from the pure short-range interacting model to the long-range interacting model. We investigate the infinite-range model (Husimi-Temperley model) as a prototype of this competition, and we study how the critical temperature changes as a function of the strength of the long-range interaction. This model can also be interpreted as an approximation for the Ising model on a small-world network. We derive a formula for the critical temperature as a function of the strength of the long-range interaction. We also propose a scaling form for the spin correlation length at the critical point, which is finite as long as the long-range interaction is included, though it diverges in the limit of the pure short-range model. These properties are confirmed by extensive Monte Carlo simulations.

preprint2011arXiv

Instability of the mean-field states and generalization of phase separation in long-range interacting systems

Equilibrium properties of long-range interacting systems on lattices are investigated. There was a conjecture by Cannas et. al. that the mean-field theory is exact for spin systems with non-additive long-range interactions. This is called "exactness of the mean-field theory". We show that the exactness of the mean-field theory holds for systems on a lattice with non-additive two body long-range interactions in the canonical ensemble with non-fixed order parameters. We also show that in canonical ensemble with fixed order parameters (e.g. lattice gas model with a fixed number of particles), exactness of the mean-field theory does not hold in some parameter region, which we call "non-MF region". In the non-MF region, an inhomogeneous configuration appears contrary to the uniform configuration in the region where the mean-field theory holds. This inhomogeneous configuration is not the one given by the standard phase separation. Therefore, the mean-field picture is not adequate to describe these states. We discuss phase transitions between the MF region and the non-MF region. Exactness of the mean-field theory in spin glasses is also discussed.

preprint2009arXiv

Scaling properties of the relaxation time near the mean-field spinodal

We study the relaxation processes of the infinitely long-range interaction model (the Husimi-Temperley model) near the spinodal point. We propose a unified finite-size scaling function near the spinodal point, including the metastable region, the spinodal point, and the unstable region. We explicitly adopt the Glauber dynamics, derive a master equation for the probability distribution of the total magnetization, and perform the so-called van Kampen Omega expansion (an expansion in terms of the inverse of the systems size), which leads to a Fokker-Planck equation. We analyze the scaling properties of the Fokker-Planck equation and confirm the obtained scaling plot by direct numerical solution of the original master equation, and by kinetic Monte Carlo simulation of the stochastic decay process.