Source author record

Yoshihiro Yamada

Yoshihiro Yamada appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Computer Vision cond-mat.str-el Machine Learning Computation and Language

Catalog footprint

What is connected

6works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

CAT: Circular-Convolutional Attention for Sub-Quadratic Transformers

Transformers have driven remarkable breakthroughs in natural language processing and computer vision, yet their standard attention mechanism still imposes O(N^2) complexity, hindering scalability to longer sequences. We introduce Circular-convolutional ATtention (CAT), a Fourier-based approach that efficiently applies circular convolutions to reduce complexity without sacrificing representational power. CAT achieves O(NlogN) computations, requires fewer learnable parameters by streamlining fully connected layers, and introduces no additional heavy operations, resulting in consistent accuracy improvements and about a 10% speedup in naive PyTorch implementations. Based on the Engineering-Isomorphic Transformers (EITs) framework, CAT's design not only offers practical efficiency and ease of implementation, but also provides insights to guide the development of future high-performance Transformer architectures. Finally, our ablation studies highlight the key conditions underlying CAT's success, shedding light on broader principles for scalable attention mechanisms.

preprint2021arXiv

Joint Search of Data Augmentation Policies and Network Architectures

The common pipeline of training deep neural networks consists of several building blocks such as data augmentation and network architecture selection. AutoML is a research field that aims at automatically designing those parts, but most methods explore each part independently because it is more challenging to simultaneously search all the parts. In this paper, we propose a joint optimization method for data augmentation policies and network architectures to bring more automation to the design of training pipeline. The core idea of our approach is to make the whole part differentiable. The proposed method combines differentiable methods for augmentation policy search and network architecture search to jointly optimize them in the end-to-end manner. The experimental results show our method achieves competitive or superior performance to the independently searched results.

preprint2020arXiv

ShakeDrop Regularization for Deep Residual Learning

Overfitting is a crucial problem in deep neural networks, even in the latest network architectures. In this paper, to relieve the overfitting effect of ResNet and its improvements (i.e., Wide ResNet, PyramidNet, and ResNeXt), we propose a new regularization method called ShakeDrop regularization. ShakeDrop is inspired by Shake-Shake, which is an effective regularization method, but can be applied to ResNeXt only. ShakeDrop is more effective than Shake-Shake and can be applied not only to ResNeXt but also ResNet, Wide ResNet, and PyramidNet. An important key is to achieve stability of training. Because effective regularization often causes unstable training, we introduce a training stabilizer, which is an unusual use of an existing regularizer. Through experiments under various conditions, we demonstrate the conditions under which ShakeDrop works well.

preprint2016arXiv

Deep Pyramidal Residual Networks with Separated Stochastic Depth

On general object recognition, Deep Convolutional Neural Networks (DCNNs) achieve high accuracy. In particular, ResNet and its improvements have broken the lowest error rate records. In this paper, we propose a method to successfully combine two ResNet improvements, ResDrop and PyramidNet. We confirmed that the proposed network outperformed the conventional methods; on CIFAR-100, the proposed network achieved an error rate of 16.18% in contrast to PiramidNet achieving that of 18.29% and ResNeXt 17.31%.

preprint2016arXiv

Doping Effects on the Electronic Structure of an Anisotropic Kondo Semiconductor CeOs$_2$Al$_{10}$: An Optical Study with Re and Ir Substitution

An anisotropic Kondo semiconductor CeOs$_2$Al$_{10}$ exhibits an unusual antiferromagnetic order at rather high transition temperature $T_0$ of 28.5 K. Two possible origins of the magnetic order have been proposed so far, one is the Kondo coupling of the hybridization between the conduction ($c$) and the $4f$ states and the other is the charge-density wave/charge ordering along the orthorhombic $b$ axis. To clarify the origin of the magnetic order, we have investigated the electronic structure of hole- and electron-doped CeOs$_2$Al$_{10}$ [Ce(Os$_{1-y}$Re$_y$)$_2$Al$_{10}$ and Ce(Os$_{1-x}$Ir$_x$)$_2$Al$_{10}$, respectively] by using optical conductivity spectra along the $b$ axis. The intensity of the $c$-$f$ hybridization gap at $\hbarω\sim50$ meV continuously decreases from $y=0.10$ to $x=0.12$ via $x=y=0$. The intensity of the charge excitation observed at $\hbarω\sim20$ meV has the maximum at $x=y=0$ as similar with the doping dependence of $T_{\rm 0}$. The fact that the charge excitation is strongly related to the magnetic order strengthens the possibility of the charge density wave/charge ordering as the origin of the magnetic order.

preprint2016arXiv

Quantum phase transitions and multicriticality in Ta(Fe1-xVx)2

We present a comprehensive study of synthesis, structure analysis, transport and thermodynamic properties of the C14 Laves phase Ta(Fe1-xVx)2. Our measurements confirm the appearance of spin-density wave (SDW) order within a dome-like region of the x - T phase diagram with vanadium content 0.02 < x < 0.3. Our results indicate that on approaching TaFe2 from the vanadium-rich side, ferromagnetic (FM) correlations increase faster than the antiferromagnetic (AFM) ones. This results in an exchange-enhanced susceptibility and in the suppression of the SDW transition temperature for x < 0.13 forming the dome-like shape of the phase diagram. This effect is strictly related to a significant lattice distortion of the crystal structure manifested in the c/a ratio. At x = 0.02 both FM and AFM energy scales have similar strength and the system remains paramagnetic down to 2 K with an extremely large Stoner enhancement factor of about 400. Here, spin fluctuations dominate the temperature dependence of the resistivity ρ~ T ^ 3/2 and of the specific heat C/T ~ - log(T) which deviate from their conventional Fermi liquid forms, inferring the presence of a quantum critical point of dual nature.

Yoshihiro Yamada

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

CAT: Circular-Convolutional Attention for Sub-Quadratic Transformers

Joint Search of Data Augmentation Policies and Network Architectures

ShakeDrop Regularization for Deep Residual Learning

Deep Pyramidal Residual Networks with Separated Stochastic Depth

Doping Effects on the Electronic Structure of an Anisotropic Kondo Semiconductor CeOs$_2$Al$_{10}$: An Optical Study with Re and Ir Substitution

Quantum phase transitions and multicriticality in Ta(Fe1-xVx)2