Source author record

Ichiro Takeuchi

Ichiro Takeuchi appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning cond-mat.mtrl-sci cond-mat.supr-con cond-mat.str-el Computer Vision cond-mat.mes-hall Methodology physics.app-ph physics.data-an Cryptography and Security Digital Libraries eess.IV physics.optics quant-ph

Catalog footprint

What is connected

56works

14topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Post-ADC Inference: Valid Inference After Active Data Collection

The validity of statistical inference depends critically on how data are collected. When data gathered through active data collection (ADC) are reused for a post-hoc inferential task, conventional inference can fail because the sampling is adaptively biased toward regions favored by the collection strategy. This issue is especially pronounced in black-box optimization, where sequential model-based optimization (SMBO) methods such as the tree-structured Parzen estimator (TPE) and Gaussian process upper confidence bound (GP-UCB) preferentially concentrate evaluations in promising regions. We study statistical inference on actively collected data when the inferential target is constructed in a data-dependent manner after data collection. To enable valid inference in this setting, we propose post-ADC inference, a framework that accounts for the biases arising from both the active data collection process and the subsequent data-driven target construction. Our method builds on selective inference and provides valid $p$-values and confidence intervals that correct for both sources of bias. The framework applies to a broad class of ADC processes by imposing only assumptions on the observation noise, without requiring any assumptions on the underlying black-box function or the surrogate model used by the SMBO algorithm. Empirical results also show that post-ADC inference provides valid inference for data collected by GP-UCB and TPE.

preprint2026arXiv

Quantum Kernel Machine Learning for Autonomous Materials Science

Autonomous materials science, where active learning is used to navigate large compositional phase space, has emerged as a powerful vehicle to rapidly explore new materials. A crucial aspect of autonomous materials science is exploring new materials using as little data as possible. Gaussian process-based active learning allows effective charting of multi-dimensional parameter space with a limited number of training data, and thus is a common algorithmic choice for autonomous materials science. An integral part of the autonomous workflow is the application of kernel functions for quantifying similarities among measured data points. A recent theoretical breakthrough has shown that quantum kernel models can achieve similar performance with less training data than classical models. This signals the possible advantage of applying quantum kernel machine learning to autonomous materials discovery. In this work, we compare quantum and classical kernels for their utility in sequential phase space navigation for autonomous materials science. Specifically, we compute a quantum kernel and several classical kernels for x-ray diffraction patterns taken from an Fe-Ga-Pd ternary composition spread library. We conduct our study on both IonQ's Aria trapped ion quantum computer hardware and the corresponding classical noisy simulator. We experimentally verify that a quantum kernel model can outperform some classical kernel models. The results highlight the potential of quantum kernel machine learning methods for accelerating materials discovery and suggest complex x-ray diffraction data is a candidate for robust quantum kernel model advantage.

preprint2026arXiv

Real-time Multi-instrument Autonomous Discovery of Novel Phase-change Memory Materials

Autonomous labs enable the integration of automated experiment execution, data analysis and decision making. The main challenge remains the integration of diverse data streams from multiple instruments, where the data is often heterogeneous and unsynchronized. The standard learning process of undetermined synthesis-process-structure-property relationships (SPSPR) usually relies on post-experiment analysis after data is fully collected, not during live experiments, and decision making is carried out independently across characterization equipment. Here, we demonstrate the Multi-instrument Autonomous Discovery (MAD) framework -- combining structural property mapping and functional property optimization simultaneously in a closed-loop manner. As an example, we applied MAD to phase change memory (PCM) materials, and, in particular on the Mn-Sb-Te ternary, a previously unexplored materials system for PCM. A multi-output model is employed to merge data from x-ray diffraction (XRD) and electrical resistance measurements simultaneously through a co-regionalization kernel that models the relationship between them. The output probabilistic posterior and uncertainty quantification facilitate decision making with shared knowledge, while the goals are different across tasks. We aimed to maximize the knowledge of crystal structure distribution using non-negative matrix factorization (NMF), while in parallel, we find the composition with the maximum resistance value, an important figure of merit for PCM. Leveraging MAD, we found promising electrical PCMs and identified the SPSPR within 25 closed-loop iterations, corresponding to a seven-fold speed-up. The framework opens a new path of study in large-scale autonomous facilities, where future experiments can be run in parallel together, not independently.

preprint2023arXiv

Valid P-Value for Deep Learning-Driven Salient Region

Various saliency map methods have been proposed to interpret and explain predictions of deep learning models. Saliency maps allow us to interpret which parts of the input signals have a strong influence on the prediction results. However, since a saliency map is obtained by complex computations in deep learning models, it is often difficult to know how reliable the saliency map itself is. In this study, we propose a method to quantify the reliability of a salient region in the form of p-values. Our idea is to consider a salient region as a selected hypothesis by the trained deep learning model and employ the selective inference framework. The proposed method can provably control the probability of false positive detections of salient regions. We demonstrate the validity of the proposed method through numerical examples in synthetic and real datasets. Furthermore, we develop a Keras-based framework for conducting the proposed selective inference for a wide class of CNNs without additional implementation cost.

preprint2022arXiv

Bayesian Optimization for Distributionally Robust Chance-constrained Problem

In black-box function optimization, we need to consider not only controllable design variables but also uncontrollable stochastic environment variables. In such cases, it is necessary to solve the optimization problem by taking into account the uncertainty of the environmental variables. Chance-constrained (CC) problem, the problem of maximizing the expected value under a certain level of constraint satisfaction probability, is one of the practically important problems in the presence of environmental variables. In this study, we consider distributionally robust CC (DRCC) problem and propose a novel DRCC Bayesian optimization method for the case where the distribution of the environmental variables cannot be precisely specified. We show that the proposed method can find an arbitrary accurate solution with high probability in a finite number of trials, and confirm the usefulness of the proposed method through numerical experiments.

preprint2022arXiv

Benchmarking Active Learning Strategies for Materials Optimization and Discovery

Autonomous physical science is revolutionizing materials science. In these systems, machine learning controls experiment design, execution, and analysis in a closed loop. Active learning, the machine learning field of optimal experiment design, selects each subsequent experiment to maximize knowledge toward the user goal. Autonomous system performance can be further improved with implementation of scientific machine learning, also known as inductive bias-engineered artificial intelligence, which folds prior knowledge of physical laws (e.g., Gibbs phase rule) into the algorithm. As the number, diversity, and uses for active learning strategies grow, there is an associated growing necessity for real-world reference datasets to benchmark strategies. We present a reference dataset and demonstrate its use to benchmark active learning strategies in the form of various acquisition functions. Active learning strategies are used to rapidly identify materials with optimal physical properties within a ternary materials system. The data is from an actual Fe-Co-Ni thin-film library and includes previously acquired experimental data for materials compositions, X-ray diffraction patterns, and two functional properties of magnetic coercivity and the Kerr rotation. Popular active learning methods along with a recent scientific active learning method are benchmarked for their materials optimization performance. We discuss the relationship between algorithm performance, materials search space complexity, and the incorporation of prior knowledge.

preprint2022arXiv

Chiral Spin Bobbers in Exchange-Coupled Hard-Soft Magnetic Bilayers

The spin structure of exchange-coupled MnBi:Co-Fe bilayers is investigated by X-ray magnetic circular dichroism (XMCD), polarized neutron reflectometry (PNR), and micromagnetic simu-lations. The purpose of the present research is two-fold. First, the current search for new permanent-magnet materials includes hard-soft nanocomposites, and the analysis of coercivity mechanisms in these structures is an important aspect of this quest. Second, topological micro-magnetic structures such as skyrmions have recently become of intense fundamental and applied research, for example in the context of spin-based electronics. We find that the magnetization reversal of the MnBi:Co-Fe bilayer structure involves a curling-type twisting of the magnetization in the film plane. This curling in the exchange-coupled hard-soft magnetic bilayers is reminiscent of chiral spin structures known as bobbers and, in fact, establishes a new type of skyrmionic spin structure.

preprint2022arXiv

Conditional Selective Inference for Robust Regression and Outlier Detection using Piecewise-Linear Homotopy Continuation

In practical data analysis under noisy environment, it is common to first use robust methods to identify outliers, and then to conduct further analysis after removing the outliers. In this paper, we consider statistical inference of the model estimated after outliers are removed, which can be interpreted as a selective inference (SI) problem. To use conditional SI framework, it is necessary to characterize the events of how the robust method identifies outliers. Unfortunately, the existing methods cannot be directly used here because they are applicable to the case where the selection events can be represented by linear/quadratic constraints. In this paper, we propose a conditional SI method for popular robust regressions by using homotopy method. We show that the proposed conditional SI method is applicable to a wide class of robust regression and outlier detection methods and has good empirical performance on both synthetic data and real data experiments.

preprint2022arXiv

Exact Statistical Inference for the Wasserstein Distance by Selective Inference

In this paper, we study statistical inference for the Wasserstein distance, which has attracted much attention and has been applied to various machine learning tasks. Several studies have been proposed in the literature, but almost all of them are based on asymptotic approximation and do not have finite-sample validity. In this study, we propose an exact (non-asymptotic) inference method for the Wasserstein distance inspired by the concept of conditional Selective Inference (SI). To our knowledge, this is the first method that can provide a valid confidence interval (CI) for the Wasserstein distance with finite-sample coverage guarantee, which can be applied not only to one-dimensional problems but also to multi-dimensional problems. We evaluate the performance of the proposed method on both synthetic and real-world datasets.

preprint2022arXiv

Hypothesis Learning in Automated Experiment: Application to Combinatorial Materials Libraries

Machine learning is rapidly becoming an integral part of experimental physical discovery via automated and high-throughput synthesis, and active experiments in scattering and electron/probe microscopy. This, in turn, necessitates the development of active learning methods capable of exploring relevant parameter spaces with the smallest number of steps. Here we introduce an active learning approach based on co-navigation of the hypothesis and experimental spaces. This is realized by combining the structured Gaussian Processes containing probabilistic models of the possible system's behaviors (hypotheses) with reinforcement learning policy refinement (discovery). This approach closely resembles classical human-driven physical discovery, when several alternative hypotheses realized via models with adjustable parameters are tested during an experiment. We demonstrate this approach for exploring concentration-induced phase transitions in combinatorial libraries of Sm-doped BiFeO3 using Piezoresponse Force Microscopy, but it is straightforward to extend it to higher-dimensional parameter spaces and more complex physical problems once the experimental workflow and hypothesis-generation are available.

preprint2022arXiv

Physics in the Machine: Integrating Physical Knowledge in Autonomous Phase-Mapping

Application of artificial intelligence (AI), and more specifically machine learning, to the physical sciences has expanded significantly over the past decades. In particular, science-informed AI, also known as scientific AI or inductive bias AI, has grown from a focus on data analysis to now controlling experiment design, simulation, execution and analysis in closed-loop autonomous systems. The CAMEO (closed-loop autonomous materials exploration and optimization) algorithm employs scientific AI to address two tasks: learning a material system's composition-structure relationship and identifying materials compositions with optimal functional properties. By integrating these, accelerated materials screening across compositional phase diagrams was demonstrated, resulting in the discovery of a best-in-class phase change memory material. Key to this success is the ability to guide subsequent measurements to maximize knowledge of the composition-structure relationship, or phase map. In this work we investigate the benefits of incorporating varying levels of prior physical knowledge into CAMEO's autonomous phase-mapping. This includes the use of ab-initio phase boundary data from the AFLOW repositories, which has been shown to optimize CAMEO's search when used as a prior.

preprint2021arXiv

Active learning for distributionally robust level-set estimation

Many cases exist in which a black-box function $f$ with high evaluation cost depends on two types of variables $\bm x$ and $\bm w$, where $\bm x$ is a controllable \emph{design} variable and $\bm w$ are uncontrollable \emph{environmental} variables that have random variation following a certain distribution $P$. In such cases, an important task is to find the range of design variables $\bm x$ such that the function $f(\bm x, \bm w)$ has the desired properties by incorporating the random variation of the environmental variables $\bm w$. A natural measure of robustness is the probability that $f(\bm x, \bm w)$ exceeds a given threshold $h$, which is known as the \emph{probability threshold robustness} (PTR) measure in the literature on robust optimization. However, this robustness measure cannot be correctly evaluated when the distribution $P$ is unknown. In this study, we addressed this problem by considering the \textit{distributionally robust PTR} (DRPTR) measure, which considers the worst-case PTR within given candidate distributions. Specifically, we studied the problem of efficiently identifying a reliable set $H$, which is defined as a region in which the DRPTR measure exceeds a certain desired probability $α$, which can be interpreted as a level set estimation (LSE) problem for DRPTR. We propose a theoretically grounded and computationally efficient active learning method for this problem. We show that the proposed method has theoretical guarantees on convergence and accuracy, and confirmed through numerical experiments that the proposed method outperforms existing methods.

preprint2021arXiv

Computing Valid p-value for Optimal Changepoint by Selective Inference using Dynamic Programming

There is a vast body of literature related to methods for detecting changepoints (CP). However, less attention has been paid to assessing the statistical reliability of the detected CPs. In this paper, we introduce a novel method to perform statistical inference on the significance of the CPs, estimated by a Dynamic Programming (DP)-based optimal CP detection algorithm. Based on the selective inference (SI) framework, we propose an exact (non-asymptotic) approach to compute valid p-values for testing the significance of the CPs. Although it is well-known that SI has low statistical power because of over-conditioning, we address this disadvantage by introducing parametric programming techniques. Then, we propose an efficient method to conduct SI with the minimum amount of conditioning, leading to high statistical power. We conduct experiments on both synthetic and real-world datasets, through which we offer evidence that our proposed method is more powerful than existing methods, has decent performance in terms of computational efficiency, and provides good results in many practical applications.

preprint2021arXiv

Exploring physics of ferroelectric domain walls via Bayesian analysis of atomically resolved STEM data

The physics of ferroelectric domain walls is explored using the Bayesian inference analysis of atomically resolved STEM data. We demonstrate that domain wall profile shapes are ultimately sensitive to the nature of the order parameter in the material, including the functional form of Ginzburg-Landau-Devonshire expansion, and numerical value of the corresponding parameters. The preexisting materials knowledge naturally folds in the Bayesian framework in the form of prior distributions, with the different order parameters forming competing (or hierarchical) models. Here, we explore the physics of the ferroelectric domain walls in BiFeO3 using this method, and derive the posterior estimates of relevant parameters. More generally, this inference approach both allows learning materials physics from experimental data with associated uncertainty quantification, and establishing guidelines for instrumental development answering questions on what resolution and information limits are necessary for reliable observation of specific physical mechanisms of interest.

preprint2021arXiv

Mapping causal patterns in crystalline solids

The evolution of the atomic structures of the combinatorial library of Sm-substituted thin film BiFeO3 along the phase transition boundary from the ferroelectric rhombohedral phase to the non-ferroelectric orthorhombic phase is explored using scanning transmission electron microscopy (STEM). Localized properties including polarization, lattice parameter, and chemical composition are parameterized from atomic-scale imaging and their causal relationships are reconstructed using a linear non-Gaussian acyclic model (LiNGAM). This approach is further extended toward exploring the spatial variability of the causal coupling using the sliding window transform method, which revealed that new causal relationships emerged both at the expected locations, such as domain walls and interfaces, but also at additional regions forming clusters in the vicinity of the walls or spatially distributed features. While the exact physical origins of these relationships are unclear, they likely represent nanophase separated regions in the morphotropic phase boundaries. Overall, we pose that an in-depth understanding of complex disordered materials away from thermodynamic equilibrium necessitates understanding not only of the generative processes that can lead to observed microscopic states, but also the causal links between multiple interacting subsystems.

preprint2021arXiv

Parametric Programming Approach for More Powerful and General Lasso Selective Inference

Selective Inference (SI) has been actively studied in the past few years for conducting inference on the features of linear models that are adaptively selected by feature selection methods such as Lasso. The basic idea of SI is to make inference conditional on the selection event. Unfortunately, the main limitation of the original SI approach for Lasso is that the inference is conducted not only conditional on the selected features but also on their signs -- this leads to loss of power because of over-conditioning. Although this limitation can be circumvented by considering the union of such selection events for all possible combinations of signs, this is only feasible when the number of selected features is sufficiently small. To address this computational bottleneck, we propose a parametric programming-based method that can conduct SI without conditioning on signs even when we have thousands of active features. The main idea is to compute the continuum path of Lasso solutions in the direction of a test statistic, and identify the subset of the data space corresponding to the feature selection event by following the solution path. The proposed parametric programming-based method not only avoids the aforementioned computational bottleneck but also improves the performance and practicality of SI for Lasso in various respects. We conduct several experiments to demonstrate the effectiveness and efficiency of our proposed method.

preprint2021arXiv

Topic Analysis of Superconductivity Literature by Semantic Non-negative Matrix Factorization

We utilize a recently developed topic modeling method called SeNMFk, extending the standard Non-negative Matrix Factorization (NMF) methods by incorporating the semantic structure of the text, and adding a robust system for determining the number of topics. With SeNMFk, we were able to extract coherent topics validated by human experts. From these topics, a few are relatively general and cover broad concepts, while the majority can be precisely mapped to specific scientific effects or measurement techniques. The topics also differ by ubiquity, with only three topics prevalent in almost 40 percent of the abstract, while each specific topic tends to dominate a small subset of the abstracts. These results demonstrate the ability of SeNMFk to produce a layered and nuanced analysis of large scientific corpora.

preprint2021arXiv

Universal scaling of the critical temperature and the strange-metal scattering rate in unconventional superconductors

Dramatic evolution of properties with minute change in the doping level is a hallmark of the complex chemistry which governs cuprate superconductivity as manifested in the celebrated superconducting domes as well as quantum criticality taking place at precise compositions. The strange metal state, where the resistivity varies linearly with temperature, has emerged as a central feature in the normal state of cuprate superconductors. The ubiquity of this behavior signals an intimate link between the scattering mechanism and superconductivity. However, a clear quantitative picture of the correlation has been lacking. Here, we report observation of quantitative scaling laws between the superconducting transition temperature $T_{\rm c}$ and the scattering rate associated with the strange metal state in electron-doped cuprate $\rm La_{2-x}Ce_xCuO_4$ (LCCO) as a precise function of the doping level. High-resolution characterization of epitaxial composition-spread films, which encompass the entire overdoped range of LCCO has allowed us to systematically map its structural and transport properties with unprecedented accuracy and increment of $Δx = 0.0015$. We have uncovered the relations $T_{\rm c}\sim(x_{\rm c}-x)^{0.5}\sim(A_1^\square)^{0.5}$, where $x_c$ is the critical doping where superconductivity disappears on the overdoped side and $A_1^\square$ is the scattering rate of perfect $T$-linear resistivity per CuO$_2$ plane. We argue that the striking similarity of the $T_{\rm c}$ vs $A_1^\square$ relation among cuprates, iron-based and organic superconductors is an indication of a common mechanism of the strange metal behavior and unconventional superconductivity in these systems.

preprint2020arXiv

A Sampling Strategy in Efficient Potential Energy Surface Mapping for Predicting Atomic Diffusivity in Crystals by Machine Learning

We propose a machine-learning-based (ML-based) method for efficiently predicting atomic diffusivity in crystals, in which the potential energy surface (PES) of a diffusion carrier is partially evaluated by first-principles calculations. To preferentially evaluate the region of interest governing the atomic diffusivity, a statistical PES model based on a Gaussian process (GP-PES) is constructed and updated iteratively from known information on already-computed potential energies (PEs). In the proposed method, all local energy minima (stable & metastable sites) and elementary processes of atomic diffusion (atomic jumps) are explored on the predictive mean of the GP-PES. The uncertainty of jump frequency in each elementary process is then estimated on the basis of the variance of the GP-PES. The acquisition function determining the next grid point to be computed is designed to reflect the impacts of the uncertainties of jump frequencies on the uncertainty of the macroscopic atomic diffusivity. The numerical solution of the master equation is here employed to readily estimate the atomic diffusivity, which enables us to design the acquisition function reflecting the centrality of each elementary process.

preprint2020arXiv

Bayesian Quadrature Optimization for Probability Threshold Robustness Measure

In many product development problems, the performance of the product is governed by two types of parameters called design parameter and environmental parameter. While the former is fully controllable, the latter varies depending on the environment in which the product is used. The challenge of such a problem is to find the design parameter that maximizes the probability that the performance of the product will meet the desired requisite level given the variation of the environmental parameter. In this paper, we formulate this practical problem as active learning (AL) problems and propose efficient algorithms with theoretically guaranteed performance. Our basic idea is to use Gaussian Process (GP) model as the surrogate model of the product development process, and then to formulate our AL problems as Bayesian Quadrature Optimization problems for probabilistic threshold robustness (PTR) measure. We derive credible intervals for the PTR measure and propose AL algorithms for the optimization and level set estimation of the PTR measure. We clarify the theoretical properties of the proposed algorithms and demonstrate their efficiency in both synthetic and real-world product development problems.

preprint2020arXiv

Causal analysis of competing atomistic mechanisms in ferroelectric materials from high-resolution Scanning Transmission Electron Microscopy data

Machine learning has emerged as a powerful tool for the analysis of mesoscopic and atomically resolved images and spectroscopy in electron and scanning probe microscopy, with the applications ranging from feature extraction to information compression and elucidation of relevant order parameters to inversion of imaging data to reconstruct structural models. However, the fundamental limitation of machine learning methods is their correlative nature, leading to extreme susceptibility to confounding factors. Here, we implement the workflow for causal analysis of structural scanning transmission electron microscopy (STEM) data and explore the interplay between physical and chemical effects in ferroelectric perovskite across the ferroelectric-antiferroelectric phase transitions. The combinatorial library of the Sm-doped BiFeO3 is grown to cover the composition range from pure ferroelectric BFO to orthorhombic 20% Sm-doped BFO. Atomically resolved STEM images are acquired for selected compositions and are used to create a set of local compositional, structural, and polarization field descriptors. The information-geometric causal inference (IGCI) and additive noise model (ANM) analysis are used to establish the pairwise causal directions between the descriptors, ordering the data set in the causal direction. The causal chain for IGCI and ANM across the composition is compared and suggests the presence of common causal mechanisms across the composition series. Ultimately, we believe that the causal analysis of the multimodal data will allow exploring the causal links between multiple competing mechanisms that control the emergence of unique functionalities of morphotropic materials and ferroelectric relaxors.

preprint2020arXiv

CRYSPNet: Crystal Structure Predictions via Neural Network

Structure is the most basic and important property of crystalline solids; it determines directly or indirectly most materials characteristics. However, predicting crystal structure of solids remains a formidable and not fully solved problem. Standard theoretical tools for this task are computationally expensive and at times inaccurate. Here we present an alternative approach utilizing machine learning for crystal structure prediction. We developed a tool called Crystal Structure Prediction Network (CRYSPNet) that can predict the Bravais lattice, space group, and lattice parameters of an inorganic material based only on its chemical composition. CRYSPNet consists of a series of neural network models, using as inputs predictors aggregating the properties of the elements constituting the compound. It was trained and validated on more than 100,000 entries from the Inorganic Crystal Structure Database. The tool demonstrates robust predictive capability and outperforms alternative strategies by a large margin. Made available to the public (at https://github.com/AuroraLHT/cryspnet), it can be used both as an independent prediction engine or as a method to generate candidate structures for further computational and/or experimental validation.

preprint2020arXiv

Mean-Variance Analysis in Bayesian Optimization under Uncertainty

We consider active learning (AL) in an uncertain environment in which trade-off between multiple risk measures need to be considered. As an AL problem in such an uncertain environment, we study Mean-Variance Analysis in Bayesian Optimization (MVA-BO) setting. Mean-variance analysis was developed in the field of financial engineering and has been used to make decisions that take into account the trade-off between the average and variance of investment uncertainty. In this paper, we specifically focus on BO setting with an uncertain component and consider multi-task, multi-objective, and constrained optimization scenarios for the mean-variance trade-off of the uncertain component. When the target blackbox function is modeled by Gaussian Process (GP), we derive the bounds of the two risk measures and propose AL algorithm for each of the above three problems based on the risk measure bounds. We show the effectiveness of the proposed AL algorithms through theoretical analysis and numerical experiments.

preprint2020arXiv

Multi-fidelity Bayesian Optimization with Max-value Entropy Search and its parallelization

In a standard setting of Bayesian optimization (BO), the objective function evaluation is assumed to be highly expensive. Multi-fidelity Bayesian optimization (MFBO) accelerates BO by incorporating lower fidelity observations available with a lower sampling cost. In this paper, we focus on the information-based approach, which is a popular and empirically successful approach in BO. For MFBO, however, existing information-based methods are plagued by difficulty in estimating the information gain. We propose an approach based on max-value entropy search (MES), which greatly facilitates computations by considering the entropy of the optimal function value instead of the optimal input point. We show that, in our multi-fidelity MES (MF-MES), most of additional computations, compared with usual MES, is reduced to analytical computations. Although an additional numerical integration is necessary for the information across different fidelities, this is only in one dimensional space, which can be performed efficiently and accurately. Further, we also propose parallelization of MF-MES. Since there exist a variety of different sampling costs, queries typically occur asynchronously in MFBO. We show that similar simple computations can be derived for asynchronous parallel MFBO. We demonstrate effectiveness of our approach by using benchmark datasets and a real-world application to materials science data.

preprint2020arXiv

Multi-scale Domain-adversarial Multiple-instance CNN for Cancer Subtype Classification with Unannotated Histopathological Images

We propose a new method for cancer subtype classification from histopathological images, which can automatically detect tumor-specific features in a given whole slide image (WSI). The cancer subtype should be classified by referring to a WSI, i.e., a large-sized image (typically 40,000x40,000 pixels) of an entire pathological tissue slide, which consists of cancer and non-cancer portions. One difficulty arises from the high cost associated with annotating tumor regions in WSIs. Furthermore, both global and local image features must be extracted from the WSI by changing the magnifications of the image. In addition, the image features should be stably detected against the differences of staining conditions among the hospitals/specimens. In this paper, we develop a new CNN-based cancer subtype classification method by effectively combining multiple-instance, domain adversarial, and multi-scale learning frameworks in order to overcome these practical difficulties. When the proposed method was applied to malignant lymphoma subtype classifications of 196 cases collected from multiple hospitals, the classification performance was significantly better than the standard CNN or other conventional methods, and the accuracy compared favorably with that of standard pathologists.

preprint2020arXiv

Programmable Phase-change Metasurfaces on Waveguides for Multimode Photonic Convolutional Neural Network

Neuromorphic photonics has recently emerged as a promising hardware accelerator, with significant potential speed and energy advantages over digital electronics, for machine learning algorithms such as neural networks of various types. Integrated photonic networks are particularly powerful in performing analog computing of matrix-vector multiplication (MVM) as they afford unparalleled speed and bandwidth density for data transmission. Incorporating nonvolatile phase-change materials in integrated photonic devices enables indispensable programming and in-memory computing capabilities for on-chip optical computing. Here, we demonstrate a multimode photonic computing core consisting of an array of programable mode converters based on metasurface made of phase-change materials. The programmable converters utilize the refractive index change of the phase-change material Ge-Sb-Te during phase transition to control the waveguide spatial modes with a very high precision of up 64 levels in modal contrast. This contrast is used to represent the matrix elements, with 6-bit resolution and both positive and negative values, to perform MVM computation in neural network algorithms. We demonstrate an optical convolutional neural network that can perform image processing and classification tasks with high accuracy. With a broad operation bandwidth and a compact device footprint, the demonstrated multimode photonic core is very promising toward a large-scale photonic processor for high-throughput optical neural networks.

preprint2019arXiv

Microwave Meissner Screening of Proximity coupled Topological Insulator / Superconductor Bilayers

The proximity coupled topological insulator / superconductor (TI/SC) bilayer system is a representative system to realize topological superconductivity. In order to better understand this unique state and design devices from the TI/SC bilayer, a comprehensive understanding of the microscopic properties of the bilayer is required. In this work, a microwave Meissner screening study, which exploits a high-precision microwave resonator technique, is conducted on the SmB6/YB6 thin film bilayers as an example TI/SC system. The study reveals spatially dependent electrodynamic screening response of the TI/SC system that is not accessible to other techniques, from which the corresponding microscopic properties of a TI/SC bilayer can be obtained. The TI thickness dependence of the effective penetration depth suggests the existence of a bulk insulating region in the TI layer. The spatially dependent electrodynamic screening model analysis provides an estimate for the characteristic lengths of the TI/SC bilayer: normal penetration depth, normal coherence length, and the thickness of the surface states. We also discuss implications of these characteristic lengths on the design of a vortex Majorana device such as the radius of the vortex core, the energy splitting due to intervortex tunneling, and the minimum thickness required for a device.

preprint2016arXiv

A computational high-throughput search for new ternary superalloys

In 2006, a novel cobalt-based superalloy was discovered [1] with mechanical properties better than some conventional nickel-based superalloys. As with conventional superalloys, its high performance arises from the precipitate-hardening effect of a coherent L1$_2$ phase, which is in two-phase equilibrium with the fcc matrix. Inspired by this unexpected discovery of an L1$_2$ ternary phase, we performed a first-principles search through 2224 ternary metallic systems for analogous precipitate-hardening phases of the form $X_{3}$[$A_{0.5}, B_{0.5}$], where $X$ = Ni, Co, or Fe, and [$A,B$] = Li, Be, Mg, Al, Si, Ca, Sc, Ti, V, Cr, Mn, Fe, Co, Ni, Cu, Zn Ga, Sr, Y, Zr, Nb, Mo, Tc, Ru, Rh, Pd, Ag, Cd, In, Sn, Sb, Hf, Ta, W, Re, Os, Ir, Pt, Au, Hg, or Tl. We found 102 systems that have a smaller decomposition energy and a lower formation enthalpy than the Co$_{3}$(Al, W) superalloy. They have a stable two-phase equilibrium with the host matrix within the concentration range $0<x<1$ ($X_{3}$[$A_{x}, B_{1-x}$]) and have a relative lattice mismatch with the host matrix of less than or equal to 5%. These new candidates, narrowed from 2224 systems, suggest possible experimental exploration for identifying new superalloys. Of these 102 systems, 37 are new; they have no reported phase diagrams in standard databases. Based on cost, experimental difficulty, and toxicity, we limit these 37 to a shorter list of six promising candidates of immediate interest. Our calculations are consistent with current experimental literature where data exists.

preprint2016arXiv

Efficiently Bounding Optimal Solutions after Small Data Modification in Large-Scale Empirical Risk Minimization

We study large-scale classification problems in changing environments where a small part of the dataset is modified, and the effect of the data modification must be quickly incorporated into the classifier. When the entire dataset is large, even if the amount of the data modification is fairly small, the computational cost of re-training the classifier would be prohibitively large. In this paper, we propose a novel method for efficiently incorporating such a data modification effect into the classifier without actually re-training it. The proposed method provides bounds on the unknown optimal classifier with the cost only proportional to the size of the data modification. We demonstrate through numerical experiments that the proposed method provides sufficiently tight bounds with negligible computational costs, especially when a small part of the dataset is modified in a large-scale classification problem.

preprint2016arXiv

Observation of the superconducting proximity effect in the surface state of SmB6 thin films

The proximity effect at the interface between a topological insulator (TI) and a superconductor is predicted to give rise to chiral topological superconductivity and Majorana fermion excitations. In most TIs studied to date, however, the conducting bulk states have overwhelmed the transport properties and precluded the investigation of the interplay of the topological surface state and Cooper pairs. Here, we demonstrate the superconducting proximity effect in the surface state of SmB6 thin films which display bulk insulation at low temperatures. The Fermi velocity in the surface state deduced from the proximity effect is found to be as large as 10^5 m/s, in good agreement with the value obtained from a separate transport measurement. We show that high transparency between the TI and a superconductor is crucial for the proximity effect. The finding here opens the door to investigation of exotic quantum phenomena using all-thin-film multilayers with high-transparency interfaces.

preprint2016arXiv

Post Selection Inference with Kernels

We propose a novel kernel based post selection inference (PSI) algorithm, which can not only handle non-linearity in data but also structured output such as multi-dimensional and multi-label outputs. Specifically, we develop a PSI algorithm for independence measures, and propose the Hilbert-Schmidt Independence Criterion (HSIC) based PSI algorithm (hsicInf). The novelty of the proposed algorithm is that it can handle non-linearity and/or structured data through kernels. Namely, the proposed algorithm can be used for wider range of applications including nonlinear multi-class classification and multi-variate regressions, while existing PSI algorithms cannot handle them. Through synthetic experiments, we show that the proposed approach can find a set of statistically significant features for both regression and classification problems. Moreover, we apply the hsicInf algorithm to a real-world data, and show that hsicInf can successfully identify important features.

preprint2016arXiv

Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

In this paper we study predictive pattern mining problems where the goal is to construct a predictive model based on a subset of predictive patterns in the database. Our main contribution is to introduce a novel method called safe pattern pruning (SPP) for a class of predictive pattern mining problems. The SPP method allows us to efficiently find a superset of all the predictive patterns in the database that are needed for the optimal predictive model. The advantage of the SPP method over existing boosting-type method is that the former can find the superset by a single search over the database, while the latter requires multiple searches. The SPP method is inspired by recent development of safe feature screening. In order to extend the idea of safe feature screening into predictive pattern mining, we derive a novel pruning rule called safe pattern pruning (SPP) rule that can be used for searching over the tree defined among patterns in the database. The SPP rule has a property that, if a node corresponding to a pattern in the database is pruned out by the SPP rule, then it is guaranteed that all the patterns corresponding to its descendant nodes are never needed for the optimal predictive model. We apply the SPP method to graph mining and item-set mining problems, and demonstrate its computational advantage.

preprint2016arXiv

Secure Approximation Guarantee for Cryptographically Private Empirical Risk Minimization

Privacy concern has been increasingly important in many machine learning (ML) problems. We study empirical risk minimization (ERM) problems under secure multi-party computation (MPC) frameworks. Main technical tools for MPC have been developed based on cryptography. One of limitations in current cryptographically private ML is that it is computationally intractable to evaluate non-linear functions such as logarithmic functions or exponential functions. Therefore, for a class of ERM problems such as logistic regression in which non-linear function evaluations are required, one can only obtain approximate solutions. In this paper, we introduce a novel cryptographically private tool called secure approximation guarantee (SAG) method. The key property of SAG method is that, given an arbitrary approximate solution, it can provide a non-probabilistic assumption-free bound on the approximation quality under cryptographically secure computation framework. We demonstrate the benefit of the SAG method by applying it to several problems including a practical privacy-preserving data analysis task on genomic and clinical information.

preprint2016arXiv

Selective Inference Approach for Statistically Sound Predictive Pattern Mining

Discovering statistically significant patterns from databases is an important challenging problem. The main obstacle of this problem is in the difficulty of taking into account the selection bias, i.e., the bias arising from the fact that patterns are selected from extremely large number of candidates in databases. In this paper, we introduce a new approach for predictive pattern mining problems that can address the selection bias issue. Our approach is built on a recently popularized statistical inference framework called selective inference. In selective inference, statistical inferences (such as statistical hypothesis testing) are conducted based on sampling distributions conditional on a selection event. If the selection event is characterized in a tractable way, statistical inferences can be made without minding selection bias issue. However, in pattern mining problems, it is difficult to characterize the entire selection process of mining algorithms. Our main contribution in this paper is to solve this challenging problem for a class of predictive pattern mining problems by introducing a novel algorithmic framework. We demonstrate that our approach is useful for finding statistically significant patterns from databases.

preprint2016arXiv

Simultaneous Safe Screening of Features and Samples in Doubly Sparse Modeling

The problem of learning a sparse model is conceptually interpreted as the process of identifying active features/samples and then optimizing the model over them. Recently introduced safe screening allows us to identify a part of non-active features/samples. So far, safe screening has been individually studied either for feature screening or for sample screening. In this paper, we introduce a new approach for safely screening features and samples simultaneously by alternatively iterating feature and sample screening steps. A significant advantage of considering them simultaneously rather than individually is that they have a synergy effect in the sense that the results of the previous safe feature screening can be exploited for improving the next safe sample screening performances, and vice-versa. We first theoretically investigate the synergy effect, and then illustrate the practical advantage through intensive numerical experiments for problems with large numbers of features and samples.

preprint2015arXiv

A machine learning-based selective sampling procedure for identifying the low energy region in a potential energy surface: a case study on proton conduction in oxides

In this paper, we propose a selective sampling procedure to preferentially evaluate a potential energy surface (PES) in a part of the configuration space governing a physical property of interest. The proposed sampling procedure is based on a machine learning method called the Gaussian process (GP), which is used to construct a statistical model of the PES for identifying the region of interest in the configuration space. We demonstrate the efficacy of the proposed procedure for atomic diffusion and ionic conduction, specifically the proton conduction in a well-studied proton-conducting oxide, barium zirconate BaZrO3. The results of the demonstration study indicate that our procedure can efficiently identify the low-energy region characterizing the proton conduction in the host crystal lattice, and that the descriptors used for the statistical PES model have a great influence on the performance.

preprint2015arXiv

Evolution of electronic states in n-type copper oxide superconductor via electric double layer gating

Since the discovery of n-type copper oxide superconductors, the evolution of electron- and hole-bands and its relation to the superconductivity have been seen as a key factor in unveiling the mechanism of high-Tc superconductors. So far, the occurrence of electrons and holes in n-type copper oxides has been achieved by chemical doping, pressure, and/or deoxygenation. However, the observed electronic properties are blurred by the concomitant effects such as change of lattice structure, disorder, etc. Here, we report on successful tuning the electronic band structure of n-type Pr2-xCexCuO4 (x = 0.15) ultrathin films, via the electric double layer transistor technique. Abnormal transport properties, such as multiple sign reversals of Hall resistivity in normal and mixed states, have been revealed within an electrostatic field in range of -2 V to +2 V, as well as varying the temperature and magnetic field. In the mixed state, the intrinsic anomalous Hall conductivity invokes the contribution of both electron and hole-bands as well as the energy dependent density of states near the Fermi level. The two-band model can also describe the normal state transport properties well, whereas the carrier concentrations of electrons and holes are always enhanced or depressed simultaneously in electric fields. This is in contrast to the scenario of Fermi surface reconstruction by antiferromagnetism, where an anti-correlation between electrons and holes is commonly expected. Our findings paint the picture where Coulomb repulsion plays an important role in the evolution of the electronic states in n-type cuprate superconductors.

preprint2015arXiv

First principles thermodynamical modeling of the binodal and spinodal curves in lead chalcogenides

High-throughput ab-initio calculations, cluster expansion techniques and thermodynamic modeling have been synergistically combined to characterize the binodal and the spinodal decompositions features in the pseudo-binary lead chalcogenides PbSe-PbTe, PbS-PbTe, and PbS-PbSe. While our results agree with the available experimental data, our consolute temperatures substantially improve with respect to previous computational modeling. The computed phase diagrams corroborate that the formation of spinodal nanostructures causes low thermal conductivities in these alloys. The presented approach, making a rational use of online quantum repositories, can be extended to study thermodynamical and kinetic properties of materials of technological interest.

preprint2015arXiv

Homotopy Continuation Approaches for Robust SV Classification and Regression

In support vector machine (SVM) applications with unreliable data that contains a portion of outliers, non-robustness of SVMs often causes considerable performance deterioration. Although many approaches for improving the robustness of SVMs have been studied, two major challenges remain in robust SVM learning. First, robust learning algorithms are essentially formulated as non-convex optimization problems. It is thus important to develop a non-convex optimization method for robust SVM that can find a good local optimal solution. The second practical issue is how one can tune the hyperparameter that controls the balance between robustness and efficiency. Unfortunately, due to the non-convexity, robust SVM solutions with slightly different hyper-parameter values can be significantly different, which makes model selection highly unstable. In this paper, we address these two issues simultaneously by introducing a novel homotopy approach to non-convex robust SVM learning. Our basic idea is to introduce parametrized formulations of robust SVM which bridge the standard SVM and fully robust SVM via the parameter that represents the influence of outliers. We characterize the necessary and sufficient conditions of the local optimal solutions of robust SVM, and develop an algorithm that can trace a path of local optimal solutions when the influence of outliers is gradually decreased. An advantage of our homotopy approach is that it can be interpreted as simulated annealing, a common approach for finding a good local optimal solution in non-convex optimization problems. In addition, our homotopy method allows stable and efficient model selection based on the path of local optimal solutions. Empirical performances of the proposed approach are demonstrated through intensive numerical experiments both on robust classification and regression problems.

preprint2015arXiv

Quick sensitivity analysis for incremental data modification and its application to leave-one-out CV in linear classification problems

We introduce a novel sensitivity analysis framework for large scale classification problems that can be used when a small number of instances are incrementally added or removed. For quickly updating the classifier in such a situation, incremental learning algorithms have been intensively studied in the literature. Although they are much more efficient than solving the optimization problem from scratch, their computational complexity yet depends on the entire training set size. It means that, if the original training set is large, completely solving an incremental learning problem might be still rather expensive. To circumvent this computational issue, we propose a novel framework that allows us to make an inference about the updated classifier without actually re-optimizing it. Specifically, the proposed framework can quickly provide a lower and an upper bounds of a quantity on the unknown updated classifier. The main advantage of the proposed framework is that the computational cost of computing these bounds depends only on the number of updated instances. This property is quite advantageous in a typical sensitivity analysis task where only a small number of instances are updated. In this paper we demonstrate that the proposed framework is applicable to various practical sensitivity analysis tasks, and the bounds provided by the framework are often sufficiently tight for making desired inferences.

preprint2015arXiv

Regularization Path of Cross-Validation Error Lower Bounds

Careful tuning of a regularization parameter is indispensable in many machine learning tasks because it has a significant impact on generalization performances. Nevertheless, current practice of regularization parameter tuning is more of an art than a science, e.g., it is hard to tell how many grid-points would be needed in cross-validation (CV) for obtaining a solution with sufficiently small CV error. In this paper we propose a novel framework for computing a lower bound of the CV errors as a function of the regularization parameter, which we call regularization path of CV error lower bounds. The proposed framework can be used for providing a theoretical approximation guarantee on a set of solutions in the sense that how far the CV error of the current best solution could be away from best possible CV error in the entire range of the regularization parameters. We demonstrate through numerical experiments that a theoretically guaranteed a choice of regularization parameter in the above sense is possible with reasonable computational costs.

preprint2015arXiv

Robust Surface States indicated by Magnetotransport in SmB6 Thin Films

SmB6 has been predicted and verified as a prototype of topological Kondo insulators (TKIs). Here we report longitudinal magnetoresistance and Hall coefficient measurements on co-sputtered nanocrystalline SmB6 films and try to find possible signatures of their topological properties. The magnetoresistance (MR) at 2 K is positive and linear (LPMR) at low field and becomes negative and quadratic at higher field. While the negative part is known from the reduction of the hybridization gap due to Zeeman splitting, the positive dependence is similar to what has been observed in other topological insulators (TI). We conclude that the LPMR is a characteristic feature of TI and is related to the linear dispersion near the Dirac cone. The Hall resistance shows a sign change around 50 K. It peaks and becomes nonlinear at around 10 K then decreases below 10 K. This indicates that carriers with opposite signs emerge below 50 K. Two films with different geometries (thickness and lateral dimension) show contrasting behavior below and above 50K, which proves the surface origin of the low temperature carriers in these films. The temperature dependence of magnetoresistance and the Hall data indicates that the surface states are likely non-trivial.

preprint2015arXiv

Safe Feature Pruning for Sparse High-Order Interaction Models

Taking into account high-order interactions among covariates is valuable in many practical regression problems. This is, however, computationally challenging task because the number of high-order interaction features to be considered would be extremely large unless the number of covariates is sufficiently small. In this paper, we propose a novel efficient algorithm for LASSO-based sparse learning of such high-order interaction models. Our basic strategy for reducing the number of features is to employ the idea of recently proposed safe feature screening (SFS) rule. An SFS rule has a property that, if a feature satisfies the rule, then the feature is guaranteed to be non-active in the LASSO solution, meaning that it can be safely screened-out prior to the LASSO training process. If a large number of features can be screened-out before training the LASSO, the computational cost and the memory requirment can be dramatically reduced. However, applying such an SFS rule to each of the extremely large number of high-order interaction features would be computationally infeasible. Our key idea for solving this computational issue is to exploit the underlying tree structure among high-order interaction features. Specifically, we introduce a pruning condition called safe feature pruning (SFP) rule which has a property that, if the rule is satisfied in a certain node of the tree, then all the high-order interaction features corresponding to its descendant nodes can be guaranteed to be non-active at the optimal solution. Our algorithm is extremely efficient, making it possible to work, e.g., with 3rd order interactions of 10,000 original covariates, where the number of possible high-order interaction features is greater than 10^{12}.

preprint2015arXiv

Sparse selection of bases in neural-network potential for crystalline and liquid Si

The neural-network interatomic potential for crystalline and liquid Si has been developed using the forward stepwise regression technique to reduce the number of bases with keeping the accuracy of the potential. This approach of making the neural-network potential enables us to construct the accurate interatomic potentials with less and important bases selected systematically and less heuristically. The evaluation of bulk crystalline properties, and dynamic properties of liquid Si show good agreements between the neural-network potential and ab-initio results.

preprint2014arXiv

An Algorithmic Framework for Computing Validation Performance Bounds by Using Suboptimal Models

Practical model building processes are often time-consuming because many different models must be trained and validated. In this paper, we introduce a novel algorithm that can be used for computing the lower and the upper bounds of model validation errors without actually training the model itself. A key idea behind our algorithm is using a side information available from a suboptimal model. If a reasonably good suboptimal model is available, our algorithm can compute lower and upper bounds of many useful quantities for making inferences on the unknown target model. We demonstrate the advantage of our algorithm in the context of model selection for regularized learning problems.

preprint2014arXiv

Anomalous magnetoresistance in the spinel superconductor LiTi2O4

Transition-metal oxides offer an opportunity to explore unconventional superconductors, where the superconductivity (SC) is often interrelated with novel phenomena such as spin/charge order, fluctuations, and Fermi surface instability (1-3). LiTi2O4 (LTO) is a unique compound in that it is the only known spinel oxide superconductor. In addition to electron-phonon coupling, electron-electron and spin fluctuation contributions have been suggested as playing important roles in the microscopic mechanism for its superconductivity (4-8). However, the lack of high quality single crystals has thus far prevented systematic investigation of their transport properties (9). Here, we report a careful study of transport and tunneling spectroscopy in epitaxial LTO thin films. In the superconducting state, the energy gap was found to decrease as a quadratic function of magnetic field. In the normal state, an unusual magnetoresistance (MR) was observed where it changes from anisotropic positive to isotropic negative as the temperature is increased. A constant charge carrier concentration without any abrupt change in lattice parameters as a function of temperature suggests that the isotropic MR stems from the suppression of spin scattering/fluctuations, while the anisotropic term originates from an orbital contribution. These observations point to an important role strong correlations play in this unique superconductor.

preprint2014arXiv

Change in the magnetic structure of (Bi,Sm)FeO3 thin films at the morphotropic phase boundary probed by neutron diffraction

We report on the evolution of the magnetic structure of BiFeO3 thin films grown on SrTiO3 substrates as a function of Sm doping. We determined the magnetic structure using neutron diffraction. We found that as Sm increases, the magnetic structure evolves from a cycloid to a G-type antiferromagnet at the morphotropic phase boundary, where there is a large piezoelectric response due to an electric-field induced structural transition. The occurrence of the magnetic structural transition at the morphotropic phase boundary offers another route towards room temperature multiferroic devices.

preprint2014arXiv

Safe Sample Screening for Support Vector Machines

Sparse classifiers such as the support vector machines (SVM) are efficient in test-phases because the classifier is characterized only by a subset of the samples called support vectors (SVs), and the rest of the samples (non SVs) have no influence on the classification result. However, the advantage of the sparsity has not been fully exploited in training phases because it is generally difficult to know which sample turns out to be SV beforehand. In this paper, we introduce a new approach called safe sample screening that enables us to identify a subset of the non-SVs and screen them out prior to the training phase. Our approach is different from existing heuristic approaches in the sense that the screened samples are guaranteed to be non-SVs at the optimal solution. We investigate the advantage of the safe sample screening approach through intensive numerical experiments, and demonstrate that it can substantially decrease the computational cost of the state-of-the-art SVM solvers such as LIBSVM. In the current big data era, we believe that safe sample screening would be of great practical importance since the data size can be reduced without sacrificing the optimality of the final solution.

preprint2013arXiv

Combinatorial search of superconductivity in Fe-B composition spreads

We have fabricated Fe-B thin film composition spreads in search of possible superconducting phases following a theoretical prediction by Kolmogorov et al.^1 Co-sputtering was used to deposit spreads covering a large compositional region of the Fe-B binary phase diagram. A trace of superconducting phase was found in the nanocrystalline part of the spread, where the film undergoes a metal to insulator transition as a function of composition in a region with the average composition of FeB_2. The resistance drop occurs at 4K, and a diamagnetic signal has also been detected at the same temperature. The superconductivity is suppressible in the magnetic field up to 2 Tesla.

preprint2012arXiv

Density-Difference Estimation

We address the problem of estimating the difference between two probability densities. A naive approach is a two-step procedure of first estimating two densities separately and then computing their difference. However, such a two-step procedure does not necessarily work well because the first step is performed without regard to the second step and thus a small error incurred in the first stage can cause a big error in the second stage. In this paper, we propose a single-shot procedure for directly estimating the density difference without separately estimating two densities. We derive a non-parametric finite-sample error bound for the proposed single-shot density-difference estimator and show that it achieves the optimal convergence rate. The usefulness of the proposed method is also demonstrated experimentally.

preprint2012arXiv

Probing the Order Parameter of Superconducting LiFeAs using Pb/LiFeAs and Au/LiFeAs Point-Contact Spectroscopy

We have fabricated c-axis point contact junctions between high-quality LiFeAs single crystals and Pb or Au tips in order to study the nature of the superconducting order parameter of LiFeAs, one of the few stoichiometric iron-based superconductors. The observation of the Josephson current in c-axis junctions with a conventional s-wave superconductor as the counterelectrode indicates that the pairing symmetry in LiFeAs is not pure d-wave or pure spin-triplet p-wave. A superconducting gap is clearly observed in point contact Andreev reflection measurements performed on both Pb/LiFeAs and Au/LiFeAs junctions. The conductance spectra can be well described by the Blonder-Tinkham-Klapwijk model with a lifetime broadening term, resulting in a gap value of \approx 1.6 meV (2Δ/kBTC \approx 2.2).

preprint2011arXiv

Suboptimal Solution Path Algorithm for Support Vector Machine

We consider a suboptimal solution path algorithm for the Support Vector Machine. The solution path algorithm is an effective tool for solving a sequence of a parametrized optimization problems in machine learning. The path of the solutions provided by this algorithm are very accurate and they satisfy the optimality conditions more strictly than other SVM optimization algorithms. In many machine learning application, however, this strict optimality is often unnecessary, and it adversely affects the computational efficiency. Our algorithm can generate the path of suboptimal solutions within an arbitrary user-specified tolerance level. It allows us to control the trade-off between the accuracy of the solution and the computational cost. Moreover, We also show that our suboptimal solutions can be interpreted as the solution of a \emph{perturbed optimization problem} from the original one. We provide some theoretical analyses of our algorithm based on this novel interpretation. The experimental results also demonstrate the effectiveness of our algorithm.

preprint2010arXiv

Active microcantilevers based on piezoresistive ferromagnetic thin films

We report the piezoresisitivity in magnetic thin films of FeGa and their use for fabricating self transducing microcantilevers. The actuation occurs as a consequence of both the ferromagnetic and magnetostrictive property of FeGa thin films, while the deflection readout is achieved by exploiting the piezoresisitivity of these films. This self-sensing, self-actuating micromechanical system involves a very simple bilayer structure, which eliminates the need for the more complex piezoelectric stack that is commonly used in active cantilevers. Thus, it potentially opens opportunities for remotely actuated, cantilever-based sensors.

preprint2010arXiv

Atomic resolution imaging at 2.5 GHz using near-field microwave microscopy

Atomic resolution imaging is demonstrated using a hybrid scanning tunneling/near-field microwave microscope (microwave-STM). The microwave channels of the microscope correspond to the resonant frequency and quality factor of a coaxial microwave resonator, which is built in to the STM scan head and coupled to the probe tip. We find that when the tip-sample distance is within the tunneling regime, we obtain atomic resolution images using the microwave channels of the microwave-STM. We attribute the atomic contrast in the microwave channels to GHz frequency current through the tip-sample tunnel junction. Images of the surfaces of HOPG and Au(111) are presented.

preprint2010arXiv

Evidence of a universal and isotropic 2Δ/kBTC ratio in 122-type iron pnictide superconductors over a wide doping range

We have systematically investigated the doping and the directional dependence of the gap structure in the 122-type iron pnictide superconductors by point contact Andreev reflection spectroscopy. The studies were performed on single crystals of Ba1-xKxFe2As2 (x = 0.29, 0.49, and 0.77) and SrFe1.74Co0.26As2 with a sharp tip of Pb or Au pressed along the c-axis or the ab-plane direction. The conductance spectra obtained on highly transparent contacts clearly show evidence of a robust superconducting gap. The normalized curves can be well described by the Blonder-Tinkham-Klapwijk model with a lifetime broadening. The determined gap value scales very well with the transition temperature, giving the 2Δ/kBTC value of ~ 3.1. The results suggest the presence of a universal coupling behavior in this class of iron pnictides over a broad doping range and independent of the sign of the doping. Moreover, conductance spectra obtained on c-axis junctions and ab-plane junctions indicate that the observed gap is isotropic in these superconductors.

preprint2010arXiv

Multi-parametric Solution-path Algorithm for Instance-weighted Support Vector Machines

An instance-weighted variant of the support vector machine (SVM) has attracted considerable attention recently since they are useful in various machine learning tasks such as non-stationary data analysis, heteroscedastic data modeling, transfer learning, learning to rank, and transduction. An important challenge in these scenarios is to overcome the computational bottleneck---instance weights often change dynamically or adaptively, and thus the weighted SVM solutions must be repeatedly computed. In this paper, we develop an algorithm that can efficiently and exactly update the weighted SVM solutions for arbitrary change of instance weights. Technically, this contribution can be regarded as an extension of the conventional solution-path algorithm for a single regularization parameter to multiple instance-weight parameters. However, this extension gives rise to a significant problem that breakpoints (at which the solution path turns) have to be identified in high-dimensional space. To facilitate this, we introduce a parametric representation of instance weights. We also provide a geometric interpretation in weight space using a notion of critical region: a polyhedron in which the current affine solution remains to be optimal. Then we find breakpoints at intersections of the solution path and boundaries of polyhedrons. Through extensive experiments on various practical applications, we demonstrate the usefulness of the proposed algorithm.

Ichiro Takeuchi

What is connected

Connect this record

See the researcher in context

Building this map preview

56 published item(s)

Post-ADC Inference: Valid Inference After Active Data Collection

Quantum Kernel Machine Learning for Autonomous Materials Science

Real-time Multi-instrument Autonomous Discovery of Novel Phase-change Memory Materials

Valid P-Value for Deep Learning-Driven Salient Region

Bayesian Optimization for Distributionally Robust Chance-constrained Problem

Benchmarking Active Learning Strategies for Materials Optimization and Discovery

Chiral Spin Bobbers in Exchange-Coupled Hard-Soft Magnetic Bilayers

Conditional Selective Inference for Robust Regression and Outlier Detection using Piecewise-Linear Homotopy Continuation

Exact Statistical Inference for the Wasserstein Distance by Selective Inference

Hypothesis Learning in Automated Experiment: Application to Combinatorial Materials Libraries

Physics in the Machine: Integrating Physical Knowledge in Autonomous Phase-Mapping

Active learning for distributionally robust level-set estimation

Computing Valid p-value for Optimal Changepoint by Selective Inference using Dynamic Programming

Exploring physics of ferroelectric domain walls via Bayesian analysis of atomically resolved STEM data

Mapping causal patterns in crystalline solids

Parametric Programming Approach for More Powerful and General Lasso Selective Inference

Topic Analysis of Superconductivity Literature by Semantic Non-negative Matrix Factorization

Universal scaling of the critical temperature and the strange-metal scattering rate in unconventional superconductors

A Sampling Strategy in Efficient Potential Energy Surface Mapping for Predicting Atomic Diffusivity in Crystals by Machine Learning

Bayesian Quadrature Optimization for Probability Threshold Robustness Measure

Causal analysis of competing atomistic mechanisms in ferroelectric materials from high-resolution Scanning Transmission Electron Microscopy data

CRYSPNet: Crystal Structure Predictions via Neural Network

Mean-Variance Analysis in Bayesian Optimization under Uncertainty

Multi-fidelity Bayesian Optimization with Max-value Entropy Search and its parallelization

Multi-scale Domain-adversarial Multiple-instance CNN for Cancer Subtype Classification with Unannotated Histopathological Images

Programmable Phase-change Metasurfaces on Waveguides for Multimode Photonic Convolutional Neural Network

Microwave Meissner Screening of Proximity coupled Topological Insulator / Superconductor Bilayers

A computational high-throughput search for new ternary superalloys

Efficiently Bounding Optimal Solutions after Small Data Modification in Large-Scale Empirical Risk Minimization

Observation of the superconducting proximity effect in the surface state of SmB6 thin films

Post Selection Inference with Kernels

Safe Pattern Pruning: An Efficient Approach for Predictive Pattern Mining

Secure Approximation Guarantee for Cryptographically Private Empirical Risk Minimization

Selective Inference Approach for Statistically Sound Predictive Pattern Mining

Simultaneous Safe Screening of Features and Samples in Doubly Sparse Modeling

A machine learning-based selective sampling procedure for identifying the low energy region in a potential energy surface: a case study on proton conduction in oxides

Evolution of electronic states in n-type copper oxide superconductor via electric double layer gating

First principles thermodynamical modeling of the binodal and spinodal curves in lead chalcogenides

Homotopy Continuation Approaches for Robust SV Classification and Regression

Quick sensitivity analysis for incremental data modification and its application to leave-one-out CV in linear classification problems

Regularization Path of Cross-Validation Error Lower Bounds

Robust Surface States indicated by Magnetotransport in SmB6 Thin Films

Safe Feature Pruning for Sparse High-Order Interaction Models

Sparse selection of bases in neural-network potential for crystalline and liquid Si

An Algorithmic Framework for Computing Validation Performance Bounds by Using Suboptimal Models

Anomalous magnetoresistance in the spinel superconductor LiTi2O4

Change in the magnetic structure of (Bi,Sm)FeO3 thin films at the morphotropic phase boundary probed by neutron diffraction

Safe Sample Screening for Support Vector Machines

Combinatorial search of superconductivity in Fe-B composition spreads

Density-Difference Estimation

Probing the Order Parameter of Superconducting LiFeAs using Pb/LiFeAs and Au/LiFeAs Point-Contact Spectroscopy

Suboptimal Solution Path Algorithm for Support Vector Machine

Active microcantilevers based on piezoresistive ferromagnetic thin films

Atomic resolution imaging at 2.5 GHz using near-field microwave microscopy

Evidence of a universal and isotropic 2Δ/kBTC ratio in 122-type iron pnictide superconductors over a wide doping range

Multi-parametric Solution-path Algorithm for Instance-weighted Support Vector Machines