Researcher profile

Tomonari Sei

Tomonari Sei contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
13works
0followers
9topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

13 published item(s)

preprint2026arXiv

Minimum information Markov model

The analysis of high-dimensional time series data has become increasingly important across a wide range of fields. Recently, a method for constructing the minimum information Markov kernel on finite state spaces was established. In this study, we propose a statistical model based on a parametrization of its dependence function, which we call the \textit{Minimum Information Markov Model}. We show that its parametrization induces an orthogonal structure between the stationary distribution and the dependence function, and that the model arises as the optimal solution to a divergence rate minimization problem. In particular, for the Gaussian autoregressive case, we establish the existence of the optimal solution to this minimization problem, a nontrivial result requiring a rigorous proof. For parameter estimation, our approach exploits the conditional independence structure inherent in the model, which is supported by the orthogonality. Specifically, we develop several estimators, including conditional likelihood and pseudo likelihood estimators, for the minimum information Markov model in both univariate and multivariate settings. We demonstrate their practical performance through simulation studies and applications to real-world time series data.

preprint2022arXiv

A proper scoring rule for minimum information copulas

Multi-dimensional distributions whose marginal distributions are uniform are called copulas. Among them, the one that satisfies given constraints on expectation and is closest to the independent distribution in the sense of Kullback-Leibler divergence is called the minimum information copula. The density function of the minimum information copula contains a set of functions called the normalizing functions, which are often difficult to compute. Although a number of proper scoring rules for probability distributions having normalizing constants such as exponential families are proposed, these scores are not applicable to the minimum information copulas due to the normalizing functions. In this paper, we propose the conditional Kullback-Leibler score, which avoids computation of the normalizing functions. The main idea of its construction is to use pairs of observations. We show that the proposed score is strictly proper in the space of copula density functions and therefore the estimator derived from it has asymptotic consistency. Furthermore, the score is convex with respect to the parameters and can be easily optimized by the gradient methods.

preprint2022arXiv

Improving Randomization Tests under Interference Based on Power Analysis

In causal inference, we can consider a situation in which treatment on one unit affects others, i.e., interference exists. In the presence of interference, we cannot perform a classical randomization test directly because a null hypothesis is not sharp. Instead, we need to perform the randomization test restricted to a subset of units and assignments that makes the null hypothesis sharp. A previous study constructed a useful testing method, a biclique test, by reducing the selection of the appropriate subsets to searching for bicliques in a bipartite graph. However, since the power depends on the features of selected subsets, there is still room to improve the power by refining the selection procedure. In this paper, we propose a method to improve the biclique test based on a power evaluation of the randomization test. We explicitly derived an expression for the power of the randomization test under several assumptions and found that a certain quantity calculated from a given assignment set characterizes the power. Based on this fact, we propose a method to improve the power of the biclique test by modifying the selection rule for subsets of units and assignments. Through a simulation with a spatial interference setting, we confirm that the proposed method has higher power than the existing method.

preprint2013arXiv

Calculating the normalising constant of the Bingham distribution on the sphere using the holonomic gradient method

In this paper we implement the holonomic gradient method to exactly compute the normalising constant of Bingham distributions. This idea is originally applied for general Fisher-Bingham distributions in Nakayama et al. (2011). In this paper we explicitly apply this algorithm to show the exact calculation of the normalising constant; derive explicitly the Pfaffian system for this parametric case; implement the general approach for the maximum likelihood solution search and finally adjust the method for degenerate cases, namely when the parameter values have multiplicities.

preprint2013arXiv

Infinitely imbalanced binomial regression and deformed exponential families

The logistic regression model is known to converge to a Poisson point process model if the binary response tends to infinitely imbalanced. In this paper, it is shown that this phenomenon is universal in a wide class of link functions on binomial regression. The proof relies on the extreme value theory. For the logit, probit and complementary log-log link functions, the intensity measure of the point process becomes an exponential family. For some other link functions, deformed exponential families appear. A penalized maximum likelihood estimator for the Poisson point process model is suggested.

preprint2013arXiv

Properties and applications of Fisher distribution on the rotation group

We study properties of Fisher distribution (von Mises-Fisher distribution, matrix Langevin distribution) on the rotation group SO(3). In particular we apply the holonomic gradient descent, introduced by Nakayama et al. (2011), and a method of series expansion for evaluating the normalizing constant of the distribution and for computing the maximum likelihood estimate. The rotation group can be identified with the Stiefel manifold of two orthonormal vectors. Therefore from the viewpoint of statistical modeling, it is of interest to compare Fisher distributions on these manifolds. We illustrate the difference with an example of near-earth objects data.

preprint2011arXiv

Cones of elementary imsets and supermodular functions: a review and some new results

In this paper we give a review of the method of imsets introduced by Studeny (2005) from a geometric point of view. Elementary imsets span a polyhedral cone and its dual cone is the cone of supermodular functions. We review basic facts on the structure of these cones. Then we derive some new results on the following topics: i) extreme rays of the cone of standardized supermodular functions, ii) faces of the cones, iii) small relations among elementary imsets, and iv) some computational results on Markov basis for the toric ideal defined by elementary imsets.

preprint2011arXiv

Hierarchical subspace models for contingency tables

For statistical analysis of multiway contingency tables we propose modeling interaction terms in each maximal compact component of a hierarchical model. By this approach we can search for parsimonious models with smaller degrees of freedom than the usual hierarchical model, while preserving conditional independence structures in the hierarchical model. We discuss estimation and exacts tests of the proposed model and illustrate the advantage of the proposed modeling with some data sets.

preprint2011arXiv

On optimal stationary couplings between stationary processes

By a classical result of Gray, Neuhoff and Shields (1975) the $\bar\varrho$ distance between stationary processes is identified with an optimal stationary coupling problem of the corresponding stationary measures on the infinite product spaces. This is a modification of the optimal coupling problem from Monge--Kantorovich theory. In this paper we derive some general classes of examples of optimal stationary couplings which allow to calculate the $\bar\varrho$ distance in these cases in explicit form. We also extend the $\bar\varrho$ distance to random fields and to general nonmetric distance functions and give a construction method for optimal stationary $\bar c$-couplings. Our assumptions need in this case a geometric positive curvature condition.

preprint2011arXiv

Properties of semi-elementary imsets as sums of elementary imsets

We study properties of semi-elementary imsets and elementary imsets introduced by Studeny (2005). The rules of the semi-graphoid axiom (decomposition, weak union and contraction) for conditional independence statements can be translated into a simple identity among three semi-elementary imsets. By recursively applying the identity, any semi-elementary imset can be written as a sum of elementary imsets, which we call a representation of the semi-elementary imset. A semi-elementary imset has many representations. We study properties of the set of possible representations of a semi-elementary imset and prove that all representations are connected by relations among four elementary imsets.

preprint2010arXiv

A Jacobian inequality for gradient maps on the sphere and its application to directional statistics

In the field of optimal transport theory, an optimal map is known to be a gradient map of a potential function satisfying cost-convexity. In this paper, the Jacobian determinant of a gradient map is shown to be log-concave with respect to a convex combination of the potential functions when the underlying manifold is the sphere and the cost function is the distance squared. The proof uses the non-negative cross-curvature property of the sphere recently established by Kim and McCann, and Figalli and Rifford. As an application to statistics, a new family of probability densities on the sphere is defined in terms of cost-convex functions. The log-concave property of the likelihood function follows from the inequality.