Researcher profile

Daisuke Murakami

Daisuke Murakami contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 19 - UnverifiedVerification L1Unclaimed author
5works
0followers
2topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

5 published item(s)

preprint2026arXiv

Coarse-to-fine spatial modeling: A scalable, machine-learning-compatible spatial model

This study proposes coarse-to-fine spatial modeling (CFSM) as a scalable and machine learning-compatible alternative to conventional spatial process models. Unlike conventional covariance-based spatial models, CFSM represents spatial processes using a multiscale ensemble of local models. To ensure stable model training, larger-scale patterns that are easier to learn are modeled first, followed by smaller-scale patterns, with training terminated once the validation score stops improving. The training procedure, which is based on holdout validation, can be easily integrated with other machine learning algorithms, including random forests and neural networks. CFSM training is computationally efficient because it avoids explicit matrix inversion, which is a major computational bottleneck in conventional spatial Gaussian processes. Comparative Monte Carlo experiments demonstrated that the CFSM, as well as its integration with random forests, achieved superior predictive performance compared to existing models. Finally, we applied the proposed methods to an analysis of residential land prices in the Tokyo metropolitan area, Japan. The CFSM is implemented in an R package spCF (https://cran.r-project.org/web/packages/spCF/).

preprint2021arXiv

Improved log-Gaussian approximation for over-dispersed Poisson regression: application to spatial analysis of COVID-19

In the era of open data, Poisson and other count regression models are increasingly important. Still, conventional Poisson regression has remaining issues in terms of identifiability and computational efficiency. Especially, due to an identification problem, Poisson regression can be unstable for small samples with many zeros. Provided this, we develop a closed-form inference for an over-dispersed Poisson regression including Poisson additive mixed models. The approach is derived via mode-based log-Gaussian approximation. The resulting method is fast, practical, and free from the identification problem. Monte Carlo experiments demonstrate that the estimation error of the proposed method is a considerably smaller estimation error than the closed-form alternatives and as small as the usual Poisson regressions. For counts with many zeros, our approximation has better estimation accuracy than conventional Poisson regression. We obtained similar results in the case of Poisson additive mixed modeling considering spatial or group effects. The developed method was applied for analyzing COVID-19 data in Japan. This result suggests that influences of pedestrian density, age, and other factors on the number of cases change over periods.

preprint2020arXiv

Scalable GWR: A linear-time algorithm for large-scale geographically weighted regression with polynomial kernels

Although a number of studies have developed fast geographically weighted regression (GWR) algorithms for large samples, none of them has achieved linear-time estimation, which is considered a requisite for big data analysis in machine learning, geostatistics, and related domains. Against this backdrop, this study proposes a scalable GWR (ScaGWR) for large datasets. The key improvement is the calibration of the model through a pre-compression of the matrices and vectors whose size depends on the sample size, prior to the leave-one-out cross-validation, which is the heaviest computational step in conventional GWR. This pre-compression allows us to run the proposed GWR extension so that its computation time increases linearly with the sample size. With this improvement, the ScaGWR can be calibrated with one million observations without parallelization. Moreover, the ScaGWR estimator can be regarded as an empirical Bayesian estimator that is more stable than the conventional GWR estimator. We compare the ScaGWR with the conventional GWR in terms of estimation accuracy and computational efficiency using a Monte Carlo simulation. Then, we apply these methods to a US income analysis. The code for ScaGWR is available in the R package scgwr. The code is embedded into C++ code and implemented in another R package, GWmodel.

preprint2020arXiv

Spatiotemporal analysis of urban heatwaves using Tukey g-and-h random field models

The statistical quantification of temperature processes for the analysis of urban heat island (UHI) effects and local heat-waves is an increasingly important application domain in smart city dynamic modelling. This leads to the increased importance of real-time heatwave risk management on a fine-grained spatial resolution. This study attempts to analyze and develop new methods for modelling the spatio-temporal behavior of ground temperatures. The developed models consider higher-order stochastic spatial properties such as skewness and kurtosis, which are key components for understanding and describing local temperature fluctuations and UHI's. The developed models are applied to the greater Tokyo metropolitan area for a detailed real-world data case study. The analysis also demonstrates how to statistically incorporate a variety of real data sets. This includes remotely sensed imagery and a variety of ground-based monitoring site data to build models linking city and urban covariates to air temperature. The air temperature models are then used to capture high-resolution spatial emulator outputs for ground surface temperature modelling. The main class of processes studied includes the Tukey g-and-h processes for capturing spatial and temporal aspects of heat processes in urban environments.

preprint2020arXiv

The GWR route map: a guide to the informed application of Geographically Weighted Regression

Geographically Weighted Regression (GWR) is increasingly used in spatial analyses of social and environmental data. It allows spatial heterogeneities in processes and relationships to be investigated through a series of local regression models rather than a global one. Standard GWR assumes that the relationships between the response and predictor variables operate at the same spatial scale, which is frequently not the case. To address this, several GWR variants have been proposed. This paper describes a route map to inform the choice of whether to use a GWR model or not, and if so which of three core variants to apply: a standard GWR, a mixed GWR or a multiscale GWR (MS-GWR). The route map comprises primary steps: a basic linear regression, a MS-GWR, and investigations of the results of these. The paper provides guidance for deciding whether to use a GWR approach, and if so for determining the appropriate GWR variant. It describes the importance of investigating a number of secondary issues at global and local scales including collinearity, the influence of outliers, and dependent error terms. Code and data for the case study used to illustrate the route map are provided, and further considerations are described in an extensive Appendix.