Source author record

Michael J. Crowther

Michael J. Crowther appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Methodology Computation Applications stat.OT

Catalog footprint

What is connected

6works

4topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2020arXiv

A flexible parametric accelerated failure time model

Accelerated failure time (AFT) models are used widely in medical research, though to a much lesser extent than proportional hazards models. In an AFT model, the effect of covariates act to accelerate or decelerate the time to event of interest, i.e. shorten or extend the time to event. Commonly used parametric AFT models are limited in the underlying shapes that they can capture. In this article, we propose a general parametric AFT model, and in particular concentrate on using restricted cubic splines to model the baseline to provide substantial flexibility. We then extend the model to accommodate time-dependent acceleration factors. Delayed entry is also allowed, and hence, time-dependent covariates. We evaluate the proposed model through simulation, showing substantial improvements compared to standard parametric AFT models. We also show analytically and through simulations that the AFT models are collapsible, suggesting that this model class will be well suited to causal inference. We illustrate the methods with a dataset of patients with breast cancer. User friendly Stata and R software packages are provided.

preprint2020arXiv

INTEREST: INteractive Tool for Exploring REsults from Simulation sTudies

Simulation studies allow us to explore the properties of statistical methods. They provide a powerful tool with a multiplicity of aims; among others: evaluating and comparing new or existing statistical methods, assessing violations of modelling assumptions, helping with the understanding of statistical concepts, and supporting the design of clinical trials. The increased availability of powerful computational tools and usable software has contributed to the rise of simulation studies in the current literature. However, simulation studies involve increasingly complex designs, making it difficult to provide all relevant results clearly. Dissemination of results plays a focal role in simulation studies: it can drive applied analysts to use methods that have been shown to perform well in their settings, guide researchers to develop new methods in a promising direction, and provide insights into less established methods. It is crucial that we can digest relevant results of simulation studies. Therefore, we developed INTEREST: an INteractive Tool for Exploring REsults from Simulation sTudies. The tool has been developed using the Shiny framework in R and is available as a web app or as a standalone package. It requires uploading a tidy format dataset with the results of a simulation study in R, Stata, SAS, SPSS, or comma-separated format. A variety of performance measures are estimated automatically along with Monte Carlo standard errors; results and performance summaries are displayed both in tabular and graphical fashion, with a wide variety of available plots. Consequently, the reader can focus on simulation parameters and estimands of most interest. In conclusion, INTEREST can facilitate the investigation of results from simulation studies and supplement the reporting of results, allowing researchers to share detailed results from their simulations and readers to explore them freely.

preprint2020arXiv

merlin: An R package for Mixed Effects Regression for Linear, Nonlinear and User-defined models

The R package merlin performs flexible joint modelling of hierarchical multi-outcome data. Increasingly, multiple longitudinal biomarker measurements, possibly censored time-to-event outcomes and baseline characteristics are available. However, there is limited software that allows all of this information to be incorporated into one model. In this paper, we present merlin which allows for the estimation of models with unlimited numbers of continuous, binary, count and time-to-event outcomes, with unlimited levels of nested random effects. A wide variety of link functions, including the expected value, the gradient and shared random effects, are available in order to link the different outcomes in a biologically plausible way. The accompanying predict.merlin function allows for individual and population level predictions to be made from even the most complex models. There is the option to specify user-defined families, making merlin ideal for methodological research. The flexibility of merlin is illustrated using an example in patients followed up after heart valve replacement, beginning with a linear model, and finishing with a joint multiple longitudinal and competing risks survival model.

preprint2019arXiv

Impact of model misspecification in shared frailty survival models

Survival models incorporating random effects to account for unmeasured heterogeneity are being increasingly used in biostatistical and applied research. Specifically, unmeasured covariates whose lack of inclusion in the model would lead to biased, inefficient results are commonly modelled by including a subject-specific (or cluster-specific) frailty term that follows a given distribution (e.g. Gamma or log-Normal). Despite that, in the context of parametric frailty models little is known about the impact of misspecifying the baseline hazard, the frailty distribution, or both. Therefore, our aim is to quantify the impact of such misspecification in a wide variety of clinically plausible scenarios via Monte Carlo simulation, using open source software readily available to applied researchers. We generate clustered survival data assuming various baseline hazard functions, including mixture distributions with turning points, and assess the impact of sample size, variance of the frailty, baseline hazard function, and frailty distribution. Models compared include standard parametric distributions and more flexible spline-based approaches; we also included semiparametric Cox models. The resulting bias can be clinically relevant. In conclusion, we highlight the importance of fitting models that are flexible enough and the importance of assessing model fit. We illustrate our conclusions with two applications using data on diabetic retinopathy and bladder cancer. Our results show the importance of assessing model fit with respect to the baseline hazard function and the distribution of the frailty: misspecifying the former leads to biased relative and absolute risk estimates while misspecifying the latter affects absolute risk estimates and measures of heterogeneity.

preprint2019arXiv

Mixed effects models for healthcare longitudinal data with an informative visiting process: a Monte Carlo simulation study

Electronic health records are being increasingly used in medical research to answer more relevant and detailed clinical questions; however, they pose new and significant methodological challenges. For instance, observation times are likely correlated with the underlying disease severity: patients with worse conditions utilise health care more and may have worse biomarker values recorded. Traditional methods for analysing longitudinal data assume independence between observation times and disease severity; yet, with healthcare data such assumptions unlikely holds. Through Monte Carlo simulation, we compare different analytical approaches proposed to account for an informative visiting process to assess whether they lead to unbiased results. Furthermore, we formalise a joint model for the observation process and the longitudinal outcome within an extended joint modelling framework. We illustrate our results using data from a pragmatic trial on enhanced care for individuals with chronic kidney disease, and we introduce user-friendly software that can be used to fit the joint model for the observation process and a longitudinal outcome.

preprint2019arXiv

Multilevel mixed effects parametric survival analysis: Estimation, simulation and application

In this article, I present the user written stmixed command for the fitting of multilevel survival models, which serves as both an alternative to Stata's official mestreg, and a complimentary program with substantial extensions. stmixed can fit multilevel survival models with any number of levels and random effects at each level, including flexible spline-based approaches (such as Royston-Parmar and the log hazard equivalent) or user-defined hazard models. Simple or complex time-dependent effects can be included, as well as the addition of expected mortality for a relative survival model. Left-truncation/delayed entry can be used and t-distributed random effects are provided as an alternative to Gaussian random effects. The methods are illustrated with a commonly used dataset of patients with kidney disease suffering recurrent infections, and a simulated example, illustrating a simple approach to simulating clustered survival data using survsim (Crowther and Lambert 2012, 2013). stmixed is part of the merlin family (Crowther 2017, 2018).

Michael J. Crowther

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

A flexible parametric accelerated failure time model

INTEREST: INteractive Tool for Exploring REsults from Simulation sTudies

merlin: An R package for Mixed Effects Regression for Linear, Nonlinear and User-defined models

Impact of model misspecification in shared frailty survival models

Mixed effects models for healthcare longitudinal data with an informative visiting process: a Monte Carlo simulation study

Multilevel mixed effects parametric survival analysis: Estimation, simulation and application