Source author record

Carel F. W. Peeters

Carel F. W. Peeters appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Applications Methodology Machine Learning Quantitative Methods Computation Molecular Networks Populations and Evolution

Catalog footprint

What is connected

6works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

A latent factor approach to hyperspectral time series data for multivariate genomic prediction of grain yield in wheat

High-dimensional time series phenotypic data is becoming increasingly common within plant breeding programmes. However, analysing and integrating such data for genetic analysis and genomic prediction remains difficult. Here we show how factor analysis with Procrustes rotation on the genetic correlation matrix of hyperspectral secondary phenotype data can help in extracting relevant features for within-trial prediction. We use a subset of Centro Internacional de Mejoramiento de Maíz y Trigo (CIMMYT) elite yield wheat trial of 2014-2015, consisting of 1,033 genotypes. These were measured across three irrigation treatments at several timepoints during the season, using manned airplane flights with hyperspectral sensors capturing 62 bands in the spectrum of 385-850 nm. We perform multivariate genomic prediction using latent variables to improve within-trial genomic predictive ability (PA) of wheat grain yield within three distinct watering treatments. By integrating latent variables of the hyperspectral data in a multivariate genomic prediction model, we are able to achieve an absolute gain of .1 to .3 (on the correlation scale) in PA compared to univariate genomic prediction. Furthermore, we show which timepoints within a trial are important and how these relate to plant growth stages. This paper showcases how domain knowledge and data-driven approaches can be combined to increase PA and gain new insights from sensor data of high-throughput phenotyping platforms.

preprint2020arXiv

Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

We consider the problem of jointly estimating multiple inverse covariance matrices from high-dimensional data consisting of distinct classes. An $\ell_2$-penalized maximum likelihood approach is employed. The suggested approach is flexible and generic, incorporating several other $\ell_2$-penalized estimators as special cases. In addition, the approach allows specification of target matrices through which prior knowledge may be incorporated and which can stabilize the estimation procedure in high-dimensional settings. The result is a targeted fused ridge estimator that is of use when the precision matrices of the constituent classes are believed to chiefly share the same structure while potentially differing in a number of locations of interest. It has many applications in (multi)factorial study designs. We focus on the graphical interpretation of precision matrices with the proposed estimator then serving as a basis for integrative or meta-analytic Gaussian graphical modeling. Situations are considered in which the classes are defined by data sets and subtypes of diseases. The performance of the proposed estimator in the graphical modeling setting is assessed through extensive simulation experiments. Its practical usability is illustrated by the differential network modeling of 12 large-scale gene expression data sets of diffuse large B-cell lymphoma subtypes. The estimator and its related procedures are incorporated into the R-package rags2ridges.

preprint2016arXiv

Bayesian Constrained-Model Selection for Factor Analytic Modeling

My dissertation revolves around Bayesian approaches towards constrained statistical inference in the factor analysis (FA) model. Two interconnected types of restricted-model selection are considered. These types have a natural connection to selection problems in the exploratory FA (EFA) and confirmatory FA (CFA) model and are termed Type I and Type II model selection. Type I constrained-model selection is taken to mean the determination of the appropriate dimensionality of a model. This type of constrained-model selection connects with EFA in the sense of selecting the optimal dimensionality of the latent vector. Type II model selection is taken to mean the determination of appropriate inequality, order or shape restrictions on the parameter space. The dissertation connects Type II constrained-model selection to CFA by focusing on the determination of linear inequality constraints as expressions of the direction and (relative) strength of factor loadings. The figures accompanying this article are taken from the slides of my Division 5 Awards Symposium Invited address at the APA 2015 Annual Convention in Toronto. These slides can be retrieved from \url{https://github.com/CFWP/ConventionTalk}.

preprint2016arXiv

Pathophysiological Domains Underlying the Metabolic Syndrome: An Alternative Factor Analytic Strategy

Purpose: Factor analysis (FA) has become part and parcel in metabolic syndrome (MBS) research. Both exploration- and confirmation-driven factor analyzes are rampant. However, factor analytic results on MBS differ widely. A situation that is at least in part attributable to misapplication of FA. Here, our purpose is (i) to review factor analytic efforts in the study of MBS with emphasis on misusage of the FA model and (ii) to propose an alternative factor analytic strategy. Methods: The proposed factor analytic strategy consists of four steps and confronts weaknesses in application of the FA model. At its heart lies the explicit separation of dimensionality and pattern selection as well as the direct evaluation of competing inequality-constrained loading patterns. A high-profile MBS data set with anthropometric measurements on overweight children and adolescents is reanalyzed using this strategy. Results: The reanalysis implied a more parsimonious constellation of pathophysiological domains underlying phenotypic expressions of MBS than the original analysis (and many other analyzes). The results emphasize correlated factors of impaired glucose metabolism and impaired lipid metabolism. Conclusions: Pathophysiological domains underlying phenotypic expressions of MBS included in the analysis are driven by multiple interrelated metabolic impairments. These findings indirectly point to the possible existence of a multifactorial aetiology.

preprint2016arXiv

The Spectral Condition Number Plot for Regularization Parameter Determination

Many modern statistical applications ask for the estimation of a covariance (or precision) matrix in settings where the number of variables is larger than the number of observations. There exists a broad class of ridge-type estimators that employs regularization to cope with the subsequent singularity of the sample covariance matrix. These estimators depend on a penalty parameter and choosing its value can be hard, in terms of being computationally unfeasible or tenable only for a restricted set of ridge-type estimators. Here we introduce a simple graphical tool, the spectral condition number plot, for informed heuristic penalty parameter selection. The proposed tool is computationally friendly and can be employed for the full class of ridge-type covariance (precision) estimators.

preprint2015arXiv

Ridge Estimation of Inverse Covariance Matrices from High-Dimensional Data

We study ridge estimation of the precision matrix in the high-dimensional setting where the number of variables is large relative to the sample size. We first review two archetypal ridge estimators and note that their utilized penalties do not coincide with common ridge penalties. Subsequently, starting from a common ridge penalty, analytic expressions are derived for two alternative ridge estimators of the precision matrix. The alternative estimators are compared to the archetypes with regard to eigenvalue shrinkage and risk. The alternatives are also compared to the graphical lasso within the context of graphical modeling. The comparisons may give reason to prefer the proposed alternative estimators.

Carel F. W. Peeters

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

A latent factor approach to hyperspectral time series data for multivariate genomic prediction of grain yield in wheat

Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

Bayesian Constrained-Model Selection for Factor Analytic Modeling

Pathophysiological Domains Underlying the Metabolic Syndrome: An Alternative Factor Analytic Strategy

The Spectral Condition Number Plot for Regularization Parameter Determination

Ridge Estimation of Inverse Covariance Matrices from High-Dimensional Data