Researcher profile

Florence Jaffrézic

Florence Jaffrézic contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 17 - UnverifiedVerification L1Unclaimed author
4works
0followers
4topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

4 published item(s)

preprint2013arXiv

Differential meta-analysis of RNA-seq data from multiple studies

High-throughput sequencing is now regularly used for studies of the transcriptome (RNA-seq), particularly for comparisons among experimental conditions. For the time being, a limited number of biological replicates are typically considered in such experiments, leading to low detection power for differential expression. As their cost continues to decrease, it is likely that additional follow-up studies will be conducted to re-address the same biological question. We demonstrate how p-value combination techniques previously used for microarray meta-analyses can be used for the differential analysis of RNA-seq data from multiple related studies. These techniques are compared to a negative binomial generalized linear model (GLM) including a fixed study effect on simulated data and real data on human melanoma cell lines. The GLM with fixed study effect performed well for low inter-study variation and small numbers of studies, but was outperformed by the meta-analysis methods for moderate to large inter-study variability and larger numbers of studies. To conclude, the p-value combination techniques illustrated here are a valuable tool to perform differential meta-analyses of RNA-seq data by appropriately accounting for biological and technical variability within studies as well as additional study-specific effects. An R package metaRNASeq is available on the R Forge.

preprint2013arXiv

Joint estimation of causal effects from observational and intervention gene expression data

Background: Inference of gene regulatory networks from transcriptomic data has been a wide research area in recent years. Proposed methods are mainly based on the use of graphical Gaussian models for observational wild-type data and provide undirected graphs that are not able to accurately highlight the causal relationships among genes. In the present work, we seek to improve estimation of causal effects among genes by jointly modeling observational transcriptomic data with intervention data obtained by performing knock-outs or knock-downs on a subset of genes. By examining the impact of such expression perturbations on other genes, a more accurate reflection of regulatory relationships may be obtained than through the use of wild-type data alone. Results: Using the framework of Gaussian Bayesian networks, we propose a Markov chain Monte Carlo algorithm with a Mallows model and an analytical likelihood maximization to sample from the posterior distribution of causal node orderings, and in turn, to estimate causal effects. The main advantage of the proposed algorithm over previously proposed methods is that it has the flexibility to accommodate any kind of intervention design, including partial or multiple knock-out experiments. Methods were compared on simulated data as well as data from the DREAM 2007 challenge. Conclusions: The simulation study confirmed the impossibility of estimating causal orderings of genes with observation data only. The proposed algorithm was found, in most cases, to perform better than the previously proposed methods in terms of accuracy for the estimation of causal effects. In addition, multiple knock-outs proved to bring valuable additional information compared to single knock-outs. The choice of optimal intervention design therefore appears to be a crucial aspect for causal inference and an interesting challenge for future research.

preprint2013arXiv

Joint likelihood calculation for intervention and observational data from a Gaussian Bayesian network

Methodological development for the inference of gene regulatory networks from transcriptomic data is an active and important research area. Several approaches have been proposed to infer relationships among genes from observational steady-state expression data alone, mainly based on the use of graphical Gaussian models. However, these methods rely on the estimation of partial correlations and are only able to provide undirected graphs that cannot highlight causal relationships among genes. A major upcoming challenge is to jointly analyze observational transcriptomic data and intervention data obtained by performing knock-out or knock-down experiments in order to uncover causal gene regulatory relationships. To this end, in this technical note we present an explicit formula for the likelihood function for any complex intervention design in the context of Gaussian Bayesian networks, as well as its analytical maximization. This allows a direct calculation of the causal effects for known graph structure. We also show how to obtain the Fisher information in this context, which will be extremely useful for the choice of optimal intervention designs in the future.

preprint2011arXiv

Reverse engineering gene regulatory networks using approximate Bayesian computation

Gene regulatory networks are collections of genes that interact with one other and with other substances in the cell. By measuring gene expression over time using high-throughput technologies, it may be possible to reverse engineer, or infer, the structure of the gene network involved in a particular cellular process. These gene expression data typically have a high dimensionality and a limited number of biological replicates and time points. Due to these issues and the complexity of biological systems, the problem of reverse engineering networks from gene expression data demands a specialized suite of statistical tools and methodologies. We propose a non-standard adaptation of a simulation-based approach known as Approximate Bayesian Computing based on Markov chain Monte Carlo sampling. This approach is particularly well suited for the inference of gene regulatory networks from longitudinal data. The performance of this approach is investigated via simulations and using longitudinal expression data from a genetic repair system in Escherichia coli.