Source author record

Jingfei Zhang

Jingfei Zhang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

astro-ph.CO Methodology gr-qc Machine Learning Applications hep-th math.ST Statistics Theory hep-ph

Catalog footprint

What is connected

15works

9topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2022arXiv

Bias-correction and Test for Mark-point Dependence with Replicated Marked Point Processes

Mark-point dependence plays a critical role in research problems that can be fitted into the general framework of marked point processes. In this work, we focus on adjusting for mark-point dependence when estimating the mean and covariance functions of the mark process, given independent replicates of the marked point process. We assume that the mark process is a Gaussian process and the point process is a log-Gaussian Cox process, where the mark-point dependence is generated through the dependence between two latent Gaussian processes. Under this framework, naive local linear estimators ignoring the mark-point dependence can be severely biased. We show that this bias can be corrected using a local linear estimator of the cross-covariance function and establish uniform convergence rates of the bias-corrected estimators. Furthermore, we propose a test statistic based on local linear estimators for mark-point independence, which is shown to converge to an asymptotic normal distribution in a parametric $\sqrt{n}$-convergence rate. Model diagnostics tools are developed for key model assumptions and a robust functional permutation test is proposed for a more general class of mark-point processes. The effectiveness of the proposed methods is demonstrated using extensive simulations and applications to two real data examples.

preprint2022arXiv

High Dimensional Gaussian Graphical Regression Models with Covariates

Though Gaussian graphical models have been widely used in many scientific fields, relatively limited progress has been made to link graph structures to external covariates. We propose a Gaussian graphical regression model, which regresses both the mean and the precision matrix of a Gaussian graphical model on covariates. In the context of co-expression quantitative trait locus (QTL) studies, our method can determine how genetic variants and clinical conditions modulate the subject-level network structures, and recover both the population-level and subject-level gene networks. Our framework encourages sparsity of covariate effects on both the mean and the precision matrix. In particular for the precision matrix, we stipulate simultaneous sparsity, i.e., group sparsity and element-wise sparsity, on effective covariates and their effects on network edges, respectively. We establish variable selection consistency first under the case with known mean parameters and then a more challenging case with unknown means depending on external covariates, and establish in both cases the $\ell_2$ convergence rates and the selection consistency of the estimated precision parameters. The utility and efficacy of our proposed method is demonstrated through simulation studies and an application to a co-expression QTL study with brain cancer patients.

preprint2022arXiv

Multi-task Learning for Gaussian Graphical Regressions with High Dimensional Covariates

Gaussian graphical regression is a powerful means that regresses the precision matrix of a Gaussian graphical model on covariates, permitting the numbers of the response variables and covariates to far exceed the sample size. Model fitting is typically carried out via separate node-wise lasso regressions, ignoring the network-induced structure among these regressions. Consequently, the error rate is high, especially when the number of nodes is large. We propose a multi-task learning estimator for fitting Gaussian graphical regression models; we design a cross-task group sparsity penalty and a within task element-wise sparsity penalty, which govern the sparsity of active covariates and their effects on the graph, respectively. For computation, we consider an efficient augmented Lagrangian algorithm, which solves subproblems with a semi-smooth Newton method. For theory, we show that the error rate of the multi-task learning based estimates has much improvement over that of the separate node-wise lasso estimates, because the cross-task penalty borrows information across tasks. To address the main challenge that the tasks are entangled in a complicated correlation structure, we establish a new tail probability bound for correlated heavy-tailed (sub-exponential) variables with an arbitrary correlation structure, a useful theoretical result in its own right. Finally, the utility of our method is demonstrated through simulations as well as an application to a gene co-expression network study with brain cancer patients.

preprint2022arXiv

Statistical Inference of Cell-type Proportions Estimated from Bulk Expression Data

There is a growing interest in cell-type-specific analysis from bulk samples with a mixture of different cell types. A critical first step in such analyses is the accurate estimation of cell-type proportions in a bulk sample. Although many methods have been proposed recently, quantifying the uncertainties associated with the estimated cell-type proportions has not been well studied. Lack of consideration of these uncertainties can lead to missed or false findings in downstream analyses. In this article, we introduce a flexible statistical deconvolution framework that allows a general and subject-specific covariance of bulk gene expressions. Under this framework, we propose a decorrelated constrained least squares method called DECALS that estimates cell-type proportions as well as the sampling distribution of the estimates. Simulation studies demonstrate that DECALS can accurately quantify the uncertainties in the estimated proportions whereas other methods fail. Applying DECALS to analyze bulk gene expression data of post mortem brain samples from the ROSMAP and GTEx projects, we show that taking into account the uncertainties in the estimated cell-type proportions can lead to more accurate identifications of cell-type-specific differentially expressed genes and transcripts between different subject groups, such as between Alzheimer's disease patients and controls and between males and females.

preprint2021arXiv

Latent Network Structure Learning from High Dimensional Multivariate Point Processes

Learning the latent network structure from large scale multivariate point process data is an important task in a wide range of scientific and business applications. For instance, we might wish to estimate the neuronal functional connectivity network based on spiking times recorded from a collection of neurons. To characterize the complex processes underlying the observed data, we propose a new and flexible class of nonstationary Hawkes processes that allow both excitatory and inhibitory effects. We estimate the latent network structure using an efficient sparse least squares estimation approach. Using a thinning representation, we establish concentration inequalities for the first and second order statistics of the proposed Hawkes process. Such theoretical results enable us to establish the non-asymptotic error bound and the selection consistency of the estimated parameters. Furthermore, we describe a least squares loss based statistic for testing if the background intensity is constant in time. We demonstrate the efficacy of our proposed method through simulation studies and an application to a neuron spike train data set.

preprint2021arXiv

Learning Human Activity Patterns using Clustered Point Processes with Active and Inactive States

Modeling event patterns is a central task in a wide range of disciplines. In applications such as studying human activity patterns, events often arrive clustered with sporadic and long periods of inactivity. Such heterogeneity in event patterns poses challenges for existing point process models. In this article, we propose a new class of clustered point processes that alternate between active and inactive states. The proposed model is flexible, highly interpretable, and can provide useful insights into event patterns. A composite likelihood approach and a composite EM estimation procedure are developed for efficient and numerically stable parameter estimation. We study both the computational and statistical properties of the estimator including convergence, consistency, and asymptotic normality. The proposed method is applied to Donald Trump's Twitter data to investigate if and how his behaviors evolved before, during, and after the presidential campaign. Additionally, we analyze large-scale social media data from Sina Weibo and identify interesting groups of users with distinct behaviors.

preprint2021arXiv

Sparse Tensor Additive Regression

Tensors are becoming prevalent in modern applications such as medical imaging and digital marketing. In this paper, we propose a sparse tensor additive regression (STAR) that models a scalar response as a flexible nonparametric function of tensor covariates. The proposed model effectively exploits the sparse and low-rank structures in the tensor additive regression. We formulate the parameter estimation as a non-convex optimization problem, and propose an efficient penalized alternating minimization algorithm. We establish a non-asymptotic error bound for the estimator obtained from each iteration of the proposed algorithm, which reveals an interplay between the optimization error and the statistical rate of convergence. We demonstrate the efficacy of STAR through extensive comparative simulation studies, and an application to the click-through-rate prediction in online advertising.

preprint2020arXiv

Detection of the number of principal components by extended AIC-type method

Estimating the number of principal components is one of the fundamental problems in many scientific fields such as signal processing (or the spiked covariance model). In this paper, we first demonstrate that, for fixed $p$, any penalty term of the form $k'(p-k'/2+1/2)C_n$ may lead to an asymptotically consistent estimator under the condition that $C_n\to\infty$ and $C_n/n\to0$. We also extend our results to the case $n,p\to\infty$, with $p/n\to c>0$. In this case, for $k=o(n^{\frac{1}{3}})$, we first investigate the limiting laws for the leading eigenvalues of the sample covariance matrix $S_n$ under the condition that $λ_k>1+\sqrt{c}$. At low SNR, since the AIC tends to underestimate the number of signals $k$, the AIC should be re-defined in this case. As a natural extension of the AIC for fixed $p$, we propose the extended AIC (EAIC), i.e., the AIC-type method with tuning parameter $γ=φ(c)=1/2+\sqrt{1/c}-\log(1+\sqrt{c})/c$, and demonstrate that the EAIC-type method, i.e., the AIC-type method with tuning parameter $γ>φ(c)$, can select the number of signals $k$ consistently. In the following two cases, (1) $p$ fixed, $n\to\infty$, (2) $n,p\to\infty$ with $p/n\to 0$, if the AIC is defined as the degeneration of the EAIC in the case $n,p\to\infty$ with $p/n\to c>0$, i.e., $γ=\lim_{c\rightarrow 0+0}φ(c)=1$, then we have essentially demonstrated that, to achieve the consistency of the AIC-type method in the above two cases, $γ>1$ is required. Moreover, we show that the EAIC-type method is essentially tuning-free and outperforms the well-known KN estimator proposed in Kritchman and Nadler (2008) and the BCF estimator proposed in Bai, Choi and Fujikoshi (2018). Numerical studies indicate that the proposed method works well.

preprint2010arXiv

A more general interacting model of holographic dark energy

So far, there have been no theories or observational data that deny the presence of interaction between dark energy and dark matter. We extend naturally the holographic dark energy (HDE) model, proposed by Granda and Oliveros, in which the dark energy density includes not only the square of the Hubble scale, but also the time derivative of the Hubble scale to the case with interaction and the analytic forms for the cosmic parameters are obtained under the specific boundary conditions. The various behaviors concerning the cosmic expansion depend on the introduced numerical parameters which are also constrained. The more general interacting model inherits the features of the previous ones of HDE, keeping the consistency of the theory.

preprint2010arXiv

Interacting model of new agegraphic dark energy: Cosmological evolution and statefinder diagnostic

The statefinder diagnosic is a useful method for distinguishing different dark energy models. In this paper, we investigate the new agegraphic dark energy model with interaction between dark energy and matter component by using statefinder parameter pair $\{r, s\}$ and study its cosmological evolution. We plot the trajectories of the new agegraphic dark energy model for different interaction cases in the statefinder plane. As a result, the influence of the interaction on the evolution of the universe is shown in the statefinder diagrams.

preprint2010arXiv

Sandage-Loeb test for the new agegraphic and Ricci dark energy models

The Sandage-Loeb (SL) test is a unique method to explore dark energy at the ``redshift desert'' ($2\lesssim z\lesssim 5$), an era not covered by any other dark energy probes, by directly measuring the temporal variation of the redshift of quasar (QSO) Lyman-$α$ absorption lines. In this paper, we study the prospects for constraining the new agegraphic dark energy (NADE) model and the Ricci dark energy (RDE) model with the SL test. We show that, assuming only a ten-year survey, the SL test can constrain these two models with high significance.

preprint2010arXiv

Theoretical Limits on Agegraphic Quintessence from Weak Gravity Conjecture

In this paper, we investigate the possible theoretical constraint on the parameter $n$ of the agegraphic quintessence model by considering the requirement of the weak gravity conjecture that the variation of the quintessence scalar field $ϕ$ should be less than the Planck mass $M_{\rm{p}}$. We obtain the theoretical upper bound $n\lesssim 2.5$ that is inconsistent with the current observational constraint result $2.637<n<2.983$ (95.4% CL). The possible implications of the tension between observational and theoretical constraint results are discussed.

preprint2009arXiv

New agegraphic dark energy as a rolling tachyon

Combining the general relativity and the uncertainty relation in quantum mechanics, the energy density of quantum fluctuations of space-time can be viewed as dark energy. The so-called agegraphic dark energy model is just based on this viewpoint, in which the age of the universe is introduced as the length measure. Recently, the new agegraphic dark energy model was proposed, where the dynamical dark energy is measured by the conformal age of the universe. On the other hand, scalar-field dark energy models like tachyon are often regarded as an effective description of some underlying theory of dark energy. In this paper, we show that the new agegraphic dark energy can be described completely by a tachyon scalar-field. We thus reconstruct the potential and the dynamics of the tachyon scalar-field, according to the evolution of the new agegraphic dark energy.

preprint2009arXiv

Reconstructing quintom from WMAP 5-year observations: Generalized ghost condensate

In the 5-year WMAP data analysis, a new parametrization form for dark energy equation-of-state was used, and it has been shown that the equation-of-state, $w(z)$, crosses the cosmological-constant boundary $w=-1$. Based on this observation, in this paper, we investigate the reconstruction of quintom dark energy model. As a single-real-scalar-field model of dark energy, the generalized ghost condensate model provides us with a successful mechanism for realizing the quintom-like behavior. Therefore, we reconstruct this scalar-field quintom dark energy model from the WMAP 5-year observational results. As a comparison, we also discuss the quintom reconstruction based on other specific dark energy ansatzs, such as the CPL parametrization and the holographic dark energy scenarios.

preprint2007arXiv

Statefinder diagnosis in a non-flat universe and the holographic model of dark energy

In this paper, we study the holographic dark energy model in non-flat universe from the statefinder viewpoint. We plot the evolutionary trajectories of the holographic dark energy model for different values of the parameter $c$ as well as for different contributions of spatial curvature, in the statefinder parameter-planes. The statefinder diagrams characterize the properties of the holographic dark energy and show the discrimination between this scenario and other dark energy models. As we show, the contributions of the spatial curvature in the model can be diagnosed out explicitly by the statefinder diagrams. Furthermore, we also investigate the holographic dark energy model in the $w-w'$ plane, which can provide us with a useful dynamical diagnosis complement to the statefinder geometrical diagnosis.

Jingfei Zhang

What is connected

Connect this record

See the researcher in context

Building this map preview

15 published item(s)

Bias-correction and Test for Mark-point Dependence with Replicated Marked Point Processes

High Dimensional Gaussian Graphical Regression Models with Covariates

Multi-task Learning for Gaussian Graphical Regressions with High Dimensional Covariates

Statistical Inference of Cell-type Proportions Estimated from Bulk Expression Data

Latent Network Structure Learning from High Dimensional Multivariate Point Processes

Learning Human Activity Patterns using Clustered Point Processes with Active and Inactive States

Sparse Tensor Additive Regression

Detection of the number of principal components by extended AIC-type method

A more general interacting model of holographic dark energy

Interacting model of new agegraphic dark energy: Cosmological evolution and statefinder diagnostic

Sandage-Loeb test for the new agegraphic and Ricci dark energy models

Theoretical Limits on Agegraphic Quintessence from Weak Gravity Conjecture

New agegraphic dark energy as a rolling tachyon

Reconstructing quintom from WMAP 5-year observations: Generalized ghost condensate

Statefinder diagnosis in a non-flat universe and the holographic model of dark energy