Source author record

Hao Yan

Hao Yan appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Applications Machine Learning physics.med-ph Methodology Computer Vision eess.SP math.AP astro-ph.HE Computation and Language cond-mat.mtrl-sci cond-mat.str-el Databases Distributed, Parallel, and Cluster Computing Information Retrieval Molecular Networks physics.ins-det physics.optics

Catalog footprint

What is connected

30works

17topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers

Transformers are effective at inferring the latent task from context via two inference modes: recognizing a task seen during training, and adapting to a novel one. Recent interpretability studies have identified from middle-layer representations task-specific directions, or task vectors, that steer model behavior. However, a lack of rigorous foundations hinders connecting internal representations to external model behavior: existing work fails to explain how task-vector geometry is shaped by the training distribution, and what geometry enables out-of-distribution (OOD) generalization. In this paper, we study these questions in a controlled synthetic setting by training small transformers from scratch on latent-task sequence distributions, which allows a principled mathematical characterization. We show that two inference modes can coexist within a single model. In-distribution behavior is governed by Bayesian task retrieval, implemented internally through convex combinations of learned task vectors. OOD behavior, by contrast, arises through extrapolative task learning, whose representations occupy a subspace nearly orthogonal to the task-vector subspace. Taken together, our results suggest that task-vector geometry, training distributions, and generalization behaviors are closely related.

preprint2023arXiv

Quasi-monolithic Compact Interferometric Sensor Head Design with Laser Auto-alignment

Interferometers play a crucial role in high-precision displacement measurement such as gravitational-wave detection. Conventional interferometer designs require accurate laser alignment, including the laser pointing and the waist position, to maintain high interference contrast during motion. Although the corner reflector returns the reflected beam in parallel, there is still a problem of lateral beam shift which reduces the interference contrast. This paper presents a new compact interferometric sensor head design for measuring translations with auto-alignment. It works without laser beam alignment adjustment and maintains high interferometric contrast during arbitrary motion (tilts as well as lateral translation). Automatic alignment of the measuring beam with the reference beam is possible by means of a secondary reflection design with a corner reflector. A 20*10*10mm^3 all-glass quasi-monolithic sensor head is built based on UV adhesive bonding and tested by a piezoelectric (PZT) positioning stage. Our sensor head achieved a displacement sensitivity of 1 pm/Hz^1/2 at 1Hz with a tilt dynamic range over +/_200 mrad. This optical design can be widely used for high-precision displacement measurement over a large tilt dynamic range, such as torsion balances and seismometers.

preprint2022arXiv

Adaptive Partially-Observed Sequential Change Detection and Isolation

High-dimensional data has become popular due to the easy accessibility of sensors in modern industrial applications. However, one specific challenge is that it is often not easy to obtain complete measurements due to limited sensing powers and resource constraints. Furthermore, distinct failure patterns may exist in the systems, and it is necessary to identify the true failure pattern. This work focuses on the online adaptive monitoring of high-dimensional data in resource-constrained environments with multiple potential failure modes. To achieve this, we propose to apply the Shiryaev-Roberts procedure on the failure mode level and utilize the multi-arm bandit to balance the exploration and exploitation. We further discuss the theoretical property of the proposed algorithm to show that the proposed method can correctly isolate the failure mode. Finally, extensive simulations and two case studies demonstrate that the change point detection performance and the failure mode isolation accuracy can be greatly improved.

preprint2022arXiv

Adaptive Resources Allocation CUSUM for Binomial Count Data Monitoring with Application to COVID-19 Hotspot Detection

In this paper, we present an efficient statistical method (denoted as "Adaptive Resources Allocation CUSUM") to robustly and efficiently detect the hotspot with limited sampling resources. Our main idea is to combine the multi-arm bandit (MAB) and change-point detection methods to balance the exploration and exploitation of resource allocation for hotspot detection. Further, a Bayesian weighted update is used to update the posterior distribution of the infection rate. Then, the upper confidence bound (UCB) is used for resource allocation and planning. Finally, CUSUM monitoring statistics to detect the change point as well as the change location. For performance evaluation, we compare the performance of the proposed method with several benchmark methods in the literature and showed the proposed algorithm is able to achieve a lower detection delay and higher detection precision. Finally, this method is applied to hotspot detection in a real case study of county-level daily positive COVID-19 cases in Washington State WA) and demonstrates the effectiveness with very limited distributed samples.

preprint2022arXiv

ANTLER: Bayesian Nonlinear Tensor Learning and Modeler for Unstructured, Varying-Size Point Cloud Data

Unstructured point clouds with varying sizes are increasingly acquired in a variety of environments through laser triangulation or Light Detection and Ranging (LiDAR). Predicting a scalar response based on unstructured point clouds is a common problem that arises in a wide variety of applications. The current literature relies on several pre-processing steps such as structured subsampling and feature extraction to analyze the point cloud data. Those techniques lead to quantization artifacts and do not consider the relationship between the regression response and the point cloud during pre-processing. Therefore, we propose a general and holistic "Bayesian Nonlinear Tensor Learning and Modeler" (ANTLER) to model the relationship of unstructured, varying-size point cloud data with a scalar or multivariate response. The proposed ANTLER simultaneously optimizes a nonlinear tensor dimensionality reduction and a nonlinear regression model with a 3D point cloud input and a scalar or multivariate response. ANTLER has the ability to consider the complex data representation, high-dimensionality,and inconsistent size of the 3D point cloud data.

preprint2022arXiv

Filters for ISI Suppression in Molecular Communication via Diffusion

Molecular communication via diffusion (MCvD) is considered as one of the most feasible communication paradigms for nanonetworks, especially for bio-nanonetworks which are usually in water-rich biological environments. Two effects that deteriorates the signal in MCvD are noise and inter-symbol interference (ISI). The expected channel impulse response of MCvD has a long and slow attenuating tail due to molecular diffusion which causes ISI and further limits the slow data rate of MCvD. The extent that ISI and noise are suppressed in an MCvD system determines its effectiveness, especially at a high data rate. Although ISI-suppression approaches have been investigated, most of them are addressed as non-essential parts in other topics, such as signal detection or modulation. Furthermore, most of the state-of-the-art ISI-suppression approaches are performed by subtracting the estimated ISI from the total signal. In this work, we investigate ISI-suppression from a new perspective of filters to filter ISI out without any ISI estimation. The principles for a good design of ISI-suppression filters in MCvD are investigated. Based on the principles, an ISI-suppression filter with good anti-noise capability and an associated signal detection scheme is proposed for MCvD scenarios with both ISI and noise. We compare the proposed scheme with the state-of-the-art ISI-suppression approaches. The result manifests that the proposed ISI-suppression scheme could recover signals deteriorated severely by both ISI and noise, which could not be effectively detected by the state-of-the-art ISI-suppression approaches.

preprint2022arXiv

Toward a Better Monitoring Statistic for Profile Monitoring via Variational Autoencoders

Wide accessibility of imaging and profile sensors in modern industrial systems created an abundance of high-dimensional sensing variables. This led to a a growing interest in the research of high-dimensional process monitoring. However, most of the approaches in the literature assume the in-control population to lie on a linear manifold with a given basis (i.e., spline, wavelet, kernel, etc) or an unknown basis (i.e., principal component analysis and its variants), which cannot be used to efficiently model profiles with a nonlinear manifold which is common in many real-life cases. We propose deep probabilistic autoencoders as a viable unsupervised learning approach to model such manifolds. To do so, we formulate nonlinear and probabilistic extensions of the monitoring statistics from classical approaches as the expected reconstruction error (ERE) and the KL-divergence (KLD) based monitoring statistics. Through extensive simulation study, we provide insights on why latent-space based statistics are unreliable and why residual-space based ones typically perform much better for deep learning based approaches. Finally, we demonstrate the superiority of deep probabilistic models via both simulation study and a real-life case study involving images of defects from a hot steel rolling process.

preprint2021arXiv

Adaptive Change Point Monitoring for High-Dimensional Data

In this paper, we propose a class of monitoring statistics for a mean shift in a sequence of high-dimensional observations. Inspired by the recent U-statistic based retrospective tests developed by Wang et al.(2019) and Zhang et al.(2020), we advance the U-statistic based approach to the sequential monitoring problem by developing a new adaptive monitoring procedure that can detect both dense and sparse changes in real-time. Unlike Wang et al.(2019) and Zhang et al.(2020), where self-normalization was used in their tests, we instead introduce a class of estimators for $q$-norm of the covariance matrix and prove their ratio consistency. To facilitate fast computation, we further develop recursive algorithms to improve the computational efficiency of the monitoring procedure. The advantage of the proposed methodology is demonstrated via simulation studies and real data illustrations.

preprint2020arXiv

Automatic Storage Structure Selection for hybrid Workload

In the use of database systems, the design of the storage engine and data model directly affects the performance of the database when performing queries. Therefore, the users of the database need to select the storage engine and design data model according to the workload encountered. However, in a hybrid workload, the query set of the database is dynamically changing, and the design of its optimal storage structure is also changing. Motivated by this, we propose an automatic storage structure selection system based on learning cost, which is used to dynamically select the optimal storage structure of the database under hybrid workloads. In the system, we introduce a machine learning method to build a cost model for the storage engine, and a column-oriented data layout generation algorithm. Experimental results show that the proposed system can choose the optimal combination of storage engine and data model according to the current workload, which greatly improves the performance of the default storage structure. And the system is designed to be compatible with different storage engines for easy use in practical applications.

preprint2020arXiv

Long-Short Term Spatiotemporal Tensor Prediction for Passenger Flow Profile

Spatiotemporal data is very common in many applications, such as manufacturing systems and transportation systems. It is typically difficult to be accurately predicted given intrinsic complex spatial and temporal correlations. Most of the existing methods based on various statistical models and regularization terms, fail to preserve innate features in data alongside their complex correlations. In this paper, we focus on a tensor-based prediction and propose several practical techniques to improve prediction. For long-term prediction specifically, we propose the "Tensor Decomposition + 2-Dimensional Auto-Regressive Moving Average (2D-ARMA)" model, and an effective way to update prediction real-time; For short-term prediction, we propose to conduct tensor completion based on tensor clustering to avoid oversimplifying and ensure accuracy. A case study based on the metro passenger flow data is conducted to demonstrate the improved performance.

preprint2020arXiv

Partially Observable Online Change Detection via Smooth-Sparse Decomposition

We consider online change detection of high dimensional data streams with sparse changes, where only a subset of data streams can be observed at each sensing time point due to limited sensing capacities. On the one hand, the detection scheme should be able to deal with partially observable data and meanwhile have efficient detection power for sparse changes. On the other, the scheme should be able to adaptively and actively select the most important variables to observe to maximize the detection power. To address these two points, in this paper, we propose a novel detection scheme called CDSSD. In particular, it describes the structure of high dimensional data with sparse changes by smooth-sparse decomposition, whose parameters can be learned via spike-slab variational Bayesian inference. Then the posterior Bayes factor, which incorporates the learned parameters and sparse change information, is formulated as a detection statistic. Finally, by formulating the statistic as the reward of a combinatorial multi-armed bandit problem, an adaptive sampling strategy based on Thompson sampling is proposed. The efficacy and applicability of our method in practice are demonstrated with numerical studies and a real case study.

preprint2020arXiv

Rapid Detection of Hot-spot by Tensor Decomposition with Application to Weekly Gonorrhea Data

In many bio-surveillance and healthcare applications, data sources are measured from many spatial locations repeatedly over time, say, daily/weekly/monthly. In these applications, we are typically interested in detecting hot-spots, which are defined as some structured outliers that are sparse over the spatial domain but persistent over time. In this paper, we propose a tensor decomposition method to detect when and where the hot-spots occur. Our proposed methods represent the observed raw data as a three-dimensional tensor including a circular time dimension for daily/weekly/monthly patterns, and then decompose the tensor into three components: smooth global trend, local hot-spots, and residuals. A combination of LASSO and fused LASSO is used to estimate the model parameters, and a CUSUM procedure is applied to detect when and where the hot-spots might occur. The usefulness of our proposed methodology is validated through numerical simulation and a real-world dataset in the weekly number of gonorrhea cases from $2006$ to $2018$ for $50$ states in the United States.

preprint2020arXiv

Rapid Detection of Hot-spots via Tensor Decomposition with applications to Crime Rate Data

We propose an efficient statistical method (denoted as SSR-Tensor) to robustly and quickly detect hot-spots that are sparse and temporal-consistent in a spatial-temporal dataset through the tensor decomposition. Our main idea is first to build an SSR model to decompose the tensor data into a Smooth global trend mean, Sparse local hot-spots, and Residuals. Next, tensor decomposition is utilized as follows: bases are introduced to describe within-dimension correlation, and tensor products are used for between-dimension interaction. Then, a combination of LASSO and fused LASSO is used to estimate the model parameters, where an efficient recursive estimation procedure is developed based on the large-scale convex optimization, where we first transform the general LASSO optimization into regular LASSO optimization and apply FISTA to solve it with the fastest convergence rate. Finally, a CUSUM procedure is applied to detect when and where the hot-spot event occurs. We compare the performance of the proposed method in a numerical simulation study and a real-world case study, which contains a dataset including a collection of three types of crime rates for U.S. mainland states during the year 1965-2014. In both cases, the proposed SSR-Tensor is able to achieve the fast detection and accurate localization of the hot-spots.

preprint2020arXiv

Real-time Detection of Clustered Events in Video-imaging data with Applications to Additive Manufacturing

The use of video-imaging data for in-line process monitoring applications has become more and more popular in the industry. In this framework, spatio-temporal statistical process monitoring methods are needed to capture the relevant information content and signal possible out-of-control states. Video-imaging data are characterized by a spatio-temporal variability structure that depends on the underlying phenomenon, and typical out-of-control patterns are related to the events that are localized both in time and space. In this paper, we propose an integrated spatio-temporal decomposition and regression approach for anomaly detection in video-imaging data. Out-of-control events are typically sparse spatially clustered and temporally consistent. Therefore, the goal is to not only detect the anomaly as quickly as possible ("when") but also locate it ("where"). The proposed approach works by decomposing the original spatio-temporal data into random natural events, sparse spatially clustered and temporally consistent anomalous events, and random noise. Recursive estimation procedures for spatio-temporal regression are presented to enable the real-time implementation of the proposed methodology. Finally, a likelihood ratio test procedure is proposed to detect when and where the hotspot happens. The proposed approach was applied to the analysis of video-imaging data to detect and locate local over-heating phenomena ("hotspots") during the layer-wise process in a metal additive manufacturing process.

preprint2018arXiv

A novel approach for fusion of heterogeneous sources of data

With advancements in sensor technology, a heterogeneous set of data, containing samples of scalar, waveform signal, image, or even structured point cloud are becoming increasingly popular. Developing a statistical model, representing the behavior of the underlying system based upon such a heterogeneous set of data can be used in monitoring, control, and optimization of the system. Unfortunately, available methods only focus on the scalar and curve data and do not provide a general framework that can integrate different sources of data to construct a model. This paper poses the problem of estimating a process output, measured by a scalar, curve, an image, or a point cloud by a set of heterogeneous process variables such as scalar process setting, sensor readings, and images. We introduce a general approach in which each set of input data (predictor) as well as the output measurements are represented by tensors. We formulate a linear regression model between the input and output tensors and estimate the parameters by minimizing a least square loss function. In order to avoid overfitting and to reduce the number of parameters to be estimated, we decompose the model parameters using several bases, spanning the input and output spaces. Next, we learn both the bases and their spanning coefficients when minimizing the loss function using an alternating least square (ALS) algorithm. We show that such a minimization has a closed-form solution in each iteration and can be computed very efficiently. Through several simulation and case studies, we evaluate the performance of the proposed method. The results reveal the advantage of the proposed method over some benchmarks in the literature in terms of the mean square prediction error.

preprint2018arXiv

Structured Point Cloud Data Analysis via Regularized Tensor Regression for Process Modeling and Optimization

Advanced 3D metrology technologies such as Coordinate Measuring Machine (CMM) and laser 3D scanners have facilitated the collection of massive point cloud data, beneficial for process monitoring, control and optimization. However, due to their high dimensionality and structure complexity, modeling and analysis of point clouds are still a challenge. In this paper, we utilize multilinear algebra techniques and propose a set of tensor regression approaches to model the variational patterns of point clouds and to link them to process variables. The performance of the proposed methods is evaluated through simulations and a real case study of turning process optimization.

preprint2016arXiv

Estimating fiber orientation distribution from diffusion MRI with spherical needlets

We present a novel method for estimation of the fiber orientation distribution (FOD) function based on diffusion-weighted Magnetic Resonance Imaging (D-MRI) data. We formulate the problem of FOD estimation as a regression problem through spherical deconvolution and a sparse representation of the FOD by a spherical needlets basis that form a multi-resolution tight frame for spherical functions. This sparse representation allows us to estimate FOD by an $l_1$-penalized regression under a non-negativity constraint. The resulting convex optimization problem is solved by an alternating direction method of multipliers (ADMM) algorithm. The proposed method leads to a reconstruction of the FODs that is accurate, has low variability and preserves sharp features. Through extensive experiments, we demonstrate the effectiveness and favorable performance of the proposed method compared with two existing methods. Particularly, we show the ability of the proposed method in successfully resolving fiber crossing at small angles and in automatically identifying isotropic diffusion. We also apply the proposed method to real 3T D-MRI data sets of healthy elderly individuals. The results show realistic descriptions of crossing fibers that are more accurate and less noisy than competing methods even with a relatively small number of gradient directions.

preprint2014arXiv

A globally attractive cycle driven by sequential bifurcations containing ghost effects in a 3-node yeast cell cycle model

Yeast cells produce daughter cells through a DNA replication and mitosis cycle associated with checkpoints and governed by the cell cycle regulatory network. To ensure genome stability and genetic information inheritance, this regulatory network must be dynamically robust against various fluctuations. Here we construct a simplified cell cycle model for a budding yeast to investigate the underlying mechanism that ensures robustness in this process containing sequential tasks (DNA replication and mitosis). We first establish a three-variable model and select a parameter set that qualitatively describes the yeast cell cycle process. Then, through nonlinear dynamic analysis, we demonstrate that the yeast cell cycle process is an excitable system driven by a sequence of saddle-node bifurcations with ghost effects. We further show that the yeast cell cycle trajectory is globally attractive with modularity in both state and parameter space, while the convergent manifold provides a suitable control state for cell cycle checkpoints. These results not only highlight a regulatory mechanism for executing successive cell cycle processes, but also provide a possible strategy for the synthetic network design of sequential-task processes.

preprint2014arXiv

Direct observation of the transition from indirect to direct bandgap in atomically thin epitaxial MoSe2

Quantum systems in confined geometries are host to novel physical phenomena. Examples include quantum Hall systems in semiconductors and Dirac electrons in graphene. Interest in such systems has also been intensified by the recent discovery of a large enhancement in photoluminescence quantum efficiency and a potential route to valleytronics in atomically thin layers of transition metal dichalcogenides, MX2 (M = Mo, W; X = S, Se, Te), which are closely related to the indirect to direct bandgap transition in monolayers. Here, we report the first direct observation of the transition from indirect to direct bandgap in monolayer samples by using angle resolved photoemission spectroscopy on high-quality thin films of MoSe2 with variable thickness, grown by molecular beam epitaxy. The band structure measured experimentally indicates a stronger tendency of monolayer MoSe2 towards a direct bandgap, as well as a larger gap size, than theoretically predicted. Moreover, our finding of a significant spin-splitting of 180 meV at the valence band maximum of a monolayer MoSe2 film could expand its possible application to spintronic devices.

preprint2014arXiv

Improved Scatter Correction in X-Ray Cone Beam CT with Moving Beam Stop Array Using Johns' Equation

In this paper, an improved scatter correction with moving beam stop array (BSA) for x-ray cone beam (CB) CT is proposed. Firstly, correlation between neighboring CB views is deduced based on John's Equation. Then, correlation-based algorithm is presented to complement the incomplete views by using the redundancy (over-determined information) in CB projections. Finally, combining the algorithm with scatter correction method using moving BSA, where part of primary radiation is blocked and incomplete projections are acquired, an improved correction method is proposed. Effectiveness and robustness is validated by Monte Carlo (MC) simulation with EGSnrc on humanoid phantom.

preprint2014arXiv

Peacock: Learning Long-Tail Topic Features for Industrial Applications

Latent Dirichlet allocation (LDA) is a popular topic modeling technique in academia but less so in industry, especially in large-scale applications involving search engine and online advertising systems. A main underlying reason is that the topic models used have been too small in scale to be useful; for example, some of the largest LDA models reported in literature have up to $10^3$ topics, which cover difficultly the long-tail semantic word sets. In this paper, we show that the number of topics is a key factor that can significantly boost the utility of topic-modeling systems. In particular, we show that a "big" LDA model with at least $10^5$ topics inferred from $10^9$ search queries can achieve a significant improvement on industrial search engine and online advertising systems, both of which serving hundreds of millions of users. We develop a novel distributed system called Peacock to learn big LDA models from big data. The main features of Peacock include hierarchical distributed architecture, real-time prediction and topic de-duplication. We empirically demonstrate that the Peacock system is capable of providing significant benefits via highly scalable LDA topic models for several industrial applications.

preprint2014arXiv

Single-scan scatter correction in CBCT by using projection correlation based view interpolation (PC-VI) and a stationary ring-shaped beam stop array (BSA)

In the scatter correction for x-ray Cone Beam (CB) CT, the single-scan scheme with moving Beam Stop Array (BSA) offers reliable scatter measurement with low dose, and by using Projection Correlation based View Interpolation (PC-VI), the primary fluence shaded by the moving BSA (during scatter measurement) could be recovered with high accuracy. However, the moving BSA may increase the mechanical burden in real applications. For better practicability, in this paper we proposed a PC-VI based single-scan scheme with a ring-shaped stationary BSA, which serves as a virtual moving BSA during CB scan, so the shaded primary fluence by this stationary BSA can be also well recovered by PC-VI. The principle in designing the whole system is deduced and evaluated. The proposed scheme greatly enhances the practicability of the single-scan scatter correction scheme.

preprint2013arXiv

Comprehensive Evaluations of Cone-beam CT dose in Image-guided Radiation Therapy via GPU-based Monte Carlo simulations

Cone beam CT (CBCT) has been widely used for patient setup in image guided radiation therapy (IGRT). Radiation dose from CBCT scans has become a clinical concern. The purposes of this study are 1) to commission a GPU-based Monte Carlo (MC) dose calculation package gCTD for Varian On-Board Imaging (OBI) system and test the calculation accuracy, and 2) to quantitatively evaluate CBCT dose from the OBI system in typical IGRT scan protocols. We first conducted dose measurements in a water phantom. X-ray source model parameters used in gCTD are obtained through a commissioning process. gCTD accuracy is demonstrated by comparing calculations with measurements in water and in CTDI phantoms. 25 brain cancer patients are used to study dose in a standard-dose head protocol, and 25 prostate cancer patients are used to study dose in pelvis protocol and pelvis spotlight protocol. Mean dose to each organ is calculated. Mean dose to 2% voxels that have the highest dose is also computed to quantify the maximum dose. It is found that the mean dose value to an organ varies largely among patients. Moreover, dose distribution is highly non-homogeneous inside an organ. The maximum dose is found to be 1~3 times higher than the mean dose depending on the organ, and is up to 8 times higher for the entire body due to the very high dose region in bony structures. High computational efficiency has also been observed in our studies, such that MC dose calculation time is less than 5 min for a typical case.

preprint2013arXiv

Warping of accretion disk and launching of jet by a spinning black hole in NGC 4258

We fit the most updated broadband spectral energy distribution from radio to X-rays for NGC 4258 with a coupled accretion-jet model that surrounding a Kerr black hole (BH), where both the jet and the warped H_2O maser disk are assumed to be triggered by a spinning BH through Blandford-Znajek mechanism and Bardeen-Petterson effect respectively. The accretion flow consists with an inner radiatively inefficient accretion flow (RIAF) and an outer truncated standard thin disk, where the transition radius R_tr~3*10^3Rg for NGC 4258 based on the width and variability of its narrow Fe K$α$ line. The hybrid jet formation model, as a variant of Blandford-Znajek model, is used to model the jet power. Therefore, we can estimate the accretion rate and BH spin through the two observed quantities--X-ray emission and jet power, where the observed jet power is estimated from the low-frequency radio emission. Through this method, we find that the BH of NGC 4258 should be mildly spinning with dimensionless spin parameter a_*=0.7\pm0.2. The outer thin disk mainly radiates at near infrared waveband and the jet contributes predominantly at radio waveband. Using above estimated BH spin and the inferred accretion rate at the region of the maser disk based on the physical existence of the H_2 O maser, we find that the warp radius is ~8.6*10^4 R_g if it is driven by the Bardeen-Petterson effect, which is consistent with the observational result very well.

preprint2012arXiv

A comprehensive study on the relationship between image quality and imaging dose in low-dose cone beam CT

While compressed sensing (CS) based reconstructions have been developed for low-dose CBCT, a clear understanding on the relationship between the image quality and imaging dose at low dose levels is needed. In this paper, we qualitatively investigate this subject in a comprehensive manner with extensive experimental and simulation studies. The basic idea is to plot image quality and imaging dose together as functions of number of projections and mAs per projection over the whole clinically relevant range. A clear understanding on the tradeoff between image quality and dose can be achieved and optimal low-dose CBCT scan protocols can be developed for various imaging tasks in IGRT. Main findings of this work include: 1) Under the CS framework, image quality has little degradation over a large dose range, and the degradation becomes evident when the dose < 100 total mAs. A dose < 40 total mAs leads to a dramatic image degradation. Optimal low-dose CBCT scan protocols likely fall in the dose range of 40-100 total mAs, depending on the specific IGRT applications. 2) Among different scan protocols at a constant low-dose level, the super sparse-view reconstruction with projection number less than 50 is the most challenging case, even with strong regularization. Better image quality can be acquired with other low mAs protocols. 3) The optimal scan protocol is the combination of a medium number of projections and a medium level of mAs/view. This is more evident when the dose is around 72.8 total mAs or below and when the ROI is a low-contrast or high-resolution object. Based on our results, the optimal number of projections is around 90 to 120. 4) The clinically acceptable lowest dose level is task dependent. In our study, 72.8mAs is a safe dose level for visualizing low-contrast objects, while 12.2 total mAs is sufficient for detecting high-contrast objects of diameter greater than 3 mm.

preprint2012arXiv

A GPU Tool for Efficient, Accurate, and Realistic Simulation of Cone Beam CT Projections

Simulation of x-ray projection images plays an important role in cone beam CT (CBCT) related research projects. A projection image contains primary signal, scatter signal, and noise. It is computationally demanding to perform accurate and realistic computations for all of these components. In this work, we develop a package on GPU, called gDRR, for the accurate and efficient computations of x-ray projection images in CBCT under clinically realistic conditions. The primary signal is computed by a tri-linear ray-tracing algorithm. A Monte Carlo (MC) simulation is then performed, yielding the primary signal and the scatter signal, both with noise. A denoising process is applied to obtain a smooth scatter signal. The noise component is then obtained by combining the difference between the MC primary and the ray-tracing primary signals, and the difference between the MC simulated scatter and the denoised scatter signals. Finally, a calibration step converts the calculated noise signal into a realistic one by scaling its amplitude. For a typical CBCT projection with a poly-energetic spectrum, the calculation time for the primary signal is 1.2~2.3 sec, while the MC simulations take 28.1~95.3 sec. Computation time for all other steps is negligible. The ray-tracing primary signal matches well with the primary part of the MC simulation result. The MC simulated scatter signal using gDRR is in agreement with EGSnrc results with a relative difference of 3.8%. A noise calibration process is conducted to calibrate gDRR against a real CBCT scanner. The calculated projections are accurate and realistic, such that beam-hardening artifacts and scatter artifacts can be reproduced using the simulated projections. The noise amplitudes in the CBCT images reconstructed from the simulated projections also agree with those in the measured images at corresponding mAs levels.

preprint2012arXiv

Extracting respiratory signals from thoracic cone beam CT projections

Patient respiratory signal associated with the cone beam CT (CBCT) projections is important for lung cancer radiotherapy. In contrast to monitoring an external surrogate of respiration, such signal can be extracted directly from the CBCT projections. In this paper, we propose a novel local principle component analysis (LPCA) method to extract the respiratory signal by distinguishing the respiration motion-induced content change from the gantry rotation-induced content change in the CBCT projections. The LPCA method is evaluated by comparing with three state-of-the-art projection-based methods, namely, the Amsterdam Shroud (AS) method, the intensity analysis (IA) method, and the Fourier-transform based phase analysis (FT-p) method. The clinical CBCT projection data of eight patients, acquired under various clinical scenarios, were used to investigate the performance of each method. We found that the proposed LPCA method has demonstrated the best overall performance for cases tested and thus is a promising technique for extracting respiratory signal. We also identified the applicability of each existing method.

preprint2011arXiv

Fast Monte Carlo Simulation for Patient-specific CT/CBCT Imaging Dose Calculation

Recently, X-ray imaging dose from computed tomography (CT) or cone beam CT (CBCT) scans has become a serious concern. Patient-specific imaging dose calculation has been proposed for the purpose of dose management. While Monte Carlo (MC) dose calculation can be quite accurate for this purpose, it suffers from low computational efficiency. In response to this problem, we have successfully developed a MC dose calculation package, gCTD, on GPU architecture under the NVIDIA CUDA platform for fast and accurate estimation of the x-ray imaging dose received by a patient during a CT or CBCT scan. Techniques have been developed particularly for the GPU architecture to achieve high computational efficiency. Dose calculations using CBCT scanning geometry in a homogeneous water phantom and a heterogeneous Zubal head phantom have shown good agreement between gCTD and EGSnrc, indicating the accuracy of our code. In terms of improved efficiency, it is found that gCTD attains a speed-up of ~400 times in the homogeneous water phantom and ~76.6 times in the Zubal phantom compared to EGSnrc. As for absolute computation time, imaging dose calculation for the Zubal phantom can be accomplished in ~17 sec with the average relative standard deviation of 0.4%. Though our gCTD code has been developed and tested in the context of CBCT scans, with simple modification of geometry it can be used for assessing imaging dose in CT scans as well.

preprint2011arXiv

Stability of Steady Solutions to Reaction-Hyperbolic Systems for Axonal Transport

This paper is concerned with the stability of steady solutions to initial-boundary-value problems of reaction-hyperbolic systems for axonal transport. Under proper structural assumptions, we clarify the relaxation structure of the reaction-hyperbolic systems and show the time-asymptotic stability of steady solutions or relaxation boundary-layers.

preprint2010arXiv

Weak entropy solutions of nonlinear reaction-hyperbolic systems for axonal transport

This paper is concerned with a class of nonlinear reaction-hyperbolic systems as models for axonal transport in neuroscience. We show the global existence of entropy-satisfying BV-solutions to the initial-value problems by using hyperbolic-type methods. Moreover, we rigorously justify the limit as the biochemical processes are much faster than the transport ones.

Hao Yan

What is connected

Connect this record

See the researcher in context

Building this map preview

30 published item(s)

Task Vector Geometry Underlies Dual Modes of Task Inference in Transformers

Quasi-monolithic Compact Interferometric Sensor Head Design with Laser Auto-alignment

Adaptive Partially-Observed Sequential Change Detection and Isolation

Adaptive Resources Allocation CUSUM for Binomial Count Data Monitoring with Application to COVID-19 Hotspot Detection

ANTLER: Bayesian Nonlinear Tensor Learning and Modeler for Unstructured, Varying-Size Point Cloud Data

Filters for ISI Suppression in Molecular Communication via Diffusion

Toward a Better Monitoring Statistic for Profile Monitoring via Variational Autoencoders

Adaptive Change Point Monitoring for High-Dimensional Data

Automatic Storage Structure Selection for hybrid Workload

Long-Short Term Spatiotemporal Tensor Prediction for Passenger Flow Profile

Partially Observable Online Change Detection via Smooth-Sparse Decomposition

Rapid Detection of Hot-spot by Tensor Decomposition with Application to Weekly Gonorrhea Data

Rapid Detection of Hot-spots via Tensor Decomposition with applications to Crime Rate Data

Real-time Detection of Clustered Events in Video-imaging data with Applications to Additive Manufacturing

A novel approach for fusion of heterogeneous sources of data

Structured Point Cloud Data Analysis via Regularized Tensor Regression for Process Modeling and Optimization

Estimating fiber orientation distribution from diffusion MRI with spherical needlets

A globally attractive cycle driven by sequential bifurcations containing ghost effects in a 3-node yeast cell cycle model

Direct observation of the transition from indirect to direct bandgap in atomically thin epitaxial MoSe2

Improved Scatter Correction in X-Ray Cone Beam CT with Moving Beam Stop Array Using Johns' Equation

Peacock: Learning Long-Tail Topic Features for Industrial Applications

Single-scan scatter correction in CBCT by using projection correlation based view interpolation (PC-VI) and a stationary ring-shaped beam stop array (BSA)

Comprehensive Evaluations of Cone-beam CT dose in Image-guided Radiation Therapy via GPU-based Monte Carlo simulations

Warping of accretion disk and launching of jet by a spinning black hole in NGC 4258

A comprehensive study on the relationship between image quality and imaging dose in low-dose cone beam CT

A GPU Tool for Efficient, Accurate, and Realistic Simulation of Cone Beam CT Projections

Extracting respiratory signals from thoracic cone beam CT projections

Fast Monte Carlo Simulation for Patient-specific CT/CBCT Imaging Dose Calculation

Stability of Steady Solutions to Reaction-Hyperbolic Systems for Axonal Transport

Weak entropy solutions of nonlinear reaction-hyperbolic systems for axonal transport