Source author record

Ge Wang

Ge Wang appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Catalog footprint

What is connected

58works

20topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2026arXiv

MobiDiary: Autoregressive Action Captioning with Wearable Devices and Wireless Signals

Human Activity Recognition (HAR) in smart homes is critical for health monitoring and assistive living. While vision-based systems are common, they face privacy concerns and environmental limitations (e.g., occlusion). In this work, we present MobiDiary, a framework that generates natural language descriptions of daily activities directly from heterogeneous physical signals (specifically IMU and Wi-Fi). Unlike conventional approaches that restrict outputs to pre-defined labels, MobiDiary produces expressive, human-readable summaries. To bridge the semantic gap between continuous, noisy physical signals and discrete linguistic descriptions, we propose a unified sensor encoder. Instead of relying on modality-specific engineering, we exploit the shared inductive biases of motion-induced signals--where both inertial and wireless data reflect underlying kinematic dynamics. Specifically, our encoder utilizes a patch-based mechanism to capture local temporal correlations and integrates heterogeneous placement embedding to unify spatial contexts across different sensors. These unified signal tokens are then fed into a Transformer-based decoder, which employs an autoregressive mechanism to generate coherent action descriptions word-by-word. We comprehensively evaluate our approach on multiple public benchmarks (XRF V2, UWash, and WiFiTAD). Experimental results demonstrate that MobiDiary effectively generalizes across modalities, achieving state-of-the-art performance on captioning metrics (e.g., BLEU@4, CIDEr, RMC) and outperforming specialized baselines in continuous action understanding.

preprint2025arXiv

SlideChain: Semantic Provenance for Lecture Understanding via Blockchain Registration

Modern vision--language models (VLMs) are increasingly used to interpret and generate educational content, yet their semantic outputs remain challenging to verify, reproduce, and audit over time. Inconsistencies across model families, inference settings, and computing environments undermine the reliability of AI-generated instructional material, particularly in high-stakes and quantitative STEM domains. This work introduces SlideChain, a blockchain-backed provenance framework designed to provide verifiable integrity for multimodal semantic extraction at scale. Using the SlideChain Slides Dataset-a curated corpus of 1,117 medical imaging lecture slides from a university course-we extract concepts and relational triples from four state-of-the-art VLMs and construct structured provenance records for every slide. SlideChain anchors cryptographic hashes of these records on a local EVM (Ethereum Virtual Machine)-compatible blockchain, providing tamper-evident auditability and persistent semantic baselines. Through the first systematic analysis of semantic disagreement, cross-model similarity, and lecture-level variability in multimodal educational content, we reveal pronounced cross-model discrepancies, including low concept overlap and near-zero agreement in relational triples on many slides. We further evaluate gas usage, throughput, and scalability under simulated deployment conditions, and demonstrate perfect tamper detection along with deterministic reproducibility across independent extraction runs. Together, these results show that SlideChain provides a practical and scalable step toward trustworthy, verifiable multimodal educational pipelines, supporting long-term auditability, reproducibility, and integrity for AI-assisted instructional systems.

preprint2024arXiv

Graph-level Protein Representation Learning by Structure Knowledge Refinement

This paper focuses on learning representation on the whole graph level in an unsupervised manner. Learning graph-level representation plays an important role in a variety of real-world issues such as molecule property prediction, protein structure feature extraction, and social network analysis. The mainstream method is utilizing contrastive learning to facilitate graph feature extraction, known as Graph Contrastive Learning (GCL). GCL, although effective, suffers from some complications in contrastive learning, such as the effect of false negative pairs. Moreover, augmentation strategies in GCL are weakly adaptive to diverse graph datasets. Motivated by these problems, we propose a novel framework called Structure Knowledge Refinement (SKR) which uses data structure to determine the probability of whether a pair is positive or negative. Meanwhile, we propose an augmentation strategy that naturally preserves the semantic meaning of the original data and is compatible with our SKR framework. Furthermore, we illustrate the effectiveness of our SKR framework through intuition and experiments. The experimental results on the tasks of graph-level classification demonstrate that our SKR framework is superior to most state-of-the-art baselines.

preprint2023arXiv

Task-based Assessment of Deep Networks for Sinogram Denoising with A Transformer-based Observer

A variety of supervise learning methods are available for low-dose CT denoising in the sinogram domain. Traditional model observers are widely employed to evaluate these methods. However, the sinogram domain evaluation remains an open challenge for deep learning-based low-dose CT denoising. Since each lesion in medical CT images corresponds to a narrow sinusoidal strip in sinogram domain, here we proposed a transformer-based model observer to evaluate sinogram domain supervised learning methods. The numerical results indicate that our transformer-based model well-approximates the Laguerre-Gauss channelized Hotelling observer (LG-CHO) for a signal-known-exactly (SKE) and background-known-statistically (BKS) task. The proposed model observer is employed to assess two classic CNN-based sinogram domain denoising methods. The results demonstrate a utility and potential of this transformer-based observer model in developing deep low-dose CT denoising methods in the sinogram domain.

preprint2022arXiv

A transformer-based deep learning approach for classifying brain metastases into primary organ sites using clinical whole brain MRI

Treatment decisions for brain metastatic disease rely on knowledge of the primary organ site, and currently made with biopsy and histology. Here we develop a novel deep learning approach for accurate non-invasive digital histology with whole-brain MRI data. Our IRB-approved single-site retrospective study was comprised of patients (n=1,399) referred for MRI treatment-planning and gamma knife radiosurgery over 21 years. Contrast-enhanced T1-weighted and T2-weighted Fluid-Attenuated Inversion Recovery brain MRI exams (n=1,582) were preprocessed and input to the proposed deep learning workflow for tumor segmentation, modality transfer, and primary site classification into one of five classes. Ten-fold cross-validation generated overall AUC of 0.878 (95%CI:0.873,0.883), lung class AUC of 0.889 (95%CI:0.883,0.895), breast class AUC of 0.873 (95%CI:0.860,0.886), melanoma class AUC of 0.852 (95%CI:0.842,0.862), renal class AUC of 0.830 (95%CI:0.809,0.851), and other class AUC of 0.822 (95%CI:0.805,0.839). These data establish that whole-brain imaging features are discriminative to allow accurate diagnosis of the primary organ site of malignancy. Our end-to-end deep radiomic approach has great potential for classifying metastatic tumor types from whole-brain MRI images. Further refinement may offer an invaluable clinical tool to expedite primary cancer site identification for precision treatment and improved outcomes.

preprint2022arXiv

DLME: Deep Local-flatness Manifold Embedding

Manifold learning (ML) aims to seek low-dimensional embedding from high-dimensional data. The problem is challenging on real-world datasets, especially with under-sampling data, and we find that previous methods perform poorly in this case. Generally, ML methods first transform input data into a low-dimensional embedding space to maintain the data's geometric structure and subsequently perform downstream tasks therein. The poor local connectivity of under-sampling data in the former step and inappropriate optimization objectives in the latter step leads to two problems: structural distortion and underconstrained embedding. This paper proposes a novel ML framework named Deep Local-flatness Manifold Embedding (DLME) to solve these problems. The proposed DLME constructs semantic manifolds by data augmentation and overcomes the structural distortion problem using a smoothness constrained based on a local flatness assumption about the manifold. To overcome the underconstrained embedding problem, we design a loss and theoretically demonstrate that it leads to a more suitable embedding based on the local flatness. Experiments on three types of datasets (toy, biological, and image) for various downstream tasks (classification, clustering, and visualization) show that our proposed DLME outperforms state-of-the-art ML and contrastive learning methods.

preprint2022arXiv

GasHis-Transformer: A Multi-scale Visual Transformer Approach for Gastric Histopathological Image Detection

In this paper, a multi-scale visual transformer model, referred as GasHis-Transformer, is proposed for Gastric Histopathological Image Detection (GHID), which enables the automatic global detection of gastric cancer images. GasHis-Transformer model consists of two key modules designed to extract global and local information using a position-encoded transformer model and a convolutional neural network with local convolution, respectively. A publicly available hematoxylin and eosin (H&E) stained gastric histopathological image dataset is used in the experiment. Furthermore, a Dropconnect based lightweight network is proposed to reduce the model size and training time of GasHis-Transformer for clinical applications with improved confidence. Moreover, a series of contrast and extended experiments verify the robustness, extensibility and stability of GasHis-Transformer. In conclusion, GasHis-Transformer demonstrates high global detection performance and shows its significant potential in GHID task.

preprint2022arXiv

HOME: High-Order Mixed-Moment-based Embedding for Representation Learning

Minimum redundancy among different elements of an embedding in a latent space is a fundamental requirement or major preference in representation learning to capture intrinsic informational structures. Current self-supervised learning methods minimize a pair-wise covariance matrix to reduce the feature redundancy and produce promising results. However, such representation features of multiple variables may contain the redundancy among more than two feature variables that cannot be minimized via the pairwise regularization. Here we propose the High-Order Mixed-Moment-based Embedding (HOME) strategy to reduce the redundancy between any sets of feature variables, which is to our best knowledge the first attempt to utilize high-order statistics/information in this context. Multivariate mutual information is minimum if and only if multiple variables are mutually independent, which suggests the necessary conditions of factorized mixed moments among multiple variables. Based on these statistical and information theoretic principles, our general HOME framework is presented for self-supervised representation learning. Our initial experiments show that a simple version in the form of a three-order HOME scheme already significantly outperforms the current two-order baseline method (i.e., Barlow Twins) in terms of the linear evaluation on representation features.

preprint2022arXiv

Quasi-Equivalence of Width and Depth of Neural Networks

While classic studies proved that wide networks allow universal approximation, recent research and successes of deep learning demonstrate the power of deep networks. Based on a symmetric consideration, we investigate if the design of artificial neural networks should have a directional preference, and what the mechanism of interaction is between the width and depth of a network. Inspired by the De Morgan law, we address this fundamental question by establishing a quasi-equivalence between the width and depth of ReLU networks in two aspects. First, we formulate two transforms for mapping an arbitrary ReLU network to a wide network and a deep network respectively for either regression or classification so that the essentially same capability of the original network can be implemented. Then, we replace the mainstream artificial neuron type with a quadratic counterpart, and utilize the factorization and continued fraction representations of the same polynomial function to construct a wide network and a deep network, respectively. Based on our findings, a deep network has a wide equivalent, and vice versa, subject to an arbitrarily small error.

preprint2022arXiv

Research Status of Deep Learning Methods for Rumor Detection

To manage the rumors in social media to reduce the harm of rumors in society. Many studies used methods of deep learning to detect rumors in open networks. To comprehensively sort out the research status of rumor detection from multiple perspectives, this paper analyzes the highly focused work from three perspectives: Feature Selection, Model Structure, and Research Methods. From the perspective of feature selection, we divide methods into content feature, social feature, and propagation structure feature of the rumors. Then, this work divides deep learning models of rumor detection into CNN, RNN, GNN, Transformer based on the model structure, which is convenient for comparison. Besides, this work summarizes 30 works into 7 rumor detection methods such as propagation trees, adversarial learning, cross-domain methods, multi-task learning, unsupervised and semi-supervised methods, based knowledge graph, and other methods for the first time. And compare the advantages of different methods to detect rumors. In addition, this review enumerate datasets available and discusses the potential issues and future work to help researchers advance the development of field.

preprint2022arXiv

SoftDropConnect (SDC) -- Effective and Efficient Quantification of the Network Uncertainty in Deep MR Image Analysis

Recently, deep learning has achieved remarkable successes in medical image analysis. Although deep neural networks generate clinically important predictions, they have inherent uncertainty. Such uncertainty is a major barrier to report these predictions with confidence. In this paper, we propose a novel yet simple Bayesian inference approach called SoftDropConnect (SDC) to quantify the network uncertainty in medical imaging tasks with gliomas segmentation and metastases classification as initial examples. Our key idea is that during training and testing SDC modulates network parameters continuously so as to allow affected information processing channels still in operation, instead of disabling them as Dropout or DropConnet does. When compared with three popular Bayesian inference methods including Bayes By Backprop, Dropout, and DropConnect, our SDC method (SDC-W after optimization) outperforms the three competing methods with a substantial margin. Quantitatively, our proposed method generates substantial improvements in prediction accuracy (by 3.4%, 2.5%, and 6.7% respectively for whole tumor segmentation in terms of dice score; and by 11.7%, 3.9%, and 8.7% respectively for brain metastases classification) and greatly reduced epistemic and aleatoric uncertainties. Our approach promises to deliver better diagnostic performance and make medical AI imaging more explainable and trustworthy.

preprint2022arXiv

Stationary Multi-source AI-powered Real-time Tomography (SMART)

Over the past decades, the development of CT technologies has been largely driven by the needs for cardiac imaging but the temporal resolution remains insufficient for clinical CT in difficult cases and rather challenging for preclinical micro-CT since small animals, as human cardiac disease models, have much higher heart rates than human. To address this challenge, here we report a Stationary Multi-source AI-based Real-time Tomography (SMART) micro-CT system. This unique scanner is featured by 29 source-detector pairs fixed on a circular track to collect x-ray signals in parallel, enabling instantaneous tomography in principle. Given the multi-source architecture, the field-of-view only covers a cardiac region. To solve this interior problem, an AI-empowered interior tomography approach is developed to synergize sparsity-based regularization and learning-based reconstruction. To demonstrate the performance and utilities of the SMART system, extensive results are obtained in physical phantom experiments and animal studies, including dead and live rats as well as live rabbits. The reconstructed volumetric images convincingly demonstrate the merits of the SMART system using the AI-empowered interior tomography approach, enabling cardiac micro-CT with the unprecedented temporal resolution of 30ms, which is an order of magnitude higher than the state of the art.

preprint2022arXiv

Suppression of Correlated Noise with Similarity-based Unsupervised Deep Learning

Image denoising is a prerequisite for downstream tasks in many fields. Low-dose and photon-counting computed tomography (CT) denoising can optimize diagnostic performance at minimized radiation dose. Supervised deep denoising methods are popular but require paired clean or noisy samples that are often unavailable in practice. Limited by the independent noise assumption, current unsupervised denoising methods cannot process correlated noises as in CT images. Here we propose the first-of-its-kind similarity-based unsupervised deep denoising approach, referred to as Noise2Sim, that works in a nonlocal and nonlinear fashion to suppress not only independent but also correlated noises. Theoretically, Noise2Sim is asymptotically equivalent to supervised learning methods under mild conditions. Experimentally, Nosie2Sim recovers intrinsic features from noisy low-dose CT and photon-counting CT images as effectively as or even better than supervised learning methods on practical datasets visually, quantitatively and statistically. Noise2Sim is a general unsupervised denoising approach and has great potential in diverse applications.

preprint2022arXiv

Top-level Design and Simulated Performance of the First Portable CT-MR scanner

Multi-modality imaging hardware can be integrated in a single gantry to collect diverse datasets for complementary information and spatiotemporal correlation, with excellent examples including PET-CT and PET-MRI. However, there is no CT-MRI prototype up to today due to technical challenges and associated cost-benefit considerations. Thanks to the rapid development of medical imaging, it becomes feasible now to integrate cost-effective CT and MRI imagers together for portability, popularity, and point of care. In this paper, we present the top-level design of the first portable CT-MRI system and evaluate its imaging performance via realistic numerical simulations. In this CT-MRI system, the magnet made of two NdFeB rings of about 40.0 cm radius forms a magnetic field of about 57 mT at the isocenter and has a gap of 11.3 cm to accommodate the rotating CT gantry. The targeted MR imaging field of view (FOV) is a sphere of ~15 cm in diameter and that of CT is approximately 20 cm diameter in axial direction and 5 cm in longitudinal direction. Our results show a great potential of such a hybrid system. The proposed CT-MRI system will be valuable in applications such as imaging in underdeveloped countries, disaster scenes and battle fields.

preprint2022arXiv

Unravelling Distance-Dependent Inter-Site Interactions and Magnetic Transition Effects of Heteronuclear Single Atom Catalysts on Electrochemical Oxygen Reduction

Inter-site interactions between single atom catalysts (SACs) in the high loading regime are critical to tuning the catalytic performance. However, the understanding on such interactions and their distance dependent effects remains elusive, especially for the heteronuclear SACs. In this study, we reveal the effects of the distance-dependent inter-site interaction on the catalytic performance of SACs. Using the density functional theory calculations, we systematically investigate the heteronuclear iron and cobalt single atoms co-supported on the nitrogen-doped graphene (FeN4-C and CoN4-C) for oxygen reduction reaction (ORR). We find that as the distance between Fe and Co SACs decreases, FeN4-C exhibits a reduced catalytic activity, which can be mitigated by the presence of an axial hydroxyl ligand, whereas the activity of CoN4-C shows a volcano-like evolution with the optimum reached at the intermediate distance. We further unravel that the transition towards the high-spin state upon adsorption of ORR intermediate adsorbates is responsible for the decreased activity of both FeN4-C and CoN4-C at short inter-site distance. Such high-spin state transition is also found to significantly shift the linear relation between hydroxyl (*OH) and hydroperoxyl (*OOH) adsorbates. These findings not only shed light on the SAC-specific effect of the distance-dependent inter-site interaction between heteronuclear SACs, but also pave a way towards shifting the long-standing linear relations observed in multiple-electron chemical reactions.

preprint2022arXiv

X-ray Dissectography Improves Lung Nodule Detection

Although radiographs are the most frequently used worldwide due to their cost-effectiveness and widespread accessibility, the structural superposition along the x-ray paths often renders suspicious or concerning lung nodules difficult to detect. In this study, we apply "X-ray dissectography" to dissect lungs digitally from a few radiographic projections, suppress the interference of irrelevant structures, and improve lung nodule detectability. For this purpose, a collaborative detection network is designed to localize lung nodules in 2D dissected projections and 3D physical space. Our experimental results show that our approach can significantly improve the average precision by 20+% in comparison with the common baseline that detects lung nodules from original projections using a popular detection network. Potentially, this approach could help re-design the current X-ray imaging protocols and workflows and improve the diagnostic performance of chest radiographs in lung diseases.

preprint2021arXiv

Noise Entangled GAN For Low-Dose CT Simulation

We propose a Noise Entangled GAN (NE-GAN) for simulating low-dose computed tomography (CT) images from a higher dose CT image. First, we present two schemes to generate a clean CT image and a noise image from the high-dose CT image. Then, given these generated images, an NE-GAN is proposed to simulate different levels of low-dose CT images, where the level of generated noise can be continuously controlled by a noise factor. NE-GAN consists of a generator and a set of discriminators, and the number of discriminators is determined by the number of noise levels during training. Compared with the traditional methods based on the projection data that are usually unavailable in real applications, NE-GAN can directly learn from the real and/or simulated CT images and may create low-dose CT images quickly without the need of raw data or other proprietary CT scanner information. The experimental results show that the proposed method has the potential to simulate realistic low-dose CT images.

preprint2021arXiv

Phase function estimation from a diffuse optical image via deep learning

The phase function is a key element of a light propagation model for Monte Carlo (MC) simulation, which is usually fitted with an analytic function with associated parameters. In recent years, machine learning methods were reported to estimate the parameters of the phase function of a particular form such as the Henyey-Greenstein phase function but, to our knowledge, no studies have been performed to determine the form of the phase function. Here we design a convolutional neural network to estimate the phase function from a diffuse optical image without any explicit assumption on the form of the phase function. Specifically, we use a Gaussian mixture model as an example to represent the phase function generally and learn the model parameters accurately. The Gaussian mixture model is selected because it provides the analytic expression of phase function to facilitate deflection angle sampling in MC simulation, and does not significantly increase the number of free parameters. Our proposed method is validated on MC-simulated reflectance images of typical biological tissues using the Henyey-Greenstein phase function with different anisotropy factors. The effects of field of view (FOV) and spatial resolution on the errors are analyzed to optimize the estimation method. The mean squared error of the phase function is 0.01 and the relative error of the anisotropy factor is 3.28%.

preprint2021arXiv

Soft Autoencoder and Its Wavelet Adaptation Interpretation

Recently, deep learning becomes the main focus of machine learning research and has greatly impacted many important fields. However, deep learning is criticized for lack of interpretability. As a successful unsupervised model in deep learning, the autoencoder embraces a wide spectrum of applications, yet it suffers from the model opaqueness as well. In this paper, we propose a new type of convolutional autoencoders, termed as Soft Autoencoder (Soft-AE), in which the activation functions of encoding layers are implemented with adaptable soft-thresholding units while decoding layers are realized with linear units. Consequently, Soft-AE can be naturally interpreted as a learned cascaded wavelet shrinkage system. Our denoising experiments demonstrate that Soft-AE not only is interpretable but also offers a competitive performance relative to its counterparts. Furthermore, we propose a generalized linear unit (GenLU) to make an autoencoder more adaptive in nonlinearly filtering images and data, such as denoising and deblurring.

preprint2020arXiv

Cine Cardiac MRI Motion Artifact Reduction Using a Recurrent Neural Network

Cine cardiac magnetic resonance imaging (MRI) is widely used for diagnosis of cardiac diseases thanks to its ability to present cardiovascular features in excellent contrast. As compared to computed tomography (CT), MRI, however, requires a long scan time, which inevitably induces motion artifacts and causes patients' discomfort. Thus, there has been a strong clinical motivation to develop techniques to reduce both the scan time and motion artifacts. Given its successful applications in other medical imaging tasks such as MRI super-resolution and CT metal artifact reduction, deep learning is a promising approach for cardiac MRI motion artifact reduction. In this paper, we propose a recurrent neural network to simultaneously extract both spatial and temporal features from under-sampled, motion-blurred cine cardiac images for improved image quality. The experimental results demonstrate substantially improved image quality on two clinical test datasets. Also, our method enables data-driven frame interpolation at an enhanced temporal resolution. Compared with existing methods, our deep learning approach gives a superior performance in terms of structural similarity (SSIM) and peak signal-to-noise ratio (PSNR).

preprint2020arXiv

Clinical Micro-CT Empowered by Interior Tomography, Robotic Scanning, and Deep Learning

While micro-CT systems are instrumental in preclinical research, clinical micro-CT imaging has long been desired with cochlear implantation as a primary example. The structural details of the cochlear implant and the temporal bone require a significantly higher image resolution than that (about 0.2 mm) provided by current medical CT scanners. In this paper, we propose a clinical micro-CT (CMCT) system design integrating conventional spiral cone-beam CT, contemporary interior tomography, deep learning techniques, and technologies of micro-focus X-ray source, photon-counting detector (PCD), and robotic arms for ultrahigh resolution localized tomography of a freely-selected volume of interest (VOI) at a minimized radiation dose level. The whole system consists of a standard CT scanner for a clinical CT exam and VOI specification, and a robotic-arm based micro-CT scanner for a local scan at much higher spatial and spectral resolution as well as much reduced radiation dose. The prior information from global scan is also fully utilized for background compensation to improve interior tomography from local data for accurate and stable VOI reconstruction. Our results and analysis show that the proposed hybrid reconstruction algorithm delivers superior local reconstruction, being insensitive to the misalignment of the isocenter position and initial view angle in the data/image registration while the attenuation error caused by scale mismatch can be effectively addressed with bias correction. These findings demonstrate the feasibility of our system design. We envision that deep learning techniques can be leveraged for optimized imaging performance. With high resolution imaging, high dose efficiency and low system cost synergistically, our proposed CMCT system has great potentials in temporal bone imaging as well as various other clinical applications.

preprint2020arXiv

GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering

We propose a self-supervised Gaussian ATtention network for image Clustering (GATCluster). Rather than extracting intermediate features first and then performing the traditional clustering algorithm, GATCluster directly outputs semantic cluster labels without further post-processing. Theoretically, we give a Label Feature Theorem to guarantee the learned features are one-hot encoded vectors, and the trivial solutions are avoided. To train the GATCluster in a completely unsupervised manner, we design four self-learning tasks with the constraints of transformation invariance, separability maximization, entropy analysis, and attention mapping. Specifically, the transformation invariance and separability maximization tasks learn the relationships between sample pairs. The entropy analysis task aims to avoid trivial solutions. To capture the object-oriented semantics, we design a self-supervised attention mechanism that includes a parameterized attention module and a soft-attention loss. All the guiding signals for clustering are self-generated during the training process. Moreover, we develop a two-step learning algorithm that is memory-efficient for clustering large-size images. Extensive experiments demonstrate the superiority of our proposed method in comparison with the state-of-the-art image clustering benchmarks. Our code has been made publicly available at https://github.com/niuchuangnn/GATCluster.

preprint2020arXiv

Integrative Analysis for COVID-19 Patient Outcome Prediction

While image analysis of chest computed tomography (CT) for COVID-19 diagnosis has been intensively studied, little work has been performed for image-based patient outcome prediction. Management of high-risk patients with early intervention is a key to lower the fatality rate of COVID-19 pneumonia, as a majority of patients recover naturally. Therefore, an accurate prediction of disease progression with baseline imaging at the time of the initial presentation can help in patient management. In lieu of only size and volume information of pulmonary abnormalities and features through deep learning based image segmentation, here we combine radiomics of lung opacities and non-imaging features from demographic data, vital signs, and laboratory findings to predict need for intensive care unit (ICU) admission. To our knowledge, this is the first study that uses holistic information of a patient including both imaging and non-imaging data for outcome prediction. The proposed methods were thoroughly evaluated on datasets separately collected from three hospitals, one in the United States, one in Iran, and another in Italy, with a total 295 patients with reverse transcription polymerase chain reaction (RT-PCR) assay positive COVID-19 pneumonia. Our experimental results demonstrate that adding non-imaging features can significantly improve the performance of prediction to achieve AUC up to 0.884 and sensitivity as high as 96.1%, which can be valuable to provide clinical decision support in managing COVID-19 patients. Our methods may also be applied to other lung diseases including but not limited to community acquired pneumonia. The source code of our work is available at https://github.com/DIAL-RPI/COVID19-ICUPrediction.

preprint2020arXiv

Low-dimensional Manifold Constrained Disentanglement Network for Metal Artifact Reduction

Deep neural network based methods have achieved promising results for CT metal artifact reduction (MAR), most of which use many synthesized paired images for training. As synthesized metal artifacts in CT images may not accurately reflect the clinical counterparts, an artifact disentanglement network (ADN) was proposed with unpaired clinical images directly, producing promising results on clinical datasets. However, without sufficient supervision, it is difficult for ADN to recover structural details of artifact-affected CT images based on adversarial losses only. To overcome these problems, here we propose a low-dimensional manifold (LDM) constrained disentanglement network (DN), leveraging the image characteristics that the patch manifold is generally low-dimensional. Specifically, we design an LDM-DN learning algorithm to empower the disentanglement network through optimizing the synergistic network loss functions while constraining the recovered images to be on a low-dimensional patch manifold. Moreover, learning from both paired and unpaired data, an efficient hybrid optimization scheme is proposed to further improve the MAR performance on clinical datasets. Extensive experiments demonstrate that the proposed LDM-DN approach can consistently improve the MAR performance in paired and/or unpaired learning settings, outperforming competing methods on synthesized and clinical datasets.

preprint2020arXiv

Parameter-Transferred Wasserstein Generative Adversarial Network (PT-WGAN) for Low-Dose PET Image Denoising

Due to the widespread use of positron emission tomography (PET) in clinical practice, the potential risk of PET-associated radiation dose to patients needs to be minimized. However, with the reduction in the radiation dose, the resultant images may suffer from noise and artifacts that compromise diagnostic performance. In this paper, we propose a parameter-transferred Wasserstein generative adversarial network (PT-WGAN) for low-dose PET image denoising. The contributions of this paper are twofold: i) a PT-WGAN framework is designed to denoise low-dose PET images without compromising structural details, and ii) a task-specific initialization based on transfer learning is developed to train PT-WGAN using trainable parameters transferred from a pretrained model, which significantly improves the training efficiency of PT-WGAN. The experimental results on clinical data show that the proposed network can suppress image noise more effectively while preserving better image fidelity than recently published state-of-the-art methods. We make our code available at https://github.com/90n9-yu/PT-WGAN.

preprint2020arXiv

X-ray Monochromatic Imaging from Single-spectrum CT via Machine Learning

In clinical CT system, the x-ray tube emits polychromatic x-rays, and the x-ray detectors operate in the current-integrating mode. This physical process is accurately described by an energy-dependent non-linear integral equation. However, the non-linear model is not invertible with a computationally efficient solution, and is often approximated as a linear integral model in the form of the Radon transform. Such approximation basically ignores energy-dependent information and would generate beam hardening artifacts. Dual-energy CT (DECT) scans one object using two different x-ray energy spectra for the acquisition of two spectrally distinct projection datasets to improve imaging performance. Thus, DECT can reconstruct energy and material-selective images, realizing monochromatic imaging and material decomposition. Nevertheless, DECT would increase radiation dose, system complexity, and equipment cost relative to single-spectrum CT. In this paper, a machine-learning-based CT reconstruction method is proposed to perform monochromatic image reconstruction using a single-spectrum CT scanner. Specifically, a residual neural network (ResNet) model is adapted to map a CT image to a monochromatic counterpart at a pre-specified energy level. This ResNet is trained on clinical dual-energy data, showing an excellent convergence to a minimal loss. The trained network produces high-quality monochromatic images on testing data, with a relative error of less than 0.2%. This work has great potential in clinical DECT applications such as tissue characterization, beam hardening correction and proton therapy planning.

preprint2020arXiv

X-ray Photon-Counting Data Correction through Deep Learning

X-ray photon-counting detectors (PCDs) are drawing an increasing attention in recent years due to their low noise and energy discrimination capabilities. The energy/spectral dimension associated with PCDs potentially brings great benefits such as for material decomposition, beam hardening and metal artifact reduction, as well as low-dose CT imaging. However, X-ray PCDs are currently limited by several technical issues, particularly charge splitting (including charge sharing and K-shell fluorescence re-absorption or escaping) and pulse pile-up effects which distort the energy spectrum and compromise the data quality. Correction of raw PCD measurements with hardware improvement and analytic modeling is rather expensive and complicated. Hence, here we proposed a deep neural network based PCD data correction approach which directly maps imperfect data to the ideal data in the supervised learning mode. In this work, we first establish a complete simulation model incorporating the charge splitting and pulse pile-up effects. The simulated PCD data and the ground truth counterparts are then fed to a specially designed deep adversarial network for PCD data correction. Next, the trained network is used to correct separately generated PCD data. The test results demonstrate that the trained network successfully recovers the ideal spectrum from the distorted measurement within $\pm6\%$ relative error. Significant data and image fidelity improvements are clearly observed in both projection and reconstruction domains.

preprint2019arXiv

A Method of Rapid Quantification of Patient-Specific Organ Dose for CT Using Coupled Deep-Learning based Multi-Organ Segmentation and GPU-accelerated Monte Carlo Dose Computing

Purpose: This paper describes a new method to apply deep-learning algorithms for automatic segmentation of radiosensitive organs from 3D tomographic CT images before computing organ doses using a GPU-based Monte Carlo code. Methods: A deep convolutional neural network (CNN) for organ segmentation is trained to automatically delineate radiosensitive organs from CT. With a GPU-based Monte Carlo dose engine (ARCHER) to derive CT dose of a phantom made from a subject's CT scan, we are then able to compute the patient-specific CT dose for each of the segmented organs. The developed tool is validated by using Relative Dose Error (RDE) against the organ doses calculated by ARCHER with manual segmentation performed by radiologists. The dose computation results are also compared against organ doses from population-average phantoms to demonstrate the improvement achieved by using the developed tool. In this study, two datasets were used: The Lung CT Segmentation Challenge 2017 (LCTSC) dataset, which contains 60 thoracic CT scan patients each with 5 segmented organs, and the Pancreas-CT (PCT) dataset, which contains 43 abdominal CT scan patients each with 8 segmented organs. Five-fold cross-validation of the new method is performed on both datasets. Results: Comparing with the traditional organ dose evaluation method that based on population-average phantom, our proposed method achieved the smaller RDE range on all organs with -4.3%~1.5% vs -31.5%~33.9% (lung), -7.0%~2.3% vs -15.2%~125.1% (heart), -18.8%~40.2% vs -10.3%~124.1% (esophagus) in the LCTSC dataset and -5.6%~1.6% vs -20.3%~57.4% (spleen), -4.5%~4.6% vs -19.5%~61.0% (pancreas), -2.3%~4.4% vs -37.8%~75.8% (left kidney), -14.9%~5.4% vs -39.9% ~14.6% (gall bladder), -0.9%~1.6% vs -30.1%~72.5% (liver), and -23.0%~11.1% vs -52.5%~-1.3% (stomach) in the PCT dataset.

preprint2019arXiv

MRI Super-Resolution with Ensemble Learning and Complementary Priors

Magnetic resonance imaging (MRI) is a widely used medical imaging modality. However, due to the limitations in hardware, scan time, and throughput, it is often clinically challenging to obtain high-quality MR images. The super-resolution approach is potentially promising to improve MR image quality without any hardware upgrade. In this paper, we propose an ensemble learning and deep learning framework for MR image super-resolution. In our study, we first enlarged low resolution images using 5 commonly used super-resolution algorithms and obtained differentially enlarged image datasets with complementary priors. Then, a generative adversarial network (GAN) is trained with each dataset to generate super-resolution MR images. Finally, a convolutional neural network is used for ensemble learning that synergizes the outputs of GANs into the final MR super-resolution images. According to our results, the ensemble learning results outcome any one of GAN outputs. Compared with some state-of-the-art deep learning-based super-resolution methods, our approach is advantageous in suppressing artifacts and keeping more image details.

preprint2019arXiv

Multi-Contrast Super-Resolution MRI Through a Progressive Network

Magnetic resonance imaging (MRI) is widely used for screening, diagnosis, image-guided therapy, and scientific research. A significant advantage of MRI over other imaging modalities such as computed tomography (CT) and nuclear imaging is that it clearly shows soft tissues in multi-contrasts. Compared with other medical image super-resolution (SR) methods that are in a single contrast, multi-contrast super-resolution studies can synergize multiple contrast images to achieve better super-resolution results. In this paper, we propose a one-level non-progressive neural network for low up-sampling multi-contrast super-resolution and a two-level progressive network for high up-sampling multi-contrast super-resolution. Multi-contrast information is combined in high-level feature space. Our experimental results demonstrate that the proposed networks can produce MRI super-resolution images with good image quality and outperform other multi-contrast super-resolution methods in terms of structural similarity and peak signal-to-noise ratio. Also, the progressive network produces a better SR image quality than the non-progressive network, even if the original low-resolution images were highly down-sampled.

preprint2016arXiv

A Perspective on Deep Imaging

The combination of tomographic imaging and deep learning, or machine learning in general, promises to empower not only image analysis but also image reconstruction. The latter aspect is considered in this perspective article with an emphasis on medical imaging to develop a new generation of image reconstruction theories and techniques. This direction might lead to intelligent utilization of domain knowledge from big data, innovative approaches for image reconstruction, and superior performance in clinical and preclinical applications. To realize the full impact of machine learning on medical imaging, major challenges must be addressed.

preprint2016arXiv

Bibliometric Index for Academic Leadership

Academic leadership is essential for research innovation and impact. Until now, there has been no dedicated measure of leadership by bibliometrics. Popular bibliometric indices are mainly based on academic output, such as the journal impact factor and the number of citations. Here we develop an academic leadership index based on readily available bibliometric data that is sensitive to not only academic output but also research efficiency. Our leadership index was tested in two studies on peer-reviewed journal papers by extramurally-funded principal investigators in the field of life sciences from China and the USA, respectively. The leadership performance of these principal investigators was quantified and compared relative to university rank and other factors. As a validation measure, we show that the highest average leadership index was achieved by principal investigators at top national universities in both countries. More interestingly, our results also indicate that on an individual basis, strong leadership and high efficiency are not necessarily associated with those at top-tier universities nor with the most funding. This leadership index may become the basis of a comprehensive merit system, facilitating academic evaluation and resource management.

preprint2016arXiv

Deep Learning for the Classification of Lung Nodules

Deep learning, as a promising new area of machine learning, has attracted a rapidly increasing attention in the field of medical imaging. Compared to the conventional machine learning methods, deep learning requires no hand-tuned feature extractor, and has shown a superior performance in many visual object recognition applications. In this study, we develop a deep convolutional neural network (CNN) and apply it to thoracic CT images for the classification of lung nodules. We present the CNN architecture and classification accuracy for the original images of lung nodules. In order to understand the features of lung nodules, we further construct new datasets, based on the combination of artificial geometric nodules and some transformations of the original images, as well as a stochastic nodule shape model. It is found that simplistic geometric nodules cannot capture the important features of lung nodules.

preprint2016arXiv

Low-dose CT denoising with convolutional neural network

To reduce the potential radiation risk, low-dose CT has attracted much attention. However, simply lowering the radiation dose will lead to significant deterioration of the image quality. In this paper, we propose a noise reduction method for low-dose CT via deep neural network without accessing original projection data. A deep convolutional neural network is trained to transform low-dose CT images towards normal-dose CT images, patch by patch. Visual and quantitative evaluation demonstrates a competing performance of the proposed method.

preprint2016arXiv

Low-Dose CT via Deep Neural Network

In order to reduce the potential radiation risk, low-dose CT has attracted more and more attention. However, simply lowering the radiation dose will significantly degrade the imaging quality. In this paper, we propose a noise reduction method for low-dose CT via deep learning without accessing the original projection data. An architecture of deep convolutional neural network was considered to map the low-dose CT images into its corresponding normal-dose CT images patch by patch. Qualitative and quantitative evaluations demonstrate a state-the-art performance of the proposed method.

preprint2015arXiv

Theoretical analysis on x-ray cylindrical grating interferometer

Grating interferometer is a state of art x-ray imaging approach, which can simultaneously acquire information of x-ray attenuation, phase shift, and small angle scattering. This approach is very sensitive to micro-structural variation and offers superior contrast resolution for biological soft tissues. The present grating interferometer often uses flat gratings, with serious limitations in the field of view and the flux of photons. The use of curved gratings allows perpendicular incidence of x-rays on the gratings, and gives higher visibility over a larger field of view than a conventional interferometer with flat gratings. In the study, we present a rigorous theoretical analysis of the self-imaging of curved transmission gratings based on Rayleigh-Sommerfeld diffraction. Numerical simulations have demonstrated the self-imaging phenomenon of cylindrical grating interferometer. The theoretical results are in agreement with the results of numerical simulations.

preprint2014arXiv

A Pilot Study on Coupling CT and MRI through Use of Semiconductor Nanoparticles

CT and MRI are the two most widely used imaging modalities in healthcare, each with its own merits and drawbacks. Combining these techniques in one machine could provide unprecedented resolution and sensitivity in a single scan, and serve as an ideal platform to explore physical coupling of x-ray excitation and magnetic resonance. Molecular probes such as functionalized nanophosphors present an opportunity to demonstrate a synergy between these modalities. However, a simultaneous CT-MRI scanner does not exist at this moment. As a pilot study, here we propose a mechanism in which water solutions containing LiGa5O8:Cr3+ nanophosphors can be excited with x-rays to store energy, and these excited particles may subsequently influence the T2 relaxation times of the solutions so that a difference in T2 can be measured by MRI before and after x-ray excitation. The trends seen in our study suggest that a measurable effect may exist from x-ray excitation of the nanophosphors. However, there are several experimental conditions that hinder the clarity of the results to be statistically significant up to a commonly accepted level (p=0.05), including insoluble nanoparticles and inter-scan variability. Nevertheless, the initial results from our experiments seem a consistent and inspiring story that x-rays modify MRI T2 values around nanophosphors. Upon availability of soluble nanophosphors, we will repeat our experiments to confirm these observations.

preprint2014arXiv

First CT-MRI Scanner for Multi-dimensional Synchrony and Multi-physical Coupling

We propose to prototype the first CT-MRI scanner for radiation therapy and basic research, demonstrate its transformative biomedical potential, and initiate a paradigm shift in multimodality imaging. Our design consists of a double donut-shaped pair of permanent magnets to form a regionally uniform ~0.5T magnetic field and leave room for a stationary 9-source interior CT gantry at 3 tube voltages (triple-energy CT). Image reconstruction will be in a compressive sensing framework. Please discuss with Dr. Ge Wang (ge-wang@ieee.org) if you are interested in collaborative opportunities.

preprint2014arXiv

Local Filtering Fundamentally Against Wide Spectrum

Chen et al. (1) applied three-dimensional (3D) Fourier filtering together with equal-slope tomographic reconstruction for an observation of nearly all the atoms in a multiply twinned platinum nanoparticle. However, their methodology suffers from fundamental methodological flaws, as initially brought up by a recent Communications Arising (2) and now analyzed in-depth in this report written on June 20, 2014. The authors of (1) read this report and wrote a reply containing 5 points. While we have solid reasons to disagree with their points, we will not include our responses here, and will address their first two points using Nature's online commenting facility. References 1. Chen, C.C., et al., Three-dimensional imaging of dislocations in a nanoparticle at atomic resolution. Nature 496(7443):74-79, 2013 2. Rez, P. and M.M.J. Treacy, Three-dimensional imaging of dislocations. Nature 503(E1):74-79, 2013

preprint2014arXiv

Modulated Luminescent Tomography

We propose and analyze a mathematical model of Modulated Luminescent Tomography. We show that when single X-rays or focused X-rays are used as an excitation, the problem is similar to the inversion of weighted X-ray transforms. In particular, we give an explicit inversion in the case of Dual Cone X-ray excitation.

preprint2014arXiv

Physical Foundation for General Interior Tomography

Gauge invariability guarantees the same form of the Maxwell equations in different coordinate systems, and is instrumental for electromagnetic cloaking to hide a region of interest (ROI) perfectly. On the other hand, interior tomography is to reconstruct an ROI exactly. In this article, the recent results in these two disconnected areas are brought together to justify the general interior tomography principle. Several opportunities are suggested for tomographic research.

preprint2014arXiv

X-optogenetics and U-optogenetics: Feasibility and Possibilities

To address these limitations of optogenetics, here we propose two new methods for optogenetic stimulation. The first is x-optogenetics, which uses visible light-emitting nanophosphors stimulated by focused x-rays. This idea is not new but the application to optogenetics is novel. X-rays can penetrate much more deeply than infrared light and could allow for nerve cell stimulation in any part of the brain. In this paper, we discuss the feasibility and possibilities of such a method by describing the advances in nanomaterials, x-ray focusing, and x-ray sources. Also, we discuss concerns when dealing with x-rays such as radiation dosage. Through the use of quantities and assumptions backed by recent literature, manufacturer specifications, and personal correspondence, a full feasibility analysis of x-optogenetics is completed. The second proposed method we explore is u-optogenetics, which is the application of sonoluminescence to optogenetics. Such a technique uses ultrasound waves instead of x-rays to induce light emission, so there would be no introduction of radiation. However, the penetration depth of ultrasound is less than that of x-ray. The key issues affecting feasibility are laid out for further investigation into both x-optogenetics and u-optogenetics.

preprint2013arXiv

Analytic Comparison between X-ray Fluorescence CT and K-edge CT

X-ray fluorescence computed tomography (XFCT) and K-edge computed tomography (CT) are two important modalities to quantify a distribution of gold nanoparticles (GNPs) in a small animal for preclinical studies. It is valuable to determine which modality is more efficient for a given application. In this paper, we report a theoretical analysis in terms of signal-to-noise ratio (SNR) for the two modalities, showing that there is a threshold for the GNPs concentration and XFCT has a better SNR than K-edge CT if GNPs concentration is less than this threshold. Numerical simulations are performed and two kinds of phantoms are used to represent multiple concentration levels and feature sizes. Experimental results illustrate that XFCT is superior to K-edge CT when contrast concentration is lower than 0.4% which coincides with the theoretical analysis.

preprint2013arXiv

Attenuation map reconstruction from TOF PET data

To reconstruct a radioactive tracer distribution with positron emission tomography (PET), the background attenuation correction is needed to eliminate image artifacts. Recent research shows that time-of-flight (TOF) PET data determine the attenuation sinogram up to a constant, and its gradient can be computed using an analytic algorithm. In this paper, we study a direct estimation of the sinogram only from TOF PET data. First, the gradient of the attenuation sinogram is estimated using the aforementioned algorithm. Then, a relationship is established to link the differential attenuation sinogram and the underlying attenuation background. Finally, an iterative algorithm is designed to determine the attenuation sinogram accurately and stably. A 2D numerical simulation study is conducted to verify the correctness of our proposed approach.

preprint2013arXiv

Dictionary-Learning-Based Reconstruction Method for Electron Tomography

Electron tomography usually suffers from so called missing wedge artifacts caused by limited tilt angle range. An equally sloped tomography (EST) acquisition scheme (which should be called the linogram sampling scheme) was recently applied to achieve 2.4-angstrom resolution. On the other hand, a compressive sensing-inspired reconstruction algorithm, known as adaptive dictionary based statistical iterative reconstruction (ADSIR), has been reported for x-ray computed tomography. In this paper, we evaluate the EST, ADSIR and an ordered-subset simultaneous algebraic reconstruction technique (OS-SART), and compare the ES and equally angled (EA) data acquisition modes. Our results show that OS-SART is comparable to EST, and the ADSIR outperforms EST and OS-SART. Furthermore, the equally sloped projection data acquisition mode has no advantage over the conventional equally angled mode in the context.

preprint2013arXiv

Dynamic Bowtie for Fan-beam CT

A bowtie is a filter used to shape an x-ray beam and equalize its flux reaching different detector channels. For development of spectral CT with energy-discriminative photon-counting (EDPC) detectors, here we propose and evaluate a dynamic bowtie for performance optimization based on a patient model or a scout scan. Our dynamic bowtie modifies an x-ray beam intensity profile by mechanical rotation and adaptive adjustment of the x-ray source flux. First, a mathematical model for dynamic bowtie filtering is established for an elliptical section in fan-beam geometry, and the contour of the optimal bowtie is derived. Then, numerical simulation is performed to compare the performance of the dynamic bowtie in the cases of an ideal phantom and a realistic cross-section relative to the counterparts without any bowtie and with a fixed bowtie respectively. Our dynamic bowtie can equalize the expected numbers of photons in the case of an ideal phantom. In practical cases, our dynamic bowtie can effectively reduce the dynamic range of detected signals inside the field of view. Although our design is optimized for an elliptical phantom, the resultant dynamic bowtie can be applied to a real fan-beam scan if the underlying cross-section can be approximated as an ellipse. Furthermore, our design methodology can be applied to specify an optimized dynamic bowtie for any cross-section of a patient, preferably using rapid prototyping technology. This fan-beam dynamic bowtie work could be extended to the cone-beam geometry in a follow-up study.

preprint2013arXiv

Meaning of Interior Tomography

The classic imaging geometry for computed tomography is for collection of un-truncated projections and reconstruction of a global image, with the Fourier transform as the theoretical foundation that is intrinsically non-local. Recently, interior tomography research has led to theoretically exact relationships between localities in the projection and image spaces and practically promising reconstruction algorithms. Initially, interior tomography was developed for x-ray computed tomography. Then, it has been elevated as a general imaging principle. Finally, a novel framework known as omni-tomography is being developed for grand fusion of multiple imaging modalities, allowing tomographic synchrony of diversified features.

preprint2013arXiv

Micro-modulated luminescence tomography

Imaging depth of optical microscopy has been fundamentally limited to millimeter or sub-millimeter due to light scattering. X-ray microscopy can resolve spatial details of few microns deeply inside a sample but the contrast resolution is still inadequate to depict heterogeneous features at cellular or sub-cellular levels. To enhance and enrich biological contrast at large imaging depth, various nanoparticles are introduced and become essential to basic research and molecular medicine. Nanoparticles can be functionalized as imaging probes, similar to fluorescent and bioluminescent proteins. LiGa5O8:Cr3+ nanoparticles were recently synthesized to facilitate luminescence energy storage with x-ray pre-excitation and the subsequently stimulated luminescence emission by visible/near-infrared (NIR) light. In this paper, we suggest a micro-modulated luminescence tomography (MLT) approach to quantify a nanophosphor distribution in a thick biological sample with high resolution. Our numerical simulation studies demonstrate the feasibility of the proposed approach.

preprint2013arXiv

Stored Luminescence Computed Tomography

The phosphor nanoparticles made of doped semiconductors, pre-excited by well-collimated X-ray radiation, were recently reported for their light emission upon NIR light stimulation. The characteristics of X-ray energy storage and NIR stimulated emission is highly desirable to design targeting probes and improve molecular and cellular imaging. Here we propose stored luminescence computed tomography (SLCT), perform realistic numerical simulation, and demonstrate a much-improved spatial resolution in a preclinical research context. The future opportunities are also discussed along this direction.

preprint2013arXiv

Top-level Design and Pilot Analysis of Low-end CT Scanners Based on Linear Scanning for Developing Countries

Purpose: The goal is to develop a new architecture for computed tomography (CT) which is at an ultra-low-dose for developing countries, especially in rural areas. Methods: The proposed scheme is inspired by the recently developed compressive sensing and interior tomography techniques, where the data acquisition system targets a region of interest (ROI) to acquire limited and truncated data. The source and detector are translated in opposite directions for either ROI reconstruction with one or more localized linear scans or global reconstruction by combining multiple ROI reconstructions. In other words, the popular slip ring is replaced by a translation based setup, and the instrumentation cost is reduced by a relaxation of the imaging speed requirement. Results: The various translational scanning modes are theoretically analyzed, and the scanning parameters are optimized. The numerical simulation results from different numbers of linear scans confirm the feasibility of the proposed scheme, and suggest two preferred low-end systems for horizontal and vertical patient positions respectively. Conclusion: Ultra-low-cost x-ray CT is feasible with our proposed combination of linear scanning, compressive sensing, and interior tomography. The proposed architecture can be tailored into permanent, movable, or reconfigurable systems as desirable. Advanced image registration and spectral imaging features can be included as well.

preprint2012arXiv

Academic Ranking with Web Mining and Axiomatic Analysis

Academic ranking is a public topic, such as for universities, colleges, or departments, which has significant educational, administrative and social effects. Popular ranking systems include the US News & World Report (USNWR), the Academic Ranking of World Universities (ARWU), and others. The most popular observables for such ranking are academic publications and their citations. However, a rigorous, quantitative and thorough methodology has been missing for this purpose. With modern web technology and axiomatic bibliometric analysis, here we perform a feasibility study on Microsoft Academic Search metadata and obtain the first-of-its-kind ranking results for American departments of computer science. This approach can be extended for fully automatic intuitional and college ranking based on comprehensive data on Internet.

preprint2012arXiv

Omni-tomography: Next-generation Biomedical Imaging

Omni-tomography is enabled by interior tomography that has been developed over the past five years. By omni-tomography, we envision that the next stage of biomedical imaging will be the grand fusion of many tomographic modalities into a single gantry (all in one) for simultaneous data acquisition of numerous complementary features (all at once). This integration has great synergistic potential for development of systems biology, personalized and preventive medicine, because many physiological processes are dynamic and complicated, and must be observed promptly, comprehensively, sensitively, specifically, and non-invasively. In this perspective, we first present the background for and power of omni-tomography, then discuss its important applications in vulnerable plaque characterization and intratumor heterogeneity evaluation, review its enabling theory and technology, explain for the first time the feasibility of the CT-MRI scanner as an example, and finally suggest exciting research opportunities.

preprint2012arXiv

X-ray Fluorescence Sectioning

In this paper, we propose an x-ray fluorescence imaging system for elemental analysis. The key idea is what we call "x-ray fluorescence sectioning". Specifically, a slit collimator in front of an x-ray tube is used to shape x-rays into a fan-beam to illuminate a planar section of an object. Then, relevant elements such as gold nanoparticles on the fan-beam plane are excited to generate x-ray fluorescence signals. One or more 2D spectral detectors are placed to face the fan-beam plane and directly measure x-ray fluorescence data. Detector elements are so collimated that each element only sees a unique area element on the fan-beam plane and records the x-ray fluorescence signal accordingly. The measured 2D x-ray fluorescence data can be refined in reference to the attenuation characteristics of the object and the divergence of the beam for accurate elemental mapping. This x-ray fluorescence sectioning system promises fast fluorescence tomographic imaging without a complex inverse procedure. The design can be adapted in various ways, such as with the use of a larger detector size to improve the signal to noise ratio. In this case, the detector(s) can be shifted multiple times for image deblurring.

preprint2011arXiv

Differential Phase-contrast Interior Tomography

Differential phase contrast interior tomography allows for reconstruction of a refractive index distribution over a region of interest (ROI) for visualization and analysis of internal structures inside a large biological specimen. In this imaging mode, x-ray beams target the ROI with a narrow beam aperture, offering more imaging flexibility at less ionizing radiation. Inspired by recently developed compressive sensing theory, in numerical analysis framework, we prove that exact interior reconstruction can be achieved on an ROI via the total variation minimization from truncated differential projection data through the ROI, assuming a piecewise constant distribution of the refractive index in the ROI. Then, we develop an iterative algorithm for the interior reconstruction and perform numerical simulation experiments to demonstrate the feasibility of our proposed approach.

preprint2011arXiv

Fourier transform based iterative method for x-ray differential phase-contrast computed tomography

Biological soft tissues encountered in clinical and pre-clinical imaging mainly consist of light element atoms, and their composition is nearly uniform with little density variation. Thus, x-ray attenuation imaging suffers from low image contrast resolution. By contrast, x-ray phase shift of soft tissues is about a thousand times greater than x-ray absorption over the diagnostic energy range, thereby a significantly higher sensitivity can be achieved in terms of phase shift. In this paper, we propose a novel Fourier transform based iterative method to perform x-ray tomographic imaging of the refractive index directly from differential phase shift data. This approach offers distinct advantages in cases of incomplete and noisy data than analytic reconstruction, and especially suitable for phase-contrast interior tomography by incorporating prior knowledge in a region of interest (ROI). Biological experiments demonstrate the merits of the proposed approach.

preprint2011arXiv

Omni-tomography/Multi-tomography -- Integrating Multiple Modalities for Simultaneous Imaging

Current tomographic imaging systems need major improvements, especially when multi-dimensional, multi-scale, multi-temporal and multi-parametric phenomena are under investigation. Both preclinical and clinical imaging now depend on in vivo tomography, often requiring separate evaluations by different imaging modalities to define morphologic details, delineate interval changes due to disease or interventions, and study physiological functions that have interconnected aspects. Over the past decade, fusion of multimodality images has emerged with two different approaches: post-hoc image registration and combined acquisition on PET-CT, PET-MRI and other hybrid scanners. There are intrinsic limitations for both the post-hoc image analysis and dual/triple modality approaches defined by registration errors and physical constraints in the acquisition chain. We envision that tomography will evolve beyond current modality fusion and towards grand fusion, a large scale fusion of all or many imaging modalities, which may be referred to as omni-tomography or multi-tomography. Unlike modality fusion, grand fusion is here proposed for truly simultaneous but often localized reconstruction in terms of all or many relevant imaging mechanisms such as CT, MRI, PET, SPECT, US, optical, and possibly more. In this paper, the technical basis for omni-tomography is introduced and illustrated with a top-level design of a next generation scanner, interior tomographic reconstructions of representative modalities, and anticipated applications of omni-tomography.

preprint2010arXiv

Axiomatic Quantification of Co-authors' Relative Contributions

Over the past decades, the competition for academic resources has gradually intensified, and worsened with the current financial crisis. To optimize the resource allocation, individualized assessment of research results is being actively studied but the current indices, such as the number of papers, the number of citations, the h-factor and its variants have limitations, especially their inability of determining co-authors' credit shares fairly. Here we establish an axiomatic system and quantify co-authors' relative contributions. Our methodology avoids subjective assignment of co-authors' credits using the inflated, fractional or harmonic methods, and provides a quantitative tool for scientific management such as funding and tenure decisions.

preprint2010arXiv

Higher-order Reconstruction Method of Differential Phase Shift

In this paper, we develop a novel phase retrieval approach to reconstruct x-ray differential phase shift induced by an object. A primary advantage of our approach is a higher-order accuracy over that with the conventional linear approximation models, relaxing the current restriction of weak absorption and slow phase variation scenario. The optimal utilization of the diffraction images at different distance in Fresnel diffraction region eliminates the nonlinear terms in phase and attenuation, and simplifies the reconstruction to a linear inverse problem. Numerical studies are also described to demonstrate the accuracy and stability of our approach.

Ge Wang

What is connected

Connect this record

See the researcher in context

Building this map preview

58 published item(s)

MobiDiary: Autoregressive Action Captioning with Wearable Devices and Wireless Signals

SlideChain: Semantic Provenance for Lecture Understanding via Blockchain Registration

Graph-level Protein Representation Learning by Structure Knowledge Refinement

Task-based Assessment of Deep Networks for Sinogram Denoising with A Transformer-based Observer

A transformer-based deep learning approach for classifying brain metastases into primary organ sites using clinical whole brain MRI

DLME: Deep Local-flatness Manifold Embedding

GasHis-Transformer: A Multi-scale Visual Transformer Approach for Gastric Histopathological Image Detection

HOME: High-Order Mixed-Moment-based Embedding for Representation Learning

Quasi-Equivalence of Width and Depth of Neural Networks

Research Status of Deep Learning Methods for Rumor Detection

SoftDropConnect (SDC) -- Effective and Efficient Quantification of the Network Uncertainty in Deep MR Image Analysis

Stationary Multi-source AI-powered Real-time Tomography (SMART)

Suppression of Correlated Noise with Similarity-based Unsupervised Deep Learning

Top-level Design and Simulated Performance of the First Portable CT-MR scanner

Unravelling Distance-Dependent Inter-Site Interactions and Magnetic Transition Effects of Heteronuclear Single Atom Catalysts on Electrochemical Oxygen Reduction

X-ray Dissectography Improves Lung Nodule Detection

Noise Entangled GAN For Low-Dose CT Simulation

Phase function estimation from a diffuse optical image via deep learning

Soft Autoencoder and Its Wavelet Adaptation Interpretation

Cine Cardiac MRI Motion Artifact Reduction Using a Recurrent Neural Network

Clinical Micro-CT Empowered by Interior Tomography, Robotic Scanning, and Deep Learning

GATCluster: Self-Supervised Gaussian-Attention Network for Image Clustering

Integrative Analysis for COVID-19 Patient Outcome Prediction

Low-dimensional Manifold Constrained Disentanglement Network for Metal Artifact Reduction

Parameter-Transferred Wasserstein Generative Adversarial Network (PT-WGAN) for Low-Dose PET Image Denoising

X-ray Monochromatic Imaging from Single-spectrum CT via Machine Learning

X-ray Photon-Counting Data Correction through Deep Learning

A Method of Rapid Quantification of Patient-Specific Organ Dose for CT Using Coupled Deep-Learning based Multi-Organ Segmentation and GPU-accelerated Monte Carlo Dose Computing

MRI Super-Resolution with Ensemble Learning and Complementary Priors

Multi-Contrast Super-Resolution MRI Through a Progressive Network

A Perspective on Deep Imaging

Bibliometric Index for Academic Leadership

Deep Learning for the Classification of Lung Nodules

Low-dose CT denoising with convolutional neural network

Low-Dose CT via Deep Neural Network

Theoretical analysis on x-ray cylindrical grating interferometer

A Pilot Study on Coupling CT and MRI through Use of Semiconductor Nanoparticles

First CT-MRI Scanner for Multi-dimensional Synchrony and Multi-physical Coupling

Local Filtering Fundamentally Against Wide Spectrum

Modulated Luminescent Tomography

Physical Foundation for General Interior Tomography

X-optogenetics and U-optogenetics: Feasibility and Possibilities

Analytic Comparison between X-ray Fluorescence CT and K-edge CT

Attenuation map reconstruction from TOF PET data

Dictionary-Learning-Based Reconstruction Method for Electron Tomography

Dynamic Bowtie for Fan-beam CT

Meaning of Interior Tomography

Micro-modulated luminescence tomography

Stored Luminescence Computed Tomography

Top-level Design and Pilot Analysis of Low-end CT Scanners Based on Linear Scanning for Developing Countries

Academic Ranking with Web Mining and Axiomatic Analysis

Omni-tomography: Next-generation Biomedical Imaging

X-ray Fluorescence Sectioning

Differential Phase-contrast Interior Tomography

Fourier transform based iterative method for x-ray differential phase-contrast computed tomography

Omni-tomography/Multi-tomography -- Integrating Multiple Modalities for Simultaneous Imaging

Axiomatic Quantification of Co-authors' Relative Contributions

Higher-order Reconstruction Method of Differential Phase Shift