Researcher profile

Can Cui

Can Cui contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
15works
0followers
14topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

15 published item(s)

preprint2023arXiv

Omni-Seg: A Scale-aware Dynamic Network for Renal Pathological Image Segmentation

Comprehensive semantic segmentation on renal pathological images is challenging due to the heterogeneous scales of the objects. For example, on a whole slide image (WSI), the cross-sectional areas of glomeruli can be 64 times larger than that of the peritubular capillaries, making it impractical to segment both objects on the same patch, at the same scale. To handle this scaling issue, prior studies have typically trained multiple segmentation networks in order to match the optimal pixel resolution of heterogeneous tissue types. This multi-network solution is resource-intensive and fails to model the spatial relationship between tissue types. In this paper, we propose the Omni-Seg+ network, a scale-aware dynamic neural network that achieves multi-object (six tissue types) and multi-scale (5X to 40X scale) pathological image segmentation via a single neural network. The contribution of this paper is three-fold: (1) a novel scale-aware controller is proposed to generalize the dynamic neural network from single-scale to multi-scale; (2) semi-supervised consistency regularization of pseudo-labels is introduced to model the inter-scale correlation of unannotated tissue types into a single end-to-end learning paradigm; and (3) superior scale-aware generalization is evidenced by directly applying a model trained on human kidney images to mouse kidney images, without retraining. By learning from ~150,000 human pathological image patches from six tissue types at three different resolutions, our approach achieved superior segmentation performance according to human visual assessment and evaluation of image-omics (i.e., spatial transcriptomics). The official implementation is available at https://github.com/ddrrnn123/Omni-Seg.

preprint2022arXiv

A Survey on Deep Reinforcement Learning for Data Processing and Analytics

Data processing and analytics are fundamental and pervasive. Algorithms play a vital role in data processing and analytics where many algorithm designs have incorporated heuristics and general rules from human knowledge and experience to improve their effectiveness. Recently, reinforcement learning, deep reinforcement learning (DRL) in particular, is increasingly explored and exploited in many areas because it can learn better strategies in complicated environments it is interacting with than statically designed algorithms. Motivated by this trend, we provide a comprehensive review of recent works focusing on utilizing DRL to improve data processing and analytics. First, we present an introduction to key concepts, theories, and methods in DRL. Next, we discuss DRL deployment on database systems, facilitating data processing and analytics in various aspects, including data organization, scheduling, tuning, and indexing. Then, we survey the application of DRL in data processing and analytics, ranging from data preparation, natural language processing to healthcare, fintech, etc. Finally, we discuss important open challenges and future research directions of using DRL in data processing and analytics.

preprint2022arXiv

Cross-scale Attention Guided Multi-instance Learning for Crohn's Disease Diagnosis with Pathological Images

Multi-instance learning (MIL) is widely used in the computer-aided interpretation of pathological Whole Slide Images (WSIs) to solve the lack of pixel-wise or patch-wise annotations. Often, this approach directly applies "natural image driven" MIL algorithms which overlook the multi-scale (i.e. pyramidal) nature of WSIs. Off-the-shelf MIL algorithms are typically deployed on a single-scale of WSIs (e.g., 20x magnification), while human pathologists usually aggregate the global and local patterns in a multi-scale manner (e.g., by zooming in and out between different magnifications). In this study, we propose a novel cross-scale attention mechanism to explicitly aggregate inter-scale interactions into a single MIL network for Crohn's Disease (CD), which is a form of inflammatory bowel disease. The contribution of this paper is two-fold: (1) a cross-scale attention mechanism is proposed to aggregate features from different resolutions with multi-scale interaction; and (2) differential multi-scale attention visualizations are generated to localize explainable lesion patterns. By training ~250,000 H&E-stained Ascending Colon (AC) patches from 20 CD patient and 30 healthy control samples at different scales, our approach achieved a superior Area under the Curve (AUC) score of 0.8924 compared with baseline models. The official implementation is publicly available at https://github.com/hrlblab/CS-MIL.

preprint2022arXiv

ModDrop++: A Dynamic Filter Network with Intra-subject Co-training for Multiple Sclerosis Lesion Segmentation with Missing Modalities

Multiple Sclerosis (MS) is a chronic neuroinflammatory disease and multi-modality MRIs are routinely used to monitor MS lesions. Many automatic MS lesion segmentation models have been developed and have reached human-level performance. However, most established methods assume the MRI modalities used during training are also available during testing, which is not guaranteed in clinical practice. Previously, a training strategy termed Modality Dropout (ModDrop) has been applied to MS lesion segmentation to achieve the state-of-the-art performance with missing modality. In this paper, we present a novel method dubbed ModDrop++ to train a unified network adaptive to an arbitrary number of input MRI sequences. ModDrop++ upgrades the main idea of ModDrop in two key ways. First, we devise a plug-and-play dynamic head and adopt a filter scaling strategy to improve the expressiveness of the network. Second, we design a co-training strategy to leverage the intra-subject relation between full modality and missing modality. Specifically, the intra-subject co-training strategy aims to guide the dynamic head to generate similar feature representations between the full- and missing-modality data from the same subject. We use two public MS datasets to show the superiority of ModDrop++. Source code and trained models are available at https://github.com/han-liu/ModDropPlusPlus.

preprint2022arXiv

Omni-Seg: A Single Dynamic Network for Multi-label Renal Pathology Image Segmentation using Partially Labeled Data

Computer-assisted quantitative analysis on Giga-pixel pathology images has provided a new avenue in histology examination. The innovations have been largely focused on cancer pathology (i.e., tumor segmentation and characterization). In non-cancer pathology, the learning algorithms can be asked to examine more comprehensive tissue types simultaneously, as a multi-label setting. The prior arts typically needed to train multiple segmentation networks in order to match the domain-specific knowledge for heterogeneous tissue types (e.g., glomerular tuft, glomerular unit, proximal tubular, distal tubular, peritubular capillaries, and arteries). In this paper, we propose a dynamic single segmentation network (Omni-Seg) that learns to segment multiple tissue types using partially labeled images (i.e., only one tissue type is labeled for each training image) for renal pathology. By learning from ~150,000 patch-wise pathological images from six tissue types, the proposed Omni-Seg network achieved superior segmentation accuracy and less resource consumption when compared to the previous the multiple-network and multi-head design. In the testing stage, the proposed method obtains "completely labeled" tissue segmentation results using only "partially labeled" training images. The source code is available at https://github.com/ddrrnn123/Omni-Seg

preprint2022arXiv

Shape-Dependent Multi-Weight Magnetic Artificial Synapses for Neuromorphic Computing

In neuromorphic computing, artificial synapses provide a multi-weight conductance state that is set based on inputs from neurons, analogous to the brain. Additional properties of the synapse beyond multiple weights can be needed, and can depend on the application, requiring the need for generating different synapse behaviors from the same materials. Here, we measure artificial synapses based on magnetic materials that use a magnetic tunnel junction and a magnetic domain wall. By fabricating lithographic notches in a domain wall track underneath a single magnetic tunnel junction, we achieve 4-5 stable resistance states that can be repeatably controlled electrically using spin orbit torque. We analyze the effect of geometry on the synapse behavior, showing that a trapezoidal device has asymmetric weight updates with high controllability, while a straight device has higher stochasticity, but with stable resistance levels. The device data is input into neuromorphic computing simulators to show the usefulness of application-specific synaptic functions. Implementing an artificial neural network applied on streamed Fashion-MNIST data, we show that the trapezoidal magnetic synapse can be used as a metaplastic function for efficient online learning. Implementing a convolutional neural network for CIFAR-100 image recognition, we show that the straight magnetic synapse achieves near-ideal inference accuracy, due to the stability of its resistance levels. This work shows multi-weight magnetic synapses are a feasible technology for neuromorphic computing and provides design guidelines for emerging artificial synapse technologies.

preprint2022arXiv

Survival Prediction of Brain Cancer with Incomplete Radiology, Pathology, Genomics, and Demographic Data

Integrating cross-department multi-modal data (e.g., radiological, pathological, genomic, and clinical data) is ubiquitous in brain cancer diagnosis and survival prediction. To date, such an integration is typically conducted by human physicians (and panels of experts), which can be subjective and semi-quantitative. Recent advances in multi-modal deep learning, however, have opened a door to leverage such a process to a more objective and quantitative manner. Unfortunately, the prior arts of using four modalities on brain cancer survival prediction are limited by a "complete modalities" setting (i.e., with all modalities available). Thus, there are still open questions on how to effectively predict brain cancer survival from the incomplete radiological, pathological, genomic, and demographic data (e.g., one or more modalities might not be collected for a patient). For instance, should we use both complete and incomplete data, and more importantly, how to use those data? To answer the preceding questions, we generalize the multi-modal learning on cross-department multi-modal data to a missing data setting. Our contribution is three-fold: 1) We introduce optimal multi-modal learning with missing data (MMD) pipeline with optimized hardware consumption and computational efficiency; 2) We extend multi-modal learning on radiological, pathological, genomic, and demographic data into missing data scenarios; 3) a large-scale public dataset (with 962 patients) is collected to systematically evaluate glioma tumor survival prediction using four modalities. The proposed method improved the C-index of survival prediction from 0.7624 to 0.8053.

preprint2022arXiv

The saturation of the VSI in protoplanetary disks via parametric instability

The vertical shear instability (VSI) is a robust and potentially important phenomenon in irradiated protoplanetary disks (PPDs), yet the mechanism by which it saturates remains poorly understood. Global simulations suggest that the non-linear evolution of the VSI is dominated by radially propagating inertial wavetrains (called `body modes'), but these are known to be susceptible to a parametric instability. In this paper, we propose that the global VSI saturates via this secondary instability, which initiates a redistribution of energy from the large scales to smaller-scale inertial waves, and finally into a turbulent cascade. We present an analytic theory of the instability in a simple idealised model that captures the main physical and mathematical details of the problem. In addition, we conduct numerical simulations with the SNOOPY code to consolidate the theory. Once the parametric instability prevails, the VSI is likely far more disordered and incoherent than current global simulations suggest. We also argue that it is challenging to capture parametric instability in global simulations unless the radial resolution is very fine, possibly $\sim 300$ grid cells per scale height in radius.

preprint2022arXiv

Wavelike nature of the vertical shear instability in global protoplanetary disks

The vertical shear instability (VSI) is a robust phenomenon in irradiated protoplanetary disks (PPDs). The majority of previous numerical simulations have focused on the turbulent properties of its saturated state. However, the saturation of the VSI manifests as large-scale coherent radially travelling inertial waves. In this paper, we study inertial-wave-disk interactions and their impact on VSI saturation. Inertial-wave linear theory is developed and applied to a representative global 2D simulation using the Athena++ code. It is found that the VSI saturates by separating the disk into several radial wave zones roughly demarcated by corotation resonances (turning points); this structure also manifests in modest radial variations in the vertical turbulence strength. Future numerical work should employ large radial domains to accommodate this radial structure of the VSI, while concurrently adopting sufficiently fine resolutions to resolve the parametric instability that attacks the saturated VSI inertial waves.

preprint2020arXiv

A Novel Semi-Supervised Data-Driven Method for Chiller Fault Diagnosis with Unlabeled Data

In practical chiller systems, applying efficient fault diagnosis techniques can significantly reduce energy consumption and improve energy efficiency of buildings. The success of the existing methods for fault diagnosis of chillers relies on the condition that sufficient labeled data are available for training. However, label acquisition is laborious and costly in practice. Usually, the number of labeled data is limited and most data available are unlabeled. The existing methods cannot exploit the information contained in unlabeled data, which significantly limits the improvement of fault diagnosis performance in chiller systems. To make effective use of unlabeled data to further improve fault diagnosis performance and reduce the dependency on labeled data, we proposed a novel semi-supervised data-driven fault diagnosis method for chiller systems based on the semi-generative adversarial network, which incorporates both unlabeled and labeled data into learning process. The semi-generative adversarial network can learn the information of data distribution from unlabeled data and this information can help to significantly improve the diagnostic performance. Experimental results demonstrate the effectiveness of the proposed method. Under the scenario that there are only 80 labeled samples and 16000 unlabeled samples, the proposed method can improve the diagnostic accuracy to 84%, while the supervised baseline methods only reach the accuracy of 65% at most. Besides, the minimal required number of labeled samples can be reduced by about 60% with the proposed method when there are enough unlabeled samples.

preprint2020arXiv

Domain Wall Leaky Integrate-and-Fire Neurons with Shape-Based Configurable Activation Functions

Complementary metal oxide semiconductor (CMOS) devices display volatile characteristics, and are not well suited for analog applications such as neuromorphic computing. Spintronic devices, on the other hand, exhibit both non-volatile and analog features, which are well-suited to neuromorphic computing. Consequently, these novel devices are at the forefront of beyond-CMOS artificial intelligence applications. However, a large quantity of these artificial neuromorphic devices still require the use of CMOS, which decreases the efficiency of the system. To resolve this, we have previously proposed a number of artificial neurons and synapses that do not require CMOS for operation. Although these devices are a significant improvement over previous renditions, their ability to enable neural network learning and recognition is limited by their intrinsic activation functions. This work proposes modifications to these spintronic neurons that enable configuration of the activation functions through control of the shape of a magnetic domain wall track. Linear and sigmoidal activation functions are demonstrated in this work, which can be extended through a similar approach to enable a wide variety of activation functions.

preprint2020arXiv

Global Simulations of the Vertical Shear Instability with Non-ideal Magnetohydrodynamical Effects

The mechanisms of angular momentum transport and level of turbulence in protoplanetary disks (PPDs) are crucial for understanding many aspects of planet formation. In the recent years, it has been realized that the magneto-rotational instability (MRI) tends to be suppressed in PPDs due to non-ideal MHD effects, and the disk is largely laminar with accretion driven by magnetized disk winds. In parallel, several hydrodynamical mechanisms have been identified that likely also generate vigorous turbulence and drive disk accretion. We study the interplay between MHD winds in PPDs with the vertical shear instability (VSI), one of the most promising hydrodynamical mechanisms, through 2D global non-ideal MHD simulations with ambipolar diffusion and Ohmic resistivity. We find that for typical disk parameters, MHD winds can coexist with the VSI with accretion primarily wind-driven accompanied by vigorous VSI turbulence. The properties of the VSI remain similar to unmagnetized case, and the wind and overall field configuration are not strongly affected by VSI turbulence, showing modest level of variability and corrugation of midplane current sheet. Enhanced coupling between gas and magnetic field weakens the VSI. The VSI is also weakened with increasing magnetization, and we find that corrugation motions characteristic of the VSI transitions to low-amplitude breathing mode oscillations.

preprint2020arXiv

Large-scale dynamics of winds originated from black hole accretion flows: (I) Hydrodynamics

Winds from black hole accretion flows are ubiquitous. Previous works mainly focus on the launching of wind in the accretion flow scale. It still remains unclear how far the winds can propagate outward and what is their large-scale dynamics. As the first paper of this series, we study the large-scale dynamics of thermal wind beyond accretion scales via analytical and numerical methods. Boundary conditions, which are crucial to our problem, are analyzed and presented based on the small-scale simulations combined with observations of winds. Both black hole and galaxy potential are taken into account. For winds originated from hot accretion flows, we find that the wind can reach to large scales. The radial profiles of velocity, density, and temperature can be approximated by $v_r\approx v_{r0}, ρ\approx ρ_{0}(r/r_0)^{-2}$, and $T\approx T_0 (r/r_0)^{-2(γ-1)}$, where $v_{r0}, ρ_0, T_0$ are the velocity, density, and temperature of winds at the boundary $r_0(\equiv 10^3 r_g)$, $γ$ is the polytropic index. During the outward propagation, the enthalpy and the rotational energy compensate the increase of gravitational potential. For thin disks, we find that because the Bernoulli parameter is smaller, winds cannot propagate as far as the hot winds, but stop at a certain radius where the Bernoulli parameter is equal to the potential energy. Before the winds stop, the profiles of dynamical quantities can also be approximated by the above relations. In this case the rotational energy alone compensates the increase of the potential energy.

preprint2020arXiv

Large-scale dynamics of winds originated from black hole accretion flows: (II) Magnetohydrodynamics

Winds from black hole accretion disks are essential ingredients in understanding the coevolution between the supermassive black hole and its host galaxy. The great difference of dynamical ranges from small-scale accretion disk simulations to large-scale or cosmological simulations places barriers to track wind kinematics. In the first paper of this series, we have studied the dynamics of disk winds from the outer edge of the accretion disk toward galaxy scales in the hydrodynamical framework. In this paper, we further incorporate magnetic fields to understand the wind dynamics by adopting one-dimensional magnetohydrodynamical (MHD) model, with boundary conditions set for hot accretion flows. The geometry of poloidal magnetic field is prescribed as a straight line with an angle $θ=45^\circ$ from the rotational axis, and the strength satisfies the divergence free condition. The wind solution is achieved through requesting gas to pass through the slow, Alfvén and fast magneto-sonic points smoothly. Physical quantities are found to show a power-law dependence on cylindrical radius $R$ beyond the fast magneto-sonic point, for which $ρ\propto R^{-2}, v_{\rm p}\propto {\rm const.}, v_{\rm ϕ}\propto R^{-1}, B_{\rm ϕ}\propto R^{-1},$ and $ β\propto ρ^{γ-1}$. The magnetization of wind is dominant in determining the wind properties. The wind is accelerated to a greater terminal velocity with strong magnetization ($v_{\rm Ap0}>1$) compared to the hydrodynamical case, which the magnetic pressure gradient dominates and the centrifugal potential converts to the kinetic energy. The dependance of wind physical quantities on magnetization, temperature, field line angular velocity, and adiabatic index is also discussed.

preprint2020arXiv

Plasticity-Enhanced Domain-Wall MTJ Neural Networks for Energy-Efficient Online Learning

Machine learning implements backpropagation via abundant training samples. We demonstrate a multi-stage learning system realized by a promising non-volatile memory device, the domain-wall magnetic tunnel junction (DW-MTJ). The system consists of unsupervised (clustering) as well as supervised sub-systems, and generalizes quickly (with few samples). We demonstrate interactions between physical properties of this device and optimal implementation of neuroscience-inspired plasticity learning rules, and highlight performance on a suite of tasks. Our energy analysis confirms the value of the approach, as the learning budget stays below 20 $μJ$ even for large tasks used typically in machine learning.