Source author record

Xundong Wu

Xundong Wu appears in the imported research catalog. Authorship, coauthor and topic links are available while profile ownership is still unclaimed.

ResearcherUnclaimed source record

Machine Learning Neural and Evolutionary Computing Computer Vision physics.optics Biological Physics Hardware Architecture physics.ins-det

Catalog footprint

What is connected

6works

7topics

4close collaborators

Actions

Connect this record

Open graph Browse works

Inspect adjacent papers, topics, institutions and collaborators without losing the researcher page.

Building this map preview

BZPEER is loading the nearby papers, people, topics and institutions for this page.

preprint2023arXiv

Darwin3: A large-scale neuromorphic chip with a Novel ISA and On-Chip Learning

Spiking Neural Networks (SNNs) are gaining increasing attention for their biological plausibility and potential for improved computational efficiency. To match the high spatial-temporal dynamics in SNNs, neuromorphic chips are highly desired to execute SNNs in hardware-based neuron and synapse circuits directly. This paper presents a large-scale neuromorphic chip named Darwin3 with a novel instruction set architecture(ISA), which comprises 10 primary instructions and a few extended instructions. It supports flexible neuron model programming and local learning rule designs. The Darwin3 chip architecture is designed in a mesh of computing nodes with an innovative routing algorithm. We used a compression mechanism to represent synaptic connections, significantly reducing memory usage. The Darwin3 chip supports up to 2.35 million neurons, making it the largest of its kind in neuron scale. The experimental results showed that code density was improved up to 28.3x in Darwin3, and neuron core fan-in and fan-out were improved up to 4096x and 3072x by connection compression compared to the physical memory depth. Our Darwin3 chip also provided memory saving between 6.8X and 200.8X when mapping convolutional spiking neural networks (CSNN) onto the chip, demonstrating state-of-the-art performance in accuracy and latency compared to other neuromorphic chips.

preprint2020arXiv

Cross-Channel Intragroup Sparsity Neural Network

Modern deep neural networks rely on overparameterization to achieve state-of-the-art generalization. But overparameterized models are computationally expensive. Network pruning is often employed to obtain less demanding models for deployment. Fine-grained pruning removes individual weights in parameter tensors and can achieve a high model compression ratio with little accuracy degradation. However, it introduces irregularity into the computing dataflow and often does not yield improved model inference efficiency in practice. Coarse-grained model pruning, while realizing satisfactory inference speedup through removal of network weights in groups, e.g. an entire filter, often lead to significant accuracy degradation. This work introduces the cross-channel intragroup (CCI) sparsity structure, which can prevent the inference inefficiency of fine-grained pruning while maintaining outstanding model performance. We then present a novel training algorithm designed to perform well under the constraint imposed by the CCI-Sparsity. Through a series of comparative experiments we show that our proposed CCI-Sparsity structure and the corresponding pruning algorithm outperform prior art in inference efficiency by a substantial margin given suited hardware acceleration in the future.

preprint2016arXiv

Binarized Neural Networks on the ImageNet Classification Task

We trained Binarized Neural Networks (BNNs) on the high resolution ImageNet ILSVRC-2102 dataset classification task and achieved a good performance. With a moderate size network of 13 layers, we obtained top-5 classification accuracy rate of 84.1 % on validation set through network distillation, much better than previous published results of 73.2% on XNOR network and 69.1% on binarized GoogleNET. We expect networks of better performance can be obtained by following our current strategies. We provide a detailed discussion and preliminary analysis on strategies used in the network training.

preprint2016arXiv

Highly Nonlinear Luminescence Induced by Gold Nanoparticles on Glass Surfaces with Continuous-Wave Laser Illumination

We report on highly nonlinear luminescence being observed from individual spherical gold nanoparticles immobilized on a glass surface and illuminated by continuous-wave (CW) lasers with relatively low power. The nonlinear luminescence shows optical super-resolution beyond the diffraction limit in three dimensions compared to the scatting of the excitation laser light. The luminescence intensity from most nanoparticles is proportional to the 5th--7th power of the excitation laser power and has wide excitation and emission spectra across the visible wavelength range. Strong nonlinear luminescence is only observed near the glass surface. High optical nonlinearity excited by low CW laser power is related to a long-lived dark state of the gold nanoparticles, where the excitation light is strongly absorbed. This phenomenon has potential biological applications in super-resolution and deep tissue imaging.

preprint2015arXiv

An Iterative Convolutional Neural Network Algorithm Improves Electron Microscopy Image Segmentation

To build the connectomics map of the brain, we developed a new algorithm that can automatically refine the Membrane Detection Probability Maps (MDPM) generated to perform automatic segmentation of electron microscopy (EM) images. To achieve this, we executed supervised training of a convolutional neural network to recover the removed center pixel label of patches sampled from a MDPM. MDPM can be generated from other machine learning based algorithms recognizing whether a pixel in an image corresponds to the cell membrane. By iteratively applying this network over MDPM for multiple rounds, we were able to significantly improve membrane segmentation results.

preprint2015arXiv

Resonant Scanning with Large Field of View Reduces Photobleaching and Enhances Fluorescence Yield in STED Microscopy

Photobleaching is a major limitation of superresolution Stimulated Depletion Emission (STED) microscopy. Fast scanning has long been considered an effective means to reduce photobleaching in fluorescence microscopy, but a careful quantitative study of this issue is missing. In this paper, we show that the photobleaching rate in STED microscopy is slowed down and fluorescence yield is enhanced by scanning with high linear speed, enabled by the large field of view in our custom-built resonant-scanning STED microscope. The effect of scanning speed on photobleaching and fluorescence yield is more remarkable at higher levels of depletion laser irradiance, and virtually disappears in conventional confocal microscopy. With a depletion irradiance of >0.2 GW$\cdot$cm$^{-2}$ (time average), we were able to extend the fluorescence survival time of the Atto 647N dye by ~80% with an 8-fold wider field of view. We confirm that STED Photobleaching is primarily caused by the depletion light acting upon the excited fluorophores. Experimental data agree with a theoretical model. Our results encourage further increasing linear scanning speed for photobleaching reduction in STED microscopy.

Xundong Wu

What is connected

Connect this record

See the researcher in context

Building this map preview

6 published item(s)

Darwin3: A large-scale neuromorphic chip with a Novel ISA and On-Chip Learning

Cross-Channel Intragroup Sparsity Neural Network

Binarized Neural Networks on the ImageNet Classification Task

Highly Nonlinear Luminescence Induced by Gold Nanoparticles on Glass Surfaces with Continuous-Wave Laser Illumination

An Iterative Convolutional Neural Network Algorithm Improves Electron Microscopy Image Segmentation

Resonant Scanning with Large Field of View Reduces Photobleaching and Enhances Fluorescence Yield in STED Microscopy