Researcher profile

Peng Lin

Peng Lin contributes to research discovery and scholarly infrastructure.

ResearcherAffiliation not importedOpen to collaborate

Trust snapshot

Quick read

Trust 21 - EmergingVerification L1Unclaimed author
8works
0followers
6topics
4close collaborators

Actions

Decide how to stay connected

Follow researcher0

Identity and collaboration

How to connect with this researcher

Claiming links this public author record to a researcher profile and unlocks direct collaboration workflows.

Log in to claim

Direct collaboration

Open a focused conversation when the fit is right

Claim this author entity first to unlock direct invitations.

Research graph

See the researcher in context

Open full explorer

Inspect adjacent work, topics, institutions and collaborators without jumping out to a separate graph page.

Building this graph slice

BZPEER is loading the nearby papers, people, topics and institutions for this page.

Published work

8 published item(s)

preprint2023arXiv

Darwin3: A large-scale neuromorphic chip with a Novel ISA and On-Chip Learning

Spiking Neural Networks (SNNs) are gaining increasing attention for their biological plausibility and potential for improved computational efficiency. To match the high spatial-temporal dynamics in SNNs, neuromorphic chips are highly desired to execute SNNs in hardware-based neuron and synapse circuits directly. This paper presents a large-scale neuromorphic chip named Darwin3 with a novel instruction set architecture(ISA), which comprises 10 primary instructions and a few extended instructions. It supports flexible neuron model programming and local learning rule designs. The Darwin3 chip architecture is designed in a mesh of computing nodes with an innovative routing algorithm. We used a compression mechanism to represent synaptic connections, significantly reducing memory usage. The Darwin3 chip supports up to 2.35 million neurons, making it the largest of its kind in neuron scale. The experimental results showed that code density was improved up to 28.3x in Darwin3, and neuron core fan-in and fan-out were improved up to 4096x and 3072x by connection compression compared to the physical memory depth. Our Darwin3 chip also provided memory saving between 6.8X and 200.8X when mapping convolutional spiking neural networks (CSNN) onto the chip, demonstrating state-of-the-art performance in accuracy and latency compared to other neuromorphic chips.

preprint2021arXiv

Echo state graph neural networks with analogue random resistor arrays

Recent years have witnessed an unprecedented surge of interest, from social networks to drug discovery, in learning representations of graph-structured data. However, graph neural networks, the machine learning models for handling graph-structured data, face significant challenges when running on conventional digital hardware, including von Neumann bottleneck incurred by physically separated memory and processing units, slowdown of Moore's law due to transistor scaling limit, and expensive training cost. Here we present a novel hardware-software co-design, the random resistor array-based echo state graph neural network, which addresses these challenges. The random resistor arrays not only harness low-cost, nanoscale and stackable resistors for highly efficient in-memory computing using simple physical laws, but also leverage the intrinsic stochasticity of dielectric breakdown to implement random projections in hardware for an echo state network that effectively minimizes the training cost thanks to its fixed and random weights. The system demonstrates state-of-the-art performance on both graph classification using the MUTAG and COLLAB datasets and node classification using the CORA dataset, achieving 34.2x, 93.2x, and 570.4x improvement of energy efficiency and 98.27%, 99.46%, and 95.12% reduction of training cost compared to conventional graph learning on digital hardware, respectively, which may pave the way for the next generation AI system for graph learning.

preprint2021arXiv

Implementation of annihilation and junction reactions in vector density-based continuum dislocation dynamics

In a continuum dislocation dynamics formulation by Xia and El-Azab, dislocations are represented by a set of vector density fields, one per crystallographic slip systems. The space-time evolution of these densities is obtained by solving a set of dislocation transport equations coupled with crystal mechanics. Here, we present an approach for incorporating dislocation annihilation and junction reactions into the dislocation transport equations. These reactions consume dislocations and result in nothing as in the annihilation reactions, or produce new dislocations of different types as in the case of junction reactions. Collinear annihilation, glissile junctions, and sessile junctions are particularly emphasized here. A generalized energy-based criterion for junction reactions is established in terms of the dislocation density and Burgers vectors of the reacting species, and the reaction rate terms for junction reactions are formulated in terms of the dislocation densities. In order to illustrate how the dislocation network changes as a result of junction formation and annihilation in a continuum dislocation dynamics setting, we present some numerical examples focusing on the reactions processes themselves. The results show that our modeling approach is able to capture the respective dislocation network changes associated with dislocation reactions in FCC crystals: dislocations of opposite line directions encountering each other on collinear slip systems annihilate to connect the dislocations on the two slip systems, glissile junctions form on new slip system behave like Frank-Read sources, and sessile junctions form and expand along the intersection of the slip planes of the reacting dislocation species. A collective-dynamics test showing the frequency of occurrence of junctions of different types relative to each other is also presented.

preprint2021arXiv

On the computational solution of vector-density based continuum dislocation dynamics models: a comparison of two plastic distortion and stress update algorithms

Continuum dislocation dynamics models of mesoscale plasticity consist of dislocation transport-reaction equations coupled with crystal mechanics equations. The coupling between these two sets of equations is such that dislocation transport gives rise to the evolution of plastic distortion (strain), while the evolution of the latter fixes the stress from which the dislocation velocity field is found via a mobility law. Earlier solutions of these equations employed a staggered solution scheme for the two sets of equations in which the plastic distortion was updated via time integration of its rate, as found from Orowan's law. In this work, we show that such a direct time integration scheme can suffer from accumulation of numerical errors. We introduce an alternative scheme based on field dislocation mechanics that ensures consistency between the plastic distortion and the dislocation content in the crystal. The new scheme is based on calculating the compatible and incompatible parts of the plastic distortion separately, and the incompatible part is calculated from the current dislocation density field. Stress field and dislocation transport calculations were implemented within a finite element based discretization of the governing equations, with the crystal mechanics part solved by a conventional Galerkin method and the dislocation transport equations by the least squares method. A simple test is first performed to show the accuracy of the two schemes for updating the plastic distortion, which shows that the solution method based on field dislocation mechanics is more accurate. This method then was used to simulate an austenitic steel crystal under uniaxial loading and multiple slip conditions.

preprint2020arXiv

Distributed Continuous-Time and Discrete-Time Optimization With Nonuniform Unbounded Convex Constraint Sets and Nonuniform Stepsizes

This paper is devoted to distributed continuous-time and discrete-time optimization problems with nonuniform convex constraint sets and nonuniform stepsizes for general differentiable convex objective functions. The communication graphs are not required to be strongly connected at any time, the gradients of the local objective functions are not required to be bounded when their independent variables tend to infinity, and the constraint sets are not required to be bounded. For continuous-time multi-agent systems, a distributed continuous algorithm is first introduced where the stepsizes and the convex constraint sets are both nonuniform. It is shown that all agents reach a consensus while minimizing the team objective function even when the constraint sets are unbounded. After that, the obtained results are extended to discrete-time multi-agent systems and then the case where each agent remains in a corresponding convex constraint set is studied. To ensure all agents to remain in a bounded region, a switching mechanism is introduced in the algorithms. It is shown that the distributed optimization problems can be solved, even though the discretization of the algorithms might deviate the convergence of the agents from the minimum of the objective functions. Finally, numerical examples are included to show the obtained theoretical results.

preprint2020arXiv

Distributed optimization with nonconvex velocity constraints, nonuniform position constraints and nonuniform stepsizes

This note is devoted to the distributed optimization problem of multi-agent systems with nonconvex velocity constraints, nonuniform position constraints and nonuniform stepsizes. Two distributed constrained algorithms with nonconvex velocity constraints and nonuniform stepsizes are proposed in the absence and the presence of nonuniform position constraints by introducing a switching mechanism to guarantee all agents' position states to remain in a bounded region. The algorithm gains need not to be predesigned and can be selected by each agent using its own and neighbours' information. By a model transformation, the original nonlinear time-varying system is converted into a linear time-varying one with a nonlinear error term. Based on the properties of stochastic matrices, it is shown that the optimization problem can be solved as long as the communication topologies are jointly strongly connected and balanced. Numerical examples are given to show the obtained theoretical results.

preprint2020arXiv

Distributed velocity-constrained consensus of discrete-time multi-agent systems with nonconvex constraints, switching topologies, and delays

In this paper, a distributed velocity-constrained consensus problem is studied for discrete-time multi-agent systems, where each agent's velocity is constrained to lie in a nonconvex set. A distributed constrained control algorithm is proposed to enable all agents to converge to a common point using only local information. {The gains of the algorithm for all agents need not to be the same or predesigned and can be adjusted by each agent itself based on its own and neighbors' information.} It is shown that the algorithm is robust to arbitrarily bounded communication delays and arbitrarily switching communication graphs provided that the union of the graphs has directed spanning trees among each certain time interval. The analysis approach is based on multiple novel model transformations, proper control parameter selections, boundedness analysis of state-dependent stochastic matrices, exploitation of the convexity of stochastic matrices, and the joint connectivity of the communication graphs. Numerical examples are included to illustrate the theoretical results.

preprint2020arXiv

On the implementation of dislocation reactions in continuum dislocation dynamics modeling of mesoscale plasticity

The continuum dislocation dynamics framework for mesoscale plasticity is intended to capture the dislocation density evolution and the deformation of crystals when subjected to mechanical loading. It does so by solving a set of transport equations for dislocations concurrently with crystal mechanics equations, with the latter being cast in the form of an eigenstrain problem. Incorporating dislocation reactions in the dislocation transport equations is essential for making such continuum dislocation dynamics predictive. A formulation is proposed to incorporate dislocation reactions in the transport equations of the vector density-based continuum dislocation dynamics. This formulation aims to rigorously enforce dislocation line continuity using the concept of virtual dislocations that close all dislocation loops involved in cross slip, annihilation, and glissile and sessile junction reactions. The addition of virtual dislocations enables us to accurately enforce the divergence free condition upon the numerical solution of the dislocation transport equations for all slip systems individually. A set of tests were performed to illustrate the accuracy of the formulation and the solution of the transport equations within the vector density-based continuum dislocation dynamics. Comparing the results from these tests with an earlier approach in which the divergence free constraint was enforced on the total dislocation density tensor or the sum of two densities when only cross slip is considered shows that the new approach yields highly accurate results. Bulk simulations were performed for a face centered cubic crystal based on the new formulation and the results were compared with discrete dislocation dynamics predictions of the same. The microstructural features obtained from continuum dislocation dynamics were also analyzed with reference to relevant experimental observations.